MATHEMATICAL THEORY OF OPTICS by R. K. LUNEBUR6 Foreword by EMILE WOLF Supplementary Notes by M.HERZBER6ER UNIVERSITY OF CALIFORNIA PRESS BERKELEY AND LOS ANGELES 1964 R. K. Luneburg University of California Press Berkeley and Los Angeles, California Cambridge University Press London, England @ 1964 by The Regents of the University of California Library of Congress Catalog Card Number: 64-19010. Printed in the United States of America FOREWORD During the Summer of 1944, Dr. Rudolf K. Luneburg presented a course of lectures on the Mathematical Theory of Optics at Brown University. The lecture material was later collected in a volume which was issued by Brown University in the form of mimeographed notes. These notes were by no means a compilation of generally available knowledge. They contained a highly original, thorough, and systematic account of the foundations of several branches of optics and numerous new and important results. The supply of copies of Luneburg's notes was soon exhausted, but demand for them has continued. The University of California Press is providing a real service to the scientific community by issuing a printed version of these notes. Fate has prevented Dr. Luneburg from seeing this volume. He died in 1949, at a time when the importance of his work was just beginning to be generally recognized. The chief contribution which Luneburg has made through these notes lies in having shown how the two main mathematical disciplines of instrumental optics, namely geometrical optics and the scalar diffraction optics, may be developed in a systematic manner from the basic equations of Maxwell's electromagnetic theory. Prior to Luneburg's work these two disciplines were, by and large, treated as self-contained fields, with little or no contact with electromagnetic theory. The starting point of Luneburg's investigation was the observation of the formal equivalence of the basic equation of geometrical optics (the eikonal equation) and the equation that governs the propagation of discontinuous solutions of Maxwell's equations (the equation of characteristics). By boldly identifying the geometrical optics field with the electromagnetic field on a moving discontinuity surface, Luneburg was led to a complete formulation of geometrical optics as a particular class of exact solutions of Maxwell's equations; This formulation is by no means based on traditional ideas; for traditionally geometrical optics is regarded as the short wavelength limit (or, more precisely, as the asymptotic approximation for large wave numbers) of the monochromatic solution of the wave equation. Luneburg was, of course, aware of this more traditional viewpoint and he touches briefly on it in §16. In fact, in a course of lectures which he later presented at New York University (during the academic year 1947-1948) Luneburg devoted considerable time to. the interrelation between the two approaches. Some of the ideas outlined in the two courses have become the nucleus from which a systematic theory of asymptotic series solutions of Maxwell's equations is gradually being developed. An account of the material presented by Luneburg in his New York V vi FOREWORD lectures and of related more recent developments will soon be published by Drs. M. Kline and I.W. Kay in a book entitled Electromagnetic Theory and Geometrical Optics (J. Wiley and Sons, New York). Chapter I of the present work contains the derivation of the basic laws of geometrical optics from Maxwell's equations. Amongst the many new results which this chapter contains, the transport equations, eq. (11.38), relating to propagation of the electric and the magnetic field vectors along geometrical rays, are of particular significance. In Chapter II Hamilton's theory of geometrical optics is formulated and in the chapter which follows it is applied to special problems . Amongst results which seem to make their first appearance in the scientific literature are some of the formulae of §24 relating to final corrections of optical instruments by aspheric surfaces; some new theorems relating to perfect optical instruments (§28.4); and the introduction in §29· of a new "perfect lens," which images stigmatically onto each other two spherical surfaces which are situated in a homogeneous medium. This is the now well known "Luneburg lens" which has found valuable applications as a microwave antenna. First and third order theories of optical systems are discussed in Chapters IVand V and, like all the other chapters, they contain a wealth of information. Chapter VI deals with the diffraction theory of optical instruments. The first sections of this chapter are devoted to the derivation, in a mathematically consistent way, of expressions for the electromagnetic field in the image region of an optical system suffering from any prescribed aberrations. A solution of this difficult problem (naturally somewhat idealized) is embodied in formulae (47.33), now known as the Luneburg diffraction integrals. These formulae are an important and elegant generalization of certain classical results of P. Debye and J. Picht. Section 48 deals with another important problem, often ignored in other treatises, namely with a systematic derivation of the scalar theory for the description of certain diffraction phenomena with unpolarized light. The concluding sections deal with problems of resolution and contain a discussion of the possibility of improvements in resolution by a suitable choice of the pupil function. These investigations are amongst the first in a field that has attracted a good deal of attention in recent years. In two appendices formulae are summarized relating to vector analysis and to ray tracing in a system of plane surfaces. They are followed by supplementary notes on electron optics, prepared by Dr. A. Blank and based on lectures of Dr. N. Chako. The volume concludes with supplementary notes by Dr. M. Herzberger, based on his lectures dealing with optical qualities of glass, with mathematics and geometrical optics and with symmetry and asymmetry in optical images. FOREWORD vii It is evident that Luneburg's Mathematical Theory of Optics is a highly original contribution to the optical literature. I consider it to be one of the most important publications on optical theory that has appeared within the last few decades. Department of Physics and Astronomy University of Rochester, Rochester 27, New York May, 1964 Emil Wolf PUBLISHER'S NOTE The present edition has been reproduced from mimeographed notes issued by Brown University in 1944. It is reprinted by permission of the Brown University Press. The University of California Press extends gratitude for help in making this edition possible to Dr. A. A. Blank, Dr. Max Herzberger, Mrs. R. K. Luneburg, Dr. Gordon L. Walker, and Dr. Emil Wolf. The author's name was misspelled in the original edition. This has, of course, been corrected, and a number of typographical errors, almost all of which were listed originally in the Errata of the mimeographed version, have also been corrected. Dr. Blank has clarified the last section of Chapter Von the basis of the Errata. No other changes have been made in this edition, which presents Luneburg's work as he left it. BIOGRAPHICAL NOTE Dr. Rudolf Karl Luneburg Born in Volkersheim, Germany, June 30, 1903; resident in United States of America since 1935; naturalized U. S. citizen 1944. Ph.D. University of Gottingen, 1930. Research Associate in Mathematics, University of Gottingen, 1930-1933; Research Fellow in Physics, University of Leiden, 1934-1935. Research Associate in Mathematics, New York University, 1935-1938. Mathematician, Research Department of Spencer Lens Company (subsidiary of American Optical Company), Buffalo, New York, 1938-1945. Visiting Lecturer, Brown University, Summer, 1944. Mathematical Consultant, Dartmouth Eye Institute 1946. Research Mathematician, Institute of Mathematics and Mechanics (now Courant Institute of Mathematical Sciences), New York University, 1946-1948. Visiting Lecturer, University of Marburg and Darmstadt Institute of Technology, 1948-1949. Associate Professor of Mathematics, University of Southern California, 1949. Died at Great Falls, Montana, August 19, 1949. PUBLICATIONS t 1. t Das Problem der Irrfahrt ohne Richtungbeschrankung und die Randwertaufgabe der Potentialtheorie. Math. Annalen 104, 45 (1931). 2. Eine Bemerkung zum Beweise eines Satzes iiber fastperiodische Funktionen. Copenhagen, Hovedkommissionaer: Levin & Munksgaard, B. Lunos boktrykkeri a/s, 1932. 3. On multiple scattering of neutrons. I. Theory of albedo and of a plane boundary (with 0. Halpern and 0. Clark). Phys. Rev. 53, 173 (1938). 4. Mathematical theory of optics (mimeographed lecture notes). Brown University, Providence, Rhode Island, 1944. 5. Mathematical analysis of binocular vision. Princeton, New Jersey: Princeton University Press, for the Hanover Institute, 1947. Research reports are not included in this bibliography. ix x BIOGRAPHICAL NOTE 6. Metric studies in binocular vision perception (Studies and Essays presented to R. Courant on his 60th birthday, January 8, 1948). New York: Jnterscience Publishers, Inc., 1948. 7. Propagation of electromagnetic waves (mimeographed lecture notes). New York University, New York, New York, 1948. 8. Multiple scattering of neutrons. II. Diffusion in a plane of finite thickness (with 0. Halpern). Phys. Rev. 76, 1811 (1949). 9. The metric of binocular visual space. J. Opt. Soc. Amer. 40,627 (1950). ACKNOWLEDGMENT These notes cover a course in Optics given at Brown University in the summer of 1944. The preparation for mimeographing [the original copies] was possible only through the assistance of Dr. Nicholas Chako and of my students, Miss Helen Clarkson, Mr. Albert Blank and Mr. Herschel Weil. I wish to express my thanks for their excellent cooperation. The supplementary note on Electron Optics has been written by Mr. Blank with the aim of giving a short derivatio!! of the main results by methods similar to those applied in the other parts of the course. This note has its source in lectures given by Dr. Chako on the physical side of this topic; in the mathematical approach it differs, however, from his presentation. To Dr. Max Herzberger of the Eastman Kodak Company I am greatly indebted for contributing the supplementary notes II, III, IV from his recent research. These notes are a record of his three lectures. I wish also to express my gratitude to Dean R.G.D. Richardson for his interest in the course and to his staff for unfailing help and cooperation in the task of preparing these notes. R. K. L. xi CONTENTS CHAPTER I WAVE OPTICS AND GEOMETRICAL OPTICS §1. The electromagnetic equations 1.1 Definitions and notations 1.2 Maxwell's differential equations 1.3 Energy relations 1.4 Boundary conditions Periodic fields 2.1 Special form of E and H in periodic fields 2.2 Complex vectors 2.3 Differential equations for u and v 2.4 Avj:lrage energy 2.5 Average :r.~x 2.6 Polarization §3. Differential equations for E and H 3.1 Second order equations for E and H 3,2 stratified media C ,' ~ '' §4. Integral form of Maxwell's equations 4.1 Eliminations of derivatives by integration 4.2 Integral conditions instead of div e:E = div µH = 0 4.3 Integral conditions instead of Maxwell's equations §5. General conditions for discontinuities ► II," 5.1 statement of the problem 5.2 First condition for E and H 5.3 Second condition for E 5,4 Complete set of difference equations for discontinuities of the electromagnetic field xiii Pages 1-5 1-2 2-3 3-5 5 5-11 5-6 6-7 7 7-8 8-9 9-11 12-14 12-13 13-14 15-18 15-16 16-17 17-18 18-20 18 18-19 20 20 xiv §6. §8. §9. §10. §11. CONTENTS Discontinuities of the optical properties 6.1 Conditions for E and H if E and µ are discontinuous Pages 20-21 20-21 Propagation of Discontinuities; Wave fronts 7.1 Homogeneous Media 7.2 Characteristic equation and characteristic hypersurfaces 7.3 Equation of the wave fronts 21-25 21-22 22-23 23-25 Bicharacteristics; Light rays 8.1 Introduction of light rays as orthogonal trajectories 8.2 Differential equations for the light rays 8.3 Interpretation of rays as paths of corpuscles 8.4 Fermat's problem of variation 25-29 25 25-26 26 26-29 Construction of wave fronts with the.~!~-~f light rays 9.1 Boundary value problem for 1/J(x,y,z) 9.2 Expression for the solution of the problem 9.3 Proof that the problem is solved by the expression in 9.2 9.4 Problem of finding the wave fronts which belong to a given special wave front 9.5 Spherical wave fronts; wavelets 9. 6 Huyghens' construction 29-36 29-30 30-31 31-33 33 33-34 34-36 Jacobi's theorem 10.1 Construction of the light rays with the aid of a complete integral of the equation of the wave fronts 10.2 Proof of Jacobi's theorem 10.3 Example: Light rays in stratified medium 36-38 36-37 37-38 38 Transport equation for discontinuities in continuous optical media 11.1 Differentiation along a light ray 11.2 Conditions for E and H on a characteristic hyper surface 11.3 Simplification of the above conditions; differential equations for discontinuities on a light ray 39-44 39 39-42 42-44 CONTENTS §12. Transport of discontinuities (Continued) 12.1 Equations for directions of the discontinuities U,V and their absolute values 12.2 Geometric interpretation of the quantities A€1/J and Aµl/J 12,3 Energy and flux on a wave front 12.4 The directions P and Q of the discontinuities depend only on the light ray and not on the wave fronts 12.5 The discontinuities U and V determine two applicable surfaces through a light ray 12.6 Non-euclidean parallelism of the discontinuities U and V on a light ray 12.7 Integration of the transport equations §13. Spherical waves in a homogeneous medium 13.1 Simplest type of solutions of the wave equation with spherical wave fronts 13.2 Method of obtaining solutions of this type by differentiation; multipole waves 13.3 Multipole _solutions of Maxwell's equations 13.4 Dipole solution of Maxwell's equations 13.5 Another form of the expressions 13.4 13.6 The vectors E and H on the spherical wave fronts §14. Wave fronts in media of discontinuous optical properties 14.1 Snell's law of refraction 14.2 The law of reflection §15. Transport of signals in media of discontinuous optical properties. Fresnel's formulae 15.1 Conditions for discontinuities before and after refraction 15.2 Orthogonal unit vectors on the rays 15.3 First set of equations for the components of U and V 15.4 Complete set of equations for U and V; Solution of these equations. Fresnel's formulae xv Pages 44-56 44-46 46-48 48-49 49-50 50-51 51-55 55-57 58-64 58-59 59 59-60 60-62 62-63 63-64 64-67 64-66 66-67 67-72 67-69 69-70 70 71-72 xvi §16. CONTENTS Periodic waves of small wave length 16.1 Radiation of a periodic dipole in a homogeneous medium 16.2 Radiation of a periodic dipole in a nonhomogeneous medium 16,3 Equations for the amplitude vectors UO and V0 for small wave lengths 16.4 Discussion and solution of the differential equations for U0 and V0 16.5 Media of discontinuous optical properties 16.6 Electromagnetic fields associated with geometrical optics Pages 73-81 73-74 74-75 75-77 77-79 79-80 81 CHAPTER II HAMILTON'S THEORY OF GEOMETRICAL OPTICS §17. Principles of geometrical optics 17.1 Huyghens' Principle 17.2 Light rays as paths of corpuscles 17.3 Electrons in an electrostatic field 17.4 Light rays in a medium with n* = Vn 2 + C 17.5 Fermat's principle 17.6 Example in which light rays are not curves of shortest optical path 82-88 82-84 84-85 85 85-86 86-87 87-88 §18. The canonical equations 18.1 Light rays in the form x = x(z), y = y(z) 18.2 Derivation of the canonical equations 18,3 Canonical equations for problems of variation in general 88-94 88-89 89-91 91-94 §19. Hamilton's characteristic function V(x 0 ,y0 ,z 0 ,x,y,z) 94-100 19,1 Problem of numerical investigation of an optical instrument 94-95 19.2 Definition of the point characteristic V for canonical equations in general 95-97 19.3 Principal theorem of Hamilton's theory regarding the differential dV 97 19.4 Proof of the theorem 98-99 19.5 Hamilton's theorem for geometrical optics 99-100 CONTENTS §20. Hamilton's characteristic functions, W and T 20.1 The mixed characteristic W 20.2 Geometric interpretation of W 20.3 The mixed characteristic W* 20.4 The angular characteristic T 20.5 Special significance of T §21. Integral invariants 21.1 Fields of light rays: necessary and sufficient conditions. Theorem of Malus. The integral !Pc n cos 0 ds is zero for any closed curve C 21.2 Non-normal congruences of rays. Poincare's invariant. Geometrical interpretation §22. Examples 22 .1 Mixed characteristic W for stratified media. Systems of plane parallel plates. Spherical aberration of stratified media 22.2 The angular characteristic in the case of a spherical mirror. First order development of T. Mirror equation 22.3 Angular characteristic for a refracting spherical surface. First order development of T. Lens equation xvii Pages 100-107 100-102 102-103 103-104 104-106 106-107 107-116 107-112 112-116 116-128 116-120 120-124 124-128 CHAPTER Ill APPLICATION OF THE THEORY TO SPECIAL PROBLEMS §23. Perfect conjugate points. Cartesian ovals 129-138 23.1 Light rays through perfect conjugate points have equal optical length 129-130 23.2 Cartesian ovals 130-132 23.3 Object point at infinity: The Cartesian ovals are ellipsoids and hyperboloids 132-133 23.4 Virtual conjugate points. Cartesian ovals for this case 133-134 23.5 An object point at infinity having a perfect virtual conjugate point. The Cartesian ovals are again ellipsoids and hyperboloids 135 23.6 The aplanatic points of a sphere. Graphical construction of light rays refracted on a sphere. Aplanatic surfaces in the front part of micro- scopic objectives 136-138 xviii §24. §25. §26. §27. CONTENTS Pages Final correction of optical instruments by aspheric surfaces 139-151 24.1 Justification of this method of correction 139 24.2 Solution of the problem with the aid of the point characteristic 140 24.3 Solution with the aid of the mixed characteristic W 140-142 24.4 Solution with the aid of the angular characteristic T 143 24.5 Example: A lens with a plane and an aspheric surface, which transforms a spherical wave into a plane wave 143-145 24.6 Example: A lens with a spherical and an aspheric surface which transforms a plane wave into a converging spherical wave 145 24.7 The principle of the Schmidt Camera 146-147 24.8 The correction plate of the Schmidt Camera 147-151 The angular characteristic for a single refracting surface 25.1 Equations for the point characteristic 25.2 Method of finding T by Legendre's trans- formation of the refracting surface 25.3 Surfaces of revolution 25.4 Example of a spheFical surface 25.5 The angular characteristic for an elliptic or hyperbolic paraboloid 151-156 151-152 152-154 154 155 156 The angular characteristic for systems of refracting surfaces 26.1 Variation problem for T for finitely many refracting surfaces 26.2 The corresponding problem of variation for a continuous medium 26.3 Another problem of variation of T 26.4 Equivalence of both problems 26.5 Systems of spherical surfaces 156-164 156-158 158-159 159-160 160-163 163-164 Media of radial symmetry 27.1 Equation of the lig4t_.rays 27.2 The form of the light rays in general 27.3 Example: n 2 = C + -r1 164-172 164-166 166-169 169-172 CONTENTS xix Pages §28. Maxwell's Fish eye 172-182 28.1 The wave fronts can be obtained by a Legendre transformation of the wave fronts of the potential field n 2 = C + .r! 172-173 28.2 The light rays are the circles through two points on opposite ends of a diameter of the unit circle. The medium n = 1/(l+r) represents a perfect optical system 173-175 28.3 Maxwell's fish eye obtained by a stereographic projection of the unit sphere 175-178 28.4 Perfect optical systems in the x,y plane which are obtained by other conformal projections of the unit sphere 178-180 28.5 Conformal projection of surfaces. Relation of optical design problem to the problem of perfect conjugate points on surfaces 180-182 § 29. Other optical media which image a sphere into a sphere }.82-188 29.1 Integral equation for the function p = nr 182-184 29.2 Transformation of the equation into an equation of Abel's type 184-185 29.3 Proof of the inversion theorem 185-186 29.4 Explicit solution of the problem 186-187 29.5 The special case r 0 = 00 , r 1 = 1 187-188 §30. Optical instruments of revolution 188-195 30.1 Euler's equations for optical media in which n = n(p,z); p = ...J x2 + y2 30.2 Transformation of Fermat's problem into a problem for p(z) alone 30.3 Skew rays can be treated as meridional rays if n(p,z) is replaced by 188-189 189-191 m(p,z) = ~ p 191-192 30.4 The characteristic functions in instruments of revolution. The mixed characteristic W depends only on u,v ,w 192-194 30.5 Similar statements for the other characteristic functions 194-195 xx §31. §32. CONTENTS Spherical Aberration and Coma. Condition for coma free instruments 31.1 31.2 31.3 31.4 31.5 31.6 31. 7 31.8 31.9 31.95 Spherical aberration L(p) of an axial bundle of rays Zonal magnification M(p) Development of W for small values of x 0 ,Yo but large values of p,q The ray intersection with the image plane. Construction of Coma flares. Abbe's sine condition Oblique bundles which are rotationally symmetrical to one of their rays Invariance of the characteristic W with respect to orthogonal transformations The characteristic W' in a coordinate system with the principal ray as the z' axis Condition of staeble-Lihotski for symmetrical bundles of rays The aplanatic points of a sphere are coma free. Graphical construction of two aspheric mirrors which possess two perfectly aplanatic points Modification for the object at infinity Pages 195-211 195-197 197-198 198 199-201 201-203 203-204 205-206 206-207 207-208 208-211 The condenser problem 32.1 statement of the problem. Intensity and illumination. Equation for the illumination of a given plane 32.2 Illumination of the center. Equation for the relative illumination 32.3 Condition for uniform illumination 32,4 Interpretation of the condition c/>(01) c/>(0) 1- for 0 0 but 211-215 211-213 213 213-214 214-215 CHAPTER IV FIRST ORDER OPTICS § 33. The first order problem in general 216-226 33.1 Canonical transformation of the rays by an optical instrument 216-217 33.2 Invariance of the canonical conditions with respect to orthogonal transformations of object and image space 217-218 CONTENTS xxi 33.3 Conditions for the matrices of the differentials dX i, dP i. Lagrangian brackets 33.4 Linear canonical transformations in first order optics 33.5 Linear equations obtained with the aid of the angular characteristic 33.6 Simplification of the equations Pages 218-221 221-222 222-224 224-226 §34. Gaussian optics 226-233 34.1 The matrices A,F ,C in systems of revolution. The linear transformations for Gaussian optics 226 34.2 Images of points of the plane z O = 0 227-228 34.3 General lens equation: Location of conjugate planes relative to a given pair of planes 228-229 34.4 Special lens equation; if the unit planes are chosen as reference planes. Unit points and nodal points 229-231 34.5 Focal points; Newton's lens equation 231-232 34.6 The point transformation of object into image space is a collineation. Images of inclined planes 232-233 §35. Orthogonal ray systems in first order optics 234-239 35.1 Definition of orthogonal systems 35.2 Primary or tangential focus; secondary or sagittal focus; astigmatism 35.3 The bundle through the point x 0 = Yo = 0. Focal lines. Primary and secondary magnifications. Astigmatic difference 35.4 The images of the points x 0 ,Yo of the plane Zo = 0 35.5 Dependence of the constants M,m,71. on the position z 0 of the object plane. Lens equation in orthogonal systems 234 234-235 235-236 236-237 237-239 §36. Non-orthogonal systems 240-243 36.1 The relations for the ray coordinates in non-orthogonal systems derived from the mixed characteristic W 240-241 36.2 The bundle through x 0 = y O = 0 241-242 36.3 The images of the points- x 0 ,Yo of the plane Zo = 0 242-243 36.4 Relation of the torsion of the image to the angle included by principal plane bundles of the axial bundles 243 xxii § 37. § 38. § 39. CONTENTS Differential equations of first order optics for systems of rotational symmetry 37.1 Formulation of the problem 37 .2 First order canonical equations. Paraxial rays and their geometric interpretation 37 .3 The corresponding problem of variation. Transversal surfaces. Jacobi's partial differential equation for paraxial rays 37.4 Interpretation of the function n1(z). Surface power. Final form of the paraxial differential equations 37.5 The general solution is expressed by two principal paraxial rays: axial ray and field ray. Conjugate planes 37 .6 Another definition of the field ray 37.7 Lagrange's invariant. The paraxial rays can be found by quadratures if one paraxial ray is known. Integral expressions for the focal length in terms of paraxial rays 37.8 The equivalent Riccati equation Pages 243-254 243-244 244-245 245-247 247-248 248-250 251 251-253 254 The path of electrons in the neighborhood of the axis of an instrument of revolution 38.1 Expressions for n 2(z,p) if n 2 (z,O) is known. Solution of the equation ~n 2 = 0 with given boundary values n 2(z,O) = f(z) 38.2 Surface power of an equipotential surface. Differential equations for paraxial electrons 38.3 The corresponding Hamilton-Jacobi equation and corresponding Riccati equation 255-257 255-256 256 257 Difference equations for a centered system of refracting surfaces of revolution 39.l Notations and definitions 39.2 The canonical difference equations 39.3 The associated problem of variation 39.4 The solution is expressed by two principal solutions: axial ray and field ray 39.5 Lagrange's invariant 39.6 The paraxial rays can be found by summations if one paraxial ray is known. Expressions for the focal length with the aid of invariant sums 39.7 The Gaussian quantities for a single lens 39.8 First order design of optical instruments 257-268 257-258 258-261 261 261-263 263-264 264-265 265-266 266-268 CONTENTS xxiii CHAPTER V THE THIRD ORDER ABERRATIONS IN SYSTEMS OF ROTATIONAL SYMMETRY Formulation of the problem Pages §40. General types of third order aberrations 269-281 40 .1 General expressions for the aberrations Axi, Ay1; Ap 0 , Aq 0 derived from the mixed characteristic W 40.2 Expressions for the third order aberrations Axi, Ay1; Ap 0 , Aq 0 obtained by developing W to the fourth order. Aberrations of meridional rays 40.3 Spherical aberrations, Coma, Astigmatism, Curvature of field, Distortion 40.4 The combined effect of third order aberrations 40.5 Dependence of the aberrations on the position of the diaphragm 269-270 271-272 273-276 277 277-281 § 41. The third order coefficients as functions of the position of object and pupil plane 281-286 41.1 The rays are determined by the intersection with two pairs of conjugate planes. Object and Image plane, Entrance and Exit pupil plane 281-282 41.2 General form of aberrations Axi, A~ 1 of meridional rays 282-283 41.3 Expression for the image aberrations Ax1 derived from the angular characteristic 283-284 41.4 Expression for the aberrations A~ 1 of the pupil plane 284-285 41.5 Simplified expressions for Ax1 and Ab 285 41.6 The case of the exit pupil plane at infinity_. A,B,C,E as functions of the magnification M of the object plane. Impossibility of correcting an optical instrument for all pairs of conjugate planes 285-286 §42. Integral expressions for the third order coefficients 287-299 42.1 Formulation of the problem 287-288 42.2 Development of the Hamiltonian function H(U,V ,z). Differential equations for the third order polynomials X ,Y P Q Corresponding ; . 3 3 3 3 boundary value problem 288-290 xxiv §43. CONTENTS 42.3 Integral expressions for Ax1 = X3 and Ay1 = Y 3 42.4 Integral expressions for the coefficients A,B,C,D,E 42.5 The quantities H ik in general 42.6 Petzval's theorem 42.7 The quantities H lk in the case of an electrostatic field 42.8 The case of spherical surfaces of refraction 42.9 Third order coefficients in the case of a finite number of spherical surfaces of refraction Pages 290-291 292-293 293-294 294-295 295-296 296-297 297-299 Chromatic Aberrations 299-304 43.1 The index of refraction varies with the wave length of light 299 43.2 Chromatic aberrations in the realm of Gaussian optics 299-301 43.3 Integral expressions for Axial Color and Lateral Color 301 43.4 Simplification of these expressions. Dispersion. Abbe's v-value. Summations formulae in the case of a finite number of refracting surfaces 301-304 43.5 "Chromatic" Aberrations in electron optics 304 CHAPTER VI DIFFRACTION THEORY OF OPTICAL INSTRUMENTS §44. Formulation of the problem 305-311 44.1 Differential equations for periodic electromagnetic fields 305 44.2 Radiation of a periodic dipole in a homogenous medium. Boundary values at infinity 305-306 44.3 The case of a nonhomogeneous medium. Directly transmitted waves 306-307 44.4 Mathematical form of the transmitted wave. The approximation of geometrical optics 307-309 44.5 Virtual extension of the transmitted wave in the image space 309-310 44.6 The transmitted wave in the image space is characterized by its boundary values at infinity and by the condition of regularity in the whole image space 310-311 CONTENTS § 45. The boundary value problem of the equation .6.u + k 2u = 0 for a plane boundary 45.1 45.2 45.3 45.4 45.5 45.6 45. 7 Formulation of the problem Integral representation of the solution Proof that the above integral satisfies the differential equation Proof that the boundary values are attained Completion of the proof The conditions at infinity The corresponding boundary value problem for Maxwell's equations § 46. Diffraction of converging spherical waves 46.1 The corresponding boundary values at infinity 46.2 The problem is solved by applying the integral formula of §45.2. Debye's integral. The associated electromagnetic field § 47. Diffraction of imperfect spherical waves 47.1 Boundary values at infinity 47 .2 Solution of the problem with the aid of §45.2. The electromagnetic wave is represented by integrals over a wave front. Introduction of Hamilton's mixed characteristic 47.3 Invariance of these integrals with respect to the choice of the wave front 47.4 Parametric representation of the wave fronts 47 .5 Simplified form of the general diffraction integrals §48. Diffraction of unpolarized light 48 .1 Introduction of the vector m(p,q) which determines the polarization of the wave at infinity 48.2 Linear polarization at infinity 48.3 Representation of the wave with the aid of the operator 'v = -27AT. .l grad. 48.4 Average electric energy of an unpolarized wave 48.5 Average magnetic energy of an unpolarized wave 48.6 Average flux vector of an unpolarized wave 48.7 Summary of the results for the case of unpolarized waves xxv Pages 311-320 311-313 313-316 316-317 317 318 318-319 319-320 321-324 321-322 322-324 324-333 324-326 326-329 329-330 330-331 331-333 333-339 333-334 334 334-335 335-336 336-337 337-338 338-339 xxvi §49. §50. §51. CONTENTS Diffraction patterns for different types of aberrations 49.1 Spherical aberration. Limit of resolution. Tolerance for spherical aberration. Raleigh limit 49.2 Coma 49.3 Astigmatism Pages 339-344 339-342 342 343-344 Resolution of two luminous points of equal intensity 50.1 Interpretation of F(x,y ,z) as Fourier integral 50.2 Diffraction integrals in the case of rotational symmetry 50.3 Absolute limit of resolution of two point sources. Case of coherence and incoherence 50.4 Inequality for I F(r) I2 . The normal pattern has the greatest value of the central maximum I F(0) 12 for a given total energy 50.5 Solution of the maximum problem: To find the diffraction pattern F of given total energy such that for a given a we have F(a) = 0 and such that the central maximum F(0) is as great as possible 50.6 Solution of the maximum problem: To find the diffraction pattern F of given total energy and given distance of the inflection points such that the central maximum F(0) is as great as possible 50. 7 The maximum problem: To find the diffraction pattern F of given total energy such that the energy in a given circle is as great as possible 344-353 344-345 345-346 346-348 348-349 349-351 351-352 352-353 Resolution of objects of periodical structure 354-359 51.1 Resolution of more than two points at equal distances 354 51.2 Light distribution in the image of a self- luminous object which has a given distribution U0 354 51.3 The light distribution in the image of, a non-self- luminous object 354-355 51.4 The light distributions of object and image are developed into Fourier series. Relation of the two series in the case of self luminous objects of periodic structure 355-356 CONTENTS 51.5 Limit of resolution for self luminous objects of periodic structure. Impossibility of decreasing this limit by coating the aperture with thin films. Appearance of detail which ' approaches the limit of resolution 51.6 Limit of resolution for non-self-luminous objects of periodic structure xxvii Pages 356-358 358-359 Appendix I Vector analysis: Definitions and theorems I.I Notation and definitions I.2 Vector identities I.3 Vector fields I.4 Vector identities: div fA, curl fA, curl(f curl A), div (A x B), curl (A x B), A x curl B + B x curl A 360-367 360-361 361-362 362-364 364-367 Appendix II Tracing of light rays in a system of plane reflecting or refracting surfaces II.I Vector form of the laws of reflection and refraction II.2 Vector recursion formulae in a system of plane surfaces II.3 Representation of the direction ·of the final ray with the aid of certain scalar quantities P;. Recursion formulae for P; II.4 The special case that all surfaces are reflectors II.5 Example of three mirrors at right angles to each other II.6 Example of a 90° roof prism II.7 Influence of the roof angle on the doubling of the image 368-372 368 368-369 369-370 370 370 370-371 371-372 xxviii CONTENTS SUPPLEMENTARY NOTES NOTE I ELECTRON OPTICS N. Chako and A. A. Blank Introduction § 1. The equation of movement 1.1 Notations and Definitions: The electromagnetic field. Force on the electron. Kinetic energy T of the electron 1.2 The generalized potential of the field. Existence of a Lagrangian kinetic potential L(x;,X;) for the electron. Explicit form of L 1.3 The corresponding problem of variation 1.4 Statement of conservation of energy Pages 373-410 373 374-377 374-375 375 376 376-377 § 2. The associated Fermat problem 377-383 2.1 Problem of finding the path of a particle through two given points if the energy of the particle is given. The corresponding problem of variation 377-378 2.2 Transformation of this problem of variation 378-379 2.3 Proof that the new integral is homogeneous in x; of order 1. Formulation of the general rule of finding the associated Fermat problem in mechanical systems where L is independent of t. 379-380 2.4 Example: L = ~gik X;Xk - U(x1) 380-381 2.5 The case of electron optics. Derivation of the "index of refraction" for electromagnetic fields. 381-382 2.6 Analogy to crystal optics. 382-383 § 3. The canonical equations of electron optics 383-384 3.1 The variable z as a parameter 3.2 Derivation of the Hamiltonian H(x,y ,z,p,q) for electron optics. The canonical equations for the functions x(z), y(z), p(z), q(z) 383 383-384 § 4. Electromagnetic fields of rotational symmetry 4.1 The electric potential cf> and the magnetic potential A in this case 4.2 Special form of H and of the canonical equations. The invariant xq - yp 384-388 384-385 385-386 CONTENTS 4.3 The potential cp for fields free of space charges. Integral formula and power series for cp(P,z) 4.4 Integral formula and power series for a(p,z) § 5. First order optics in systems of rotational symmetry 5.1 The paraxial canonical equations 5.2 Simplified form of the paraxial equation for the complex functions X,P 5.3 The coefficients D, n, w 5.4 Solution of the equations with the aid of axial ray and field ray 5,5 Conjugate planes. Rotation of the image by the magnetic field § 6. The Gaussian constants of an electron optical instrument 6.1 Equivalent focal length. The two focal points of an instrument. Newton's Lens equation 6.2 The relations between the ray coordinates at two planes z = 0 and z = P. 6.3 Integral equations for X and P. Solution by iteration. Explicit formulae for F and the position of the unit points and focal points of a system 6.4 Approximate formulae for short systems § 7. Third order electron optics in systems of rotational symmetry 7.1 Formulation of the problem for x,y,p,q 7.2 Corresponding problem for X,P 7.3 Differential equations for the 3rd order polynomials 7.4 Solution of these equations. Integral expressions for the aberrations Ai;, A7J 7.5 The third order polynomials. Integral expressions for the coefficients A,B,C,D,E;~,y,o,E 7.6 Explicit expressions for the quantities Hik (z) 7.7 Analog of Petzval's theorem xxix Pages 386-387 387-388 388-393 388-389 389-390 390-391 391-392 392-393 394-398 394 394-395 395-397 398 398-407 398-399 399-400 400-402 402-403 403-405 405-406 406-407 XXX § 8. CONTENTS Physical Discussion of the third order aberrations of an electron optical instrument 8.1 Spherical aberration 8.2 Coma 8.3 Astigmatism 8.4 Distortion Pages 407-410 407-408 408-409 409 409-410 NOTE II OPTICAL QUALITIES OF GLASS M. Herzberger 411-431 NOTE ID MATHEMATICS AND GEOMETRICAL OPTICS M. Herzberger 432-439 NOTE IV SYMMETRY AND ASYMMETRY IN OPTICAL IMAGES 440-448 M. Herzberger CHAPTER 1 WAVE OPTICS AND GEOMETRICAL OPTICS In this course we shall be concerned with the propagation of light in a transparent medium. We shall not consider absorbing media or non-isotropic media, such as metals or crystals; but we will allow the medium to be nonhomogeneous. The optical properties of such a medium can be characterized by a scalar function n = n(x, y, z) , the refractive index of the medium. In ordinary optical instruments this function is sectionally constant and discontinuous on certain surfaces. The mathematical treatrnen11 of the propagation of light can be based on two theories: The wave theory of light (Physical Optics) and the theory of light rays (Geometrical Optics), Both theories seem to be fundamentally different and can be developed independent of each other. Actually, however, they are intimately connected. Both points of view are needed, even in problems of practical optical design. The design of an optical objective is carried out in general on the basis of Geometrical Optics, but for the interpretation or prediction of the performance of the objective it becomes necessary to investigate the propagation of waves through the lens system. In view of this fact, these theories will be developed simultaneously. The wave theory is considered as the general theory, and Geometrical Optics will be shown to be that special part of the wave theory which describes the propagation of light signals, i.e., of sudden discontinuities. On the other hand, in the important case of periodic waves, it represents an approximate solution of the differential equations of wave optics. This approximate solution can be used in a method of successive approximation to develop the diffraction theory of optical instruments, as will be shown in the later parts of this course. §1. THE ELECTROMAGNETIC EQUATIONS. 1.1 The wave optical part of this course is based upon Maxwell's electromagnetic theory of light. The phenomenon of light is identified with an electromagnetic field. 2 MATHEMATICAL THEORY OF OPTICS The location in space is determined by three coordinates x, y, z, the unit of length being 1 cm. The time is determined by the coordinate t; the unit of this variable being 1 sec. The electromagnetic field is represented by two vectors: the electric vector: the magnetic vector: E(x,y ,z,t) H(x,y,z,t) (Ei,E 2,E 3) , (H1,H2 ,H3) . The components, (E 1,E 2,E 3), of the electric vector are functions of x,y,z,t; the unit of these components is 1 electrostatic unit of E. The unit of the components (Hi, H 2, H3) of the magnetic vector is 1 electromagnetic unit of H. The properties of the medium can be characterized by two scalar functions of x,y ,z (the medium thus is assumed not to change with the time): the dielectric constant: E = E (x,y,z) , the magnetic permeability µ µ(x,y,z) . (1.11) 1.2 The electromagnetic vectors satisfy a system of partial differential equations which, with the above choice of units, assumes the form: curl H - i Et = 0 , C curl E + I:!:. Ht = 0 . C (1.20) The constant c is the velocity of light, in our units numerically equal t.o C = 3 • 1010 • If the components of E and H are introduced, the above vector equations yield a system of six linear differential equations of first order: aH3 aH 2 E aE1 ay az C at 0 aH1 aH 3 E a E 2 0 az ax C at a E3 ay -a E-2 + az µ C aH1 at = 0 a E1 aE3 µ aH 2 az ax· + C at 0 , (1.21) aH 2 aH1 ax ay E a E3 C at 0 aE 2 aE1 + !!:. aH 3 ax ay C at 0 WAVE OPTICS AND GEOMETRICAL OPTICS 3 In the group of optical problems to be considered in this course, we can assume µ = 1, since our medium (glass) is not magnetic. The dielectric constant, E = E(x,y,z), will be replaced by the index of refraction of the sub- stance, according to the equation (1.22) This relation between two different properties of a medium is actually far from being satisfied by the substances we are mainly interested in. However, experience shows that the predictions of the electromagnetic theory are in excellent agreement with observation if in theoretical results the quantity -fE is replaced by the index of refraction, measured by optical methods. Furthermore it is possible to give a satisfactory explanation of the above discrepancy by molecular considerations. We prefer in the following sections to leave Maxwell's equations in the above forms, (1.20) and (1.21). The symmetrical structure of these equations will often allow us to find from one relation another one simply by inter- changing the letters E and µ, and replacing E by -H and H by E. It is customary to add two more equations to the equations (1.20), namely: (1.23) div(µH) = 0, or ...£_ (µHi) ax + ...£_ ay (µH 2 ) + ...£_ az (µH 3 ) = 0. These state that the electromagnetic field does not contain a source of electricity or magnetism. However, these equations are not independent of (1.20). Indeed, since div curl A = 0 for an arbitrary vector field A(x,y ,z,t), it follows 88t (div EE) = 88t. (div µH) = o , (1.24) i.e., both div(E E) and div(µH) are identically zero if they are zero at any particular time. 1.3 Energy. If we· form the scalar product of E with the first of the equations (1.20) and of H with the second one and subtract both results we obtain 1 E •curl H - H •curl E - - (EE• Et + µH, Ht) 0 . C (1.30) On account of the identity H·curl E - E·curl H div(E x H) (1.31) 4 this gives MATHEMATICAL THEORY OF OPTICS c div(E x H) + _! _Q_ (EE 2 + µH 2) 2 at 0 or (1.32) div 4c,r(E x H) + _£_ at _!_(EE 2 + µH 2) B1r 0. The function (1.33) W(x,y,z,t) (1.34) measures the distribution of electromagnetic energy in the field. It determines the light density in Optics. The vector S(x,y,z,t) = 4C,r (E x H) (1.35) is called Poynting's radiation vector, and the relation between W and S is given by the equation aw + div s = o at • (1.36) Let us integrate this equation over a domain D of the x,y ,z space enclosed by a closed surface r. From Gauss' integral theorem: aat fff W dx dy dz + ff Sv do = O, D r (1.37) Sv being the normal component of S on r. The first integral represents the change of the total energy of the domain D per unit time. The surface integral thus gives the amount of energy which has left the domain D through the surface. Hence we interpret the vector field S = -4C(,rExH) as the vector field (or better, tensor field) of energy flux. Let do be the area of a surface element at a point x,y,z, and N a unit vector normal to it. Then the energy flux through this surface element is given by WAVE OPTICS AND GEOMETRICAL OPTICS 5 where SN = S, N is the normal component of the Poynting vector, S. In Optics, the energy flux per unit area is called the illumination of the surface element. We have (1.38) 1.4 Boundary Conditions. The vector functions E and H are of course not uniquely determined by the differential equations (1.20), unless certain boundary conditions are added. For optical problems, the following problem types are significant: 1. To find a solution of the equations (1.20), i.e., two vector fields, E(x,y,z,t) and H(x,y,z,t), if the electromagnetic field E(x,y,z,0) and H(x,y,z,0), at the time t = 0 is given and satisfies at this time the conditions div(E E) = div(µH) = 0. 2. Let us assume that on the plane z = 0, the electromagnetic field is a known function of x,y and t when t > 0, and that certain homogeneous boundary conditions are satisfied on another plane, z = L; i.e., E = E(x,y,0,t) given for t > 0 and, for example, E(x,y ,L,t) = 0 on z = L (Figure 1). z Let furthermore E = H = 0 for t = 0. To find a solution, E and H, in the half- space z > 0 which satisfies these L boundary conditions. 3. Of greater practical importance is the case for which the electromagnetic field is established under the influence of Figure 1 a periodic oscillator. Let us assume that an electric dipole is oscillating at a given point in space, for example, in front of an optical objective (Figure 2). Under the influence of this point source, an electric field is established which represents the light wave which travels through the objective. The problem is to determine these forced vibrations of the space as solutions of Maxwell's equations. §2. PERIODIC FIELDS. 2,1 We can expect that the electromagnetic field which in the end is established by a point source periodic in the time, will be periodic in time itself and, that its frequency equals the frequency of the oscillator. On the 6 MATHEMATICAL THEORY OF OPTICS basis of this expectation, one is led to consider special solutions of Maxwell's equations which have the form E u(x,y,z)e- iwt H v(x,y,z)e- lwt , (2.11) Figure 2 where u and v are vectors independent of t. The quantity w 271" (2.12) is the frequency of the oscillator and 11. the wave length. 2.2 The above complex notation is chosen on account of its mathematical advantages. The vectors u and v are in general complex vectors u = a + ia*, V = b + ib*' i.e., complex combinations of real vectors a, a* and b, b*. Calculations involving these complex vectors can be carried out in the same way as those involving real vectors only, when i is considered a scalar quantity, with i 2 = -1. For example: Scalar product: u-v = (a•b - a*-b*) + i(a•b* + a*•b). (2.21) Vector product: u xv = (a x b - a* x b*) + i(a x b* + a* x b). (2.22) The absolute value of a complex vector: u-u = a 2 + (a*) 2 • Two complex vectors u and v are called orthogonal if U•V = u·v (2.23) 0, i.e., if a-b + a *·b* = 0 , ab* - a*·b = 0 . (2.24) WAVE OPTICS AND GEOMETRICAL OPTICS 7 If two complex vectors E and H satisfy Maxwell's equations then both the real and imaginary parts of E and H are solutions. The real parts of the vectors (2.11), for example, are given by E acoswt+a*sinwt, H b cos wt + b* sin wt , (2.25) and will be considered in the following as representing the electromagnetic field. 2.3 We now introduce the expressions (2.11) into Maxwell's equations. This yields curl v + -iw E u O , C curl u - ~ iw µ v = 0 , (2.31) i.e., a system of partial differential equations without the time variable. By introducing the constant k w C (2.32) we obtain curl v + ik E u 0 , curl u - ik µ v 0 . (2.33) It follows that div(Eu) = div(µv) = 0 (2.34) so that it is unnecessary to add these conditions explicitly, as in (1.23). 2A Energy. The period, T = 271" = ~ , of the functions (2.11) is so W C extremely short in optical problems that we are unable to observe the actual fluctuation of the electromagnetic field. Indeed, in case of sodium light, for example, we have 11. = 0.6 x 10-4 cm., hence T 2 x 10-15 sec. 8 MATHEMATICAL THEORY OF OPTICS The same is true for the extremely rapid fluctuations of the light density W(x,y,z,t) = 8~ [E(acoswt + a*sinwt) 2 +µ(bcoswt+b*sinwt) 2]. (2.41) We are, however, able to observe the average value of this energy, which is given by the integral (2.42) We can express this result in terms of the original complex vectors u and v and obtain -J -W = -l611r rL.E - U • U + µ V • V (2.43) as an expression for the observable light density at the point x,y ,z. 2.5 Flux. Similar considerations may be applied to the Poynting vector, S. By introducing the expressions (2.25) into the definition of S, (1.36), it follows that S = :'Ir (a cos wt + a*sin wt) x (b cos wt + b*sin wt) (2.51) which is also a periodic function with the small period, T. Again, only the average value, 1 -S=T1- rt, TSdt ' can be considered as physically significant. We obtain -S(x,y,z) = SC1r (a x b + a* x b*) , (2.52) or in terms of the complex vectors, u and v, S(x,y,z) = C 1671" (u x -v + -u x v) . (2.53) We can show that the vector field, S, of average flux is a solenoidal field, i.e., div S = 0. WAVE OPTICS AND GEOMETRICAL OPTICS 9 For the complex vectors u and v satisfy the equations curl v + ikE u 0' curl u - ikµv 0. (2.54) The conjugate complex vectors u and v, consequently, satisfy curl v - ikEu 0 ' curl u + ikµv 0. (2.55) It follows that ii curl v - v curl u + ik (Eu·ii ~ µv•v) 0' v curl u - u curl v + ik (Eu •iI - µv •v) 0 ' (2.56) or div (u X v) + ik (€ u. u - µv •V) 0 ' div (ii x v) - ik (EU • ii - µv •v) 0. Hence div (u xv + u x v) = O; i.e., div S O. (2.57) 2.6 Polarization. The vectors E and H given by (2.25) describe certain closed curves in space. The type of these curves determines the state of polarization of the wave at the point x,y,z, and this again represents an observable characteristic of the field. In general, the electric vector is considered as the vector which gives the polarization of the light. We have E = a cos wt + a* sin wt, or in components E1 a1 cos wt + a1* sin wt , E 2 = a 2cos wt + a 2* sin wt, E 3 a 3cos wt + a 3* sin wt . (2.61) The curve described by E is plane since E is a linear combination of the vectors a and a*. We can show easily that this curve is an ellipse. Let us introduce ~ = cos wt and 7) = sin wt. By squaring the components of E we find 10 MATHEMATICAL THEORY OF OPTICS These equations, together with the relation ~2 + 112 = 1 represent four linear equations for the three quantities ~2, 11 2, and 2~7). Their determinant thus must be zero: 1 1 1 0 Ei2 ai2 a1*2 a1a1* 0 E 22 a22 a2*2 a2a2* (2.62) Eg2 ag2 a3*2 a3a3* This is an equation of the type AEi2 + BE 22 + CEa2 = D, which means that the curve of E lies on a surface of second order. The intersection curve of a plane and a surface of second order, however, is a conic. It must be an ellipse because it is closed. The equations (2.61) show that the ellipse is symmetrical with respect to the origin, i.e., to the point x,y ,z in question. Thus we can find the length and direction of the axes by determining the extreme lengths of the vector, E, i.e., the extreme values of the quadratic form under the condition ~ 2 + 7J 2 = 1. In other words, the axes are equal to the characteristic values of the above quadratic form and are given by the two solutions, ;>..1 and ;>.. 2, of the quadratic equation la·a - ;>.. a•a* I a•a* a*-a* - ;>.. 0. (2.63) Hence, (2.64) The characteristic values, ;>.. , are real, since the expression under the radical is not negative. The characteristic values cannot be negative; for, with the aid of the inequality (a.a*) 2 ::§ a 2 • (a*) 2 , one can see that WAVE OPTICS AND GEOMETRICAL OPTICS 11 We illustrate three types of polarization: a* a Figure 3. Elliptical polarization. Two different characteristic values, Ai =/- A2 , both different from zero. The electric vector describes an ellipse. The characteristic values are equal, At = A2. This implies a* a-a a*• a*, a.a_'." 0. (2.65) a Figure 4. Circular polarization. The two components a and a* of u are orthogonal and equal in length. The electric vector describes a circle. a a* Figure 5. Linear polarization. The smaller one of the characteristic values, A, is zero. The electric vector describes a straight line. The two vector components of u have the same direction, a x a* = 0. (2.66) We can express these results again by using the complex vector, u directly. The quadratic equation for ll. may be written as follows: ¼ A. 2 - (u •u) A - (u X u) 2 = 0 ; a+ ia*, and this has the solution, (2.67) The ellipticity E of the polarization, i.e., the ratio of the lengths of the axes, is thus given by the expression U • U - v1u) 2 (u) 2 u•u + -ltu) 2 (U:) 2 (2.68) Hence, for linear polarization: for circular polarization: uxu = 0, u2 =u•u=0. (2.69) 12 MATHEMATICAL THEORY OF OPTICS §3. DIFFERENTIAL EQUATIONS FOR E AND H. 3.1 If we eliminate one of the vectors, E or H, from Maxwell's equations, (1.20), we obtain second order equations for either E or H. By differentiation with respect to t: curl Ht - -E Ett O , C curl Et +I!:. Htt 0. C We introduce Ht = - ~ curl E in the first of these equations, and Et C E curl H in the second. The results. are ~ µcurl(~ curl E) + Ett O, ?" E curl (¾ curl H) + H tt = 0 . (3.11) We apply the following vector identity, which holds for an arbitrary scalar function, f(x,y,z), and an arbitrary vector field, A(x,y,z), with continuous derivatives of the second order: curl (f curl A) = -f A A + f grad (div A) + (grad f) x (curl A) , (3.12) where A A = Axx + Ayy + Azz. Equations (3.11) become ? G t) - Ett - A E = (curl E) x grad grad (div E), ? Htt - AH = (curl H) x (e grad¾) - grad (div H) . (3.13) From the second pair of Maxwell's equations (1.23), it follows that i.e., div E defined by div E E E div E + E •grad E 0 ' divµ H µ div H + H • grad µ o,; (3.14) -E •p and div H = -H •q, where the vectors p and q are p 1 grade E q -1 µ gradµ grad (log E ) , grad (log µ) . (3.15) WAVE OPTICS AND GEOMETRICAL OPTICS We introduce and obtain from (3.11) the equations -;;zn2 E tt - 6. E = grad (p • E) + q x curl E , -;;zn2 Htt - 6. H = grad (q • H) + p x curl H . 13 (3.16) (3.17) The vectors p and q and the function n are given by the properties of the medium; they are not independent of each other but are related by the equation 21 (p + q) = grad (log n) . (3.18) In the special case of a homogeneous medium, both p and q are zero, and n = '/Eµ is a constant. The equations (3.17) become -;;z = n2 Ett - 6. E 0 , -n;;2z Rtt - 6. H = 0 . (3.19) Each component of E and H satsifies the ordinary wave equation. The velocity of the waves is given by the quantity v = c/n, which allows us to regard the quantity n = '/Eµ as the index of refraction of the medium, defined by the ratio n = c/v of the velocity of light in a vacuum to the velocity in the medium. In the case of a non-homogeneous medium, a more complicated set of equations is obtained. Since n is now a function of x,y ,z, the six equations, (3.17), no longer yield one equation in each component, for the first order operators on the right sides involve all the components of the vectors in each equation. However, it is still true that the wave velocity, v, is given by the ratio c/n. Indeed, we shall see that for the propagation of a light signal, i.e., a sudden disturbance of the electric field, only the second order terms in (3.17) are significant. These terms lead to a generalized wave equation in which the coefficient, n, is not constant. 3.2 Stratified media. Let us consider as an example the case of a stratified medium, in which the functions E and µ depend only on one variable, 14 MATHEMATICAL THEORY OF OPTICS for instance on z. This case is of considerable practical interest, since the propagation of waves through thin, multilayer films, evaporated on glass, leads to a problem of this type. Let µ = 1 and le = n(z). It follows that p = 2 grad (log n) = (o,o, 2 :•). , Hence q 0. p·E 2 n' E n 3 (3.21) grad (p •E) ( 2 !!..'._ n 8E3. ax ' 2 !!..'._ 8E3. n ay ' 2 _£. az (n' n E \) 3/ - p x curl H = 2 n' n (8H1 az _ 8H 3 ax ' -8aHy-3+ -8aHz-2 ' 0 ) The differential equations (3.17) become n2 a2E1 c 2 at2 -AE1 2n-'-B-E3 n ax n2 a2E2 c 2 at2 -AE 2 = 2n'-8E-3 nay n2 a2E3 c 2 at2 -AE 3 = 2.a£z...(nn' E~3 n2 a2H1 n' 8H1 c 2 at2 -AH1 + 2 n -az- +2nn-' -a8-xH3 + 2n-'-a-H3 nay (3.22) (3.23) We thus obtain two partial differential equations, namely, those for E 3 and H3, in which none of the other components appear. After E 3 and H3 have been determined from these two equations, they are substituted in the remaining equations of (3.23); and these equations then become modified wave equations for E 1, E 2, H1, and H2 modified in the sense that the right side is not zero, but a known function. As a result of this simplification it is possible to find explicit solutions for many problems connected with stratified media, especially with films producing low reflection. WAVE OPTICS AND GEOMETRICAL OPTICS 15 §4. INTEGRAL FORM OF MAXWELL'S EQUATIONS. The functions E (x,y,z) and µ(x,y ,z) are not necessarily continuous functions. We assume, however, that they are sectionally smooth, i.e., every finite domain of the x,y ,z space can be divided into a finite number of parts in which E and µ are continuous and have continuous derivatives. The differential equations (1.20) represent conditions for the electromagnetic field in every part of the space where E, µ, and E, H are continuous and have continuous derivatives. They are, however, not sufficient to establish conditions for the boundary values of E and H on a surface of discontinuity. This is the reason why it is advantageous to replace the differential equations (1.20) by certain integral relations. These integral equations are equivalent to the differential equations if E, µ, and E, H are continuous and have continuous derivatives. They are more general, on the other hand, since they apply equally well to the case of discontinuous functions E, µ; E, H and establish definite conditions for the electromagnetic field in this case. 4.1 Let us consider, in the four-dimensional x,y,z,t space,. a domain D which is bounded by a closed three-dimensional hypersurface r. We assume that the hypersurface r consists of a finite number of sections in which the outside normal N of the hypersurface varies continuously. This normal N is a unit vector in the x,y,z,t space given by A. if the surface r is represented by the equation
0 the vectors E
and H are different from zero in a
larger sphere of radius o + ct. In
other words we expect that the surface
which separates the parts of the space
which are still at rest from those
penetrated by the original impulse
Figure 8
travels over the space. A surface of
this type is called a wavefront. In the
above example the wave fronts are
spherical and given by the equation
cp(x,y,z,t) =✓x2 + y 2 + z 2 - cS - ct = 0.
(7 .11)
If the boundary values of E(x,y,z,0) or H(x,y,z,0) on the original sphere
of radius o are different from zero then this sphere is a surface on which the
electromagnetic field is discontinuous. We must expect that at the time t > 0
22
MATHEMATICAL THEORY OF OPTICS
the corresponding boundary values on the wavefront (7 .11) are likewise different from zero so that the electromagnetic field is also discontinuous on the new wave front. This consideration leads us to define a wave front more generally as any surface in the x,y ,z space on which, at a given time t, the electromagnetic field is discontinuous.
An observer at a point x,y ,z will interpret such a discontinuity as a sudden signal which reaches him when the wave front goes through the point x,y,z.
Instead of illustrating the equation (7 .11) by a set of surfaces in the threedimensional x,y ,z space depending on the parameter t, we can interpret such
a relation rp(x,y ,z,t) = 0 as a hypersurface in the four-dimensional space
x,y,z,t. In our example this hypersurface is the cone
and the electromagnetic vectors are discontinuous on this cone. Its "contour
lines", i.e., the cross sections of the hypercone, rp(x,y,z,t) = 0, with the
hyperplanes t = const., then represent the above set of wave fronts in the
x,y,z, space.
of rp
7.2 We may expect from
the above example that the
hypersurfaces rp = 0 which
determine the propagation of
discontinuities are not arbitrary
but must fulfill certain con-
ditions. We can derive these
conditions easily with the aid
of the general relations (5.41).
Let us assume that rp(x,y,z,t) =
0 represents a hypersurface on
which the vectors E and H are
Figure 9
discontinuous. The functions
E (x,y ,z) and µ(x,y ,z) shall be
continuous in the neighborhood
0. We introduce, on rp = 0, the vectors
u [El
V = [HJ
E2 E1' H2 - H1'
(7 .21)
which measure the discontinuity of E and H on rp 0. It follows from (5.41):
gradrpxV- i rpt u C
0.
'
U • grad rp
0. '
grad rp x U + I!:.