zotero-db/storage/HQNEX47Y/.zotero-ft-cache

518 lines
83 KiB
Plaintext

JOHN EARMAN and CLARK GL YMOUR
LOST IN THE TENSORS: EINSTEIN'S STRUGGLES WITH COVARIANCE PRINCIPLES 1912-1916"
Introduction
IN 1912 E i n s t e i n b e g a n to devote a m a j o r p o r t i o n o f his time a n d energy to an attempt to construct a relativistic theory of gravitation. A strong intimation of the struggle that lay ahead is contained in a letter to Arnold Sommerfeld dated October 29, 1912:
At the moment I am working solely on the problem of gravitation and believe 1 will be able to overcome all difficulties with the help of a local, friendly mathematician. But one thing is certain, that I have never worked so hard in my life, and that I have been injected with a great awe of mathematics, which in my naivet~ until now I only viewed as a pure luxury in its subtler forms! Compared to this problem the original theory of relativity is mere child's play.'
Einstein's letter contained only a perfunctory reply to a query from Sommerfeld
about the Debye-Born theory of specific heats. Obviously disappointed, Som-
merfeld wrote to Hilbert: 'My letter to Einstein was in vain . . . Einstein is
evidently so deeply mired in gravitation that he is deaf to everything else?
Sommerfeld's words were more prophetic than he could possibly have known;
the next three years were to see Einstein deeply mired in gravitation, sometimes
seemingly hopelessly so.
In large measure, Einstein's struggle resulted from his use and his misuse,
his understanding and his misunderstanding of the nature and implications of
covariance principles. In brief, considerations of general covariance were bound
up with Einstein's motive for seeking a 'generalized' theory of relativity; mis-
understandings about the meaning and implementation of this motivation
threatened to wreck the search; and in the end, the desire for general covariance
helped to bring Einstein back onto the track which led to what we now recognize
*Present address c/o Department of Philosophy, Universityof Minnesota, Minneapolis, Minn, U.S.A. and Department of Philosophy, The University of Illinois at Chicago Circle, Chicago, 111. 60680, U.S.A. We are grateful to Dr. Otto Nathan, Trustee of the Einstein Estate, for permission to quote from Einstein's correspondence. We also want to thank the anonymous referee for numerous improvements in an earlier draft of this paper.
'A. Hermann (ed.), Albert Einstein/Arnold Sommerfeld Briefwechsel (Basel: Schabe and Co., 1968), p. 26. The 'friendly mathematician' was Marcel Grossmann; various aspects of the EinsteinGrossmann collaboration will be discussed below. Our translations of the Einstein-Sommerfeld correspondence are taken from Helga and Roger Stuewer's translation of this book. We are grateful to them for their permission to use their unpublished translation. 21bid. p. 27.
Stud. Hist. Phil. Sci., Vol. 9 (1978), No. 4. pp. 251 - 2 7 8 . © Pergamon Press Ltd. Printed in Great Britain.
0039-3681/78/1201-0251$02.00/0
251
252
Studies in History and Philosophy o f Science
as the general theory of relativity. The purpose of this paper is to trace the influence which Einstein's changing attitudes towards covariance had on the emergence of the general theory. Though the task is basically an historical one, it is inextricably bound up with a number of issues in the foundations of physics which are still subject to lively discussion. We make no apology for our forays into these areas, for no detour around them can lead to any real appreciation of Einstein's struggles.
1. Einstein's Desire for a 'Generalized' Theory of Relativity
Einstein's motivation for the special theory of relativity derived mainly from his concern with the special principle of relativity and not with the aether hypothesis or the constitution of matter, the problems which occupied Lorentz and Poincar6. 3 To a significant degree, a similar concern with an extended version of the principle of relativity motivated the search for a theory of gravitation.
The reasons for Einstein's desire to generalize relativity, and the importance of covariance to that project, are to be found in Einstein's earliest writings on gravitation. In 1907 Einstein published a survey and discussion of experimental and theoretical work on relativity. The fifth and final part of this essay contains a discussion of the connection between gravitation and the principle of relativity, and it begins with a question:
We have thus far taken the Principle of Relativity -- the principle of the independence of natural laws from the state of motion of reference frames - - only for reference frames free of acceleration. Can it be thought that the Principle of Relativity also holds for systems which are accelerated with respect to one another?'
Einstein's answer is in effect negative. The remainder of the section develops the principle of equivalence for the first time, and explores some of its implications. This principle was the rock upon which Einstein built his gravitation theory: in nine years of work on gravitation between 1907 and 1916, the principle of equivalence was, above all, the idea Einstein refused to give up, and yet in his very first statement of the principle he finds it inconsistent with the constancy of the velocity of light and with any generalization of the principle of relativity. The velocity of light cannot be the same in accelerated and unaccelerated frames; the laws in an unaccelerated frame may have to incorporate a homogeneous gravitational field which would vanish in an accelerated frame.
Einstein published nothing more on gravitation for three years, and when in
3SeeT. Hirosige, 'The Ether Problem, the MechanisticWorldview,and the Originsof the Theory of Relativity', Historical Studies in the Physical Sciences, Vol. 7 (Princeton: Princeton University
Press, 1976),pp. 3-82. 4A. Einstein, 'lQber das Relativit~itsprinzip und die aus demselben gezogenen Folgerungen',
J~hrbuch der Radioaktivit~t und Elekronik 4 (1907), p. 454.
Lost in the Tensors: Einstein's Struggles with Covariance Principles 1912-1916 253
1911 his next paper ~on the subject appeared, it went little beyond his 1907 discussion in reconciling the principles of equivalence and relativity. Einstein did, however, suggest a sense in which his ideas on gravitation generalized the theory of relativity. For if an unaccelerated frame of reference subject to a homogeneous gravitational field is physically equivalent to an accelerated frame, then, at least when there is gravitation, it seemed to Einstein that there is no such thing as absolute acceleration. Just as the special theory eliminates absolute velocity, its generalization through the principle of equivalence eliminates absolute acceleration. This way of viewing the relation between relativity and the principle of equivalence was bound to lead Einstein to try to embody certain of Mach's ideas within gravitational theory, and Machian themes were to emerge explicitly in Einstein's 1913 theory. But in 191 l, the principle of relativity - - meaning the invariance of the form of physical laws in all appropriate reference frames -and the principle of equivalence were still at odds. Indeed, as Einstein made his ideas on gravitation more explicit, the contradiction seemed more and more irreconcilable. Einstein's 1911 and 1912 papers on gravitation develop a scalar theory of static gravitational fields in which the velocity of light plays the role of the gravitational potential? Accordingly, the theory could not be invariant under Lorentz transformations. The principle of equivalence was in outright contradiction with the special theory of relativity, and no generalization seemed at hand which would save the principles of the latter. When in 1912 Einstein undertook to master the tensor calculus with the help of his friend Marcel Grossmann, it may very well have been exactly because the new mathematical apparatus appeared to offer a means to save gravitational theory, the principle of equivalence and the principle of relativity all at once. The 'Absolute Differential Calculus' would permit the representation of quantities as geometrical objects, and would further permit the statement of relations between such quantities in such a way that they would hold in every frame of reference if they held in any. The principle of covariance would thus bring with it a generalization of the principle of relativity, and the question Einstein asked in 1907 would be answered in the affirmative.
Einstein's belief that the final form of his gravitational theory, as it emerged in late 1915 and early 1916, embodied a general principle of relativity is stated explicitly:
The general laws of nature are to be expressed by equations which hold good for all systems o f coordinates, that is, are covariant with respect to any substitutions whatever (generally covarianO.
It is clear that a physical theory which satisfies this postulate will also be suitable for a general postulate of relativity. For the sum of all substitutions in any case
SA. Einstein, 'tiber den Einfluss der Schwerkraft auf die Ausbreitungdes Lichtes', Annalen der Physik 35 (1911), 898-908.
6Ref. 5 and 'Lichtgeschwindigkeitund Statik des Gravitationsfeldes', Annalen der Physik 38 (1912), 355-369, and 'Zur Theorie des statischen Gravitationsfelds', ibid. 433-458.
254
Studies in History and Philosophy of Science
includes those which correspond to all relative motions of three dimensional systems of coordinates.., this requirement of general covariance.., takes away from space and time the last remnant of physical objectivity...7
This passage displays at once Einstein's tendency during this period to blur each of three interrelated distinctions: reference frames vs coordinate systems; point transformations vs coordinate transformations; and relativity or invariance principles vs covariance principles. The quoted passage shows that Einstein believed that because the laws of his 'general theory' were generally covariant, they automatically satisfied a general principle of relativity. The reason given for this belief shows the confusion of point and coordinate transformation and of reference and coordinate systems. Sorting out all the strands of this tangle would be a Herculean task, calling for one or more book length monographs, and we do not propose to undertake such a labor here. But a few preliminary remarks about the core of the tangle are needed to set up our discussion of Einstein's struggles with gravitational theory.
If we understand a reference frame to be defined by a congruence of timelike curves (each of which is to be considered the world line of a reference point of the frame) a n d u n d e r s t a n d a c o o r d i n a t e system {x~}, i = 1, 2, 3, 4, to be a d a p t e d to the frame just in case each curve of the congruence satisfies x~ = constant (a = 1, 2, 3), then frames of reference are related by point rather than by coordinate transformations; and the fact that laws are covariant, holding in the coordinates adapted to one frame of reference if they hold in the coordinates adapted to another, signifies by itself nothing about the status of the respective frames? This is but one version of Einstein's practice in this period of seeing in the abolition of coordinate dependence for physical laws also the abolition of space-time structure.
In the sections below we will follow the thread which is most central to Einstein's progress, or lack of it, towards a theory of gravitation between 1912 and 1916; we will concentrate first on the considerations which led Einstein to abandon the requirement of general covariance in 1913, and we will then trace the reasons which led him to reaffirm it at the end of 1915.
2. The Einstein-Grossmann Theory
That Einstein saw in covariance a solution to the tension between his two ~A. Einstein, 'Die Grundlage der allgemeinen Relativit~ttstheorie', Annalen der Physik 49 (1916), 770-822; translation from W. Perrett and G. B. Jeffrey, The Principle o f Relativity (New York: Dover, 1923), p. 117. The italics are Einstein's. 8Of course, a reference frame can be represented by a maximal class of adapted coordinate systems. Any two members of such a class are related by a coordinate transformation of the form x'* = x "° (x~), x'" = x " (x ~, x'). Thus, coordinate transformations having this form can be thought of as giving a recoordinatization of the frame while ones not of this form can be thought of as giving a change of frame. But such a coordinate representation can easily lead to a blurring of the crucial distinctions mentioned above. For more on these matters, see Sections 3-6 below; see also J. Earman, 'Covariance, Invariance, and the Equivalence of Frames', Foundations o f Physics. (1974), 267-289.
Lost in the Tensors: Einstein's Struggles with Covariance Principles 1912-1916 255
chief principles is evidenced in his contribution to the paper he published jointly with Grossmann in 1913, 'Entwurf einer verallgemeinerten Relativit~itstheorie', the first of his papers in which the apparatus of the tensor calculus is used? After explaining that the principle of equivalence requires that the velocity of light not be constant, Einstein wrote:
With the introduction of a spatial variation in the quantity c we have gone beyond what is presently understood by 'relativity theory'; for the expression of ds is no longer invariant under linear orthogonal coordinate transformations. Suppose now - - what cannot be in doubt - - that the principle of relativity is to be preserved; then we must generalize relativity theory so that the significant elements of the foregoing theory of static gravitational fields are included as a special case. '°
Einstein then introduced an arbitrary coordinate transformation and required that the motion of material points in arbitrary coordinates satisfy
6 f ds = 0
(1)
w h e r e ds 2 = gljd.vdxJ. 11 H e p o i n t e d o u t t h a t t h e go a r e c o m p o n e n t s o f a tensor -- the space-time metric tensor -- and remarked:
In the case of ordinary relativity theory only linear orthogonal substitutions are permissible. It will be shown that it is possible to obtain equations for the effects of gravitational fields on material phenomena which are covariant under arbitrary substitutions. ~2
Einstein's equation of motion for material particles is simply the requirement that their trajectories be geodesics, and so is generally covariant. Einstein and Grossmann were also able to give, as Einstein promised in the passage quoted, a generally covariant equation for the motion of incoherent matter in a gravitational field. But any field theoretic treatment of gravitation requires in addition a field equation which will determine the relation between field sources and the field they generate.
9A. Einstein and M. Grossmann, 'Entwurf einer verallgemeinerten Relativitatstheorie und einer Theorie der Gravitation', Zeitschriftfiir Mathernatik und Physik 62 (1913), 225- 261. Einstein's friendship with Grossman extended back to tl-?ir school days in Ziirich. It was Grossmann's father-in-law who helped Einstein to get a job at the Patent Office in Berne, and it was Grossmann who helped to conduct the negotiations which brought Einstein back to ZUrich as a professor of theoretical physics at the ETH. Further, Grossmann was an expert in non-Euclidean geometry and the tensor calculus - - just the mathematical tools Einstein saw he needed for his work on gravitation. Thus, it was natural, if not inevitable, that Einstein would enter into a scientific
collaboration with Grossmann. ~°'Entwurf', p. 228. ~'For sake of readability, we have modernized Einstein's notation. Latin indices run from 1 to 4
and Greek indices from 1 to 3. The Einstein summation convention on repeated indices is in effect. Ordinary derivatives are denoted by a comma and covariant derivatives by a semi-colon. In 1913 Einstein did not use the now standard convention of upper and lower indices to denote respectively contravariant and covariant components. Thus, he had to use two different symbols for contravariant and covariant tensors, a practice which makes it hard for the modern eye to scan his early
papers on gravitation.
~2'Entwurf', p. 229.
256
Studies in History and Philosophy of Science
Einstein and Grossmann searched for field equations of the form
A~i = KT,.i
(2)
The T~ are the covariant components of the energy tensor for 'matter fields'. Initially, three requirements were imposed on the A~i: (i) they are to be components of a second rank tensor so that (2) is generally convariant, (ii) they are to be constructed from the metric potentials g,i and their derivatives of first and second order, and (iii) they are to be such that in the Newtonian limit, (2) reduces to an analogue of the Poisson equation
V2~ = constant xo
(3)
where q~is the Newtonian scalar potential and 0 is the mass density. As indicated by Grossmann, a natural candidate for Av is the Ricci tensor
Ri~. R~ automatically satisfies (i) and (ii). And it seems promising as regards (iii). Since the g,:~take the place of the Newtonian scalar potential, what is wanted is an expression concocted from the g~ which, in the Newtonian limit, reduces to a sum of second order derviatives of the g,:~. This suggests that one look for a tensor expression which involves a summation over a repeated index. For tensors, this operation amounts to contraction, and R,~ is the only meaningful contraction of the Riemann tensor R~,. Einstein and Grossmann had already recognized that the Riemann tensor plays a fundamental role in gravitational theory since it vanishes if and only if the metric is pseudo-Euclidean. Grossmann, however, rejected this choice for Air. The fact that R~ -R~i~ in itself showed, according to Grossmann's reasoning, that R~ cannot have the right Newtonian limit. Grossman's idea was that as the gravational field vanishes, then so must OR,jk,, since this is a necessary condition for flat space - time and since gravitational field is null only if s p a c e - time is flat. But since R,j is a contraction of R,jk,, if the latter vanishes so must the former. '3
Grossmafi's argument does not show what it was intended to show. It does establish that when the gravitational field vaniches completely, R,:i also vanishes, but the argument does not tell one what happens in some appropriately weak but non-vanishing approximation. Grossmann could have argued validly in the opposite direction; namely, when T,:~vanishes, we should return to the situation of special relativity with its flat space-time. But with the choice of R,j for Av in (2), (2) does not have this consequence since in dimension 4, R~ = 0 does not entail Ri~I = 0. (The entailment does hold for lower dimensions.) While this argument does have the virtue of validity, its leading idea has the physically disastrous consequence that space-time cannot be curved outside mass-energy concentrations; thus, massive bodies would not 'bend' light, the sun would not "attract' the earth, etc.
'3'Entwurf', p. 257.
Lost in the Tensors: Einstein's Struggles with Covariance Principles 1912-1916 257
The actual situation as regards the Newtonian limit of
R,j = KT,~
(4)
is mathematically straightforward but is somewhat complicated to present because of the number of assumptions needed to guarantee the correct Newtonian limit. We demand first that (A,) the only sources of the gravitational field are massive particles (this is already implicit in the Newtonian equation (3)). Furthermore, we may assume that (A2) the particles are moving slowly in comparison with the speed of light. Applying these assumptions to the contravariant form of (4), it is easily seen that the only component T~'which cannot be neglected to first approximation is 7",, = Q, so the problem reduces to examining the equation
R"" = K0
(5)
In computing the Newtonian limit of R"", we are justified in assuming that (A3)
the space-time metric differs little from the Minkowski metric, i.e., that there
exists a coordinate system in which (5) holds and g~ = A~j + E/3. where A~jis the
Minkowski matrix and t is a sufficiently small positive number. If we finally
assume that (A,) the metric is stationary in the sense that there is a coordinate
system in which (A3) holds and in which OgJOx. = 0, we find after neglecting
all but first order items in ~ that
3
R,, = 1/2Y/3"".,,
(6)
/J=l
Thus, under assumptions (A,)-(A,), the contravariant form of (4) reduces to
3 44
5-/3 ,,, = constant x0
(7)
/a= 1
which is a close an analogue of (2) as could be hoped for.
Although Grossmann's objection to (4) was not sound, there are real problems
involved. In a letter to Besso dated December 10, 1915, Einstein said that he
and Grossmann had rejected (4) not only because of the considerations
mentioned above but also because they believed that the 'conservation law
would not be satisfied'.'" To the modern eye, the problem seems obvious;
namely, the conservation equation
T":j : 0
(8)
does not follow from (4). But this is certainly not the problem which worried Einstein in 1913, for (8) is not a consequence of the field equations in any of Einstein's early theories.
Another problem is that (8) in conjunction with (4) entails that the energy scalar T ~ 5-T~ is a constant, a seemingly implausible restriction on matter fields. It is doubtful, however, that Einstein was aware of this consequence in 1913, for Einstein apparently did not know the key identity
"P. Speziali (ed.), Albert Einstein/Michele Besso Correspondance (Paris: Hermann, 1972), p. 60.
258
Studies in History and Philosophy of Science
(R ii- V2g'~R);i = 0
(where R -= ZRII) needed to prove the c o n s e q u e n c e that T = c o n s t a n t . In November of 1915, Einstein returned to the equation (4), and he did not mention this consequence, although he did go through an eleborate argument to achieve the stronger result T = 0.15 A n d when he first proposed the final field equations
R,, = K (T ~-- V2g~iT)
which are equivalent to
R , i - Vzg(iR = KT,i,
(11)
the conservation law (8) was still postulated separately. '"
Assuming that the recollection Einstein recorded in his letter to Besso is
accurate, what probably worried him in 1913,was that, by his lights (8) was
not a proper conservation law. For Einstein believed that an adequate conser-
vation law would have to include the contribution of the energy-momentum
of the gravitational field itself and that the conservation law should be express-
ed in terms of an ordinary rather than a covariant divergence so that it could
be integrated to yield conserved quantities, lr Evidently, Einstein did not see a
way to achieve a c o n s e r v a t i o n law o f the desired f o r m using (4). 1~
The struggle with convariance is evident throughout Einstein and Grossmann's
joint paper. Based, apparently, on Grossman's argument against the Ricci
tensor, Einstein says in his 'Physical Part' of the paper that it has proved im-
possible to find a satisfactory tensor built from the metric tensor and its deriv-
atives up to second order. Einstein does allow that it is possible that there may
be satisfactory third order tensors, but thinks there are no good physical reasons
to expect such an object to be a satisfactory generalization of the Newtonian
potential. In his part of the paper Grossmann concludes that
We rot.st therefore leave open the question as to what extent the general theory of differential tensors linked with the gravitational field is related to the problem of
'SA. Einstein, 'Zur RelativitStstheorie (Nachtrag)', Sitzungsberichte, der KOniglichen Preussischen A kademie der Wissenschaften zu Berlin, 1915, 799 - 801.
'6A. Einstein, 'Die Feldgleichungender Gravitation', Sitzungsberichte, der KOniglichen Preussischen Akademie der Wissenschaften zu Berlin, 1915, 844-847. For more on this point, see our paper, 'Einstein and Hilbert: Two Months in the History of General Relativity', forthcoming in Archive for the History of Exact Sciences.
"See Section 3 below for more on Einstein'sattitude towards conservation laws. 'aE. Zahar has recentlyclaimed that Einstein and Grossmann bad another reason for abandoning the Ricci tensor. He writes: 'both Einstein and Grossmann thought that, given the appropriate boundary conditions, the ten equations R,~ = 0 would uniquely determine the ten functions g(~. This means that we are not at liberty to choose an arbitrary frame of referencebecausethe functions g(~are generally altered by a change in coordinates. Thus it seems that the Relativity Principle is violated', 'Why Did Einstein's Program Succeed Lorentz's?' in C. Howson (ed.), Method and Appraisal in the Physical Sciences (Cambridge: Cambridge University Press, 1976). This account appears to be unfounded. The argument ascribed to Einstein and Grossmann is not given in the 1913 'Entwurf' paper, nor does it appear in any of Einstein's papers between 1913 and 1916. Zahar puts in evidence a letter from Einstein to Sommerfeld dated November 28, 1915; but the text of this letter contains nothing of the argument in question, nor anything which could plausibly be read as implying it (see Einstein/SommerfeM Briefwechsel, pp. 32 - 36).
Lost in the Tensors: Einstein's Struggles with Covariance Principles 1912-1916 259
the equations of gravitation. Such a connection must exist insofar as the equations allow arbitary substitutions [Le., are generally covariant]; but in this case it seems that differential equations of the second order cannot be obtained. If, on the other hand, it could be established that the equations of gravitation only admit a certain group of transformations, then one could understand why the differential tensors yielded by the general theory would not be acceptable. As mentioned in the physical part [the part written by Einstein], we are as yet not in a position to discuss this question.' ~
Einstein's own approach to the problem of gravitation had made him receptive to doubts about general covariance. As we have seen, his strategy was to seek a relativistic field equation (2) which generalizes the Newtonian equation (3). Within this strategy, a natural tactic is to try to derive Ao by finding a fourdimensional covariant analogue V~ of the classical V2 operator and then to apply 7~ to the metric potentials g,~, which Einstein had already recognized as the relativistic gravitational potentials. And the most obvious try for V~is V~ ( ) = [g,,,( ) ;m];n. However, if one assumes, as Einstein did implicitly in this period, that the affine connection of space-time is compatible with the metric, then V~(g'J) = 0. After considering such results, Einstein was ready to entertain non-covariant generalizations of W. And in fact, the field equations which he
gave in the 'Entwurf' paper are not generally covariant. For later reference, they are
Aij = KT,j
(12)
Aij = llx/--g (gk,, x/--g g'~,,,),k ~ gk"g, rg'",kg~r,m + 1/2gk~g'Jg,r, kgn~,m- IAgiJgk~gn.,,g"~,m
where g = det (g~).20
In this climate, Grossmann's rejection of (4) and his expression of doubts about the possibility o f finding a suitable tensor expression for A,~ provided all that was needed to make the seed of Einstein's own doubts bloom. The
unfortunate harvest was given in an appendix attached to the main paper. The appendix is signed with the initials 'A.E.', and no one who is familiar with
Einstein's style of argument can doubt that it is pure Einstein. Two arguments are given for abandoning the now suspect requirement of general covariance.
But before turning to a discussion of these arguments, we should make a few brief remarks about Einstein's perception of Grossmann's contribution to their
joint effort of 1913. In a letter which has been lost, Sommerfeld apparently advised Einstein to
be cautious in crediting Grossmann with too large a role in the development of the theory of gravitation. Einstein's reply, dated July 15, 1915, was that
lgEntwurf', p. 257. ~*Foreaseof reference,we willsometimescall theseequations the Einstein- Grossmannequations. In all probability,however,they are Einstein's invention.The 'Entwurf' paper consistsof two main sections, a 'MathematicalPart' by Grossmannand a 'Physical Part' by Einstein; the fieldequations are givenin the latter. See also the remarks at the end of this section.
260
Studies in History and Philosophy o f Science
Grossmann will never claim to be codiscoverer. He only helped me orient myself in the mathematical literature, but contributed nothing materially to the results?'
From the point of view of hindsight, Einstein's remarks seem at once overly kind and ungenerous. While Grossmann did help to orient Einstein in the mathematical literature, he also helped to disorient him in its interpretation. On the other hand, Grossmann did alert Einstein to the possible importance of the Ricci tensor, an idea that came to Einstein's rescue at the end of 1915. But writing in July, Einstein was aware of neither his disorientation nor his salvation.
3. Einstein's Arguments Against General Covariance
Nothing is easier for a first-rate mind than to form plausible arguments that what it cannot do cannot be done. In the course of writing the joint paper with Grossmann, Einstein constructed two arguments that seemed to establish that satisfactory generally covariant field equations are impossible. The main argument, whose style (though not its invalidity) is absolutely characteristic of Einstein, is short, simple and lucid. It is also pure hokum.
Einstein's main argument against general covariance is based on the idea that the field equations should satisfy the causality requirement that the distribution of mass-energy uniquely determines the space-time metric and, hence, the gravitational field. Einstein's argument is supposed to show that causality in this form will be violated for generally covariant equations. Towards this end, we are asked to consider a domain D of space-time which is devoid of massenergy. Further, we are asked to imagine two coordinate systems {x~} and {~} which agree outside D but which diverge inside D. The components of the energy-momentum tensor in these two coordinate systems will agree everywhere since inside D we have T 'j -- T 'j -- 0 and outside D we also have T 'j = T 'j because x ~ = x k there. But inside D, g,~ ~: g,j. Assuming general covariance, however, the gi~ m u s t satisfy the field equations if the g,~ do. Hence, Causality fails.
Any student of modern differential geometry will immediately put his finger on the flaw in Einstein's argument: by construction, the g~j and the g~i are simply different coordinate representations of the same intrinsic object g, the metric tensor. There is not the least reason we cannot have equations that remain true under a coordinate transformation that changes the components of the metric tensor but not the components of the stress-energy tensor. Thus, the argument cannot possibly show that g is underdetermined.22
As a matter of mathematical and physical fact, however, there is a good deal of truth in Einstein's conclusion. But a proper understanding of the basis of the kernel of truth reveals that general covariance is in no way impugned. The
2'Einstein/Sommerfeld Briefwechsel, p. 30. 22See B. Hoffmann, 'Einstein and Tensors', Tensor 26 (1972), 157-162.
Lost in the Tensors: Einstein's Struggles with Covariance Principles 1912-1916 261
mathematics of the situation is straightforward, but the treatment it receives in most standard physics textbooks is still sufficiently non-perspicuous as to make a brief discussion appropriate here. This discussion will also facilitate an understanding of a further evolution of Einstein's argument.
Let us suppose that the field equation for gravitation is local in the sense that if (M, g, T) (where M is a four-dimensional differentiable manifold, g is Lorentz signature metric for M, and T is an energy momentum tensor) is a solution and D C M i s an open subset of M, then (D, gl~,, TIo) is also a solution. (Most of the theories of gravitation which, at one time or another, have been under serious consideration do have this property.) This property allows one to concentrate on the source-free field equations when we are in an Einstein domain D where TI,, = 0. The source free field equation will be represented symbolically as
L~r(g,i) = 0
(13)
To establish Einstein's conclusion, it must be shown that if (13) is generally covariant, then there are two solutions (M, g,) and M, g2) where g, 4=g2. This is easy to do. Let 0 by any diffeomorphism of M onto itself. Then by general covariance, if (M, g) is a solution, so is (M 0*g)where 0*g denotes the 'dragging along' of g by 0. By choosing a map which is not an isometry of g, i.e., 0*g = g, we have a case in point. (Note: general covariance is sufficient for this result; but as will be discussed later, it is not necessary.)
It must be emphasized, however, that this kind of freedom to perform diffeomorphisms is trivial in two senses. First, although g and 0*g may be different metrics, they are of the same type, e.g., if g is flat or pseudo-Euclidean, then so is 0*g. Second, this freedom has nothing to do with general relativity theory per se, for a similar sort of freedom extends to any generally covariant theory as long as all of the quantities of the theory are simultaneously determined via field equations. As a simple illustration, consider the scalar wave equation
O2¢/Oxa + O2¢/Oy 2 + O20/Oz 2- O2¢/Ot 2 = 0 (14)
(c-l)
encountered in special relativity theory. To construct a generally covariant
version, we postulate
Rijk,(g,,,) = 0
(15)
and
gm,~;,, ;- = 0
(16)
Equation (15) says that the metric is flat, so together with the side condition that the manifold is R ' , it specifies that the space-time is a copy of Minkowski space-time. Then in an inertial coordinate system, (16) reduces to the more
262
Studies in History and Philosophy of Science
familiar form given in (14). Obvious, if (R', g, O) solves (15)-(16), then so does ( R ' , 0*g, 0*0) for any diffeomorphism 0 of R 4. Note that (15) makes the space-time metric an 'absolute object' in the sense that for any two solutions ( R ' , g,) and ( R ' , g2) o f (15), there is a diffeomorphism 0 of R ' such that 0*g, = g2. If one treats this absolute object not as being determined by the field equations but as being given once and for all by God, then the freedom to perform diffeomorphisms is curtailed - - then 0 must be an isometry of the fixed metric - - in this case, a Poincar6 (point) transformation. Parallel examples can also be constructed for Newtonian theories, so again nothing hinges on features peculiar to relativistic physics.
In general relativistic theories the freedom to perform diffeomorphisms also occurs in the initial value problem. For the sake of concreteness, consider the source-free initial value problem for the field equation (10-(1 1) o f the usual version of general relativity. An initial value set is a triple (S, h, X) where S is a three manifold, and h and Xare tensor fields on S which are interpreted respectively as the first and second fundamental forms of S (intuitively, S is an instantaneous time slice of space-time, h characterizes the intrinsic spatial geometry of S, and )~determines how S is to be imbedded into space-time). By a solution to the initial value problem is meant an object ((M, g), ~) where (M, g) is a solution to the (source free) field equations and u,J : S ~ M is an embedding such that ~(S) is a spacelike hypersurface of (M, g) whose first and second f u n d a m e n t a l f o r m s are h and )~ respectively. Again, if ((M, g), qJ) is a solution and 0 is a diffeomorphism of M, then ((M,0*g), 0tlJ) is also a solution. If S is given a priori or is somehow fixed observationally, then 0 is constrained to preserve S pointwise, i.e., tgO~-' = id; but for points in M t h a t are not in S, 0 may be chosen arbitrarily. A similar result holds for the initial value problem with sources and for an initial value problem in which the initial data are specified on a finite 'sandwich' of space-time rather than an instantaneous slice. Thus, one cannot hope to have causality in the sense that the laws determine a unique extension of the initial value set; the best one can hope for is uniqueness up to a diffeomorphism. 23
The situation is not quite the same for theories with an absolute space-time structure since this structure can be used to 'line-up' the corresponding solutions. In the example of (15)-(16), we know that for any two solutions (R', g,, 0,) and ( R ' , g2, O2), there is a diffeomorphism 0 of R4 such that 0*g, = g2. If 0*0 1 = O2 on a finite sandwich o f space-time, then 0*0, = O2 everywhere. In this sense, the scaffolding of the absolute space-time background permits one to come closer to a unique projection of the initial data than is possible in the case of general relativity.
23This is the geometric origin of the 'four undetermined functions' in the initial value problem of general relativity. Contrary to the impression sometimes given, these functions do not arise simplybecauseof general covariance; see the discussionbelow.
Lost in the Tensors: Einstein's Struggles with Covariance Principles 1912-1916 263
It is clear then that it is not general covariance itself which is responsible for the freedom to perform diffeomorphisms in the initial value problem of general relativity; rather, it is the combination of general covariance and the absence of any absolute objects. And further, although this freedom does entail a change in the formulation of the classical principle of Laplacian causality, it does not extract the teeth of the principle.
The discussion so far has admittedly not done complete justice to Einstein's idea that the distribution of mass energy should uniquely determine the metric.
But justice is hard to do here since it is difficult to formulate Einstein's idea in a coherent and precise fashion. In some cases -- for example, when matter exerts a pressure - - T 'j will itself explicitly contain the metric potentials. Even when T 'j does not explicitly depend upon the metric, its interpretation does. For example, the energy density as measured by an observer with four-velocity V is T,~Vi Vj. But the accuracy o f this measure presupposes that V is normalized, which in turn assumes the metric. Still, let us attempt to test Einstein's idea in the following way. Let
Lws(glj, T it) = 0
(17)
represent the field equation with sources. For fixed T, specified somehow or other as a tensor field on a manifold M, can there be more than one g which solves (17)? If (17) is generally covariant, we know that when (M, g, T) is a solution, so is (M, 0*g, 0*T) for any diffeomorphism 0 of M. But by fixing T, 0 is required to be a symmetry of T, i.e., 0*T = T, and in special cases there may be no non-trivial solutions of this condition. Of course, if (17) is local and if T vanishes on some open set D, we return to the situation discussed above. To obtain any additional information we must therefore consider specific instances. For the particular case of the field equations (10)-(11), the free-space solutions include ones which are more than mere diffeomorphic variants of one another -- e.g., they include both flat and curved metrics. A similar result obtains when a 'cosmological constant' is added to the field equations of general relativity.
In addition to his causality argument, Einstein offered a second reason for abandoning general covariance. He had shown that as a consequence of equation (8) and the Einstein-Grossman field equation, a conservation law for the combined matter and gravitational fields holds in the form
[x/--g(T4 + t'/)l, j = 0
(18)
where t~ is the object Einstein interpreted as characterizing the energy content of the gravitational field. In his appendix, Einstein stated that (18) is covariant only for linear coordinate transformations. But once again the reasoning was unsound. I f tj, transforms like a (1, 1) tensor, then in general (18) would be covariant only under linear transformations. But in general t~ transforms like a
264
Studies in History and Philosophy of Science
tensor only for linear coordinate transformations. Since (18) is a consequence of generally covariant equation (8) and the Einstein-Grossmann field equations, its covariance is just as wide as the covariance of the field equations. The point should have been intuitively evident to Einstein. The principle of equivalence implies that it is always possible to 'transform away' the gravitational field at any point; thus, whatever the state of the field, one should always be able to find a coordinate system such that t¢ = 0 at the chosen point. 24 Thus, t{ cannot behave like a tensor under non-linear transformations.
Once Einstein recognized his mistake, he set to work to find the extent to which the Einstein-Grossmann field equations are covariant. This matter will be discussed below in Sec. 5, but before turning to it we will take up two further developments in Einstein's argument against general covariance.
4. Further Developments in Einstein's Arguments Against General Covariance
Einstein had some reasons to be pleased with his new theory of gravitation; it
did incorporate a version of the principle of equivalence, and although it was not generally covariant, he believed it to be covariant under linear transformations and thus to satisfy a restricted principle of relativity. Furthermore, the new theory appeared to make the inertia of a masspoint depend on neighboring masses, and Einstein saw this result as an appealing realization of Mach's ideas. But the theory was not especially well-received by Einstein's contemporaries. The discussion of Einstein's paper at the Viennese Naturforscherversammlung was desultory at best, and dominated by Gustav Mie, who chided Einstein for failing to discuss Mie's own theory, and then proceeded to describe it at length. 2s Einstein wrote to Besso shortly afterwards complaining that
The physicists have a negative attitude towards my work on gravitation... A free and impartial vision is scarcely proper for the adult German. (Blinkers!). 2~
Even a free and impartial vision, however, might have had very serious reservations about the Einstein-Grossmann theory. It explained no known anomalies of Newtonian gravitational theory, 27 the principle of equivalence was
2~An explicit demonstration that this is indeed the case was given by E. Schrodinger, 'Die Energiekomponenten des Gravitationsfeldes', Physikalische Zeitschrift 19 (1918), 4 7.
25See A. Einstein, 'Zum gegenwfirtigen Stande des Gravitationsproblems', Physikalische Zeitsehr~ft 14 (1913), 1249-1266, and the discussion following Einstein's lecture on pp. 1262-1266. The reason given in the body of Einstein's paper for limiting the requirement of covariance to linear transformations was that the validity of the conservation law in the form (18) is not compatible with any wider covariance. But in a footnote (see p. 1257) which was probably added in proof, Einstein states that he has discovered 'in the last days' a demonstration showing that generally covariant field equations cannot be used.
~6Einstein/Besso Correspondance, p. 50. ~'Besides the anomalous advance of the perihelion of Mercury, other anomalies had been established by the end of the 19th century, the most notable one being the movement of the nodes of Venus. See H. Jeffreys, 'The Secular Perturbations of the Four Inner Planets', Monthly Notices q/'the Royal Astronomical Society, 77 (1916), pp. 112-118. See also P. Cohen, Relativity and the Ercess Advances of Perihelia in Planetary Orbits (University of Pennsylvania M. A. Thesis, 1971).
Lost in the Tensors: Einstein's Struggles with Covariance Principles 1912-1916 265
embodied in a strong form without any argument, and, finally, though the setting of the theory made general covariance almost irresistible, the field equations and the conservation laws were not generally covariant. Einstein did not help matters as he elaborated his views on covariance. In 1914 Mie published a critical review article" in which he attacked the Einstein-Grossmann theory for, among other things, falling to generalize the principle of relativity to accelerated reference frames. Einstein replied that he had been misunderstood, and rather than counterattacking, attempted to lay out more clearly the reasoning that led him to put credence in his theory." He found it unbelievable that the velocity of light should be entirely unaffected by any other physical phenomena, he wrote. Dropping the requirement that the velocity of light be constant permits one to use arbitrary coordinates; the equation of motion of a mass point is the usual Hamiltonian equation (1). Now however, the principle of equivalence seems to require that the metric components g,j also be the components of the gravitational field. One can, furthermore, give a generally covariant equation specifying the effects of the gravitational field on physical phenomena. 'It is important for the theory', Einstein wrote, 'that equation [(1)] is invariant under arbitrary transformations'?° What more is required is a generalization of Poisson's equation, that is, a field equation, and that, too, should be generally covariant.
Einstein's entire approach to the theory of gravity, then, demands a generally covariant theory, for it is exactly the coupling of the admissibility of arbitrary coordinates with the equivalence principle that leads Einstein to a tensor theory. Yet the field equations of the theory are not generally covariant, Einstein admits; still, that is not so disastrous as it seems. Here is Einstein's argument. Suppose a set of equations fail to be generally covariant; then one of two cases must obtain. Either there are generally covariant equations which in appropriate coordinates reduce to the original equations, or there are not. In the former case, there is no harm in using the non-covariant equations. In the latter case, however, the non-covariant equations must lack any physical significance whatsoever and can only be a complex way of specifying coordinate systems. But no one can doubt that the Einstein-Grossmann theory is physically significant. 31
The thrust of this argument seems plainly to be that it is of no consequence that the Einstein-Grossmann field equations are not covariant so long as there are covariant equations that reduce to them; and, furthermore, that the EinsteinGrossmann equations satisfy a sufficient condition for being special cases of
"G. Mie, 'Bemerkungen zu der Einsteinschen Gravitationstheorie', Physikalische Zeitschrift 14 (1914), 115-122, and 169-176.
29A. Einstein, 'PrinzipieUes zur verallgemeinerten Relativit~itstheorie und Gravitationstheorie', Physikalische Zeitschrift IS (1914), 176-180.
3°Ibid. p. 177. 3'Ibid, p. 177-178,
266
Studies in History and Philosophy of Science
covariant relations. After writing down the field equations Einstein continued:
It is indubitable that these equations conform with one, even perhaps a small number of generally covariant equations, the statement of which is neither of logical nor of physical interest...32
Einstein's argument is both puzzling and fascinating, and for several reasons. In the first place, the argument is given in conjunction with his earlier arguments to the conclusion that the field equations for gravitation cannot even in principle be covariant: If T is to determine g uniquely, then the equations cannot be generally covariant; further, laws which are of the form of the conservation law cannot be generaly covariant. It seems very much as though Einstein is saying both that his field equations must have and do have a covariant generalization, and furthermore that they cannot have them and do not have them. There is, perhaps, a consistent way to understand Einstein's point of view in this regard. It is that every physically significant equation which is not generally covariant must be the specialization (to special coordinates) of some generally covariant equation, but that in special coordinates a noncovariant equation must express, or somehow reflect, a feature of the world (causality or conservation, for example) that cannot be caught by any generally covariant relation.
In the second place, what reason did Einstein have for believing that every physically significant equation is equivalent, in the special coordinates used, to a generally covariant one? If we are dealing with a differential equation or system of differential equations using geometrical objects of the usual kinds, in a single coordinate system, then Einstein was quite correct. Ignoring tensor densities, one can always find corresponding covariant equations, for example, simply by introducing a flat affine connection which vanishes in the special coordinates and replacing all coordinate derivatives by covariant derivatives. But if, as is actually the case with the Einstein-Grossmann theory, one is given a set of equations which are to hold in not one but an entire collection of coordinate systems, then it is not clear under what conditions there exists a covariant equation which reduces to the original equations in a//of the coordinate systems in the collection. The trick of introducing a flat connection will not work if, for example, some of the coordinate system in the collection are nonlinearly related. An analogous trick will work provided there is an operator which is a derivation, and which is identical with coordinate derivation in all of the specified coordinates. We do not know for what group of coordinate transformations such operators exist.
In the third place, the argument Einstein gave in reply to Mie marks an important and fundamental change in his attitude toward general covariance.
321bid. p. 179. A similar point is made at the end of Walter Dallenbach's notes on Einstein's lectures at the ETH, presumably delivered during the winter semester, 1913-1914; see Einstein Papers, Princeton University, microfilm reel I.A.8, no. 4.
Lost in the Tensors: Einstein's Struggles with Covariance Principles 1912-1916 267
Heretofore, Einstein seems to have seen in general covariance the perfect and complete generalization of one aspect of the principle of relativity. Now, in the face of Mie's attack, Einstein denigrates general covariance, and sees the demand for it as physically vacuous; the gravitational theory he and Grossmann put forward was an intelligible generalization of special relativity and satisfied the equivalence principle; that is what mattered. Although the arguments Einstein advanced for his new view were unsound or incomplete, the view itself was closer to the truth of the matter than those Einstein had earlier held. Thus Einstein's account of covariance in reply to Mie is essentially the one given two years later by Kretschmann 33 in criticizing Einstein's 1916 paper on the f o u n dations of general relativity. 3' Yet, paradoxically, in forming a more accurate opinion of the physical significance of the demand for general covariance, Einstein may well have made it more difficult for himself to find a satisfactory gravitational theory. Certainly it seems that if he had held fast to general covariance at this point he might have been driven to re-evalute Grossmann's arguments against the Ricci tensor.
Though Einstein's rejection of general covariance did not waver during 1914, the argumentation did undergo a significant shift. In his communication to the Berlin Academy for October 1914, 3~Einstein's argument against general covariance appears in a revised form, reflecting a recognition of the main error of interpretation in the original form. 38 We are again asked to consider a domain D of space-time which is devoid of all material processes. Then the
total state of D is completely specified once the metric potentials g,i(x k) as
functions of some coordinate system {x~} are given. Again let {x~} be a second coordinate system which coincides with tx ~} outside D but diverges inside. Einstein now correctly states that the metric potentials T,, (~) as functions of
the new coordinates describe exactly the same physical situation as the g,(x~).
Here then was an opportunity to escape from past confusions and to reinstate the requirement of general covariance. But the opportunity was not seized, and Einstein was quickly seduced by a new form of the same misunderstandings that caused the original problem.
If the field equations are generally covariant, then it follows, Einstein
argues, that if the gli(x ~) are solutions then so are the ~,:i(x~). But ~,~(x~) and g~i(xk) describe different gravitational fields if the barred functions are different
from the unbarred ones, as can always be arranged by the choice of x '. Thus,
33E. Kretschmann, 'L)ner den physikalischen Sinn der Relativititspostulate, A. Einsteins neue und seine ursprtingliche Relativit~tstheorie', Annalen der Physik 53 (1917), 575-614.
~'Ref. 7. 3~A. Einstein, 'Die formale Grundlage der allgemeinen Relativititstheorie', Sitzungsberichte, der KOniglichen Preussischen A kademie der Wissenschaften zu Berlin, (1914), 1030 - 1085. 3eWe have not been able to determine either from Einstein's published papers or his correspondence who or what was responsible for this change. This is just one of the many unsolved mysteries about Einstein's odyssey from the special to the general theory of relativity.
268
Studies in History and Philosophy of Science
we arrive at the same conclusion: generally covariant field equations violate Causality. The promises of Einstein's argument are largely correct and amount to a coordinate representation of the freedom to perform diffeomorphisms discussed in Section 3, though Einstein's construction presupposes a nonstandard notion of 'dragging along'. In a more standard presentation, the construction proceeds as follows. Consider two subdomains D, and D2 o f D, and let 0 : D1 ~ D2 be a diffeomorphism. 0 can be given a coordinate representation by describing it as the map which takes a point in D, with coordinates x k to a point in D2 with coordinates f k ( x m ) . With this point m a p we can associate a coordinate m a p to a new coordinate system {x~} where x ~ (O(p)) - x k ( p ) , i.e., the new coordinates at the new point are numerically equal to the old coordinates at the old point. If F k is the inverse o f f ~ then the coordinate transformation is given by ~ ~ = Fk(xm). The metric 0*g dragged along by 0 can be defined in coordinate terms by the rule (0*g),~i-0rp~ = gi~lp i.e., the new metric at the new point has the same components in the new coordinate system as the old metric has in the old coordinate system at the old point? 7 If 0 is an isometry o f g, then (0*g),~jl0l~ = g~,ioc,,~, and combining this with the definition of 0*g, we have ~ , ~ = g,~il~.Since the barred coordinates at 0(p) are numerically the same as the unbarred coordinates at p, the ~ and the gi~ are the same functions. Conversely, if the functions are the same, then 0 is an isometry, and the coordinate transformation ~ = F ~ ( x "~) can be said to generate an isometry.
What Einstein did not notice, however, was that dropping the requirement of general covariance need not block the unwanted consequence. Let us suppose, as Einstein did, that the source free field equation is a differential equation constructed entirely from the metric potentials and their derivatives but that the equation need not be generally covariant. As a simple example, consider the equation
go,k = o
(19)
Suppose that the associated gravitational theory says that a metric g is admissible (in the absence of matter) just in case there exists a coordinate system in which (19) holds. This if g is admissible, so is 0*g for any diffeomorphism 0. For if {~} is the coordinate system that makes g admissible, then {~}, where x ~ = P ( x " ) , is the system that makes 0*g admissible - - 0*g'~,~ = 0 in ~ : Only if some particular coordinate system is singled out ahead of time will (19) uniquely determine the metric. The singling out could be done by ostention, but unless guided by a physical motivation such ostention is arbitrary. If there were additional quantities not determined through the field equations but which were given a priori, then the value of the coordinate components of these
37Einstein's construction seems to presuppose the alternative definition (0%o)ijlp=-gij10(p), i.e.,
the components of the new metric in the old coordinate system at the old point are numerically the same as the components of the old metric in the new coordinate system at the new point.
Lost in the Tensors: Einstein's Struggles with Covariance Principles 1912-1916 269
objects could be used to pick out a preferred coordinate system. But on Einstein's own hypotheses, such quantities are not available.
To summarize, Einstein believed that general covariance led to a violation of Causality. His original argument to this effect (discussed in Section 3) was invalid. His second argument, presented in his 1914 Academy paper, was essentially correct. But this argument, like the original one, assumes a naive and ultimately untenable interpretation of Causality. Finally, Einstein did not notice that the use of non-covariant field equations did not provide any easy way to resolve his alleged problem.
5. The Return to General Covariance
In a joint 1914 paper 38 Einstein and Grossmann reported what they took to be a major advance in the understanding and application of the covariance properties of their gravitational field equations. The advance proved to be largely illusory. But this illusion, or rather the piercing of it, was later to serve as the spur to a real advance; indeed, it proved to be one of the main reasons why Einstein dropped the Einstein-Grossmann theory and returned to general covariance. In the second section of this paper, Einstein's Causality argument is repeated in the original form of the 'Entwurf' paper, indicating that Einstein had not hit upon the refinement discussed above in Section 4. But tucked away in a footnote39is the admission that, contrary to the claim made in the 'Entwurf' paper, the validity of the conservation law (18) does not limit covariance to linear transformations. The covariance of (18) will be just as wide as that of the Einstein-Grossmann field equations. But then what is the covariance of the latter?
By way of answering this question, Einstein and Grossmann first show that the Einstein-Grossmann field equation together with the generally covariant conservation law (8) for matter fields entails
B. - (x/--g g'ig.k g*%j) ,i,m = o
(20)
A coordinate system satisfying the condition (20) is called an 'adapted' (angepasste) system and the transformation between two such systems is said to he 'justified (berechtigte).
Next, it is shown that the field equations are derivable from a variational principle.
6 f ( H - - 2Kx/--g T,jg'J)d ' x = o
(21)
where d denotes functional variation of the metric potentials and the Hamil-
aSA. Einstein and M. Grossmann, 'Kovarianzeigenschaftender Feldgleichungender auf die verallgemeinerteRelativit~tstheoriegegriindentenGravitationstheorie',Zeitschrift far Mathematik und Physik 63 (1914),2i5-225.
aglbid, p. 218.
270
Studies" in History and Philosophy o f Science
t o n i a n o f the g r a v i t a t i o n a l field is given by 4°
H = 1/2 ~/--g ~'Jgk,,,,,g~m,i
(22)
Using (21) and (22), Einstein and Grossmann present an argument designed to show that their field equations are covariant under any justified transformation. This argument and the parallel considerations Einstein offered in his 1914 Academy paper were the subject of an intensive correspondence with LeviCivita?' We will discuss this matter in another paper, and for present purposes we will concentrate on what Einstein took to be the significance of his result.
As Einstein saw it, the main advance of the 'Kovarianzeigenschaften' paper was the partial removal of the 'severe defect' -- the perceived limitation on the relativity p r i n c i p l e - - which he had a n n o u n c e d in his reply to Mie. 42 His reasoning now was that (a) justified transformations may be non-linear, and, therefore, (b) the theory allows the equivalence of accelerated and nonaccelerated frames. As he put it in a letter to Besso, written in March of 1914:
• . . the equations qfg,ravitation are valid f o r all coordinate systems sati~fyin~ these conditions Jeq (20)]• From this, it follows that there are transformations representing accelerations of a very varied nature (for example, rotations), so that the equivalence hypothesis is vouchsafed in its original f o r m ? 3
From the importance he attached to 'Mach's Paradox' (see Section 6 below). Einstein must have been very pleased by (the illusion!) that rotations were covered.
Because Einstein blurred the distinctions between point and coordinate transformations, reference frames and coordinate systems, and relativity and covariance principles, it is difficult to put his claims in the letter to Besso to a test; but one case which comes as close to a test as can be hoped concerns rotation in an empty region of space-time. Let D be a region where T = 0. Then in D, the Minkowski metric rl is a solution of the Einstein-Grossmann field equations (12). Choose in D a reference frame K which is inertial with respect to rl, and then c h o o s e an inertial c o o r d i n a t e system {X'} = {X, Y, Z, T} which is a d a p t e d to K, i.e., in {Xi},rl,:, = diag(1, 1, 1, -1) and the world lines o f the points o f K have t h e f o r m X ~ = c o n s t a n t (c~ = 1, 2, 3). O b v i o u s l y , B , ( X ' ) = 0 so the X ' is 'adapted' in the sense of Einstein and Grossmann. Next, consider a frame K' which is r o t a t i n g with c o n s t a n t a n g u l a r velocity co with respect to K. It is n o t so obvious what coordinate system to choose as representative of K', but one natural choice is to use polar coordinates {r, +, a, t} a d a p t e d to K ' . the relation between the two coordinate systems is
"°Interestingly, Paul Bernays is given credit for calling this fact to the attention of Einstein and Grossmann; see the footnote on p. 219. of Ref. 38.
"'Einstein Papers, Princeton University, microfilm reel I.B. 1, no. 16. "2SeeRef. 29, p. 176 and our discussion above in Section 4. "~Einstein/Besso Correspondence, p. 53. The italics are Einstein's.
Lost in the Tensors: Einstein's Struggles with Covariance Principles 1912-1916 271
S ~_- rcos (~ + cot) Y = rsin (+ + cot) Z= Z T= t
The non-vanishing metric components in the new coordinate system are
1,r/22 =r 2,rl33 = 1
~24 ~--- rl42 = r2, r/44 = (1 - co~r2) ( c ~ 1)
But as can easily be checked, the condition (20) does not hold for the new coordinate system. A similar result follows for other natural choices for a coordinate representative of K'.
It is quite likely that considerations of just this kind were in large part responsible for Einstein's subsequent loss of confidence in both the methods and results of the Einstein-Grossmann theory. In a letter to Sommerfeld, dated November 28, 1915, Einstein listed three reasons for having abandoned his previous theory. The first on the list is that 'I proved that the gravitational field for a uniformly rotating system does not satisfy the field equations'." The same point is also made in a letter to Lorentz written on the first day of January, 1916. '~ Learning that the 'severe defect' had not after all been removed must surely have been a serious blow; when added to the other two reasons -- the failure of the Einstein-Grossmann field equations to yield the observed value of the advance of perihelion of mercury and the collapse of an attempt to justify the choice of H in (22) -- it was sufficient to undermine whatever confidence Einstein still had in the Einstein-Grossman theory. '~
We have described elsewhere'r the final months of Einstein's struggle towards the covariant form of the general theory of relativity, and we will mention here only two episodes. Einstein did not immediately return to general covariant field equations; rather, he initially considered equations covariant for transformations for which the Jacobian is unity.'8 In his next attempt, '9 Einstein proposed the equations (4) which he and Grossmann had considered much earlier and whose rejection had set Einstein off on his painful odyssey. In November of 1915, however, Einstein's desire for general covariance became so strong that he was willing to tolerate the consequence that T -- constant
*'Einstein/Sommerfeld Briefwechsel, p. 32. "SEinstein Papers, Princeton University, Microfilm reel I.B.1, no. 16. In this letter the same three reasons are given, but the consideration about rotating systems appears second on the list. 'eFor more discussion of this matter, see our paper 'Einstein and Hilbert: Two Months in the History of General Relativity', op. cit. (Ref. 16). "Ibid. 'SA. Einstein, 'Zur allgemeinen Relativit~,tstheorie', Sitzungsberichte, der K6niglichen Preussischen Akademie der Wissenschaften zu Berlin 1915, 778-786. 'gRef. 15.
272
Studies in History and Philosophy of Science
and the even worse result that T = 0 and to support these consequences with a highly speculative hypothesis about the constitution of matter.
But this makes the obvious question even more puzzling: How could Einstein return to general convariance when he had 'proved' that generally covariant field equations are physically unacceptable? This is the puzzle we will try to resolve in the next section.
6. From Causality to Covariance
Physicists seldom seem content unless they have shown that what is possible is also necessary. Having obtained satisfactory covariant field equations late in 1915, Einstein proceeded to adduce arguments to show that satisfactory field equations must be covariant. These arguments are given both in his famous 1916 A n n a l e n der P h y s i k paper ~° and in his correspondence with Ehrenfest 51 and others s2 late in 1915 a n d early in 1916.
In his 1916 paper on the foundations of general relativity, Einstein presents a variety of arguments for general covariance. There is, in the first place, the consideration which we have earlier suggested attracted Einstein to covariant methods originally. Einstein first presents an epistemological argument, which he attributes to Mach, against privileged reference frames. Two fluid bodies at a great distance from each other and from all other masses are in rotation, each with respect to the other. Suppose measurements of each of the two bodies are carried out by means of instruments at rest with respect to the corresponding body, with the result that one body is an ellipsoid and the other a sphere. What is the cause of this asymmetry? The anwer given by Newtonian mechanics and special relativity theory - - that one of the bodies is really accelerated with respect to a privileged frame of reference (actually a class of privileged frames, the inertial ones) - - is unsatisfactory because the cause is not an 'observable fact of experience'. The only acceptable answer must be that the differing behavior is caused by the distant masses. Similarly, any privileged class of reference frames is open to the same epistemological objections; thus: 'The laws of physics must be of such a nature that they apply to systems of reference in any kind of m o t i o n ' , sa
Just what Einstein meant by this 'extended principle of relativity' is unclear. His formulation of it suggests that he meant that the laws of physics must be formulated so that they holcl in any system of reference but his argument sug-
S°Ref. 7. 5'Einstein Papers, Princeton University, microfilm reel I.B.I, no. 9. See especially the letters dated December26, 1915,December29, 1915,and January 5, 1916. s2SeeEinstein/Besso Correspondence, pp. 63-64. ~JRef. 7; translation from Perrett and Jeffrey, p. 113. For earlier expressions of Einstein's interest in 'Mach's paradox', see A. Einstein, 'Bases Physique d'une Theorie de la Gravitation', Archieves des Sciences Physiques et Naturelles 37, (1914), 5-12; 'Zum Relativit~its-Problem', Scientia 15 (1914), 337-348; and Ref. 5, pp. 1031- 1032.
Lost in the Tensors: Einstein's Struggles with Covariance Principles 1912-1916 273
gests instead that the laws must not require the existence of a privileged class of reference systems, privileged, that is, in the sense that motion with respect to this class of frames plays a special causal role. Quite likely, Einstein believed that these two formulations of the principle of relativity are equivalent, and he certainly implied, quite erroneously, that general covariance guarantees that the two formulations of the principle of relativity are satisfied, s' Einstein's argument, against Mie, that general covariance is physically vacuous is now utterly forgotten; general covariance not only guarantees that the laws 'apply' in every system of coordinates, it also abolishes the structure of space and time. Einstein's new discussion is in almost every way less accurate and more confused than his earlier discussion, given at a time when he thought general covariance could not be satisfied by gravitational theory.
Einstein had another, very different argument for general covariance. It is given in the 1916 Annalen paper, but its point is first made, so far as we can determine, i,-ta letter to Ehrenfest, dated December 26, 1915.~s
Section 12 of my last year's paper [the October 1914 Academy paper] is entirely correct (in the first 3 paragraphs) up to what is in italics at the end of the third paragraph. [This is the argument examined in detail in Section 4 above.] There is no incoherence to be derived from the fact that both systems G(x) and G'(x) [G(x) is the symbol Einstein used to denote the totality of the functions g,j(x~)], which refer to equivalent reference frames, satisfy the conditions on the gravitational field. The apparent force of this consideration is immediately dashed if one realizes that (1) the coordinate system signifies nothing real (2) the nature of the theory makes possible the simultaneous realization of two
(distinct) g-systems in the same region of the continuum. [Ehrenfest has a question mark in the margin here.] The following consideration must take the place of Section 12. The physically real events in the world (in contrast to those that depend on the choice of coordinates) are space-time coincidences.
And since (continuous one-one) coordinate transformations preserve all such space-time coincidences, they do not change the physical situation. Essentially the same paragraph is contained in a letter to Besso written about a week after the letter to Ehrenfest. 5e
From these two letters, the main outlines of Einstein's new strategy is clear: (i) keep the requirement of causality, but (ii) despite the previous argument, deny that causality is violated by generally covariant field equations. This is to be accomplished by (iii) maintaining that physical reality consists of spacetime coincidences and (iv) observing that coincidences are preserved by con-
~'Ref.7, Section 3; seeespeciallythe passagequoted above in Section 1above. SSEinsteinPapers, Princeton University,microfilmreel I.B. l, no. 9. ~*Theletter to Bessois dated January 3, 1916; see Einstein/Besso Correspondence, pp. 63-64.
274
Studies in History and Philosophy o f Science
tinuous transformations. What is not so clear is how well Einstein strategy succeeds; in particular, one wants to know how the crucial premise (iii) is to be justified and how (iii) and (iv) support (ii).
One of the circumstances in which Einstein thought that (iii) would be justified is mentioned briefly in the letter to Ehrenfest and in more detail in the letter to Besso:
If, for example, physical phenomena were built up from the movements of material points alone, then the encounters of these points, that is to say the intersections of their world lines, would be the on!y physical reality, the only observable.57
But even if we grant for the sake of argument what Einstein says regarding the case of point particles, how does a similar conclusion follow for the case at issue -- the gravitational field? Einstein answered this question indirectly by arguing that (iii) must be true for physical reality in general and, therefore, for the gravitational field in particular. From the equation of 'reality' with the 'observable' one can guess how the argument goes. In his 1916 paper on the foundations of general relativity, Einstein proceeds from the premise that 'all our space-time verifications invariably amount to a determination of spacetime coincidences' to 'all our physical experience can ultimately be reduced to such coincidences' and thence to the tacitly understood conclusion that all of physical reality can be so reduced? 8
The influence of Mach, who is explicitly mentioned in the introduction of the paper, is evident here in the emphasis on the means of verification and the use of epistemological considerations to establish constraints on ontology. But although Mach's searching skepticism and his critique of Newtonian mechanics had a deep influence on the early Einstein, orthodox Machian positivism was never Einstein's way. Einstein never saw the Machian atom of experience, the individual sensation, as the basic building block for nature or for scientific theories. And Einstein accepted Mach's insistence on the primacy of epistemological questions only when there was doubt or uncertainty; when doubt and uncertainty faded for Einstein, so did the importance of epistemological questions. Writing to Besso in 1914 he says:
I do not doubt any more the correctness of the whole system [the theory of gravitation] may the observation of the eclipse succeed or not. The sense of the thing is too evident. 59
And to a large extent, it was the general theory of relativity which turned Einstein consciously against Mach's philosophy of science. In the Herbert Spencer lecture at Oxford in 1933, Einstein stated that the general theory brought the realization that the fundamental concepts of physics cannot be
STEinstein/Besso Correspondence, p. 64. SSRef.7, Section3. 5~Einstein /Besso Correspondence, p. 53.
Lost in the Tensors: Einstein's Struggles with Covariance Principles 1912-1916 275
arrived at by 'abstraction' from experience.
Scientists . . . [of earlier times] were for the most part convinced that the basic concepts and laws of physics were not in a logical sense free inventions of the human mind, but rather that they were derivable by abstraction, i.e., by a logical process, from experiments. It was the General Theory of Relativity which showed in a convincing manner the incorrectness of this view.8°
Moreover, Einstein's growing hypostatization of space-time was directly contrary to Mach's picture of a world composed of sensations in ordinary space and time; indeed, one supposes that Mach would have perceived the spacetime of general relativity as just another form of the 'metaphysical monster' he had detected in N e w t o n ' s absolute space and time. 6'
Thus, the irony is doubly heavy: while the general theory of relativity led Einstein to an anti-Machian outlook, the theory itself was motivated in part by specific ideas and a general philosophical orientation derived from Mach, and at the crucial juncture in 1915, Machian positivism was used to legitimize the dual embrace of causality and general covariance.
The second notable feature of Einstein's strategy concerns a failure to distinguish sufficiently clearly between coordinate and point transformations. Because the strategy is most directly concerned with covariance, (iv) must be interpreted as referring to coordinate transformations. But then (iv) is trivially true, and it is true not just for an impoverished physical reality characterized by simple point coincidences of particles but equally for a richer reality characterized by any set of intrinsic geometric object fields, no matter how complicated or varied. On the other hand, the correct kernel of Einstein's causality argument concerns the freedom to perform diffeomorphisms -- point transformations, not coordinate transformations, though of course the point mapping can be given a coordinate representation as was done above in Section 4 -- and these mappings, do, primafacie, change the physical situation, be it simple or complicated.
One way to fill in the lacuna in Einstein's attempted reconciliation of causality and covariance is suggested by his remark to Ehrenfest that a coordinate system signifies 'nothing real'. If this attitude is extended to the underlying manifold of which the coordinate systems are representatives, then the following position can be entertained. The fact that generally covariant field equations determine the metric field only up to an arbitrary diffeomorphism (when T = 0) does not show that the equations fail to uniquely fix the total physical situation. The contrary impression is fostered by taking too literally the mathematical apparatus of an underlying differentiable manifold and various object fields on this manifold. For the space-time manifold is not
6°A. Einstein, On the Method o f Theoretical Physics, (Oxford University Press, (1933), p. 11. 6'For more on Einstein and Mach, see G. Holton, Thematic Origins of Scientific Thought (Cambridge: Harvard University Press, 1973).
276
Studies in History and Philosophy o f Science
something over and above the collection of all physical events; rather, the mathematician's M is only a formal device which allows a convenient description of physical events. Thus, two diffeomorphic variants - - (M, g) and (M, 0*g) -- are simply different but equivalent ways of describing the same physical content. As it stands, this interpretation is highly programmatic, and one naturally wants to know how the details are to be filled in before passing judgment. But at least the program does hold out the hope of a coherent means f o r implementing Einstein's strategy. 62
Unfortunately, the suggested implementation appears to be incompatible with Einstein's further remark to Ehrenfest to the effect that two distinct 'gsystems' can be 'equally realized' in the same region of the continuum. Not surprisingly, Ehrenfest has put a question mark in the margin beside this sentence of Einstein's letter, and again, not surprisingly, the correspondence on this topic continued with Einstein trying, both by means of abstract arguments and concrete examples, to win Ehrenfest over to his interpretation of general covariance. While Einstein was tenacious as always, he did concede in a letter dated January 5, 1916, that
I cannot blame you for not understanding the permissibility of generally covariant equations, because I myself needed so long to become clear about this point. 63
But much more time was needed before the dawning o f any geniune clarity. 84
7. Conclusion
Einstein's struggles with general covariance were both magnificent and paradoxical. They began, we believe, with an idea at once both right and wrong: that a generally covariant formulation of gravitational theory would permit the statement of laws that hold in every frame of reference, and that such a statement would be a physically important generalization of the principle of relativity, consistent with the principle of equivalence. From then on, Einstein produced physical and conceptual analyses of the requirement of covariance to fit his technical needs. Thus, convinced by Grossman that no second order
e2For some remarks on how a similar strategy might be used to explicate Leibniz's views on space
and time, see J. Earman, 'Leibnizian Algebras and Leibnizian Space-Times', in R. Butts and J. Hintikka (eds.), Proceedings of the Fifth International Congress o f Logic, Methodology, and Philosophy of Science (Dordrecht -Holland: D. Reidel, 1977).
63Einstein Papers, Princeton University, microfilm reel I.B. 1, no. 9. 8'The second part of David Hilbert's paper, 'Die Grundlagen der Physik', Nachrichten vonder KOniglichen Gesellschaft der Wissenschaften zu GOttingen, 1917, 53-76, contains a discussion o f covariance and causality which bears a certain resemblance to Einstein's treatment. Hilbert says that causality is not violated by generally covariant equations because from the appropriate initial data one can infer 'necessarily and unambiguously' all those propositions about the future behavior
of the system which 'have physical meaning' (p. 61). Physically meaningful propositions are taken to be ones about invariants, e.g., coincidences. Hilbert makes no reference to Einstein's 1916 paper of the foundations of general relativity theory, and we have not found any evidence of a direct link between Hilbert's views and Einstein's views on this matter.
Lost in the Tensors: Einstein's Struggles with Covariance Principles 1912-1916 277
tensor expression was available to generalize Poisson's equation, Einstein produced arguments to show that general covariance was impossible for gravitational theory. Pressed by Mie about the lack of general covariance for his 1913 field equations, he produced an argument to show that covariance was of no physical importance. When after a month's furious work, Einstein finally hit on satisfactory generally covariant equations in November of 1915, he rapidly produced arguments to show that covariance is essential. Almost none of these arguments are, from either a modern or a contemporary standpoint, satisfactory; typically they confuse coordinate systems and reference frames, coordinate transformations and point transformations, covariance and space-time structure. But they are also paradoxical. Einstein gave Machian arguments for the very theory which was to lead him to a more realist view, and he seems best to have understood the physical significance of covariance exactly when he was most in error about what could and could not be done within tensor calculus.
Shortly after Einstein's 1916 Annalen paper, Kretschmann wrote a paper on the principle of relativity/~ claiming that general covariance was of no physical significance and that any equation could be made to conform to a generally covariant one in a given coordinate system. The physical significance and precise sense of the principle of relativity had to lie elsewhere. Although Kretschmann had no better arguments than Einstein had for the same thesis two years before, he was correct enough. Einstein replied that general covariance has nonetheless a great heuristic value,e6and he was exactly right. 6r While there is little evidence that the arguments that Einstein gave for and against the possibility or necessity of generally covariant laws actually convinced him or anyone else of their conclusions, there is abundant evidence that general covariance was the key factor in Einstein's approaches to and recessions from an adequate gravitational theory. Both the principle of equivalence and the demand that gravitational theory generalize special relativity remained from 1907 on; what changed between 1912 and 1916 was Einstein's conviction regarding whether a generally covariant theory was possible at all. When he believed it was, given his other principles, he was bound to come close to the theory that appeared in late 1915.
The magnificence of Einstein's intellectual odyssey lies not only in the grandeur of its conclusion, but also in its chaos, in the indirectness of the paths
65Ref. 33. 6eA. Einstein, 'Prinzipielles zur aligemeinen Relativitatstcheorie', Annalen der Physik 55, (1918), 241-244. erHowever, in his reply to Kretschmann, Einstein did not mention his earlier correct contention that the requirement of general covariance when combined with other requirements, e.g., that the field equations be second order and derivable from an action principle, does have bite and may lead to a quite definite result. See A. Einstein, 'Hamiltonsches Prinzip und allgemeinen Relativitatstheofie', Sitzungsberichte,derKOniglichenPreussischenAkademie der Wissenschaftenzu Berlin, 1916, 1111-1116. We plan to discuss these matters in more detail in another paper.
278
Studies in Hisl gry and Philosophy o f Science
that led to home. One cannot read this history without amazement at Einstein's intellect; for much of the period between 1912 and 1916 he was truly lost in the tensors, quite completely on the wrong path, accompanied by erroneous reasons he claimed to be fundamental. And yet, quite singularly, in the course of a month be abondoned his errors and their justifications. The moral, perhaps, is that a certain fickleness is more conducive to theoretical progress than is any abundance o f conceptual clarity - - at least if one is Einstein.