9

I'm trying to grasp Einstein's special relativity theory and can't seem to find a clear answer as to why Einstein concluded that the speed of light is constant to observers in different inertial frames.

  1. Sometimes I read it's because of the Michelson-Morley experiment. But that only proves there's no aether/medium for light to travel through.

  2. Sometimes I read that he derived it from Maxwell's equations. I still need to study those, so I can't judge about that.

  3. And sometimes I read that it's just an axiom that we need to accept.

So what's the right answer?

Sam
  • 207
  • First chapter of this article gives some info https://arxiv.org/abs/1201.1828 –  Jul 01 '18 at 08:12
  • 1
    I tried to grasp SR using this paper. I think it's the original paper which introduced many of the concepts. –  Jul 01 '18 at 08:16
  • 1
    I deleted some comments which looked like they were answering the question. Please keep in mind that answers should be posted as answers, not comments. Incidentally, I'm torn about whether this should be considered a duplicate. It sounds like this question is asking about why the speed of light is invariant within SR-as-an-axiomatic-system, while the proposed duplicate target is asking why it's invariant in the real world. I'm not sure if the distinction between the two is enough that this shouldn't be marked as a duplicate of the other. – David Z Jul 01 '18 at 08:25
  • It's more like: hey, I'm Einstein, living in 1905 and I just came up with my second postulate. Here's why....

    That's the answer I'm looking for ;-)

    – Sam Jul 01 '18 at 08:30
  • 1
    Related: https://physics.stackexchange.com/q/134356/2451 , https://physics.stackexchange.com/q/245346/2451 and links therein. – Qmechanic Jul 01 '18 at 08:40
  • 1
  • 1
    I don't think it's a duplicate, for basically the reason David Z gave. (To me the distinction seems plenty.) – N. Virgo Jul 01 '18 at 12:24
  • "Axiom" is a rigorous mathematical term. The constancy of the speed of light is a stricter condition than what relativity requires. The actual requirement is the constancy of the two-way speed of light (roundtrip) only in the direction of motion. You could call this an axiom, but it also follows from the uniformity of spacetime. So you can postulate the speed of light and derive the uniformity of spacetime from it or you can postulate the uniformity of spacetime and derive the constancy of the speed of light. While equivalent, the latter seems more proper withouut raising questions like yours. – safesphere Jul 01 '18 at 13:23
  • Are you asking what the absolute foundation is that special relativity resides within ? – Sean Jul 01 '18 at 13:26
  • This may help. https://physics.stackexchange.com/questions/143576/is-it-a-postulate-or-a-well-proven-fact-that-speed-of-light-remains-constant-w-r/143645#143645 – Sean Jul 01 '18 at 13:39
  • Since "postulate" and "axiom" are synonyms, the trivial answer to the question raised in the title of this question is "yes". – David Hammen Jul 04 '18 at 23:08
  • 1
    I'm voting to close this question as off-topic because, as currently phrased, it asks for the historical aspects of why Einstein used the postulate and, as per site policy, it belongs on [hsm.se]. If the intention is to ask about the modern aspects of how we justify that postulate today, then the historical aspects of the question should be removed. – Emilio Pisanty Jul 05 '18 at 09:52

6 Answers6

5

All of the above. Let's examine them in turn:

  1. Sometimes I read it's because of the Michelson-Morley experiment. But that only proves there's no aether/medium for light to travel through.

This seems a slightly back-to-front way of looking at it. What the Michelson-Morley experiment shows is that the round-trip speed of light is constant in every direction and doesn't change as the Earth moves around the Sun. From this we conclude that the luminiferous ether theory is incorrect, since it predicts that the speed of light would be different depending on the Earth's motion relative to the ether. But the actual measurement essentially just says the speed of light doesn't depend on the reference frame, at least for observers fixed to the surface of the Earth.

(In fact one can't even really conclude that there is no ether, since the measurements could also be explained by the Earth dragging the ether around with it somehow. I believe this was seriously considered at the time. The ether theory is ruled out by the development of special and general relativity and the much better predictions they make, not by the Michelson-Morley experiment alone.)

It should be noted that Einstein himself said that the Michelson-Morley experiment was not a big part of his own personal motivation, as quoted in Grayscale's answer. Instead, his main motivation was to do with Maxwell's equations as described below. But clearly the idea of frame invariance was 'in the air' at the time, and both Michelson and Einstein were influenced by common ideas. (In fact, Lorentz had already worked out the mathematics of length contraction and time dilation, but interpreted them as physical phenomena rather than observer-dependent ones. So there were plenty of other people working on these kinds of ideas at that time as well.)

  1. Sometimes I read that it's derived from Maxwell's equations. I still need to study those, so I can't judge about that.

This is true too. The point is that if you assume Galilean invariance (i.e. if you don't assume relativity but you do assume that the laws of physics are the same for all observers) then it's possible to have an observer moving at the speed of light. If such an observer were to travel parallel to a light wave then they would see the electric and magnetic fields varying sinusoidally in space but not changing over time. But such a frozen light wave is not a solution to Maxwell's equations, so there is a paradox.

One of Einstein's main motivations was to resolve this, i.e. to express Maxwell's equations in such a way that they would be valid for all inertial observers, instead of only being valid for an observer "at rest".

  1. And sometimes I read that it's just an axiom that we need to accept.

This is true too. The theory of special relativity is very mathematically elegant in that it requires very few assumptions. If you make this assumption (plus a few other quite minimal ones), then the rest of the theory follows, much as if you assume Euclid's axioms then you can derive Pythagoras' theorem.

If you're a pure mathematician then that's all you need to know - you don't care about why you should make the assumptions you make, so you call them axioms. But in physics we generally don't just make assumptions for no reason - they are made because of measurements or because of holes in existing theories. Einstein made the assumption of a constant speed of light for several reasons, but the Michelson-Morley experiment and the observer-dependence of Maxwell's equations were probably the main ones.

So all three are true. You can treat it as an axiom if you want, and that's probably the best approach if your main goal is to understand the maths. But the axiom also has several physical justifications, and that's what the other points you listed are.

N. Virgo
  • 33,913
  • MM experiment says absolutely nothing about one way speed of light while SR claims that one way speed of light is c in all frames. Moreower, observer in the center of the circumfernce will alway see, that mesuring rod of rotating observer id contracted and rotating obsrerver will always see, that clock in the center of the circumference is ticking faster and measuring rod in the center of circumference is gamms times longer. That gives him a hint that his interferometer is contracted and his own clock tuns slower. –  Jul 01 '18 at 11:08
  • 2
    @Albert your comment makes no sense. The constancy of the speed of light across several different reference frames is the main result of the Michelson-Morley experiment. – N. Virgo Jul 01 '18 at 12:25
  • https://en.m.wikipedia.org/wiki/One-way_speed_of_light there is no experiment even in principle that confirms that one way speed of light is the same in all inertial frames. Einstein synchronization is softly speaking is a convention, or even a fantasy. MM experiment shows that roundtrip speed of light is c. Actually they have meaured phase shifts. Lorentz explained absence of phase shifts by contraction of one of the shoulders (Lorentz - FitzGerald contraction). –  Jul 01 '18 at 13:01
  • @Albert none of that contradicts anything I wrote in my answer. – N. Virgo Jul 01 '18 at 13:19
  • 1
    Oh, I see, you took "constant in every direction" to mean one-way instead of round trip. That's not what I meant of course. I'll clarify it. – N. Virgo Jul 01 '18 at 13:20
  • You're confusing me. You're saying the light source used by MM is the Sun? I always thought the light source was a lamp, attached to the earth. – Sam Jul 02 '18 at 15:17
  • But such a frozen light wave is not a solution to Maxwell's equations, so there is a paradox I don't quite follow why this wouldn't be a solution in Galilean relativity setting. If Maxwell's equations were equations of motion of the ether, this would still be a solution, for the same reason that moving along a tube with a running wave with the speed of sound gives you a "frozen sound wave" relative to you. – Ruslan Jul 02 '18 at 15:49
  • @sam the light source is indeed attached to the earth, not the sun. The point is that after half a year the earth has gone half way around the sun and so should be traveling in a different direction relative to the ether. So the ether theory predicts that the results will be different at different times of the year. – N. Virgo Jul 02 '18 at 22:18
  • @ruslan right, but that would mean Maxwell's equations are only valid in one reference frame, the one that's at rest relative to the ether. Einstein was trying to make them valid in all reference frames. (probably the MM experiment is the main motivation for wanting to do that.) – N. Virgo Jul 02 '18 at 22:23
  • @Nathaniel Thanks! But how is "no ether" then implying a constant speed of light to all reference frames? The experiment only involves one reference frame: Earth. Right? – Sam Jul 03 '18 at 11:26
  • @Sam no, it involves many. In summer (northern hemisphere) the Earth is travelling in one direction relative to the Sun, and in winter it's travelling in the opposite direction relative to the Sun. So the entire Earth is moving in a different inertial frame at different times of the year. – N. Virgo Jul 03 '18 at 11:44
  • Also, again, you have it the wrong way around. It is not that "no ether" implies a constant speed of light, it is that a constant speed of light implies no ether! – N. Virgo Jul 03 '18 at 11:44
  • 1
    Downvoted for three reasons. (1) The MM experiment had little, if any, impact on Einstein's thinking. (2) The MM experiment only proved that the luminiferous aether as conceived of at the time did not exist. (3) There are alternative axiomatizations that maintain the concept of an ether and that are indistinguishable from special relativity in terms of predictions of experimental outcomes. – David Hammen Jul 04 '18 at 17:26
  • @DavidHammen I added some text in my answer to address your point (1). (It does not contradict what I wrote already, but seemed worth clarifying.) Point (2) was already explicitly made in the answer, so I think there's no problem maybe? Point (3) is true but doesn't seem all that relevant to the question or my answer - if you can be more specific about what needs to be addressed on that point I might address it. – N. Virgo Jul 05 '18 at 06:34
4

None of the other answers seem to directly address the historical nature of the question, which asks why Einstein concluded that the speed of light is constant to observers in different inertial frames. Note this is clarified in the comments, where the asker states

"It's more like: hey, I'm Einstein, living in 1905 and I just came up with my second postulate. Here's why.... that's the answer I'm looking for."

So, to answer...

Among your three options, it is my understanding that the historical evidence suggests a mixture of the second and third options: Einstein had certain (axiomatic, if you like) convictions about absolute motion, and from the interplay of these convictions with (results derived from) Maxwell's equations, he was motivated to create his 1905 theory. Contrary to what some of the other answers suggest, Einstein was motivated much less by the Michelson-Morley result.

I think this quote from Einstein sums all this up fairly well:

In my own development Michelson’s result has not had a considerable influence. I even do not remember if I knew of it at all when I wrote my first paper on the subject (1905). The explanation is that I was, for general reasons, firmly convinced that there does not exist absolute motion and my problem was only how this could be reconciled with our knowledge of electro-dynamics. One can therefore understand why in my personal struggle Michelson’s experiment played no role or at least no decisive role.

(Emphasis added.)

However, the role that the Michelson-Morley experiment played in constributing to the ideas in Einstein's 1905 paper is a subject of debate. This paper discusses some of the conflicting lines of evidence as to how much Einstein knew about Michelson-Morley and how much it influenced his initial theory (note that the above quote is contained within the linked paper, as well as additional ones which may better expose some of the nuances of what contributed to Einstein's thinking).

Hope that helps!

Grayscale
  • 294
1

Nathaniel's answer accurately addresses your question. I would like to address your specific doubts.

  1. Sometimes I read it's because of the Michelson-Morley experiment. But that only proves there's no aether/medium for light to travel through.

True, and this in turn implies that the velocity of light must not be measured with respect to the aether. On the contrary, if $c$ has to have an absolute meaning in the Maxwell's equations, then it has to have an absolute value, i.e. it must be the same in all inertial frames.

  1. Sometimes I read that it's derived from Maxwell's equations. I still need to study those, so I can't judge about that.

The important point here is that it can be derived from Maxwell's equations provided that you know the correct transformation laws between inertial frames. In turn, Einstein derived these laws by postulating the constancy of $c$. But you could still postulate the transformation laws, and derive the constancy of $c$ through these laws or through Maxwell's equations. Or you could postulate (as Lorentz did) that Maxwell's equations should be the same in all inertial systems and use the postulate to derive the transformation laws.

  1. And sometimes I read that it's just an axiom that we need to accept.

This is usually what you do.

1

You need to be very careful as to what you mean by "the speed of light". There are indeed one-way speeds and two-way speeds and they have completely different a status.

  • One-way. A light signal departs from $A$ at time $t_A$ and arrives at $B$ at time $t_B$, as marked by clocks respectively located at $A$ and $B$. The one-way speed is then $AB/(t_B - t_A)$.
  • Two-way. A light signal departs from $A$ at time $t_A$, hits a mirror at $B$ and is reflected back to $A$ where it arrives at time $t'_A$, where those times are recorded by a clock sitting at $A$. The two-way speed is then $2AB/(t'_A-t_A)$.

The one-way speed of light is a convention as it depends on how the clocks at $A$ and $B$ are synchronised. The two-way speed of light, on the contrary, is a genuine physical observable as it can be measured. In his 1905 paper, Einstein chose a synchronisation procedure for the clocks such that the time of flight of a light signal from $A$ to $B$ is equal to the time of flight from $B$ to $A$. As a result the one-way speed from $A$ to $B$ is equal to the one-way speed from $B$ to $A$, and therefore also to the two-way speed.

Your question is therefore ill-posed. If you meant the one-way speed, then being a convention, we can't say more without choosing one such convention. If we go for Einstein's method, then it boils down to discuss the two-way speed. But we could choose anything else.

If you meant the two-way speed, then Michelson-Morley experiment shows that it is isotropic at the surface of Earth. Now consider the most popular Ether theory of that time past: the Ether is at rest in the geocentric frame, or even in the heliocentric frame. Then the M&M apparatus, thanks to the Earth spinning and revolving about the Sun moves with a great variety of speeds with respect to the Ether. So we can say that the isotropy of the two-way speed of light is observed in the corresponding great variety of inertial frames. Thus assuming that it is isotropic in any inertial frame is a legit assumption to make, to see where that leads when one tries to find space and time transformations between frames which would realise that. That's precisely what Einstein did in his 1905 paper.

  • @StéphaneRollandin yes, indeed, stupid me! Fixed. –  Jul 01 '18 at 15:40
  • @Albert Not going into the details of your proposal, you have just described one synchronisation procedure among an infinity of possibilities. On what ground should it be preferred? Going into details, I actually think the bar will not straighten up and furthermore that the wheel at the other end will oscillate. –  Jul 05 '18 at 05:56
1

I'll answer the third questions posed in the question in reverse order, and finally return to the third.

  1. And sometimes I read that it's just an axiom that we need to accept.

What's wrong with axioms? Einstein himself listed the constancy of the speed of light as a postulate (aka axiom; they're synonyms). Newton's laws of motion are axioms. Hilbert's sixth problem, announced in 1900, addressed the need to axiomatize physics. There's nothing wrong with axioms.

To the contrary; axioms in physics are a very good thing. They form the mathematical basis from which physicists predict the outcomes of experiments. One or more of the underlying assumptions (another name for axioms) are falsified if the outcome disagrees with the prediction.

  1. Sometimes I read that he derived it from Maxwell's equations. I still need to study those, so I can't judge about that.

At the time Einstein developed special relativity, Maxwell's equations were consistent with experiments of an electromagnetic nature while Newtonian mechanics appeared to be inconsistent with such experiments. This was one of the key conundrums of late 19th century physics. One of those axiomatizations appears to be incorrect.

One of the many face-value consequences of Maxwell's equation is that the one-way speed of electromagnetic radiation (which includes light) is the same to all inertial observers. In particular, the expression for the speed of a wave of electromagnetic wave in vacuum depends on neither the speed of the source nor the receiver. It's simply $c = 1/\sqrt{\mu_0 \varepsilon_0}$, where $\mu_0$ and $\varepsilon_0$ are the magnetic permeability and electrical permittivity of space. Einstein simply took Maxwell's equations at face value, that the speed of light in vacuum is the same to all observers.

  1. Sometimes I read it's because of the Michelson-Morley experiment. But that only proves there's no aether/medium for light to travel through.

The Michelson-Morley experiment disproves the existence of a luminiferous aether as conceived of at the time of that experiment. It does not disprove the existence of an ether. A competing axiomatization, Lorenz Ether Theory, developed at the same time Einstein developed his special theory of relativity, maintained the concept of a preferred frame of reference and of an ether. More on this later.

As Grayscale noted in his answer, the Michelson-Morley experiment had little if any influence on Einstein's thinking. Einstein was much more influenced by the Fizeau experiment, which looked at the speed of light in a moving medium such as flowing water. If Newtonian mechanics was true, such a medium should have a linear affect on the speed of electromagnetic waves, just as blowing air linearly effects the speed of sound waves. This was not what the Fizeau experiment found. It instead found a relationship that, after the fact, was found to be consistent with special relativity (and also Lorenz Ether Theory).

  1. And sometimes I read that it's just an axiom that we need to accept.

You don't need to accept the constancy of the one-way speed of light as axiomatic. You could accept Lorenz Ether Theory instead. It is indistinguishable from special relativity in terms of predicted experimental outcomes. The problem with Lorenz Ether Theory is that it is chock full of "where did that come from" (WTF) assumptions. It makes length contraction, time dilation, a preferred frame of reference, and an ether that acts as a medium for electromagnetic radiation in empty space axiomatic. Length contraction and time dilation nicely hide the preferred frame of reference and the ether from any experimental test designed to detect them. Nice. (That was a sarcastic "nice".)

Special relativity only has one "where did that come from" assumption, the constancy of the one-way speed of light, and that is only a ""where did that come from" assumption because we are too preconditioned by Newtonian thinking. The one-way constancy of the speed of light is a beautiful thing that speaks directly about the geometry of space-time. In contrast, Lorenz Ether Theory is a rather ugly thing that fails Occam's razor that does not speak directly to the mathematics that underlies space and time.

David Hammen
  • 41,359
  • I think that you miss one thing. SR is the LET. In SR observer simply cannot say, that he is in motion, nothing else. It is simply illogical - to think, that you cannot move, i.e. change spatial position in someone's frame. Please find a problem here: spiff.rit.edu/classes/phys200/lectures/doppler/doppler.html. The problem is about Joe and Aliens. Joe is "at rest" and Aliens fly relatively to him. What do you think, will the Aliens see dilation of Joe's clock (redshift of frequency?). Hint: aberration of light. –  Jul 04 '18 at 21:42
  • In SR ANY observer MUST think that he is at rest. Even in SR, as soon, as an observer thinks, that he is in motion, or ASCRIBES himself state of motion, he will see, that clock "at rest" is running faster and measuring rod "at rest" is longer, than his own. Look at relativistic aberration formula:https://en.wikipedia.org/wiki/Aberration_of_light. In the denominator you see the distance along the x axis. Why these distances $\gamma$ times increase ? Because measuring rod of the moving observer contracts. Look for relativistic aberration here: http://www.feynmanlectures.caltech.edu/I_34.html –  Jul 04 '18 at 22:13
  • @Albert - While the predictions of SR are indistinguishable from the predictions of LET, SR is anything but LET. LET is an ugly beast with ad hoc and unneeded metaphysical axioms. SR is a beautiful thing in comparison. – David Hammen Jul 04 '18 at 22:59
  • Does SR predicts, that due to dilation of his own clock, moving observer sees, that clock "at rest" is ticking $\gamma$ times faster? How should you synchronize clocks in your frame, so as to get this result? One - way speed of light will not be c, if you understand what role plays synchronization of clock in Special Relativity. Do you understand, that reciprocity is achieved only due to the same synchronization procedure? –  Jul 04 '18 at 23:02
  • Please look here: http://www.mathpages.com/home/kmath587/kmath587.htm (Since both emitter and receiver have the speed v relative to this system of reference, there is no differential time dilation.). How should you synchronize clocks, so as not to see any time dilation? I will tell you. You have to synchronize them with that assumption, that one way velocity of light is not c. One frame moves and another frame also moves. In a frame that is moving, one - way velocity of light is not c, it is obvious. Look at transverse Doppler shifts, they tell everything, –  Jul 04 '18 at 23:15
  • Mein gott. If there is no reciprocity of observations by means of observing transverse Doppler shift, then Einstein's synchronization procedure must be changed, with assumption that one way speed of light is not c. It i very simple. If you think, that you are moving, take you own velocity in a frame, in which you are moving and re-synchronize clocks . It is the same, as to take time from "stationary frame". To assume that one - way speed of light is c in each frame is RUBBISH. –  Jul 04 '18 at 23:21
  • @Albert - To assume that one - way speed of light is c in each frame is RUBBISH. Okay then. That is your opinion, to which you are entitled (but you are not entitled to your own facts). After the fact, photons do not need a medium, which alone annihilates one of the two key motivating factors that underlying LET, which is that a medium is needed to transmit electromagnetic waves. After the fact, general relativity annihilates the other key motivating factor, which is that there exists a preferred universal frame of reference. There is no universal frame of reference, preferred or otherwise. – David Hammen Jul 05 '18 at 05:32
-4

Einstein "borrowed" his constant-speed-of-light postulate from the ether theory:

Albert Einstein: "...I introduced the principle of the constancy of the velocity of light, which I borrowed from H. A. Lorentz's theory of the stationary luminiferous ether..." https://en.wikipedia.org/wiki/Lorentz_ether_theory

In 1887 the Michelson-Morley experiment UNEQUIVOCALLY confirmed the variable speed of light posited by Newton's emission theory of light and refuted the constant (independent of the speed of the light source) speed of light posited by the ether theory and later adopted by Einstein as his 1905 second postulate:

John Norton: "The Michelson-Morley experiment is fully compatible with an emission theory of light that CONTRADICTS THE LIGHT POSTULATE." http://philsci-archive.pitt.edu/1743/2/Norton.pdf

Banesh Hoffmann, Relativity and Its Roots, p.92: "Moreover, if light consists of particles, as Einstein had suggested in his paper submitted just thirteen weeks before this one, the second principle seems absurd: A stone thrown from a speeding train can do far more damage than one thrown from a train at rest; the speed of the particle is not independent of the motion of the object emitting it. And if we take light to consist of particles and assume that these particles obey Newton's laws, they will conform to Newtonian relativity and thus automatically account for the null result of the Michelson-Morley experiment without recourse to contracting lengths, local time, or Lorentz transformations. Yet, as we have seen, Einstein resisted the temptation to account for the null result in terms of particles of light and simple, familiar Newtonian ideas, and introduced as his second postulate something that was more or less obvious when thought of in terms of waves in an ether. If it was so obvious, though, why did he need to state it as a principle? Because, having taken from the idea of light waves in the ether the one aspect that he needed, he declared early in his paper, to quote his own words, that "the introduction of a 'luminiferous ether' will prove to be superfluous." https://www.amazon.com/Relativity-Its-Roots-Banesh-Hoffmann/dp/0486406768