2

(i) In Dirac’s book, The Principles of Quantum Mechanics, there is something quite baffling about the observed velocity of a particle. On p. 262 of the fourth edition, he wrote,

'We can conclude that a measurement of a component of the velocity of a free electron is certain to lead to the result $\pm c$. …. We shall find upon further examination of the equations of motion that the velocity is not at all constant, but oscillates rapidly about a mean value which agrees with the observed value.'

If the observed value is $c$, then the oscillation about this value will involve particle speed greater than $c$. This is highly problematic and suggests there is something wrong with his derivation which is based on the Heisenberg picture to get the expressions for position and velocity of the particle. He seemed to have ignored the problem.

(ii) The following problem also suggests that there is something wrong with his derivation based on the Heisenberg picture. On p. 267, he noted, 'We were led to the value of $\hbar/2$ for the spin of the electron by an argument depending simply on general principles of quantum theory and relativity. One could apply the same argument to other kinds of elementary particle and one would be led to the same conclusion, that the spin angular momentum is half a quantum. This would be satisfactory for the proton and the neutron, but there are some kinds of elementary particle (e.g., the photon and certain kinds of meson) whose spins are known experimentally to be different from $\hbar/2$, so we have a discrepancy between our theory and experiment.'

He then went on to seek to explain the discrepancy, based on his assumption that particles whose spins are other than $\hbar/2$ do not have their position as an observable. However, the explanation is short of being convincing.

(iii) How do we deal with these two difficulties in Dirac's derivation of the velocity and spin of a particle?

Damon
  • 197

2 Answers2

2

Zitterbewegung is an artifact of the Dirac lagrangian. The conclusion that electrons move at velocity $c$ is manifestly inconsistent with special relativity.

It is interesting that this inconsistency has found general acceptance in physics. It shows that the physics community is fallible. I should explain the problem and its solution in a paper but have not done so yet. The reason is that such a paper will likely be generally ignored unless the author has a very strong affiliation.

my2cts
  • 24,097
1

I have the fourth edition to hand, so let's go through your questions.

The speed issue

Your truncated quotation ends with a paragraph worth quoting in full:

Since electrons are observed in practice to have velocities considerably less than that of light, it would seem that we have here a contradiction with experiment. The contradiction is not real, though, since the theoretical velocity in the above conclusion is the velocity at one instant of time while observed velocities are always average velocities through appreciable time intervals. We shall find upon further examination of the equations of motion that the velocity is not at all constant, but oscillates rapidly about a mean value which agrees with the observed value.

Still confused? Well, here's the crucial point: velocities are vectors; speeds are magnitudes of vectors. Notice Dirac never once talks of speeds in that paragraph! We can unpack his point a little:

  • A velocity is "less than" a given value when its magnitude is small, so technically this is a comment about speed;
  • The instantaneous speed doesn't have to vary for the speed averaged over an observational period to be much smaller than $c$, as the direction of motion can vary;
  • Although there is a classical conservation of each component of the velocity vector, the quantized velocity vector is not "conserved between observations" because each velocity measurement determines a velocity in a specific direction, and in other directions the component is now randomly $\pm c$ viz. a superposition.

So when Dirac talks about fluctuations around an average velocity, he doesn't mean fluctuations around an average speed, e.g. from $9c/10$ to $11c/10$. If you consider speed-$c$ velocities as the surface of a radius-$c$ sphere in velocity space, he means fluctuations in a small patch on the spherical surface around a central point. Since a ball is convex, averaging gives observational speeds less than $c$. (Dirac also discusses this in terms of momenta: each speed-$c$ state has infinite momentum, but this averaging effect gives a finite empirical momentum much larger than $m_ec$.)

Well, what he actually means is the quantum-mechanical version of that, where there's a probability distribution over the surface, with the eigenvalue implications Dirac mentioned.

The spin issue

Again, let's quote his explanation:

The answer is to be found in a hidden assumption in our work. Our argument is valid only provided the position of the particle is an observable. If this assumption holds, the particle must have a spin angular momentum of half a quantum. For those particles that have a different spin the assumption must be false and any dynamical variables $x_1,\,x_2,\,x_3$ that may be introduced to describe the position of the particle cannot be observables in accordance with our general theory. For such particles there is no true Schrödinger representation. One might be able to introduce a quasi wave function involving the dynamical variables $x_1,\,x_2,\,x_3$, but it would not have the correct physical interpretation of a wave function–that the square of its modulus gives the probability density. For such particles there is still a momentum representation, which is sufficient for practical purposes.

Dirac's calculation of $\dot{m}_1+\frac12\hbar\dot{\sigma}_1$ requires $m_1$ to be an operator, so we can calculate its commutator with $H$, and similarly with $\sigma_1$. While each of these requirements relies on momentum being an observable, for $m_1$ there is also a reliance on position being an observable. If it is, the proof that the spin is $\frac12$ succeeds; contrapositively, other spins imply position and/or momentum aren't observables. It's actually position that loses out here, and not momentum, because velocity is conserved, i.e. momentum is a good quantum number. (Dirac noted this conservation is due to $H$ being linear in the momentum, which is unlike the quadratic relation in Newton's mechanics.) By contrast, position isn't conserved because the momentum is nonzero.

All observed particles that aren't of spin $\frac12$ are integer-spin quanta of force-carrying fields. Energy-momentum is stored in these fields, but the quanta don't need specific positions to interact with spin-$\frac12$ matter particles such as electrons. We see the effects on such fermions of their interactions with the integer-spin fields, or with other fermions via these fields, and can calculate these effects with Feynman diagrams (or something similar) that use the bosons. But there isn't something analogous to the electron's wavefunction for photons in modern QFT. What we have instead is a quantized version of Maxwell's vector field $A^\mu$.

Minor detail

Of course, Dirac's writing wasn't the final word on everything. The first and fourth editions were respectively published in 1930 and 1958, both long before we understood what mesons really are. Ever wondered why Dirac only said "some" mesons aren't spin $\frac12$? In those days, muons were considered mesons, because they were of medium mass. By contrast, today only quark-antiquark pairs are considered mesons, so the spin must be $0$ or $1$.

J.G.
  • 24,837
  • Thank you for the clarification of the variation of the particle's direction of velocity, i.e., the variation is not in its speed. My immediate question is related to what you reported from Dirac, 'Dirac also discusses this in terms of momenta: each speed- state has infinite momentum.' The particle's speed of c implies its infinite momentum and its infinite energy. Is this not a problem? – Damon Feb 08 '20 at 23:56
  • @Damon Physical infinities are only a problem if they're predicted to be observable. The empirical momentum is effectively determined by the subluminal averaged velocity. – J.G. Feb 09 '20 at 00:05
  • As you have mentioned, indeed on p.266 the momentum referenced is an operator. Hence, the sigma refers to the three sigma matrices which work with the momentum operators. But later he used sigma as a vector, for which he should have given the reader a little explanation. Also, what is the magnitude of this vector? Presumably, it is 1. Again, he did not explain or discuss. – Damon Feb 09 '20 at 00:14
  • @Damon That's standard; it's a vector whose entries are matrices, so $\sigma\cdot a:=\sum_i\sigma_i\hat{a}_i$. – J.G. Feb 09 '20 at 08:11
  • Indeed, this is how the three sigma matrices work with the three momentum operators. If $\sigma$ stands for a vector of matrices, then $\frac {\hbar}{2} \sigma$ is a vector of matrices. How then did he arrive at the scalar spin value of $\frac {\hbar}{2} $ from this vector of matrices? – Damon Feb 09 '20 at 13:32
  • Ah, sorry: the $\sigma_i$ never denote the Pauli matrices in this chapter, but rather $2\times2$ diagonal block matrices expressible in terms of them (i.e. $4\times4$ matrices that can act on spinors, which makes more sense), viz. his explanation of the chapter's Eq. (9). Then $\dot{m}_1+\frac12\hbar\dot{\sigma}_1$ is shorthand for the operator $\dot{m}_1I_4+\frac12\hbar\sigma_1\partial_t$ on spinors. – J.G. Feb 09 '20 at 13:42
  • Thanks for further clarification. With that understanding, there are still some steps to take before one can arrive at Dirac's conclusion in the third paragraph of p.267, 'We were led to the value $\frac {\hbar}{2} $ for the spin of the electron ...' It is not clear what these steps are. – Damon Feb 09 '20 at 17:22
  • Coming back to the nature of $\sigma$ again, in the second paragraph of p.267, he seemed to use it as an ordinary vector, not a vector of matrices, since $\sigma$ gives the (same) direction for the spin angular momentum and the spin magnetic moment. It is rather confusing whether $\sigma$ at that point is used as a vector of matrices or an ordinary vector. – Damon Feb 09 '20 at 17:23