Quantized Dirac fields' plane wave component— better, deeper understanding

Question

How do we digest the fact that the Dirac field, creation $\psi$ and annihilation $\bar\psi$ operators, have their plain wave component in this form (see Peskin QFT p.58):

$$\psi(x)=\int \frac{d^3p}{(2\pi)^3} \frac{1}{\sqrt{2E_\mathbf{p}}}\sum_s \left( a_\mathbf{p}^s u^s(p)e^{-ip\cdot x}+ b_\mathbf{p}^{s\dagger}v^s(p)e^{ip\cdot x} \right); \tag{3.99}$$ $$\bar{\psi}(x)=\int \frac{d^3p}{(2\pi)^3} \frac{1}{\sqrt{2E_\mathbf{p}}}\sum_s \left( b_\mathbf{p}^s\bar{v}^s(p)e^{-ip\cdot x}+ a_\mathbf{p}^{s\dagger}\bar{u}^s(p)e^{ip\cdot x} \right). \tag{3.100}$$ The creation and annihilation operators obey the anticommutation rules $$\{a_\mathbf{p}^r,a_\mathbf{q}^{s\dagger}\} =\{b_\mathbf{p}^r,b_\mathbf{q}^{s\dagger}\} =(2\pi)^3\delta^{(3)}(\mathbf{p}-\mathbf{q})\delta^{rs}, \tag{3.101}$$

It is clear from the earlier plain wave solutions in section 3.3., the particle plain wave is $u e^{-ipx}$ and the antiparticle plain wave is $v e^{+ipx}$. However, when we use them into the field operator, creation or annihilation, we can apply the complex conjugation, such that we can choose also from, $u^* e^{ipx}$ and $v^* e^{-ipx}$.

However, questions are:

How the fermion annihilation operator contains

the annihilation of particle $a_p$ with the plain wave $u e^{-ipx}$
the creation of antiparticle $b_p^\dagger$ with the plain wave $v e^{+ipx}$

Why not the annhilation of particle $a_p$ has $u^* e^{ipx}$?

Why not the creation of antiparticle $b_p^\dagger$ has $v^* e^{-ipx}$?

Once the fermion annihilation operator is fixed, the fermion creation operator is the complex conjugation transpose of the above. So similar, we can ask

Why not the annhilation of antiparticle $b_p$ has $v^T e^{+ipx}$?

Why not the creation of particle $b_p^\dagger$ has $u^T e^{-ipx}$?

Are these just matters of conventions, or do they imply something important, for these linear combinations with particular propagating mode $$ e^{-ipx} \text{ or } e^{+ipx}? $$

They explain all of this in the text starting from equation (3.87) onwards, they even purposely use some bad notation in (3.87) they then correct in (3.99) to emphasize it. If you don't appreciate how they actually got the time-independent equation (3.87), even though the Dirac equation and it's solutions are time-dependent, which means going directly from the solutions in section 3.3 to equation (3.87) involves a big jump (please actually try doing) they assume in writing down (3.87), then read this and adapt to the Dirac equation case. — bolbteppa, Sep 23 '21 at 22:41
thanks, but I am asking a shorter and an intuitive understanding, instead of following a long chain of mathematical arguments. — Марина Marina S, Sep 24 '21 at 01:34
maybe you can contribute an answer in a clear conceptual way. I certainly knew how to follow the equations. But it is not yet a clear explanation — Марина Marina S, Sep 24 '21 at 04:36
Take a look at the Complex Fourier series. I guess your question is only a matter of mathematics, it doesn't involve a significant physical interpretation. https://en.wikipedia.org/wiki/Fourier_series Start from the real and then get complex and you see that it's just a separation of possible terms into two pieces in the real case, but in the complex case, it's a more economic representation of the Fourier series(otherwise it would include four terms instead of two). And about the second question don't forget about the ${\gamma}_0$ matrix. — Bastam Tajik, Sep 24 '21 at 09:55

bolbteppa · Answer 1 · 2021-09-24T23:22:04.167

Here is a hopefully more conceptual perspective from [1] and [2] on why one chooses e.g. $\hat{a}_p u e^{-ipx}$ rather than $\hat{a}_{\mathbf{p}} \overline{u}_{\mathbf{p}} e^{+ipx}$ or something to represent the annihilation of a particle in the wave operator expansions.

In non-relativistic quantum mechanics, if $$\psi_n(x,t) = \psi_n(x) e^{- i E_n t}$$ are the stationary states of the $\hbar = 1$ Schrodinger equation, and $\hat{A}$ is some operator, then the matrix element of $\hat{A}$ for a transition from $|i>$ to $|f>$ is given by $$A_{fi}(t) = \int \psi_f^*(x) \hat{A} \psi_i(x) dx e^{-i(E_i - E_f)t} = A_{fi} e^{- i \omega_{if}t}$$ where we set $$\omega_{if} = E_i - E_f.$$ If we then do second quantization for a system of identical particles such that the stationary states of a single such particle are the above stationary states, and assume $E_i, E_f,E_k$ are all non-negative (important for comparison with the discussion below), then a transition where the number of particles in the $k$'th stationary state increases by one has a final energy $$E_f = E_i + E_k$$ so that $\omega_{if} = E_i - E_f = - E_k$ implies the time dependence of the matrix element of this transition is $$A_{fi}(t) = A_{fi} e^{+i E_k t},$$ Thus single-particle operators with a $e^{+i E_k t}$ time dependence represent transitions where the particle number increases by one (creation operators). Similarly if $$E_f = E_i - E_k$$ is the energy of a transition where the particle number in the $k$'th stationary state decreased by one we have $\omega_{if} = E_i - E_f = + E_k$ so that the time dependence of the matrix element is $$A_{fi}(t) = A_{fi} e^{-i E_k t}.$$ so single-particle operators with a $e^{- i E_k t}$ time dependence represent transitions where the particle number decreases by one (annihilation operators).

Since in second quantization of a system of identical particles one promotes the general expansion of a single-particle wave function in terms of it's stationary states $\psi(x,t) = \sum_n a_n \psi_n(x) e^{-i E_n t}$ to a quantum field operator $$\hat{\psi}(x,t) = \sum_n \hat{a}_n \psi_n(x) e^{-i E_n t}$$ we can interpret a non-relativistic quantum field operator as a sum of single-particle annihilation operators $\hat{a}_n$ where $\hat{a}_n$ is an operator annihilates a particle in the $n$'th stationary state, and their adjoint $$\hat{\psi}^{\dagger}(x,t) = \sum_n \hat{a}_n^{\dagger} \psi_n^* e^{+i E_n t}$$ as a sum of single-particle creation operators, where $\hat{a}_n^{\dagger}$ creates a particle in the $n$'th stationary state. We can guess based off the above what the 'things' that the $\hat{\psi}$'s are supposed to act on will look like, i.e. states as creation operators on a vacuum, and it's a bit more work to determine commutation relations or anti-commutation relations from this set-up. Notice also that we really needed to use the Heisenberg picture perspective to achieve this interpretation.

When we go to relativistic quantum mechanics the above interpretations should not only still hold in the non-relativistic limit directly, but there is no reason why we can't consider all stationary states to be of the above form $\psi_n(x) e^{-i E_n t}$ as before, and so directly carry over the previous interpretation. The only difference here is that, due to relativity, even for free particles we now find that $E = \pm |E|$ is possible, i.e. energy seems to be either positive or negative, while in the non-relativistic case the energy of a free particle is very importantly a positive quantity. In non-relativistic quantum mechanics this happens all the time, negative energy eigenvalues just indicate a discrete spectrum (while positive energy eigenvalues indicate the continuous spectrum), but the negative energy discrete spectrum actually only occurs in a non-relativistic potential and the discrete spectrum conclusion for negative energies explicitly relies on the fact we are not studying a free particle because it's energy is positive.

Thus, for relativistic quantum mechanics, when we try to consider even the case of a free particle, we still unavoidably seem to find positive and negative energy eigenvalues, and simply can't ignore the stationary states associated to the negative energy eigenvalues. The most common approach is to always interpret the energy $E$ of a free particle as a positive quantity, and instead interpret the quantum field operator associated to the general expansion of a single-particle wave function in the stationary states $$\hat{\psi}(\mathbf{r},t) = \sum_{\mathbf{p}} \hat{a}_{\mathbf{p}}^{(-)} \psi_{(E_{\mathbf{p}},\mathbf{p})}(\mathbf{r}) e^{-i E_{\mathbf{p}}t} + \sum_{\mathbf{p}} \hat{a}_{\mathbf{p}}^{(+)\dagger} \psi_{(-E_{\mathbf{p}},\mathbf{p})}(\mathbf{r}) e^{+i E_{\mathbf{p}}t}$$ as not only just annihilating particles (the first sum) as in the non-relativistic case, but also creating particles (the second sum). The sum really means a sum over $\mathbf{p}, s, ...$ whatever the stationary states depend on (i.e. an integral over momentum $\mathbf{p}$ and a sum over spin $\sigma$ etc...).

Also, although we are setting up a second quantized theory of a system of identical particles, there is some freedom in that $\hat{a}_{\mathbf{p}}^{(-)}$ can in general be annihilating one type of particle, and the operator $\hat{a}_{\mathbf{p}}^{(+)\dagger}$ can be (related to an operator that is) creating a different type of particle (anti-particles), but the two types of systems of identical particles should have the same mass as they came from the same Klein-Gordon/Dirac/... equation. This also allows the special case where one is in fact only deaaling with one species of particle, thus the field is called a neutral field ([2] Sec. 2, 12, 14, 21).

One can further argue on symmetry grounds that a free particle stationary state should always have, using the $(+,-,-,-)$ metric, a $$e^{-i p x} = e^{-i(E_{\mathbf{p}} t - \mathbf{p} \cdot \mathbf{r})}$$ dependence in them, where the overall $-i$ sign is just a choice to agree with the non-relativistic case of having $e^{-iEt}$ time dependence, so that the above general expansion can be written as $$\hat{\psi}(\mathbf{r},t) = \sum_{\mathbf{p}} \hat{a}_{\mathbf{p}}^{(-)} \psi_{(E,\mathbf{p})} e^{-i(E_{\mathbf{p}} t - \mathbf{p} \cdot \mathbf{r})} + \sum_{\mathbf{p}} \hat{a}_{\mathbf{p}}^{(+)\dagger} \psi_{(-E,\mathbf{p})} e^{-i(- E_{\mathbf{p}} t - \mathbf{p} \cdot \mathbf{r})}$$ Here the $\psi_{(E,\mathbf{p})}$ and $\psi_{(-E,\mathbf{p})}$ are just the amplitudes of the stationary states, e.g. the bispinor amplitudes $u,v$ and associated normalization factors (note one should ask why we're allowed to even use normalization factors in a continuous spectrum free particle problem where the norm of an individual eigenfunction is technically infinite, see my final comments below) in the Dirac equation case. Also, the terms in the exponentials in the second sum are not even apparently relativistic as written we should really send $\mathbf{p} \to - \mathbf{p}$ $$\hat{\psi}(\mathbf{r},t) = \sum_{\mathbf{p}} \hat{a}_{\mathbf{p}}^{(-)} \psi_{p} e^{-i(E_{\mathbf{p}} t - \mathbf{p} \cdot \mathbf{r})} + \sum_{\mathbf{p}} \hat{a}_{-\mathbf{p}}^{(+)\dagger} \psi_{-p} e^{i(E_{\mathbf{p}} t - \mathbf{p} \cdot \mathbf{r})}$$ Thus we find the second term represents the creation of particles with momentum $-\mathbf{p}$ that can be interpreted as anti-particles if they are a different type of particle, or a 'neutral field' if they are the same type off particle as referenced above. In the anti-particle case they are given the new labels $\hat{a}_{\mathbf{p}}^{(+)\dagger} = \hat{b}_{-\mathbf{p}}^{\dagger}$ and the first term is similarly re-labelled $\hat{a}_{\mathbf{p}}^{(-)} = \hat{a}_{\mathbf{p}}$ to give $$\hat{\psi}(\mathbf{r},t) = \sum_{\mathbf{p}} \hat{a}_{\mathbf{p}} \psi_{p} e^{-ipx} + \sum_{\mathbf{p}} \hat{b}_{\mathbf{p}}^{\dagger} \psi_{-p} e^{ip x}$$ or in the neutral case something like $\hat{a}_{\mathbf{p}}^{(+)\dagger} = \hat{c}_{-\mathbf{p}}^{\dagger}$ and $\hat{a}_{\mathbf{p}}^{(-)} = \hat{c}_{\mathbf{p}}$ so that $$\hat{\psi}(\mathbf{r},t) = \sum_{\mathbf{p}} \hat{c}_{\mathbf{p}} \psi_{p} e^{-ipx} + \sum_{\mathbf{p}} \hat{c}_{\mathbf{p}}^{\dagger} \psi_{-p} e^{ip x}$$

In the case of the Dirac equation, from the explicit form of the stationary states we can re-write the anti-particle case as $$\hat{\psi}(x)=\int \frac{d^3p}{(2\pi)^3} \frac{1}{\sqrt{2E_\mathbf{p}}}\sum_s \left( \hat{a}_\mathbf{p}^s u^s(p)e^{-ip\cdot x} + \hat{b}_\mathbf{p}^{s\dagger}v^s(p)e^{ip\cdot x} \right); \tag{3.99}$$

Hopefully it's completely obvious now that arbitrarily deciding to interpret whatever operator is attached to say $e^{+ipx}$ (in the $(+,-,-,-)$ metric) in $\hat{\psi}$ as an annihilation operator would result in a time dependence that just completely disagrees with the discussion I started from - the usual second quantization time dependence for quantum mechanical single particle transitions that increase the number of particles in that stationary state.

You should be able to figure out the interpretation of $\hat{\psi}^{\dagger}$ based off this, or rather $\hat{\overline{\psi}} = \hat{\psi}^{\dagger} \gamma^0$. The notation/conventions with the $u$'s and $v$'s is to do with choices made in finding the stationary states of the Dirac equation in the first place, it is hopefully clear that it is the propagating modes $e^{\pm i px}$ (and the choice of metric if one uses this notation) that is vitally important in the interpretation we are being consistent.

Note how the relativistic free particle comments are inherently based in the continuous spectrum, every step of it involves physically interpreting a free particle as having the energy associated to a given stationary state. Yet it is a very common belief, even on this site, that a rigorous approach says one cannot physically interpret free particle eigenfunctions.

If taken seriously, this is saying that it's okay to throw out a physical interpretation for every single continuous spectrum free particle eigenfunction, but presumably also saying it's not okay to throw away a physical interpretation of the 'negative energy' free particle eigenfunctions in the relativistic case because of their absolutely historic importance, one should judge for themselves.

References:

Landau and Lifshitz, "Quantum Mechanics", 3rd Ed.
Landau and Lifshitz, "Quantum Electrodynamics", 2nd Ed.

This is a fantastic answer, kudos! I have a comment about the passage where you say we give "a new label $\hat a_p^\dagger = \hat b_{-p}^\dagger$. That is not exactly what happens right? In 1st quantisation there are constraints on the $c$-number coefficients in front of the plane wave solutions, and those $c$-number constraints get translated into operator constraints. For example, for a real scalar, you have that the coefficients in front of +ve energy solutions are the complex conjugate of the coefficients in front of the -ve energy solutions, and this translates to no antiparticles. — Andrea, Sep 24 '21 at 08:39
Thanks a lot. As it was written above, if we don't re-label there we're going to have a $-\mathbf{p}$ floating around no? From the above perspective, if I understand, the question seems to be is this re-labelling 'adding' a creation operator for a new particle (the anti-particle $b^{\dagger}$ case) or the same particle (neutral $c^{\dagger}$ case) - I'd suggest comparing how [2] introduces Maxwell (i.e. classical real field constraint assumed in advance) vs. how it's re-derived in the 'spin one' section as a special case of assuming the field is 'strictly neutral' on this 'constraint' point. — bolbteppa, Sep 24 '21 at 09:37
Mh.. not sure I follow you. Let me try to rephrase what I'm saying. To be blunt, in general you have no reason to assume, in the first formula where you have two sums, that the operators in the second sums are related to those in the first sum. You should already start with $\hat a_p$ and $\hat b_p$ and then in case re-label $\hat b_p$ to $\hat b_{-p}^\dagger$ or whatever when you want to interpret those operators as creating particles with positive energy. — Andrea, Sep 24 '21 at 13:08
@Andrea Great catch, used bad notation there initially and should have emphasized the different possibilities more, hopefully fixed now. — bolbteppa, Sep 24 '21 at 21:53

Quantized Dirac fields' plane wave component— better, deeper understanding

1 Answers1

Linked