16

I've been wondering about the standard argument that the only possible identical particles in three dimensions are bosons or fermions. The argument goes like this:

Consider exchanging the positions of two identical particles in 3D. Because the particles are identical, $|\Psi|^2$ remains the same after having performed the exchange, so we must have $$\Psi \to \Psi e^{i\theta}.$$ If we perform the exchange twice, the path described is homotopic to the trivial path. This is easiest to see by realizing that the double exchange of particles A and B is the same as having particle A encircle particle B completely, and this path can just be lifted into the third dimension and shrunk down to a point. Alternatively, it's because $\pi_1(S^2/\mathbb{Z}_2) = \mathbb{Z_2}$.

Because the phase picked up along a path is a homotopy invariant, and the path of a double exchange is homotopic to the trivial path, $$e^{2 i \theta} = 1$$ which implies that under a single exchange, $\Psi\to\pm\Psi$, corresponding to bosons and fermions.

This argument sounds good, but it sneaks in the crucial physical input without justification: why should the phase be a topological invariant? Why can't it change under a deformation of the path? Not only do I not know how to prove this, it doesn't even appear to be true; for example, if there were a magnetic field, changing the path would change the magnetic flux through it, and hence the phase.

knzhou
  • 101,976
user94624
  • 341
  • 1
    Your last comment seems to be the answer you are looking for. Moreover, only in 3 or more dimensions you will be able to "lift" the loop and continually deform it to the trivial path. Notice that you cannot do that for the simple exchange since this path is not a closed path. By the way, where does the $S^2/\mathbb Z_2$ in your fundamental group come from? – Diracology Feb 12 '16 at 10:26
  • I've rephrased my question a bit.

    The $S^2/\mathbb{Z}_2$ comes from going to relative coordinates and and assuming that the particles have a constant separation during exchange (this is the $S^2$ part), while the quotient comes from identifying opposite points on the sphere. Due to this I think a single exchange actually does correspond to a closed path on this space. For this space it holds that a double exchange is again closed, but contractible to a point.

    – user94624 Feb 12 '16 at 14:33
  • Have a look at this site about parastatistics: https://en.wikipedia.org/wiki/Parastatistics – Lewis Miller Feb 12 '16 at 15:53
  • I don't really see how that relates to my question - is there something in there related to the phase being topological? – user94624 Feb 22 '16 at 13:03
  • As far as I know, the wavefunction is a notion not applicable to topological phases; that's what makes them topoligical, there is no local order parameter possible in description (unlike Landau-Ginzburg phase transitions) – Ilja Mar 26 '16 at 21:04
  • A thought: one of the standard conditions for proofs of the spin-statistics connection, as I understand it, is that there be a Lorentz-invariant vacuum state. It is not clear to me that this would be the case in this situation. In particular, if in one particular inertial frame the phase a particle acquires only depends on the spatial path that is taken, this will not hold in any other inertial frame, and the vacuum thus picks out one frame as "special." – Rococo Mar 28 '16 at 22:37
  • I actually addressed this very question in a previous answer: http://physics.stackexchange.com/a/168728/6461

    Hope you find it useful.

    – Ronak M Soni Mar 29 '16 at 23:22

2 Answers2

6

The phase picked up during identical-particle exchange actually isn't a topological invariant. It comes from the combined contribution of many different factors:

  1. The standard dynamical phase $\sim e^{-i E t}$ resulting from Schrodinger time evolution and the fact that the exchange takes a positive amount of time

  2. The Berry's phase, which does not depend on the exchange duration but does depend on its specific geometric (not merely topological) path

  3. The instantaneous energy eigenbasis transformation (in the nonabelian case)

  4. Diabatic effects resulting from the fact that the exchange process is not infinitely slow

  5. Thermal effects if the system is at finite temperature

  6. Finite-size interaction effects from the fact that the anyons are not infinitely far-separated

These various factors do not always separate cleanly, but roughly speaking, the combined effects of the second and third contributions only depend on the braid's topology. For a gapped topological phase, the last two are exponentially small in the appropriate regimes, but they're always there. But for any physical system, there is actually only a window of exchange times, which is bounded in both directions, in which the topological phase dominates. If the exchange is too fast then diabatic errors mess things up, but if it's too slow than finite-size errors mess things up. In practice, the former issue is more problematic, because diabatic errors are generically only suppressed algebraically while finite-size errors are suppressed exponentially. Some of these phase shifts - particularly the dynamical one - can (hopefully) be ignored, because they are the same for every anyon that participates in the braid, and only phase differences are physically relevant.

For abelian anyons the topological phase is basically just an Aharanov-Bohm phase that the particle acquires (in addition to an ordinary dynamical phase) from circling magnetic flux, as you say. The key difference from the regular Aharanov-Bohm setup is that the flux is very tightly localized around the other anyon. So basically, if the other anyon is in the loop then the moving anyon circles all the flux and picks up the full exchange phase, and if it's out of the loop then it doesn't encircle any flux. In reality, for a system with a finite correlation length the anyons are not perfectly localized and some flux will leak into or out of the loop, but since the effective magnetic field around an anyon decays exponentially in a gapped phase, we can neglect this effect if the anyons stay separated much farther than the correlation length. Part of the reason why the importance of non-topological contributions to anyon phases isn't fully appreciated is that most people build up their intuition on the toric code, which is highly non-generic because it has zero correlation length, so anyons are "well-separated" even on adjacent lattice sites.

See the introduction to this paper for a discussion of these various subtleties.

In practice, theorists often sweep all these subtleties under the rug and only focus on the topological phase, although in reality the other contributions are always there. For example, topological quantum field theories can only capture the physics of the topological phase contributions, so any results derived from that formalism will necessarily fail to describe the other effects in a real system.

tparker
  • 47,418
  • 3
    This is an absolutely perfect answer, to me. I’ll keep the bounty up for now so it can get more publicity. – knzhou Jun 19 '18 at 18:42
  • This is a great answer, indeed. But when it comes to the topological part of the phase, why is it the "magnetic flux" is localized to the other particle (in the ideal scenario)? Isn't that equivalent to the condition $f_{kl}=0$ everywhere except for $x=0$ in the other answer? – user94624 Jun 25 '18 at 11:20
  • @user94624 Yes, in reality the value of $f_{kl}$ decays exponentially with distance from the other particle. By assuming that the particles stay much farther away from each other than the correlation length, we can approximate $f_{kl}$ as being perfectly localized at the other position, but this breaks down if they get close together. As to why it's localized to the other particle: as with many "why" questions in physics, it's difficult to give any answer better than "because the math says so." I could mumble something about emergent gauge fields but that would really just restate... – tparker Jun 26 '18 at 18:26
  • ... the question. You can think of an anyon as a bound state of an electric charge and a magnetic flux tube, so that the magnetic flux only appears at the particle's location. And the particle's wavefunction is exponentially localized, as is generic for the low-lying excitations of a gapped many-body phase. – tparker Jun 26 '18 at 18:29
3

I think an answer can be found in the great paper On the theory of identical particles (1977). I'll write down the argument as I understand it.

Consider 2 identical particles in 3D. The configuration space is $C=(\mathbb{R}^3 \times \mathbb{R}^3)/\mathbb{Z}_2$, where we quotient by $\mathbb{Z}_2$ since the particles are identical. By going to relative coordinates, $C=\mathbb{R}^3 \times (\mathbb{R}^3/\mathbb{Z}_2)$, where $\mathbb{R}^3$ is associated with the center of mass, and where the second is the space of relative coordinates $x\in \mathbb{R}^3$ which are identified with $-x$. This gives a singular point at $x=0$, and we remove it (we keep the particles apart). Thus, the relative part is $(\mathbb{R}^3-{0})/\mathbb{Z}_2=]0,\infty[\times (S^2/\mathbb{Z}_2)$.

First focus on the part $S^2/\mathbb{Z}_2$. As I mentioned above in the question, closed loops in this space fall in two classes. If we think of the sphere $S^2$ before identifying antipodal points, these classes are classes of paths that 1. begin and end on the same point 2. begin and end on antipodal points.

The paths in class 1, viewed in the total space $(\mathbb{R}^3-{0})/\mathbb{Z}_2$, do not encircle the removed point at 0 (or can be continuously deformed to a path that doesn't). The paths in class 2 encircle the origin once (or can be continuously deformed to a path that encircles it once).

The actual argument can now be made using parallel transport of vectors along closed loops in the configuration space. For each $x$ we define a Hilbert space $h_x$ such that $\Psi(x)\in h_x$ is the value of the wave function at $x$. Thus we think of $\Psi(x)$ as a vector that we will parallel transport. Defining a basis $\chi_x$ for $h_x$, we can write $\Psi(x)=\psi(x)\chi_x$. We have a choice of basis, which is a gauge freedom.

We then define a linear, unitary operator $P(x',x) : h_x \to h_{x'}$ which parallel transports vectors of $h_x$ to vectors of $h_{x'}$. It is assumed that it takes the form $P(x+dx,x)\chi_x = (1+i dx^k b_k (x) ) \chi_{x+dx}$ for infinitesimal parallel displacement. The $b_k$ are functions analogous to the vector potential $A_\mu$, and define gauge-invariant differentiation $D_k = \partial_k -i b_k (x) $.

In turn, this defines a "curvature" tensor $f_{kl} = i [D_k ,D_l]$ analogous to $F_{\mu \nu}$ in EM or $R_{\mu \nu}$ in GR. The idea, I think, is that parallel transport is trivial along paths which enclose no curvature, but that it can have an effect if the region enclosed is curved. In our case, $f_{kl}$ is chosen zero for all points in the configuration space, but it is not defined for the excluded point at the origin.

Suppose we consider a path that does not encircle the origin. In that case, the enclosed "curvature" is zero, and we have $P(x,x)\equiv P(x) = 1$. This holds for any path that does not encircle the origin - deforming a path continuously will retain this property. If we consider a path that encircles the origin once, we write $P(x) = \exp[i\zeta]$, since $h_x$ is one-dimensional. Again, if we continuously deform such a path, we cannot change the enclosed "curvature", so the phase will be the same for all such paths.

Then a path corresponding to a "double exchange" yields $P(x) = \exp[2i\zeta]$. Now we continuously deform it back to the trivial path, which we can do without passing through the origin. The "curvature" enclosed for the "double exchange" path is therefore the same as that of the trivial path, namely zero (there should be some nicer way of looking at this). Since for all such paths $P(x)=1$, we have $\exp[2i\zeta]=1$.

If we consider the 2D case, we would get infinitely many classes of paths, since the configuration space would be $\mathbb{R}^2 \times (]0,\infty[\times S^1/\mathbb{Z}_2)$. Yet, the same idea should apply.

Quillo
  • 4,972
user94624
  • 341
  • 1
    "In our case, $f_{kl}$ is chosen zero for all points in the configuration space"- isn't this the whole point, though? This reads to me like a fancier way of saying that the answer to "why is the phase not path-dependent" is "we choose to make the phase not path-dependent." But maybe I am missing something... – Rococo Mar 29 '16 at 15:59
  • That's a valid point - basically the authors only offer the explanation that they don't want $b_k$ to describe a force field, so they let $f_{kl}=0$. I'm not quite convinced by this, but I do feel there should be some explanation for it. – user94624 Mar 29 '16 at 19:45
  • Choosing $f_{kl} \neq 0$ has implications for the dynamics. If it were non-zero there would be dynamics that just isn't there in the Hamiltonian, and all sorts of quantum mechanics calculations just break then. The standard way of not folding up the configuration space would be a completely useless way to think of QM. However, we know that that isn't the case. (Of course, it could be non-zero at a small enough scale that we don't do QM calculations there. Which is basically the same as saying it's singular at a point.) – Ronak M Soni Mar 29 '16 at 23:27
  • @RonakMSoni yes, we can say that this possibility is experimentally ruled out. Having $f_{kl}$ be nonzero is somewhat like having a residual magnetic field that is "baked into" the vacuum state. This would certainly ruin the topological argument, but I'm not so sure that it would ruin a more rigorous proof of the spin-statistics connection. After all, fermions in a uniform magnetic field (in 3D) don't cease being fermions. So I wonder if this type of proof actually makes more assumptions than are strictly necessary. – Rococo Mar 30 '16 at 01:31