Why regularization?

Question

In quantum field theory when dealing with divergent integrals, particularly in calculating corrections to scattering amplitudes, what is often done to render the integrals convergent is to add a regulator, which is some parameter $\Lambda$ which becomes the upper limit of the integrals instead of $\infty$.

The physical explanation of this is that the quantum field theory is just an approximation of the "true" theory (if one exists), and it is only valid at low energies. We are ignorant of the processes that occur at high energies, and so it makes no sense to extend the theory to that regime. So we cut off the integral to include the low energy processes we are familiar with only.

My problem with this explanation is, even if we don't know what happens at high energies, is it okay to just leave those phenomena out of our calculations? In effect we are rewriting our integral as $\int_{-\infty}^{\infty} = \int_{-\infty}^{\Lambda} + \int_{\Lambda}^{\infty}$, and then ignoring the second term. Is this really okay? And even if we were able to ascertain the "true" theory and calculated the contribution to the correction from high energy processes, I feel like it would come out to be large, since looking at it from our effective field theory viewpoint it already appears to be infinity. Do we have any right to say it is negligible?

score 15 · Accepted Answer · answered Dec 02 '11 at 11:55

15

It took the insights of Wilson and Kadanoff to answer this question. Universality. It doesn't matter all that much what the precise details in the ultraviolet are. Under the renormalization group, only a small number of parameters are either relevant or marginal. All the rest are irrelevant. As long as you take care to match up the relevant and marginal parameters, the precise regulator you choose doesn't matter. Even if it differs from the actual underlying physics, in the infrared, it still gives the same answers.

answered Dec 02 '11 at 11:55

Fred

174

1

"...the insights of Wilson and Kadanoff..." Would you mind providing a reference? – bright-star Aug 03 '16 at 15:16
@bright-star Aside from (rather brutal) expositions in statistical field theory texts, you can try Kadanoff’s “Theories of matter: Infinities and renormalization” in the Oxford handbook of philosophy of physics (arXiv:1002.2985) for a leisurely introduction. – Alex Shpilkin Jul 31 '18 at 11:42

score 13 · Answer 2 · answered Dec 02 '11 at 09:37

I'm not sure about it, but my understanding of this is that the $\int_\Lambda^\infty$ term is essentially constant between different processes, because whatever physics happens at high energies should not be affected by the low-energy processes we are able to control. That way, we can meaningfully calculate differences between two integrals, and the high-energy portions cancel out. This will be the case regardless of whether the high-energy contributions are bounded (as they should be in a true theory) or unbounded (as QFT calculations seem to indicate).

score 9 · Answer 3 · answered Dec 02 '11 at 11:48

I'll just adress that point:

... $\int_{-\infty}^{\infty} = \int_{-\infty}^{\Lambda} + \int_{\Lambda}^{\infty}$, and then ignoring the second term. Is this really okay?

The integrals we are dealing with look like this one: $$\int\frac{f(p_\mu)}{g(p_\mu)}d^4p$$ Or, more explicitly: $$ \int^{+\infty}_{-\infty}dp_x\int^{+\infty}_{-\infty}dp_y\int^{+\infty}_{-\infty}dp_z\int^{+\infty}_{-\infty}dE\,\,\frac{f(E,p_x,p_y,p_z)}{g(E,p_x,p_y,p_z)} $$ And then you just use 4D spherical coordinates: $$\int\frac{f(p_\mu)}{g(p_\mu)}d^4p = \int d\Omega_4 \int_{0}^{\infty} d{p_{eu}}\frac{f(p_\mu)}{g(p_\mu)}$$ Where $p_{eu}^2 = E^2 + p_x^2+p_y^2+p_z^2$ is the Euclidean norm of the vector, and $\Omega_4$ is a 4D solid angle.

Short digression: why Euclidean norm? We are supposed to deal with Minkowski space, do we? The crucial point here is the use of the Wick rotation: the time-components (in that case the energy $E$) of your vectors are considered to be complex numbers, the integration is performed along the imaginary axis and the result is analytically continued to back to the real values of the energy.

So, it is the last integral that we are usually cutting off: $$\int_0^\infty dp_{eu} \to \int_0^\Lambda dp_{eu}$$ and that procedure is not exactly the same as cutting every integrals in Cartesian coordinates at $\pm\infty$. But the results must be the same -- the difference if just either you cut large contributions with a sphere or with a box.

Slightly more important point is that more detailed study of the analytic properties of the integrals we are dealing with in QFT shows, that the "nature" of the UV divergences is always Euclidean -- it "comes" from large values of $E^2 + p_x^2+p_y^2+p_z^2$ -- not just "large energy". So, the use of this coordinates is rather natural.

Last but not least -- this coordinates lead you to the idea if dimensional regularization, which turns out to be very convenient when doing actual calculations.

score 7 · Answer 4 · answered Jul 16 '12 at 06:38

The reason is that the continuum involves a notion of limit where you get an infinity of points in every little region. This is different from the potential infinity of say, sentences, where the sentence can get longer and longer and then you get an infinite number of sentences, but only for very long sentences which are further away from the realizable ones. In this case, in every box, you have an infinite number of distinct points, each of which is as accessible as any other (at least in a naive view of the continuum).

When you are doing mathematics, you are describing a continuous object with a string of symbols. This means that the only way to define quantities on a continuum is to define them on some approximate notion of a continuum, and then take the limit that the approximation becomes dense. When you first defined real numbers, in grade school, you defined numbers with a finite number of decimals, or perhaps rational numbers, and then abstracted the notion of real numbers as infinite sequences of decimals, or as limiting values of Cauchy sequences of rationals. In either case, you have a discrete structure with only a potential infinity, and the real numbers emerge when you take the continuum limit, either by allowing the decimals to grow arbitrarily long, or by allowing the denominators of the rational number to grow arbitrarily large.

The reason we don't sweat this is because we are intuitively familiar with geometry, so we have an immediate understanding that this process makes sense. But it is far from intuitive that the real numbers make sense. In quantum field theory, one has to deal squarely with the fact that the real numbers are actually a sophisticated idea, since you are defining quantum fluctuations in fields, which have separate degrees of freedom at every one of continuously many points. There is a cutoff at every energy, since exciting the short-distance modes requires high energy, so physically, the behavior of fields at low energies should not depend on the high-energy field modes. But if this is true, then you should be able to define the field theory as a continuum theory, as a limit as continuous space emerges from a discrete structure.

When formulating quantum field theory, you start with a regularization because that's the way every continuum theory is defined, it's a limit. This is no different in principle than defining a differential equation. If somebody asked you "what does $\dot{x} = \alpha \sqrt{x} $ mean?" You would have to say that the limit of the next value of x, minus the current value of x, is the time increment times the $\sqrt{x}$, in the limit that the stepsize is small. We write it as a differential equation without a stepsize, because the limit makes sense. The definition of derivative as a quotient is designed to ensure that this is so, you divide by the step-size to the first power, because this is how the increment of a differentiable function scales with step-size.

If you are doing stochastic calculus, random walks on continuous time, the increment of distance you move scales as the square root of the stepsize. This means that the ordinary notion of derivative diverges when you take the small step-size limit. The definition of stochastic calculus defines derivatives of random walks as a distribution, so that only its integral makes sense, not its value at one point, and you get some funny commutation relations, like

$$ x(t+\epsilon)\dot{x}(t) - x(t)\dot{x}(t) = 1$$

where the equality is understood as a distribution identity, it is saying that the integral of the randomly fluctuating quantity on the left over any interval is the same as the integral of 1 over the same interval, and the fluctuations are zero over a finite size interval in the limit of small steps. This is the stochastic version of the Heisenberg commutation relation in quantum mechanics.

For quantum fields, you have to analogously make an approximation to the continuum, say a lattice with step-size $\epsilon$. If the results make sense as field theory, if they have a continuum limit, they don't depend on the step-size (the inverse cutoff) you choose, as long as it is small. The only difference is that the scaling laws for the parameters is different from ordinary calculus (or from stochastic calculus).

To see explicitly how a lattice limit produces a continuous field theory, you can consider the case of the Ising model. If you make the lattice small while simultaneously bringing the temperature closer to the critical temperature, so that the correlation length stays fixed as the lattice shrinks, you end up with long-ranged fluctuations in the spin described by a continuous field. The field is the number of up spins minus the number of down spins in a ball containing many lattice sites, where the ball size shrinks relative to the correlation length, but grows relative to the lattice spacing. You rescale the field by the lattice spacing to a certain power and the ball-size to a certain power (you choose the powers to get a finite limit in the small $\epsilon$ limit, independent of the ball size), and then you have defined a field theory. In this case, it is a scalar field theory with quartic self interactions, and in three dimensions or in two, it converges to a sensible unique limit which depends only on the correlation length (the coupling flows to a fixed point in the long-distance theory). In five dimensions and above, the theory converges to a free field theory. In four dimensions, it converges to a free field theory, but very very slowly, the coupling only goes to zero as the log of the lattice spacing, and if you see a scalar quartically interacting field in nature with a nonzero interaction, you can conclude that the cutoff scale is above the lattice size which would make the coupling smaller than what you observe.

I gave a qualitative description because you asked for this, but the full rigorous description of the limiting process is not yet fully worked out, although it is known heuristically for most cases of interest. This is an important open problem in mathematical physics.

Vladimir Kalitvianski · Answer 5 · 2011-12-02T14:00:47.227

Yes, as twistor59 said, just keeping $\Lambda$ finite does not help: the calculation results still violently disagree with experiments. So these integrals are modified by hand in a more radical way: some part of the integrand is subtracted. Such a subtraction is carried out until one gets a finite integral within infinite limits. Such a subtraction is sometimes equivalent (and implemented) as mass and charge renormalizations (redefinitions). The cut-off parameter $\Lambda$ gets into the "definition" of the "physical" parameters via "bare" ones. Nothing fixes the value of $\Lambda$ in this subtraction procedure. In other words, the subtraction result is still ambiguous.

One choses the "fixing condition" in an obvious way: the result should be in agreement with experiment. In particular, mass and charge values in our formulas should be equal to the measured ones. But after subtraction, the result is rather insensitive to $\Lambda$ if the latter is large enough. There is, however, a whole science about relationships between "bare" and "physical" parameters called sloppily Renormalization Group transformations (flows).

It turns out that after renormalizations and soft diagram summation the calculation results can be "put in touch" with experimental data. So the mainstream opinion is now that those cut-offs and subtractions (renormalizations) have some underlying physics. They say, it cannot be just an accidental success.

But it was not so in the early years of QFT. Many QFT fathers pointed out that we apparently worked with wrong Hamiltonians due to wrong conceptions and agreement with experiment was occasional. I designed a simple toy model to support this original opinion. See this and that.

score 2 · Answer 6 · edited Apr 13 '17 at 12:40

My answer may be rather naive because I am not too familiar with this, however due to the same reason it may be more transparent.

As far as I understand, the whole idea of regularization is to get rid of $\Lambda$ as an explicit parameter of the theory and put the unknown to observable values. This is done by rather tricky limit $\Lambda\to\infty$ which leaves observable values (divergent without the procedure) finite.

Due to many different reasons leaving $\Lambda$ as an additional parameter of the theory is not satisfactory.

For details it is better to read references given in this community wiki.

score 1 · Answer 7 · answered Dec 02 '11 at 11:57

1

The ideas of renormalization and regularization are explained very well in this paper by Delamotte. Arxiv version. Perhaps this will help you.

Edit: same article is mentioned in the community wiki

answered Dec 02 '11 at 11:57

Vijay Murthy

2,316

Why regularization?

7 Answers7

Linked