28

In the second chapter of Peskin and Schroeder, An Introduction to Quantum Field Theory, it is said that the action is invariant if the Lagrangian density changes by a four-divergence. But if we calculate any change in Lagrangian density we observe that under the conditions of equation of motion being satisfied, it only changes by a four-divergence term.

If ${\cal L}(x) $ changes to $ {\cal L}(x) + \alpha \partial_\mu J^{\mu} (x) $ then action is invariant. But isn't this only in the case of extremization of action to obtain Euler-Lagrange equations.

Comparing this to $ \delta {\cal L}$

$$ \alpha \delta {\cal L} = \frac{\partial {\cal L}}{\partial \phi} (\alpha \delta \phi) + \frac{\partial {\cal L}}{\partial \partial_{\mu}\phi} \partial_{\mu}(\alpha \delta \phi) $$

$$= \alpha \partial_\mu \left(\frac{\partial {\cal L}}{\partial \partial_{\mu}\phi} \delta \phi \right) + \alpha \left[ \frac{\partial {\cal L}}{\partial \phi} - \partial_\mu \left(\frac{\partial {\cal L}}{\partial \partial_{\mu}\phi} \right) \right] \delta \phi. $$

Getting the second term to zero assuming application of equations of motion. Doesn't this imply that the noether's current itself is zero, rather than its derivative? That is:

$$J^{\mu} (x) = \frac{\partial {\cal L}}{\partial \partial_{\mu}\phi} \delta \phi .$$

I add that my doubt is why changing ${\cal L}$ by a four divergence term lead to invariance of action globally when that idea itself was derived while extremizing the action which I assume is a local extremization and not a global one.

JamalS
  • 19,254
  • Right there! j^mu = {(del L)/(del del_mu phi)}(delta phi) - J^mu. So j^mu itself seems to be zero. Good question to ask – MycrofD Aug 29 '15 at 03:52

4 Answers4

63

Here's what I perceive to be a mathematically and logically precise presentation of the theorem, let me know if this helps.

Mathematical Preliminaries

First let me introduce some precise notation so that we don't encounter any issues with "infinitesimals" etc. Given a field $\phi$, let $\hat\phi(\alpha, x)$ denote a smooth one-parameter family of fields for which $\hat \phi(0, x) = \phi(x)$. We call this family a deformation of $\phi$ (in a previous version I called this a "flow"). Then we can define the variation of $\phi$ under this deformation as the first order approximation to the change in $\phi$ as follows:

Definition 1. (Variation of field) $$ \delta\phi(x) = \frac{\partial\hat\phi}{\partial\alpha}(0,x) $$

This definition then implies the following expansion $$ \hat\phi(\alpha, x) = \phi(x) + \alpha\delta\phi(x) + \mathcal O(\alpha^2) $$ which makes contact with the notation in many physics books like Peskin and Schroeder.

Note: In my notation, $\delta\phi$ is NOT an "infinitesimal", it's the coefficient of the parameter $\alpha$ in the first order change in the field under the deformation. I prefer to write things this way because I find that it leads to a lot less confusion.

Next, we define the variation of the Lagrangian under the deformation as the coefficient of the change in $\mathcal L$ to first order in $\alpha$;

Definition 2. (Variation of Lagrangian density) $$ \delta\mathcal L(\phi(x), \partial_\mu\phi(x)) = \frac{\partial}{\partial\alpha}\mathcal L(\hat\phi(\alpha, x), \partial_\mu\hat\phi(\alpha, x))\Big|_{\alpha=0} $$

Given these definitions, I'll leave it to you to show

Lemma 1. For any variation of the fields $\phi$, the variation of the Lagrangian density satisfies \begin{align} \delta\mathcal L &= \left(\frac{\partial \mathcal L}{\partial\phi} - \partial_\mu\frac{\partial\mathcal L}{\partial(\partial_\mu\phi)}\right)\delta\phi + \partial_\mu K^\mu,\qquad K^\mu = \frac{\partial\mathcal L}{\partial(\partial_\mu\phi)}\delta\phi \end{align} You'll need to use (1) The chain rule for partial differentiation, (2) the fact $\delta(\partial_\mu\phi) = \partial_\mu\delta\phi$ which can be proven from the above definition of $\delta\phi$ and (3) the product rule for partial differentiation.

Noether's theorem in steps

  1. Let a particular flow $\hat\phi(\alpha, x)$ be given.

  2. Assume that for this particular deformation, there exists some vector field $J^\mu\neq K^\mu$ such that $$ \delta\mathcal L = \partial_\mu J^\mu $$

  3. Notice, that for any field $\phi$ that satisfies the equation of motion, Lemma 1 tells us that $$ \delta \mathcal L = \partial_\mu K^\mu $$

  4. Define a vector field $j^\mu$ by $$ j^\mu = K^\mu - J^\mu $$

  5. Notice that for any field $\phi$ satisfying the equations of motion steps 2+ 3 + 4 imply $$ \partial_\mu j^\mu = 0 $$

Q.E.D.

Important Notes!!! If you follow the logic carefully, you'll see that $\delta \mathcal L = \partial_\mu K^\mu$ only along the equations of motion. Also, part of the hypothesis of the theorem was that we found a $J^\mu$ that is not equal to $K^\mu$ for which $\delta\mathcal L = \partial_\mu J^\mu$. This ensures that $j^\mu$ defined in the end is not identically zero! In order to find such a $J^\mu$, you should not be using the equations of motion. You should be applying the given deformation to the field and seeing what happens to it to first order in the "deformation parameter" $\alpha$.

Addendum. 2020-07-02 (Free scalar field example.)

A concrete example helps clarify the theorem and the remarks made afterward. Consider a single real scalar field $\phi:\mathbb R^{1,3}\to\mathbb R$. Let $m\in\mathbb R$ and $\xi\in\mathbb R^{1,3}$, and consider the following Lagrangian density and deformation (often called spacetime translation): $$ \mathcal L(\phi, \partial_\mu\phi) = \frac{1}{2}\partial_\mu\phi\partial^\mu\phi - \frac{1}{2}m^2\phi, \qquad \hat\phi(\alpha, x) = \phi(x + \alpha\xi) $$ Computation using the definition of $\delta\mathcal L$ (plug the deformed field into $\mathcal L$, take the derivative with respect to $\alpha$, and set $\alpha = 0$ at the end) but without ever invoking the equation of motion (Klein-Gordon equation) for the field gives $$ \delta \mathcal L = \partial_\mu(\xi^\nu\delta^\mu_\nu \mathcal L), \qquad \frac{\partial\mathcal L}{\partial(\partial_\mu\phi)}\delta\phi = \xi^\nu\partial_\nu\phi\partial^\mu\phi $$ It follows that $$ J^\mu = \xi^\nu\delta^\mu_\nu \mathcal L, \qquad K^\mu = \xi^\nu\partial_\nu\phi\partial^\mu\phi $$ and therefore $$ j^\mu = \xi^\nu(\partial_\nu\phi\partial^\mu\phi -\delta^\mu_\nu\mathcal L) $$ If e.g. one chooses $\tau > 0$ and sets $\xi = (\tau, 0, 0, 0)$, then the deformation is time translation, and conservation of $j^\mu$ yields conservation of the Hamiltonian density associated with $\mathcal L$ as the reader can check.

Suppose, instead, that in the process of computing $\delta \mathcal L$, one were to further invoke the following equation of motion which is simply the Euler-Lagrange equation for the Lagrangian density $\mathcal L$: $$ \partial^\mu\partial_\mu\phi = -m^2\phi, $$ Then one finds that $$ \delta\mathcal L = \partial_\mu(\xi^\nu\partial_\nu\phi\partial_\mu\phi) $$ so $J^\mu = K^\mu$ and therefore $j^\mu = 0$, which is uninformative.

joshphysics
  • 57,120
  • What is the difference between a flow and a Homotopy? Apart from the fact that in the Homotopy the parameter lies on a compact interval and the "final" field is given (we usually say Homotopy between functions f and g) – Prastt Mar 15 '13 at 04:16
  • Well there really are a lot of things that are, in general, different; have you looked at http://en.wikipedia.org/wiki/Flow_(mathematics) and http://en.wikipedia.org/wiki/Homotopy and compared the definitions? – joshphysics Mar 15 '13 at 05:04
  • 1
    Thanks joshphysics for such a clear detailed answer. You say we assume that there is some $J^\mu$ such that $\delta L = \partial_\mu J^\mu $ and then we get a noether's current. So incase we can't see such four divergence change in lagrangian we don't have any nice noether current expression anymore right? – excitedaboutphysics Mar 15 '13 at 19:48
  • 3
    Yeah that's exactly right. The assumption of the existence of such a $J$ for which $\delta \mathcal L = \partial_\mu J^\mu$ is precisely what one means when on says that the variation being considered is a symmetry. Note also that $J^\mu = 0$ is a totally fine option. This would happen, for example, if the Lagrangian is invariant under the transformation under consideration. – joshphysics Mar 15 '13 at 20:58
  • 1
    hi joshphysics, would you be able to tell some texts which could quickly introduce me to the kind of language you are using about flows etc, otherwise a math prof at my university asked me to go through a whole set of coursework from basic analysis to diff geometry to do anything in qft. i aim to work in condensed matter physics. – excitedaboutphysics Mar 16 '13 at 14:24
  • It seems like noether didn't presume fixing the variation of fields at the end points and assumed the equations of motion and then got a current. Seems a different way of doing variational methods. – excitedaboutphysics Mar 18 '13 at 06:14
  • 1
    @excitedaboutphysics I'm not sure about texts; I kind of learned about that stuff piecemeal from lots of different places both in book format, and online. I'll let you know if I think of anything more specific. – joshphysics Mar 18 '13 at 07:01
  • 14
    This is the best derivation of Nother's theorem for physicists I've seen on the web. Most are hopelessly vague about what's being held constant, what a deformation is, and the difference between the quantities called $J^\mu$ and $K^\mu$ here. The others overwhelm physicists with needless technical complications, and lack generality. Well done. – Jess Riedel Jan 30 '14 at 20:06
  • 8
    @JessRiedel Thanks very much for the praise; it means a lot to me to hear that. It took many years for me wade through the logically and mathematically opaque treatments I saw online and in textbooks, so I hope this sort of post will save others time and frustration. – joshphysics Jan 30 '14 at 21:01
  • 5
    @joshphysics the government should give you a salary to keep you writing physics books. The world would be a better place – Yossarian Oct 28 '14 at 12:42
  • 2
    @silvrfück If you can find a government agency that's willing to make such an arrangement with me, then send me an email. Thanks for the praise; it means a lot. – joshphysics Oct 28 '14 at 20:02
  • Thank you very much. Most references are very sloppy about Noether's theorem. I don't understand what an "infinitesimal variation" and I never know what to set to 0 because "it's very small". Being precise is not that difficult. – mlainz Oct 26 '15 at 01:45
  • 1
    @mlainz Glad to be of service. I agree that being sufficiently precise to understand the mathematics underlying Noether's Theorem is not that difficult -- I've always been somewhat confused as to why so much imprecision and confusion surrounds the descriptions of "infinitesimals" and "variations" that abound in physics, but I'd speculate it's partly a cultural thing, and partly a historical thing. – joshphysics Oct 26 '15 at 16:32
  • Hi, do you have any idea of my question about Noether's theorem about spacetime symmetry: https://physics.stackexchange.com/q/327504 Greatly thanks. – 346699 Apr 19 '17 at 01:33
  • 1
    "In order to find such a $J_\mu$, you should not be using the equations of motion. You should be applying the given flow to the field and seeing what happens to it to first order in the "flow parameter" $\alpha$." I don't understand how this sentence is meant. Do you mean that I should insert $\hat{\phi}=\phi+\alpha\delta\phi$ in $$\delta \mathcal{L}=\partial_\alpha \mathcal{L}(\hat{\phi},\partial_\mu\hat{\phi})|_{\alpha=0}?$$ But this just produces the result of Lemma 1. Cloud you maybe elaborate, or give a reference to a simple example (like translations or something like that..)? – Sito Jun 02 '20 at 08:04
  • also, can your result be generalized to multiple fields? i.e assume $\mathcal{L}(\phi_a,\partial_\mu\phi_a)$ for $a=1,\dots, N$. Can one in the same manner show that $$K^\mu = \frac{\partial\mathcal{L}}{\partial \phi_{a,\mu}}\delta \phi_a ?$$ I think it can, it worked at least in a couple examples I tried, but I'd like a confirmation from someone who is more versed in the topic, if possible.. – Sito Jun 03 '20 at 13:48
  • 1
    @Sito Added an example. I didn't include the gory computational details for the sake of conciseness, but I trust you can work the details out yourself. – joshphysics Jul 03 '20 at 04:09
1

Lagrangian invariant upto a overall 4-divergence and Euler Lagrange equation they together give you $\partial_{\mu}\left(\frac{\partial L}{\partial\partial_{\mu}\phi}\delta\phi\right)=\partial_{\mu}\left(J^{\mu}(x)\right)$

Now if I understood you correctly you are saying essentially if $\dfrac{df}{dx}=\dfrac{dg}{dx}$ then $f=g$ which in general is not true all one can say is $\dfrac{d(f-g)}{dx}=0$ i.e. $f-g=constant$.

Similarly here $\partial_{\mu}\left(J^{\mu}(x)-\frac{\partial L}{\partial\partial_{\mu}\phi}\delta\phi\right)=0$ would imply

$j^{\mu}(x)=J^{\mu}(x)-\frac{\partial L}{\partial\partial_{\mu}\phi}\delta\phi$ such that $\partial_{\mu}(j^{\mu}(x))=0$

Abrin
  • 74
  • Well that is fine, I understand that but I don't understand why one should write $J_\mu$ at all in the change of lagrangian and then equate it and then define a noether's current – excitedaboutphysics Mar 14 '13 at 14:13
0

The key point is that the on-shell solutions only extremize the action when boundary conditions are left unchanged, arbitrary transformations on the field in general do not leave the boundary conditions unchanged and this is why the term $\partial_\mu \left(\frac{\partial {\cal L}}{\partial \partial_{\mu}\phi} \delta \phi \right) \neq 0$ since the deformation need not be zero on the boundaries. In the case of boundary conditions at infinity, the deformations need not be regular at infinity and thus can give finite boundary terms.

Also, adding a four-divergence term to the Lagrangian does not leave the action invariant in general, only the physical solutions.

AfterShave
  • 1,780
-2

First, the differential operation is called "four-divergence" (the four-dimensional divergence), not "fourth divergence".

Second, the action obviously does change under a generic change of the fields i.e. if the change of the Lagrangian is not a four-divergence. It is a completely general functional of the fields so it does change.

Third, the action is stationary when the equations of motion are satisfied. These two conditions are ultimately equivalent. But when deriving the equations of motion, you can't assume that the equations of motion are satisfied. That would be a circular reasoning and you couldn't derive anything.

Fourth, yes, the equations of motion are being used when one derives $\partial_\mu J^\mu=0$ (i.e. action is stationary) but no, the derivation of the Noether's current does not imply that $J^\mu=0$. Your mistake is to confuse what is extremized. The equations of motion only mean $\delta S =0$, not $\delta L=0$ or $\delta{\mathcal L}=0$.

Fifth, your last equation is completely meaningless because the left hand side is finite but the right hand side is infinitesimal. Much like with problems of dimensional analysis (incompatible units), a manipulation with these expressions that obey the basic rules can never end up with a similar mismatch. Your previous "calculation" is also off because you're writing some bizarre expressions that are of second-order. In the variations, $\alpha$ itself is supposed to be infinitesimal, and in valid derivations, there is never a product of $\alpha$ with another infinitesimal quantity such as $\delta \phi$. In effect, your terms are of second-order (doubly infinitesimal) but your analysis doesn't have this higher-order precision so it's wrong.

I think it's a better idea to follow the actual correct derivation instead of your personal attempts to revise the functional calculus that you haven't mastered yet.

Luboš Motl
  • 179,018
  • Well I knew that it shouldn't be an infinitesimal but if we assume a significant change in the field, wouldn't that raise an issue with using the equations of motion since they are valid in the case of a local extremization. – excitedaboutphysics Mar 14 '13 at 14:17
  • OK, I don't fully understand what you're trying to ask but you're failing. But both the equations of motion and the derivation of Noether's current only depend on infinitesimal variations of the fields. That doesn't mean that fields can't vary by a finite amount, but a finite (greater than infinitesimal variations) are irrelevant for the derivation of equations of motion; and they're irrelevant for the derivation of Noether's current (the current is linked to the Lie algebra of symmetries which correspond to group elements infinitesimally close to the identity). – Luboš Motl Mar 15 '13 at 07:02
  • thanks Luboš Motl, this is exactly the kind of answer that i was aiming for: "the current is linked to the Lie algebra of symmetries which correspond to group elements infinitesimally close to the identity" . But that again pinches back on my insufficient understanding of math while trying to do physics. So though i appreciate the answer in some way but I don't completely understand what is meant by symmetries corresponding to group elements inf close to identity – excitedaboutphysics Mar 15 '13 at 15:31
  • 1
    @excitedaboutphysics, you don't need that much math to understand what he's saying. Modern treatments of QFT include an introductory chapter/appendices about group theory, Lie groups/algebras, representation theory, Clifford algebras, etc. Without that you are not going to get very far on QFT unless you only want to learn the rules to calculate amplitudes, etc. – Prastt Mar 15 '13 at 19:25
  • hi Barefeg, any texts you might refer to which give a decent introduction to such math required for qft. – excitedaboutphysics Mar 16 '13 at 14:26
  • @excitedaboutphysics: a random tip, try Lie Algebras In Particle Physics: from Isospin To Unified Theories (Frontiers in Physics) by Howard Georgi (Oct 22, 1999). Otherwise "symmetries corresponding to group elements inf. close to identity" means, in the case of rotations, rotations by a tiny angle around any axis, or generalizations to other transformations that are not quite ordinary rotations but they're still "generalized rotations". For those rotations, one can write the matrix of the transformation as 1+epsilon where epsilon is infinitesimal and epsilon is an element of the Lie algebra. – Luboš Motl Mar 17 '13 at 07:06
  • It seems like noether didn't presume fixing the variation of fields at the end points and assumed the equations of motion and then got a current. Seems a different way of doing variational methods. – excitedaboutphysics Mar 18 '13 at 06:14