Primary constraints for constrained Hamiltonian systems

Question

I would be most thankful if you could help me clarify the setting of primary constraints for constrained Hamiltonian systems. I am reading Classical and quantum dynamics of constrained Hamiltonian systems by H. Rothe and K. Rothe, and Quantization of gauge system by Henneaux and Teitelboim.

Consider a system with Lagrangian $L(q,\dot{q})$ and define the momenta $$p_j = \partial L(q,\dot{q},t)/\partial{\dot{q}_j}, \tag{3.1}$$ with $j$ from 1 to $n$ (for $n$ degrees of freedom), and the Hessian $$W_{ij}(q,\dot{q}) = \partial^2 L/\partial{\dot{q}_i}\partial{\dot{q}_j}\tag{2.2}.$$ Let $R_W$ be the rank of the Hessian $W_{ij}$.

Assume that $L$ is singular and $R_W < n$. In order to ask the question, I'm now quoting how the Rothes describe the setting of primary constraints on pages 26 and 27 of their book.

Let $W_{ab}$, ($a,b = 1,...,R_W$) be the largest invertible submatrix of $W_{ij}$, where a suitable rearrangement of the components has been carried out. We can then solve the eqs. (3.1) for $R_W$ velocities $\dot{q}_a$ in terms of the coordinates $q_i$, the momenta $\{p_a\}$ and the remaining velocities $\{\dot{q}_\alpha\}$: $$\dot{q}_a = f_a(q,\{p_b\},\{\dot{q}_\beta\}), \qquad a,b = 1,...,R_W, \qquad\beta = R_W + 1,...,n.\tag{3.6}$$

Inserting this expression into the definition of $p_j$, one arrives at a relation of the form $$p_j = h_j(q,\{p_a\},\{\dot{q}_\alpha\}).\tag{1}$$ For $j = a$ ($a = 1,...,R_W$) this relation must reduce to an identity. The remaining equations read $$p_\alpha = h_\alpha (q,\{p_a\},\{\dot{q}_\beta\}).\tag{2}$$ But the rhs cannot depend on the velocities $\dot{q}_\beta$, since otherwise we could express more velocities from the set $\{\dot{q}_\alpha\}$ in terms of the coordinates, the momenta, and the remaining velocities.

This is where the presentation from Rothe stops, and my concern is that equations of the form (2) with all $\{\dot{q}_\beta\}$ present can still be possible, and yet one cannot solve for more velocities from the set $\{\dot{q}_\alpha\}$ in terms of the coordinates, the momenta and the remaining velocities if the conditions stipulated in the Implicit Function Theorem are not met, for not all equations of the type (2) can be solved implicitly for $\{\dot{q}_\alpha\}$. Therefore it is not proven that there are $(n - R_W)$ primary constraints of the form $\phi_\alpha (q,p) = 0$.

Henneaux and Teteilboim even state that these $(n - R_W)$ constraints of the form $\phi_\alpha (q,p) = 0$ are functionally independent, but give no justification to this statement.

I would be most thankful if you could help clarify my above concern and also if you could clarify the statement by Henneaux and Teitelboin as to the fact that the constraints are functionally independent.

Qmechanic · Answer 1 · 2023-10-03T12:57:37.070

I) Let us suppress position dependence $q^i$ and explicit time dependence $t$ in the following, and also assume that the Lagrangian $L=L(v)$ is a smooth function of the velocities $v^i$, where $i=1, \ldots, n$. The Hessian matrix is defined as

$$ H_{ij}~:=~\frac{\partial^2 L}{\partial v^i \partial v^j}.\tag{1}$$

Let us consider an open neighborhood$^1$ $V$ around a fixed point $v_{(0)}$. Now a very important assumption is the so-called regularity condition, cf. Refs. 1 and 2. This means that the rank of the Hessian $H_{ij}$ should not depend on the point $v$. In other words, the Hessian $H_{ij}$ should have constant rank $r$. (Ref. 3 implicitly assumes this crucial point without stressing its importance.)

We now permute/rename the velocities $(v^1,\ldots, v^n)$ such that the $r\times r$ minor $A_{ab}$ is invertible in the top left corner of the Hessian

$$ H~=~ \begin{bmatrix} A & B \\ C & D \end{bmatrix} ~=~ \underbrace{\begin{bmatrix} 1 & 0 \\ CA^{-1} & 1 \end{bmatrix}}_{\text{invertible}} \begin{bmatrix} A & 0 \\ 0 & D-CA^{-1}B \end{bmatrix} \underbrace{\begin{bmatrix} 1 & A^{-1}B \\ 0 & 1 \end{bmatrix}}_{\text{invertible}}. \tag{2} $$

The renaming is assumed to be done in the same way in the whole neighborhood $V$. This is possibly by going to a smaller open neighborhood (which we also call $V$) if necessary. (Later when we apply inverse function theorem below we might have to implicitly restrict $V$ further.) It follows from the constant rank condition that

$$ D~=~CA^{-1}B. \tag{3}$$

II) We next perform the singular Legendre transformation $v\leadsto p$. Define functions

$$ g_i(v)~:=~\frac{\partial L(v)}{\partial v^i}, \qquad i=1, \ldots, n. \tag{4} $$

The momenta are defined in the Lagrangian theory as

$$ p_i~:=~g_i(v), \qquad i=1, \ldots, n. \tag{5}$$

The velocities

$$ v^i~\longrightarrow~ (u^a,w^{\alpha}) \tag{6} $$

split into two sets of velocity coordinates

$$ u^a, \quad a=1, \ldots, r, \qquad\text{and}\qquad w^{\alpha}, \quad \alpha=1, \ldots, n-r, \tag{7}$$

which we shall call primary expressible (unexpressible) velocities, respectively [2]. Similarly, the momenta

$$ p_i~\longrightarrow~ (\pi_a,\rho_{\alpha}) \tag{8}$$

split into two sets of momentum coordinates

$$ \pi_a, \quad a=1, \ldots, r, \qquad\text{and}\qquad \rho_{\alpha}, \quad \alpha=1, \ldots, n-r. \tag{9}$$

The primary expressible velocities

$$ u^a~=f^a(\pi,w), \qquad a=1, \ldots, r. \tag{10}$$

are extracted from the $r$ first momentum relations (5) via the inverse function theorem with the $w$-variables as passive spectator parameters.

III) Next define composite functions

$$ h_i(\pi,w) ~:=~ g_i(f(\pi,w),w), \qquad i=1, \ldots, n. \tag{11}$$

It follows immediately that

$$ h_a(\pi,w) ~=~ \pi_a, \qquad a=1, \ldots, r. \tag{12}$$

because the functions $g$ and $f$ are each other's inverse for fixed $w$. Differentiation of (12) wrt. $w^{\alpha}$ leads to

$$ \begin{align} 0~\stackrel{(12)}{=}~&\frac{\partial h_a(\pi,w)}{\partial w^{\alpha}} \cr ~\stackrel{(11)}{=}~&\left.\frac{\partial g_a(u,w)}{\partial w^{\alpha}}\right|_{u=f(\pi,w)} +\left. \frac{\partial g_a(u,w)}{\partial u^b} \right|_{u=f(\pi,w)} \frac{\partial f^b(\pi,w)}{\partial w^{\alpha}} \cr ~\stackrel{(1)+(2)+(4)}{=}&\left.B_{a\alpha}\right|_{u=f(\pi,w)} +\left. A_{ab} \right|_{u=f(\pi,w)} \frac{\partial f^b(\pi,w)}{\partial w^{\alpha}} . \end{align}\tag{13} $$

Theorem 1. The $h_i$-functions (11) do not depend on the $w$-variables.

Proof of theorem 1:

$$ \begin{align} \frac{\partial h_{\alpha}(\pi,w)}{\partial w^{\beta}} ~\stackrel{(11)}{=}~&\left.\frac{\partial g_{\alpha}(u,w)}{\partial w^{\beta}}\right|_{u=f(\pi,w)} +\left. \frac{\partial g_{\alpha}(u,w)}{\partial u^a} \right|_{u=f(\pi,w)} \frac{\partial f^a(\pi,w)}{\partial w^{\beta}} \cr ~\stackrel{(1)+(2)+(4)}{=}&\left.D_{\alpha\beta}\right|_{u=f(\pi,w)}+\left. C_{\alpha a} \right|_{u=f(\pi,w)} \frac{\partial f^a(\pi,w)}{\partial w^{\beta}} \cr ~\stackrel{(13)}{=}~&\left. \left(D_{\alpha\beta}-C_{\alpha a}(A^{-1})^{ab}B_{b\beta} \right)\right|_{u=f(\pi,w)}\cr ~\stackrel{(3)}{=}~&0.\end{align}\tag{14} $$

End of proof.

Theorem 1 answers OP's question. Let us for fun continue with a few more steps of the Dirac-Bergmann analysis.

IV) The $n-r$ last momentum relations (5) now become $n-r$ functionally independent primary constraints

$$ \phi_{\alpha}(p)~:=~\rho_{\alpha}- h_{\alpha}(\pi)~\approx~0,\tag{15} $$

where the $\approx$ symbol means equality modulo constraints or EOMs. The primary constraints (15) are clearly functionally independent as each of them depend on different $\rho$ momenta.

V) The Lagrangian energy function is defined as

$$h(v)~\stackrel{(4)}{:=}~g_i(v) v^i -L(v).\tag{16}$$

Define the canonical Hamiltonian as a composite function

$$\begin{align} H_{\rm can}(\pi,w)~:=~~~~~&\left.h(v)\right|_{u=f(\pi,w)}~=~h(f(\pi,w),w)\cr ~\stackrel{(10)+(11)+(16)}{=}&\underbrace{h_a(\pi)}_{=\pi_a} f^a(\pi,w) +h_{\alpha}(\pi) w^{\alpha} - L(f(\pi,w),w).\end{align}\tag{17}$$

Theorem 2. The canonical Hamiltonian (17) does not depend on the $w$-variables.

Remark. The canonical Hamiltonian (17) also does not depend on the $\rho$-momenta by construction.

Proof of theorem 2:

$$\begin{align} \frac{\partial H_{\rm can}}{\partial w^{\alpha}} ~\stackrel{(17)}{=}~&\left(h_a(\pi) -\left. \frac{\partial L(u,w)}{\partial u^a} \right|_{u=f(\pi,w)} \right) \frac{\partial f^a(\pi,w)}{\partial w^{\alpha}} + h_{\alpha}(\pi) -\left.\frac{\partial L(u,w)}{\partial w^{\alpha}} \right|_{u=f(\pi,w)} \cr ~\stackrel{(4)+(11)}{=}0.\end{align} \tag{18}$$

End of proof.

VI) Let us define the extended Hamiltonian [2]

$$ H_E(v,p)~:=~p_iv^i-L(v), \tag{19}$$

and the primary Hamiltonian

$$ H_{\rm prim}(w,p)~:=~\left. H_E(v,p)\right|_{u=f(\pi,w)}~\stackrel{(15)+(17)}{=}~H_{\rm can}(\pi)+w^{\alpha}\phi_{\alpha}(p).\tag{20} $$

Note that the $w$-variables act as Lagrange multipliers.

VII) One may show that Lagrange equations

$$\left. \frac{d}{dt}\frac{\partial L}{\partial v^j}\right|_{v=\dot{q}}~\approx~\frac{\partial L}{\partial q^j} \tag{21}$$

are equivalent to Hamilton's equation in the following form

$$ \begin{align} \dot{q}^i~\approx~&\frac{\partial H_{\rm prim}}{\partial p_i}\cr -\dot{p}_j~\approx~&\frac{\partial H_{\rm prim}}{\partial q^j}\cr \phi_{\alpha}(p)~\approx~&0. \end{align} \tag{22}$$

This completes in principle the singular Legendre transformation in the sense that we no longer need to refer to the Lagrangian.

VIII) From this point we assume that there are no explicit time dependence, and we will stop suppressing the $q$-dependence in the notation. We still need to check the consistency condition$^2$

$$\begin{align} 0~\stackrel{?}{\approx}~&\dot{\phi}_{\alpha}~\approx~\{\phi_{\alpha},H_{\rm prim}\}\cr ~\approx~&\{\phi_{\alpha},H_{\rm can}\}+\{\phi_{\alpha},\phi_{\beta}\}w^{\beta}\cr ~=:~&b_{\alpha}+A_{\alpha\beta}w^{\beta} ,\end{align}\tag{23}$$

which is an affine equation in the $w$-variables. If

$$ b~\notin~{\rm ran}(A)~\equiv~{\rm im}(A) \tag{24}$$

(in a weak sense) this lead to secondary constraints in the $(q,p)$-phase space.

Note that (some of) the secondary constraints might not be new. [This happens e.g. in E&M where the secondary constraint is Gauss' law, which is already there implicitly]. Interestingly, if new secondary constraints (that are not already implicitly present) are needed, this must have a counterpart in the Lagrangian formulation.

Let $\Phi_A$ denote all primary and (new and old) secondary constraint functions, and let $\lambda^A$ denote the corresponding Lagrange multipliers. Let us assume that the Hamiltonian is modified into the form

$$ H(q,p,\lambda)~=~ H_0(q,p)+\lambda^A\Phi_A(q,p).\tag{25} $$

Next repeat the consistency check for the Hamiltonian (25) in search of tertiary constraints, and so forth. The final form of $H$ in eq. (25) is called the total Hamiltonian.

Note that we could in principle shift $\lambda^A\to \lambda^A+c^A(q,p)$ and reparametrize the constraints $\Phi_A(q,p)\to \Lambda_A{}^B(q,p)\Phi_B(q,p),$ where $\Lambda_A{}^B$ is an invertible matrix.

IX) Example with $n=2$ and $r=1$. Let the Lagrangian be

$$ L ~=~\frac{1}{2}\frac{u^2}{1-w}~=~\frac{u^2}{2}\sum_{n=0}^{\infty}w^n, \qquad |w|~<~1. \tag{26}$$

The Hessian

$$ H_{ij}~=~\begin{bmatrix} \frac{1}{1-w} & \frac{u}{(1-w)^2} \\ \frac{u}{(1-w)^2} & \frac{u^2}{(1-w)^3}\end{bmatrix} \tag{27}$$

has two eigenvalues $\frac{1}{1-w}+ \frac{u^2}{(1-w)^3}>0$ and $0$, i.e. it has constant rank $r=1$ when $|w|< 1$.

The first momentum relation

$$ \pi~=~\frac{\partial L}{\partial u}~=~ \frac{u}{1-w} \tag{28}$$

can be inverted to yield

$$ u~=~f(\pi,w)~=~(1-w)\pi. \tag{29}$$

The second momentum relation

$$ \rho~=~\frac{\partial L}{\partial w}~=~ \frac{u^2}{2(1-w)^2} ~=~\frac{1}{2}\pi^2\tag{30}$$

leads to a primary constraint

$$ \phi~:=~ \rho -\frac{1}{2}\pi^2~\approx~0. \tag{31}$$

The canonical Hamiltonian (17) becomes

$$ H_{\rm can}~=~ \pi \left. u\right|_{u=f(\pi,w)} + \underbrace{\frac{1}{2}\pi^2}_{\approx\rho} w -\left.L\right|_{u=f(\pi,w)} ~=~\frac{1}{2}\pi^2. \tag{32}$$

It is easy to check that there is no secondary constraint. End of example.

References:

M. Henneaux and C. Teitelboim, Quantization of Gauge Systems, (1994), p. 5-7.
D.M. Gitman and I.V. Tyutin, Quantization of fields with constraints, (1990), p. 13-16.
H. Rothe and K. Rothe, Classical and quantum dynamics of constrained Hamiltonian systems, (2010), p. 24-27.

--

$^1$ We will only make a local argument, i.e. ignore global issues.

$^2$ For more details, see my Phys.SE answer here.

I asked almost this same question here over a year ago, and it's been in the back of my mind ever since. Thank you so much for finally clearing this up for me. I only have one small question still. How are we sure that we can rename the coordinates locally and not just pointwise? — Charles Hudgins, Mar 19 '20 at 23:59
We use the assumption that the rank of the Hessian doesn't jump. — Qmechanic, Mar 20 '20 at 05:45
Looking at it again, it seems clear. 1. Did you draw on a particular reference for this answer, or did you come up with it on your own? Asking since I think I would prefer that reference's exposition to Rothe's on this topic. 2. Also, out of curiosity, does the Lagrangian in that post have some significance, or is it just a good example? — Charles Hudgins, Mar 21 '20 at 02:25

Jia Yiyang · Accepted Answer · 2013-04-07T17:31:43.017

1

OP wrote

This is where the presentation from Rothe stops, and my concern is that equations of the form $p_\alpha = h_\alpha (q, \{p_a\},\{\dot{q}_\beta\})$ (2) with all $\{\dot{q}_\beta\}$ present can still be possible, and yet one cannot solve for more velocities from the set $\{\dot{q}_\alpha\}$ in terms of the coordinates, the momenta and the remaining velocities if the conditions stipulated in the Implicit Function Theorem are not met, for not all equations of the type (2) can be solved implicitly for $\{\dot{q}_\alpha\}$. Therefore it is not proven that there are $(n - R_W)$ primary constraints of the form $\phi_\alpha (q,p) = 0$

I sensed a misunderstanding of Rothe's statements, ignore me if I'm not understanding OP correctly:

Rothe is arguing at least one of the $\dot{q}_\beta$'s can be expressed as a function of $p_a,p_\alpha$ and remaining $\dot{q}_\alpha$'s. For any particular $\alpha$ in your equation (2), by implicit function theorem applied to one-variable($\dot{q}_\beta$) function, it's always doable unless $\frac{\partial h_\alpha}{\partial \dot{q}_\beta}=0$ for our chosen $\alpha$ and $\beta$, but the latter case simply means $h_\alpha$ does not depend on $\dot{q}_\beta$, so either case Rothe's statement is correct.

Updates:

As for the functional independence part, if I'm not mistaken, is quite trivial. It's because in your equation (2), all the $p_\alpha$'s are on LHS and RHS doesn't contain any $p_\alpha$ thus it's impossible to find a inter-relation among these equations(plural in the sense that $\alpha$ can take many values, from $1$ to $M'$). Or in differential calculus language, the constraint functions from (2) will be $\phi_\alpha(q,p)=p_\alpha-h_\alpha(q,\{p_a\})$, thus the Jacobian of these functions will simply be

$\frac{\partial \phi_\beta}{\partial \{p_\alpha,p_a, q\}}= \begin{bmatrix} 1 & 0 & \cdots &0&\cdots&\frac{\partial h_1}{\partial p_a}&\cdots&\frac{\partial h_1}{\partial q_i}&\cdots\\ 0 & 1 & \cdots&0&\cdots&\frac{\partial h_2}{\partial p_a}&\cdots&\frac{\partial h_2}{\partial q_i}&\cdots\\ \vdots & \vdots & \ddots&\vdots&\vdots&\vdots&\vdots&\vdots&\vdots\\ 0 &0 &\cdots&1&\cdots&\frac{\partial h_{M'}}{\partial p_a}&\cdots&\frac{\partial h_{M'}}{\partial q_i}&\cdots\end{bmatrix}$

And this Jacobian is of maximal rank because of the identity submatrix on the left, and this is the same as saying these contraint functions are functionally independent.

P.S.: Now I'm not quite sure what situation H&T were referring to when they said $M<M'$.

edited Apr 07 '13 at 17:31

answered Apr 06 '13 at 02:47

Jia Yiyang

3,985

You are absolutely right, and your argument is flawless. Thank you very much! Can you, please, also indicate as to why the (n - R_W) constraints are functionally independent? Can you prove that? Thanks! – user22208 Apr 06 '13 at 07:54
Where did Henneaux and Teteilboim say that? I actually read a contrary statement on page 5, somewhere below equation (1.6) – Jia Yiyang Apr 06 '13 at 14:10
On page 5 HT write: "If the rank of $\partial^2 L/\partial{\dot{q}n}\partial{\dot{q}{n^\prime}}$ is equal to $N - M^\prime$ there are $M^\prime$ independent equations among (1.6)." In our case the rank of the Hessian is $R_W$ and there should be $(n - R_W)$ functionally independent constraints, but I don't know how to show that. – user22208 Apr 06 '13 at 15:14
But they said M' can be strictly smaller than M. In other words, M' can be strictly smaller than n-R_w in Rothe's notation. – Jia Yiyang Apr 06 '13 at 16:24
In Rothe's notation $R_W = N - M^\prime$ and $n = N$. Therefore $n - R_W = N - (N - M^{\prime}) = M^{\prime}.$ – user22208 Apr 06 '13 at 16:45
From where you conclude $R_W=N-M'$ in Rothe's notation? I think $R_W=N-M$ in Rothe's notation. – Jia Yiyang Apr 07 '13 at 15:36
I concluded that $R_W = N - M^\prime$ because Rothe says that $R_W$ is the rank of the Hessian $\partial^2 L/\partial{\dot{q}i}\partial{\dot{q}_j}$ in Rothe's notation and also HT say, in their book, on page 5 that $(N - M^\prime)$ is the rank of the Hessian $\partial^2 L/\partial{\dot{q}_n}\partial{\dot{q}{n^\prime}}$ in HT's notation. I just equated the ranks of the Hessians in the two books, in a consistent manner, namely $R_W = N - M^\prime$. I also equated the number of degrees of freedom $n = N$. – user22208 Apr 07 '13 at 16:02
Yes now I agree with you, I'll update my answer. – Jia Yiyang Apr 07 '13 at 16:44
Thank you very much for the update! Your argument is again flawless! My understanding of the statement in HT that there can be $M$ constraints with $M > M^\prime$ is that one can write down as many constraints as one wants, namely $M$, starting from the $M^\prime$ functionally independent ones. I would very much appreciate it if you could recommend me a good advanced calculus book in English where the topic of functional independence of m functions of n variables is thoroughly discussed. Thank you! – user22208 Apr 07 '13 at 18:31
Seems any decent multivariable calculus textbook will do, and I don't have any particular one in mind, as for functional independence, I actutally googled the definition before answering your definition. – Jia Yiyang Apr 07 '13 at 18:58

Primary constraints for constrained Hamiltonian systems

2 Answers2

Linked