Does Heisenberg equation of motion imply the Schrodinger equation for evolution operator?

Question

Let us choose to postulate (e.g. considering the analogy of the Hamiltonian being a generator of time evolution in classical mechanics) $$ i\hbar \frac{d\hat{U}}{dt}=\hat{H}\hat{U}\tag{1} $$ where $\hat{U}$ is the (unitary, linear) evolution operator and $\hat{H}$ the Hamiltonian (the most general version of which; i.e. time-dependent with instances at different times non-commuting).

In S-picture, one can easily show that (1) is equivalent to $$ i\hbar \frac{d}{dt}|\psi_S(t)\rangle=\hat{H}_S|\psi_S(t)\rangle\tag{2} $$ where $\psi$ is a state, $|\psi_S(t)\rangle = \hat{U}(t)|\psi_S(0)\rangle \equiv \hat{U}(t)|\psi_H\rangle$ and $\hat{H}_S:=\hat{H}$.

In H-picture, it is straightforward to show that (1) implies $$ \frac{d\hat{A}_H}{dt} = \frac{\partial\hat{A}_H}{\partial t}+\frac{1}{i\hbar}[\hat{A}_H,\hat{H}_H]\tag{3} $$ where $A$ is an observable and $\hat{A}_H(t)=\hat{U}(t)^\dagger \hat{A}_H(0)\hat{U}(t)\equiv\hat{U}(t)^\dagger \hat{A}_S\hat{U}(t)$ (and also $[\hat{H}_{H},\hat{H}_H]=0$ implying that the time dependence of $\hat{H}_H$ is purely explicit, i.e. $\hat{H}_H=\hat{H}_S\equiv \hat{H}$ with $[\hat{H},\hat{U}]=0$).

My question: is it possible to obtain (1) from (3), i.e. to show that (1) is equivalent to (3)?

Some thoughts on this: it is extensively mentioned in literature that both pictures yield same answers. Therefore, it should be possible to obtain (1) from (3) since (1) and (2) are equivalent. Assuming (3), the best I can get is that given an observable $A$, the operator $$ \hat{C}:= \hat{A}_S\left(\frac{d\hat{U}}{dt} \hat{U}^\dagger - \frac{\hat{H}}{i\hbar}\right) $$ must be skew-Hermitian.

LaTeX tip: use, e.g. \tag{1} to label your equations. For other configurations, it may be \eqno(1). — JamalS, Jul 01 '14 at 21:53

yuggib · Accepted Answer · 2014-07-02T09:06:45.287

9

The result can be proved in a general way using, well, math. In particular the theory of semigroups of linear operators on Banach spaces (I know that seems advanced and maybe not physical, but it is an elegant way of proving the result you seek ;-) ).

Define the Banach space $\mathscr{L}^1(\mathscr{H})$ of trace class operators over a separable Hilbert space the set of all bounded operators $u$ such that $\mathrm{Tr}\lvert u\rvert<\infty$. The fact that (3) holds for all observables (I will not discuss about domains here for the sake of simplicity) implies that it holds also for trace class operators that does not depend explicitly on time. In this case (3) is equivalent to the following Cauchy problem on the Banach space $\mathscr{L}^1(\mathscr{H})$: $$\frac{du(t)}{dt}=L u(t)\;, \; u(0)=x$$ where $\mathscr{L}^1(\mathscr{H})\ni x\equiv \hat{A}_0$ that does not depend on time, $u(t)\equiv \hat{A}_H(t)$ and $L$ is the linear operator that acts as $i[\hat{H},\cdot]$ (I am assuming $\hslash=1$). If $x\in D(L)$, i.e. $x$ such that $\mathrm{Tr}\lvert [\hat{H},x]\rvert <\infty$, the solution of the Cauchy problem above is unique, because $H$ is self-adjoint. We know by Stone's theorem an explicit solution, namely $$u(t)=\hat{U}(t)x \hat{U}^\dagger(t)\; ,$$ where $\hat{U}(t)=e^{-it\hat{H}}$ is the unitary group generated by $\hat{H}$, that satisfies (1) on $D(\hat{H})$. That solution is unique, so assuming (3) for all observables implies that the operator $\hat{U}$ you used to define Heisenberg picture operators has to be exactly the group generated by $\hat{H}$, i.e. satisfy (1).

Just for the sake of completeness: once you have solved the Cauchy problem for $x\in D(L)$, you can extend the solution to trace class operators or compact operators or bounded operators; also to unbounded operators (provided $\hat{U}(t)x \hat{U}^\dagger(t)$ makes sense on some dense domain). This is what is called a mild solution of the Cauchy problem, because we don't know a priori if we are allowed to take the derivative. However uniqueness is usually proved under general assumptions and for mild or even weak solutions, so I think it is quite safe to conclude that $\hat{U}(t)x \hat{U}^\dagger(t)$ is the unique solution of (3) in a suitable sense.

edited Jul 02 '14 at 09:06

answered Jul 02 '14 at 08:45

yuggib

11,987

Does Stone's theorem in the above mentioned form apply in the general case, where $\hat H$ is time-dependent where the Hamiltonians at different times need not commute? Also, are trace class operators that do not depend explicitly on time guaranteed to be self-adjoint (so that they really satisfy (3))? (This goes beyond my current knowledge, apologies if my questions are rubbish.) – Kubav Jul 02 '14 at 10:09
Or, is there a general result saying that if (3) is satisfied by all observables, then it satisfied by all trace class operators with no explicit time dependence? – Kubav Jul 02 '14 at 10:29
No, Stone's theorem does not hold for time dependent generators; however also your equation (1) is not precise when the generators are time dependent. In particular you would then obtain a two-parameter group, that is well defined only with some assumptions on the aforementioned generators, and you have a right derivative different from the left derivative. Everything gets very very technical in that case, but a similar reasoning could be replied in principle. – yuggib Jul 02 '14 at 10:32
As you wrote, the trace class operators are general, and everything holds in particular for self-adjoint trace class operators. Since the solution is self-adjoint, provided the initial $x$ is self-adjoint, you have a closed result that preserves self-adjointness. – yuggib Jul 02 '14 at 10:32
Interesting, I have lived in a belief that (1) holds even for the most general case of $\hat H$ (and that the solution can then be written in terms of path-ordered exponential). I know it is slightly off topic, but could you at least briefly support your argument? – Kubav Jul 02 '14 at 10:46
1

Well, the evolution can be rigorously defined by means of the Dyson expansion for bounded $H(t)$. When, as usual in QM, $H(t)$ are not bounded, a lot of problems may arise in defining a two-parameter unitary group generated by them. Most of the problems are linked to the domains of definition of $H(t)$ for each $t$. For example it is a priori possible that $D(H(t_1))\cap D(H(t_1+1))=\emptyset$. In this case you cannot apply in succession $H(t_1+1)H(t_1)$ because it is not defined, and so you see that it would not be possible to define the time-ordered exponential. – yuggib Jul 02 '14 at 11:13
It is anyways possible to define the unitary evolution for certain families of time dependent generators $H(t)$ that satisfy some suitable assumptions (that are really technical and usually complicated to prove). – yuggib Jul 02 '14 at 11:15
+1 Awesome. I was walking home thinking the real way to prove this probably invoked Stone's thm., but I wasn't sure exactly how. I'm glad you saw this post! – joshphysics Jul 02 '14 at 14:48

joshphysics · Answer 2 · 2014-07-02T02:29:29.687

4

CAUTION - ANSWER INCOMPLETE There is a gap in my argument (see the send); it relies on the claim that \begin{align} - \hat O^\dagger \hat A = \hat A\hat O \end{align} for all hermitian $\hat A$ implies $\hat O = 0$ which may not be true. Please comment if you know how to prove this or know of a counterexample.

Update. Actually the claim above is definitely false in one dimension, so the ensuing argument is certainly incomplete.

Some notational clarifications.

Let me first that (3) as you wrote it, although very much standard, is really a rather severe abuse of notation.

The difference between the "total" and "partial" derivatives in the equation is that the partial derivative term is supposed to reference the time-dependence carried by the Schrodinger picture operator itself, while the total derivative refers to that plus the additional time-dependence introduced by conjugating the operator by $\hat U(t)$.

To see this, note that if as usual we define \begin{align} \hat A_H(t) = \hat U^\dagger(t) \hat A_S(t) \hat U(t) \tag{$\star$} \end{align} then differentiation with respect to time on both sides and invoking (1) yields \begin{align} \frac{d\hat A_H}{dt}(t) = U^\dagger(t)\frac{d \hat A_S}{dt}(t)\hat U(t) + \frac{1}{i\hbar} [\hat U^\dagger(t)\hat A_S(t) \hat U(t), \hat H] \tag{$\star\star$} \end{align} so if we feel like good physicists who like using partial derivative symbols in rather odd ways and define \begin{align} \frac{\partial \hat A_H}{\partial t}(t) = U^\dagger(t)\frac{d \hat A_S}{dt}(t)\hat U(t) \end{align} then we get precisely your equation (3).

Proof that (3) $\implies$ (1).

All right, so now that we know what that equation is really saying. Let's try to use it to prove (1) as you desire. We start with $(\star)$ and $(\star\star)$ and try to prove (1). In fact, plugging the $(\star)$ into $(\star\star)$ and canceling the common term yields \begin{align} \frac{d \hat U^\dagger}{dt}(t) \hat A_S(t)\hat U(t) + \hat U^\dagger(t) \hat A_S(t) \frac{d\hat U}{dt}(t) = \frac{1}{i\hbar} [\hat U^\dagger(t)\hat A_S(t) \hat U(t), \hat H] \end{align} Expanding out the commutator, and multiplying both sides by $\hat U(t)$ on the left, and $\hat U^\dagger(t)$ on the right, we find that \begin{align} \hat U(t) \frac{d\hat U^\dagger(t)}{dt} \hat A_S(t) + \hat A_S(t) \frac{d \hat U}{dt}(t) \hat U^\dagger(t) = -\frac{1}{i\hbar}\hat U(t)\hat H \hat U^\dagger (t) \hat A_S(t) + \frac{1}{i\hbar}\hat A_S(t) \hat U(t) \hat H \hat U^\dagger (t). \end{align} which, upon a some rearrangement gives \begin{align} \left(\hat U(t) \frac{d\hat U^\dagger(t)}{dt} + \frac{1}{i\hbar}\hat U(t)\hat H \hat U^\dagger (t)\right)\hat A_S(t) =\hat A_S(t)\left(\frac{1}{i\hbar}\hat U(t) \hat H \hat U^\dagger (t)-\frac{d \hat U}{dt}(t) \hat U^\dagger(t)\right) \end{align} Now let the term in parentheses on the right be called $\hat O(t)$, then using the fact that $\hat A_S(t)$ is hermitian, this equation can be written as \begin{align} -\hat O^\dagger(t) \hat A_S(t) = \hat A_S(t) \hat O(t) \end{align} This holds for all hermitian $\hat A_S(t)$, so $\hat O(t) = 0$, which is to say that \begin{align} \frac{1}{i\hbar}\hat U(t) \hat H \hat U^\dagger (t)-\frac{d \hat U}{dt}(t) \hat U^\dagger(t) =0 \end{align} and (1) follows upon multiplying both sides on the left by $\hat U^\dagger(t)$.

edited Jul 02 '14 at 02:29

answered Jul 02 '14 at 00:49

joshphysics

57,120

Thank you for your answer. Unfortunately, I don't see how it follows from $-\hat O^\dagger(t) \hat A_S^\dagger(t) = \hat A_S(t) \hat O(t)$ that $2(\hat A_S(t)\hat O(t))^\dagger = 0$ unless $\hat A_S(t)\hat O(t)$ is Hermitian, which I don't think is the case in general. Could you please clarify? Thanks. – Kubav Jul 02 '14 at 01:03
@Kuba You're completely right; that was sloppy. Let me think about this for a moment. If I can't salvage the answer, then I will delete it. – joshphysics Jul 02 '14 at 01:07
@joshphysics This might just me missing something obvious, but does $AO+(AO)^\dagger =0$ really imply $O=0$? – Danu Jul 02 '14 at 01:49
@Danu Definitely not missing something obvious; my answer is currently incomplete and perhaps wrong. That equation holding for all hermitian $A$ may give the result, but I'm not sure. I wrote a "caution" to point this out in the beginning of the answer. I hesitate to delete it since someone may use what I wrote to figure this out. – joshphysics Jul 02 '14 at 01:51
1

@joshphysics Thanks a lot for responding (and I see now that you edited the answer accordingly as well). We all know how scary it is not knowing for sure whether one is asking a smart question or a really stupid one! I would certainly advise against deleting; it is quite interesting to me already! – Danu Jul 02 '14 at 01:53
@Danu I think it works for dimension $2$. I checked this by requiring that $O^\dagger A + AO = 0$ for four specific hermitian matrices. It seems hard to replicate my argument for higher dimensions though. Any ideas? – joshphysics Jul 02 '14 at 02:10
@joshphysics If the reasoning I present in the question is correct, it should hold that $[\hat H, \hat U(t)]=0$, i.e. $\hat U(t)^\dagger \hat H \hat U(t) = \hat U(t)\hat H \hat U(t)^\dagger=\hat H$, which would simplify $\hat O(t)$. – Kubav Jul 02 '14 at 02:11
Josh, Unfortunately I would say that my linear algebra is mediocre at best, so I'm drawing a blank. The best idea that springs to mind immediately is pinging @V.Moretti, or posting to math.SE... – Danu Jul 02 '14 at 02:16
@Danu I actually just realized that it's obviously false in dimension 1 since in that case any matrix O (complex number in 1d) that is the negative of its conjugate satisfies the expression for all hermitian (real in 1d) A. – joshphysics Jul 02 '14 at 02:22
@joshphysics Darn it, that was an obvious one after all! Hah. Could it be dimension-dependent? That does not seem very likely to me. Then again, this does seem like it is (at least partially) the right way to derive the Schrodinger eqn from the Heisenberg eqn... – Danu Jul 02 '14 at 02:24
@Danu Yeah hmmm. I think this is the right way...but clearly something is missing. I hope people see the cautionary remark and find a way to complete the argument. – joshphysics Jul 02 '14 at 02:27
@Kuba Perhaps that would help I'm not sure. My apologies for possibly deterring others from answer this question. If we can't figure it out, then hopefully someone else can. Nice question btw. – joshphysics Jul 02 '14 at 02:28
@joshphysics In the finite dimensional case from dimension 2 I think it works: a real basis of all Hermitian matrices is formed by $H_{k\ell} = i|e_k\rangle\langle e_\ell| - i|e_\ell\rangle\langle e_k|$ along with the $|e_k\rangle\langle e_k|$ which we don't need. If $a_{ij}$ are the coordinates of $\widehat O$, then $\langle e_p|\widehat O^\dagger H_{k\ell}|e_q\rangle = ia_{kp}^\ast\delta_{\ell q} - ia_{\ell p}^\ast\delta_{kq}$ and $\langle e_p|H_{k\ell}\widehat O|e_q\rangle = ia_{\ell q}\delta_{pk} - ia_{kq}\delta_{p\ell}$. – doetoe Jul 02 '14 at 09:43
Their sum is zero. To show that $a_{kp} = 0$ with these, we need $k\ne p$, and we have to be able to choose $q = \ell$ unequal to $k$, for which we need dimension at least 2. – doetoe Jul 02 '14 at 09:51
For the diagonal terms you can use the $|e_k\rangle\langle e_k|$ (which we do actually need), and we see that the diagonal terms have to be purely imaginary. Not necessarily 0. From the earlier relations we see that they all have to be equal. So the only finite dimensional matrices satisfying this are the purely imaginary scalar matrices, in retrospect the obvious candidates. – doetoe Jul 02 '14 at 09:58
Now your final expression is equal to a purely imaginary number. Right-multiplying by $\widehat U(t)$ shows that this corresponds to adding a real scalar to the Hamiltonian. That just corresponds to changing the 0 of energy in (1) (it shifts the eigenenergies at least), which doesn't have physical significance. – doetoe Jul 02 '14 at 13:39
2

@joshphysics A much better argument: $\widehat O^\dagger A + A\widehat O = 0$ for every Hermitian operator $A$ implies (for $A = I$) that $\widehat O$ is skew hermitian, and the condition becomes $[\widehat O, A] = 0$ for every Hermitian $A$. Hence the skew-Hermitian $\widehat O$ has a common basis of eigenvectors with every Hermitian $A$. Since we are free in our choice of $A$, every vector is an eigenvector of $\widehat O$, so $\widehat O$ has to be a scalar, purely imaginary because it is skew-Hermitian, and corresponds to a constant shift in potential energy in the Hamiltonian. – doetoe Jul 02 '14 at 16:19
1

@doetoe Now that is a nice argument. I was stuck at having used the $A=I$, never thinking to plug the resulting skew-hermitian condition back into the original equation. This also neatly explains what's going on in dimension 1 where clearly any imaginary scalar satisfies the equation. Well done. – joshphysics Jul 02 '14 at 22:11
@doetoe Nice argument indeed. It seems to rely on the following two claims. First: if $\hat O$ has a common basis of e-vectors with every $\hat A$ hermitian, then every vector is an e-vector of $\hat O$. Second: if for some $\hat X\in L(\mathcal{H})$ have $\hat X |\psi\rangle = \lambda(\psi)|\psi\rangle$ for all $|\psi\rangle\in\mathcal{H}$, then $\lambda(\psi)=\lambda=const.$ (plus clearly have $\hat{X}^{\dagger}=-\hat{X}\implies \lambda(\psi)^\ast = -\lambda(\psi)$). – Kubav Jul 03 '14 at 00:42
To show the first claim: let $|\psi\rangle$ be any vector, and let $P$ be the projection onto $|\psi\rangle$, which is Hermitian. Then $\widehat O|\psi\rangle = \widehat OP|\psi\rangle = P\widehat O|\psi\rangle = |\psi\rangle\langle\psi|\widehat O|\psi\rangle$, so $|\psi\rangle$ is an eigenvector with eigenvalue $\langle\psi|\widehat O|\psi\rangle$ – doetoe Jul 03 '14 at 06:05
To show the second: if $v$ and $w$ are eigenvectors with different eigenvalues of some operator, then $v + w$ cannot be an eigenvector. – doetoe Jul 03 '14 at 06:10
@doetoe I eventually managed to get the second one too, but huge thanks for the first! This essentially completes the argument, so it might be worth editing the original answer. – Kubav Jul 03 '14 at 10:31

score 1 · Answer 3 · answered Jul 02 '14 at 07:59

1

Just calculate (3) for $\hat{A}_H=\hat{U}_H$. With $\hat{U}_H(t)=\hat{U}^{\dagger}(t)\hat{U}(t)\hat{U}(t)=\hat{U}(t)$ and $\hat{H}_H=\hat{H}$ (as you said) you get:

$\frac{d\hat{U}_H}{dt}=\frac{\partial \hat{U}}{\partial t}+\frac{1}{i\hbar}\underbrace{[\hat{H},\hat{U}]}_{=0 \text{(as you said)}}\stackrel{\text{$\hat{U}=e^{\hat{H}t/(i\hbar)}$}}{=}\frac{1}{i\hbar}\hat{H}\hat{U}$

In words: $\hat{U}$ commutes with $\hat{H}$ and thus the total derivative of $\hat{U}$ equals it's partial derivative, which I can calculate to be $\frac{1}{i\hbar}\hat{H}\hat{U}$. So (1) is implied in (3).

answered Jul 02 '14 at 07:59

PeMa

131

It does not seem sufficient, because a priori you do not know that $U=e^{-itH}$ (if else is trivial). And $[H,U]$ is zero whenever $U$ is a function of $H$, as defined by the spectral theorem. – yuggib Jul 02 '14 at 08:06
But then eqn. (1) and (3) aren't connected, as long as I don't choose U to be defined like that. Or how else would you find (3)? – PeMa Jul 02 '14 at 08:26
You assume (3) to hold for all observables, without knowing the form of $U(t)$. – yuggib Jul 02 '14 at 08:46
This is not an assumption. This you can proof. That's what the OP called "straight forward" and in this proof you already use $\partial_t\hat{U}$ – PeMa Jul 02 '14 at 09:20
This is one side of the proof, namely (1)$\Rightarrow$(3). If you want to prove the converse, i.e. (3)$\Rightarrow$(1), you have to assume (3) without knowing (1)!! – yuggib Jul 02 '14 at 09:22
@yuggib I completely agree. In particular, one cannot identify $\hat{A}_H=\hat{U}_H$ in (3), because we require $A$ to be an observable, i.e. $\hat{A}_H^\dagger = \hat{A}_H$ which it is not guaranteed to be the case for $\hat{U}_H$ (in fact, $\hat{U}_H$ Hermitian iff $\hat{U}$ Hermitian, which is certainly not true e.g. when $\hat{U}=e^{-it\hat H}$, where $\hat H$ is const. in time). – Kubav Jul 02 '14 at 09:48
@ yuggib: I don't know about (1) in the proof. There is a difference between the total and the partial derivative. – PeMa Jul 02 '14 at 11:46
@Kuba: To proof (3), $\hat{A}$ doesn't need to be Hermitian. – PeMa Jul 02 '14 at 11:49

score 0 · Answer 4 · answered Jul 02 '14 at 09:45

0

I think you need to add a postulate to (3) in order to obtain (1). This postulate it's actually assumed in every picture of the dinamic and it is that the evolution operator is a one group parameter. It is such a natural assumption that it comes no harm in taking it for true. Even in classical hamiltonian mechanics the flow in the coordinate space form a group.

Eq(3) on its own, as prievously stated, substantially tells you that $[U,H] = 0$ and then you can put $U(t) = f(\hat{H},t)$ for an opportune function $f$, via spectral theorem. This of course may be expressed in full mathematical rigour at will.

Now if you ask $U$ to be a group (and take $H$ time independent for sake of semplicity), it should be easy to prove that $f( - , t)$ must be $exp(i (-) t)$ (the i comes from the request of unitarity). After all, if you write your problem in a eigenbasis for $U$ and $H$, you are looking for a function $f(E,t)$ such that $f(E,t) f(E,t') = f(E,t+t')$, for each eigenvalue $E$ of $\hat{H}$ and each $t,t'$. (Note that now the product is the one in $\mathbb{R}$ not the composition of operators!)

answered Jul 02 '14 at 09:45

giulio bullsaver

2,060

As I explained in my answer, the unique solution of (3) (for trace class operators, but actually in any Banach space) has to be the unitary group generated by $H$, it is not necessary to assume the group property a priori. – yuggib Jul 02 '14 at 09:54
Only now I've fully understood your answer, that seems decisive to me now. Do you think that my answer answers to the question (sorry for the repetition) "How to obtain (1) from Heisenberg picture, i.e. A_h(t) = U(t) A_h(0) U^{dagger}(t) with some physical assumptions: H does not evolve, U is a group"?
This is not the OP question that instead is "how to obtain (1) in Heisenberg picture if we assume (3) (and nothing else)", right?
– giulio bullsaver Jul 02 '14 at 10:41
Well, there is a one to one correspondence between groups and self-adjoint operators that generate them (Stone's Theorem). So asking $U$ is a group it means that $U$ satisfies (1) only with its generator on the r.h.s. In some sense then postulating the Heisenberg picture let you know that the sought generator is effectively $H$. – yuggib Jul 02 '14 at 11:07

Does Heisenberg equation of motion imply the Schrodinger equation for evolution operator?

4 Answers4

Linked