I have a conceptual question regarding the time evolution amplitude in the path integral formulation of quantum mechanics. I apologize for this long post, but I think this long text helps with stating the problem and making the actual questions clear.
Since the question is about the subtleties of the time evolution amplitude, let me introduce the "time evolution amplitude" [1], which is also called "time evolution kernel" [2], as it is laid out in some of the literature:
Definition of Kleinert (Ref. [1], page 44 (page number of PDF file: 87))
Kleinert uses in Ref. [1] the definition
$$ (\mathbf{x}_b t_b | \mathbf{x}_a t_a) \equiv \langle\mathbf{x}_b | \hat{U}(t_b,t_a) | \mathbf{x}_a \rangle \qquad\qquad (1)$$
and notes that in the case of a time-independent Hamiltonian this can be written as
$$ (\mathbf{x}_b t_b | \mathbf{x}_a t_a) = \langle\mathbf{x}_b | \exp(-\mathrm{i}\hat{H}(t_b-t_a)/\hbar | \mathbf{x}_a \rangle \,. \qquad\qquad (2)$$
Definition of Wipf (Ref. [2], page 10 (page number of PDF file: 3))
Wipf calls this quantity in Ref. [2] "time evolution kernel" and introduces it in the context of analyzing the time dependence of the wave function as
$$K(t,q,q') = \langle q| e^{-\mathrm{i}t H / \hbar} | q'\rangle \,, \qquad\qquad (3)$$
where he denotes the position eigenstates by $|q\rangle$.
Interestingly, he notes afterwards "It is the probability amplitude for the particle to propagate from $q'$ at time 0 to $q$ at time $t$ and is occasionally denoted by
$$K(t,q,q') \equiv \langle q,t | q',0\rangle \quad \textrm{"} \,. \qquad\qquad (4)$$
First thoughts about this
When interpreting this, we face the little difficulty that both authors use for definitions the symbol $\equiv$ and not $:=$ or $=:$, which have in contrast to the first one the advantage of being directed. Nevertheless, in the case of Kleinert's exposition it is clear that he uses the matrix element of the time evolution operator $\hat{U}$ to define $(\mathbf{x}_b t_b | \mathbf{x}_a t_a)$. It is important to note that for some reason, which he does not tell the reader :-), he avoids using the standard Dirac brackets here and uses round brackets instead.
In the case of the script of Wipf this interpretation can be a bit more difficult: However, because of the order of Wipf's discussion it is in my opinion not far fetched to claim that he first defines the time evolution kernel via matrix elements of the time evolution operator and then proceeds to define the overlap $\langle q,t | q',0\rangle$ based on this kernel function.
In this context, it is interesting to explore what happens, if one reads the round brackets of Kleinert's exposition as Dirac brackets: Accordingly, we now want to evaluate the expression $ \langle \mathbf{x}_b t_b | \mathbf{x}_a t_a\rangle $. We note that both, Kleinert and Wipf, use the same sign convention for the time evolution of states in the Schrödinger picture: $$ | \psi_S(t) \rangle = e^{-\mathrm{i}\hat{H}t/\hbar} | \psi_S(0) \rangle \qquad\qquad (5) \,,$$ see page 10 (page 3 of the PDF file) of Ref. [2]. The analog can be found on page 34 (page 77 of the PDF file) of Ref. [1]. If we time evolve the localized state $| \mathbf{x}_a\rangle$ in the same way as a generic state $|\psi\rangle$, we obtain $$ \langle \mathbf{x}_b, t_b | \mathbf{x}_a, t_a\rangle = \langle \mathbf{x}_b, 0 | (e^{-\mathrm{i}\hat{H}t_b/\hbar})^\dagger \, e^{-\mathrm{i}\hat{H}t_a/\hbar} | \mathbf{x}_a, 0\rangle = \langle \mathbf{x}_b | e^{-\mathrm{i}\hat{H}(t_a-t_b)/\hbar}) | \mathbf{x}_a \rangle \,. \qquad\qquad (6) $$ So, with this short calculation we arrived at an expression with the opposite sign as Eq. (2) by ignoring the fact that the brackets on the left side of Eq. (2) are not Dirac brackets. I assume that this calculation is bascially the reason, why in Eq. (2) round brackets are used and why in Eq. (4) the overlap of the localized states at different times is defined via the kernel function. Reading the discussion of the time evolution amplitude in Ref. [3] strengthens this impression.
Definition of Herzog (Ref. [3], page 2)
Herzog is similar in the notation to Wipf and gives the relation
$$ \langle x_f, t_f | x_i, t_i \rangle = \langle x_f | e^{-\mathrm{i} (t_f -t_i)/\hbar} | x_i \rangle \,, \qquad \qquad (7) $$
whereby he made some important statements regarding $|x,t\rangle$ before.
He uses for the time evolution the same sign convention as the others: $|\psi(t)\rangle_S = e^{-\mathrm{i} H t /\hbar } |\psi\rangle_H$ ($S$ for Schrödinger picture and $H$ for Heisenberg picture).
In the next step he uses $\langle{x} | \psi(t) \rangle_S = \psi(x,t)$ and $\psi(x,t) = \langle x | e^{-\mathrm{i} H t/\hbar} | \psi \rangle_H$ to motivate the definition
$$ | x, t\rangle \equiv e^{\mathrm{i} H t/\hbar} | x\rangle \,. \qquad \qquad (8)$$
So, this discussion is very clear. The statement regarding the transition amplitude is basically the same as in the scripts of Kleinert and Wipf, however, the problem with unusual time evolution of $|x,t\rangle$ is directly visible.
Final discussion
Reading the correspondings parts of the scripts of Kleinert, Wipf, and Herzog makes it clear that there is some subtlety in the time evolution of the localized state here: It has a different sign than the time evolution of the other state, what can be seen by comparing Eq. (8) with Eq. (5). While Wipf and Herzog determine this time evolution of localized states with defintions, Kleinert circumvents this problem by using round brackets implying that $|\mathbf{x}_a t_a)$ might be not directly comparable with $|\mathbf{x}_a t_a \rangle$.
While some things might have become clear, there are also some questions:
We have here the localized states which time evolve with the opposite sign than the normal states. But are also localized states with the "normal" time evolution, i.e., localized states obeying Eq. (5), thinkable?
Here are some thoughts: If yes, we would distinguish $| x_a, t_a )$ evolving with a plus (combining notation from Kleinert with the more explicit equations from Herzog) from $| x_a, t_a \rangle$ evolving with a minus. $| x_a, t_a )$ would be useful for describing the time evolution amplitude, while $| x_a, t_a \rangle$ could be useful for analyzing the time evolution of a physical system with a localized state as initial state. One could calculate, e.g., $\langle n | x_0,t \rangle$, the probability amplitude for finding the system initially localized at $x_0$ after some time $t$ in the eigenstate $n$ of some operator.
Some claim, that $ | x, t ) = e^{\mathrm{i}\hat{H} t /\hbar} | x, 0 )$ is a Heisenberg state. See, e.g., this Physics SE answer. I wonder whether this is correct. In my eyes ${}_H\langle x_f, t_f | x_i, t_i \rangle{}_H$ has basically, two problems:
- First, Heisenberg states are states, which are not time-evolving. They are usually only defined for one reference time, e.g.. $t=0$. See for example Ref. [3]. If you need the state at a different time, evolve it with the operator or use the Schrödinger state at a fixed time, which is an argument of the state. This understanding of Heisenberg states is violated by the interpretation of these states as Heisenberg states.
- Secondly, if one uses a Heisenberg state and evolves it to some other point in time, one has just the usual time evolution with the usual sign, see page 1 of Ref. [3].
In some context I can make some sense of this relation to the Heisenberg picture. If I see $ | x, t )$ as part of a projection operator $P_x(t) = | x, t ) ( x, t | $, in the Heisenberg picture this operator evolves according to $P_x(t) = e^{\mathrm{i}\hat{H} t /\hbar} | x, 0 ) ( x, 0 | e^{-\mathrm{i}\hat{H} t /\hbar} = | x, t ) ( x, t | $ yielding the correct behavior for $| x, t )$.
[1] H. Kleinert, "Path Integrals - in Quantum Mechanics, Statistics, Polymer Physics, and Financial Markets", http://users.physik.fu-berlin.de/~kleinert/b5/psfiles/pi.pdf
[2] A. Wipf, "Path Integrals", URL of chapter 2: https://www.tpi.uni-jena.de/~wipf/lectures/pfad/pfad2.pdf
[3] C. Herzog, "Notes on the Path Integral - Physics 305", http://insti.physics.sunysb.edu/~cpherzog/phys405fall2017/pathintegral.pdf