6

I know how the measurement postulate in quantum mechanics works, in regard to hermitian operators, but what if an operator is non-hermitian? Can i apply the following reasoning?

If an operator is represented by a non-hermitian matrix, i know that i can't apply the same postulate for hermitian matrices because the eigenvalues might not be real, and the eigenvalues corresponding to different eigenvalues might not be orthogonal, but if when i try to find the eigenvalues of this matrix, i find that some are real, and to those real eigenvalues, some of them have eigenvectors that are orthogonal to eachother. Given a eigenstate of a system, can i say that the possible values of a measurement of that operator in that state are the eigenvalues corresponding to each orthogonal eigenvector that i found and that each probability is the sum of the inner products of the eigenstate with the eigenvectors corresponding to a eigenvalue?

Or is there some other procedure to find expected values and probabilities of a non hermitian operator?

Qmechanic
  • 201,751
orochi
  • 323

1 Answers1

4

Many texts say that an observable should be represented by a hermitian operator. That is sufficient, but not necessary. More generally, we can use any operator that can be expressed as a linear combination of mutually commuting projection operators. Such an operator is called a normal operator. A normal operator $N$ is most simply characterized by the fact that it commutes with its own adjoint: $N^*N = NN^*$. Examples of normal operators include hermitian operators, individual projection operators, and unitary operators.

Here's an example to illustrate this idea. If $P_1,P_2,...$ are mutually orthogonal projection operators, then the operator $$ A = a_1 P_1 + a_2 P_2 + \cdots $$ is a normal operator for any choice of (possibly complex) coefficients $a_k$. The eigenvectors of this operator are mutually orthogonal because the projection operators $P_k$ are. If the coefficients are real, then $A$ is self-adjoint (hermitian). But what really matters in quantum theory is the projection operators $P_k$. These are what determine the various possible outcomes of the measurement and the relative frequencies of those outcomes. The coefficients $a_k$ are just convenient labels for the outcomes, making it possible to define statistics like mean values and standard deviations.

Using only self-adjoint operators is sufficient, because allowing complex coefficients is only allowing a more general way of labelling the various implied projection operators. Nature doesn't care how we label things.

In the first paragraph, I said "mutually commuting projection operators", which is more general than "mutually orthogonal projection operators." The latter implies the former, but not conversely. The former is needed in order to include observables like the position operator in non-relativistic quantum mechanics, which does not have (normalizable) eigenvectors. However, it still implicitly defines projection operators like $$ P\psi(x)=\begin{cases} \psi(x)&\text{ if }x\in R\\ 0 &\text{otherwise}, \end{cases} $$ where $R$ is some region of space. We can think of the usual position operator $X$ as a convenient single-operator representation of this whole algebra of mutually commuting projection operators. It's the projection operators that we use in the measurement postulates. The fact that any normal operator implicitly defines such a set of mutually commuting projection operators is the subject of the spectral decomposition theorem.

Given any observable $A$, if $P$ is one of the projection operators that it implicitly defines (through the spectral decomposition theorem), then a measurement of $A$ will result in a state $|\psi'\rangle$ that satisfies either $P|\psi'\rangle = |\psi'\rangle$ or $(1-P)|\psi'\rangle=|\psi'\rangle$. (I'm not trying to advocate any particular interpretation of quantum theory here; I'm just trying to be concise.) In terms of the state $|\psi\rangle$ prior to the measurement, the relative frequencies of these two possible outcomes are $\psi(P)$ and $\psi(1-P)$, respectively, using the abbreviation $$ \psi(\cdots)\equiv\frac{\langle\psi|\cdots|\psi\rangle}{\langle\psi|\psi\rangle}. $$ The point here is that we don't need to worry if $A$ doesn't have a complete set of (normalizable) eigenstates. As long as $A$ is a normal operator, we can still use the corresponding projection operators to make useful predictions, because each of the projection operators has (normalizable) eigenvectors. As long as they all commute with each other, we can think of this whole set of projection operators as a bunch of mutually compatible observables, each of which has only two possible outcomes.

  • A caveat (see Weinberg vol.1 p.52): a symmetry transformation is of form $U=1+i \epsilon T$ with T Hermitian. Then Weinberg says "Indeed, most (and perhaps all) of the observables of physics, arise in this way from symmetry transformations." This is the deepest reason I have read why quantum textbooks restrict to Hermitian observables. – isometry Nov 04 '18 at 09:33
  • 1
    Actually a normal operator can uniquely be decomposed into a linear combination of two mutually commuting selfadjoint operators, so selfadjoint operators are also necessary...Normal operstors viewed as observables are nothing but a pair of compatible (seladjoint) observables. – Valter Moretti Nov 04 '18 at 14:03
  • 1
    Necessity is due to the fact that the spectrum must be real if it represents the outcomes of measurements. A normal operator with real spectrum is selfadjoint necessarily. – Valter Moretti Nov 04 '18 at 14:07
  • @ValterMoretti You made a good point that any normal operator can be written as a linear combination of commuting selfadjoint operators. But the essence of an observable is its projection-valued measure (PVM), not its spectrum. We are free to label the outcomes with complex numbers, or even with words. If words are used, then we can't encode the labels into a single operator, but using a single operator to represent an observable is only for convenience. We could define an observable to be a PVM, with no coefficients, and we'd still be doing standard quantum theory. – Chiral Anomaly Nov 04 '18 at 17:45
  • It depends on your starting point. Instruments automatically give rise to real numbers, so the most natural view is considering PVMs over the Borel sets of $\mathbb R$. Complex combinations of selfadjoint operators exhaust all normal (also unbounded) operators. In this sense, physically speaking and dealing with observables normal operators are a posteriori constructions. PVMs can be constructed over any measurable space (also without any specific topology) but itdoes not mean that they automatically have physical meaning. – Valter Moretti Nov 04 '18 at 17:55
  • 1
    See also this answer of mine for my general viewpoint https://physics.stackexchange.com/questions/116595/why-is-the-application-of-probability-in-qm-fundamentally-different-from-applica/116609#116609 which I suspect to be quite close to yours. – Valter Moretti Nov 04 '18 at 18:01
  • @ValterMoretti That's another good point: we're talking about ideal observables/measurements here. I didn't specify that before, but I should have. Thanks for the clarifications! – Chiral Anomaly Nov 04 '18 at 18:05
  • @Dan Yand Indeed, otherwise POVMs are moreuseful... – Valter Moretti Nov 04 '18 at 18:06
  • 2
    @Valter Moretti "Instruments automatically give rise to real numbers". Sorry, I have discussed this before on SE and I still don't get why everyone says this. Probably because I'm not an experimentalist but doesn't a measurement just distinguish outcomes - our choice to use numbers to do this is sometimes useful (such as spin) and sometimes natural (such as distance and momentum) but not essential. I seem to be missing something obvious here. – isometry Nov 05 '18 at 07:32
  • @Valter Moretti I do get your first point though. Am I right to say this is saying that we could have an observable $p_x + i p_y$ but in practice we split this into two separate observables $ (p_x , p_y ) $ because they commute. – isometry Nov 05 '18 at 07:44
  • @Bruce Greetham Yes, you are right it was exactly what I meant regarding $p_x+ip_y$... all normal "observables" can be written that way. – Valter Moretti Nov 05 '18 at 08:03
  • Regarding instruments and real numbers, what I mean is that outcomes are usually real numbers because mass, angular momentum, spin, momentum and so on are real numbers. However you can also use complex numbers, but I think that in this case we can always re-interpret these extended observables as a linear complex combination of mutually compatible real observables as above. Mathematically this is a normal (generally unbounded) operator. – Valter Moretti Nov 05 '18 at 08:11
  • What would one make out of operators that are not even normal? – Quantumwhisp Jun 30 '22 at 15:29
  • @ChiralAnomaly yes I know they are useful for other purposes, but since the scope the question was to relate non-hermitean operators to something physical, I thought maybe there is some ontological meaning (like measurement outcomes in case of the normal operator) for non-normal operators as well, and that's the reason I asked this question. – Quantumwhisp Jul 02 '22 at 16:00
  • @Quantumwhisp I deleted my earlier comment that missed the mark. Maybe this is closer to what you're asking: some non-normal operators can be used as the Kraus operators in a generalized measurement. – Chiral Anomaly Jul 02 '22 at 19:08
  • @ChiralAnomaly thank you. Especially the generalized measurement is interesting, as one operator that might qualify for this purpose is the fermionic field operator from qft (at least it looks to me like that) – Quantumwhisp Jul 03 '22 at 17:16