Statistical mechanics definition of temperature as the average kinetic energy

Question

In most text books of statistical physics, temperature is defined as :

$$T=\left(\frac{\partial E}{\partial S}\right)_V$$

where $V$ stands for any external variable(s) the system's Hamiltonian depends on.

However, it is often suggested that the temperature is also the average kinetic energy per degree of freedom as in the Generalized Helmotz theorem. Is this second definition always valid? If so, how can you prove or explain they are the same? Are there some condition for it? How do you relate it to the definition above?

The wiki page you link to makes it clear that they are defining $T$ to be the average kinetic energy per degree of freedom. This only coincides with the statistical definition for quadratic degrees of freedom. — jacob1729, Sep 02 '20 at 12:21
Does this answer your question? When is temperature not a measure of the average kinetic energy of the particles in a substance? — BioPhysicist, Sep 02 '20 at 13:31
The question is similar but the answers I get on this thread are much closer to what I'm looking for. Less focusses on ideal gases, more on general foundations of statistical mechanics. — Benoit, Sep 02 '20 at 13:40

jacob1729 · Answer 1 · 2020-09-02T14:53:08.427

No, they're not always the same. Temperature is not a measure of energy but of uncertainty. This is made clear by a two level system with possible energies $0,\epsilon$. The mean energy $\langle E \rangle := U \in [0,\epsilon]$ is confined to a bounded set whereas the temperature can be anything you like (including negative values, corresponding to $U > \epsilon/2$).

In this setup, the temperature is a measure of how uncertain we are about which of the two states the system is in. The probabilities of the two states are:

$$ p_\alpha = \frac{e^{-E_\alpha/T}}{1+e^{-\epsilon/T}}$$

Perfect uncertainty ($p_\alpha=1/2$) corresponds to $T=\infty$. Meanwhile perfect certainty corresponds to $T=0$ - you know exactly what state the system is in.

(Note: there are two cases of 'perfect certainty' - either its definitely in the ground state or definitely excited. These both correspond to $T=0$ but from different directions. Taking a positive temperature and lowering it to $T=0^+$, you reach $U=0$ and taking a negative temperature and raising to $T=0^-$ you reach $U=\epsilon$.)

@AlekseyDruggist yep I lost a minus sign. As for the rest of your comment, that's what I said - $T=0^+$ corresponds to certainty that $\alpha=0$ ($T=0^-$ is $\alpha=1$). Meanwhile $T=\infty$ in that formula gives $p=1/2$ for both states. — jacob1729, Sep 02 '20 at 14:29

Daniel · Accepted Answer · 2020-09-03T18:51:27.813

You ask when it is true that $$ \left\langle \frac{{p}_i^2}{2m_i} \right\rangle_{t} = \frac{1}{2}k_B \left(\frac{\partial S}{\partial E}\right)^{-1} \tag{1}$$ for a Hamiltonian system.

If the Hamiltonian can be written as $$\mathcal{H} = U(\mathbf{q}) + \sum_i \frac{p_i^2}{2m_i} \tag{2} $$ the equipartition theorem implies that $$ \left\langle \frac{{p}_i^2}{2m_i} \right\rangle_{eq} = \frac{1}{2}k_B \left(\frac{\partial S}{\partial E}\right)^{-1} \tag{3}$$ where the average is taken for a microcanonical ensemble. There are thus two reasons that (1) might fail to hold:

We might have a Hamiltonian which cannot be written in the form of (2)
We might find that $ \left\langle \cdot\right\rangle_{eq} \ne \left\langle \cdot\right\rangle_{t} $

One can show that $ \left\langle \cdot\right\rangle_{eq} = \left\langle \cdot\right\rangle_{t} $ whenever both averages are well-defined and all conserved quantities can be written as functions of the energy. The latter is related to the requirement in the generalized Helmholtz theorem that certain hypersurfaces be metrically indecomposable.

Examples:

To see an example of the first reason (1) might not hold, consider a charged particle in an electromagnetic field. We have $$ \mathcal{H} = q\phi(\mathbf{r}) + \frac{1}{2m}\sum_i (p_i - qA_i(\mathbf{r}))^2 $$ This has terms linear in $\mathbf{p}$, so instead of something like (3) the equipartition theorem leaves us with $$ \left\langle p_i \cdot \frac{1}{m}(p_i - qA_i(t,\mathbf{r}))\right\rangle_{eq} = k_B T $$ or $$ \left\langle \frac{p_i^2}{2m} \right\rangle_{eq} = \frac{1}{2} k_B T + \left\langle \frac{p_i \cdot qA_i(\mathbf{r})}{2m} \right\rangle_{eq}$$

To see the second reason in action, consider the case of a particle in a one-dimensional potential $U(q)$ with two local minima $U_1 \gg U_2$ separated by a barrier of height $U_3$ at $q = 0$. Consider a particle with energy $E$ such that $U_1 < E < U_3 $. The canonical ensemble is mostly concentrated in well 2, so $$\left\langle \frac{p^2}{2m}\right\rangle_{eq} \approx E - U_2 $$ The equipartition theorem does apply here, so this tells us $$ \frac{1}{2}k_B \left(\frac{\partial S}{\partial E}\right)^{-1} \approx E - U_2 $$ If the particle is initially located in well 1 it will stay there for all time, and so $$\left\langle \frac{p^2}{2m}\right\rangle_{t} \approx E - U_1 $$ and thus (1) does not hold. This observation corresponds to the fact that we can define a conserved quantity like $$ C(q,p) = \begin{cases} 1 & \frac{p^2}{2m} + U(q) < U_3 \wedge q < 0 \\ 2 & \frac{p^2}{2m} + U(q) < U_3 \wedge q > 0 \\ 3 & \frac{p^2}{2m} + U(q) \geq U_3 \end{cases} $$ which cannot be written as a function of energy alone.

This conserved quantity is rather strange. Another, more natural example is that of two noninteracting particles in a one-dimensional box. $p_1^2$ and $p_2^2$ are each conserved separately, and so it turns out that $\langle p_1^2 \rangle_{eq} \ne \langle p_1^2 \rangle_t $.

Thanks a lot. For classical mechanics, can the Hamiltonian always be written with this form? — Benoit, Sep 02 '20 at 13:40
If you mean "classical" as opposed to "quantum", then no. One can define any function $\mathcal{H}(\mathbf{q},\mathbf{p})$ and still be studying classical mechanics. If you're looking for a realistic example, the Hamiltonian for a charged particle in electromagnetic fields has terms linear in the $p_i$. — Daniel, Sep 02 '20 at 13:44
Thanks. That would be nice to include in your answer if you wish. — Benoit, Sep 02 '20 at 14:12
My understanding of the equipartiton theorem is shallow, but I do recall that it applies to any quadratic degree of freedom. What changes if potential energy quadratic degrees of freedom are added to the Hamiltonian? In the "average energy" definition of temperature, should potential energy (say, vibrations of a diatomic molecule) be included as well as kinetic? It seems to be so. (This question has bothered me for years. Decades. I'd be very happy to have it nailed down.) — garyp, Sep 02 '20 at 15:07
@garyp, yes (under similar conditions). The equipartition theorem implies that $$\left\langle \frac{1}{2} \omega q_i^2\right\rangle_{eq} = \left\langle \frac{p_j^2}{2m} \right\rangle_{eq} $$ if $ q_i $ and $ p_j $ enter only quadratically in the Hamiltonian. Notice that this doesn't change the definition of temperature, it only provides another equivalent definition. — Daniel, Sep 02 '20 at 16:16
My thinking here is that in the case of a polyatomic molecule there is more energy than the kinetic energy that is available to transfer heat to another system in order to find a new equilibrium. In a polyatomic molecule is it the case that the average kinetic energy equal to the potential energy averaged over all degrees of freedom? That is, all of the PE is lumped together for the purposes here? — garyp, Sep 02 '20 at 17:43
It might make more sense to ask this as a question in its own right. It's pretty distinct from the question above. — Daniel, Sep 02 '20 at 17:45
You're welcome! Also, I just realized I mixed up the canonical and microcanonical ensembles before, which is why I got confused about the conditions for equivalence to time averages - fixed now. — Daniel, Sep 02 '20 at 18:58

A413 · Answer 3 · 2020-09-02T13:32:23.513

Let $\Omega(E,N,V)$ be the number of microstates with energy $E$, $N$ particles and volume $V$. Recall that $S=k \ln \Omega$. It is actually more convenient to define temperature as $\frac{1}{T}=(\frac{\partial S}{\partial E})_{V,N}$.

Let $\Phi(E)$ denote the total number of possible quantum states of a system which are characterised by energies less then $E$. Assume that we have now a system described by $f$ quantum numbers, then the energy per degree of freedom $\epsilon$ is roughly $\epsilon \approx E/f$.

Let now, $\Phi_1(\epsilon)$ be the total number of possible values which can be assumed by one particular quantum number when it contributes an amount $\epsilon$ or less to a system. It can be shown $\Phi_1(\epsilon)$ that is roughly propositional to $\epsilon$, and $\Phi(E) \approx \Phi_1(\epsilon)^f$.

Moreover in the range of energies between $E$ and $E+\delta E$, we have $\Omega(E)\approx \Phi(E)-\Phi(E+\delta E)\approx \frac{\partial \Phi}{\partial E} \delta E \approx \Phi_1^{f-1}\frac{\partial \Phi_1}{\partial \epsilon} \delta E$. Since $f$ is usually very large number (about $10^{24}$), we get $\ln \Omega \approx f \ln \Phi_1$, it follows that $\Omega \approx \Phi_1^f$ which is propositional to $E^f$. Thus $\ln \Omega \approx f\ln E + constant$. Thus when $E\approx \overline{E}$ (the average energy), we get $1/T = kf /\overline{E}$. So the temperature is indeed proportional to the average energy per degree of freedom. If the system has negligible interactions then $\overline{E}$ is the average kinetic energy.

It is important to keep in mind that in the derivation above we assumed that our system has no upper bound on the possible energy. In fact this is the case for the systems where one takes kinetic energy of particles into account. However if one focuses instead only on spin degrees of freedom, then when all the spins are lined up anti-parallel to the field a maximum of energy is reached.

A good exercise would be to consider the case of an ideal gas. In this case you get very simple analytic expressions. If you are interested you to learn more you may take a look at "Fundamentals of statistical and thermal physics" by Reif and also "Statistical Mechanics" by Pathria and Beale.

Can you be clearer about what assumptions are necessary for this result? It's clearly not true for eg an ensemble of two level systems which has a clear upper energy limit. — jacob1729, Sep 02 '20 at 12:16
@jacob1729 Thank you for the remark. A clarification has been added. — A413, Sep 02 '20 at 13:48

Statistical mechanics definition of temperature as the average kinetic energy

3 Answers3

Linked