Processing math: 66%
Skip to Tutorial Content

Chapter 5 Derivation of Jarzynski Equality

Click Start Over at the left bottom to start Back to Contents


1 Jacobian determinant

Given two vectors α=(α1,α2) and β=(β1,β2) as shown in Figure 1. The area of the parallelogram dA can be calculated as

dA=|β||α|sinθ=α1β2α2β1

So

dA=|α×β|=|α1α2β1β2|

For an irregular infinitesimal area, the slope directions at the start point are chosen to represent the parallelogram sides.

Figure 1: Relation between area of a parallelogram and cross product of two vectors.

Suppose that there exists a bijective transformation T:u=u(x,y),v=v(x,y) mapping the rectangular area in the xy plane to a parallelogram in the uv plane as shown in Figure 2. Since

du=uxdx+uydydv=vxdx+vydy

or in maxtrix form

[dudv]=[uxuyvxvy][dxdy],

the vectors dx=(dx,0) and dy=(dy,0) are transformed to du and dv as

du=[uxuyvxvy][dx0]=[uxdxvxdx]

and

dv=[uxuyvxvy][0dy]=[uydyvydy]

Using equation (1), we obtain the new area as

dAuv=|du×dv|=|[uxdxvxdx]×[uydyvydy]|=|J|dxdy

where J=|(u,v)(x,y)|=|uxvxuyvy| is the Jacobian determinant. |J| is the absolute value.

Figure 2: Schematic drawing of area transformation.

2 Liouville’s theorem

Let us start with two examples.

Example 1. Consider a free particle with mass m moving in constant velocity v in one dimension. It can be described by its position q and momenta p. Its time dependent trajectory in phase space Γ=(q,p) follows the Hamilton’s equations

˙p=Hq=0

˙q=Hp=pm

where the Hamiltonian is H=12mp2. The time evolution of the trajectory from t to t is given by the solution to the equations (3) and (4):

p=p

q=q+pm(tt)

Therefore the Jacobian determinant of the transformation from (q,p) to (q,p) is

J(t,t)=(q,p)(q,p)=|1ttm01|=1

Equation (2) yields dpdq=dpdq.

Example 2. Consider a particle of mass m subject to a linear restoring force F=kx, corresponding to the quadratic potential

V(x)=12kx2=12mω2x2

where ω=k/m. In the Hamiltonian description of classical mechanics, the system is described by the dynamical variables (x,p), and the evolution is governed by the Hamilton’s equations of motion

˙x=Hp=pm

˙p=Hx=mω2x

where H is the Hamiltonian:

H=T+V=p22m+12mω2x2

Upon taking a second derivative, equations (8) and (9) become

¨x+ω2x=0¨p+ω2p=0

The general solutions to equations (11) are

x(t)=Asin(ωt+δ)

p(t)=Amωcos(ωt+δ)

The microstate of the system at some other time t can be written down from equations (12) and (13) as

x(t)=1mωsin(ω(tt))p(t)+cos(ω(tt))x(t)

p(t)=cos(ω(tt))p(t)mωsin(ω(tt))x(t)

The Jacobian determinant of the transformation defined by equations (14) and (15) is

J(t,t)=(q,p)(q,p)=|cos(ω(tt))1mωsin(ω(tt))mωsin(ω(tt))cos(ω(tt))|=1

So we have again dxdp=dxdp.

In general, let us consider a system consisting of N particles in 3 dimensions. At any time t, each particle j is described by its position vector qj=(qj,x,qj,y,qj,z) and momenta vector pj=(pj,x,pj,y,pj,z). Thus the whole system at any time is described by a 6N dimensional vector X=(q1,p1;q2,p2;,qN,pN;) in a 6N dimensional phase space. If the transformation from time t to another time t is defined by the Hamilton’s equations of motion

˙qj,α=Hpj,α˙pj,α=Hqj,α,α=x,y,z,

then the absolute value of Jacobian determinant of the transformation is 1. This is the Liouville’s theorem.

Proof. According to equation (2), it is enough to prove the volume of the region

ΔX={Xq1Δq1,p1Δp1;;qNΔqN,pNΔpN}

is a constant with respect to t.

Using Taylor expansion, we obtain that

Δqj,α(t+δt)Δqj,α(t)+δtΔ˙qj,α(t)=Δqj,α(t)(1+δtΔ˙qj,α(t)Δqj,α(t))

Δpj,α(t+δt)Δpj,α(t)+δtΔ˙pj,α(t)=Δpj,α(t)(1+δtΔ˙pj,α(t)Δpj,α(t))

The volume of ΔX is

|ΔX(t+δt)|=ΠjΠα(Δqj,α(t+δt)Δpj,α(t+δt))|ΔX(t)|[1+δtjα(Δ˙qj,α(t)Δqj,α(t)+Δ˙pj,α(t)Δpj,α(t))]

Therefore,

ddt|ΔX(t)|=lim

This implies that \left|\Delta X(t)\right| is a constant. Thus the absolute value of the Jacobian determinant is 1.

3 Jarzynski equality

Assume the equilibrium state inside the test tube is changed with time from its initial state A to final state B along some path \gamma because external work W is done on the test tube system. The work W depend on the initial and final microscopic conditions of the system. Since the initial state of the system is a random variable, the work is a random variable. Jarzynski has proved that the average of the Boltzmann weight of the work is related to the Helmholtz free energy by the following equality (Jarzynski 1997)

\begin{equation} \overline{\exp\left(-\beta W\right)} = \exp\left(-\beta \Delta F\right) \label{eq:JE} \end{equation}

where \beta \equiv 1/k_BT. This equality tells us that the Helmholtz free energy (equilibrium information) can be extracted from the ensemble average of the Boltmann weighted work performed on the system to change the system equilibrium state from A to B. The result is independent of both the path \gamma from A to B, and the rate at which the system is changed along the path.

For simplicity, we assume that the system’s trajectory from state A to state B along a path \gamma can be characterized by a time-dependent parameter \lambda(t), 0 \le t \leq \tau. Let \boldsymbol{z} \equiv \left(\boldsymbol{q}, \boldsymbol{p}\right) denote a point in the phase space of the system, and let H(\boldsymbol{z}, \lambda) the Hamiltonian for the system, parameterized by \lambda. Let us prove equality (\ref{eq:JE}).

Proof. The work done on the system along the path \gamma from state A to state B is

\begin{equation} \begin{split} W(\boldsymbol{z}(0), \boldsymbol{z}(\tau)) & = \int_{0}^{\tau} \frac{\partial H(\boldsymbol{z}, \lambda)}{\partial \lambda} \frac{d\lambda}{dt}dt \\ & = \int_{0}^{\tau} \frac{d H(\boldsymbol{z}(t), \lambda(t))}{dt} dt \\ & = H(\boldsymbol{z}(\tau), B) - H(\boldsymbol{z}(0), A) \end{split} \label{eq:jarzW} \end{equation}

Let Z_{A} and Z_{B} be the partition function for state A and state B, and F_{A}=-\beta^{-1}\ln Z_{A} and F_{B}=-\beta^{-1}\ln Z_{B} be the free energy. Since \boldsymbol{z}_0 \equiv \boldsymbol{z}(0) is a random variable, \boldsymbol{z}_{\tau} \equiv \boldsymbol{z}(\tau) is also a random variable.

\begin{equation} \begin{split} <e^{-\beta W}> &= \int d\boldsymbol{z}_0 \frac{1}{Z_{A}}e^{-\beta H(\boldsymbol{z}_0, A)} e^{-\beta W(\boldsymbol{z}_0)} \\ & = \frac{1}{Z_{A}}\int d\boldsymbol{z}_0 e^{-\beta H(\boldsymbol{z}_{\tau}, B)} \\ & = \frac{1}{Z_{A}}\int d\boldsymbol{z}_{\tau}\left|\frac{\partial \boldsymbol{z}_{\tau}}{\partial \boldsymbol{z}_{0}}\right|^{-1} e^{-\beta H(\boldsymbol{z}_{\tau}, B)} \\ & = \frac{Z_{B}}{Z_{A}} \\ & = e^{-\beta \Delta F} \end{split} \label{eq:proofJE} \end{equation}

where \left|\frac{\partial \boldsymbol{z}_{\tau}}{\partial \boldsymbol{z}_{0}}\right|=1 is due to the Liouville’s theorem.

Proposition 3.1 \left<W\right> \geq \Delta F, which is the inequality of Clausius.

Figure 3: Exponential function is concavity.

Since exponential function is a concavity function, any secant line lies above the curve as shown in Figure 3. Therefore e^{\left<X\right>} \le \left<e^X\right>, which is called the Jensen’s inequality. Applying the Jensen’s inequality to equality (\ref{eq:JE}), we obtain that \left<W\right> \geq \Delta F.

References

Jarzynski, C. 1997. “Nonequilibrium Equality for Free Energy Differences.” Phys. Rev. Lett. 78 (14): 2690–3. https://doi.org/10.1103/PhysRevLett.78.2690.