Chapter 5 Derivation of Jarzynski Equality
Click Start Over at the left bottom to start Back to Contents
1 Jacobian determinant
Given two vectors →α=(α1,α2) and →β=(β1,β2) as shown in Figure 1. The area of the parallelogram dA can be calculated as
dA=|→β|⋅|→α|sinθ=α1β2−α2β1
So
dA=|→α×→β|=|α1α2β1β2|
For an irregular infinitesimal area, the slope directions at the start point are chosen to represent the parallelogram sides.
Figure 1: Relation between area of a parallelogram and cross product of two vectors.
Suppose that there exists a bijective transformation T:u=u(x,y),v=v(x,y) mapping the rectangular area in the xy plane to a parallelogram in the uv plane as shown in Figure 2. Since
du=uxdx+uydydv=vxdx+vydy
or in maxtrix form
[dudv]=[uxuyvxvy][dxdy],
the vectors →dx=(dx,0) and →dy=(dy,0) are transformed to →du and →dv as
→du=[uxuyvxvy][dx0]=[uxdxvxdx]
and
→dv=[uxuyvxvy][0dy]=[uydyvydy]
Using equation (1), we obtain the new area as
dAuv=|→du×→dv|=|[uxdxvxdx]×[uydyvydy]|=|J|dxdy
where J=|∂(u,v)∂(x,y)|=|uxvxuyvy| is the Jacobian determinant. |J| is the absolute value.
Figure 2: Schematic drawing of area transformation.
2 Liouville’s theorem
Let us start with two examples.
Example 1. Consider a free particle with mass m moving in constant velocity v in one dimension. It can be described by its position q and momenta p. Its time dependent trajectory in phase space Γ=(q,p) follows the Hamilton’s equations
˙p=−∂H∂q=0
˙q=∂H∂p=pm
where the Hamiltonian is H=12mp2. The time evolution of the trajectory from t to t′ is given by the solution to the equations (3) and (4):
p′=p
q′=q+pm(t′−t)
Therefore the Jacobian determinant of the transformation from (q,p) to (q′,p′) is
J(t,t′)=∂(q′,p′)∂(q,p)=|1t′−tm01|=1
Equation (2) yields dp′dq′=dpdq.
Example 2. Consider a particle of mass m subject to a linear restoring force F=−kx, corresponding to the quadratic potential
V(x)=12kx2=12mω2x2
where ω=√k/m. In the Hamiltonian description of classical mechanics, the system is described by the dynamical variables (x,p), and the evolution is governed by the Hamilton’s equations of motion
˙x=∂H∂p=pm
˙p=−∂H∂x=−mω2x
where H is the Hamiltonian:
H=T+V=p22m+12mω2x2
Upon taking a second derivative, equations (8) and (9) become
¨x+ω2x=0¨p+ω2p=0
The general solutions to equations (11) are
x(t)=Asin(ωt+δ)
p(t)=Amωcos(ωt+δ)
The microstate of the system at some other time t′ can be written down from equations (12) and (13) as
x(t′)=1mωsin(ω(t′−t))p(t)+cos(ω(t′−t))x(t)
p(t′)=cos(ω(t′−t))p(t)−mωsin(ω(t′−t))x(t)
The Jacobian determinant of the transformation defined by equations (14) and (15) is
J(t,t′)=∂(q′,p′)∂(q,p)=|cos(ω(t′−t))1mωsin(ω(t′−t))mωsin(ω(t′−t))−cos(ω(t′−t))|=−1
So we have again dx′dp′=dxdp.
In general, let us consider a system consisting of N particles in 3 dimensions. At any time t, each particle j is described by its position vector →qj=(qj,x,qj,y,qj,z) and momenta vector →pj=(pj,x,pj,y,pj,z). Thus the whole system at any time is described by a 6N dimensional vector →X=(→q1,→p1;→q2,→p2;⋯,→qN,→pN;) in a 6N dimensional phase space. If the transformation from time t to another time t′ is defined by the Hamilton’s equations of motion
˙qj,α=∂H∂pj,α˙pj,α=−∂H∂qj,α,α=x,y,z,
then the absolute value of Jacobian determinant of the transformation is 1. This is the Liouville’s theorem.
Proof. According to equation (2), it is enough to prove the volume of the region
ΔX={→X∣→q1∈Δq1,→p1∈Δp1;⋯;→qN∈ΔqN,→pN∈ΔpN}
is a constant with respect to t.
Using Taylor expansion, we obtain that
Δqj,α(t+δt)≈Δqj,α(t)+δtΔ˙qj,α(t)=Δqj,α(t)(1+δtΔ˙qj,α(t)Δqj,α(t))
Δpj,α(t+δt)≈Δpj,α(t)+δtΔ˙pj,α(t)=Δpj,α(t)(1+δtΔ˙pj,α(t)Δpj,α(t))
The volume of ΔX is
|ΔX(t+δt)|=ΠjΠα(Δqj,α(t+δt)Δpj,α(t+δt))≈|ΔX(t)|[1+δt∑j∑α(Δ˙qj,α(t)Δqj,α(t)+Δ˙pj,α(t)Δpj,α(t))]
Therefore,
ddt|ΔX(t)|=lim
This implies that \left|\Delta X(t)\right| is a constant. Thus the absolute value of the Jacobian determinant is 1.
3 Jarzynski equality
Assume the equilibrium state inside the test tube is changed with time from its initial state A to final state B along some path \gamma because external work W is done on the test tube system. The work W depend on the initial and final microscopic conditions of the system. Since the initial state of the system is a random variable, the work is a random variable. Jarzynski has proved that the average of the Boltzmann weight of the work is related to the Helmholtz free energy by the following equality (Jarzynski 1997)
\begin{equation} \overline{\exp\left(-\beta W\right)} = \exp\left(-\beta \Delta F\right) \label{eq:JE} \end{equation}
where \beta \equiv 1/k_BT. This equality tells us that the Helmholtz free energy (equilibrium information) can be extracted from the ensemble average of the Boltmann weighted work performed on the system to change the system equilibrium state from A to B. The result is independent of both the path \gamma from A to B, and the rate at which the system is changed along the path.
For simplicity, we assume that the system’s trajectory from state A to state B along a path \gamma can be characterized by a time-dependent parameter \lambda(t), 0 \le t \leq \tau. Let \boldsymbol{z} \equiv \left(\boldsymbol{q}, \boldsymbol{p}\right) denote a point in the phase space of the system, and let H(\boldsymbol{z}, \lambda) the Hamiltonian for the system, parameterized by \lambda. Let us prove equality (\ref{eq:JE}).
Proof. The work done on the system along the path \gamma from state A to state B is
\begin{equation} \begin{split} W(\boldsymbol{z}(0), \boldsymbol{z}(\tau)) & = \int_{0}^{\tau} \frac{\partial H(\boldsymbol{z}, \lambda)}{\partial \lambda} \frac{d\lambda}{dt}dt \\ & = \int_{0}^{\tau} \frac{d H(\boldsymbol{z}(t), \lambda(t))}{dt} dt \\ & = H(\boldsymbol{z}(\tau), B) - H(\boldsymbol{z}(0), A) \end{split} \label{eq:jarzW} \end{equation}
Let Z_{A} and Z_{B} be the partition function for state A and state B, and F_{A}=-\beta^{-1}\ln Z_{A} and F_{B}=-\beta^{-1}\ln Z_{B} be the free energy. Since \boldsymbol{z}_0 \equiv \boldsymbol{z}(0) is a random variable, \boldsymbol{z}_{\tau} \equiv \boldsymbol{z}(\tau) is also a random variable.
\begin{equation} \begin{split} <e^{-\beta W}> &= \int d\boldsymbol{z}_0 \frac{1}{Z_{A}}e^{-\beta H(\boldsymbol{z}_0, A)} e^{-\beta W(\boldsymbol{z}_0)} \\ & = \frac{1}{Z_{A}}\int d\boldsymbol{z}_0 e^{-\beta H(\boldsymbol{z}_{\tau}, B)} \\ & = \frac{1}{Z_{A}}\int d\boldsymbol{z}_{\tau}\left|\frac{\partial \boldsymbol{z}_{\tau}}{\partial \boldsymbol{z}_{0}}\right|^{-1} e^{-\beta H(\boldsymbol{z}_{\tau}, B)} \\ & = \frac{Z_{B}}{Z_{A}} \\ & = e^{-\beta \Delta F} \end{split} \label{eq:proofJE} \end{equation}
where \left|\frac{\partial \boldsymbol{z}_{\tau}}{\partial \boldsymbol{z}_{0}}\right|=1 is due to the Liouville’s theorem.
Proposition 3.1 \left<W\right> \geq \Delta F, which is the inequality of Clausius.
Figure 3: Exponential function is concavity.
Since exponential function is a concavity function, any secant line lies above the curve as shown in Figure 3. Therefore e^{\left<X\right>} \le \left<e^X\right>, which is called the Jensen’s inequality. Applying the Jensen’s inequality to equality (\ref{eq:JE}), we obtain that \left<W\right> \geq \Delta F.
References
Jarzynski, C. 1997. “Nonequilibrium Equality for Free Energy Differences.” Phys. Rev. Lett. 78 (14): 2690–3. https://doi.org/10.1103/PhysRevLett.78.2690.