In
statistical mechanics
In physics, statistical mechanics is a mathematical framework that applies statistical methods and probability theory to large assemblies of microscopic entities. It does not assume or postulate any natural laws, but explains the macroscopic be ...
, a semi-classical derivation of
entropy that does not take into account the
indistinguishability of particles yields an expression for entropy which is not
extensive (is not proportional to the amount of substance in question). This leads to a
paradox known as the Gibbs paradox, after
Josiah Willard Gibbs, who proposed this
thought experiment in 1874‒1875.
The paradox allows for the entropy of closed systems to decrease, violating the
second law of thermodynamics. A related paradox is the "
mixing paradox
In statistical mechanics, a semi-classical derivation of entropy that does not take into account the indistinguishability of particles yields an expression for entropy which is not extensive (is not proportional to the amount of substance in que ...
". If one takes the perspective that the definition of entropy must be changed so as to ignore particle permutation, in the
thermodynamic limit, the paradox is averted.
Illustration of the problem
Gibbs himself considered the following problem that arises if the ideal gas entropy is not extensive.
[ Reprinted in
and in ] Two identical containers of an ideal gas sit side-by-side. The gas in container #1 is identical in every respect to the gas in container #2 (i.e. in volume, mass, temperature, pressure, etc). There is a certain entropy ''S'' associated with each container which depends on the volume of each container. Now a door in the container wall is opened to allow the gas particles to mix between the containers. No macroscopic changes occur, as the system is in equilibrium. The entropy of the gas in the two-container system can be easily calculated, but if the equation is not extensive, the entropy would not be 2''S''. In fact, the non-extensive entropy quantity defined and studied by Gibbs would predict additional entropy. Closing the door then reduces the entropy again to ''S'' per box, in supposed violation of the
Second Law of Thermodynamics.
As understood by Gibbs,
and reemphasized more recently, this is a misapplication of Gibbs' non-extensive entropy quantity. If the gas particles are distinguishable, closing the doors will not return the system to its original state - many of the particles will have switched containers. There is a freedom in what is defined as ordered, and it would be a mistake to conclude the entropy had not increased. In particular, Gibbs' non-extensive entropy quantity for an ideal gas was not intended for varying numbers of particles.
The paradox is averted by concluding the
indistinguishability (at least effective indistinguishability) of the particles in the volume. This results in the
extensive Sackur–Tetrode equation for entropy, as derived next.
Calculating the entropy of ideal gas, and making it extensive
In classical mechanics, the state of an
ideal gas of energy ''U'', volume ''V'' and with ''N'' particles, each particle having mass ''m'', is represented by specifying the
momentum
In Newtonian mechanics, momentum (more specifically linear momentum or translational momentum) is the product of the mass and velocity of an object. It is a vector quantity, possessing a magnitude and a direction. If is an object's mass an ...
vector ''p'' and the position vector ''x'' for each particle. This can be thought of as specifying a point in a 6N-dimensional
phase space
In dynamical system theory, a phase space is a space in which all possible states of a system are represented, with each possible state corresponding to one unique point in the phase space. For mechanical systems, the phase space usually ...
, where each of the axes corresponds to one of the momentum or position coordinates of one of the particles. The set of points in phase space that the gas could occupy is specified by the constraint that the gas will have a particular energy:
:
and be contained inside of the volume V (let's say ''V'' is a cube of side ''X'' so that ''V''=''X''
3):
:
for
and
The first constraint defines the surface of a 3N-dimensional
hypersphere of radius (2''mU'')
1/2 and the second is a 3N-dimensional
hypercube
In geometry, a hypercube is an ''n''-dimensional analogue of a square () and a cube (). It is a closed, compact, convex figure whose 1- skeleton consists of groups of opposite parallel line segments aligned in each of the space's dimensions, ...
of volume ''V''
N. These combine to form a 6N-dimensional hypercylinder. Just as the area of the wall of a cylinder is the
circumference of the base times the height, so the area φ of the wall of this hypercylinder is:
:
The entropy is proportional to the logarithm of the number of states that the gas could have while satisfying these constraints. In classical physics, the number of states is infinitely large, but according to quantum mechanics it is finite. Before the advent of quantum mechanics, this infinity was regularized by making phase space discrete. Phase space was divided up in blocks of volume
. The constant h thus appeared as a result of a mathematical trick and thought to have no physical significance. However, using quantum mechanics one recovers the same formalism in the semi-classical limit, but now with h being Planck's constant. One can qualitatively see this from
Heisenberg's uncertainty principle; a volume in N phase space smaller than ''h''
3N (''h'' is Planck's constant) cannot be specified.
To compute the number of states we must compute the volume in phase space in which the system can be found and divide that by
. This leads us to another problem: The volume seems to approach zero, as the region in phase space in which the system can be is an area of zero thickness. This problem is an artifact of having specified the energy U with infinite accuracy. In a generic system without symmetries, a full quantum treatment would yield a discrete non-degenerate set of energy eigenstates. An exact specification of the energy would then fix the precise state the system is in, so the number of states available to the system would be one, the entropy would thus be zero.
When we specify the internal energy to be ''U'', what we really mean is that the total energy of the gas lies somewhere in an interval of length
around U. Here
is taken to be very small, it turns out that the entropy doesn't depend strongly on the choice of
for large ''N''. This means that the above "area"
must be extended to a shell of a thickness equal to an uncertainty in momentum
, so the entropy is given by:
:
where the constant of proportionality is ''k'',
Boltzmann's constant. Using
Stirling's approximation for the
Gamma function
In mathematics, the gamma function (represented by , the capital letter gamma from the Greek alphabet) is one commonly used extension of the factorial function to complex numbers. The gamma function is defined for all complex numbers except ...
which omits terms of less than order ''N'', the entropy for large ''N'' becomes:
:
This quantity is not extensive as can be seen by considering two identical volumes with the same
particle number
The particle number (or number of particles) of a thermodynamic system, conventionally indicated with the letter ''N'', is the number of constituent particles in that system. The particle number is a fundamental parameter in thermodynamics which is ...
and the same energy. Suppose the two volumes are separated by a barrier in the beginning. Removing or reinserting the wall is reversible, but the entropy increases when the barrier is removed by the amount
:
which is in contradiction to thermodynamics if you re-insert the barrier. This is the Gibbs paradox.
The paradox is resolved by postulating that the gas particles are in fact indistinguishable. This means that all states that differ only by a permutation of particles should be considered as the same state. For example, if we have a 2-particle gas and we specify ''AB'' as a state of the gas where the first particle (''A'') has momentum p
1 and the second particle (''B'') has momentum p
2, then this state as well as the ''BA'' state where the ''B'' particle has momentum p
1 and the ''A'' particle has momentum p
2 should be counted as the same state.
For an ''N''-particle gas, there are ''N''! states which are identical in this sense, if one assumes that each particle is in a different single particle state. One can safely make this assumption provided the gas isn't at an extremely high density. Under normal conditions, one can thus calculate the volume of phase space occupied by the gas, by dividing Equation 1 by ''N''!. Using the
Stirling approximation again for large ''N'', ln(''N!'') ≈ ''N'' ln(''N'') - ''N'', the entropy for large ''N'' is:
:
which can be easily shown to be extensive. This is the
Sackur–Tetrode equation.
The mixing paradox
A closely related paradox to the Gibbs paradox is the ''mixing paradox''. In fact the Gibbs paradox is a special case of the "mixing paradox" which contains all the salient features. The difference is that the mixing paradox deals with ''arbitrary'' distinctions in the two gases, not just distinctions in particle ordering as Gibbs had considered. In this sense, it is a straightforward generalization to the argument laid out by Gibbs. Again take a box with a partition in it, with gas A on one side, gas B on the other side, and both gases are at the same temperature and pressure. If gas A and B are different gases, there is an entropy that arises once the gases are mixed. If the gases are the same, no additional entropy is calculated. The additional entropy from mixing does not depend on the character of the gases; it only depends on the fact that the gases are different. The two gases may be arbitrarily similar, but the entropy from mixing does not disappear unless they are the same gas - a paradoxical discontinuity.
This "paradox" can be explained by carefully considering the definition of entropy. In particular, as concisely explained by
Edwin Thompson Jaynes,
definitions of entropy are arbitrary.
As a central example in Jaynes' paper points out, one can develop a theory that treats two gases as similar even if those gases may in reality be distinguished through sufficiently detailed measurement. As long as we do not perform these detailed measurements, the theory will have no internal inconsistencies. (In other words, it does not matter that we call gases A and B by the same name if we have not yet discovered that they are distinct.) If our theory calls gases A and B the same, then entropy does not change when we mix them. If our theory calls gases A and B different, then entropy ''does'' increase when they are mixed. This insight suggests that the ideas of "thermodynamic state" and of "entropy" are somewhat subjective.
The differential increase in entropy (dS) as a result of mixing dissimilar gases, multiplied by the temperature (T), equals the minimum amount of work we must do to restore the gases to their original separated state. Suppose that two gases are different, but that we are unable to detect their differences. If these gases are in a box, segregated from one another by a partition, how much work does it take to restore the system's original state after we remove the partition and let the gases mix?
None – simply reinsert the partition. Even though the gases have mixed, there was never a detectable change of state in the system, because by hypothesis the gases are experimentally indistinguishable.
As soon as we can distinguish the difference between gases, the work necessary to recover the pre-mixing macroscopic configuration from the post-mixing state becomes nonzero. This amount of work does not depend on how different the gases are, but only on whether they are distinguishable.
This line of reasoning is particularly informative when considering the concepts of
indistinguishable particles and
correct Boltzmann counting. Boltzmann's original expression for the number of states available to a gas assumed that a state could be expressed in terms of a number of energy "sublevels" each of which contain a particular number of particles. While the particles in a given sublevel were considered indistinguishable from each other, particles in different sublevels were considered distinguishable from particles in any other sublevel. This amounts to saying that the exchange of two particles in two different sublevels will result in a detectably different "exchange macrostate" of the gas. For example, if we consider a simple gas with ''N'' particles, at sufficiently low density that it is practically certain that each sublevel contains either one particle or none (i.e. a Maxwell–Boltzmann gas), this means that a simple container of gas will be in one of ''N!'' detectably different "exchange macrostates", one for each possible particle exchange.
Just as the mixing paradox begins with two detectably different containers, and the extra entropy that results upon mixing is proportional to the average amount of work needed to restore that initial state after mixing, so the extra entropy in Boltzmann's original derivation is proportional to the average amount of work required to restore the simple gas from some "exchange macrostate" to its original "exchange macrostate". If we assume that there is in fact no experimentally detectable difference in these "exchange macrostates" available, then using the entropy which results from assuming the particles are indistinguishable will yield a consistent theory. This is "correct Boltzmann counting".
It is often said that the resolution to the Gibbs paradox derives from the fact that, according to the quantum theory, like particles are indistinguishable in principle. By Jaynes' reasoning, if the particles are experimentally indistinguishable for whatever reason, Gibbs paradox is resolved, and quantum mechanics only provides an assurance that in the quantum realm, this indistinguishability will be true as a matter of principle, rather than being due to an insufficiently refined experimental capability.
Non-extensive entropy of two ideal gases and how to fix it
In this section, we present in rough outline a purely classical derivation of the non-extensive entropy for an ideal gas considered by Gibbs before "correct counting" (indistinguishability of particles) is accounted for. This is followed by a brief discussion of two standard methods for making the entropy extensive. Finally, we present a third method, due to R. Swendsen, for an extensive (additive) result for the entropy of two systems if they are allowed to exchange particles with each other.
[
]
Setup
We will present a simplified version of the calculation. It differs from the full calculation in three ways:
# The ideal gas consists of particles confined to one spatial dimension.
# We keep only the terms of order
, dropping all terms of size ''n'' or less, where ''n'' is the number of particles. For our purposes, this is enough, because this is where the Gibbs paradox shows up and where it must be resolved. The neglected terms play a role when the number of particles is not very large, such as in
computer simulation
Computer simulation is the process of mathematical modelling, performed on a computer, which is designed to predict the behaviour of, or the outcome of, a real-world or physical system. The reliability of some mathematical models can be dete ...
and
nanotechnology
Nanotechnology, also shortened to nanotech, is the use of matter on an atomic, molecular, and supramolecular scale for industrial purposes. The earliest, widespread description of nanotechnology referred to the particular technological goal o ...
. Also, they are needed in deriving the
Sackur–Tetrode equation.
# The subdivision of phase space into units of
Planck's constant (h) is omitted. Instead, the entropy is defined using an integral over the "accessible" portion of phase space. This serves to highlight the purely
classical nature of the calculation.
We begin with a version of
Boltzmann's entropy in which the integrand is all of accessible
phase space
In dynamical system theory, a phase space is a space in which all possible states of a system are represented, with each possible state corresponding to one unique point in the phase space. For mechanical systems, the phase space usually ...
:
:
The integral is restricted to a contour of available regions of phase space, subject to conservation of energy. In contrast to the one-dimensional
line integrals encountered in elementary physics, the contour of constant energy possesses a vast number of dimensions. The justification for integrating over phase space using the canonical measure involves the assumption of equal probability. The assumption can be made by invoking the
ergodic hypothesis
In physics and thermodynamics, the ergodic hypothesis says that, over long periods of time, the time spent by a system in some region of the phase space of microstates with the same energy is proportional to the volume of this region, i.e., th ...
as well as the
Liouville's theorem of
Hamiltonian systems.
(The ergodic hypothesis underlies the ability of a physical system to reach
thermal equilibrium, but this may not always hold for computer simulations (see the
Fermi–Pasta–Ulam–Tsingou problem) or in certain real-world systems such as
non-thermal plasmas.)
Liouville's theorem assumes a fixed number of dimensions that the system 'explores'. In calculations of entropy, the number dimensions is proportional to the number of particles in the system, which forces phase space to abruptly change dimensionality when particles are added or subtracted. This may explain the difficulties in constructing a clear and simple derivation for the dependence of entropy on the number of particles.
For the ideal gas, the accessible phase space is an
(n-1)-sphere (also called a hypersphere) in the
dimensional
space:
:
To recover the paradoxical result that entropy is not extensive, we integrate over phase space for a gas of
monatomic particles confined to a single spatial dimension by