Soft Configuration Model
   HOME

TheInfoList



OR:

In applied mathematics, the soft configuration model (SCM) is a
random graph In mathematics, random graph is the general term to refer to probability distributions over graphs. Random graphs may be described simply by a probability distribution, or by a random process which generates them. The theory of random graphs l ...
model subject to the
principle of maximum entropy The principle of maximum entropy states that the probability distribution which best represents the current state of knowledge about a system is the one with largest entropy, in the context of precisely stated prior data (such as a proposition ...
under constraints on the expectation of the
degree sequence In graph theory, the degree (or valency) of a vertex of a graph is the number of edges that are incident to the vertex; in a multigraph, a loop contributes 2 to a vertex's degree, for the two ends of the edge. The degree of a vertex v is denot ...
of sampled
graphs Graph may refer to: Mathematics *Graph (discrete mathematics), a structure made of vertices and edges **Graph theory, the study of such graphs and their properties * Graph (topology), a topological space resembling a graph in the sense of discre ...
. Whereas the configuration model (CM) uniformly samples random graphs of a specific degree sequence, the SCM only retains the specified degree sequence on average over all network realizations; in this sense the SCM has very relaxed constraints relative to those of the CM ("soft" rather than "sharp" constraints). The SCM for graphs of size n has a nonzero probability of sampling any graph of size n, whereas the CM is restricted to only graphs having precisely the prescribed connectivity structure.


Model formulation

The SCM is a
statistical ensemble In physics, specifically statistical mechanics, an ensemble (also statistical ensemble) is an idealization consisting of a large number of virtual copies (sometimes infinitely many) of a system, considered all at once, each of which represents a ...
of random graphs G having n vertices (n=, V(G), ) labeled \_^n=V(G), producing a
probability distribution In probability theory and statistics, a probability distribution is a Function (mathematics), function that gives the probabilities of occurrence of possible events for an Experiment (probability theory), experiment. It is a mathematical descri ...
on \mathcal_n (the set of graphs of size n). Imposed on the ensemble are n constraints, namely that the
ensemble average In physics, specifically statistical mechanics, an ensemble (also statistical ensemble) is an idealization consisting of a large number of virtual copies (sometimes infinitely many) of a system, considered all at once, each of which represents a ...
of the degree k_j of vertex v_j is equal to a designated value \widehat_j, for all v_j\in V(G). The model is fully parameterized by its size n and expected degree sequence \_^n. These constraints are both local (one constraint associated with each vertex) and soft (constraints on the ensemble average of certain observable quantities), and thus yields a
canonical ensemble In statistical mechanics, a canonical ensemble is the statistical ensemble that represents the possible states of a mechanical system in thermal equilibrium with a heat bath at a fixed temperature. The system can exchange energy with the hea ...
with an extensive number of constraints. The conditions \langle k_j \rangle = \widehat_j are imposed on the ensemble by the method of Lagrange multipliers (see
Maximum-entropy random graph model Maximum-entropy random graph models are random graph models used to study complex networks subject to the principle of maximum entropy under a set of structural constraints, which may be global, distributional, or local. Overview Any random graph ...
).


Derivation of the probability distribution

The probability \mathbb_\text(G) of the SCM producing a graph G is determined by maximizing the
Gibbs entropy The concept entropy was first developed by German physicist Rudolf Clausius in the mid-nineteenth century as a thermodynamic property that predicts that certain spontaneous processes are irreversible or impossible. In statistical mechanics, entrop ...
S /math> subject to constraints \langle k_j \rangle = \widehat_j, \ j=1,\ldots,n and normalization \sum_\mathbb_\text(G)=1. This amounts to
optimizing Mathematical optimization (alternatively spelled ''optimisation'') or mathematical programming is the selection of a best element, with regard to some criteria, from some set of available alternatives. It is generally divided into two subfiel ...
the multi-constraint Lagrange function below: : \begin & \mathcal\left(\alpha,\_^n\right) \\ pt= & -\sum_\mathbb_\text(G)\log\mathbb_\text(G) + \alpha\left(1-\sum_\mathbb_\text(G) \right)+\sum_^n\psi_j\left(\widehat_j-\sum_\mathbb_\text(G)k_j(G)\right), \end where \alpha and \_^n are the n+1 multipliers to be fixed by the n+1 constraints (normalization and the expected degree sequence). Setting to zero the derivative of the above with respect to \mathbb_\text(G) for an arbitrary G\in \mathcal_n yields : 0 = \frac= -\log \mathbb_\text(G) -1-\alpha-\sum_^n\psi_j k_j(G) \ \Rightarrow \ \mathbb_\text(G)=\frac\exp\left \sum_^n\psi_jk_j(G)\right the constant Z:=e^=\sum_\exp\left \sum_^n\psi_jk_j(G)\right\prod_\left(1+e^\right) being the partition function normalizing the distribution; the above exponential expression applies to all G\in\mathcal_n, and thus is the probability distribution. Hence we have an
exponential family In probability and statistics, an exponential family is a parametric set of probability distributions of a certain form, specified below. This special form is chosen for mathematical convenience, including the enabling of the user to calculate ...
parameterized by \_^n, which are related to the expected degree sequence \_^n by the following equivalent expressions: : \langle k_q \rangle = \sum_k_q(G)\mathbb_\text(G) = -\frac =\sum_\frac = \widehat_q, \ q=1,\ldots,n.


References

{{Reflist Random graphs