In applied mathematics, the soft configuration model (SCM) is a

random graph In mathematics, random graph is the general term to refer to probability distributions over graphs. Random graphs may be described simply by a probability distribution, or by a random process which generates them. The theory of random graphs l ...

model subject to the

principle of maximum entropy The principle of maximum entropy states that the probability distribution which best represents the current state of knowledge about a system is the one with largest entropy, in the context of precisely stated prior data (such as a proposition ...

under constraints on the expectation of the

degree sequence In graph theory, the degree (or valency) of a vertex of a graph is the number of edges that are incident to the vertex; in a multigraph, a loop contributes 2 to a vertex's degree, for the two ends of the edge. The degree of a vertex v is denot ...

of sampled

graphs Graph may refer to: Mathematics *Graph (discrete mathematics), a structure made of vertices and edges **Graph theory, the study of such graphs and their properties * Graph (topology), a topological space resembling a graph in the sense of discre ...

. Whereas the configuration model (CM) uniformly samples random graphs of a specific degree sequence, the SCM only retains the specified degree sequence on average over all network realizations; in this sense the SCM has very relaxed constraints relative to those of the CM ("soft" rather than "sharp" constraints). The SCM for graphs of size

n

has a nonzero probability of sampling any graph of size

n

, whereas the CM is restricted to only graphs having precisely the prescribed connectivity structure.

Model formulation

The SCM is a

statistical ensemble In physics, specifically statistical mechanics, an ensemble (also statistical ensemble) is an idealization consisting of a large number of virtual copies (sometimes infinitely many) of a system, considered all at once, each of which represents a ...

of random graphs

G

having

n

vertices (

n=, V(G),

) labeled

\_^n=V(G)

, producing a

probability distribution In probability theory and statistics, a probability distribution is a Function (mathematics), function that gives the probabilities of occurrence of possible events for an Experiment (probability theory), experiment. It is a mathematical descri ...

\mathcal_n

(the set of graphs of size

n

). Imposed on the ensemble are

n

constraints, namely that the

ensemble average In physics, specifically statistical mechanics, an ensemble (also statistical ensemble) is an idealization consisting of a large number of virtual copies (sometimes infinitely many) of a system, considered all at once, each of which represents a ...

of the degree

k_j

of vertex

v_j

is equal to a designated value

\widehat_j

, for all

v_j\in V(G)

. The model is fully parameterized by its size

n

and expected degree sequence

\_^n

. These constraints are both local (one constraint associated with each vertex) and soft (constraints on the ensemble average of certain observable quantities), and thus yields a

canonical ensemble In statistical mechanics, a canonical ensemble is the statistical ensemble that represents the possible states of a mechanical system in thermal equilibrium with a heat bath at a fixed temperature. The system can exchange energy with the hea ...

with an extensive number of constraints. The conditions

\langle k_j \rangle = \widehat_j

are imposed on the ensemble by the method of Lagrange multipliers (see

Maximum-entropy random graph model Maximum-entropy random graph models are random graph models used to study complex networks subject to the principle of maximum entropy under a set of structural constraints, which may be global, distributional, or local. Overview Any random graph ...

Derivation of the probability distribution

The probability

\mathbb_\text(G)

of the SCM producing a graph

G

is determined by maximizing the

Gibbs entropy The concept entropy was first developed by German physicist Rudolf Clausius in the mid-nineteenth century as a thermodynamic property that predicts that certain spontaneous processes are irreversible or impossible. In statistical mechanics, entrop ...

S /math> subject to constraints \langle k_j \rangle = \widehat_j, \ j=1,\ldots,n and normalization \sum_\mathbb_\text(G)=1 . This amounts to

optimizing Mathematical optimization (alternatively spelled ''optimisation'') or mathematical programming is the selection of a best element, with regard to some criteria, from some set of available alternatives. It is generally divided into two subfiel ...

the multi-constraint Lagrange function below: :

= & -\sum_\mathbb_\text(G)\log\mathbb_\text(G) + \alpha\left(1-\sum_\mathbb_\text(G) \right)+\sum_^n\psi_j\left(\widehat_j-\sum_\mathbb_\text(G)k_j(G)\right), \end

where

\alpha

and

\_^n

are the

n+1

multipliers to be fixed by the

n+1

constraints (normalization and the expected degree sequence). Setting to zero the derivative of the above with respect to

\mathbb_\text(G)

for an arbitrary

G\in \mathcal_n

yields :

0 = \frac= -\log \mathbb_\text(G) -1-\alpha-\sum_^n\psi_j k_j(G) \ \Rightarrow \ \mathbb_\text(G)=\frac\exp\left \sum_^n\psi_jk_j(G)\right

the constant

\prod_\left(1+e^\right)

being the partition function normalizing the distribution; the above exponential expression applies to all

G\in\mathcal_n

, and thus is the probability distribution. Hence we have an

exponential family In probability and statistics, an exponential family is a parametric set of probability distributions of a certain form, specified below. This special form is chosen for mathematical convenience, including the enabling of the user to calculate ...

parameterized by

\_^n

, which are related to the expected degree sequence

\_^n

by the following equivalent expressions: :

\langle k_q \rangle = \sum_k_q(G)\mathbb_\text(G) = -\frac =\sum_\frac = \widehat_q, \ q=1,\ldots,n.

References

{{Reflist Random graphs