In
combinatorics
Combinatorics is an area of mathematics primarily concerned with counting, both as a means and as an end to obtaining results, and certain properties of finite structures. It is closely related to many other areas of mathematics and has many ...
, the twelvefold way is a systematic classification of 12 related enumerative problems concerning two finite sets, which include the classical problems of
counting
Counting is the process of determining the number of elements of a finite set of objects; that is, determining the size of a set. The traditional way of counting consists of continually increasing a (mental or spoken) counter by a unit for ever ...
permutation
In mathematics, a permutation of a set can mean one of two different things:
* an arrangement of its members in a sequence or linear order, or
* the act or process of changing the linear order of an ordered set.
An example of the first mean ...
s,
combinations,
multiset
In mathematics, a multiset (or bag, or mset) is a modification of the concept of a set that, unlike a set, allows for multiple instances for each of its elements. The number of instances given for each element is called the ''multiplicity'' of ...
s, and partitions either
of a set or
of a number. The idea of the classification is credited to
Gian-Carlo Rota
Gian-Carlo Rota (April 27, 1932 – April 18, 1999) was an Italian-American mathematician and philosopher. He spent most of his career at the Massachusetts Institute of Technology, where he worked in combinatorics, functional analysis, proba ...
, and the name was suggested by
Joel Spencer.
Overview
Let and be
finite set
In mathematics, particularly set theory, a finite set is a set that has a finite number of elements. Informally, a finite set is a set which one could in principle count and finish counting. For example,
is a finite set with five elements. Th ...
s. Let
and
be the
cardinalities of the sets. Thus is a set with elements, and is a set with elements.
The general problem we consider is the enumeration of
equivalence class
In mathematics, when the elements of some set S have a notion of equivalence (formalized as an equivalence relation), then one may naturally split the set S into equivalence classes. These equivalence classes are constructed so that elements ...
es of
functions .
The functions are subject to one of the three following restrictions:
* No condition: each in may be sent by to any in , and each may occur multiple times.
* is
injective
In mathematics, an injective function (also known as injection, or one-to-one function ) is a function that maps distinct elements of its domain to distinct elements of its codomain; that is, implies (equivalently by contraposition, impl ...
: each value
for in must be distinct from every other, and so each in may occur at most once in the
image
An image or picture is a visual representation. An image can be Two-dimensional space, two-dimensional, such as a drawing, painting, or photograph, or Three-dimensional space, three-dimensional, such as a carving or sculpture. Images may be di ...
of .
* is
surjective
In mathematics, a surjective function (also known as surjection, or onto function ) is a function such that, for every element of the function's codomain, there exists one element in the function's domain such that . In other words, for a f ...
: for each in there must be at least one in such that
, thus each will occur at least once in the image of .
(The condition " is
bijective
In mathematics, a bijection, bijective function, or one-to-one correspondence is a function between two sets such that each element of the second set (the codomain) is the image of exactly one element of the first set (the domain). Equival ...
" is only an option when
; but then it is equivalent to both " is injective" and " is surjective".)
There are four different
equivalence relation
In mathematics, an equivalence relation is a binary relation that is reflexive, symmetric, and transitive. The equipollence relation between line segments in geometry is a common example of an equivalence relation. A simpler example is equ ...
s which may be defined on the set of functions from to :
* equality;
* equality
up to Two Mathematical object, mathematical objects and are called "equal up to an equivalence relation "
* if and are related by , that is,
* if holds, that is,
* if the equivalence classes of and with respect to are equal.
This figure of speech ...
a
permutation
In mathematics, a permutation of a set can mean one of two different things:
* an arrangement of its members in a sequence or linear order, or
* the act or process of changing the linear order of an ordered set.
An example of the first mean ...
of ;
* equality up to a permutation of ;
* equality up to permutations of and .
The three conditions on the functions and the four equivalence relations can be paired in ways.
The twelve problems of counting equivalence classes of functions do not involve the same difficulties, and there is not one systematic method for solving them. Two of the problems are trivial (the number of equivalence classes is 0 or 1), five problems have an answer in terms of a multiplicative formula of and , and the remaining five problems have an answer in terms of combinatorial functions (
Stirling numbers and the
partition function for a given number of parts).
The incorporation of classical enumeration problems into this setting is as follows.
* Counting -permutations (i.e.,
partial permutations or sequences without repetition) of is equivalent to counting
injective functions .
* Counting -combinations of is equivalent to counting
injective functions up to permutations of .
* Counting permutations of the set is equivalent to counting injective functions when = , and also to counting
surjective functions when .
* Counting multisets of size (also known as -combinations with repetitions) of elements in is equivalent to counting all
functions up to permutations of .
* Counting partitions of the set into subsets is equivalent to counting all
surjective functions up to permutations of .
* Counting
composition
Composition or Compositions may refer to:
Arts and literature
*Composition (dance), practice and teaching of choreography
* Composition (language), in literature and rhetoric, producing a work in spoken tradition and written discourse, to include ...
s of the number into parts is equivalent to counting all
surjective functions up to permutations of .
Viewpoints
The various problems in the twelvefold way may be considered from different points of view.
Balls and boxes
Traditionally many of the problems in the twelvefold way have been formulated in terms of placing balls in boxes (or some similar visualization) instead of defining functions. The set can be identified with a set of balls, and with a set of boxes; the function
then describes a way to distribute the balls into the boxes, namely by putting each ball into box
. A function ascribes a unique image to each value in its domain; this property is reflected by the property that any ball can go into only one box (together with the requirement that no ball should remain outside of the boxes), whereas any box can accommodate an arbitrary number of balls. Requiring in addition
to be injective means to forbid putting more than one ball in any one box, while requiring
to be surjective means insisting that every box contain at least one ball.
Counting
modulo
In computing and mathematics, the modulo operation returns the remainder or signed remainder of a division, after one number is divided by another, the latter being called the '' modulus'' of the operation.
Given two positive numbers and , mo ...
permutations of or is reflected by calling the balls or the boxes, respectively, "indistinguishable". This is an imprecise formulation, intended to indicate that different configurations are not to be counted separately if one can be transformed into the other by some interchange of balls or of boxes. This possibility of transformation is formalized by the action by permutations.
Sampling
Another way to think of some of the cases is in terms of
sampling, in
statistics
Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
. Imagine a population of items (or people), of which we choose . Two different schemes are normally described, known as "
sampling with replacement" and "
sampling without replacement". In the former case (sampling with replacement), once we've chosen an item, we put it back in the population, so that we might choose it again. The result is that each choice is
independent
Independent or Independents may refer to:
Arts, entertainment, and media Artist groups
* Independents (artist group), a group of modernist painters based in Pennsylvania, United States
* Independentes (English: Independents), a Portuguese artist ...
of all the other choices, and the set of samples is technically referred to as
independent identically distributed. In the latter case, however, once we have chosen an item, we put it aside so that we can not choose it again. This means that the act of choosing an item has an effect on all the following choices (the particular item can not be seen again), so our choices are dependent on one another.
A second distinction among sampling schemes is whether ordering matters. For example, if we have ten items, of which we choose two, then the choice (4, 7) is different from (7, 4) if ordering matters; on the other hand, if ordering does not matter, then the choices (4, 7) and (7, 4) are equivalent.
The first two rows and columns of the table below correspond to sampling with and without replacement, with and without consideration of order. The cases of sampling with replacement are found in the column labeled "Any
", while the cases of sampling without replacement are found in the column labeled "Injective
". The cases where ordering matters are found in the row labeled "Distinct," and the cases where ordering does not matter are found in the row labeled "
orbits". Each table entry indicates how many different sets of choices there are, in a particular sampling scheme. Three of these table entries also correspond to
probability distributions
In probability theory and statistics, a probability distribution is a function that gives the probabilities of occurrence of possible events for an experiment. It is a mathematical description of a random phenomenon in terms of its sample spac ...
. Sampling with replacement where ordering matters is comparable to describing the
joint distribution of separate
random variable
A random variable (also called random quantity, aleatory variable, or stochastic variable) is a Mathematics, mathematical formalization of a quantity or object which depends on randomness, random events. The term 'random variable' in its mathema ...
s, each with an -fold
categorical distribution
In probability theory and statistics, a categorical distribution (also called a generalized Bernoulli distribution, multinoulli distribution) is a discrete probability distribution that describes the possible results of a random variable that can ...
. Sampling with replacement where ordering does not matter, however, is comparable to describing a single
multinomial distribution of draws from an -fold category, where only the number seen of each category matters. Sampling without replacement where ordering does not matter is comparable to a single
multivariate hypergeometric distribution. Sampling without replacement where order does matter does not seem to correspond to a probability distribution. In all the injective cases (sampling without replacement), the number of sets of choices is zero unless . ("Comparable" in the above cases means that each element of the
sample space
In probability theory, the sample space (also called sample description space, possibility space, or outcome space) of an experiment or random trial is the set of all possible outcomes or results of that experiment. A sample space is usually den ...
of the corresponding distribution corresponds to a separate set of choices, and hence the number in the appropriate box indicates the size of the sample space for the given distribution.)
From the perspective of sampling, the column labeled "Surjective
" is somewhat strange: Essentially, we keep sampling with replacement until we have chosen each item at least once. Then, we count how many choices we have made, and if it is not equal to , throw out the entire set and repeat. This is vaguely comparable to the
coupon collector's problem, where the process involves "collecting" (by sampling with replacement) a set of coupons until each coupon has been seen at least once. In all surjective cases, the number of sets of choices is zero unless .
Labelling, selection, grouping
A function
can be considered from the perspective of or of . This leads to different views:
* the function
''labels'' each element of by an element of .
* the function
''selects'' (chooses) an
element of the set for each element of , a total of choices.
* the function
''groups'' the elements of together that are mapped to the same element of .
These points of view are not equally suited to all cases. The labelling and selection points of view are not well compatible with permutation of the elements of , since this changes the labels or the selection; on the other hand the grouping point of view does not give complete information about the configuration ''unless'' the elements of may be freely permuted. The labelling and selection points of view are more or less equivalent when is not permuted, but when it is, the selection point of view is more suited. The selection can then be viewed as an unordered selection: a single choice of a (multi-)set of elements from is made.
Labelling and selection with or without repetition
When viewing
as a labelling of the elements of , the latter may be thought of as arranged in a sequence, and the labels from as being successively assigned to them. A requirement that
be injective means that no label can be used a second time; the result is a sequence of labels ''without repetition''. In the absence of such a requirement, the terminology "sequences with repetition" is used, meaning that labels may be used more than once (although sequences that happen to be without repetition are also allowed).
When viewing
as an unordered selection of the elements of , the same kind of distinction applies. If
must be injective, then the selection must involve distinct elements of , so it is a subset of of size , also called an -
combination. Without the requirement, one and the same element of may occur multiple times in the selection, and the result is a
multiset
In mathematics, a multiset (or bag, or mset) is a modification of the concept of a set that, unlike a set, allows for multiple instances for each of its elements. The number of instances given for each element is called the ''multiplicity'' of ...
of size of elements from , also called an -
multicombination or -combination with repetition.
The requirement that
be surjective, from the viewpoint of labelling elements of , means that every label is to be used at least once; from the viewpoint of selection from , it means that every element of must be included in the selection at least once. Labelling with surjection is equivalent to a grouping of elements of followed by labeling each group by an element of , and is accordingly somewhat more complicated to describe mathematically.
Partitions of sets and numbers
When viewing
as a grouping of the elements of (which assumes one identifies under permutations of ), requiring
to be surjective means the number of groups must be exactly . Without this requirement the number of groups can be at most . The requirement of injective
means each element of must be a group in itself, which leaves at most one valid grouping and therefore gives a rather uninteresting counting problem.
When in addition one identifies under permutations of , this amounts to forgetting the groups themselves but retaining only their sizes. These sizes moreover do not come in any definite order, while the same size may occur more than once; one may choose to arrange them into a weakly decreasing list of numbers, whose sum is the number . This gives the combinatorial notion of a
partition of the number , into exactly (for surjective
) or at most (for arbitrary
) parts.
Formulas
Formulas for the different cases of the twelvefold way are summarized in the following table; each table entry links to a subsection below explaining the formula.
The particular notations used are:
* the
falling factorial power ,
* the
rising factorial power ,
* the
factorial
In mathematics, the factorial of a non-negative denoted is the Product (mathematics), product of all positive integers less than or equal The factorial also equals the product of n with the next smaller factorial:
\begin
n! &= n \times ...
* the
Stirling number of the second kind , denoting the number of ways to
partition a set of elements into non-empty subsets
* the
binomial coefficient
In mathematics, the binomial coefficients are the positive integers that occur as coefficients in the binomial theorem. Commonly, a binomial coefficient is indexed by a pair of integers and is written \tbinom. It is the coefficient of the t ...
* the
Iverson bracket encoding a truth value as 0 or 1
* the number
of
partitions of into parts
Intuitive meaning of the rows and columns
This is a quick summary of what the different cases mean. The cases are described in detail below.
Think of a set of numbered items (numbered from 1 to ), from which we choose , yielding an ordered list of the items: e.g. if there are
items of which we choose
, the result might be the list (5, 2, 10). We then count how many different such lists exist, sometimes first transforming the lists in ways that reduce the number of distinct possibilities.
Then the columns mean:
; Any : After we choose an item, we put it back, so we might choose it again.
; Injective : After we choose an item, we set it aside, so we can't choose it again; hence we'll end up with distinct items. Necessarily, then, unless
, no lists can be chosen at all.
; Surjective : After we choose an item, we put it back, so we might choose it again — but at the end, we have to end up having chosen each item at least once. Necessarily, then, unless
, no lists can be chosen at all.
And the rows mean:
; Distinct: Leave the lists alone; count them directly.
;
orbits: Before counting, sort the lists by the item number of the items chosen, so that order doesn't matter, e.g., (5, 2, 10), (10, 2, 5), (2, 10, 5) → (2, 5, 10).
;
orbits: Before counting, renumber the items seen so that the first item seen has number 1, the second 2, etc. Numbers may repeat if an item was seen more than once, e.g., (3, 5, 3), (5, 2, 5), (4, 9, 4) → (1, 2, 1) while (3, 3, 5), (5, 5, 3), (2, 2, 9) → (1, 1, 2).
;
×
orbits: Two lists count as the same if it is possible to both reorder and relabel them as above and produce the same result. For example, (3, 5, 3) and (2, 9, 9) count as the same because they can be reordered as (3, 3, 5) and (9, 9, 2) and then relabeling both produces the same list (1, 1, 2).
Intuitive meaning of the chart using balls and boxes scenario
The chart below is similar to the chart above, but instead of showing the formulas, it gives an intuitive understanding of their meaning using the familiar balls and boxes example. The rows represent the distinctness of the balls and boxes. The columns represent if multi-packs (more than one ball in one box), or empty boxes are allowed. The cells in the chart show the question that is answered by solving the formula given in the formula chart above.
Details of the different cases
The cases below are ordered in such a way as to group those cases for which the arguments used in counting are related, which is not the ordering in the table given.
Functions from to
This case is equivalent to counting sequences of elements of with no restriction: a function is determined by the images of the elements of , which can each be independently chosen among the elements of . This gives a total of
possibilities.
Example:
Injective functions from to
This case is equivalent to counting sequences of ''distinct'' elements of , also called -permutations of , or sequences without repetitions; again this sequence is formed by the images of the elements of . This case differs from the one of unrestricted sequences in that there is one choice fewer for the second element, two fewer for the third element, and so on. Therefore instead of by an ordinary power of , the value is given by a
falling factorial power of , in which each successive factor is one fewer than the previous one. The formula is
:
Note that if then one obtains a factor zero, so in this case there are no injective functions at all; this is just a restatement of the
pigeonhole principle
In mathematics, the pigeonhole principle states that if items are put into containers, with , then at least one container must contain more than one item. For example, of three gloves, at least two must be right-handed or at least two must be l ...
.
Example:
Injective functions from to , up to a permutation of
This case is equivalent to counting subsets with elements of , also called -combinations of : among the sequences of distinct elements of , those that differ only in the order of their terms are identified by permutations of . Since in all cases this groups together exactly ! different sequences, we can divide the number of such sequences by ! to get the number of -combinations of . This number is known as the
binomial coefficient
In mathematics, the binomial coefficients are the positive integers that occur as coefficients in the binomial theorem. Commonly, a binomial coefficient is indexed by a pair of integers and is written \tbinom. It is the coefficient of the t ...
, which is therefore given by
:
Example:
Functions from to , up to a permutation of
This case is equivalent to counting multisets with elements from (also called -
multicombinations). The reason is that for each element of it is determined how many elements of are mapped to it by , while two functions that give the same such "multiplicities" to each element of can always be transformed into another by a permutation of . The formula counting all functions is not useful here, because the number of them grouped together by permutations of varies from one function to another. Rather, as explained under
combinations
In mathematics, a combination is a selection of items from a set that has distinct members, such that the order of selection does not matter (unlike permutations). For example, given three fruits, say an apple, an orange and a pear, there are t ...
, the number of -multicombinations from a set with elements can be seen to be the same as the number of -combinations from a set with elements. This reduces the problem to
another one in the twelvefold way, and gives as result
:
Example:
Surjective functions from to , up to a permutation of
This case is equivalent to counting
multiset
In mathematics, a multiset (or bag, or mset) is a modification of the concept of a set that, unlike a set, allows for multiple instances for each of its elements. The number of instances given for each element is called the ''multiplicity'' of ...
s with elements from , for which each element of occurs at least once. This is also equivalent to counting the
compositions of with (non-zero) terms, by listing the multiplicities of the elements of in order. The correspondence between functions and multisets is the same as in the previous case, and the surjectivity requirement means that all multiplicities are at least one. By decreasing all multiplicities by 1, this reduces to the previous case; since the change decreases the value of by , the result is
:
Note that when < there are no surjective functions at all (a kind of "empty pigeonhole" principle); this is taken into account in the formula, by the convention that binomial coefficients are always 0 if the lower index is negative. The same value is also given by the expression
:
except in the extreme case , where with the former expression correctly gives
, while the latter incorrectly gives
.
The form of the result suggests looking for a manner to associate a class of surjective functions directly to a subset of elements chosen from a total of , which can be done as follows. First choose a
total ordering
In mathematics, a total order or linear order is a partial order in which any two elements are comparable. That is, a total order is a binary relation \leq on some set X, which satisfies the following for all a, b and c in X:
# a \leq a ( re ...
of the sets and , and note that by applying a suitable permutation of , every surjective function can be transformed into a unique
weakly increasing (and of course still surjective) function. If one connects the elements of in order by arcs into a
linear graph, then choosing any subset of arcs and removing the rest, one obtains a graph with connected components, and by sending these to the successive elements of , one obtains a weakly increasing surjective function ; also the sizes of the connected components give a composition of into parts. This argument is basically the one given at
stars and bars, except that there the complementary choice of "separations" is made.
Example:
Injective functions from to , up to a permutation of
In this case we consider sequences of distinct elements from , but identify those obtained from one another by applying to each element a permutation of . It is easy to see that two different such sequences can always be identified: the permutation must map term of the first sequence to term of the second sequence, and since no value occurs twice in either sequence these requirements do not contradict each other; it remains to map the elements not occurring in the first sequence bijectively to those not occurring in the second sequence in an arbitrary way. The only fact that makes the result depend on and at all is that the existence of any such sequences to begin with requires , by the pigeonhole principle. The number is therefore expressed as