HOME

TheInfoList



OR:

In modern computer science and
statistics Statistics (from German language, German: ''wikt:Statistik#German, Statistik'', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of ...
, the complexity index of a function denotes the level of informational content, which in turn affects the difficulty of learning the function from examples. This is different from
computational complexity In computer science, the computational complexity or simply complexity of an algorithm is the amount of resources required to run it. Particular focus is given to computation time (generally measured by the number of needed elementary operations) ...
, which is the difficulty to compute a function. Complexity indices characterize the entire class of functions to which the one we are interested in belongs. Focusing on Boolean functions, the ''detail'' of a class \mathsf C of Boolean functions ''c'' essentially denotes how deeply the class is articulated.


Technical definition

To identify this index we must first define a ''sentry function'' of \mathsf C. Let us focus for a moment on a single function ''c'', call it a ''concept'' defined on a set \mathcal X of elements that we may figure as points in a Euclidean space. In this framework, the above function associates to ''c'' a set of points that, since are defined to be external to the concept, prevent it from expanding into another function of \mathsf C. We may dually define these points in terms of sentinelling a given concept ''c'' from being fully enclosed (invaded) by another concept within the class. Therefore, we call these points either ''sentinels'' or ''sentry points''; they are assigned by the sentry function \boldsymbol S to each concept of \mathsf C in such a way that: # the sentry points are external to the concept ''c'' to be sentineled and internal to at least one other including it, # each concept c' including ''c'' has at least one of the sentry points of ''c'' either in the gap between ''c'' and c', or outside c' and distinct from the sentry points of c', and # they constitute a minimal set with these properties. The technical definition coming from is rooted in the inclusion of an augmented concept c^+ made up of ''c'' plus its sentry points by another \left(c'\right)^+ in the same class.


Definition of sentry function

For a concept class \mathsf C on a space \mathfrak X, a ''sentry function'' is a total function \boldsymbol S: \mathsf C\cup\\mapsto 2^ satisfying the following conditions: # Sentinels are outside the sentineled concept (c\cap(c)=\emptyset for all c\in \mathsf C). # Sentinels are inside the invading concept (Having introduced the sets c^+=c\cup\boldsymbol S(c), an invading concept c'\in \mathsf C is such that c'\not\subseteq c and c^+\subseteq \left(c'\right)^+. Denoting \mathrm(c) the set of concepts invading ''c'', we must have that if c_2\in\mathrm(c_1), then c_2\cap(c_1)\neq\emptyset). # (c) is a minimal set with the above properties (No '\neq exists satisfying (1) and (2) and having the property that \boldsymbol S'(c)\subseteq \boldsymbol S(c) for every c\in \mathsf C). # Sentinels are honest guardians. It may be that c\subseteq \left(c'\right)^+ but (c)\cap c'=\emptyset so that c'\not\in\mathrm(c). This however must be a consequence of the fact that all points of (c) are involved in really sentineling ''c'' against other concepts in \mathrm(c) and not just in avoiding inclusion of c^+ by (c')^+. Thus if we remove c', (c) remains unchanged (Whenever c_1 and c_2 are such that c_1\subset c_2\cup(c_2) and c_2\cap(c_1)=\emptyset, then the restriction of to \\cup\mathrm(c_1)-\ is a sentry function on this set). (c) is the ''frontier'' of ''c'' upon \boldsymbol S. With reference to the picture on the right, \ is a candidate frontier of c_0 against c_1,c_2,c_3,c_4. All points are in the gap between a c_i and c_0. They avoid inclusion of c_0\cup\ in c_3, provided that these points are not used by the latter for sentineling itself against other concepts. ''Vice versa'' we expect that c_1 uses x_1 and x_3 as its own sentinels, c_2 uses x_2 and x_3 and c_4 uses x_1 and x_2 analogously. Point x_4 is not allowed as a c_0 sentry point since, like any diplomatic seat, it should be located outside all other concepts just to ensure that it is not occupied in case of invasion by c_0.


Definition of detail

The frontier size of the most expensive concept to be sentineled with the least efficient sentineling function, i.e. the quantity :\mathrm D_=\sup_\#(c), is called ''detail'' of \mathsf C. \boldsymbol S spans also over sentry functions on subsets of \mathfrak X sentineling in this case the intersections of the concepts with these subsets. Actually, proper subsets of \mathfrak X may host sentineling tasks that prove harder than those emerging with \mathfrak X itself. The detail \mathrm D_ is a complexity measure of concept classes dual to the VC dimension \mathrm D_. The former uses points to separate sets of concepts, the latter concepts for partitioning sets of points. In particular the following inequality holds :\mathrm D_\leq \mathrm D_+1 See also Rademacher complexity for a recently introduced class complexity index.


Example: continuous spaces

Class ''C'' of circles in \mathbb R^2 has detail \mathrm D_=2, as shown in the picture on left below. Similarly, for the class of segments on \mathbb R, as shown in the picture on right.


Example: discrete spaces

The class \mathsf C=\ on \mathfrak X=\ whose concepts are illustrated in the following scheme, where "+" denotes an element x_j belonging to c_i, "-" an element outside c_i, and ⃝ a sentry point: This class has \mathrm D_=2. As usual we may have different sentineling functions. A worst case , as illustrated, is: \mathbf S(c_1)=\, \mathbf S(c_2)=\, \mathbf S(c_3)=\, \mathbf S(c_4)=\emptyset. However a cheaper one is \mathbf S(c_1)=\, \mathbf S(c_2)=\, \mathbf S(c_3)=\, \mathbf S(c_4)=\emptyset:


References

* * {{cite journal , doi=10.1016/S0304-3975(95)00240-5 , author=Apolloni, B. , author2=Chiaravalli, S. , title=PAC learning of concept classes through the boundaries of their items , journal=Theoretical Computer Science , volume=172 , issue=1–2 , year=1997 , pages=91–120 , doi-access=free , ref={{harvid, Apolloni, 1997 Computational complexity theory Algorithmic inference