Primal Shatter Function
   HOME

TheInfoList



OR:

A class of sets is said to shatter another set if it is possible to "pick out" any element of that set using
intersection In mathematics, the intersection of two or more objects is another object consisting of everything that is contained in all of the objects simultaneously. For example, in Euclidean geometry, when two lines in a plane are not parallel, their ...
. The concept of shattered sets plays an important role in Vapnik–Chervonenkis theory, also known as VC-theory. Shattering and VC-theory are used in the study of
empirical process In probability theory, an empirical process is a stochastic process that characterizes the deviation of the empirical distribution function from its expectation. In mean field theory, limit theorems (as the number of objects becomes large) are con ...
es as well as in statistical
computational learning theory In computer science, computational learning theory (or just learning theory) is a subfield of artificial intelligence devoted to studying the design and analysis of machine learning algorithms. Overview Theoretical results in machine learning m ...
.


Definition

Suppose ''A'' is a
set Set, The Set, SET or SETS may refer to: Science, technology, and mathematics Mathematics *Set (mathematics), a collection of elements *Category of sets, the category whose objects and morphisms are sets and total functions, respectively Electro ...
and ''C'' is a
class Class, Classes, or The Class may refer to: Common uses not otherwise categorized * Class (biology), a taxonomic rank * Class (knowledge representation), a collection of individuals or objects * Class (philosophy), an analytical concept used d ...
of sets. The class ''C'' shatters the set ''A'' if for each subset ''a'' of ''A'', there is some element ''c'' of ''C'' such that : a = c \cap A. Equivalently, ''C'' shatters ''A'' when their
intersection In mathematics, the intersection of two or more objects is another object consisting of everything that is contained in all of the objects simultaneously. For example, in Euclidean geometry, when two lines in a plane are not parallel, their ...
is equal to ''As
power set In mathematics, the power set (or powerset) of a set is the set of all subsets of , including the empty set and itself. In axiomatic set theory (as developed, for example, in the ZFC axioms), the existence of the power set of any set is po ...
: ''P''(''A'') = . We employ the letter ''C'' to refer to a "class" or "collection" of sets, as in a Vapnik–Chervonenkis class (VC-class). The set ''A'' is often assumed to be
finite Finite may refer to: * Finite set, a set whose cardinality (number of elements) is some natural number * Finite verb, a verb form that has a subject, usually being inflected or marked for person and/or tense or aspect * "Finite", a song by Sara Gr ...
because, in empirical processes, we are interested in the shattering of finite sets of data points.


Example

We will show that the class of all discs in the
plane Plane most often refers to: * Aero- or airplane, a powered, fixed-wing aircraft * Plane (geometry), a flat, 2-dimensional surface * Plane (mathematics), generalizations of a geometrical plane Plane or planes may also refer to: Biology * Plane ...
(two-dimensional space) does not shatter every set of four points on the
unit circle In mathematics, a unit circle is a circle of unit radius—that is, a radius of 1. Frequently, especially in trigonometry, the unit circle is the circle of radius 1 centered at the origin (0, 0) in the Cartesian coordinate system in the Eucli ...
, yet the class of all
convex set In geometry, a set of points is convex if it contains every line segment between two points in the set. For example, a solid cube (geometry), cube is a convex set, but anything that is hollow or has an indent, for example, a crescent shape, is n ...
s in the plane does shatter every finite set of points on the
unit circle In mathematics, a unit circle is a circle of unit radius—that is, a radius of 1. Frequently, especially in trigonometry, the unit circle is the circle of radius 1 centered at the origin (0, 0) in the Cartesian coordinate system in the Eucli ...
. Let ''A'' be a set of four points on the unit circle and let ''C'' be the class of all discs. To test where ''C'' shatters ''A'', we attempt to draw a disc around every subset of points in ''A''. First, we draw a disc around the subsets of each isolated point. Next, we try to draw a disc around every subset of point pairs. This turns out to be doable for adjacent points, but impossible for points on opposite sides of the circle. Any attempt to include those points on the opposite side will necessarily include other points not in that pair. Hence, any pair of opposite points cannot be isolated out of ''A'' using intersections with class ''C'' and so ''C'' does not shatter ''A''. As visualized below: Image:shattering02.png, Each individual point can be isolated with a disc (showing all four). Image:shattering03.png, Each subset of adjacent points can be isolated with a disc (showing one of four). Image:shattering04.png, A subset of points on opposite sides of the unit circle can ''not'' be isolated with a disc. Because there is some subset which can ''not'' be isolated by any disc in ''C'', we conclude then that ''A'' is not shattered by ''C''. And, with a bit of thought, we can prove that no set of four points is shattered by this ''C''. However, if we redefine ''C'' to be the class of all ''elliptical discs'', we find that we can still isolate all the subsets from above, as well as the points that were formerly problematic. Thus, this specific set of 4 points is shattered by the class of elliptical discs. Visualized below: Image:shattering05.png, Opposite points of ''A'' are now separable by some ellipse (showing one of two) Image:shattering06.png, Each subset of three points in ''A'' is also separable by some ellipse (showing one of four) With a bit of thought, we could generalize that any set of finite points on a unit circle could be shattered by the class of all
convex set In geometry, a set of points is convex if it contains every line segment between two points in the set. For example, a solid cube (geometry), cube is a convex set, but anything that is hollow or has an indent, for example, a crescent shape, is n ...
s (visualize connecting the dots).


Shatter coefficient

To quantify the richness of a collection ''C'' of sets, we use the concept of ''shattering coefficients'' (also known as the ''growth function''). For a collection ''C'' of sets s \subset \Omega, \Omega being any space, often a
sample space In probability theory, the sample space (also called sample description space, possibility space, or outcome space) of an experiment or random trial is the set of all possible outcomes or results of that experiment. A sample space is usually den ...
, we define the ''n''th ''shattering coefficient'' of ''C'' as : S_C(n) = \max_ \operatorname \ where \operatorname denotes the
cardinality The thumb is the first digit of the hand, next to the index finger. When a person is standing in the medical anatomical position (where the palm is facing to the front), the thumb is the outermost digit. The Medical Latin English noun for thum ...
of the set and x_1,x_2,\dots,x_n \in \Omega is any set of ''n'' points,. S_C(n) is the largest number of subsets of any set ''A'' of ''n'' points that can be formed by intersecting ''A'' with the sets in collection ''C''. For example, if set ''A'' contains 3 points, its power set, P(A), contains 2^3=8 elements. If ''C'' shatters ''A'', its shattering coefficient(3) would be 8 and S_C(2) would be 2^2=4. However, if one of those sets in P(A) cannot be obtained through intersections in ''c'', then S_C(3) would only be 7. If none of those sets can be obtained, S_C(3) would be 0. Additionally, if S_C(2)=3, for example, then there is an element in the set of all 2-point sets from ''A'' that cannot be obtained from intersections with ''C''. It follows from this that S_C(3) would also be less than 8 (i.e. ''C'' would not shatter ''A'') because we have already located a "missing" set in the smaller power set of 2-point sets. This example illustrates some properties of S_C(n): # S_C(n)\leq 2^n for all ''n'' because \\subseteq P(A) for any A\subseteq \Omega. # If S_C(n)=2^n, that means there is a set of cardinality ''n'', which can be shattered by ''C''. # If S_C(N)<2^N for some N>1 then S_C(n)<2^n for all n\geq N. The third property means that if ''C'' cannot shatter any set of cardinality ''N'' then it can not shatter sets of larger cardinalities.


Vapnik–Chervonenkis class

If ''A'' cannot be shattered by ''C'', there will be a smallest value of ''n'' that makes the shatter coefficient(n) less than 2^n because as ''n'' gets larger, there are more sets that could be missed. Alternatively, there is also a largest value of ''n'' for which the S_C(n) is still 2^n, because as ''n'' gets smaller, there are fewer sets that could be omitted. The extreme of this is S_C(0) (the shattering coefficient of the empty set), which must always be 2^0=1. These statements lends themselves to defining the
VC dimension VC may refer to: Military decorations * Victoria Cross, a military decoration awarded by the United Kingdom and other Commonwealth nations ** Victoria Cross for Australia ** Victoria Cross (Canada) ** Victoria Cross for New Zealand * Victorious ...
of a class ''C'' as: :VC(C)=\underset\\, or, alternatively, as :VC_0(C)=\underset\.\, Note that VC(C)=VC_0(C)+1.. The VC dimension is usually defined as VC_0, the largest cardinality of points chosen that will still shatter ''A'' (i.e. ''n'' such that S_C(n)=2^n). Altneratively, if for any ''n'' there is a set of cardinality ''n'' which can be shattered by ''C'', then S_C(n)=2^n for all ''n'' and the VC dimension of this class ''C'' is infinite. A class with finite VC dimension is called a ''Vapnik–Chervonenkis class'' or ''VC class''. A class ''C'' is uniformly Glivenko–Cantelli if and only if it is a VC class.


See also

*
Sauer–Shelah lemma In combinatorial mathematics and extremal set theory, the Sauer–Shelah lemma states that every family of sets with small VC dimension consists of a small number of sets. It is named after Norbert Sauer and Saharon Shelah, who published it inde ...
, relating the cardinality of a
family of sets In set theory and related branches of mathematics, a family (or collection) can mean, depending upon the context, any of the following: set, indexed set, multiset, or class. A collection F of subsets of a given set S is called a family of su ...
to the size of its largest shattered set


References

*. * *{{citation, authorlink=J. Michael Steele, last=Steele, first=J. M., year=1978, title=Empirical discrepancies and subadditive processes, journal=
Annals of Probability ''Annals of Probability'' is a leading peer-reviewed probability journal published by the Institute of Mathematical Statistics, which is the main international society for researchers in the areas probability and statistics. The journal was started ...
, volume=6, pages=118–227, issue=1, jstor=2242865, doi=10.1214/aop/1176995615, doi-access=free, url=https://repository.upenn.edu/bitstreams/b57f8ebe-7b2d-4b10-b507-f2e9215f8511/download.


External links


Origin of "Shattered sets" terminology
by J. Steele Empirical process Computational learning theory