computational learning theory In computer science, computational learning theory (or just learning theory) is a subfield of artificial intelligence devoted to studying the design and analysis of machine learning algorithms. Overview Theoretical results in machine learning m ...

in mathematics, a concept over a domain ''X'' is a total

Boolean function In mathematics, a Boolean function is a function whose arguments and result assume values from a two-element set (usually , or ). Alternative names are switching function, used especially in older computer science literature, and truth function ...

over ''X''. A concept class is a class of concepts. Concept classes are a subject of

. Concept class terminology frequently appears in

model theory In mathematical logic, model theory is the study of the relationship between formal theories (a collection of sentences in a formal language expressing statements about a mathematical structure), and their models (those structures in which the ...

associated with probably approximately correct (PAC) learning.Chase, H., & Freitag, J. (2018). ''Model Theory and Machine Learning''. arXiv preprint arXiv:1801.06566
In this setting, if one takes a set ''Y'' as a set of (classifier output) labels, and ''X'' is a set of examples, the map

c: X\to Y

, i.e. from examples to classifier labels (where

Y = \

and where ''c'' is a subset of ''X''), ''c'' is then said to be a ''concept''. A ''concept class''

C

is then a collection of such concepts. Given a class of concepts ''C'', a subclass ''D'' is reachable if there exists a sample ''s'' such that ''D'' contains exactly those concepts in ''C'' that are extensions to ''s''. Not every subclass is reachable.

Background

A ''sample''

s

is a

partial function In mathematics, a partial function from a set to a set is a function from a subset of (possibly itself) to . The subset , that is, the domain of viewed as a function, is called the domain of definition of . If equals , that is, if is ...

from

X

\

. Identifying a concept with its characteristic function mapping

X

\

, it is a special case of a sample. Two samples are ''consistent'' if they agree on the intersection of their domains. A sample

s'

''extends'' another sample

s

if the two are consistent and the domain of

s

is contained in the domain of

s'

Examples

Suppose that

C = S^+(X)

. Then: * the subclass

\

is reachable with the sample

s = \

; * the subclass

S^+(Y)

for

Y\subseteq X

are reachable with a sample that maps the elements of

X - Y

to zero; * the subclass

S(X)

, which consists of the singleton sets, is ''not'' reachable.

Applications

Let

C

be some concept class. For any concept

c\in C

, we call this concept

1/d

''-good'' for a positive integer

d

if, for all

x\in X

, at least

1/d

of the concepts in

C

agree with

c

on the classification of

x

. The ''fingerprint dimension''

FD(C)

of the entire concept class

C

is the least positive integer

d

such that every reachable subclass

C'\subseteq C

contains a concept that is

1/d

-good for it. This quantity can be used to bound the minimum number of equivalence queries needed to learn a class of concepts according to the following inequality:

FD(C) - 1\leq \#EQ(C)\leq \lceil FD(C)\ln(, C, )\rceil

References

Computational learning theory {{Compu-AI-stub