Categorization is the ability and activity of recognizing shared features or similarities between the elements of the

experience Experience refers to conscious events in general, more specifically to perceptions, or to the practical knowledge and familiarity that is produced by these conscious processes. Understood as a conscious event in the widest sense, experience invol ...

of the world (such as objects, events, or

idea In common usage and in philosophy, ideas are the results of thought. Also in philosophy, ideas can also be mental representational images of some object. Many philosophers have considered ideas to be a fundamental ontological category of be ...

s), organizing and classifying experience by associating them to a more abstract group (that is, a category, class, or type), on the basis of their traits, features, similarities or other criteria that are universal to the group. Categorization is considered one of the most fundamental

cognitive abilities Cognitive skills, also called cognitive functions, cognitive abilities or cognitive capacities, are brain-based skills which are needed in acquisition of knowledge, manipulation of information and reasoning. They have more to do with the mechanisms ...

, and as such it is studied particularly by

psychology Psychology is the scientific study of mind and behavior. Psychology includes the study of conscious and unconscious phenomena, including feelings and thoughts. It is an academic discipline of immense scope, crossing the boundaries betwe ...

and

cognitive linguistics Cognitive linguistics is an interdisciplinary branch of linguistics, combining knowledge and research from cognitive science, cognitive psychology, neuropsychology and linguistics. Models and theoretical accounts of cognitive linguistics are c ...

. Categorization is sometimes considered synonymous with classification (cf., Classification synonyms). Categorization and classification allow humans to organize things, objects, and ideas that exist around them and simplify their understanding of the world. Categorization is something that humans and other organisms ''do'': "doing the right thing with the right ''kind'' of thing." The activity of categorizing things can be nonverbal or verbal. For humans, both concrete objects and abstract ideas are recognized, differentiated, and understood through categorization. Objects are usually categorized for some adaptive or pragmatic purposes. Categorization is

grounded Grounding or grounded may refer to: Science and philosophy * Grounding (metaphysics), a topic of wide philosophical interest * Grounding (psychology), a strategy for coping with stress or other negative emotions * Grounding in communication, th ...

in the features that distinguish the category's members from nonmembers. Categorization is important in learning, prediction,

inference Inferences are steps in reasoning, moving from premises to logical consequences; etymologically, the word '' infer'' means to "carry forward". Inference is theoretically traditionally divided into deduction and induction, a distinction that ...

decision making In psychology, decision-making (also spelled decision making and decisionmaking) is regarded as the cognitive process resulting in the selection of a belief or a course of action among several possible alternative options. It could be either ra ...

, language, and many forms of organisms' interaction with their environments.

Overview of categorization

Categories are distinct collections of concrete or abstract instances (category members) that are considered equivalent by the cognitive system. Using category knowledge requires one to access mental representations that define the core features of category members (cognitive psychologists refer to these category-specific mental representations as

concept Concepts are defined as abstract ideas. They are understood to be the fundamental building blocks of the concept behind principles, thoughts and beliefs. They play an important role in all aspects of cognition. As such, concepts are studied by s ...

s). To categorization theorists, the categorization of objects is often considered using taxonomies with three hierarchical levels of

abstraction Abstraction in its main sense is a conceptual process wherein general rules and concepts are derived from the usage and classification of specific examples, literal ("real" or " concrete") signifiers, first principles, or other methods. "An a ...

. For example, a plant could be identified at a high level of abstraction by simply labeling it a flower, a medium level of abstraction by specifying that the flower is a rose, or a low level of abstraction by further specifying this particular rose as a dog rose. Categories in a taxonomy are related to one another via class inclusion, with the highest level of abstraction being the most inclusive and the lowest level of abstraction being the least inclusive. The three levels of abstraction are as follows: * Superordinate level, Genus (e.g., Flower) - The highest and most inclusive level of abstraction. Exhibits the highest degree of generality and the lowest degree of within-category similarity. * Basic Level, Species (e.g., Rose) - The middle level of abstraction. Rosch and colleagues (1976) suggest the basic level to be the most cognitively efficient. Basic level categories exhibit high within-category ''similarities'' and high between-category ''dissimilarities''. Furthermore, the basic level is the most inclusive level at which category exemplars share a generalized identifiable shape. Adults most-often use basic level object names, and children learn basic object names first. * Subordinate level (e.g., Dog Rose) - The lowest level of abstraction. Exhibits the highest degree of specificity and within-category similarity.

Theories of categorization

Classical view

The classical theory of categorization, is a term used in

to denote the approach to categorization that appears in Plato and Aristotle and that has been highly influential and dominant in Western culture, particularly in philosophy, linguistics and psychology. Aristotle's categorical method of analysis was transmitted to the

scholastic Scholastic may refer to: * a philosopher or theologian in the tradition of scholasticism * ''Scholastic'' (Notre Dame publication) * Scholastic Corporation, an American publishing company of educational materials * Scholastic Building, in New Y ...

medieval university through Porphyry's

Isagoge The ''Isagoge'' ( el, Εἰσαγωγή, ''Eisagōgḗ''; ) or "Introduction" to Aristotle's "Categories", written by Porphyry in Greek and translated into Latin by Boethius, was the standard textbook on logic for at least a millennium after his ...

. The classical view of categories can be summarized into three assumptions: a category can be described as a list of necessary and sufficient features that its membership must have, categories are discrete in that they have clearly defined boundaries (either an element belongs to one or not, with no possibilities in between), and all the members of a category have the same status. (There are no members of the category which belong more than others). In the classical view, categories need to be clearly defined, mutually exclusive and collectively exhaustive; this way, any entity in the given classification universe belongs unequivocally to one, and only one, of the proposed categories. The classical view of categories first appeared in the context of

Western Philosophy Western philosophy encompasses the philosophical thought and work of the Western world. Historically, the term refers to the philosophical thinking of Western culture, beginning with the ancient Greek philosophy of the pre-Socratics. The wo ...

in the work of

Plato Plato ( ; grc-gre, Πλάτων ; 428/427 or 424/423 – 348/347 BC) was a Greek philosopher born in Athens during the Classical period in Ancient Greece. He founded the Platonist school of thought and the Academy, the first institutio ...

, who, in his Statesman dialogue, introduces the approach of grouping objects based on their similar properties. This approach was further explored and systematized by

Aristotle Aristotle (; grc-gre, Ἀριστοτέλης ''Aristotélēs'', ; 384–322 BC) was a Greek philosopher and polymath during the Classical Greece, Classical period in Ancient Greece. Taught by Plato, he was the founder of the Peripatet ...

in his Categories treatise, where he analyzes the differences between classes and objects. Aristotle also applied intensively the classical categorization scheme in his approach to the classification of living beings (which uses the technique of applying successive narrowing questions such as "Is it an animal or vegetable?", "How many feet does it have?", "Does it have fur or feathers?", "Can it fly?"...), establishing this way the basis for natural taxonomy. Examples of the use of the classical view of categories can be found in the western philosophical works of Descartes,

Blaise Pascal Blaise Pascal ( , , ; ; 19 June 1623 – 19 August 1662) was a French mathematician, physicist, inventor, philosopher, and Catholic writer. He was a child prodigy who was educated by his father, a tax collector in Rouen. Pascal's earlies ...

, Spinoza and John Locke, and in the 20th century in

Bertrand Russell Bertrand Arthur William Russell, 3rd Earl Russell, (18 May 1872 – 2 February 1970) was a British mathematician, philosopher, logician, and public intellectual. He had a considerable influence on mathematics, logic, set theory, linguistics, ar ...

, G.E. Moore, the logical positivists. It has been a cornerstone of

analytic philosophy Analytic philosophy is a branch and tradition of philosophy using analysis, popular in the Western world and particularly the Anglosphere, which began around the turn of the 20th century in the contemporary era in the United Kingdom, United ...

and its conceptual analysis, with more recent formulations proposed in the 1990s by Frank Cameron Jackson and Christopher Peacocke. At the beginning of the 20th century, the question of categories was introduced into the empirical social sciences by Durkheim and Mauss, whose pioneering work has been revisited in contemporary scholarship. The classical model of categorization has been used at least since the 1960s from linguists of the structural semantics paradigm, by Jerrold Katz and

Jerry Fodor Jerry Alan Fodor (; April 22, 1935 – November 29, 2017) was an American philosopher and the author of many crucial works in the fields of philosophy of mind and cognitive science. His writings in these fields laid the groundwork for the mo ...

in 1963, which in turn have influenced its adoption also by psychologists like Allan M. Collins and

M. Ross Quillian ( ; ; pl. ; ; 1512, from Middle French , literally "my lord") is an honorific title that was used to refer to or address the eldest living brother of the king in the French royal court. It has now become the customary French title of respec ...

. Modern versions of classical categorization theory study how the brain learns and represents categories by detecting the features that distinguish members from nonmembers.

Prototype theory

The pioneering research by psychologist Eleanor Rosch and colleagues since 1973, introduced the prototype theory, according to which categorization can also be viewed as the process of grouping things based on prototypes. This approach has been highly influential, particularly for

. It was in part based on previous insights, in particular the formulation of a category model based on

family resemblance Family resemblance (german: Familienähnlichkeit, link=no) is a philosophical idea made popular by Ludwig Wittgenstein, with the best known exposition given in his posthumously published book '' Philosophical Investigations'' (1953). It argues t ...

by Wittgenstein (1953), and by Roger Brown's ''How shall a thing be called?'' (1958). Prototype theory has been then adopted by cognitive linguists like

George Lakoff George Philip Lakoff (; born May 24, 1941) is an American cognitive linguist and philosopher, best known for his thesis that people's lives are significantly influenced by the conceptual metaphors they use to explain complex phenomena. The co ...

. The prototype theory is an example of a similarity-based approach to categorization, in which a stored category representation is used to assess the similarity of candidate category members. Under the prototype theory, this stored representation consists of a summary representation of the category's members. This prototype stimulus can take various forms. It might be a central tendency that represents the category's average member, a modal stimulus representing either the most frequent instance or a stimulus composed of the most common category features, or, lastly, the "ideal" category member, or a caricature that emphasizes the distinct features of the category. An important consideration of this prototype representation is that it does not necessarily reflect the existence of an actual instance of the category in the world. Furthermore, prototypes are highly sensitive to context. For example, while one's prototype for the category of beverages may be soda or seltzer, the context of brunch might lead them to select mimosa as a prototypical beverage. The prototype theory claims that members of a given category share a

, and categories are defined by sets of typical features (as opposed to all members possessing necessary and sufficient features).

Exemplar theory

Another instance of the similarity-based approach to categorization, the exemplar theory likewise compares the similarity of candidate category members to stored memory representations. Under the exemplar theory, all known instances of a category are stored in memory as exemplars. When evaluating an unfamiliar entity's category membership, exemplars from potentially relevant categories are retrieved from memory, and the entity's similarity to those exemplars is summed to formulate a categorization decision. Medin and Schaffer's (1978)

Context model A context model (or context modeling) defines how context data are structured and maintained (It plays a key role in supporting efficient context management). It aims to produce a formal or semi-formal description of the context information that is ...

employs a nearest neighbor approach which, rather than summing an entity's similarities to relevant exemplars, multiplies them to provide weighted similarities that reflect the entity's proximity to relevant exemplars. This effectively biases categorization decisions towards exemplars most similar to the entity to be categorized.

Conceptual clustering

Conceptual clustering is a

machine learning Machine learning (ML) is a field of inquiry devoted to understanding and building methods that 'learn', that is, methods that leverage data to improve performance on some set of tasks. It is seen as a part of artificial intelligence. Machine ...

paradigm for

unsupervised classification Unsupervised learning is a type of algorithm that learns patterns from untagged data. The hope is that through mimicry, which is an important mode of learning in people, the machine is forced to build a concise representation of its world and t ...

that was defined by

Ryszard S. Michalski Ryszard S. Michalski (May 7, 1937 – September 20, 2007) was a Polish-American computer scientist. Michalski was Professor at George Mason University and a pioneer in the field of machine learning. Biography Michalski was born in Kalusz near Lv ...

in 1980. It is a modern variation of the classical approach of categorization, and derives from attempts to explain how knowledge is represented. In this approach, classes (clusters or entities) are generated by first formulating their conceptual descriptions and then classifying the entities according to the descriptions. Conceptual clustering developed mainly during the 1980s, as a machine paradigm for unsupervised learning. It is distinguished from ordinary data clustering by generating a concept description for each generated category. Conceptual clustering is closely related to fuzzy set theory, in which objects may belong to one or more groups, in varying degrees of fitness. A

cognitive Cognition refers to "the mental action or process of acquiring knowledge and understanding through thought, experience, and the senses". It encompasses all aspects of intellectual functions and processes such as: perception, attention, thought ...

approach accepts that natural categories are graded (they tend to be fuzzy at their boundaries) and inconsistent in the status of their constituent members. The idea of necessary and sufficient conditions is almost never met in categories of naturally occurring things.

Category learning

''While an exhaustive discussion of category learning is beyond the scope of this article, a brief overview of category learning and its associated theories is useful in understanding formal models of categorization.'' If categorization research investigates how categories are maintained and used, the field of category learning seeks to understand how categories are acquired in the first place. To accomplish this, researchers often employ novel categories of arbitrary objects (e.g., dot matrices) to ensure that participants are entirely unfamiliar with the stimuli. Category learning researchers have generally focused on two distinct forms of category learning. Classification learning tasks participants with predicting category labels for a stimulus based on its provided features. Classification learning is centered around learning between-category information and the diagnostic features of categories.Higgins, E., & Ross, B. (2011). Comparisons in category learning: How best to compare for what. In Proceedings of the Annual Meeting of the Cognitive Science Society (Vol. 33, No. 33). In contrast, inference learning tasks participants with inferring the presence/value of a category feature based on a provided category label and/or the presence of other category features. Inference learning is centered on learning within-category information and the category's prototypical features. Category learning tasks can generally be divided into two categories, supervised and unsupervised learning. Supervised learning tasks provide learners with category labels. Learners then use information extracted from labeled example categories to classify stimuli into the appropriate category, which may involve the

of a rule or concept relating observed object features to category labels. Unsupervised learning tasks do not provide learners with category labels. Learners must therefore recognize inherent structures in a data set and group stimuli together by similarity into classes. Unsupervised learning is thus a process of generating a classification structure. Tasks used to study category learning take various forms: * Rule-based tasks present categories that participants can learn through explicit reasoning processes. In these kinds of tasks, classification of stimuli is accomplished via the use of an acquired rule (i.e., if stimulus is large on dimension x, respond A). * Information-integration tasks require learners to synthesize perceptual information from multiple stimulus dimensions prior to making categorization decisions. Unlike rule-based tasks, information-integration tasks do not afford rules that are easily articulable. Reading an X-ray and trying to determine if a tumor is present can be thought of as a real-world instantiation of an information-integration task. * Prototype distortion tasks require learners to generate a prototype for a category. Candidate exemplars for the category are then produced by randomly manipulating the features of the prototype, which learners must classify as either belonging to the category or not.

Category learning theories

Category learning researchers have proposed various theories for how humans learn categories. Prevailing theories of category learning include the prototype theory, the exemplar theory, and the decision bound theory. The prototype theory suggests that to learn a category, one must learn the category's prototype. Subsequent categorization of novel stimuli is then accomplished by selecting the category with the most similar prototype. The exemplar theory suggests that to learn a category, one must learn about the exemplars that belong to that category. Subsequent categorization of a novel stimulus is then accomplished by computing its similarity to the known exemplars of potentially relevant categories and selecting the category that contains the most similar exemplars. Decision bound theory suggests that to learn a category, one must either learn the regions of a stimulus space associated with particular responses or the boundaries (the decision bounds) that divide these response regions. Categorization of a novel stimulus is then accomplished by determining which response region it is contained within.

Formal models of categorization

Computational models of categorization have been developed to test theories about how humans represent and use category information. To accomplish this, categorization models can be fit to experimental data to see how well the predictions afforded by the model line up with human performance. Based on the model's success at explaining the data, theorists are able to draw conclusions about the accuracy of their theories and their theory's relevance to human category representations. To effectively capture how humans represent and use category information, categorization models generally operate under variations of the same three basic assumptions. First, the model must make some kind of assumption about the internal representation of the stimulus (e.g., representing the perception of a stimulus as a point in a multi-dimensional space). Second, the model must make an assumption about the specific information that needs to be accessed in order to formulate a response (e.g., exemplar models require the collection of all available exemplars for each category). Third, the model must make an assumption about how a response is selected given the available information. Though all categorization models make these three assumptions, they distinguish themselves by the ways in which they represent and transform an input into a response representation. The internal knowledge structures of various categorization models reflect the specific representation(s) they use to perform these transformations. Typical representations employed by models include exemplars, prototypes, and rules. * Exemplar models store all distinct instances of stimuli with their corresponding category labels in memory. Categorization of subsequent stimuli is determined by the stimulus' collective similarity to all known exemplars. * Prototype models store a summary representation of all instances in a category. Categorization of subsequent stimuli is determined by selecting the category whose prototype is most similar to the stimulus. * Rule-based models define categories by storing summary lists of the necessary and sufficient features required for category membership. Boundary models can be considered as atypical rule models, as they do not define categories based on their content. Rather, boundary models define the edges (boundaries) between categories, which subsequently serve as determinants for how a stimulus gets categorized.

Examples of categorization models

Prototype models

Weighted Features Prototype Model An early instantiation of the prototype model was produced by Reed in the early 1970s. Reed (1972) conducted a series of experiments to compare the performance of 18 models on explaining data from a categorization task that required participants to sort faces into one of two categories. Results suggested that the prevailing model was the weighted features prototype model, which belonged to the family of average distance models. Unlike traditional average distance models, however, this model differentially weighted the most distinguishing features of the two categories. Given this model's performance, Reed (1972) concluded that the strategy participants used during the face categorization task was to construct prototype representations for each of the two categories of faces and categorize test patterns into the category associated with the most similar prototype. Furthermore, results suggested that similarity was determined by each categories most discriminating features.

Exemplar models

Generalized Context Model Medin and Schaffer's (1978)

context model A context model (or context modeling) defines how context data are structured and maintained (It plays a key role in supporting efficient context management). It aims to produce a formal or semi-formal description of the context information that is ...

was expanded upon by Nosofsky (1986) in the mid-1980's, resulting in the production of the Generalized Context Model (GCM). The GCM is an exemplar model that stores exemplars of stimuli as exhaustive combinations of the features associated with each exemplar. By storing these combinations, the model establishes contexts for the features of each exemplar, which are defined by all other features with which that feature co-occurs. The GCM computes the similarity of an exemplar and a stimulus in two steps. First, the GCM computes the psychological distance between the exemplar and the stimulus. This is accomplished by summing the absolute values of the dimensional difference between the exemplar and the stimulus. For example, suppose an exemplar has a value of 18 on dimension X and the stimulus has a value of 42 on dimension X; the resulting dimensional difference would be 24. Once psychological distance has been evaluated, an exponential decay function determines the similarity of the exemplar and the stimulus, where a distance of 0 results in a similarity of 1 (which begins to decrease exponentially as distance increases). Categorical responses are then generated by evaluating the similarity of the stimulus to each category's exemplars, where each exemplar provides a "vote" to their respective categories that varies in strength based on the exemplar's similarity to the stimulus and the strength of the exemplar's association with the category. This effectively assigns each category a selection probability that is determined by the proportion of votes it receives, which can then be fit to data.

Rule-based models

RULEX (Rule-Plus-Exception) Model While simple logical rules are ineffective at learning poorly defined category structures, some proponents of the rule-based theory of categorization suggest that an imperfect rule can be used to learn such category structures if exceptions to that rule are also stored and considered. To formalize this proposal, Nosofsky and colleagues (1994) designed the RULEX model. The RULEX model attempts to form a decision tree composed of sequential tests of an object's attribute values. Categorization of the object is then determined by the outcome of these sequential tests. The RULEX model searches for rules in the following ways: * Exact Search for a rule that uses a single attribute to discriminate between classes without error. * Imperfect Search for a rule that uses a single attribute to discriminate between classes with few errors * Conjunctive Search for a rule that uses multiple attributes to discriminate between classes with few errors. * Exception Search for exceptions to the rule. The method that RULEX uses to perform these searches is as follows: First, RULEX attempts an exact search. If successful, then RULEX will continuously apply that rule until misclassification occurs. If the exact search fails to identify a rule, either an imperfect or conjunctive search will begin. A sufficient, though imperfect, rule acquired during one of these search phases will become permanently implemented and the RULEX model will then begin to search for exceptions. If no rule is acquired, then the model will attempt the search it did not perform in the previous phase. If successful, RULEX will permanently implement the rule and then begin an exception search. If none of the previous search methods are successful RULEX will default to only searching for exceptions, despite lacking an associated rule, which equates to acquiring a random rule.

Hybrid models

SUSTAIN (Supervised and Unsupervised Stratified Adaptive Incremental Network) It is often the case that learned category representations vary depending on the learner's goals, as well as how categories are used during learning. Thus, some categorization researchers suggest that a proper model of categorization needs to be able to account for the variability present in the learner's goals, tasks, and strategies. This proposal was realized by Love and colleagues (2004) through the creation of SUSTAIN, a flexible clustering model capable of accommodating both simple and complex categorization problems through incremental adaptation to the specifics of problems. In practice, the SUSTAIN model first converts a stimulus' perceptual information into features that are organized along a set of dimensions. The representational space that encompasses these dimensions is then distorted (e.g., stretched or shrunk) to reflect the importance of each feature based on inputs from an attentional mechanism. A set of clusters (specific instances grouped by similarity) associated with distinct categories then compete to respond to the stimulus, with the stimulus being subsequently assigned to the cluster whose representational space is closest to the stimulus'. The unknown stimulus dimension value (e.g., category label) is then predicted by the winning cluster, which, in turn, informs the categorization decision. The flexibility of the SUSTAIN model is realized through its ability to employ both supervised and unsupervised learning at the cluster level. If SUSTAIN incorrectly predicts a stimulus as belonging to a particular cluster, corrective feedback (i.e., supervised learning) would signal sustain to recruit an additional cluster that represents the misclassified stimulus. Therefore, subsequent exposures to the stimulus (or a similar alternative) would be assigned to the correct cluster. SUSTAIN will also employ unsupervised learning to recruit an additional cluster if the similarity between the stimulus and the closest cluster does not exceed a threshold, as the model recognizes the weak predictive utility that would result from such a cluster assignment. SUSTAIN also exhibits flexibility in how it solves both simple and complex categorization problems. Outright, the internal representation of SUSTAIN contains only a single cluster, thus biasing the model towards simple solutions. As problems become increasingly complex (e.g., requiring solutions consisting of multiple stimulus dimensions), additional clusters are incrementally recruited so SUSTAIN can handle the rise in complexity.

Social categorization

Social categorization consists of putting human beings into groups in order to identify them based on different criteria. Categorization is a process studied by scholars in cognitive science but can also be studied as a social activity. Social categorization is different from the categorization of other things because it implies that people create categories for themselves and others as human beings. Groups can be created based on ethnicity, country of origin, religion, sexual identity, social privileges, economic privileges, etc. Various ways to sort people exist according to one's schemas. People belong to various social groups because of their ethnicity, religion, or age. Social categories based on age, race, and gender are used by people when they encounter a new person. Because some of these categories refer to physical traits, they are often used automatically when people don't know each other. These categories are not objective and depend on how people see the world around them. They allow people to identify themselves with similar people, and to identify people who are different. They are useful in one's identity formation with the people around them. One can build their own identity by identifying themselves in a group or by rejecting another group. Social categorization is similar to other types of categorization since it aims at simplifying the understanding of people. However, creating social categories implies that people will position themselves in relation to other groups. A hierarchy in group relations can appear as a result of social categorization. Scholars argue that the categorization process starts at a young age when children start to learn about the world and the people around them. Children learn how to know people according to categories based on similarities and differences. Social categories made by adults also impact their understanding of the world. They learn about social groups by hearing generalities about these groups from their parents. They can then develop prejudices about people as a result of these generalities. Another aspect of social categorization is mentioned by Stephen Reicher and Nick Hopkins and is related to political domination. They argue that political leaders use social categories to influence political debates.

Negative aspects

The activity of sorting people according to subjective or objective criteria can be seen as a negative process because of its tendency to lead to violence from a group to another. Indeed, similarities gather people who share common traits but differences between groups can lead to tensions and then the use of violence between those groups. The creation of social groups by people is responsible of a hierarchization of relations between groups. These hierarchical relations participate in the promotion of stereotypes about people and groups, sometimes based on subjective criteria. Social categories can encourage people to associate stereotypes to groups of people. Associating stereotypes to a group, and to people who belong to this group, can lead to forms of discrimination towards people of this group. The perception of a group and the stereotypes associated with it have an impact on social relations and activities. Some social categories have more weight than others in society. For instance, in history and still today, the category of "race" is one of the first categories used to sort people. However, only a few categories of race are commonly used such as "Black", "White", "Asian" etc. It participates in the reduction of the multitude of ethnicities to a few categories based mostly on people's skin color. The process of sorting people creates a vision of the other as 'different', leading to the dehumanization of people. Scholars talk about intergroup relations with the concept of

social identity theory Social identity is the portion of an individual's self-concept derived from perceived membership in a relevant social group. As originally formulated by social psychologists Henri Tajfel and John Turner in the 1970s and the 1980s, social ...

developed by H. Tajfel. Indeed, in history, many examples of social categorization have led to forms of domination or violence from a dominant group to a dominated group. Periods of colonisation are examples of times when people from a group chose to dominate and control other people belonging to other groups because they considered them as inferior. Racism, discrimination and violence are consequences of social categorization and can occur because of it. When people see others as different, they tend to develop hierarchical relation with other groups.

Miscategorization

There cannot be categorization without the possibility of

miscategorization A category mistake, or category error, or categorical mistake, or mistake of category, is a semantic or ontological error in which things belonging to a particular category are presented as if they belong to a different category, or, alternativel ...

. To do "the right thing with the right ''kind'' of thing.", there has to be both a right and a wrong thing to do. Not only does a category of which "everything" is a member lead logically to the Russell paradox ("is it or is it not a member of itself?"), but without the possibility of error, there is no way to detect or define what distinguishes category members from nonmembers. An example of the absence of nonmembers is the problem of the

poverty of the stimulus Poverty of the stimulus (POS) is the controversial argument from linguistics that children are not exposed to rich enough data within their linguistic environments to acquire every feature of their language. This is considered evidence contrary to ...

in language learning by the child: children learning the language do not hear or make errors in the rules of

Universal Grammar Universal grammar (UG), in modern linguistics, is the theory of the genetic component of the language faculty, usually credited to Noam Chomsky. The basic postulate of UG is that there are innate constraints on what the grammar of a possible h ...

(UG). Hence they never get corrected for errors in UG. Yet children's speech obeys the rules of UG, and speakers can immediately detect that something is wrong if a linguist generates (deliberately) an utterance that violates UG. Hence speakers can categorize what is UG-compliant and UG-noncompliant. Linguists have concluded from this that the rules of UG must be somehow encoded innately in the human brain. Ordinary categories, however, such as "dogs," have abundant examples of nonmembers (cats, for example). So it is possible to learn, by trial and error, with error-correction, to detect and define what distinguishes dogs from non-dogs, and hence to correctly categorize them. This kind of learning, called reinforcement learning in the behavioral literature and supervised learning in the computational literature, is fundamentally dependent on the possibility of error, and error-correction. Miscategorization—examples of nonmembers of the category—must always exist, not only to make the category learnable, but for the category to exist and be definable at all.

References

External links

To Cognize is to Categorize: Cognition is Categorization

Wikipedia Categories Visualizer

Interdisciplinary Introduction to Categorization: Interview with Dvora Yanov (political sciences), Amie Thomasson (philosophy) and Thomas Serre (artificial intelligence)
* {{philosophy of language Cognition Concepts in epistemology Knowledge representation Semantics