HOME

TheInfoList



OR:

Lumpers and splitters are opposing factions in any
academic discipline An academic discipline or academic field is a subdivision of knowledge that is taught and researched at the college or university level. Disciplines are defined (in part) and recognized by the academic journals in which research is published, a ...
that has to place individual examples into rigorously defined
categories Category, plural categories, may refer to: General uses *Classification, the general act of allocating things to classes/categories Philosophy *Category of being * ''Categories'' (Aristotle) *Category (Kant) *Categories (Peirce) *Category (Vais ...
. The lumper–splitter problem occurs when there is the desire to create classifications and assign examples to them, for example, schools of
literature Literature is any collection of Writing, written work, but it is also used more narrowly for writings specifically considered to be an art form, especially novels, Play (theatre), plays, and poetry, poems. It includes both print and Electroni ...
,
biological Biology is the scientific study of life and living organisms. It is a broad natural science that encompasses a wide range of fields and unifying principles that explain the structure, function, growth, origin, evolution, and distribution of ...
taxa In biology, a taxon (back-formation from ''taxonomy''; : taxa) is a group of one or more populations of an organism or organisms seen by taxonomists to form a unit. Although neither is required, a taxon is usually known by a particular name and ...
, and so on. A "lumper" is a person who assigns examples broadly, judging that differences are not as important as signature similarities. A "splitter" makes precise definitions, and creates new categories to classify samples that differ in key ways.


Origin of the terms

The earliest known use of these terms was thought to be
Charles Darwin Charles Robert Darwin ( ; 12 February 1809 – 19 April 1882) was an English Natural history#Before 1900, naturalist, geologist, and biologist, widely known for his contributions to evolutionary biology. His proposition that all speci ...
, in a letter to
Joseph Dalton Hooker Sir Joseph Dalton Hooker (30 June 1817 – 10 December 1911) was a British botanist and explorer in the 19th century. He was a founder of geographical botany and Charles Darwin's closest friend. For 20 years he served as director of the Ro ...
in 1857: "It is good to have hair-splitters & lumpers". But according to research done by the deputy director at NCSE, Glenn Branch, the credit is due to naturalist Edward Newman who wrote in 1845, "The time has arrived for discarding imaginary species, and the duty of doing this is as imperative as the admission of new ones when such are really discovered. The talents described under the respective names of 'hair-splitting' and 'lumping' are unquestionably yielding their power to the mightier power of Truth." They were then introduced more widely by George G. Simpson in his 1945 work ''The Principles of Classification and a Classification of Mammals''. As he put it: A later use can be found in the title of a 1969 paper "On lumpers and splitters ..." by the medical geneticist
Victor McKusick Victor Almon McKusick (October 21, 1921 – July 22, 2008) was an American internist and medical geneticist, and Professor of Medicine at the Johns Hopkins Hospital, Baltimore. He was a proponent of the mapping of the human genome due to its ...
. Reference to lumpers and splitters in the humanities appeared in a debate in 1975 between J. H. Hexter and Christopher Hill, in the ''
Times Literary Supplement ''The Times Literary Supplement'' (''TLS'') is a weekly literary review published in London by News UK, a subsidiary of News Corp. History The ''TLS'' first appeared in 1902 as a supplement to ''The Times'' but became a separate publication ...
''. It followed from Hexter's detailed review of Hill's book ''Change and Continuity in Seventeenth Century England'', in which Hill developed
Max Weber Maximilian Carl Emil Weber (; ; 21 April 186414 June 1920) was a German Sociology, sociologist, historian, jurist, and political economy, political economist who was one of the central figures in the development of sociology and the social sc ...
's argument that the rise of capitalism was facilitated by
Calvinist Reformed Christianity, also called Calvinism, is a major branch of Protestantism that began during the 16th-century Protestant Reformation. In the modern day, it is largely represented by the Continental Reformed Protestantism, Continenta ...
Puritanism. Hexter objected to Hill's "mining" of sources to find evidence that supported his theories. Hexter argued that Hill plucked quotations from sources in a way that distorted their meaning. Hexter explained this as a mental habit that he called "lumping". According to him, "lumpers" rejected differences and chose to emphasise similarities. Any evidence that did not fit their arguments was ignored as aberrant. Splitters, by contrast, emphasised differences, and resisted simple schemes. While lumpers consistently tried to create coherent patterns, splitters preferred incoherent complexity.


Usage in various fields


Biology

The categorisation and naming of a particular species should be regarded as a ''hypothesis'' about the
evolution Evolution is the change in the heritable Phenotypic trait, characteristics of biological populations over successive generations. It occurs when evolutionary processes such as natural selection and genetic drift act on genetic variation, re ...
ary relationships and distinguishability of that group of organisms. As further information comes to hand, the hypothesis may be confirmed or refuted. Sometimes, especially in the past when communication was more difficult, taxonomists working in isolation have given two distinct names to individual
organism An organism is any life, living thing that functions as an individual. Such a definition raises more problems than it solves, not least because the concept of an individual is also difficult. Many criteria, few of them widely accepted, have be ...
s later identified as the same species. When two named species are agreed to be of the same species, the older species name is almost always retained dropping the newer species name honouring a convention known as "priority of nomenclature". This form of lumping is technically called synonymisation. Dividing a taxon into multiple, often new, taxa is called splitting. Taxonomists are often referred to as "lumpers" or "splitters" by their colleagues, depending on their personal approach to recognizing differences or commonalities between organisms. For example, the number of
genera Genus (; : genera ) is a taxonomic rank above species and below family as used in the biological classification of living and fossil organisms as well as viruses. In binomial nomenclature, the genus name forms the first part of the binomial s ...
used in
Pteridophyte Phylogeny Group The Pteridophyte Phylogeny Group (PPG) is an informal international group of systematic botanists who collaborate to establish on the classification of pteridophytes ( lycophytes and ferns) that reflects knowledge about plant relationships discove ...
I (PPG I) has proved controversial. PPG I uses 18
lycophyte The lycophytes, when broadly circumscribed, are a group of vascular plants that include the clubmosses. They are sometimes placed in a division Lycopodiophyta or Lycophyta or in a subdivision Lycopodiophytina. They are one of the oldest lineag ...
and 319
fern The ferns (Polypodiopsida or Polypodiophyta) are a group of vascular plants (plants with xylem and phloem) that reproduce via spores and have neither seeds nor flowers. They differ from mosses by being vascular, i.e., having specialized tissue ...
genera. The earlier system put forward by Smith et al. (2006) had suggested a range of 274 to 312 genera for ferns alone. By contrast, the system of Christenhusz & Chase (2014) used 5 lycophyte and about 212 fern genera. The number of fern genera was further reduced to 207 in a subsequent publication. Defending PPG I, Schuettpelz et al. (2018) argue that the larger number of genera is a result of "the gradual accumulation of new collections and new data" and hence "a greater appreciation of fern diversity and ... an improved ability to distinguish taxa". They also argue that the number of species per genus in the PPG I system is already higher than in other groups of organisms (about 33 species per genus for ferns as opposed to about 22 species per genus for
angiosperms Flowering plants are plants that bear flowers and fruits, and form the clade Angiospermae (). The term angiosperm is derived from the Greek words (; 'container, vessel') and (; 'seed'), meaning that the seeds are enclosed within a fruit. T ...
) and that reducing the number of genera as Christenhusz and Chase propose yields the excessive number of about 50 species per genus for ferns. In response, Christenhusz AND Chase (2018) argue that the excessive splitting of genera destabilises the usage of names and will lead to greater instability in future, and that the highly split genera have few if any characters that can be used to recognise them, making identification difficult, even to generic level. They further argue that comparing numbers of species per genus in different groups is "fundamentally meaningless".


History

In history, lumpers are those who tend to create broad definitions that cover large periods of time and many disciplines, whereas splitters want to assign names to tight groups of inter-relationships. Lumping tends to create a more and more unwieldy definition, with members having less and less mutually in common. This can lead to definitions which are little more than conventionalities, or groups which join fundamentally different examples. Splitting often leads to " distinctions without difference", ornate and fussy categories, and failure to see underlying similarities. For example, in the arts, " Romantic" can refer specifically to a period of
German German(s) may refer to: * Germany, the country of the Germans and German things **Germania (Roman era) * Germans, citizens of Germany, people of German ancestry, or native speakers of the German language ** For citizenship in Germany, see also Ge ...
poetry roughly from 1780 to 1810, but would exclude the later work of
Goethe Johann Wolfgang (von) Goethe (28 August 1749 – 22 March 1832) was a German polymath who is widely regarded as the most influential writer in the German language. His work has had a wide-ranging influence on Western literature, literary, Polit ...
, among other writers. In music it can mean every composer from Hummel through
Rachmaninoff Sergei Vasilyevich Rachmaninoff; in Russian pre-revolutionary script. (28 March 1943) was a Russian composer, virtuoso pianist, and conductor. Rachmaninoff is widely considered one of the finest pianists of his day and, as a composer, one of ...
, plus many that came after.


Software modelling

Software engineering Software engineering is a branch of both computer science and engineering focused on designing, developing, testing, and maintaining Application software, software applications. It involves applying engineering design process, engineering principl ...
often proceeds by building models (sometimes known as
model-driven architecture Model-driven architecture (MDA) is a software design approach for the development of software systems. It provides a set of guidelines for the structuring of specifications, which are expressed as models. Model Driven Architecture is a kind of doma ...
). A lumper is keen to generalise, and produces models with a small number of broadly defined objects. A splitter is reluctant to generalise, and produces models with a large number of narrowly defined objects. Conversion between the two styles is not necessarily symmetrical. For example, if error messages in two narrowly defined classes behave in the same way, the classes can be easily combined. But if some messages in a broad class behave differently, every object in the class must be examined before the class can be split. This illustrates the principle that "splits can be lumped more easily than lumps can be split".


Language classification

There is no agreement among
historical linguists Historical linguistics, also known as diachronic linguistics, is the scientific study of how language change, languages change over time. It seeks to understand the nature and causes of linguistic change and to trace the evolution of language ...
about what amount of evidence is needed for two languages to be safely classified in the same
language family A language family is a group of languages related through descent from a common ancestor, called the proto-language of that family. The term ''family'' is a metaphor borrowed from biology, with the tree model used in historical linguistics ...
. For this reason, many proposed language families have had lumper–splitter controversies, including
Altaic The Altaic () languages are a group of languages comprising the Turkic, Mongolic and Tungusic language families, with some linguists including the Koreanic and Japonic families. These languages share agglutinative morphology, head-final ...
, Pama–Nyungan,
Nilo-Saharan The Nilo-Saharan languages are a proposed family of around 210 African languages spoken by somewhere around 70 million speakers, mainly in the upper parts of the Chari and Nile rivers, including historic Nubia, north of where the two tributari ...
, and most of the larger families of the Americas. At a completely different level, the splitting of a
mutually intelligible In linguistics, mutual intelligibility is a relationship between different but related language varieties in which speakers of the different varieties can readily understand each other without prior familiarity or special effort. Mutual intellig ...
dialect continuum A dialect continuum or dialect chain is a series of Variety (linguistics), language varieties spoken across some geographical area such that neighboring varieties are Mutual intelligibility, mutually intelligible, but the differences accumulat ...
into different languages, or lumping them into one, is also an issue that continually comes up, though the consensus in contemporary linguistics is that there is no completely objective way to settle the question. Splitters regard the
comparative method In linguistics, the comparative method is a technique for studying the development of languages by performing a feature-by-feature comparison of two or more languages with common descent from a shared ancestor and then extrapolating backwards ...
(meaning not comparison in general, but only reconstruction of a common ancestor or
protolanguage In the tree model of historical linguistics, a proto-language is a postulated ancestral language from which a number of attested languages are believed to have descended by evolution, forming a language family. Proto-languages are usually unattest ...
) as the only valid proof of kinship, and consider genetic relatedness to be the question of interest. American linguists of recent decades tend to be splitters. Lumpers are more willing to admit techniques like
mass lexical comparison Mass comparison is a method developed by Joseph Greenberg to determine the level of genetic relatedness between languages. It is now usually called multilateral comparison. Mass comparison has been referred to as a "methodological deception" an ...
or
lexicostatistics Lexicostatistics is a method of comparative linguistics that involves comparing the percentage of lexical cognates between languages to determine their relationship. Lexicostatistics is related to the comparative method but does not reconstruct a ...
, and mass typological comparison, and to tolerate the uncertainty of whether relationships found by these methods are the result of linguistic divergence (descent from common ancestor) or
language convergence Language convergence is a type of linguistic change in which languages come to resemble one another structurally as a result of prolonged language contact and mutual interference, regardless of whether those languages belong to the same language ...
(borrowing). Much long-range comparison work has been from Russian linguists belonging to the
Moscow School of Comparative Linguistics The Moscow School of Comparative Linguistics (also called the Nostratic languages, Nostratic School) is a school of linguistics based in Moscow, Russia that is known for its work in . Formerly based at Moscow State University, it is currently cente ...
, most notably
Vladislav Illich-Svitych Vladislav Markovich Illich-Svitych (, also transliterated as Illič-Svityč; 12 September 1934 – 22 August 1966) was a Soviet linguist and accentologist. He was a founding father of comparative Nostratic linguistics and the Moscow School o ...
and
Sergei Starostin Sergei Anatolyevich Starostin (; March 24, 1953 – September 30, 2005) was a Russian historical linguistics, historical linguist and philology, philologist, perhaps best known for his reconstructions of hypothetical proto-languages, including hi ...
. In the United States,
Greenberg Greenberg is a surname common in North America and Anglosphere, with anglicized spelling of the German Grünberg (''green mountain'') or the Jewish Ashkenazi Yiddish Grinberg, an artificial surname.Beider, Alexander (1993). ''A Dictionary of Jewi ...
and Ruhlen's work has been met with little acceptance from linguists. Earlier American linguists like
Morris Swadesh Morris Swadesh ( ; January 22, 1909 – July 20, 1967) was an American linguist who specialized in comparative and historical linguistics, and developed his mature career at UNAM in Mexico. Swadesh was born in Massachusetts to Bessarabian Jewi ...
and
Edward Sapir Edward Sapir (; January 26, 1884 – February 4, 1939) was an American anthropologist-linguistics, linguist, who is widely considered to be one of the most important figures in the development of the discipline of linguistics in the United States ...
also pursued large-scale classifications like Sapir's 1929 scheme for the Americas, accompanied by controversy similar to that today.


Religious studies

Paul F. Bradshaw suggests that the same principles of lumping and splitting apply to the study of early Christian
liturgy Liturgy is the customary public ritual of worship performed by a religious group. As a religious phenomenon, liturgy represents a communal response to and participation in the sacred through activities reflecting praise, thanksgiving, remembra ...
. Lumpers, who tend to predominate in this field, try to find a single line of successive texts from the
apostolic age Christianity in the 1st century covers the formative history of Christianity from the start of the ministry of Jesus (–29 AD) to the death of the last of the Twelve Apostles () and is thus also known as the Apostolic Age. Early Christianity ...
to the fourth century (and later). Splitters see many parallel and overlapping strands which intermingle and flow apart so that there is not a single coherent path in the development of liturgical texts. Liturgical texts must not be taken solely at face value; often there are hidden agendas in texts.Bradshaw, Paul F., ''The Search for the Origins of Christian Worship'', Oxford University Press, 2002, p. ix. . The idea of a single Hindu religion is essentially a lumper's concept, sometimes also known as
Smartism The ''Smarta'' tradition (, ) is a movement in Hinduism that developed and expanded with the Puranas genre of literature. It reflects a synthesis of four philosophical strands, namely Uttara Mīmāṃsā, Advaita, Yoga, and theism. The Sm ...
(on the basis of the Smārta synthesis). Hindu splitters, and individual adherents, often identify themselves on the other hand as adherents of a religion such as
Shaivism Shaivism (, , ) is one of the major Hindu denominations, Hindu traditions, which worships Shiva as the Para Brahman, supreme being. It is the Hinduism#Demographics, second-largest Hindu sect after Vaishnavism, constituting about 385 million H ...
,
Vaishnavism Vaishnavism () ), also called Vishnuism, is one of the major Hindu denominations, Hindu traditions, that considers Vishnu as the sole Para Brahman, supreme being leading all other Hindu deities, that is, ''Mahavishnu''. It is one of the majo ...
, or
Shaktism Shaktism () is a major Hindu denomination in which the God in Hinduism, deity or metaphysics, metaphysical reality is considered metaphorically to be a woman. Shaktism involves a galaxy of goddesses, all regarded as different aspects, mani ...
, according to which deity they believe to be the supreme creator of the universe. Various "holistic" approaches to religion can prioritise themes such as individual spirituality, the New-Age-style essential oneness of multiple religious traditions, or
religious fundamentalism Fundamentalism is a tendency among certain groups and individuals that are characterized by the application of a strict literal interpretation to scriptures, dogmas, or ideologies, along with a strong belief in the importance of distinguis ...
.


Philosophy

Physicist and philosophy writer
Freeman Dyson Freeman John Dyson (15 December 1923 – 28 February 2020) was a British-American theoretical physics, theoretical physicist and mathematician known for his works in quantum field theory, astrophysics, random matrix, random matrices, math ...
has suggested that one can broadly, if over-simplistically, divide "observers of the philosophical scene" into splitters and lumpers – roughly corresponding to
materialists Materialism is a form of philosophical monism according to which matter is the fundamental substance in nature, and all things, including mental states and consciousness, are results of material interactions. According to philosophical materia ...
(who imagine the world as divided into atoms) and
Platonists Platonism is the philosophy of Plato and philosophical systems closely derived from it, though contemporary Platonists do not necessarily accept all doctrines of Plato. Platonism has had a profound effect on Western thought. At the most fundam ...
(who regard the world as made up of ideas).


Psychiatry

In
psychiatry Psychiatry is the medical specialty devoted to the diagnosis, treatment, and prevention of deleterious mental disorder, mental conditions. These include matters related to cognition, perceptions, Mood (psychology), mood, emotion, and behavior. ...
, the
'splitters' and the 'lumpers' have fundamentally different approaches to psychiatric diagnosis and classification. First, 'splitters' emphasise the heterogeneity within the diagnostic categories and argue that this heterogeneity drives the 'splitting' process'. 'Lumpers', on the other hand, point to the similarities ''between'' the diagnostic categories, and suggest that these similarities justify the creation of broader entities.
Thus lumpers might see "stress" where splitters could identify (say)
worry Worrying is the mental distress or agitation resulting from anxiety, usually coming from a place of anticipatory fear (terror) or fear coming from a present threat (horror). With more understanding of the situation, worry becomes concern, the ...
,
grief Grief is the response to the loss of something deemed important, particularly to the death of a person to whom or animal to which a Human bonding, bond or affection was formed. Although conventionally focused on the emotional response to loss, ...
, or some sort of
anxiety disorder Anxiety disorders are a group of mental disorders characterized by significant and uncontrollable feelings of anxiety and fear such that a person's social, occupational, and personal functions are significantly impaired. Anxiety may cause phys ...
.


Neuroscience

In neuroscience, "uncertainty aversion" and "uncertainty tolerance" in semantic representations appear to correlate with the terms "splitters" and "lumpers" respectively. As neuroscientist Marc-Lluís Vives observes:
Our survival is possible because every day we make use of previously acquired categories to navigate the world. Every single mug we encounter is distinct, but fundamentally the same. Thanks to this powerful capacity to classify distinct stimuli under the same category, we can generalize our knowledge from the previously encountered subset of mugs to a future subset of mugs. However, this also posits a dilemma: Is a glass mug still a mug? That is, what are the defining principles that make something a "mug"? Establishing this is fundamental since it also affects its relationship with its close-neighbors. Conceptualizing a mug as very different from a glass creates a more clear-cut mapping between the input—that is, the stimulus perceived—and the output that a person needs to generate—that is, the response, such as drinking coffee. Classical work in cognitive science demonstrates that the more similar two stimuli are, the harder it is to discriminate them and respond with different behavior.


Artificial intelligence and linguistics

Natural language processing Natural language processing (NLP) is a subfield of computer science and especially artificial intelligence. It is primarily concerned with providing computers with the ability to process data encoded in natural language and is thus closely related ...
, using algorithmic approaches such as Word2Vec, provides a way to quantify the overlap or distinguish between semantic categories between words. This can provide a sense of how often the contexts of words overlap or are dissimilar in general usage.


See also

*
Evolutionary biology Evolutionary biology is the subfield of biology that studies the evolutionary processes such as natural selection, common descent, and speciation that produced the diversity of life on Earth. In the 1930s, the discipline of evolutionary biolo ...
*
Heterarchy A heterarchy is a system of organization where the elements of the organization are unranked (non- hierarchical) or where they possess the potential to be ranked a number of different ways. Definitions of the term vary among the disciplines: in soc ...
* No true Scotsman *
Prototype theory Prototype theory is a theory of categorization in cognitive science, particularly in psychology and cognitive linguistics, in which there is a graded degree of belonging to a conceptual category, and some members are more central than others. It ...
*
Sorites paradox The sorites paradox (), sometimes known as the paradox of the heap, is a paradox that results from vague predicates. A typical formulation involves a heap of sand, from which grains are removed individually. With the assumption that removing a s ...


References

{{Reflist


External links


"Lumpers and Splitters"
''Veryard Projects''. July 2003. Knowledge representation Taxonomy Classification of people