A concordance is an
alphabetical
Alphabetical order is a system whereby character strings are placed in order based on the position of the characters in the conventional ordering of an alphabet. It is one of the methods of collation. In mathematics, a lexicographical order is ...
list of the principal
word
A word is a basic element of language that carries semantics, meaning, can be used on its own, and is uninterruptible. Despite the fact that language speakers often have an intuitive grasp of what a word is, there is no consensus among linguist ...
s used in a book or body of work, listing every instance of each word with its immediate
context. Historically, concordances have been compiled only for works of special importance, such as the
Vedas
FIle:Atharva-Veda samhita page 471 illustration.png, upright=1.2, The Vedas are ancient Sanskrit texts of Hinduism. Above: A page from the ''Atharvaveda''.
The Vedas ( or ; ), sometimes collectively called the Veda, are a large body of relig ...
,
Bible
The Bible is a collection of religious texts that are central to Christianity and Judaism, and esteemed in other Abrahamic religions such as Islam. The Bible is an anthology (a compilation of texts of a variety of forms) originally writt ...
,
Qur'an
The Quran, also romanized Qur'an or Koran, is the central religious text of Islam, believed by Muslims to be a revelation directly from God ('' Allāh''). It is organized in 114 chapters (, ) which consist of individual verses ('). Besides ...
or the works of
Shakespeare
William Shakespeare ( 23 April 1564 – 23 April 1616) was an English playwright, poet and actor. He is widely regarded as the greatest writer in the English language and the world's pre-eminent dramatist. He is often called England's natio ...
,
James Joyce
James Augustine Aloysius Joyce (born James Augusta Joyce; 2 February 1882 – 13 January 1941) was an Irish novelist, poet, and literary critic. He contributed to the modernist avant-garde movement and is regarded as one of the most influentia ...
or classical Latin and Greek authors, because of the time, difficulty, and expense involved in creating a concordance in the pre-
computer
A computer is a machine that can be Computer programming, programmed to automatically Execution (computing), carry out sequences of arithmetic or logical operations (''computation''). Modern digital electronic computers can perform generic set ...
era.

A concordance is more than an
index
Index (: indexes or indices) may refer to:
Arts, entertainment, and media Fictional entities
* Index (''A Certain Magical Index''), a character in the light novel series ''A Certain Magical Index''
* The Index, an item on the Halo Array in the ...
, with additional material such as commentary, definitions and topical
cross-indexing which makes producing one a labor-intensive process even when assisted by computers.
In the precomputing era,
search
Searching may refer to:
Music
* "Searchin', Searchin", a 1957 song originally performed by The Coasters
* Searching (China Black song), "Searching" (China Black song), a 1991 song by China Black
* Searchin' (CeCe Peniston song), "Searchin" (C ...
technology was unavailable, and a concordance offered readers of long works such as the Bible something comparable to search results for every word that they would have been likely to search for. Today, the ability to combine the result of queries concerning multiple terms (such as searching for words near other words) has reduced interest in concordance publishing. In addition, mathematical techniques such as
latent semantic indexing
Latent semantic analysis (LSA) is a technique in natural language processing, in particular distributional semantics, of analyzing relationships between a set of documents and the terms they contain by producing a set of concepts related to the d ...
have been proposed as a means of automatically identifying linguistic information based on word context.
A bilingual concordance is a concordance based on
aligned parallel text.
A topical concordance is a list of subjects that a book covers (usually The Bible), with the immediate context of the coverage of those subjects. Unlike a traditional concordance, the indexed word does not have to appear in the verse. The best-known topical concordance is
Nave's Topical Bible.
The first
Bible concordance
A Bible concordance is a Concordance (publishing), concordance, or verbal index, to the Bible. A simple form lists Biblical words alphabetically, with indications to enable the inquirer to find the passages of the Bible where the words occur.
Con ...
was compiled for the
Vulgate
The Vulgate () is a late-4th-century Bible translations into Latin, Latin translation of the Bible. It is largely the work of Saint Jerome who, in 382, had been commissioned by Pope Damasus I to revise the Gospels used by the Diocese of ...
Bible by
Hugh of St Cher (d.1262), who employed 500 friars to assist him. In 1448, Rabbi Mordecai Nathan completed a concordance to the Hebrew Bible. It took him ten years. A concordance to the Greek New Testament was published in 1546 by
Sixt Birck, and the
Septuagint
The Septuagint ( ), sometimes referred to as the Greek Old Testament or The Translation of the Seventy (), and abbreviated as LXX, is the earliest extant Greek translation of the Hebrew Bible from the original Biblical Hebrew. The full Greek ...
was done a by Conrad Kircher in 1602. The first concordance to the English Bible was published in 1550 by Mr Marbeck. According to Cruden, it did not employ the verse numbers devised by
Robert Stephens in 1545, but "the pretty large concordance" of Mr Cotton did. Then followed
Cruden's Concordance and
Strong's Concordance
''The Exhaustive Concordance of the Bible'', generally known as Strong's Concordance, is a Bible concordance, an index of every word in the King James Version (KJV), constructed under the direction of American theologian James Strong. Strong fi ...
.
Use in linguistics
Concordances are frequently used in
linguistics
Linguistics is the scientific study of language. The areas of linguistic analysis are syntax (rules governing the structure of sentences), semantics (meaning), Morphology (linguistics), morphology (structure of words), phonetics (speech sounds ...
, when studying a text. For example:
* comparing different usages of the same word
* analysing keywords
* analysing
word frequencies
A word list is a list of words in a lexicon, generally sorted by frequency of occurrence (either by graded levels, or as a ranked list). A word list is compiled by lexical frequency analysis within a given text corpus, and is used in corpus li ...
* finding and analysing phrases and
idioms
An idiom is a phrase or expression that largely or exclusively carries a figurative or non-literal meaning, rather than making any literal sense. Categorized as formulaic language, an idiomatic expression's meaning is different from the lit ...
* finding
translation
Translation is the communication of the semantics, meaning of a #Source and target languages, source-language text by means of an Dynamic and formal equivalence, equivalent #Source and target languages, target-language text. The English la ...
s of subsentential elements, e.g.
terminology
Terminology is a group of specialized words and respective meanings in a particular field, and also the study of such terms and their use; the latter meaning is also known as terminology science. A ''term'' is a word, Compound (linguistics), com ...
, in
bitexts and translation memories
* creating indexes and word lists (also useful for publishing)
Concordancing techniques are widely used in national
text corpora
In linguistics and natural language processing, a corpus (: corpora) or text corpus is a dataset, consisting of natively digital and older, digitalized, language resources, either annotated or unannotated.
Annotated, they have been used in cor ...
such as
American National Corpus (ANC),
British National Corpus
The British National Corpus (BNC) is a 100-million-word text corpus of samples of written and spoken English from a wide range of sources. The corpus covers British English of the late 20th century from a wide variety of genres, with the intention ...
(BNC), and
Corpus of Contemporary American English (COCA) available on-line. Stand-alone applications that employ concordancing techniques are known as
concordancers or more advanced
corpus manager
A corpus manager (corpus browser or corpus query system) is a tool for multilingual corpus analysis, which allows effective searching in corpora.
A corpus manager usually represents a complex tool that allows one to perform searches for language ...
s. Some of them have integrated
part-of-speech taggers (POS taggers) and enable the user to create their own
POS-annotated corpora to conduct various types of searches adopted in corpus linguistics.
Inversion
The reconstruction of the text of some of the
Dead Sea Scrolls
The Dead Sea Scrolls, also called the Qumran Caves Scrolls, are a set of List of Hebrew Bible manuscripts, ancient Jewish manuscripts from the Second Temple period (516 BCE – 70 CE). They were discovered over a period of ten years, between ...
involved a concordance.
Access to some of the scrolls was governed by a "secrecy rule" that allowed only the original International Team or their designates to view the original materials. After the death of
Roland de Vaux in 1971, his successors repeatedly refused to even allow the publication of photographs to other scholars. This restriction was circumvented by
Martin Abegg in 1991, who used a computer to "invert" a concordance of the missing documents made in the 1950s which had come into the hands of scholars outside of the International Team, to obtain an approximate reconstruction of the original text of 17 of the documents.
This was soon followed by the release of the original text of the scrolls.
See also
*
A Vedic Word Concordance
*
Bible concordance
A Bible concordance is a Concordance (publishing), concordance, or verbal index, to the Bible. A simple form lists Biblical words alphabetically, with indications to enable the inquirer to find the passages of the Bible where the words occur.
Con ...
*
Cross-reference
The term cross-reference (abbreviation: xref) can refer to either:
* An instance within a document which refers to related information elsewhere in the same document. In both printed and online dictionaries cross-references are important because ...
*
Key Word in Context
*
Index
Index (: indexes or indices) may refer to:
Arts, entertainment, and media Fictional entities
* Index (''A Certain Magical Index''), a character in the light novel series ''A Certain Magical Index''
* The Index, an item on the Halo Array in the ...
*
Text mining
Text mining, text data mining (TDM) or text analytics is the process of deriving high-quality information from text. It involves "the discovery by computer of new, previously unknown information, by automatically extracting information from differe ...
*
Termbase
References
External links
Shakespeare concordance- A concordance of Shakespeare's complete works (from Open Source Shakespeare)
Online Concordance to the Complete Works of Hryhorii Skovoroda- A concordance to
Hryhorii Skovoroda's complete works (University of Alberta, Edmonton, Canada)
Alex Catalogue of Electronic Texts- The Alex Catalogue is a collection of public domain electronic texts from American and English literature as well as Western philosophy. Each of the 14,000 items in the Catalogue are available as full-text but they are also complete with a concordance. Consequently, you are able to count the number of times a particular word is used in a text or list the most common (10, 25, 50, etc.) words.
Hyper-Concordance, Mitsu Matsuoka, Nagoya University- The Hyper-Concordance is written in C++, a program that scans and displays lines based on a command entered by the user. Includes Victorian, British & Irish, and American literatures.
- Page includes link to Concord, an on-the-fly KWIC concordance generator. Works with at least some non-Latin scripts (modern Greek, for instance). Multiple choices for sorting results; multi-platform; Open Source.
ConcorDance- A concordance interface to the WorldWideWeb, it uses Google's or Yahoo's search engine to find concordances and can be used directly from the browser.
Chinese Text Project Concordance Tool- Concordance lookup and discussion of the continued importance of printed concordances in
Sinology
Sinology, also referred to as China studies, is a subfield of area studies or East Asian studies involved in social sciences and humanities research on China. It is an academic discipline that focuses on the study of the Chinese civilization p ...
-
Chinese Text Project
KH Coder- A free software for KWIC concordance and collocation stats generation. Various statistical analysis functions are also available such as
co-occurrence network, multidimensional scaling, hierarchical cluster analysis, and correspondence analysis of words.
The electronic concordance of the Armenian literature- Armenian literature of concordances has a rich history, but one of its breakthrough stages was marked by the creation of a large Armenian electronic concordance available on the website. The electronic concordances are advantageously distinguished from both typographical, printed concordances and foreign-language electronic concordances. Those differences refer to the choice of authors, the structure, and the presentation of the headword.
{{DEFAULTSORT:Concordance (Publishing)
Information science