
A semantic lexicon is a digital
dictionary
A dictionary is a listing of lexemes from the lexicon of one or more specific languages, often arranged Alphabetical order, alphabetically (or by Semitic root, consonantal root for Semitic languages or radical-and-stroke sorting, radical an ...
of
word
A word is a basic element of language that carries semantics, meaning, can be used on its own, and is uninterruptible. Despite the fact that language speakers often have an intuitive grasp of what a word is, there is no consensus among linguist ...
s labeled with
semantic
Semantics is the study of linguistic Meaning (philosophy), meaning. It examines what meaning is, how words get their meaning, and how the meaning of a complex expression depends on its parts. Part of this process involves the distinction betwee ...
classes so associations can be drawn between words that have not previously been encountered. Semantic lexicons are built upon
semantic network
A semantic network, or frame network is a knowledge base that represents semantic relations between concepts in a network. This is often used as a form of knowledge representation. It is a directed or undirected graph consisting of vertices, ...
s, which represent the semantic relations between words. The difference between a semantic lexicon and a semantic network is that a semantic lexicon has definitions for each word, or a "gloss".
Structure
Semantic lexicons are made up of lexical entries. These entries are not orthographic, but semantic, eliminating issues of homonymy and polysemy. These lexical entries are interconnected with
semantic relations
Semantics is the study of linguistic meaning. It examines what meaning is, how words get their meaning, and how the meaning of a complex expression depends on its parts. Part of this process involves the distinction between sense and referenc ...
, such as
hyperonymy,
hyponymy
Hypernymy and hyponymy are the wikt:Wiktionary:Semantic relations, semantic relations between a generic term (''hypernym'') and a more specific term (''hyponym''). The hypernym is also called a ''supertype'', ''umbrella term'', or ''blanket term ...
,
meronymy
In linguistics, meronymy () is a semantic relation between a meronym denoting a part and a holonym denoting a whole. In simpler terms, a meronym is in a ''part-of'' relationship with its holonym. For example, ''finger'' is a meronym of ''hand, ...
, or
troponymy. Synonymous entries are grouped together in what the Princeton
WordNet
WordNet is a lexical database of semantic relations between words that links words into semantic relations including synonyms, hyponyms, and meronyms. The synonyms are grouped into ''synsets'' with short definitions and usage examples. It can thu ...
calls "
synsets
In metadata, a synonym ring or synset, is a group of data elements that are considered semantically equivalent for the purposes of information retrieval. These data elements are frequently found in different metadata registries. Although a group ...
"
Most semantic lexicons are made up of four different "sub-nets":
nouns, verbs, adjectives, and adverbs, though some researchers have taken steps to add an "artificial node" interconnecting the sub-nets.
Nouns
Nouns are ordered into a
taxonomy
image:Hierarchical clustering diagram.png, 280px, Generalized scheme of taxonomy
Taxonomy is a practice and science concerned with classification or categorization. Typically, there are two parts to it: the development of an underlying scheme o ...
, structured into a hierarchy where the broadest and most encompassing noun is located at the top, such as "thing", with the nouns becoming more and more specific the further they are from the top. The very top noun in a semantic lexicon is called a ''unique beginner''. The most specific nouns (those that do not have any subordinates), are ''terminal nodes''.
Semantic lexicons also distinguish between types, where a type of something has characteristics of a thing such as a ''Rhodesian Ridgeback'' being a type of dog, and instances, where something is an example of said thing, such as ''
Dave Grohl
David Eric Grohl (; born January 14, 1969) is an American musician. He founded the rock band Foo Fighters, of which he is the lead singer, guitarist, principal songwriter, and only consistent member. From 1990 to 1994, he was the drummer of th ...
'' is an instance of a ''musician''. Instances are always terminal nodes because they are solitary and don’t have other words or
ontological categories belonging to them.
Semantic lexicons also address
meronymy
In linguistics, meronymy () is a semantic relation between a meronym denoting a part and a holonym denoting a whole. In simpler terms, a meronym is in a ''part-of'' relationship with its holonym. For example, ''finger'' is a meronym of ''hand, ...
,
which is a “part-to-whole” relationship, such as keys are part of a laptop. The necessary attributes that define a specific entry are also necessarily present in that entry’s
hyponym
Hypernymy and hyponymy are the wikt:Wiktionary:Semantic relations, semantic relations between a generic term (''hypernym'') and a more specific term (''hyponym''). The hypernym is also called a ''supertype'', ''umbrella term'', or ''blanket term ...
. So, if a ''computer'' has ''keys'', and a ''laptop'' is a type of ''computer'', then a ''laptop'' must have ''keys''. However, there are many instances where this distinction can become vague. A good example of this is the item ''chair''. Most would define a chair as having legs and a seat (as in the part one sits on). However, there are some artistic or modern chairs that do not have legs at all. Beanbags also do not have legs, but few would argue that they aren't chairs. Questions like this are the core questions that drive research and work in the fields of
taxonomy
image:Hierarchical clustering diagram.png, 280px, Generalized scheme of taxonomy
Taxonomy is a practice and science concerned with classification or categorization. Typically, there are two parts to it: the development of an underlying scheme o ...
and
ontology
Ontology is the philosophical study of existence, being. It is traditionally understood as the subdiscipline of metaphysics focused on the most general features of reality. As one of the most fundamental concepts, being encompasses all of realit ...
.
Verbs
Verb synsets are arranged much like their noun counterparts: the more general and encompassing verbs are near the top of the hierarchy while
troponyms (verbs that describe a more specific way of doing something) are grouped beneath. Verb specificity moves along a
vector
Vector most often refers to:
* Euclidean vector, a quantity with a magnitude and a direction
* Disease vector, an agent that carries and transmits an infectious pathogen into another living organism
Vector may also refer to:
Mathematics a ...
, with the verbs becoming more and more specific in reference to a certain quality.
For example. The set "walk / run / sprint" becomes more specific in terms of the speed, and "dislike / hate / abhor" becomes more specific in terms of the intensity of the emotion.
The ontological groupings and separations of verbs is far more arguable than their noun counterparts. It is widely accepted that a ''dog'' is a type of ''animal'' and that a ''stool'' is a type of ''chair'', but it can be argued that ''abhor'' is on the same emotional plane as ''hate'' (that they are synonyms and not super/subordinates). It can also be argued that ''love'' and ''adore'' are synonyms, or that one is more specific than the other. Thus, the relations between verbs are not as agreed-upon as that of nouns.
Another attribute of verb synset relations is that there are also ordered into verb pairs. In these pairs, one verb necessarily
entails the other in the way that ''massacre'' entails ''kill'', and ''know'' entails ''believe''.
These verb pairs can be troponyms and their superordinates, as is the case in the first example, or they can be in completely different ontological categories, as in the case in the second example.
Adjectives
Adjective synset relations are very similar to verb synset relations. They are not quite as neatly hierarchical as the noun synset relations, and they have fewer tiers and more terminal nodes. However, there are generally less terminal nodes per ontological category in adjective synset relations than that of verbs. Adjectives in semantic lexicons are organized in word pairs as well, with the difference being that their word pairs are
antonyms
In lexical semantics, opposites are words lying in an inherently incompatible binary relationship. For example, something that is ''even'' entails that it is not ''odd''. It is referred to as a 'binary' relationship because there are two members i ...
instead of
entailments. More generic polar adjectives such as ''hot'' and ''cold'', or ''happy'' and ''sad'' are paired. Then other adjectives that are semantically similar are linked to each of these words. ''Hot'' is linked to ''warm'', ''heated'', ''sizzling'', and ''sweltering'', while ''cold'' is linked to ''cool'', ''chilly'', ''freezing'', and ''nippy''. These semantically similar adjectives are considered ''indirect antonyms''
to the opposite polar adjective (i.e. ''nippy'' is an indirect antonym to ''hot''). Adjectives that are derived from a verb or a noun are also directly linked to said verb or noun across sub-nets. For example, ''enjoyable'' is linked to the semantically similar adjectives ''agreeable'', and ''pleasant'', as well as to its origin verb, ''enjoy''.
Adverbs
There are very few adverbs accounted for in semantic lexicons. This is because most adverbs are taken directly from their adjective counterparts, in both meaning and form, and changed only
morphologically (i.e. ''happily'' is derived from ''happy'', and ''luckily'' is derived from ''lucky'', which is derived from ''luck''). The only adverbs that are accounted for specifically are ones without these connections, such as ''really'', ''mostly'', and ''hardly''.
Challenges facing semantic lexicons
The effects of the Princeton
WordNet
WordNet is a lexical database of semantic relations between words that links words into semantic relations including synonyms, hyponyms, and meronyms. The synonyms are grouped into ''synsets'' with short definitions and usage examples. It can thu ...
project extend far past English, though most research in the field revolves around the English language. Creating a semantic lexicon for other languages has proved to be very useful for
Natural Language Processing
Natural language processing (NLP) is a subfield of computer science and especially artificial intelligence. It is primarily concerned with providing computers with the ability to process data encoded in natural language and is thus closely related ...
applications. One of the main focuses of research in semantic lexicons is linking lexicons of different languages to aid in
machine translation
Machine translation is use of computational techniques to translate text or speech from one language to another, including the contextual, idiomatic and pragmatic nuances of both languages.
Early approaches were mostly rule-based or statisti ...
. The most common approach is to attempt to create a shared ontology that serves as a “middleman” of sorts between semantic lexicons of two different languages.
This is an extremely challenging and as-of-yet unsolved issue in the Machine Translation field. One issue arises from the fact that no two languages are word-for-word translations of each other. That is, every language has some sort of structural or syntactic difference from every other. In addition, languages often have words that don’t translate easily into other languages, and certainly not with an exact word-to-word match. Proposals have been made to create a set framework for wordnets. Research has shown that every known human language has some sort of concept resembling
synonym
A synonym is a word, morpheme, or phrase that means precisely or nearly the same as another word, morpheme, or phrase in a given language. For example, in the English language, the words ''begin'', ''start'', ''commence'', and ''initiate'' are a ...
y,
hyponymy
Hypernymy and hyponymy are the wikt:Wiktionary:Semantic relations, semantic relations between a generic term (''hypernym'') and a more specific term (''hyponym''). The hypernym is also called a ''supertype'', ''umbrella term'', or ''blanket term ...
,
meronymy
In linguistics, meronymy () is a semantic relation between a meronym denoting a part and a holonym denoting a whole. In simpler terms, a meronym is in a ''part-of'' relationship with its holonym. For example, ''finger'' is a meronym of ''hand, ...
, and
antonymy
In lexical semantics, opposites are words lying in an inherently incompatible binary relationship. For example, something that is ''even'' entails that it is not ''odd''. It is referred to as a 'binary' relationship because there are two members i ...
. However, every idea so far proposed has been met with criticism for using a pattern that works best for English and less for other languages.
Another obstacle in the field is that no solid guidelines exist for semantic lexicon framework and contents. Each lexicon project in each different language has had a slightly (or not so slightly) different approach to their wordnet. There is not even an agreed-upon definition of what a “word” is.
Orthographically, they are defined as a string of letters with spaces on either side, but semantically it becomes a very debated subject. For example, though it is not difficult to define ''dog'' or ''rod'' as words, but what about ''guard dog'' or ''lightning rod''? The latter two examples would be considered orthographically separate words, though semantically they make up one concept: one is a type of dog and one is a type of rod. In addition to these confusions, wordnets are also
idiosyncratic
An idiosyncrasy is a unique feature of something. The term is often used to express peculiarity.
Etymology
The term "idiosyncrasy" originates from Greek ', "a peculiar temperament, habit of body" (from ', "one's own", ', "with" and ', "blend ...
, in that they do not consistently label items. They are redundant, in that they often have several words assigned to each meaning (synsets). They are also open-ended, in that they often focus on and extend into
terminology
Terminology is a group of specialized words and respective meanings in a particular field, and also the study of such terms and their use; the latter meaning is also known as terminology science. A ''term'' is a word, Compound (linguistics), com ...
and domain-specific vocabulary.
Other names
* wordnet
* computational lexicon
List of semantic lexicons
*
WordNet
WordNet is a lexical database of semantic relations between words that links words into semantic relations including synonyms, hyponyms, and meronyms. The synonyms are grouped into ''synsets'' with short definitions and usage examples. It can thu ...
*
EuroWordNet
*
Multilingual Central Repository
*
Global Wordnet
*
MindNet
See also
*
Gellish
Gellish is an ontology language for data storage and communication, designed and developed by Andries van Renssen since mid-1990s. It started out as an engineering modeling language ("Generic Engineering Language", giving it the name, "Gellish") b ...
*
Lexicon
A lexicon (plural: lexicons, rarely lexica) is the vocabulary of a language or branch of knowledge (such as nautical or medical). In linguistics, a lexicon is a language's inventory of lexemes. The word ''lexicon'' derives from Greek word () ...
*
Semantic network
A semantic network, or frame network is a knowledge base that represents semantic relations between concepts in a network. This is often used as a form of knowledge representation. It is a directed or undirected graph consisting of vertices, ...
References
{{DEFAULTSORT:Semantic Lexicon
Lexis (linguistics)
Semantics