In
natural language processing
Natural language processing (NLP) is a subfield of computer science and especially artificial intelligence. It is primarily concerned with providing computers with the ability to process data encoded in natural language and is thus closely related ...
, textual entailment (TE), also known as natural language inference (NLI), is a directional relation between
text
Text may refer to:
Written word
* Text (literary theory)
In literary theory, a text is any object that can be "read", whether this object is a work of literature, a street sign, an arrangement of buildings on a city block, or styles of clothi ...
fragments. The relation holds whenever the truth of one text fragment follows from another text.
Definition
In the TE framework, the entailing and entailed texts are termed ''text'' (''t'') and ''hypothesis'' (''h''), respectively. Textual entailment is not the same as pure
logical entailment – it has a more relaxed definition: "''t'' entails ''h''" (''t'' ⇒ ''h'') if, typically, a human reading ''t'' would infer that ''h'' is most likely true. (Alternatively: ''t'' ⇒ ''h'' if and only if, typically, a human reading ''t'' would be justified in inferring the proposition expressed by ''h'' from the proposition expressed by ''t''.) The relation is directional because even if "''t'' entails ''h''", the reverse "''h'' entails ''t''" is much less certain.
[Dagan, I. and O. Glickman. 'Probabilistic textual entailment: Generic applied modeling of language variability'](_blank)
in: ''PASCAL Workshop on Learning Methods for Text Understanding and Mining'' (2004) Grenoble.[Tătar, D. e.a. ''Textual Entailment as a Directional Relation''](_blank)
/ref>
Determining whether this relationship holds is an informal task, one which sometimes overlaps with the formal tasks of formal semantics (satisfying a strict condition will usually imply satisfaction of a less strict conditioned); additionally, textual entailment partially subsumes word entailment.
Examples
Textual entailment can be illustrated with examples of three different relations:
An example of a positive TE (text entails hypothesis) is:
*text: ''If you help the needy, God will reward you''.
:hypothesis: ''Giving money to a poor man has good consequences''.
An example of a negative TE (text contradicts hypothesis) is:
*text: ''If you help the needy, God will reward you''.
:hypothesis: ''Giving money to a poor man has no consequences''.
An example of a non-TE (text does not entail nor contradict) is:
*text: ''If you help the needy, God will reward you''.
:hypothesis: ''Giving money to a poor man will make you a better person''.
Ambiguity of natural language
A characteristic of natural language is that there are many different ways to state what one wants to say: several meanings can be contained in a single text and the same meaning can be expressed by different texts. This variability of semantic expression can be seen as the dual problem of language ambiguity
Ambiguity is the type of meaning (linguistics), meaning in which a phrase, statement, or resolution is not explicitly defined, making for several interpretations; others describe it as a concept or statement that has no real reference. A com ...
. Together, they result in a many-to-many
Many-to-many communication occurs when information is shared between groups. Members of a group receive information from multiple senders.
Wikis are a type of many-to-many communication, where multiple editors collaborate to create content that is ...
mapping between language expressions and meanings. The task of paraphrasing
A paraphrase () or rephrase is the rendering of the same text in different words without losing the meaning of the text itself. More often than not, a paraphrased text can convey its meaning better than the original words. In other words, it is a ...
involves recognizing when two texts have the same meaning and creating a similar or shorter text that conveys almost the same information. Textual entailment is similar but weakens the relationship to be unidirectional. Mathematical solutions to establish textual entailment can be based on the directional property of this relation, by making a comparison between some directional similarities of the texts involved.
Approaches
Textual entailment measures natural language understanding as it asks for a semantic interpretation of the text, and due to its generality remains an active area of research. Many approaches and refinements of approaches have been considered, such as word embedding
In natural language processing, a word embedding is a representation of a word. The embedding is used in text analysis. Typically, the representation is a real-valued vector that encodes the meaning of the word in such a way that the words that ...
, logical models, graphical models, rule systems, contextual focusing, and machine learning. Practical or large-scale solutions avoid these complex methods and instead use only surface syntax or lexical relationships, but are correspondingly less accurate. , state-of-the-art systems are far from human performance; a study found humans to agree on the dataset 95.25% of the time. Algorithms from 2016 had not yet achieved 90%.
Applications
Many natural language processing applications, like question answering, information extraction, summarization, multi-document summarization, and evaluation of machine translation systems, need to recognize that a particular target meaning can be inferred from different text variants. Typically entailment is used as part of a larger system, for example in a prediction system to filter out trivial or obvious predictions. Textual entailment also has applications in adversarial stylometry, which has the objective of removing textual style without changing the overall meaning of communication.
Datasets
Some of available English NLI datasets include:
SNLI
ref name=Bowman2015>MultiNLIref name=Williams2018>
SciTailref name=Khot2018>
SICKref name=Marelli2014>
MedNLIref name=Romanov2018>
QA-NLIref name=Demszky2018>
In addition, there are several non-English NLI datasets, as follows:
XNLIref name=Conneau2018>
DACCORD, RTE3-FR, SICK-FRref name=Skandalis2024> for French
FarsTailref name="Amirkhani2021"> for Farsi
OCNLIref name=Hu2020> for Chinese
SICK-NLref name=Wijnholds2021> for Dutch
IndoNLIref name=Mahendra2021> for Indonesian
See also
*
Entailment (linguistics)
Linguistic entailments are entailments which arise in natural language. If a sentence ''A'' entails a sentence ''B'', sentence ''A'' cannot be true without ''B'' being true as well. For instance, the English sentence "Pat is a fluffy cat" entail ...
*
Inference engine
*
Semantic reasoner
*
Fuzzy logic
Fuzzy logic is a form of many-valued logic in which the truth value of variables may be any real number between 0 and 1. It is employed to handle the concept of partial truth, where the truth value may range between completely true and completely ...
References
Bibliography
*
External links
Textual Entailment Resource Pool
{{Natural Language Processing
Tasks of natural language processing
Logical consequence
Text mining
Natural language processing