Information integration (II) is the merging of information from heterogeneous sources with differing conceptual, contextual and typographical representations. It is used in
data mining and consolidation of data from unstructured or semi-structured resources. Typically, ''information integration'' refers to textual representations of knowledge but is sometimes applied to
rich-media content.
Information fusion, which is a related term, involves the combination of information into a new set of information towards reducing redundancy and uncertainty.
Examples of
technologies
Technology is the application of knowledge to reach practical goals in a specifiable and reproducible way. The word ''technology'' may also mean the product of such an endeavor. The use of technology is widely prevalent in medicine, science, ...
available to integrate information include
deduplication
The term deduplication refers generally to eliminating duplicate or redundant information.
*Data deduplication, in computer storage, refers to the elimination of redundant data
*Record linkage
Record linkage (also known as data matching, data l ...
, and
string metrics which allow the detection of similar text in different data sources by
fuzzy matching
Record linkage (also known as data matching, data linkage, entity resolution, and many other terms) is the task of finding records in a data set that refer to the same entity across different data sources (e.g., data files, books, websites, and ...
. A host of methods for these research areas are available such as those presented in the International Society of Information Fusion. Other methods rely on causal estimates of the outcomes based on a model of the sources.
[P.K. Davis, D. Manheim, W.L. Perry, J. Hollywood (2015)]
In Proceedings of the 2015 Winter Simulation Conference (WSC '15). IEEE Press, Piscataway, NJ, USA, 2586-2597.
/ref>
See also
* Data fusion
Data fusion is the process of integrating multiple data sources to produce more consistent, accurate, and useful information than that provided by any individual data source.
Data fusion processes are often categorized as low, intermediate, or hig ...
(is a subset of Information integration)
* Sensor fusion
Sensor fusion is the process of combining sensor data or data derived from disparate sources such that the resulting information has less uncertainty than would be possible when these sources were used individually. For instance, one could potentia ...
* Data integration
Data integration involves combining data residing in different sources and providing users with a unified view of them.
This process becomes significant in a variety of situations, which include both commercial (such as when two similar companies ...
* Image fusion
* Synesthesia
Synesthesia (American English) or synaesthesia (British English) is a perceptual phenomenon in which stimulation of one sensory or cognitive pathway leads to involuntary experiences in a second sensory or cognitive pathway. People who rep ...
Books
*
*
* Springer, Information Fusion in Data Mining (2003),
* H. B. Mitchell, Multi-sensor Data Fusion – An Introduction (2007) Springer-Verlag, Berlin,
* S. Das, High-Level Data Fusion (2008), Artech House Publishers, Norwood, MA, and 1596932813
* E. P. Blasch, E. Bosse, and D. A. Lambert, High-Level Information Fusion Management and System Design (2012), Artech House Publishers, Norwood, MA. ,
*
References
External links
Discriminant Correlation Analysis (DCA)
Information Integration Using Logical View
LNCS 1997.
International Society of Information Fusion
{{DEFAULTSORT:Information Integration
Data management
ar:تكامل البيانات
de:Informationsintegration