The term deduplication refers generally to eliminating duplicate or redundant information.
*
Data deduplication
In computing, data deduplication is a technique for eliminating duplicate copies of repeating data. Successful implementation of the technique can improve storage utilization, which may in turn lower capital expenditure by reducing the overall amou ...
, in computer storage, refers to the elimination of redundant data
*
Record linkage
Record linkage (also known as data matching, data linkage, entity resolution, and many other terms) is the task of finding records in a data set that refer to the same entity across different data sources (e.g., data files, books, websites, and d ...
, in databases, refers to the task of finding entries that refer to the same entity in two or more files
{{Disambiguation