The SMART (System for the Mechanical Analysis and Retrieval of Text) Information Retrieval System is an
information retrieval
Information retrieval (IR) in computing and information science is the task of identifying and retrieving information system resources that are relevant to an Information needs, information need. The information need can be specified in the form ...
system developed at
Cornell University
Cornell University is a Private university, private Ivy League research university based in Ithaca, New York, United States. The university was co-founded by American philanthropist Ezra Cornell and historian and educator Andrew Dickson W ...
in the 1960s. Many important concepts in information retrieval were developed as part of research on the SMART system, including the
vector space model
Vector space model or term vector model is an algebraic model for representing text documents (or more generally, items) as vector space, vectors such that the distance between vectors represents the relevance between the documents. It is used in i ...
,
relevance feedback, and
Rocchio classification.
Gerard Salton led the group that developed SMART. Other contributors included
Mike Lesk
Michael E. Lesk (born 1945) is an American computer scientist.
Biography
In the 1960s, Michael Lesk worked for the SMART Information Retrieval System project, wrote much of its retrieval code and did many of the retrieval experiments, as well as ...
.
The SMART system also provides a set of corpora, queries and reference rankings, taken from different subjects, notably
*
ADI: publications from information science reviews
*
Computer science
Computer science is the study of computation, information, and automation. Computer science spans Theoretical computer science, theoretical disciplines (such as algorithms, theory of computation, and information theory) to Applied science, ...
*
Cranfield collection: publications from aeronautic reviews
*
Forensic science
Forensic science combines principles of law and science to investigate criminal activity. Through crime scene investigations and laboratory analysis, forensic scientists are able to link suspects to evidence. An example is determining the time and ...
: library science
*
MEDLARS
MEDLINE (Medical Literature Analysis and Retrieval System Online, or MEDLARS Online) is a bibliographic database of life sciences and biomedical information. It includes bibliographic information for articles from academic journals covering medic ...
collection: publications from medical reviews
*
Time magazine
''Time'' (stylized in all caps as ''TIME'') is an American news magazine based in New York City. It was published weekly for nearly a century. Starting in March 2020, it transitioned to every other week. It was first published in New York Cit ...
collection: archives of the generalist review ''Time'' in 1963
To the legacy of the SMART system belongs the so-called SMART triple notation, a mnemonic scheme for denoting
tf-idf weighting variants in the vector space model. The mnemonic for representing a combination of weights takes the form
ddd.qqq
, where the first three letters represents the term weighting of the collection document vector and the second three letters represents the term weighting for the query document vector. For example,
ltc.lnn
represents the
ltc
weighting applied to a collection document and the
lnn
weighting applied to a query document.
The following tables establish the SMART notation:
The gray letters in the first, fifth, and ninth columns are the scheme used by Salton and Buckley in their 1988 paper.
[Salton, G., & Buckley, C. (1988)]
Term-Weighting Approaches in Automatic Text Retrieval
''Inf. Process. Manage., 24'', 513-523. The bold letters in the second, sixth, and tenth columns are the scheme used in experiments reported thereafter.
References
External links
*
tp://ftp.cs.cornell.edu/pub/smart/ Software and test collections(FTP at
Cornell University
Cornell University is a Private university, private Ivy League research university based in Ithaca, New York, United States. The university was co-founded by American philanthropist Ezra Cornell and historian and educator Andrew Dickson W ...
)
Interactive SMART tutorial
Discontinued software
Search engine software
{{software-eng-stub