An image retrieval system is a computer system used for browsing, searching and retrieving images from a large
database
In computing, a database is an organized collection of data or a type of data store based on the use of a database management system (DBMS), the software that interacts with end users, applications, and the database itself to capture and a ...
of digital images. Most traditional and common methods of image retrieval utilize some method of adding
metadata
Metadata (or metainformation) is "data that provides information about other data", but not the content of the data itself, such as the text of a message or the image itself. There are many distinct types of metadata, including:
* Descriptive ...
such as
captioning,
keywords, title or descriptions to the images so that retrieval can be performed over the annotation words. Manual image annotation is time-consuming, laborious and expensive; to address this, there has been a large amount of research done on automatic image annotation. Additionally, the increase in social
web application
A web application (or web app) is application software that is created with web technologies and runs via a web browser. Web applications emerged during the late 1990s and allowed for the server to dynamically build a response to the request, ...
s and the
semantic web
The Semantic Web, sometimes known as Web 3.0, is an extension of the World Wide Web through standards set by the World Wide Web Consortium (W3C). The goal of the Semantic Web is to make Internet data machine-readable.
To enable the encoding o ...
have inspired the development of several web-based image annotation tools.
The first microcomputer-based image database retrieval system was developed at
MIT, in the 1990s, by Banireddy Prasaad,
Amar Gupta, Hoo-min Toong, and
Stuart Madnick.
A 2008 survey article documented progresses after 2007.
Search methods
Image search is a specialized data search used to find images. To search for images, a user may provide query terms such as keyword, image file/link, or click on some image, and the system will return images "similar" to the query. The similarity used for search criteria could be meta tags, color distribution in images, region/shape attributes, etc.
*
Image meta search - search of images based on associated metadata such as keywords, text, etc.
*
Content-based image retrieval (CBIR) – the application of
computer vision
Computer vision tasks include methods for image sensor, acquiring, Image processing, processing, Image analysis, analyzing, and understanding digital images, and extraction of high-dimensional data from the real world in order to produce numerical ...
to the image retrieval. CBIR aims at avoiding the use of textual descriptions and instead retrieves images based on similarities in their contents (textures, colors, shapes etc.) to a user-supplied query image or user-specified image features.
**
List of CBIR Engines - list of engines which search for images based image visual content such as color, texture, shape/object, etc.
:
*
Image collection exploration - search of images based on the use of novel exploration paradigms.
Data scope
It is crucial to understand the scope and nature of image data in order to determine the complexity of image search system design. The design is also largely influenced by factors such as the diversity of user-base and expected user traffic for a search system. Along this dimension, search data can be classified into the following categories:
* ''Archives'' - usually contain large volumes of structured or semi-structured homogeneous data pertaining to specific topics.
* ''Domain-Specific Collection'' - this is a homogeneous collection providing access to controlled users with very specific objectives. Examples of such a collection are biomedical and satellite image databases.
* ''Enterprise Collection'' - a heterogeneous collection of images that is accessible to users within an organization's intranet. Pictures may be stored in many different locations.
* ''Personal Collection'' - usually consists of a largely homogeneous collection and is generally small in size, accessible primarily to its owner, and usually stored on a local storage media.
* ''Web'' - World Wide Web images are accessible to everyone with an Internet connection. These image collections are semi-structured, non-homogeneous and massive in volume, and are usually stored in large disk arrays.
Evaluations
There are evaluation workshops for image retrieval systems aiming to investigate and improve the performance of such systems.
* ImageCLEF - a continuing track of the Cross Language Evaluation Forum that evaluates systems using both textual and pure-image retrieval methods.
* Content-based Access of Image and Video Libraries - a series of
IEEE
The Institute of Electrical and Electronics Engineers (IEEE) is an American 501(c)(3) organization, 501(c)(3) public charity professional organization for electrical engineering, electronics engineering, and other related disciplines.
The IEEE ...
workshops from 1998 to 2001.
See also
*
Automatic image annotation
*
Computer vision
Computer vision tasks include methods for image sensor, acquiring, Image processing, processing, Image analysis, analyzing, and understanding digital images, and extraction of high-dimensional data from the real world in order to produce numerical ...
*
Concept-based image indexing
*
Content-based image retrieval (CBIR)
*
Digital asset management
*
Digital image editing
*
Image organizer
*
Image processing
An image or picture is a visual representation. An image can be two-dimensional, such as a drawing, painting, or photograph, or three-dimensional, such as a carving or sculpture. Images may be displayed through other media, including a pr ...
*
Information retrieval
Information retrieval (IR) in computing and information science is the task of identifying and retrieving information system resources that are relevant to an Information needs, information need. The information need can be specified in the form ...
*
Object categorization from image search
*
Multimedia information retrieval
Multimedia information retrieval (MMIR or MIR) is a research discipline of computer science that aims at extracting semantic information from multimedia data sources.H Eidenberger. ''Fundamental Media Understanding'', atpress, 2011, p. 1. Data sour ...
*
VisualRank
*
Learning to rank
Learning to rank. Slides from Tie-Yan Liu's talk at World Wide Web Conference, WWW 2009 conference aravailable online or machine-learned ranking (MLR) is the application of machine learning, typically Supervised learning, supervised, Semi-supervi ...
References
External links
Image-Net.orgVGG Image Search Engine(VISE): a free and open source software for visual search of a large number of images using an image as a search query.
{{Search engines
Applications of computer vision