Faceted search augments lexical search with a faceted navigation system, allowing users to narrow results by applying filters based on a
faceted classification
A faceted classification is a classification scheme used in organizing knowledge into a systematic order. A faceted classification uses semantic categories, either general or subject-specific, that are combined to create the full classification ...
of the items.
It is a
parametric search technique. A faceted classification system classifies each information element along multiple explicit dimensions, facets, enabling the classifications to be accessed and ordered in multiple ways rather than in a single, predetermined,
taxonomic order.
[
]
Facets correspond to properties of the information elements. They are often derived by analysis of the text of an item using
entity extraction techniques or from pre-existing fields in a database such as author, descriptor, language, and format. Thus, existing web-pages, product descriptions or online collections of articles can be augmented with navigational facets.
Faceted search interfaces were first developed in the academic world by
Ben Shneiderman, Steven Pollitt,
Marti Hearst, and
Gary Marchionini in the 1990s and 2000s.
[
][
][
]
The most well-known of these efforts was the Flamenco research project at
University of California, Berkeley
The University of California, Berkeley (UC Berkeley, Berkeley, Cal, or California), is a Public university, public Land-grant university, land-grant research university in Berkeley, California, United States. Founded in 1868 and named after t ...
led by Marti Hearst. Concurrently, there was development of commercial faceted search systems, notably
Endeca and
Spotfire.
Within the academic community, faceted search has attracted interest primarily among
library and information science
Library and information science (LIS)Library and Information Sciences is the name used in the Dewey Decimal Classification for class 20 from the 18th edition (1971) to the 22nd edition (2003). are two interconnected disciplines that deal with inf ...
researchers, and to some extent among
computer science
Computer science is the study of computation, information, and automation. Computer science spans Theoretical computer science, theoretical disciplines (such as algorithms, theory of computation, and information theory) to Applied science, ...
researchers specializing in
information retrieval
Information retrieval (IR) in computing and information science is the task of identifying and retrieving information system resources that are relevant to an Information needs, information need. The information need can be specified in the form ...
.
Mass market use
Faceted search has become a popular technique in commercial search applications, particularly for online retailers and libraries. An increasing number of enterprise search vendors provide software for implementing faceted search applications.
Online retail catalogs pioneered the earliest applications of faceted search, reflecting both the faceted nature of product data (most products have a type, brand, price, etc.) and the ready availability of the data in retailers' existing information-systems. In the early 2000s retailers started using faceted search, in part due to published studies that evaluated user search experience on popular sites.
, among the 50 largest US-based online retailers, 40% had implemented faceted search.
[Smashing Magazine: The Current State of E-Commerce Search](_blank)
Retrieved on 2014-08-27. Examples include the filtering options that appear in the left column on
amazon.com or
Google Shopping after a keyword search has been performed.
Libraries and information science
In 1933, the noted librarian
Ranganathan proposed a
faceted classification
A faceted classification is a classification scheme used in organizing knowledge into a systematic order. A faceted classification uses semantic categories, either general or subject-specific, that are combined to create the full classification ...
system for library materials, known as
colon classification. In the pre-computer era, he did not succeed in replacing the pre-coordinated
Dewey Decimal Classification
The Dewey Decimal Classification (DDC) (pronounced ) colloquially known as the Dewey Decimal System, is a proprietary library classification system which allows new books to be added to a library in their appropriate location based on subject. ...
system.
Modern online library catalogs, also known as
online public access catalog
The online public access catalog (OPAC), now frequently synonymous with ''library catalog'', is an online database of materials held by a library or group of libraries. Online catalogs have largely replaced the analog card catalogs previously ...
s (OPAC), have increasingly adopted faceted search interfaces. Noted examples include the
North Carolina State University
North Carolina State University (NC State, North Carolina State, NC State University, or NCSU) is a public university, public Land-grant university, land-grant research university in Raleigh, North Carolina, United States. Founded in 1887 and p ...
library catalog (part of the Triangle Research Libraries Network) and the
OCLC
OCLC, Inc. See also: is an American nonprofit cooperative organization "that provides shared technology services, original research, and community programs for its membership and the library community at large". It was founded in 1967 as the ...
Open
WorldCat
WorldCat is a union catalog that itemizes the collections of tens of thousands of institutions (mostly libraries), in many countries, that are current or past members of the OCLC global cooperative. It is operated by OCLC, Inc. Many of the O ...
system. The
CiteSeerX
CiteSeerX (formerly called CiteSeer) is a public search engine and digital library for scientific and academic papers, primarily in the fields of computer and information science.
CiteSeer's goal is to improve the dissemination and access of a ...
project
CiteSeerX
Citeseerx.ist.psu.edu. Retrieved on 2013-07-21. at the Pennsylvania State University
The Pennsylvania State University (Penn State or PSU) is a Public university, public Commonwealth System of Higher Education, state-related Land-grant university, land-grant research university with campuses and facilities throughout Pennsyl ...
allows faceted search for academic documents and continues to expand into other facets such as table search.
See also
* Enterprise search
Enterprise search is software technology for searching data sources internal to a company, typically intranet and database content. The search is generally offered only to users internal to the company. Enterprise search can be contrasted with web ...
* Exploratory search
* Faceted classification
A faceted classification is a classification scheme used in organizing knowledge into a systematic order. A faceted classification uses semantic categories, either general or subject-specific, that are combined to create the full classification ...
* Human–computer information retrieval
* Information extraction
* NoSQL
NoSQL (originally meaning "Not only SQL" or "non-relational") refers to a type of database design that stores and retrieves data differently from the traditional table-based structure of relational databases. Unlike relational databases, which ...
* Voxound
References
{{DEFAULTSORT:Faceted Search
Information retrieval techniques
Information retrieval genres