Wide Area Information Server (WAIS) is a
client–server text searching system that uses the
ANSI
The American National Standards Institute (ANSI ) is a private nonprofit organization that oversees the development of voluntary consensus standards for products, services, processes, systems, and personnel in the United States. The organiz ...
Standard
Z39.50 Information Retrieval Service Definition and
Protocol Specifications for Library Applications" (Z39.50:1988) to search index
database
In computing, a database is an organized collection of data or a type of data store based on the use of a database management system (DBMS), the software that interacts with end users, applications, and the database itself to capture and a ...
s on remote computers. It was developed in 1990 as a project of
Thinking Machines,
Apple Computer,
Dow Jones, and
KPMG Peat Marwick.
WAIS did not adhere to either the standard nor its
OSI framework (adopting instead
TCP/IP
The Internet protocol suite, commonly known as TCP/IP, is a framework for organizing the communication protocols used in the Internet and similar computer networks according to functional criteria. The foundational protocols in the suite are ...
) but created a unique protocol inspired by Z39.50:1988.
History
The WAIS protocol and servers were promoted by
Thinking Machines Corporation (TMC) of
Cambridge, Massachusetts
Cambridge ( ) is a city in Middlesex County, Massachusetts, United States. It is a suburb in the Greater Boston metropolitan area, located directly across the Charles River from Boston. The city's population as of the 2020 United States census, ...
. TMC-produced WAIS servers ran on their massively parallel CM-2 (
Connection Machine) and
SPARC-based CM-5 MP
supercomputer
A supercomputer is a type of computer with a high level of performance as compared to a general-purpose computer. The performance of a supercomputer is commonly measured in floating-point operations per second (FLOPS) instead of million instruc ...
s. WAIS clients were developed for various
operating system
An operating system (OS) is system software that manages computer hardware and software resources, and provides common daemon (computing), services for computer programs.
Time-sharing operating systems scheduler (computing), schedule tasks for ...
s and
windowing systems including
Microsoft Windows
Windows is a Product lining, product line of Proprietary software, proprietary graphical user interface, graphical operating systems developed and marketed by Microsoft. It is grouped into families and subfamilies that cater to particular sec ...
,
Macintosh
Mac is a brand of personal computers designed and marketed by Apple Inc., Apple since 1984. The name is short for Macintosh (its official name until 1999), a reference to the McIntosh (apple), McIntosh apple. The current product lineup inclu ...
,
NeXT
NeXT, Inc. (later NeXT Computer, Inc. and NeXT Software, Inc.) was an American technology company headquartered in Redwood City, California that specialized in computer workstations for higher education and business markets, and later develope ...
,
X,
GNU Emacs
GNU Emacs is a text editor and suite of free software tools. Its development began in 1984 by GNU Project founder Richard Stallman, based on the Emacs editor developed for Unix operating systems. GNU Emacs has been a central component of the GNU ...
, and character terminals. TMC released a free
open source software version of WAIS for
Unix
Unix (, ; trademarked as UNIX) is a family of multitasking, multi-user computer operating systems that derive from the original AT&T Unix, whose development started in 1969 at the Bell Labs research center by Ken Thompson, Dennis Ritchie, a ...
in 1991.
Inspired by the WAIS project on full-text databases and emerging
SGML
The Standard Generalized Markup Language (SGML; International Organization for Standardization, ISO 8879:1986) is a standard for defining generalized markup languages for documents. ISO 8879 Annex A.1 states that generalized markup is "based on t ...
projects, Z39.50 version 2 (Z39.50:1992) was released. Unlike its 1988 predecessor, it was a compatible superset of the international ISO 10162/10163 standard.
With the advent of Z39.50:1992, the termination of support for free WAIS by Thinking Machines and the establishment of WAIS Inc as a commercial venture, the U.S.
National Science Foundation
The U.S. National Science Foundation (NSF) is an Independent agencies of the United States government#Examples of independent agencies, independent agency of the Federal government of the United States, United States federal government that su ...
funded the
Clearinghouse for Networked Information Discovery and Retrieval (CNIDR) to promote Internet search and discovery systems, open source and standards. CNIDR created a new, free open-source WAIS. This was the first freeWAIS based on the wais-8-b5 codebase of TMC, with a wholly new software suite
Isite based upon Z39.50:1992 using
Isearch as its full-text
search engine
A search engine is a software system that provides hyperlinks to web pages, and other relevant information on World Wide Web, the Web in response to a user's web query, query. The user enters a query in a web browser or a mobile app, and the sea ...
.
Ulrich Pfeifer and Norbert Gövert of the computer science department of the
University of Dortmund extended the CNIDR freeWAIS code to become freeWAIS-sf with structured fields as its main improvement. Ulrich Pfeifer rewrote freeWAIS-sf in
Perl
Perl is a high-level, general-purpose, interpreted, dynamic programming language. Though Perl is not officially an acronym, there are various backronyms in use, including "Practical Extraction and Reporting Language".
Perl was developed ...
, becoming WAIT.
Inspired by WAIS' "Directory of Servers",
Eliot Christian of
USGS
The United States Geological Survey (USGS), founded as the Geological Survey, is an government agency, agency of the United States Department of the Interior, U.S. Department of the Interior whose work spans the disciplines of biology, geograp ...
envisioned GILS:
Government Information Locator Service. GILS (based upon Z39.50:1992 with some extensions) became a U.S. Federal mandate as part of the
Paperwork Reduction Act of 1995 ().
Directory of Servers
Thinking Machines Corp provided a service called the Directory of Servers. It was a WAIS server like any other information source except containing information about the other WAIS servers on the Internet. A WAIS server with TMC WAIS code creates a special record containing
metadata
Metadata (or metainformation) is "data that provides information about other data", but not the content of the data itself, such as the text of a message or the image itself. There are many distinct types of metadata, including:
* Descriptive ...
plus some common words describing its indexed content. The record is uploaded to the central server and indexed along with the records from other public servers. The directory can be searched to find servers that might have content relevant to a specific field of interest. This model of searching for (WAIS) servers to search became the model for
GILS and Peter Deutsch's
WHOIS++ distributed white pages directory.
People
Two of the developers of WAIS,
Brewster Kahle and Harry Morris, left Thinking Machines to found WAIS Inc in Menlo Park, California, with
Bruce Gilliat. WAIS Inc. was originally a joint project between Apple Computer, Peat Marwick, Dow Jones, and Thinking Machines. In 1992, the presidential campaign of
Ross Perot used the WAIS product as a campaign wide information system, connecting the field offices to the national office. Later,
Perot Systems adopted WAIS to better access the information in its corporate databases. Other early clients were the
Environmental Protection Agency,
Library of Congress
The Library of Congress (LOC) is a research library in Washington, D.C., serving as the library and research service for the United States Congress and the ''de facto'' national library of the United States. It also administers Copyright law o ...
, and the
Department of Energy and later the ''
Wall Street Journal
''The Wall Street Journal'' (''WSJ''), also referred to simply as the ''Journal,'' is an American newspaper based in New York City. The newspaper provides extensive coverage of news, especially business and finance. It operates on a subscriptio ...
'' and ''
Encyclopædia Britannica
The is a general knowledge, general-knowledge English-language encyclopaedia. It has been published by Encyclopædia Britannica, Inc. since 1768, although the company has changed ownership seven times. The 2010 version of the 15th edition, ...
''.
WAIS Inc was sold to
AOL in May 1995 for $15 million. Following the sale, Margaret St. Pierre left WAIS Inc to start Blue Angel Technologies. Her WAIS variant formed the basis of MetaStar. Georgios Papadopoulos left to found
Atypon. François Schiettecatte left
Human Genome Project at
Johns Hopkins Hospital and started FS-Consult and developed his own variant of WAIS which eventually became ScienceServer, which was later sold to
Elsevier Science. Kahle and Gilliat went on to found the
Internet Archive
The Internet Archive is an American 501(c)(3) organization, non-profit organization founded in 1996 by Brewster Kahle that runs a digital library website, archive.org. It provides free access to collections of digitized media including web ...
and
Alexa Internet.
WAIS and Gopher
Public WAIS is often used as a full-text
search engine
A search engine is a software system that provides hyperlinks to web pages, and other relevant information on World Wide Web, the Web in response to a user's web query, query. The user enters a query in a web browser or a mobile app, and the sea ...
for individual
Internet Gopher servers, supplementing the popular
Veronica system which only searches the menu titles of Gopher sites. WAIS and Gopher share the
World Wide Web
The World Wide Web (WWW or simply the Web) is an information system that enables Content (media), content sharing over the Internet through user-friendly ways meant to appeal to users beyond Information technology, IT specialists and hobbyis ...
's client–server architecture and a certain amount of its functionality. The WAIS protocol is influenced largely by the z39.50 protocol designed for networking library catalogs. It allows a text-based search, and retrieval following a search. Gopher provides a free text search mechanism, but principally uses menus. A menu is a list of titles, from which the user may pick one. While
Gopher Space is a web containing many loops, the menu system gives the user the impression of a tree.
[Berners-Lee, Tim. "The World-Wide Web". ''The New Media Reader''. The MIT Press.]
The Web's data model is similar to the gopher model, except that menus are generalized to hypertext documents. In both cases, simple file servers generate the menus or hypertext directly from the file structure of a server. The Web's hypertext model permits the author more freedom to communicate the options available to the reader, as it can include headings and various forms of list structure.
References
External links
Wide Area Information Servers (WAIS) launch lecture(Xerox PARC, 1991)
RFC1625: WAIS over Z39.50-1988M. St. Pierre, J. Fullton, K. Gamiel, J. Goldman, B. Kahle, J. Kunze, H. Morris, F. Schiettecatte, June 1994
RFC 4156: The wais URI Scheme P. Hoffman, August 2005
Clifford A. Lynch, ''D-Lib'' Magazine, April 1997
*Usage of WHOIS through the ''swais'' command:
{{DEFAULTSORT:Wide Area Information Server
Internet Standards
Unix network-related software
Internet protocols
Internet search engines
Gopher (protocol)