HOME

TheInfoList



OR:

Archie is a tool for indexing
FTP The File Transfer Protocol (FTP) is a standard communication protocol used for the transfer of computer files from a server to a client on a computer network. FTP is built on a client–server model architecture using separate control and dat ...
archives, allowing users to more easily identify specific files. It is considered the first
Internet The Internet (or internet) is the Global network, global system of interconnected computer networks that uses the Internet protocol suite (TCP/IP) to communicate between networks and devices. It is a internetworking, network of networks ...
search engine A search engine is a software system that provides hyperlinks to web pages, and other relevant information on World Wide Web, the Web in response to a user's web query, query. The user enters a query in a web browser or a mobile app, and the sea ...
. The original implementation was written in 1990 by Alan Emtage, then a postgraduate student at
McGill University McGill University (French: Université McGill) is an English-language public research university in Montreal, Quebec, Canada. Founded in 1821 by royal charter,Frost, Stanley Brice. ''McGill University, Vol. I. For the Advancement of Learning, ...
in
Montreal Montreal is the List of towns in Quebec, largest city in the Provinces and territories of Canada, province of Quebec, the List of the largest municipalities in Canada by population, second-largest in Canada, and the List of North American cit ...
,
Canada Canada is a country in North America. Its Provinces and territories of Canada, ten provinces and three territories extend from the Atlantic Ocean to the Pacific Ocean and northward into the Arctic Ocean, making it the world's List of coun ...
. Archie was superseded by other, more sophisticated search engines, including Jughead and Veronica, which were search engines for the
Gopher Pocket gophers, commonly referred to simply as gophers, are burrowing rodents of the family Geomyidae. The roughly 41 speciesSearch results for "Geomyidae" on thASM Mammal Diversity Database are all endemic to North and Central America. They ar ...
protocol. These were in turn superseded by directories like
Yahoo! Yahoo (, styled yahoo''!'' in its logo) is an American web portal that provides the search engine Yahoo Search and related services including My Yahoo, Yahoo Mail, Yahoo News, Yahoo Finance, Yahoo Sports, y!entertainment, yahoo!life, and its a ...
in 1995 and search engines like
Google Google LLC (, ) is an American multinational corporation and technology company focusing on online advertising, search engine technology, cloud computing, computer software, quantum computing, e-commerce, consumer electronics, and artificial ...
in 1998. Work on Archie ceased in the late 1990s. A legacy Archie server was maintained for historic purposes in Poland at Interdisciplinary Centre for Mathematical and Computational Modelling in the
University of Warsaw The University of Warsaw (, ) is a public university, public research university in Warsaw, Poland. Established on November 19, 1816, it is the largest institution of higher learning in the country, offering 37 different fields of study as well ...
until 2023. With assistance from the University of Warsaw, a new Archie server was created and opened for public access at The Serial Port, a web-based computer museum, on 11 May 2024.


Origin

Archie first appeared in 1986, while Emtage was the systems manager at the McGill University School of Computer Science. His predecessor had attempted to persuade the
institution An institution is a humanly devised structure of rules and norms that shape and constrain social behavior. All definitions of institutions generally entail that there is a level of persistence and continuity. Laws, rules, social conventions and ...
to connect to the
Internet The Internet (or internet) is the Global network, global system of interconnected computer networks that uses the Internet protocol suite (TCP/IP) to communicate between networks and devices. It is a internetworking, network of networks ...
, but due to the expensive cost — roughly $35,000 per year for a sluggish link to
Boston Boston is the capital and most populous city in the Commonwealth (U.S. state), Commonwealth of Massachusetts in the United States. The city serves as the cultural and Financial centre, financial center of New England, a region of the Northeas ...
— it had been challenging to persuade the appropriate parties that the
investment Investment is traditionally defined as the "commitment of resources into something expected to gain value over time". If an investment involves money, then it can be defined as a "commitment of money to receive more money later". From a broade ...
was worthwhile. The name derives from the word "archive" without the 'v'. Emtage has said that contrary to popular belief, there was no association with the
Archie Comics Archie Comic Publications, Inc. (often referred to simply as Archie Comics) is an American comic book publisher headquartered in the village of Pelham, New York. The company's many titles feature the fictional teenagers Archie Andrews, Jug ...
. Despite this, other early Internet search technologies such as Jughead and Veronica were named after characters from the comics. Anarchie, one of the earliest graphical FTP clients, was named for its ability to perform Archie searches.


Function

The earliest versions of Archie would simply search a list of public anonymous
File Transfer Protocol The File Transfer Protocol (FTP) is a standard communication protocol used for the transfer of computer files from a server to a client on a computer network. FTP is built on a client–server model architecture using separate control and d ...
(FTP) sites using the
Telnet Telnet (sometimes stylized TELNET) is a client-server application protocol that provides access to virtual terminals of remote systems on local area networks or the Internet. It is a protocol for bidirectional 8-bit communications. Its main ...
protocol and create index files available via FTP. To view the contents of a file, it had first to be downloaded. The indexes are updated on a regular basis (contacting each roughly once a month, so as not to waste too many resources of the remote servers) by requesting a listing. These listings were stored in local files to be searched using the
Unix Unix (, ; trademarked as UNIX) is a family of multitasking, multi-user computer operating systems that derive from the original AT&T Unix, whose development started in 1969 at the Bell Labs research center by Ken Thompson, Dennis Ritchie, a ...
command. The developers populated the engine's servers with databases of anonymous FTP host directories. This was used to find specific file titles since the list was plugged in to a searchable database of FTP sites. Archie did not recognize natural language requests nor index the content inside the files. Therefore, users had to know the title of the file they wanted. The ability to index the content inside the files was later introduced by
Gopher Pocket gophers, commonly referred to simply as gophers, are burrowing rodents of the family Geomyidae. The roughly 41 speciesSearch results for "Geomyidae" on thASM Mammal Diversity Database are all endemic to North and Central America. They ar ...
.


Development

Emtage and Heelan wrote a script allowing people to log in and search collected information using the
Telnet Telnet (sometimes stylized TELNET) is a client-server application protocol that provides access to virtual terminals of remote systems on local area networks or the Internet. It is a protocol for bidirectional 8-bit communications. Its main ...
protocol at the host "archie.mcgill.ca" 32.206.2.3 Later, more efficient front- and back-ends were developed, and the system spread from a local tool to a network-wide resource and a popular service available from multiple sites around the
Internet The Internet (or internet) is the Global network, global system of interconnected computer networks that uses the Internet protocol suite (TCP/IP) to communicate between networks and devices. It is a internetworking, network of networks ...
. The collected data would be exchanged between the neighbouring Archie servers. The servers could be accessed in multiple ways: using a local client (such as ''archie'' or ''xarchie'');
telnet Telnet (sometimes stylized TELNET) is a client-server application protocol that provides access to virtual terminals of remote systems on local area networks or the Internet. It is a protocol for bidirectional 8-bit communications. Its main ...
ting to a server directly; sending queries by
electronic mail Electronic mail (usually shortened to email; alternatively hyphenated e-mail) is a method of transmitting and receiving Digital media, digital messages using electronics, electronic devices over a computer network. It was conceived in the ...
; and later via a
World Wide Web The World Wide Web (WWW or simply the Web) is an information system that enables Content (media), content sharing over the Internet through user-friendly ways meant to appeal to users beyond Information technology, IT specialists and hobbyis ...
interface. At the peak of its popularity, the Archie search engine accounted for 50% of Montreal Internet traffic. In 1992, Emtage, along with J. Peter Deutsch and some financial help from McGill University, formed Bunyip Information Systems with a licensed commercial version of the Archie search engine used by millions of people worldwide. Heelan followed them into Bunyip soon after, where he together with Bibi Ali and Sandro Mazzucato significantly updated the Archie database and indexed web pages. Work on the search engine ceased in the late 1990s, and the company dissolved in 2003.


See also

* Alan Emtage * Jughead * Veronica * Wide area information server


References


Further reading

*Archie—A Darwinian Development Process. Peter Deutsch.
IEEE Internet Computing ''IEEE Internet Computing'' is a bimonthly peer-reviewed scientific journal published by the IEEE Computer Society. It covers all aspects of emerging and maturing Internet technologies. The editor-in-chief is Weisong Shi (University of Delaware). ...
, January/February 2000, 4(1):69-71. Part of Millennial Forecasts, . *P. Deutsch, A. Emtage, A. Marine
''How to Use Anonymous FTP''
(RFC1635, May 1994)


External links


Online instance of Archie


{{DEFAULTSORT:Archie Search Engine Internet Standards Unix Internet software Internet search engines History of the Internet