HOME

TheInfoList



OR:

WebCite is an intermittently available archive site, originally designed to digitally preserve scientific and educationally important material on the web by taking snapshots of Internet contents as they existed at the time when a blogger or a scholar cited or quoted from it. The preservation service enabled verifiability of claims supported by the cited sources even when the original web pages are being revised, removed, or disappear for other reasons, an effect known as
link rot Link rot (also called link death, link breaking, or reference rot) is the phenomenon of hyperlinks tending over time to cease to point to their originally targeted file, web page, or server due to that resource being relocated to a new address ...
. As of June 2023, the site no longer accepts new archive requests; old archive snapshots can still be viewed. The site is frequently offline with no explanation, and for lengthy periods of time. For example it was offline between October 29, 2021 and June 24, 2023 (1 year and 8 months) during which it reported "DB Connection failed". The site is owned and maintained by
Gunther Eysenbach Gunther Eysenbach is a German-Canadian researcher on healthcare, especially health policy, eHealth, and health informatics, consumer health informatics. Career Eysenbach was born on 22 March 1967 in West Berlin, West Germany. While a Medical sch ...
.


Service features

WebCite allowed for preservation of all types of web content, including
HTML Hypertext Markup Language (HTML) is the standard markup language for documents designed to be displayed in a web browser. It defines the content and structure of web content. It is often assisted by technologies such as Cascading Style Sheets ( ...
web pages,
PDF Portable document format (PDF), standardized as ISO 32000, is a file format developed by Adobe Inc., Adobe in 1992 to present documents, including text formatting and images, in a manner independent of application software, computer hardware, ...
files, style sheets,
JavaScript JavaScript (), often abbreviated as JS, is a programming language and core technology of the World Wide Web, alongside HTML and CSS. Ninety-nine percent of websites use JavaScript on the client side for webpage behavior. Web browsers have ...
and
digital image A digital image is an image composed of picture elements, also known as pixels, each with '' finite'', '' discrete quantities'' of numeric representation for its intensity or gray level that is an output from its two-dimensional functions f ...
s. It also archived
metadata Metadata (or metainformation) is "data that provides information about other data", but not the content of the data itself, such as the text of a message or the image itself. There are many distinct types of metadata, including: * Descriptive ...
about the collected resources such as access time, MIME type, and content length. WebCite was a non-profit
consortium A consortium () is an association of two or more individuals, companies, organizations, or governments (or any combination of these entities) with the objective of participating in a common activity or pooling their resources for achieving a ...
supported by publishers and editors, and it could be used by individuals without charge. It was one of the first services to offer on-demand archiving of pages, a feature later adopted by many other archiving services, such as archive.today and the
Wayback Machine The Wayback Machine is a digital archive of the World Wide Web founded by Internet Archive, an American nonprofit organization based in San Francisco, California. Launched for public access in 2001, the service allows users to go "back in ...
. It did not do web page crawling.


History

Conceived in 1997 by
Gunther Eysenbach Gunther Eysenbach is a German-Canadian researcher on healthcare, especially health policy, eHealth, and health informatics, consumer health informatics. Career Eysenbach was born on 22 March 1967 in West Berlin, West Germany. While a Medical sch ...
, WebCite was publicly described the following year when an article on Internet
quality control Quality control (QC) is a process by which entities review the quality of all factors involved in production. ISO 9000 defines quality control as "a part of quality management focused on fulfilling quality requirements". This approach plac ...
declared that such a service could also measure the
citation impact Citation impact or citation rate is a measure of how many times an academic journal article or book or author is cited by other articles, books or authors. Citation counts are interpreted as measures of the impact or influence of academic work a ...
of web pages. In the next year, a pilot service was set up at the address webcite.net. Although it seemed that the need for WebCite decreased when Google's ''short term'' copies of web pages began to be offered by Google Cache and the
Internet Archive The Internet Archive is an American 501(c)(3) organization, non-profit organization founded in 1996 by Brewster Kahle that runs a digital library website, archive.org. It provides free access to collections of digitized media including web ...
expanded their crawling (which started in 1996), WebCite was the only one allowing "on-demand" archiving by users. WebCite also offered interfaces to scholarly journals and publishers to automate the archiving of cited links. By 2008, over 200 journals had begun routinely using WebCite. WebCite was formerly a member of the International Internet Preservation Consortium. In response a 2012 message on Twitter relating to WebCite's former membership of the consortium, Eysenbach commented that "WebCite has no funding, and IIPC charges €4000 per year in annual membership fees." WebCite "feeds its content" to other
digital preservation In library science, library and archival science, digital preservation is a formal process to ensure that digital information of continuing value remains accessible and usable in the long term. It involves planning, resource allocation, and appli ...
projects, including the
Internet Archive The Internet Archive is an American 501(c)(3) organization, non-profit organization founded in 1996 by Brewster Kahle that runs a digital library website, archive.org. It provides free access to collections of digitized media including web ...
.
Lawrence Lessig Lester Lawrence "Larry" Lessig III (born June 3, 1961) is an American legal scholar and political activist. He is the Roy L. Furman Professor of Law at Harvard Law School and the former director of the Edmond J. Safra Center for Ethics at Harvar ...
, an American academic who writes extensively on copyright and technology, used WebCite in his ''amicus'' brief in the
Supreme Court of the United States The Supreme Court of the United States (SCOTUS) is the highest court in the federal judiciary of the United States. It has ultimate appellate jurisdiction over all Federal tribunals in the United States, U.S. federal court cases, and over Stat ...
case of '' MGM Studios, Inc. v. Grokster, Ltd.'' Sometime between July 9 and 17, 2019, WebCite stopped accepting new archiving requests. In a further outage, between about October 29, 2021 and June 24, 2023, no archived content was available, only the main page worked.


Fundraising

WebCite ran a fund-raising campaign using FundRazr from January 2013 with a target of $22,500, a sum which its operators stated was needed to maintain and modernize the service beyond the end of 2013. This includes relocating the service to Amazon EC2 cloud hosting and legal support. it remained undecided whether WebCite would continue as a non-profit or as a for-profit entity.


Business model

The term "WebCite" is a registered trademark. WebCite did not charge individual users, journal editors and publishers any fee to use their service. WebCite earned revenue from publishers who wanted to "have their publications analyzed and cited webreferences archived". Early support was from the
University of Toronto The University of Toronto (UToronto or U of T) is a public university, public research university whose main campus is located on the grounds that surround Queen's Park (Toronto), Queen's Park in Toronto, Ontario, Canada. It was founded by ...
.


Copyright issues

WebCite maintained the legal position that its archiving activities are allowed by the copyright doctrines of
fair use Fair use is a Legal doctrine, doctrine in United States law that permits limited use of copyrighted material without having to first acquire permission from the copyright holder. Fair use is one of the limitations to copyright intended to bal ...
and implied license. To support the fair use argument, WebCite noted that its archived copies are transformative, socially valuable for academic research, and not harmful to the market value of any copyrighted work. WebCite argued that caching and archiving web pages was not considered a copyright infringement when the archiver offers the copyright owner an opportunity to "opt-out" of the archive system, thus creating an implied license. To that end, WebCite would not archive in violation of Web site "do-not-cache" and "no-archive"
metadata Metadata (or metainformation) is "data that provides information about other data", but not the content of the data itself, such as the text of a message or the image itself. There are many distinct types of metadata, including: * Descriptive ...
, as well as robot exclusion standards, the absence of which creates an " implied license" for web archive services to preserve the content. In a similar case involving Google's web caching activities, on January 19, 2006, the United States District Court for the District of Nevada agreed with that argument in the case of '' Field v. Google'' (CV-S-04-0413-RCJ-LRL), holding that fair use and an "implied license" meant that Google's caching of Web pages did not constitute copyright violation. The "implied license" referred to general Internet standards.


DMCA requests

According to their policy, after receiving legitimate
DMCA The Digital Millennium Copyright Act (DMCA) is a 1998 United States copyright law that implements two 1996 treaties of the World Intellectual Property Organization (WIPO). It criminalizes production and dissemination of technology, devices, or ...
requests from the copyright holders, WebCite would remove saved pages from public access, as the archived pages are still under the safe harbor of being citations. The pages were removed to a "dark archive" and in cases of legal controversies or evidence requests, there was pay-per-view access of "$200 (up to 5 snapshots) plus $100 for each further 10 snapshots" to the copyrighted content.


See also

* Archive.today * Perma.cc *
Wayback Machine The Wayback Machine is a digital archive of the World Wide Web founded by Internet Archive, an American nonprofit organization based in San Francisco, California. Launched for public access in 2001, the service allows users to go "back in ...


References


External links

* {{DEFAULTSORT:Webcite Internet properties established in 2004 Organizations established in 2003 Web archiving initiatives