HOME

TheInfoList



OR:

In programming, Libarc is a C++ library that accesses contents of GZIP compressed ARC files. These ARC files are generated by the
Internet Archive The Internet Archive is an American digital library with the stated mission of "universal access to all knowledge". It provides free public access to collections of digitized materials, including websites, software applications/games, music, ...
's Heritrix
web crawler A Web crawler, sometimes called a spider or spiderbot and often shortened to crawler, is an Internet bot that systematically browses the World Wide Web and that is typically operated by search engines for the purpose of Web indexing (''web spid ...
.


Overview

Libarc allows users to open and scan contents of GZIP compressed ARC Files. It also allows users to get an iterator that walks over the contents of said ARC files, member by member. Users are able to specify the media type in order to limit the types seen. This allows them to access information in the member’s URL record and response headers from
servers
and access to the member’s data in a single API call. Additionally to the API reference documentation there are two other sources: Programming with libarc - This describes the libarac API, and the license and copyright policies held by the Basis Technology Corp.


References

C++ libraries {{compu-library-stub