In
computer storage
Computer data storage is a technology consisting of computer components and recording media that are used to retain digital data. It is a core function and fundamental component of computers.
The central processing unit (CPU) of a comput ...
, a tape library is a physical area that holds
magnetic data tapes. In an earlier era, tape libraries were maintained by people known as
tape librarian
A computer operator is a role in IT which oversees the running of computer systems, ensuring that the machines, and computers are running properly. The job of a computer operator as defined by the United States Bureau of Labor Statistics is to ...
s and
computer operator
A computer operator is a role in IT which oversees the running of computer systems, ensuring that the machines, and computers are running properly. The job of a computer operator as defined by the United States Bureau of Labor Statistics is to ...
s and the proper operation of the library was crucial to the running of
batch processing
Computerized batch processing is a method of running software programs called jobs in batches automatically. While users are required to submit the jobs, no other interaction by the user is required to process the batch. Batches may automatically ...
jobs. Although tape libraries of this era were not automated, the use of
tape management system software could assist in running them.
Subsequently, tape libraries became physically automated, and as such are sometimes called a tape silo, tape robot, or tape jukebox. These are a storage devices that contain one or more
tape drive
A tape drive is a data storage device that reads and writes data on a magnetic tape. Magnetic tape data storage is typically used for offline, archival data storage. Tape media generally has a favorable unit cost and a long archival stability.
...
s, a number of slots to hold
tape cartridges, a
barcode
A barcode or bar code is a method of representing data in a visual, Machine-readable data, machine-readable form. Initially, barcodes represented data by varying the widths, spacings and sizes of parallel lines. These barcodes, now commonly refe ...
reader to identify tape cartridges, and an automated method for loading tapes (a robot). Such solutions are mostly used for
backups and for
digital archiving. Additionally, the area where tapes that are not currently in a silo are stored is also called a tape library. One of the earliest examples was the
IBM 3850
The IBM 3850 Mass Storage System was an online tape library used to hold large amounts of infrequently accessed data. It was one of the earliest examples of nearline storage.
History
Starting in the late-1960s IBM's lab in Boulder, Colorado ...
Mass Storage System (MSS), announced in 1974.
In either era, tape libraries can contain millions of tapes.
Manual era
Tapes and batch processing
In the
mainframe computer
A mainframe computer, informally called a mainframe or big iron, is a computer used primarily by large organizations for critical applications like bulk data processing for tasks such as censuses, industry and consumer statistics, enterpris ...
era, especially the
IBM mainframe
IBM mainframes are large computer systems produced by IBM since 1952. During the 1960s and 1970s, IBM dominated the large computer market. Current mainframe computers in IBM's line of business computers are developments of the basic design of t ...
, the most common format in use was the
9-track tape.
Some large application systems could require scores of different tapes as part of their batch job runs.
In the data processing applications of the era, the
master files for such things as employee payroll information, supplies and stores inventory, or customer accounts were typically kept on tape.
Batch jobs to update these master files would take the existing tape master file as input and write out a new tape master file as output.
In addition, the set of update transactions themselves might constitute a second input tape.
The master file output of one update job would then be the master file input to the next time the job is run, perhaps a day, a week, or a month later.
The tapes representing a few past iterations of a master file would typically be retained, in case a problem with the latest version were to be discovered and the job had to be rerun.
Role of tape libraries and librarians
Mainframe computer installations often had a separate room, the tape library, to house their racks and cabinets of tapes.
The typical workflow for running a batch job was to go into the library, pull certain tapes off the racks there and load them onto a rolling cart, move the cart into the computer area, mount the tapes onto tape drives for a production run, take the tapes off the drives when the run was over, move the cart back to the library, and put the tapes back on the library racks. Such tape libraries existed at most computer installations.
Even a modestly sized computer installation could have hundreds of tapes,
and library sizes of several thousand reels of tapes were commonplace.
And they could be much larger: by the mid-1970s, the
U.S. Census Bureau
The United States Census Bureau (USCB), officially the Bureau of the Census, is a principal agency of the U.S. Federal Statistical System, responsible for producing data about the American people and economy. The Census Bureau is part of the ...
and
NASA
The National Aeronautics and Space Administration (NASA ) is an independent agency of the US federal government responsible for the civil space program, aeronautics research, and space research.
NASA was established in 1958, succeedi ...
each had tape libraries with around one million tape reels in them.
The person in charge of all this was typically called the
tape librarian
A computer operator is a role in IT which oversees the running of computer systems, ensuring that the machines, and computers are running properly. The job of a computer operator as defined by the United States Bureau of Labor Statistics is to ...
.
In this era, there were no automated tape delivery and mounting systems, and so this action had to be done by
computer operator
A computer operator is a role in IT which oversees the running of computer systems, ensuring that the machines, and computers are running properly. The job of a computer operator as defined by the United States Bureau of Labor Statistics is to ...
s.
These people were the ones responsible for mounting tapes onto
tape drive
A tape drive is a data storage device that reads and writes data on a magnetic tape. Magnetic tape data storage is typically used for offline, archival data storage. Tape media generally has a favorable unit cost and a long archival stability.
...
s as part of running a job.
Even careful computer operators could sometimes mount the wrong tape as input to a job or present the reels of a multi-tape dataset out of order.
Overwriting a tape that was meant to be preserved was another potential mistake.
It was the tape librarian's responsibility to set up procedures for the handling of tapes to minimize the chances of errors taking place.
As one book of the era wrote, "keeping track of the whereabouts of the tapes is a formidable and responsible job."
Supporting software
Tape management systems of this era were software packages whose purpose was to help facilitate tape library operations and management. They kept track of data sets on tape, and produced reports indicating whether a data set should be retained on, or could be scratched from, a tape; they aided in the setup and running of scheduled production jobs, through such things as tape pull lists and pre-printed external gummed tape labels; and they kept track of the physical inventory of tape reels. The most popular of these packages was UCC-1 from
University Computing Company,
a product that was also known as the Tape Management System. It made several appearances on
Datapro Research Corporation's Software Honor Roll.
Another was Valu-Lib from
Value Computing, Inc.,
and a third was TLMS II from
Capex Corporation
Capex Corporation was an American computer software company in existence from 1969 through 1982 and based in Phoenix, Arizona. It made a variety of software products, mostly system utilities for the IBM mainframe platform, and was known for its O ...
.
As use of the mainframe continued on into the following century, tape library management, both manual and automatic, was one element of the offerings of the
Data Facility Storage Management Subsystem (MVS)
Data Facility Storage Management Subsystem (DFSMS) is a central component of IBM's flagship operating system z/OS. It includes access methods, utilities and program management functions.
''Data Facility Storage Management Subsystem'' is also a c ...
from IBM.
Automated era
Design
Physically automated tape library devices can store immense amounts of data, ranging from 20
terabyte
The byte is a unit of digital information that most commonly consists of eight bits. Historically, the byte was the number of bits used to encode a single character of text in a computer and for this reason it is the smallest addressable unit ...
s up to 2.1
exabyte
The byte is a unit of digital information that most commonly consists of eight bits. Historically, the byte was the number of bits used to encode a single character of text in a computer and for this reason it is the smallest addressable unit ...
s of data as of 2016. Such capacity is multiple thousand times that of a typical
hard drive
A hard disk drive (HDD), hard disk, hard drive, or fixed disk is an electro-mechanical data storage device that stores and retrieves digital data using magnetic storage with one or more rigid rapidly rotating platters coated with magne ...
and well in excess of what is capable with
network attached storage
Network-attached storage (NAS) is a file-level (as opposed to block-level storage) computer data storage server connected to a computer network providing data access to a heterogeneous group of clients. The term "NAS" can refer to both the techn ...
. Typical entry-level solutions cost around $10,000 USD, while high-end solutions can start at as much as $200,000 USD and cost well in excess of $1 million for a fully expanded and configured library.
For large data-storage, they are a cost-effective solution, with cost per gigabyte as low as 2 cents USD. The tradeoff for their larger capacity is their slower access time, which usually involves mechanical manipulation of tapes. Access to data in a library takes from several seconds to several minutes.
Because of their slow sequential access and huge capacity, tape libraries are primarily used for
backups and as the final stage of
digital archiving. A typical application of the latter would be an organization's extensive transaction record for legal or auditing purposes. Another example is
hierarchical storage management
Hierarchical storage management (HSM), also known as Tiered storage, is a data storage and Data management technique that automatically moves data between high-cost and low-cost storage media. HSM systems exist because high-speed storage devices, ...
(HSM), in which tape library is used to hold rarely used files from
file system
In computing, file system or filesystem (often abbreviated to fs) is a method and data structure that the operating system uses to control how data is stored and retrieved. Without a file system, data placed in a storage medium would be one lar ...
s.
Software support
There are several large-scale library-management packages available commercially. Open-source implementations include
AMANDA
Amanda is a Latin feminine gerundive (i.e. verbal adjective) name meaning, literally, “she who must (or is fit to) be loved”. Other translations, with similar meaning, could be "deserving to be loved," "worthy of love," or "loved very much b ...
,
Bacula
Bacula is an open-source, enterprise-level computer backup system for heterogeneous networks. It is designed to automate backup tasks that had often required intervention from a systems administrator or computer operator.
Bacula supports Li ...
, and the minimal mtx program.
Barcode labels
Tape libraries commonly have the capability of optically scanning
barcode
A barcode or bar code is a method of representing data in a visual, Machine-readable data, machine-readable form. Initially, barcodes represented data by varying the widths, spacings and sizes of parallel lines. These barcodes, now commonly refe ...
labels which are attached to each tape, allowing them to automatically maintain an inventory of which tapes are where within the library. Preprinted barcode labels are commercially available or custom labels may be generated using commercial or free software. The barcode label is frequently part of the
tape label Tape labels are identifiers given to volumes of magnetic tape.
There are two kinds of tape labels. The first is a label applied to the exterior of tape cartridge or reel. The second is data recorded on the tape itself.
Visual labels
Visual label ...
, information recorded at the beginning of the medium to uniquely identify the tape.
Autoloaders

Smaller tape libraries with only one drive are known as autoloaders. The term ''autoloader'' is also sometimes used synonymously with ''stacker'', a device in which the media are loaded necessarily in a sequential manner.
Other types of autoloaders may operate with
optical disc
In computing and optical disc recording technologies, an optical disc (OD) is a flat, usually circular disc that encodes binary data (bits) in the form of pits and lands on a special material, often aluminum, on one of its flat surfaces. ...
s (such as
compact disc
The compact disc (CD) is a digital optical disc data storage format that was co-developed by Philips and Sony to store and play digital audio recordings. In August 1982, the first compact disc was manufactured. It was then released in Octo ...
s or
DVD
The DVD (common abbreviation for Digital Video Disc or Digital Versatile Disc) is a digital optical disc data storage format. It was invented and developed in 1995 and first released on November 1, 1996, in Japan. The medium can store any kin ...
s) or
floppy disk
A floppy disk or floppy diskette (casually referred to as a floppy, or a diskette) is an obsolescent type of disk storage composed of a thin and flexible disk of a magnetic storage medium in a square or nearly square plastic enclosure lined ...
s.
See also
*
Optical jukebox
An optical jukebox is a robotic data storage device that can automatically load and unload optical discs, such as Compact Disc, DVD, Ultra Density Optical or Blu-ray and can provide terabytes (TB) or petabytes (PB) of tertiary storage. The devic ...
References
{{reflist
Library
A library is a collection of materials, books or media that are accessible for use and not just for display purposes. A library provides physical (hard copies) or digital access (soft copies) materials, and may be a physical location or a vi ...