Authority file
   HOME

TheInfoList



OR:

In
information science Information science (also known as information studies) is an academic field which is primarily concerned with analysis, collection, classification, manipulation, storage, retrieval, movement, dissemination, and protection of information. ...
, authority control is a process that organizes information, for example in
library catalog A library catalog (or library catalogue in British English) is a register of all bibliographic items found in a library or group of libraries, such as a network of libraries at several locations. A catalog for a group of libraries is also ...
s, by using a single, distinct spelling of a name (heading) or a numeric identifier for each topic. The word ''authority'' in ''authority control'' derives from the idea that the names of people, places, things, and concepts are ''authorized,'' i.e., they are established in one particular form. Note: root words for both ''author'' and ''authority'' are words such as ''auctor'' or ''autor'' and ''autorite'' from the 13th century. These one-of-a-kind headings or identifiers are applied consistently throughout catalogs which make use of the respective authority file, and are applied for other methods of organizing data such as linkages and cross references. Each controlled entry is described in an authority ''record'' in terms of its scope and usage, and this organization helps the library staff maintain the catalog and make it user-friendly for researchers.
Cataloger In library and information science, cataloging ( US) or cataloguing ( UK) is the process of creating metadata representing information resources, such as books, sound recordings, moving images, etc. Cataloging provides information such as auth ...
s assign each subject—such as author, topic, series, or corporation—a particular unique identifier or heading term which is then used consistently, uniquely, and unambiguously for all references to that same subject, which removes variations from different spellings,
transliteration Transliteration is a type of conversion of a text from one script to another that involves swapping letters (thus ''trans-'' + '' liter-'') in predictable ways, such as Greek → , Cyrillic → , Greek → the digraph , Armenian → or L ...
s,
pen name A pen name, also called a ''nom de plume'' or a literary double, is a pseudonym (or, in some cases, a variant form of a real name) adopted by an author and printed on the title page or by-line of their works in place of their real name. A pen na ...
s, or
alias Alias may refer to: * Pseudonym * Pen name * Nickname Arts and entertainment Film and television * ''Alias'' (2013 film), a 2013 Canadian documentary film * ''Alias'' (TV series), an American action thriller series 2001–2006 * ''Alias the J ...
es. The unique header can guide users to all relevant information including related or collocated subjects. Authority records can be combined into a database and called an authority file, and maintaining and updating these files as well as "logical linkages" to other files within them is the work of librarians and other information catalogers. Accordingly, authority control is an example of
controlled vocabulary Control may refer to: Basic meanings Economics and business * Control (management), an element of management * Control, an element of management accounting * Comptroller (or controller), a senior financial officer in an organization * Control ...
and of
bibliographic control In library and information science, cataloging ( US) or cataloguing ( UK) is the process of creating metadata representing information resources, such as books, sound recordings, moving images, etc. Cataloging provides information such as aut ...
. While in theory any piece of information is amenable to authority control such as personal and corporate names,
uniform title A uniform title in library cataloging is a distinctive title assigned to a work which either has no title or has appeared under more than one title. Establishing a uniform title is an aspect of authority control. The phrases conventional title and ...
s, series names, and subjects, library catalogers typically focus on author names and titles of works. Subject headings from the
Library of Congress The Library of Congress (LOC) is the research library that officially serves the United States Congress and is the ''de facto'' national library of the United States. It is the oldest federal cultural institution in the country. The libra ...
fulfill a function similar to authority records, although they are usually considered separately. As time passes, information changes, prompting needs for reorganization. According to one view, authority control is not about creating a perfect seamless system but rather it is an ongoing effort to keep up with these changes and try to bring "structure and order" to the task of helping users find information.


Benefits of authority control

* ''Better researching''. Authority control helps researchers understand a specific subject with less wasted effort. A well-designed digital catalog/database enables a researcher to query a few words of an entry to bring up the already established term or phrase, thus improving accuracy and saving time. * ''Makes searching more predictable''. It can be used in conjunction with keyword searching using "and" or "not" or "or" or other Boolean operators on a web browser. It increases chances that a given search will return relevant items. * ''Consistency of records''. * ''Organization and structure of information''. * ''Efficiency for catalogers''. The process of authority control is not only of great help to researchers searching for a particular subject to study, but it can help catalogers organize information as well. Catalogers can use authority records when trying to categorize new items, since they can see which records have already been cataloged and can therefore avoid unnecessary work. * ''Maximizes library resources''. * ''Easier to maintain the catalog''. It enables catalogers to detect and correct errors. In some instances, software programs support workers tasked with maintaining the catalog to do ongoing tasks such as automated clean-up. It helps creators and users of metadata. * ''Fewer errors''. It can help catch errors caused by typos or misspellings which can sometimes accumulate over time, sometimes known as ''quality drift''. For example, machines can catch misspellings such as "Elementary school "teachers" and "Pumpkins" which can then be corrected by library staff.


Examples


Diverse names describe the same subject

Sometimes within a catalog, there are diverse names or spellings for only one person or subject. This variation may cause researchers to overlook relevant information. Authority control is used by catalogers to
collocate In corpus linguistics, a collocation is a series of words or terms that co-occur more often than would be expected by chance. In phraseology, a collocation is a type of compositional phraseme, meaning that it can be understood from the words t ...
materials that logically belong together but that present themselves differently. Records are used to establish
uniform title A uniform title in library cataloging is a distinctive title assigned to a work which either has no title or has appeared under more than one title. Establishing a uniform title is an aspect of authority control. The phrases conventional title and ...
s that collocate all versions of a given work under one unique heading even when such versions are issued under different titles. With authority control, one unique preferred name represents all variations and will include different variations, spellings and misspellings, uppercase versus lowercase variants, differing dates, and so forth. For example, in Wikipedia, the first wife of
Charles III Charles III (Charles Philip Arthur George; born 14 November 1948) is King of the United Kingdom and the 14 other Commonwealth realms. He was the longest-serving heir apparent and Prince of Wales and, at age 73, became the oldest person ...
is described by an article
Diana, Princess of Wales Diana, Princess of Wales (born Diana Frances Spencer; 1 July 1961 – 31 August 1997) was a member of the British royal family. She was the first wife of King Charles III (then Prince of Wales) and mother of Princes William and Harry. Her ac ...
as well as numerous other descriptors, e.g.
Princess Diana Diana, Princess of Wales (born Diana Frances Spencer; 1 July 1961 – 31 August 1997) was a member of the British royal family. She was the first wife of King Charles III (then Prince of Wales) and mother of Princes William and Harry. Her ac ...
, but both ''Princess Diana'' and ''Diana, Princess of Wales'' describe the same person; an authority record would choose one title as the preferred one for consistency. In an online library catalog, various entries might look like the following: #Diana. (1) #Diana, Princess of Wales. (1) #Diana, Princess of Wales, 1961–1997. (13) #Diana, Princess of Wales 1961–1997. (1) #Diana, Princess of Wales, 1961–1997. (2) #DIANA, PRINCESS OF WALES, 1961–1997. (1) These terms describe the same person. Accordingly, authority control reduces these entries to one unique entry or officially authorized heading, sometimes termed an ''access point'': Diana, Princess of Wales, 1961–1997. Generally, there are different authority file headings and identifiers used by different libraries in different countries, possibly inviting confusion, but there are different approaches internationally to try to lessen the confusion. One international effort to prevent such confusion is the
Virtual International Authority File The Virtual International Authority File (VIAF) is an international authority file. It is a joint project of several national libraries and operated by the Online Computer Library Center (OCLC).  History Discussion about having a common ...
which is a collaborative attempt to provide a single heading for a particular subject. It is a way to standardize information from different authority files around the world such as the
Integrated Authority File The (translated as ''Integrated Authority File'', also known as the ''Universal Authority File'') or GND is an international authority file for the organisation of personal names, subject headings and corporate bodies from catalogues. It is u ...
(GND) maintained and used cooperatively by many libraries in German-speaking countries and the United States
Library of Congress The Library of Congress (LOC) is the research library that officially serves the United States Congress and is the ''de facto'' national library of the United States. It is the oldest federal cultural institution in the country. The libra ...
. The idea is to create a single worldwide virtual authority file. For example, the ID for
Princess Diana Diana, Princess of Wales (born Diana Frances Spencer; 1 July 1961 – 31 August 1997) was a member of the British royal family. She was the first wife of King Charles III (then Prince of Wales) and mother of Princes William and Harry. Her ac ...
in the GND is ''118525123'' (preferred name: ''Diana < Wales, Prinzessin>'') while the United States Library of Congress uses the term ''Diana, Princess of Wales, 1961–1997''; other authority files have other choices. The Virtual International Authority File choice for all of these variations is ''VIAF ID: 107032638'' — that is, a common number representing all of these variations.
Virtual International Authority File The Virtual International Authority File (VIAF) is an international authority file. It is a joint project of several national libraries and operated by the Online Computer Library Center (OCLC).  History Discussion about having a common ...
. Records for Princess Diana, Retrieved on 12 March 2013
The English Wikipedia prefers the term "Diana, Princess of Wales", but at the bottom of the article about her, there are links to various international cataloging efforts for reference purposes.


Same name describes two different subjects

Sometimes two different authors have been published under the same name. This can happen if there is a title which is identical to another title or to a collective uniform title. This, too, can cause confusion. Different authors can be distinguished correctly from each other by, for example, adding a middle initial to one of the names; in addition, other information can be added to one entry to clarify the subject, such as birth year, death year, range of active years such as 1918–1965 when the person flourished, or a brief descriptive epithet. When catalogers come across different subjects with similar or identical headings, they can disambiguate them using authority control.


Authority records and files

A customary way of enforcing authority control in a bibliographic catalog is to set up a separate index of authority records, which relates to and governs the headings used in the main catalog. This separate index is often referred to as an "authority file." It contains an indexable record of all decisions made by catalogers in a given library (or—as is increasingly the case—cataloging consortium), which catalogers consult when making, or revising, decisions about headings. As a result, the records contain documentation about sources used to establish a particular preferred heading, and may contain information discovered while researching the heading which may be useful. While authority files provide information about a particular subject, their primary function is not to provide information but to organize it. They contain enough information to establish that a given author or title is unique, but that is all; irrelevant but interesting information is generally excluded. Although practices vary internationally, authority records in the English-speaking world generally contain the following information: * ''Headings'' show the preferred title chosen as the official and authorized version. It is important that the heading be unique; if there is a conflict with an identical heading, then one of the two will have to be chosen: * ''Cross references'' are other forms of the name or title that might appear in the catalog and include: #''see'' references are forms of the name or title that describe the subject but which have been passed over or ''deprecated'' in favor of the authorized heading form #''see also'' references point to other forms of the name or title that are also authorized. These ''see also'' references generally point to earlier or later forms of a name or title. * ''Statement(s) of justification'' is a brief account made by the cataloger about particular information sources used to determine both authorized and deprecated forms. Sometimes this means citing the title and publication date of the source, the location of the name or title on that source, and the form in which it appears on that source. For example, the Irish writer
Brian O'Nolan Brian O'Nolan ( ga, Brian Ó Nualláin; 5 October 1911 – 1 April 1966), better known by his pen name Flann O'Brien, was an Irish civil service official, novelist, playwright and satirist, who is now considered a major figure in twentieth c ...
, who lived from 1911 to 1966, wrote under many
pen name A pen name, also called a ''nom de plume'' or a literary double, is a pseudonym (or, in some cases, a variant form of a real name) adopted by an author and printed on the title page or by-line of their works in place of their real name. A pen na ...
s such as Flann O'Brien and Myles na Gopaleen. Catalogers at the United States Library of Congress chose one form—"O'Brien, Flann, 1911–1966"—as the official heading. The example contains all three elements of a valid authority record: the first heading ''O'Brien, Flann, 1911–1966'' is the form of the name that the
Library of Congress The Library of Congress (LOC) is the research library that officially serves the United States Congress and is the ''de facto'' national library of the United States. It is the oldest federal cultural institution in the country. The libra ...
chose as authoritative. In theory, every record in the catalog that represents a work by this author should have this form of the name as its author heading. What follows immediately below the heading beginning with ''Na Gopaleen, Myles, 1911–1966'' are the ''see'' references. These forms of the author's name will appear in the catalog, but only as transcriptions and not as headings. If a user queries the catalog under one of these variant forms of the author's name, he or she would receive the response: "See O'Brien, Flann, 1911–1966." There is an additional spelling variant of the Gopaleen name: "Na gCopaleen, Myles, 1911–1966" has an extra ''C'' inserted because the author also employed the non-anglicized Irish spelling of his pen-name, in which the capitalized ''C'' shows the correct root word while the preceding ''g'' indicates its pronunciation in context. So if a library user comes across this spelling variant, he or she will be led to the same author regardless. ''See also'' references, which point from one authorized heading to another authorized heading, are exceedingly rare for personal name authority records, although they often appear in name authority records for corporate bodies. The final four entries in this record beginning with ''His At Swim-Two-Birds ... 1939.'' constitute the justification for this particular form of the name: it appeared in this form on the 1939 edition of the author's novel ''At Swim-Two-Birds'', whereas the author's other ''noms de plume'' appeared on later publications.


Access control

The act of choosing a single authorized heading to represent all forms of a name is quite often a difficult and complex task, considering that any given individual may have legally changed their name or used a variety of legal names in the course of their lifetime, as well as a variety of nicknames, pen names, stage names or other alternative names. It may be particularly difficult to choose a single authorized heading for individuals whose various names have controversial political or social connotations, when the choice of authorized heading may be seen as endorsement of the associated political or social ideology. An alternative to using authorized headings is the idea of ''
access control In the fields of physical security and information security, access control (AC) is the selective restriction of access to a place or other resource, while access management describes the process. The act of ''accessing'' may mean consuming ...
,'' where various forms of a name are related without the endorsement of one particular form.


Cooperative cataloging

Before the advent of digital
online public access catalog The online public access catalog (OPAC), now frequently synonymous with ''library catalog'', is an online database of materials held by a library or group of libraries. Online catalogs have largely replaced the analog card catalogs previously u ...
s and the Internet, creating and maintaining a library's authority files were generally carried out by individual cataloging departments within each library. Naturally, then, there was considerable difference in the authority files of the different libraries. For the early part of library history, it was generally accepted that, as long as a library's catalog was internally consistent, the differences between catalogs in different libraries did not matter greatly. As libraries became more attuned to the needs of researchers and began interacting more with other libraries, the value of standard cataloging practices came to be recognized. With the advent of automated database technologies, catalogers began to establish cooperative consortia, such as
OCLC OCLC, Inc., doing business as OCLC, See also: is an American nonprofit cooperative organization "that provides shared technology services, original research, and community programs for its membership and the library community at large". It wa ...
and
RLIN The Research Libraries Group (RLG) was a U.S.-based library consortium that existed from 1974 until its merger with the OCLC library consortium in 2006. RLG developed the Eureka interlibrary search engine, the RedLightGreen database of bibliograp ...
in the
United States The United States of America (U.S.A. or USA), commonly known as the United States (U.S. or US) or America, is a country Continental United States, primarily located in North America. It consists of 50 U.S. state, states, a Washington, D.C., ...
, in which cataloging departments from libraries all over the world contributed their records to, and took their records from, a shared database. This development prompted the need for national standards for authority work. In the United States, the primary organization for maintaining cataloging standards with respect to authority work operates under the aegis of the
Library of Congress The Library of Congress (LOC) is the research library that officially serves the United States Congress and is the ''de facto'' national library of the United States. It is the oldest federal cultural institution in the country. The libra ...
Program for Cooperative Cataloging The Program for Cooperative Cataloging is a collaborative cataloging program. The formation of the Program for Cooperative Cataloging was catalyzed by an article by librarians Dorothy Gregor and Carol Mandel titled "Cataloging Must Change!", publ ...
, and is known as the
Name Authority Cooperative Program The Program for Cooperative Cataloging is a collaborative cataloging program. The formation of the Program for Cooperative Cataloging was catalyzed by an article by librarians Dorothy Gregor and Carol Mandel titled "Cataloging Must Change!", publ ...
, or NACO Authority.


Standards

There are various standards using different acronyms. Standards for authority metadata: *
MARC standards MARC (machine-readable cataloging) standards are a set of digital formats for the description of items catalogued by libraries, such as books, DVDs, and digital resources. Computerized library catalogs and library management software need to str ...
for authority records in machine-readable format. *
Metadata Authority Description Schema Metadata Authority Description Schema (MADS) is an XML schema developed by the United States Library of Congress' Network Development and Standards Office that provides an authority element set to complement the Metadata Object Description Schema ...
(MADS), an XML schema for an authority element set that may be used to provide metadata about agents (people, organizations), events, and terms (topics, geographics, genres, etc.). *
Encoded Archival Context Encoded Archival Context – Corporate bodies, Persons and Families (EAC-CPF) is an XML standard for encoding information about the creators of archival materials – i.e., a corporate body, person or family -- including their relationships to (a) r ...
, an XML schema for authority records conforming to ''ISAAR''. Standards for object identification, controlled by an identification-authority: *
Legal personality Legal capacity is a quality denoting either the legal aptitude of a person to have rights and liabilities (in this sense also called transaction capacity), or altogether the personhood itself in regard to an entity other than a natural pers ...
identification systems (person-IDs) and authorities: ** ''ISAAR'' (CPF) – International Standard Archival Authority Record for Corporate Bodies, Persons, and Families. Published by the International Council on Archives ** ''ISNI'' – International Standard Name Identifier *** ''ORCID'' – Open Researcher and Contributor ID, a subset of the ''ISNI'', to uniquely identify scientific and other academic authors. *** ''DAI'' – Digital Author Identification, another subset of ''ISNI''. ** ''GRID'' – Global Research Identifier Database ** ''GND'' – Integrated Authority File (''Gemeinsame Normdatei''), authority file for personal names, corporate bodies and subject headings. ** KANTO – National Agent Data (''finaf''), authority file for persons and corporate bodies. ** ''LCCN'' – Library of Congress Control Number ** ''NDL'' – National Diet Library ** ''VIAF'' – Virtual International Authority File, an aggregation of authority files currently focused on personal and corporate names. ** ''WorldCat/identities'' * Bibliographic object identification systems and authorities: ** ''DOI'' – Digital object identifier ** urn:lex, for law-document identifiers, controlled by local law authorities. ** ISBN – International Standard Book Number ** ''ISSN'' – International Standard Serial Number * Other identification systems (for generic named-entities) and authorities: **
GeoNames GeoNames (or GeoNames.org) is a user editable geographical database available and accessible through various web services, under a Creative Commons attribution license. The project was founded in late 2005. The GeoNames dataset differs fro ...
** ''TGN'' - Getty Thesaurus of Geographic Names Standards for identified-object metadata (examples):
vCard vCard, also known as VCF (Virtual Contact File), is a file format standard for electronic business cards. vCards can be attached to e-mail messages, sent via Multimedia Messaging Service (MMS), on the World Wide Web, instant messaging, NFC ...
,
Dublin Core 220px, Logo image of DCMI, which formulates Dublin Core The Dublin Core, also known as the Dublin Core Metadata Element Set (DCMES), is a set of fifteen "core" elements (properties) for describing resources. This fifteen-element Dublin Core has ...
, etc.


See also

*
Knowledge Organization Systems Knowledge Organization Systems (KOS), concept system or concept scheme is a generic term used in knowledge organization for authority files, classification schemes, thesauri, topic maps, ontologies and similar works. Despite their differences in typ ...
*
Library classification A library classification is a system of organization of knowledge by which library resources are arranged and ordered systematically. Library classifications are a notational system that represents the order of topics in the classification and al ...
systems: **
Dewey Decimal Classification The Dewey Decimal Classification (DDC), colloquially known as the Dewey Decimal System, is a proprietary library classification system which allows new books to be added to a library in their appropriate location based on subject. Section 4.1 ...
**
Library of Congress Classification The Library of Congress Classification (LCC) is a system of library classification developed by the Library of Congress in the United States, which can be used for shelving books in a library. LCC is mainly used by large research and academic libra ...
*
Ontology (information science) In computer science and information science, an ontology encompasses a representation, formal naming, and definition of the categories, properties, and relations between the concepts, data, and entities that substantiate one, many, or all domains ...
* Proprietary services **
ResearcherID ResearcherID is an identifying system for scientific authors. The system was introduced in January 2008 by Thomson Reuters Corporation. This unique identifier aims at solving the problem of author identification and correct attribution of work ...
* Registration authority * Simple Knowledge Organization System (SKOS)


References

{{Authority control Library cataloging and classification Metadata Information Information science Library science terminology