Xiph.Org libVorbis I 20120203 (Omnipresent)
   HOME

TheInfoList



OR:

Vorbis is a free and open-source software project headed by the Xiph.Org Foundation. The project produces an
audio coding format An audio coding format (or sometimes audio compression format) is a content representation format for storage or transmission of digital audio (such as in digital television, digital radio and in audio and video files). Examples of audio coding ...
and software reference encoder/decoder (
codec A codec is a device or computer program that encodes or decodes a data stream or signal. ''Codec'' is a portmanteau of coder/decoder. In electronic communications, an endec is a device that acts as both an encoder and a decoder on a signal or ...
) for
lossy In information technology, lossy compression or irreversible compression is the class of data compression methods that uses inexact approximations and partial data discarding to represent the content. These techniques are used to reduce data size ...
audio compression. Vorbis is most commonly used in conjunction with the
Ogg Ogg is a free, open container format maintained by the Xiph.Org Foundation. The authors of the Ogg format state that it is unrestricted by software patents and is designed to provide for efficient streaming and manipulation of high-quality di ...
container format and it is therefore often referred to as Ogg Vorbis. Vorbis is a continuation of audio compression development started in 1993 by
Chris Montgomery Christopher "Monty" Montgomery (born June 6, 1972) is an American programmer. He is the original creator of the Ogg Free Software container format and the Vorbis audio codec and others, and the founder of The Xiph.Org Foundation, which promot ...
. Intensive development began following a September 1998 letter from the
Fraunhofer Society The Fraunhofer Society (german: Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V., lit=Fraunhofer Society for the Advancement of Applied Research) is a German research organization with 76institutes spread throughout Germany ...
announcing plans to charge licensing fees for the
MP3 MP3 (formally MPEG-1 Audio Layer III or MPEG-2 Audio Layer III) is a coding format for digital audio developed largely by the Fraunhofer Society in Germany, with support from other digital scientists in the United States and elsewhere. Origin ...
audio format. The Vorbis project started as part of the Xiphophorus company's Ogg project (also known as OggSquish multimedia project). Chris Montgomery began work on the project and was assisted by a growing number of other developers. They continued refining the
source code In computing, source code, or simply code, is any collection of code, with or without comments, written using a human-readable programming language, usually as plain text. The source code of a program is specially designed to facilitate the w ...
until the Vorbis file format was frozen for 1.0 in May 2000. Originally licensed as LGPL, in 2001 the Vorbis license was changed to the
BSD license BSD licenses are a family of permissive free software licenses, imposing minimal restrictions on the use and distribution of covered software. This is in contrast to copyleft licenses, which have share-alike requirements. The original BSD lice ...
to encourage adoption, with the endorsement of
Richard Stallman Richard Matthew Stallman (; born March 16, 1953), also known by his initials, rms, is an American free software movement activist and programmer. He campaigns for software to be distributed in such a manner that its users have the freedom to ...
. A stable version (1.0) of the reference software was released on July 19, 2002. The Xiph.Org Foundation maintains a
reference implementation In the software development process, a reference implementation (or, less frequently, sample implementation or model implementation) is a program that implements all requirements from a corresponding specification. The reference implementation o ...
, libvorbis. There are also some fine-tuned forks, most notably aoTuV, that offer better audio quality, particularly at low bitrates. aoTuV improvements were intended to be merged back into the reference codebase periodically, but that only happened once in libvorbis 1.1, due to the lack of upstream resources for reviewing further patches. Since February 2013, Xiph.Org has stated that the use of Vorbis should be deprecated in favor of the
Opus ''Opus'' (pl. ''opera'') is a Latin word meaning "work". Italian equivalents are ''opera'' (singular) and ''opere'' (pl.). Opus or OPUS may refer to: Arts and entertainment Music * Opus number, (abbr. Op.) specifying order of (usually) publicatio ...
codec, which is also a Xiph.Org Foundation project and also free and open-source. Compared to Vorbis, Opus can simultaneously achieve higher compression efficiency—per both Xiph.Org itself and third-party listening tests—and lower encode/decode latency (in most cases, low enough for real-time applications such as
internet telephony Voice over Internet Protocol (VoIP), also called IP telephony, is a method and group of technologies for the delivery of voice communications and multimedia sessions over Internet Protocol (IP) networks, such as the Internet. The terms Internet t ...
or live singing, rarely possible with Vorbis).


Name

Vorbis is named after a ''
Discworld ''Discworld'' is a comic fantasy"Humorous Fantasy" in David Pringle, ed., ''The Ultimate Encyclopedia of Fantasy'' (pp.31-33). London, Carlton,2006. book series written by the English author Terry Pratchett, set on the Discworld, a flat ...
'' character Exquisitor Vorbis in ''
Small Gods ''Small Gods'' is the thirteenth of Terry Pratchett's ''Discworld'' novels, published in 1992. It tells the origin of the god Om, and his relations with his prophet, the reformer Brutha. In the process, it satirises religious institutions, peop ...
'' by
Sir Terry Pratchett Sir Terence David John Pratchett (28 April 1948 – 12 March 2015) was an English humourist, satirist, and author of fantasy novels, especially comical works. He is best known for his ''Discworld'' series of 41 novels. Pratchett's first nove ...
. The
Ogg Ogg is a free, open container format maintained by the Xiph.Org Foundation. The authors of the Ogg format state that it is unrestricted by software patents and is designed to provide for efficient streaming and manipulation of high-quality di ...
format, however, is ''not'' named after
Nanny Ogg Gytha Ogg (usually called Nanny Ogg) is a character from Terry Pratchett's '' Discworld'' series. She is a witch and member of the Lancre coven. Personality The character of Nanny Ogg is based on the Mother stereotype of the Triple Goddess ...
, another Discworld character; the name is in fact derived from ''ogging'',
jargon Jargon is the specialized terminology associated with a particular field or area of activity. Jargon is normally employed in a particular communicative context and may not be well understood outside that context. The context is usually a partic ...
that arose in the computer game
Netrek ''Netrek'' is an Internet game for up to 16 players, written almost entirely in cross-platform open-source software. It combines features of multi-directional shooters and team-based real-time strategy games. Players attempt to disable or destroy ...
.


Usage

Vorbis faces competition from other audio formats, such as MP3. Though Vorbis is technically superior (addressing many of the limitations inherent to the MP3 design), MP3 has a far higher public profile. Because Vorbis does not have financial support from large organisations, support for the format is not as widespread, though programs such as Audacity can convert to more popular formats, and support in games has gradually improved. The Vorbis format has proven popular among supporters of
free software Free software or libre software is computer software distributed under terms that allow users to run the software for any purpose as well as to study, change, and distribute it and any adapted versions. Free software is a matter of liberty, no ...
. They argue that its higher fidelity and completely free nature, unencumbered by patents, make it a well-suited replacement for patented and restricted formats. Vorbis has different uses for consumer products. Many
video game Video games, also known as computer games, are electronic games that involves interaction with a user interface or input device such as a joystick, controller, keyboard, or motion sensing device to generate visual feedback. This fee ...
s store in-game audio as Vorbis, including '' Amnesia: The Dark Descent'', '' Grand Theft Auto: San Andreas'', '' Halo: Combat Evolved'', ''
Minecraft ''Minecraft'' is a sandbox game developed by Mojang Studios. The game was created by Markus "Notch" Persson in the Java (programming language), Java programming language. Following several early private testing versions, it was first made pub ...
'', and ''
World of Warcraft ''World of Warcraft'' (''WoW'') is a massively multiplayer online role-playing game (MMORPG) released in 2004 by Blizzard Entertainment. Set in the ''Warcraft'' fantasy universe, ''World of Warcraft'' takes place within the world of Azeroth ...
'', among others. Popular software players support Vorbis playback either natively or through an external plugin. A number of websites, including
Wikipedia Wikipedia is a multilingual free online encyclopedia written and maintained by a community of volunteers, known as Wikipedians, through open collaboration and using a wiki-based editing system. Wikipedia is the largest and most-read refer ...
, use it. Others include
Jamendo Jamendo is a Luxembourg-based music website and an open community of independent artists and music lovers. A subsidiary of Belgian company AudioValley, and Independent Management Entity (IME) since 2019. Originally, Jamendo was a music platform ...
and Mindawn, as well as several national radio stations like JazzRadio, Absolute Radio,
NPR National Public Radio (NPR, stylized in all lowercase) is an American privately and state funded nonprofit media organization headquartered in Washington, D.C., with its NPR West headquarters in Culver City, California. It differs from other ...
, Radio New Zealand and Deutschlandradio. The
Spotify Spotify (; ) is a proprietary Swedish audio streaming and media services provider founded on 23 April 2006 by Daniel Ek and Martin Lorentzon. It is one of the largest music streaming service providers, with over 456 million monthly active us ...
audio streaming service has used Vorbis in a basket of codecs for its audio streams, but now has also been using AAC profiles. Also, the French music site Qobuz offers its customers the possibility to download their purchased songs in Vorbis format, as does the American music site
Bandcamp Bandcamp is an American online audio distribution platform founded in 2007 by Oddpost co-founder Ethan Diamond and programmers Shawn Grunberger, Joe Holt and Neal Tucker, with headquarters in Oakland, California, US. On March 2, 2022, Bandcamp ...
.


Quality

Listening tests conducted through 2014 showed Vorbis performed significantly better than many other lossy audio formats in that it produced smaller files at equivalent or higher quality while retaining computational complexity comparable to other
MDCT The modified discrete cosine transform (MDCT) is a transform based on the type-IV discrete cosine transform (DCT-IV), with the additional property of being lapped: it is designed to be performed on consecutive blocks of a larger dataset, where ...
formats such as AAC and
Windows Media Audio Windows Media Audio (WMA) is a series of audio codecs and their corresponding audio coding formats developed by Microsoft. It is a proprietary technology that forms part of the Windows Media framework. WMA consists of four distinct codecs. The ...
. Listening tests have attempted to find the best-quality lossy audio codecs at particular
bitrate In telecommunications and computing, bit rate (bitrate or as a variable ''R'') is the number of bits that are conveyed or processed per unit of time. The bit rate is expressed in the unit bit per second (symbol: bit/s), often in conjunction ...
s. Some conclusions made by listening tests: * Low bitrate (less than 64 kbit/s): the most recent (2007) public multiformat test at 48 kbit/s showed that aoTuV Vorbis had a better quality than WMA and
LC-AAC Advanced Audio Coding (AAC) is an audio coding standard for lossy digital audio compression. Designed to be the successor of the MP3 format, AAC generally achieves higher sound quality than MP3 encoders at the same bit rate. AAC has been stand ...
, the same quality as WMA Professional, and a lower quality than HE-AAC. * Mid to low bitrates (less than 128 kbit/s down to 64 kbit/s): private tests in 2005 at 80 kbit/s and 96 kbit/s showed that aoTuV Vorbis had a better quality than other lossy audio formats (LC-AAC, HE-AAC,
MP3 MP3 (formally MPEG-1 Audio Layer III or MPEG-2 Audio Layer III) is a coding format for digital audio developed largely by the Fraunhofer Society in Germany, with support from other digital scientists in the United States and elsewhere. Origin ...
, MPC, WMA). * High bitrates (greater than 128 kbit/s): most people do not hear significant differences. However, trained listeners can often hear significant differences between codecs at identical bitrates, and aoTuV Vorbis performed better than LC-AAC, MP3, and MPC. Due to the ever-evolving nature of audio codecs, the results of many of these tests have become outdated.


Listening tests

Listening tests are normally carried out as
ABX test An ABX test is a method of comparing two choices of sensory stimuli to identify detectable differences between them. A subject is presented with two known samples (sample , the first reference, and sample , the second reference) followed by one unkn ...
s, i.e., the listener has to identify an unknown sample X as being A or B, with A (the original) and B (the encoded version) available for reference. The outcome of a test must be statistically significant. This setup ensures that the listener is not biased by their expectations and that the outcome is very unlikely to be the result of chance. If sample X can be identified reliably, the listener can assign a score as a subjective judgment of the quality. Otherwise, the encoded version is considered to be ''transparent''. Below are links to several listening test results. ; 2005, July comparison: AAC vs MP3 vs Vorbis vs WMA at 80 kbit/s. States that Vorbis aoTuV beta 4 is the best encoder for either classical or various music in this bitrate, and that its quality is comparable to the
LAME Lame or LAME may refer to: Music * "Lame" (song) by Unwritten Law * ''Lame'' (album) by Iame People * Ibrahim Lame (born 1953), Nigerian educator and politician * Jennifer Lame (), American film editor * Quintín Lame (1880–1967), Colombian ...
ABR MP3 at 128 kbit/s. ; 2005, August comparison: AAC vs MP3 vs Vorbis vs WMA at 96 kbit/s. States that Vorbis aoTuV beta 4 and AAC are tied as the best encoders for classical music in this bitrate, while aoTuV beta 4 is the best encoder for pop music, even better than LAME at 128 kbit/s. ; 2005, August comparison: MPC vs Vorbis vs MP3 vs AAC at 180 kbit/s. An
audiophile An audiophile is a person who is enthusiastic about high-fidelity sound reproduction. An audiophile seeks to reproduce the sound of a piece of recorded music or a live musical performance, typically inside closed headphones, In-ear monitors, open ...
listening test, which states that, for classical music, Vorbis aoTuV beta 4 has 93% percent probability of being the best encoder, tied with MPC. MPC is tied with both Vorbis, in the first place, and LAME in the second. ; 2011, April comparison by Hydrogenaudio: Vorbis vs HE-AAC vs
Opus ''Opus'' (pl. ''opera'') is a Latin word meaning "work". Italian equivalents are ''opera'' (singular) and ''opere'' (pl.). Opus or OPUS may refer to: Arts and entertainment Music * Opus number, (abbr. Op.) specifying order of (usually) publicatio ...
at 64 kbit/s. Vorbis was on average between the LC-AAC low anchor and Nero HE-AAC, while the upcoming
Opus ''Opus'' (pl. ''opera'') is a Latin word meaning "work". Italian equivalents are ''opera'' (singular) and ''opere'' (pl.). Opus or OPUS may refer to: Arts and entertainment Music * Opus number, (abbr. Op.) specifying order of (usually) publicatio ...
(by
Xiph Xiph.Org Foundation is a nonprofit organization that produces free multimedia formats and software tools. It focuses on the Ogg family of formats, the most successful of which has been Vorbis, an open and freely licensed audio format and codec ...
) was best.


Characteristic artifacts

As with most modern formats, the most consistently cited problem with Vorbis is
pre-echo In audio signal processing, pre-echo, sometimes called a '' forward echo'', (not to be confused with reverse echo) is a digital audio compression artifact where a sound is heard before it occurs (hence the name). It is most noticeable in impulsiv ...
, a faint copy of a sharp attack that occurs just before the actual sound (for example, when reproducing the sound of
castanets Castanets, also known as ''clackers'' or ''palillos'', are a percussion instrument (idiophone), used in Spanish, Kalo, Moorish, Ottoman, Italian, Sephardic, Swiss, and Portuguese music. In ancient Greece and ancient Rome there was a simil ...
). When the bitrate is too low to encode the audio without perceptible loss, Vorbis exhibits an analog noise-like failure mode, which can be described as reverberations in a room or amphitheater. Vorbis's behavior is due to the
noise floor In signal theory, the noise floor is the measure of the signal created from the sum of all the noise sources and unwanted signals within a measurement system, where noise is defined as any signal other than the one being monitored. In radio com ...
approach to encoding; see .


Technical details

Vorbis is intended for sample rates from 8 kHz telephony to 192 kHz digital masters and a range of channel representations (monaural, polyphonic, stereo, quadraphonic, 5.1, ambisonic, or up to 255 discrete channels). Given 44.1 kHz (standard CD audio
sampling frequency In signal processing, sampling is the reduction of a continuous-time signal to a discrete-time signal. A common example is the conversion of a sound wave to a sequence of "samples". A sample is a value of the signal at a point in time and/or s ...
) stereo input, the encoder will produce output from roughly 45 to 500 kbit/s (32 to 500 kbit/s for aoTuV tunings) depending on the specified quality setting. Quality setting goes from −0.1 to 1.0 for the Xiph library and −0.2 to 1.0 for aoTuV. Encoding front-ends map these values to an integer-based quality setting that goes from −1 to 10 for the Xiph library and −2 to 10 for aoTuV. Files encoded with a given quality setting should have the same quality of sound in all versions of the encoder, but newer versions should be able to achieve that quality with a lower bitrate. The bit rates mentioned above are only approximate; Vorbis is inherently variable-bitrate (VBR), so bitrate may vary considerably from sample to sample. (It is a free-form variable-bitrate codec and packets have no minimum size, maximum size, or fixed/expected size.) Vorbis aims to be more efficient than MP3, with '' data compression transparency'' being available at lower bitrates.


Outline of coder algorithm

Vorbis I is a forward-adaptive monolithic transform codec based on the modified discrete cosine transform (MDCT). Vorbis uses the modified discrete cosine transform for converting sound data from the
time domain Time domain refers to the analysis of mathematical functions, physical signals or time series of economic or environmental data, with respect to time. In the time domain, the signal or function's value is known for all real numbers, for the c ...
to the
frequency domain In physics, electronics, control systems engineering, and statistics, the frequency domain refers to the analysis of mathematical functions or signals with respect to frequency, rather than time. Put simply, a time-domain graph shows how a s ...
. The resulting frequency-domain data is broken into
noise floor In signal theory, the noise floor is the measure of the signal created from the sum of all the noise sources and unwanted signals within a measurement system, where noise is defined as any signal other than the one being monitored. In radio com ...
and residue components, and then quantized and entropy coded using a
codebook A codebook is a type of document used for gathering and storing cryptography codes. Originally codebooks were often literally , but today codebook is a byword for the complete record of a series of codes, regardless of physical format. Crypto ...
-based
vector quantization Vector quantization (VQ) is a classical quantization technique from signal processing that allows the modeling of probability density functions by the distribution of prototype vectors. It was originally used for data compression. It works by di ...
algorithm. The decompression algorithm reverses these stages. The noise-floor approach gives Vorbis its characteristic analog noise-like failure mode when the bitrate is too low to encode the audio without perceptible loss. The sound of compression artifacts at low bitrates can be perhaps described as reverberations in an amphitheater or a room.


Tuned versions

Various tuned versions of the encoder (Garf, aoTuV or MegaMix) attempt to provide better sound at a specified quality setting, usually by dealing with certain problematic waveforms by temporarily increasing the bitrate. Most of the tuned versions of Vorbis attempt to correct the
pre-echo In audio signal processing, pre-echo, sometimes called a '' forward echo'', (not to be confused with reverse echo) is a digital audio compression artifact where a sound is heard before it occurs (hence the name). It is most noticeable in impulsiv ...
problem and to increase the sound quality of lower quality settings (-q-2 through -q4). Some tuning suggestions created by the Vorbis user community (especially the aoTuV beta 2 tunings) have been incorporated into the 1.1.0 release.


Bitrate peeling

The Vorbis ''format'' supports bitrate peeling for reducing the bitrate of already encoded files without re-encoding, and several experimental implementations exist.Experimental Ogg vorbis Bitrate Peeler, Bitrate reduction of ogg vorbis
/ref> However, the quality is "unusable" and markedly inferior to decoding and re-encoding. The reason that peeling programs degrade quality so severely is that the format does not specify the relative importance of signal data, and existing encoders do not order the data in order of importance. Hence peelers cannot easily determine whether data is important or not.


Container formats

Vorbis streams can be encapsulated in other media container formats besides Ogg. A commonly used alternative is
Matroska Matroska is a project to create a container format that can hold an unlimited number of video, audio, picture, or subtitle tracks in one file. The Matroska Multimedia Container is similar in concept to other containers like AVI, MP4, or Advanc ...
. It is also used in
WebM WebM is an audiovisual media file format. It is primarily intended to offer a royalty-free alternative to use in the HTML5 video and the HTML5 audio elements. It has a sister project, WebP, for images. The development of the format is sponsored ...
, a container format based on a subset of Matroska. Vorbis streams can also be encapsulated in an RTP payload format.


Metadata

Vorbis metadata, called
Vorbis comment A Vorbis comment is a metadata container used in the Vorbis, FLAC, Theora, Speex and Opus file formats. It allows information such as the title, artist, album, track number or other information about the file to be added to the file itself. Howeve ...
s, supports metadata tags similar to those implemented in the
ID3 ID3 is a metadata container most often used in conjunction with the MP3 audio file format. It allows information such as the title, artist, album, track number, and other information about the file to be stored in the file itself. There are tw ...
standard for MP3. The metadata is stored in a vector of byte strings of arbitrary length and size. The size of the vector and the size of each string in bytes is limited to 232 − 1 (about 4.3
billion Billion is a word for a large number, and it has two distinct definitions: *1,000,000,000, i.e. one thousand million, or (ten to the ninth power), as defined on the short scale. This is its only current meaning in English. * 1,000,000,000,000, i. ...
, or any positive integer that can be expressed in 32 bits). This vector is stored in the second header packet that begins a Vorbis bitstream. The strings are assumed to be encoded as
UTF-8 UTF-8 is a variable-length character encoding used for electronic communication. Defined by the Unicode Standard, the name is derived from ''Unicode'' (or ''Universal Coded Character Set'') ''Transformation Format 8-bit''. UTF-8 is capable of ...
. Music tags are typically implemented as strings of the form " AG ALUE, for instance, "ARTIST=The John Smith Band". The tag names are case-insensitive, thus typing "ARTIST=The John Smith Band" would be the same as "artist=The John Smith Band". Like the current version of ID3, users and encoding software are free to use whichever tags are appropriate for the content. For example, an encoder could use localized tag labels, live music tracks might contain a "Venue=" tag or files could have multiple genre definitions. Most applications also support common de facto standards such as disc number and
ReplayGain ReplayGain is a proposed technical standard published by David Robinson in 2001 to measure and normalize the perceived loudness of audio in computer audio formats such as MP3 and Ogg Vorbis. It allows media players to normalize loudness for indi ...
information.


Licensing

Knowledge of Vorbis' specifications is in the
public domain The public domain (PD) consists of all the creative work to which no exclusive intellectual property rights apply. Those rights may have expired, been forfeited, expressly waived, or may be inapplicable. Because those rights have expired, ...
. Concerning the specification itself, the Xiph.Org Foundation reserves the right to set the Vorbis specification and certify compliance. Its libraries are released under the revised 3-clause
BSD license BSD licenses are a family of permissive free software licenses, imposing minimal restrictions on the use and distribution of covered software. This is in contrast to copyleft licenses, which have share-alike requirements. The original BSD lice ...
and its tools are released under the
GNU General Public License The GNU General Public License (GNU GPL or simply GPL) is a series of widely used free software licenses that guarantee end users the four freedoms to run, study, share, and modify the software. The license was the first copyleft for general ...
. The libraries were originally released under the GNU Lesser General Public Licence, but a BSD license was later chosen with the endorsement of
Richard Stallman Richard Matthew Stallman (; born March 16, 1953), also known by his initials, rms, is an American free software movement activist and programmer. He campaigns for software to be distributed in such a manner that its users have the freedom to ...
. The Xiph.Org Foundation states that Vorbis, like all its developments, is completely free from the licensing or
patent A patent is a type of intellectual property that gives its owner the legal right to exclude others from making, using, or selling an invention for a limited period of time in exchange for publishing an enabling disclosure of the invention."A ...
issues raised by
proprietary {{Short pages monitor