A video coding format (or sometimes video compression format) is a
content representation format of
digital video
Digital video is an electronic representation of moving visual images (video) in the form of encoded digital data. This is in contrast to analog video, which represents moving visual images in the form of analog signals. Digital video comprises ...
content, such as in a data file or
bitstream
A bitstream (or bit stream), also known as binary sequence, is a sequence of bits.
A bytestream is a sequence of bytes. Typically, each byte is an 8-bit quantity, and so the term octet stream is sometimes used interchangeably. An octet may ...
. It typically uses a standardized
video compression
In information theory, data compression, source coding, or bit-rate reduction is the process of encoding information using fewer bits than the original representation. Any particular compression is either lossy or lossless. Lossless compression ...
algorithm, most commonly based on
discrete cosine transform
A discrete cosine transform (DCT) expresses a finite sequence of data points in terms of a sum of cosine functions oscillating at different frequency, frequencies. The DCT, first proposed by Nasir Ahmed (engineer), Nasir Ahmed in 1972, is a widely ...
(DCT) coding and
motion compensation
Motion compensation in computing is an algorithmic technique used to predict a frame in a video given the previous and/or future frames by accounting for motion of the camera and/or objects in the video. It is employed in the encoding of video ...
. A computer
software
Software consists of computer programs that instruct the Execution (computing), execution of a computer. Software also includes design documents and specifications.
The history of software is closely tied to the development of digital comput ...
or
hardware component that compresses or decompresses a specific video coding format is a
video codec
A video codec is software or Computer hardware, hardware that data compression, compresses and Uncompressed video, decompresses digital video. In the context of video compression, ''codec'' is a portmanteau of ''encoder'' and ''decoder'', while ...
.
Some video coding formats are documented by a detailed
technical specification
A specification often refers to a set of documented requirements to be satisfied by a material, design, product, or service. A specification is often a type of technical standard.
There are different types of technical or engineering specificati ...
document known as a video coding specification. Some such specifications are written and approved by
standardization organization
A standards organization, standards body, standards developing organization (SDO), or standards setting organization (SSO) is an organization whose primary function is developing, coordinating, promulgating, revising, amending, reissuing, interpr ...
s as
technical standard
A technical standard is an established Social norm, norm or requirement for a repeatable technical task which is applied to a common and repeated use of rules, conditions, guidelines or characteristics for products or related processes and producti ...
s, and are thus known as a video coding standard. There are
''de facto'' standards and formal standards.
Video content encoded using a particular video coding format is normally bundled with an audio stream (encoded using an
audio coding format) inside a
multimedia container format such as
AVI,
MP4,
FLV,
RealMedia
RealMedia is a proprietary multimedia container format (digital), container format created by RealNetworks with the filename extension . RealMedia is used in conjunction with RealVideo and RealAudio, while also being used for Streaming media, st ...
, or
Matroska
Matroska (styled Matroška) is a project to create a container format that can hold an unlimited number of video, audio, picture, or subtitle tracks in one file. The Matroska Multimedia Container is similar in concept to other containers like ...
. As such, the user normally does not have a
H.264 file, but instead has a
video file, which is an MP4 container of H.264-encoded video, normally alongside
AAC-encoded audio. Multimedia container formats can contain one of several different video coding formats; for example, the MP4 container format can contain video coding formats such as
MPEG-2 Part 2
H.262 or MPEG-2 Part 2 (formally known as ITU-T Recommendation H.262 and ISO/IEC 13818-2, also known as MPEG-2 Video) is a video coding format standardised and jointly maintained by ITU-T Study Group 16 Video Coding Experts Group (VCEG) and ISO ...
or H.264. Another example is the initial specification for the file type
WebM
WebM is an audiovisual media file format. It is primarily intended to offer a royalty-free alternative to use in the HTML video and the HTML audio elements. It has a sister project, WebP, for images. The development of the format is sponsored by ...
, which specifies the container format (Matroska), but also exactly which video (
VP8
VP8 is an open format, open and royalty-free Video coding format, video compression format released by On2 Technologies in 2008.
Initially released as a Proprietary software, proprietary successor to On2's previous VP7 format, VP8 was released a ...
) and audio (
Vorbis) compression format is inside the Matroska container, even though Matroska is capable of containing
VP9 video, and
Opus audio support was later added to the
WebM
WebM is an audiovisual media file format. It is primarily intended to offer a royalty-free alternative to use in the HTML video and the HTML audio elements. It has a sister project, WebP, for images. The development of the format is sponsored by ...
specification.
Distinction between ''format'' and ''codec''
A ''format'' is the layout plan for data produced or consumed by a ''codec''.
Although video coding formats such as H.264 are sometimes referred to as ''codecs'', there is a clear conceptual difference between a specification and its implementations. Video coding formats are described in specifications, and software,
firmware
In computing
Computing is any goal-oriented activity requiring, benefiting from, or creating computer, computing machinery. It includes the study and experimentation of algorithmic processes, and the development of both computer hardware, h ...
, or hardware to encode/decode data in a given video coding format from/to uncompressed video are implementations of those specifications. As an analogy, the video coding format
H.264 (specification) is to the
codec
A codec is a computer hardware or software component that encodes or decodes a data stream or signal. ''Codec'' is a portmanteau of coder/decoder.
In electronic communications, an endec is a device that acts as both an encoder and a decoder o ...
OpenH264
OpenH264 is a free software library for real-time encoding and decoding video streams in the H.264/MPEG-4 AVC format. It is released under the terms of the Simplified BSD License."
History Move to free-to-use binaries
On October 30, 2013, Rowa ...
(specific implementation) what the
C Programming Language
C (''pronounced'' '' – like the letter c'') is a general-purpose programming language. It was created in the 1970s by Dennis Ritchie and remains very widely used and influential. By design, C's features cleanly reflect the capabilities of ...
(specification) is to the compiler
GCC (specific implementation). Note that for each specification (e.g.,
H.264), there can be many codecs implementing that specification (e.g.,
x264
x264 is a free and open-source software library and a command-line utility developed by VideoLAN for encoding video streams into the H.264/MPEG-4 AVC video coding format. It is released under the terms of the GNU General Public License.
Histo ...
, OpenH264,
H.264/MPEG-4 AVC products and implementations).
This distinction is not consistently reflected terminologically in the literature. The H.264 specification calls
H.261,
H.262,
H.263, and
H.264 ''video coding standards'' and does not contain the word ''codec''.
The
Alliance for Open Media clearly distinguishes between the
AV1 video coding format and the accompanying codec they are developing, but calls the video coding format itself a ''
video codec
A video codec is software or Computer hardware, hardware that data compression, compresses and Uncompressed video, decompresses digital video. In the context of video compression, ''codec'' is a portmanteau of ''encoder'' and ''decoder'', while ...
specification''. The
VP9 specification calls the video coding format VP9 itself a ''codec''.
As an example of conflation, Chromium's and Mozilla's pages listing their video formats support both call video coding formats, such as H.264 ''codecs''. As another example, in Cisco's announcement of a free-as-in-beer video codec, the press release refers to the H.264 video coding format as a ''codec'' ("choice of a common video codec"), but calls Cisco's implementation of a H.264 encoder/decoder a ''codec'' shortly thereafter ("open-source our H.264 codec").
A video coding format does not dictate all
algorithm
In mathematics and computer science, an algorithm () is a finite sequence of Rigour#Mathematics, mathematically rigorous instructions, typically used to solve a class of specific Computational problem, problems or to perform a computation. Algo ...
s used by a
codec
A codec is a computer hardware or software component that encodes or decodes a data stream or signal. ''Codec'' is a portmanteau of coder/decoder.
In electronic communications, an endec is a device that acts as both an encoder and a decoder o ...
implementing the format. For example, a large part of how video compression typically works is by finding
similarities between video frames (block-matching) and then achieving compression by copying previously-coded similar subimages (such as
macroblock
The macroblock is a processing unit in image and video compression formats based on linear block transforms, typically the discrete cosine transform (DCT). A macroblock typically consists of 16×16 samples, and is further subdivided into transform ...
s) and adding small differences when necessary. Finding optimal combinations of such predictors and differences is an
NP-hard
In computational complexity theory, a computational problem ''H'' is called NP-hard if, for every problem ''L'' which can be solved in non-deterministic polynomial-time, there is a polynomial-time reduction from ''L'' to ''H''. That is, assumi ...
problem, meaning that it is practically impossible to find an optimal solution. Though the video coding format must support such compression across frames in the bitstream format, by not needlessly mandating specific algorithms for finding such block-matches and other encoding steps, the codecs implementing the video coding specification have some freedom to optimize and innovate in their choice of algorithms. For example, section 0.5 of the H.264 specification says that encoding algorithms are not part of the specification.
Free choice of algorithm also allows different
space–time complexity trade-offs for the same video coding format, so a live feed can use a fast but space-inefficient algorithm, and a one-time
DVD encoding for later mass production can trade long encoding-time for space-efficient encoding.
History
The concept of
analog video
Video is an Electronics, electronic medium for the recording, copying, playback, broadcasting, and display of moving picture, moving image, visual Media (communication), media. Video was first developed for mechanical television systems, whi ...
compression dates back to 1929, when R.D. Kell in
Britain
Britain most often refers to:
* Great Britain, a large island comprising the countries of England, Scotland and Wales
* The United Kingdom of Great Britain and Northern Ireland, a sovereign state in Europe comprising Great Britain and the north-eas ...
proposed the concept of transmitting only the portions of the scene that changed from frame-to-frame. The concept of
digital video
Digital video is an electronic representation of moving visual images (video) in the form of encoded digital data. This is in contrast to analog video, which represents moving visual images in the form of analog signals. Digital video comprises ...
compression dates back to 1952, when
Bell Labs
Nokia Bell Labs, commonly referred to as ''Bell Labs'', is an American industrial research and development company owned by Finnish technology company Nokia. With headquarters located in Murray Hill, New Jersey, Murray Hill, New Jersey, the compa ...
researchers B.M. Oliver and C.W. Harrison proposed the use of
differential pulse-code modulation
Differential pulse-code modulation (DPCM) is a signal encoder that uses the baseline of pulse-code modulation (PCM) but adds some functionalities based on the prediction of the samples of the signal. The input can be an analog signal or a Digital ...
(DPCM) in video coding. In 1959, the concept of
inter-frame motion compensation
Motion compensation in computing is an algorithmic technique used to predict a frame in a video given the previous and/or future frames by accounting for motion of the camera and/or objects in the video. It is employed in the encoding of video ...
was proposed by
NHK
, also known by its Romanization of Japanese, romanized initialism NHK, is a Japanese public broadcasting, public broadcaster. It is a statutory corporation funded by viewers' payments of a television licence, television license fee.
NHK ope ...
researchers Y. Taki, M. Hatori and S. Tanaka, who proposed predictive inter-frame video coding in the
temporal dimension
In physics and mathematics, the dimension of a mathematical space (or object) is informally defined as the minimum number of coordinates needed to specify any point within it. Thus, a line has a dimension of one (1D) because only one coordi ...
.
In 1967,
University of London
The University of London (UoL; abbreviated as Lond or more rarely Londin in Post-nominal letters, post-nominals) is a collegiate university, federal Public university, public research university located in London, England, United Kingdom. The ...
researchers A.H. Robinson and C. Cherry proposed
run-length encoding
Run-length encoding (RLE) is a form of lossless data compression in which ''runs'' of data (consecutive occurrences of the same data value) are stored as a single occurrence of that data value and a count of its consecutive occurrences, rather th ...
(RLE), a
lossless compression
Lossless compression is a class of data compression that allows the original data to be perfectly reconstructed from the compressed data with no loss of information. Lossless compression is possible because most real-world data exhibits statisti ...
scheme, to reduce the transmission bandwidth of
analog television
Analog television is the original television technology that uses analog signals to transmit video and audio. In an analog television broadcast, the brightness, colors and sound are represented by amplitude, instantaneous phase and frequency, ...
signals.
The earliest digital video coding algorithms were either for
uncompressed video
Uncompressed video is digital video that either has never been compressed or was generated by decompressing previously compressed digital video. It is commonly used by video cameras, video monitors, video recording devices (including general-pur ...
or used
lossless compression
Lossless compression is a class of data compression that allows the original data to be perfectly reconstructed from the compressed data with no loss of information. Lossless compression is possible because most real-world data exhibits statisti ...
, both methods inefficient and impractical for digital video coding.
Digital video was introduced in the 1970s,
initially using uncompressed
pulse-code modulation
Pulse-code modulation (PCM) is a method used to digitally represent analog signals. It is the standard form of digital audio in computers, compact discs, digital telephony and other digital audio applications. In a PCM stream, the amplitud ...
(PCM), requiring high
bitrates around 45200
Mbit/s
In telecommunications, data transfer rate is the average number of bits (bitrate), characters or symbols (baudrate), or data blocks per unit time passing through a communication link in a data-transmission system. Common data rate units are multi ...
for
standard-definition
Standard-definition television (SDTV; also standard definition or SD) is a television system that uses a resolution that is not considered to be either high-definition television, high or enhanced definition. ''Standard'' refers to offering a ...
(SD) video,
which was up to 2,000 times greater than the
telecommunication
Telecommunication, often used in its plural form or abbreviated as telecom, is the transmission of information over a distance using electronic means, typically through cables, radio waves, or other communication technologies. These means of ...
bandwidth (up to 100
kbit/s
In telecommunications, data transfer rate is the average number of bits ( bitrate), characters or symbols ( baudrate), or data blocks per unit time passing through a communication link in a data-transmission system. Common data rate units are mu ...
) available until the 1990s.
Similarly, uncompressed
high-definition (HD)
1080p
1080p (1920 × 1080 progressively displayed pixels; also known as Full HD or FHD, and BT.709) is a set of HDTV high-definition video modes characterized by 1,920 pixels displayed across the screen horizontally and 1,080 pixels down the sc ...
video requires bitrates exceeding 1
Gbit/s
In telecommunications, data transfer rate is the average number of bits ( bitrate), characters or symbols ( baudrate), or data blocks per unit time passing through a communication link in a data-transmission system. Common data rate units are mu ...
, significantly greater than the bandwidth available in the 2000s.
Motion-compensated DCT
Practical
video compression
In information theory, data compression, source coding, or bit-rate reduction is the process of encoding information using fewer bits than the original representation. Any particular compression is either lossy or lossless. Lossless compression ...
emerged with the development of
motion-compensated DCT (MC DCT) coding,
also called block motion compensation (BMC)
or DCT motion compensation. This is a hybrid coding algorithm,
which combines two key
data compression
In information theory, data compression, source coding, or bit-rate reduction is the process of encoding information using fewer bits than the original representation. Any particular compression is either lossy or lossless. Lossless compressi ...
techniques:
discrete cosine transform
A discrete cosine transform (DCT) expresses a finite sequence of data points in terms of a sum of cosine functions oscillating at different frequency, frequencies. The DCT, first proposed by Nasir Ahmed (engineer), Nasir Ahmed in 1972, is a widely ...
(DCT) coding
in the
spatial dimension
In physics and mathematics, the dimension of a mathematical space (or object) is informally defined as the minimum number of coordinates needed to specify any point within it. Thus, a line has a dimension of one (1D) because only one coord ...
, and predictive
motion compensation
Motion compensation in computing is an algorithmic technique used to predict a frame in a video given the previous and/or future frames by accounting for motion of the camera and/or objects in the video. It is employed in the encoding of video ...
in the
temporal dimension
In physics and mathematics, the dimension of a mathematical space (or object) is informally defined as the minimum number of coordinates needed to specify any point within it. Thus, a line has a dimension of one (1D) because only one coordi ...
.
DCT coding is a
lossy
In information technology, lossy compression or irreversible compression is the class of data compression methods that uses inexact approximations and partial data discarding to represent the content. These techniques are used to reduce data size ...
block compression
transform coding technique that was first proposed by
Nasir Ahmed, who initially intended it for
image compression
Image compression is a type of data compression applied to digital images, to reduce their cost for computer data storage, storage or data transmission, transmission. Algorithms may take advantage of visual perception and the statistical properti ...
, while he was working at
Kansas State University
Kansas State University (KSU, Kansas State, or K-State) is a public university, public Land-grant university, land-grant research university with its main campus in Manhattan, Kansas, United States. It was opened as the state's land-grant coll ...
in 1972. It was then developed into a practical image compression algorithm by Ahmed with T. Natarajan and
K. R. Rao at the
University of Texas
The University of Texas at Austin (UT Austin, UT, or Texas) is a public research university in Austin, Texas, United States. Founded in 1883, it is the flagship institution of the University of Texas System. With 53,082 students as of fall 2 ...
in 1973, and was published in 1974.
The other key development was motion-compensated hybrid coding.
In 1974, Ali Habibi at the
University of Southern California
The University of Southern California (USC, SC, or Southern Cal) is a Private university, private research university in Los Angeles, California, United States. Founded in 1880 by Robert M. Widney, it is the oldest private research university in ...
introduced hybrid coding,
which combines predictive coding with transform coding.
He examined several transform coding techniques, including the DCT,
Hadamard transform
The Hadamard transform (also known as the Walsh–Hadamard transform, Hadamard–Rademacher–Walsh transform, Walsh transform, or Walsh–Fourier transform) is an example of a generalized class of Fourier transforms. It performs an orthogonal ...
,
Fourier transform
In mathematics, the Fourier transform (FT) is an integral transform that takes a function as input then outputs another function that describes the extent to which various frequencies are present in the original function. The output of the tr ...
, slant transform, and
Karhunen-Loeve transform.
However, his algorithm was initially limited to
intra-frame coding in the spatial dimension. In 1975, John A. Roese and Guner S. Robinson extended Habibi's hybrid coding algorithm to the temporal dimension, using transform coding in the spatial dimension and predictive coding in the temporal dimension, developing
inter-frame motion-compensated hybrid coding.
For the spatial transform coding, they experimented with different transforms, including the DCT and the
fast Fourier transform
A fast Fourier transform (FFT) is an algorithm that computes the discrete Fourier transform (DFT) of a sequence, or its inverse (IDFT). A Fourier transform converts a signal from its original domain (often time or space) to a representation in ...
(FFT), developing inter-frame hybrid coders for them, and found that the DCT is the most efficient due to its reduced complexity, capable of compressing image data down to 0.25-
bit per
pixel
In digital imaging, a pixel (abbreviated px), pel, or picture element is the smallest addressable element in a Raster graphics, raster image, or the smallest addressable element in a dot matrix display device. In most digital display devices, p ...
for a
videotelephone scene with image quality comparable to a typical intra-frame coder requiring 2-bit per pixel.
The DCT was applied to video encoding by Wen-Hsiung Chen,
who developed a fast DCT algorithm with C.H. Smith and S.C. Fralick in 1977,
and founded
Compression Labs to commercialize DCT technology.
In 1979,
Anil K. Jain and Jaswant R. Jain further developed motion-compensated DCT video compression.
This led to Chen developing a practical video compression algorithm, called motion-compensated DCT or adaptive scene coding, in 1981.
Motion-compensated DCT later became the standard coding technique for video compression from the late 1980s onwards.
Video coding standards
The first digital video coding standard was
H.120, developed by the
CCITT
The International Telecommunication Union Telecommunication Standardization Sector (ITU-T) is one of the three Sectors (branches) of the International Telecommunication Union (ITU). It is responsible for coordinating standards for telecommunicat ...
(now ITU-T) in 1984.
H.120 was not usable in practice, as its performance was too poor.
H.120 used motion-compensated DPCM coding,
a lossless compression algorithm that was inefficient for video coding.
During the late 1980s, a number of companies began experimenting with
discrete cosine transform
A discrete cosine transform (DCT) expresses a finite sequence of data points in terms of a sum of cosine functions oscillating at different frequency, frequencies. The DCT, first proposed by Nasir Ahmed (engineer), Nasir Ahmed in 1972, is a widely ...
(DCT) coding, a much more efficient form of compression for video coding. The CCITT received 14 proposals for DCT-based video compression formats, in contrast to a single proposal based on
vector quantization
Vector quantization (VQ) is a classical quantization technique from signal processing that allows the modeling of probability density functions by the distribution of prototype vectors. Developed in the early 1980s by Robert M. Gray, it was ori ...
(VQ) compression. The
H.261 standard was developed based on motion-compensated DCT compression.
H.261 was the first practical video coding standard,
and uses
patents
A patent is a type of intellectual property that gives its owner the legal right to exclude others from making, using, or selling an invention for a limited period of time in exchange for publishing an sufficiency of disclosure, enabling discl ...
licensed from a number of companies, including
Hitachi
() is a Japanese Multinational corporation, multinational Conglomerate (company), conglomerate founded in 1910 and headquartered in Chiyoda, Tokyo. The company is active in various industries, including digital systems, power and renewable ener ...
,
PictureTel,
NTT,
BT, and
Toshiba
is a Japanese multinational electronics company headquartered in Minato, Tokyo. Its diversified products and services include power, industrial and social infrastructure systems, elevators and escalators, electronic components, semiconductors ...
, among others.
Since H.261, motion-compensated DCT compression has been adopted by all the major video coding standards (including the
H.26x
The Video Coding Experts Group or Visual Coding Experts Group (VCEG, also known as Question 6) is a working group of the ITU Telecommunication Standardization Sector (ITU-T) concerned with standards for compression coding of video, images, audio ...
and
MPEG
The Moving Picture Experts Group (MPEG) is an alliance of working groups established jointly by International Organization for Standardization, ISO and International Electrotechnical Commission, IEC that sets standards for media coding, includ ...
formats) that followed.
MPEG-1
MPEG-1 is a Technical standard, standard for lossy compression of video and Audio frequency, audio. It is designed to compress VHS-quality raw digital video and CD audio down to about 1.5 Mbit/s (26:1 and 6:1 compression ratios respectively ...
, developed by the
Moving Picture Experts Group
The Moving Picture Experts Group (MPEG) is an alliance of working groups established jointly by ISO and IEC that sets standards for media coding, including compression coding of audio, video, graphics, and genomic data; and transmission and ...
(MPEG), followed in 1991, and it was designed to compress
VHS-quality video.
It was succeeded in 1994 by
MPEG-2
MPEG-2 (a.k.a. H.222/H.262 as was defined by the ITU) is a standard for "the generic coding of moving pictures and associated audio information". It describes a combination of lossy video compression and lossy audio data compression methods ...
/
H.262,
which was developed with patents licensed from a number of companies, primarily
Sony
is a Japanese multinational conglomerate (company), conglomerate headquartered at Sony City in Minato, Tokyo, Japan. The Sony Group encompasses various businesses, including Sony Corporation (electronics), Sony Semiconductor Solutions (i ...
,
Thomson and
Mitsubishi Electric
is a Japanese Multinational corporation, multinational electronics and electrical equipment manufacturing company headquartered in Tokyo, Japan. The company was established in 1921 as a spin-off from the electrical machinery manufacturing d ...
.
MPEG-2 became the standard video format for
DVD and
SD digital television.
Its motion-compensated DCT algorithm was able to achieve a
compression ratio
The compression ratio is the ratio between the maximum and minimum volume during the compression stage of the power cycle in a piston or Wankel engine.
A fundamental specification for such engines, it can be measured in two different ways. Th ...
of up to 100:1, enabling the development of
digital media
In mass communication, digital media is any media (communication), communication media that operates in conjunction with various encoded machine-readable data formats. Digital content can be created, viewed, distributed, modified, listened to, an ...
technologies such as
video on demand
Video on demand (VOD) is a media distribution system that allows users to access videos, television shows and films Digital distribution, digitally on request. These multimedia are accessed without a traditional video playback device and a typica ...
(VOD)
and
high-definition television
High-definition television (HDTV) describes a television or video system which provides a substantially higher image resolution than the previous generation of technologies. The term has been used since at least 1933; in more recent times, it ref ...
(HDTV).
In 1999, it was followed by
MPEG-4
MPEG-4 is a group of international standards for the compression of digital audio and visual data, multimedia systems, and file storage formats. It was originally introduced in late 1998 as a group of audio and video coding formats and related ...
/
H.263, which was a major leap forward for video compression technology.
It uses patents licensed from a number of companies, primarily Mitsubishi,
Hitachi
() is a Japanese Multinational corporation, multinational Conglomerate (company), conglomerate founded in 1910 and headquartered in Chiyoda, Tokyo. The company is active in various industries, including digital systems, power and renewable ener ...
and
Panasonic
is a Japanese multinational electronics manufacturer, headquartered in Kadoma, Osaka, Kadoma, Japan. It was founded in 1918 as in Fukushima-ku, Osaka, Fukushima by Kōnosuke Matsushita. The company was incorporated in 1935 and renamed and c ...
.
The most widely used video coding format is
H.264/MPEG-4 AVC.
It was developed in 2003, and uses patents licensed from a number of organizations, primarily Panasonic,
Godo Kaisha IP Bridge and
LG Electronics
LG Electronics Inc. () is a South Korean Multinational corporation, multinational major appliance and consumer electronics corporation headquartered in Yeouido-dong, Seoul, South Korea. LG Electronics is a part of LG, LG Corporation, the fourth ...
.
In contrast to the standard DCT used by its predecessors, AVC uses the
integer DCT.
H.264 is one of the video encoding standards for
Blu-ray Disc
Blu-ray (Blu-ray Disc or BD) is a Digital media, digital optical disc data storage format designed to supersede the DVD format. It was invented and developed in 2005 and released worldwide on June 20, 2006, capable of storing several hours of ...
s; all Blu-ray Disc players must be able to decode H.264. It is also widely used by streaming internet sources, such as videos from
YouTube
YouTube is an American social media and online video sharing platform owned by Google. YouTube was founded on February 14, 2005, by Steve Chen, Chad Hurley, and Jawed Karim who were three former employees of PayPal. Headquartered in ...
,
Netflix
Netflix is an American subscription video on-demand over-the-top streaming service. The service primarily distributes original and acquired films and television shows from various genres, and it is available internationally in multiple lang ...
,
Vimeo
Vimeo ( ) is an American Online video platform, video hosting, sharing, and services provider founded in 2004 and headquartered in New York City. Vimeo focuses on the delivery of high-definition video across a range of devices and operates on a ...
, and the
iTunes Store, web software such as the
Adobe Flash Player
Adobe Flash Player (known in Internet Explorer, Firefox, and Google Chrome as Shockwave Flash) is a discontinuedExcept in China, where it continues to be used, as well as Harman for enterprise users. computer program for viewing multimedia ...
and
Microsoft Silverlight, and also various
HDTV
High-definition television (HDTV) describes a television or video system which provides a substantially higher image resolution than the previous generation of technologies. The term has been used since at least 1933; in more recent times, it ref ...
broadcasts over terrestrial (
ATSC standards,
ISDB-T,
DVB-T
DVB-T, short for Digital Video Broadcasting – Terrestrial, is the DVB European-based consortium standard for the broadcast transmission of digital terrestrial television that was first published in 1997 and first broadcast in Singapore in Fe ...
or
DVB-T2
DVB-T2 is an abbreviation for "Digital Video Broadcasting – Second Generation Terrestrial"; it is the extension of the television standard DVB-T, issued by the consortium DVB, devised for the broadcast transmission of digital terrestrial telev ...
), cable (
DVB-C
Digital Video Broadcasting - Cable (DVB-C) is the Digital Video Broadcasting, DVB European consortium standard for the broadcast transmission of digital television over coaxial cable, cable. This system transmits an MPEG-2 or MPEG-4 family digita ...
), and satellite (
DVB-S2).
A main problem for many video coding formats has been
patent
A patent is a type of intellectual property that gives its owner the legal right to exclude others from making, using, or selling an invention for a limited period of time in exchange for publishing an sufficiency of disclosure, enabling discl ...
s, making it expensive to use or potentially risking a patent lawsuit due to
submarine patent
A submarine patent is a patent whose issuance and publication are intentionally delayed by the applicant for an artificially long pendency, which can be several years, or a decade. s. The motivation behind many recently designed video coding formats such as
Theora
Theora is a free lossy video compression format. It was developed by the Xiph.Org Foundation and distributed without licensing fees alongside their other free and open media projects, including the Vorbis audio format and the Ogg contai ...
,
VP8
VP8 is an open format, open and royalty-free Video coding format, video compression format released by On2 Technologies in 2008.
Initially released as a Proprietary software, proprietary successor to On2's previous VP7 format, VP8 was released a ...
, and
VP9 have been to create a (
libre) video coding standard covered only by royalty-free patents. Patent status has also been a major point of contention for the choice of which video formats the mainstream
web browser
A web browser, often shortened to browser, is an application for accessing websites. When a user requests a web page from a particular website, the browser retrieves its files from a web server and then displays the page on the user's scr ...
s will support inside the
HTML video tag.
The current-generation video coding format is
HEVC
High Efficiency Video Coding (HEVC), also known as H.265 and MPEG-H Part 2, is a video compression standard designed as part of the MPEG-H project as a successor to the widely used Advanced Video Coding (AVC, H.264, or MPEG-4 Part 10). In co ...
(H.265), introduced in 2013. AVC uses the integer DCT with 4x4 and 8x8 block sizes, and HEVC uses integer DCT and
DST transforms with varied block sizes between 4x4 and 32x32.
HEVC is heavily patented, mostly by
Samsung Electronics
Samsung Electronics Co., Ltd. (SEC; stylized as SΛMSUNG; ) is a South Korean multinational major appliance and consumer electronics corporation founded on 13 January 1969 and headquartered in Yeongtong District, Suwon, South Korea. It is curr ...
,
GE,
NTT, and
JVCKenwood.
It is challenged by the
AV1 format, intended for free license. , AVC is by far the most commonly used format for the recording, compression, and distribution of video content, used by 91% of video developers, followed by HEVC which is used by 43% of developers.
List of video coding standards
Lossless, lossy, and uncompressed
Consumer video is generally compressed using
lossy
In information technology, lossy compression or irreversible compression is the class of data compression methods that uses inexact approximations and partial data discarding to represent the content. These techniques are used to reduce data size ...
video codec
A video codec is software or Computer hardware, hardware that data compression, compresses and Uncompressed video, decompresses digital video. In the context of video compression, ''codec'' is a portmanteau of ''encoder'' and ''decoder'', while ...
s, since that results in significantly smaller files than
lossless compression. Some video coding formats designed explicitly for either lossy or lossless compression, and some video coding formats such as
Dirac and
H.264 support both.
Uncompressed video
Uncompressed video is digital video that either has never been compressed or was generated by decompressing previously compressed digital video. It is commonly used by video cameras, video monitors, video recording devices (including general-pur ...
formats, such as ''Clean HDMI'', is a form of lossless video used in some circumstances such as when sending video to a display over a
HDMI
High-Definition Multimedia Interface (HDMI) is a proprietary digital interface used to transmit high-quality video and audio signals between devices. It is commonly used to connect devices such as televisions, computer monitors, projectors, gam ...
connection. Some high-end cameras can also capture video directly in this format.
Intra-frame
Interframe compression complicates editing of an encoded video sequence.
One subclass of relatively simple video coding formats are the
intra-frame video formats, such as
DV, in which each frame of the video stream is compressed independently without referring to other frames in the stream, and no attempt is made to take advantage of correlations between successive pictures over time for better compression. One example is
Motion JPEG, which is simply a sequence of individually
JPEG
JPEG ( , short for Joint Photographic Experts Group and sometimes retroactively referred to as JPEG 1) is a commonly used method of lossy compression for digital images, particularly for those images produced by digital photography. The degr ...
-compressed images. This approach is quick and simple, at the expense of the encoded video being much larger than a video coding format supporting
Inter frame
An inter frame is a frame in a video compression stream which is expressed in terms of one or more neighboring frames. The "inter" part of the term refers to the use of ''Inter frame prediction''. This kind of prediction tries to take advantage fr ...
coding.
Because interframe compression copies data from one frame to another, if the original frame is simply cut out (or lost in transmission), the following frames cannot be reconstructed properly. Making
cuts in intraframe-compressed video while
video editing
Video editing is the post-production and arrangement of video shots. To showcase excellent video editing to the public, video editors must be reasonable and ensure they have a thorough understanding of film, television, and other sorts of videog ...
is almost as easy as editing uncompressed video: one finds the beginning and ending of each frame, and simply copies bit-for-bit each frame that one wants to keep, and discards the frames one does not want. Another difference between intraframe and interframe compression is that, with intraframe systems, each frame uses a similar amount of data. In most interframe systems, certain frames (such as
I-frame
In the field of video compression, a video frame is compressed using different algorithms with different advantages and disadvantages, centered mainly around amount of data compression. These different algorithms for video frames are called pict ...
s in
MPEG-2
MPEG-2 (a.k.a. H.222/H.262 as was defined by the ITU) is a standard for "the generic coding of moving pictures and associated audio information". It describes a combination of lossy video compression and lossy audio data compression methods ...
) are not allowed to copy data from other frames, so they require much more data than other frames nearby.
It is possible to build a computer-based video editor that spots problems caused when I frames are edited out while other frames need them. This has allowed newer formats like
HDV to be used for editing. However, this process demands a lot more computing power than editing intraframe compressed video with the same picture quality. But, this compression is not very effective to use for any audio format.
Profiles and levels
A video coding format can define optional restrictions to encoded video, called
profiles and levels. It is possible to have a decoder which only supports decoding a subset of profiles and levels of a given video format, for example to make the decoder program/hardware smaller, simpler, or faster.
A ''profile'' restricts which encoding techniques are allowed. For example, the H.264 format includes the profiles ''baseline'', ''main'' and ''high'' (and others). While
P-slices (which can be predicted based on preceding slices) are supported in all profiles,
B-slices (which can be predicted based on both preceding and following slices) are supported in the ''main'' and ''high'' profiles but not in ''baseline''.
A ''level'' is a restriction on parameters such as maximum resolution and data rates.
See also
*
Comparison of video container formats
These tables compare features of multimedia container format (digital), container formats, most often used for storing or streaming digital video or digital audio content. To see which multimedia players support which container format, look at com ...
*
*
Display resolution
The display resolution or display modes of a digital television, computer monitor, or other display device is the number of distinct pixels in each dimension that can be displayed. It can be an ambiguous term especially as the displayed resoluti ...
*
List of video compression formats
*
Video file format
A video file format is a type of file format for storing digital video data on a computer system. Video is almost always stored using lossy compression to reduce the file size.
A video file normally consists of a container (e.g. in the Matroska ...
Notes
References
{{Compression formats
Video formats