HOME

TheInfoList



OR:

MPEG-4 Part 2, MPEG-4 Visual (formally
ISO The International Organization for Standardization (ISO ; ; ) is an independent, non-governmental, international standard development organization composed of representatives from the national standards organizations of member countries. Me ...
/ IEC 14496-2) is a video encoding specification designed by the
Moving Picture Experts Group The Moving Picture Experts Group (MPEG) is an alliance of working groups established jointly by ISO and IEC that sets standards for media coding, including compression coding of audio, video, graphics, and genomic data; and transmission and ...
(MPEG). It belongs to the MPEG-4 ISO/IEC family of encoders. It uses block-wise
motion compensation Motion compensation in computing is an algorithmic technique used to predict a frame in a video given the previous and/or future frames by accounting for motion of the camera and/or objects in the video. It is employed in the encoding of video ...
and a
discrete cosine transform A discrete cosine transform (DCT) expresses a finite sequence of data points in terms of a sum of cosine functions oscillating at different frequency, frequencies. The DCT, first proposed by Nasir Ahmed (engineer), Nasir Ahmed in 1972, is a widely ...
(DCT), similar to previous encoders such as
MPEG-1 Part 2 MPEG-1 is a standard for lossy compression of video and audio. It is designed to compress VHS-quality raw digital video and CD audio down to about 1.5 Mbit/s (26:1 and 6:1 compression ratios respectively) without excessive quality loss, m ...
and H.262/MPEG-2 Part 2. Examples of popular implementations of the encoder specifications include
DivX DIVX (Digital Video Express) is a discontinued digital video format. Created in part by Circuit City, it was an unsuccessful attempt to create an alternative to video rental in the United States. The format's poor reception from consumers resu ...
, Xvid and
Nero Digital Nero Digital is a brand name applied to a suite of MPEG-4-compatible video and audio compression codecs developed by Nero AG of Germany and Ateme of France. The audio codecs are integrated into the ''Nero Digital Audio+'' audio encoding tool for M ...
. MPEG-4 Part 2 is H.263 compatible in the sense that a basic H.263 bitstream is correctly decoded by an MPEG-4 Video decoder. (MPEG-4 Video decoder is natively capable of decoding a basic form of H.263.) In MPEG-4 Visual, there are two types of video object layers: the video object layer that provides full MPEG-4 functionality, and a reduced functionality video object layer, the video object layer with short headers (which provides bitstream compatibility with base-line H.263). MPEG-4 Part 2 is partially based on ITU-T H.263. The first MPEG-4 Video Verification Model (simulation and test model) used ITU-T H.263 coding tools together with shape coding.


History

The MPEG-4 Visual format was developed by the
Moving Picture Experts Group The Moving Picture Experts Group (MPEG) is an alliance of working groups established jointly by ISO and IEC that sets standards for media coding, including compression coding of audio, video, graphics, and genomic data; and transmission and ...
(MPEG) committee. The specification was authored by
Iranian Iranian () may refer to: * Something of, from, or related to Iran ** Iranian diaspora, Iranians living outside Iran ** Iranian architecture, architecture of Iran and parts of the rest of West Asia ** Iranian cuisine, cooking traditions and practic ...
-
Swiss Swiss most commonly refers to: * the adjectival form of Switzerland * Swiss people Swiss may also refer to: Places * Swiss, Missouri * Swiss, North Carolina * Swiss, West Virginia * Swiss, Wisconsin Other uses * Swiss Café, an old café located ...
engineer Touradj Ebrahimi (later the president of
JPEG JPEG ( , short for Joint Photographic Experts Group and sometimes retroactively referred to as JPEG 1) is a commonly used method of lossy compression for digital images, particularly for those images produced by digital photography. The degr ...
) and Dutch engineer Caspar Horne. The standard was developed using patents from over a dozen organizations, listed by
MPEG LA MPEG LA was an American company based in Denver, Colorado that licensed patent pools covering essential patents required for use of the MPEG-2, MPEG-4, IEEE 1394, VC-1, ATSC, MVC, MPEG-2 Systems, AVC/H.264 and HEVC standards. Via Licensin ...
in a
patent pool In patent law, a patent pool is a consortium of two or more companies agreeing to cross-license patents relating to a particular technology. The creation of a patent pool can save patentees and licensees time and money, and, in case of blocking pa ...
. The majority of patents used for the MPEG-4 Visual format were from three Japanese companies:
Mitsubishi Electric is a Japanese Multinational corporation, multinational electronics and electrical equipment manufacturing company headquartered in Tokyo, Japan. The company was established in 1921 as a spin-off from the electrical machinery manufacturing d ...
(255 patents),
Hitachi () is a Japanese Multinational corporation, multinational Conglomerate (company), conglomerate founded in 1910 and headquartered in Chiyoda, Tokyo. The company is active in various industries, including digital systems, power and renewable ener ...
(206 patents), and
Panasonic is a Japanese multinational electronics manufacturer, headquartered in Kadoma, Osaka, Kadoma, Japan. It was founded in 1918 as in Fukushima-ku, Osaka, Fukushima by Kōnosuke Matsushita. The company was incorporated in 1935 and renamed and c ...
(200 patents). See '' Patent holders'' below for a full list of patent holders.


Editions


Profiles

To address various applications ranging from low-quality, low-resolution surveillance cameras to high definition TV broadcasting and DVDs, many video standards group features into profiles and levels. MPEG-4 Part 2 contains approximately 21 profiles. The most commonly deployed profiles are the Advanced Simple Profile (ASP or AS Profile) and the Simple Profile (SP), which is a subset of the ASP. Other profiles include the Advanced Coding Efficiency Profile (ACEP or ACE Profile), the Advanced Real Time Simple Profile (ARTSP or ARTS Profile), the Core Profile (CP), the Main Profile (MP), and the Simple Studio Profile (SStP or SSt Profile). Most of the video compression schemes standardize the
bitstream A bitstream (or bit stream), also known as binary sequence, is a sequence of bits. A bytestream is a sequence of bytes. Typically, each byte is an 8-bit quantity, and so the term octet stream is sometimes used interchangeably. An octet may ...
(and thus the decoder) leaving the encoder design to the individual implementations. Therefore, implementations for a particular profile (such as DivX or Nero Digital which are implementations of Advanced Simple Profile and Xvid that implements both profiles) are all technically identical on the decoder side. A point of comparison would be that an MP3 file can be played in any MP3 player, whether it was created through iTunes, Windows Media Player,
LAME LAME is a software encoder that converts digital audio into the MP3 audio coding format. LAME is a free software project that was first released in 1998 and has incorporated many improvements since then, including an improved psychoacoustic ...
, or the common Fraunhofer encoder.


Simple Profile (SP)

The Simple Profile is mostly aimed for use in situations where low bit rate and low resolution are mandated by other conditions of the applications, like network bandwidth, device size etc. Examples are
mobile phone A mobile phone or cell phone is a portable telephone that allows users to make and receive calls over a radio frequency link while moving within a designated telephone service area, unlike fixed-location phones ( landline phones). This rad ...
s, some low end
video conferencing Videotelephony (also known as videoconferencing or video calling) is the use of audio signal, audio and video for simultaneous two-way communication. Today, videotelephony is widespread. There are many terms to refer to videotelephony. ''Vide ...
systems, electronic surveillance systems etc.


Levels


Advanced Simple Profile (ASP)

The Advanced Simple Profile was not included in the original standard. Its notable technical features relative to the Simple Profile, which is roughly similar to H.263, include: * Support for "MPEG"-style quantization * Support for interlaced video * Support for B pictures (a.k.a. ''
B-frame In the field of video compression, a video frame is compressed using different algorithms with different advantages and disadvantages, centered mainly around amount of data compression. These different algorithms for video frames are called pict ...
s'') * Quarter Pixel
motion compensation Motion compensation in computing is an algorithmic technique used to predict a frame in a video given the previous and/or future frames by accounting for motion of the camera and/or objects in the video. It is employed in the encoding of video ...
( Qpel) *
Global motion compensation {{refimprove, date=September 2008 ''Global motion compensation'' ''(GMC)'' is a motion compensation technique used in video compression to reduce the bitrate required to encode video. It is most commonly used in MPEG-4 ASP, such as with the D ...
(GMC) The MPEG quantization and interlace support are designed in basically similar ways to the way it is found in
MPEG-2 MPEG-2 (a.k.a. H.222/H.262 as was defined by the ITU) is a standard for "the generic coding of moving pictures and associated audio information". It describes a combination of lossy video compression and lossy audio data compression methods ...
Part 2. The B picture support is designed in a basically similar way to the way it is found in MPEG-2 Part 2 and H.263v2. The quarter-pixel motion compensation feature of ASP was innovative, and was later also included (in somewhat different forms) in later designs such as
MPEG-4 Part 10 Advanced Video Coding (AVC), also referred to as H.264 or MPEG-4 Part 10, is a video compression standard based on block-oriented, motion-compensated coding. It is by far the most commonly used format for the recording, compression, and d ...
,
HEVC High Efficiency Video Coding (HEVC), also known as H.265 and MPEG-H Part 2, is a video compression standard designed as part of the MPEG-H project as a successor to the widely used Advanced Video Coding (AVC, H.264, or MPEG-4 Part 10). In co ...
,
VC-1 SMPTE 421, informally known as VC-1, is a video coding format. Most of it was initially developed as Microsoft's proprietary video format Windows Media Video 9 in 2003. With some enhancements including the development of a new Advanced Profile, ...
and VVC. Some implementations of MPEG-4 Part 2 omit support for this feature, because it has a significantly harmful effect on the speed of software decoders and it is not always beneficial for quality. The global motion compensation feature is not actually supported in most implementations although the standard officially requires decoders to support it. Most encoders do not support it either, and some experts say that it does not ordinarily provide any benefit in compression. When used, ASP's global motion compensation has a large unfavorable impact on speed and adds considerable complexity to the implementation.


Levels


Simple Studio Profile (SStP)

The Simple Studio Profile has six levels, ranging from
SDTV Standard-definition television (SDTV; also standard definition or SD) is a television system that uses a resolution that is not considered to be either high or enhanced definition. ''Standard'' refers to offering a similar resolution to the ...
to
4K resolution 4K resolution refers to a horizontal display resolution of approximately 4,000 pixels. Digital television and digital cinematography commonly use several different 4K resolutions. In television and consumer media, 38402160 (4K UHD) with a 16:9 asp ...
. SStP allows for up to 12-bit bit depth and up to 4:4:4
chroma subsampling Chroma subsampling is the practice of encoding images by implementing less resolution for Chrominance, chroma information than for luma (video), luma information, taking advantage of the human visual system's lower acuity for color differences t ...
, using intra-frame coding only. SStP is used by HDCAM SR.


Levels


Patent holders

MPEG-4 Part 2
patent A patent is a type of intellectual property that gives its owner the legal right to exclude others from making, using, or selling an invention for a limited period of time in exchange for publishing an sufficiency of disclosure, enabling discl ...
s have expired worldwide, the last one on January 28, 2024. The following organizations held patents for MPEG-4 Visual technology, as listed in the
patent pool In patent law, a patent pool is a consortium of two or more companies agreeing to cross-license patents relating to a particular technology. The creation of a patent pool can save patentees and licensees time and money, and, in case of blocking pa ...
administered by
MPEG LA MPEG LA was an American company based in Denver, Colorado that licensed patent pools covering essential patents required for use of the MPEG-2, MPEG-4, IEEE 1394, VC-1, ATSC, MVC, MPEG-2 Systems, AVC/H.264 and HEVC standards. Via Licensin ...
.


Criticisms

MPEG-4 Part 2 has drawn some industry criticism.
FFmpeg FFmpeg is a free and open-source software project consisting of a suite of libraries and programs for handling video, audio, and other multimedia files and streams. At its core is the command-line ffmpeg tool itself, designed for processing vide ...
's maintainer Michael Niedermayer has criticised MPEG-4 for lacking an in-loop deblocking filter, GMC being too computationally intensive, and OBMC being defined but not allowed in any profiles among other things.
Microsoft Microsoft Corporation is an American multinational corporation and technology company, technology conglomerate headquartered in Redmond, Washington. Founded in 1975, the company became influential in the History of personal computers#The ear ...
's Ben Waggoner states "Microsoft (well before my time) went down the codec standard route before with MPEG-4 part 2, which turns out to be a profound disappointment across the industry - it didn't offer that much of a compression advantage over MPEG-2, and the protracted license agreement discussions scared off a lot of adoption. I was involved in many digital media projects that wouldn't even touch MPEG-4 in the late 1990s to early 2000s because there was going to be a 'content fee' that hadn't been fully defined yet."VC-1 and H264 - Page 2 - Doom9's Forum
/ref>


Popular software implementations

*
3ivx 3ivx ( ) was an MPEG-4 compliant video codec suite, created by 3ivx Technologies, based in Sydney, Australia. 3ivx video codecs were released from 2001 to 2012, with releases of related technologies continuing until 2015. 3ivx provided plugins to ...
*
DivX DIVX (Digital Video Express) is a discontinued digital video format. Created in part by Circuit City, it was an unsuccessful attempt to create an alternative to video rental in the United States. The format's poor reception from consumers resu ...
* HDX4 * libavcodec *
Nero Digital Nero Digital is a brand name applied to a suite of MPEG-4-compatible video and audio compression codecs developed by Nero AG of Germany and Ateme of France. The audio codecs are integrated into the ''Nero Digital Audio+'' audio encoding tool for M ...
*
QuickTime QuickTime (or QuickTime Player) is an extensible multimedia architecture created by Apple, which supports playing, streaming, encoding, and transcoding a variety of digital media formats. The term ''QuickTime'' also refers to the QuickTime Pla ...
* Xvid


See also

*
High Efficiency Video Coding High Efficiency Video Coding (HEVC), also known as H.265 and MPEG-H Part 2, is a video compression standard designed as part of the MPEG-H project as a successor to the widely used Advanced Video Coding (AVC, H.264, or MPEG-4 Part 10). In compa ...
*
Advanced Video Coding Advanced Video Coding (AVC), also referred to as H.264 or MPEG-4 Part 10, is a video compression standard based on block-oriented, motion-compensated coding. It is by far the most commonly used format for the recording, compression, and d ...
*
Quantization (image processing) Quantization, involved in image processing, is a lossy compression technique achieved by compressing a range of values to a single quantum (discrete) value. When the number of discrete symbols in a given stream is reduced, the stream becomes more ...
*
FourCC A FourCC ("four-character code") is a sequence of four bytes (typically ASCII) used to uniquely identify data formats. It originated from the OSType or ResType metadata system used in classic Mac OS and was adopted for the Amiga/Electronic Arts ...
*
MP3 MP3 (formally MPEG-1 Audio Layer III or MPEG-2 Audio Layer III) is a coding format for digital audio developed largely by the Fraunhofer Society in Germany under the lead of Karlheinz Brandenburg. It was designed to greatly reduce the amount ...
*
ISO/IEC JTC 1/SC 29 ISO/IEC JTC 1/SC 29, entitled ''Coding of audio, picture, multimedia and hypermedia information'', is a standardization subcommittee of the Joint Technical Committee ISO/IEC JTC 1 of the International Organization for Standardization (ISO) and the I ...


Notes


External links


MPEG-4 Part 2: Visual
*
Official MPEG web site

MPEG-4 Visual Patent List (MPEG LA)
{{MPEG MPEG-4 Open standards covered by patents Videotelephony 1999 software 20th-century inventions