Unified Speech and Audio Coding (USAC) is an
audio compression format and
codec
A codec is a computer hardware or software component that encodes or decodes a data stream or signal. ''Codec'' is a portmanteau of coder/decoder.
In electronic communications, an endec is a device that acts as both an encoder and a decoder o ...
for both music and speech or any mix of speech and audio using very low bit rates between 12 and 64 kbit/s.
It was developed by
Moving Picture Experts Group
The Moving Picture Experts Group (MPEG) is an alliance of working groups established jointly by ISO and IEC that sets standards for media coding, including compression coding of audio, video, graphics, and genomic data; and transmission and ...
(MPEG) and was published as an international standard
ISO
The International Organization for Standardization (ISO ; ; ) is an independent, non-governmental, international standard development organization composed of representatives from the national standards organizations of member countries.
Me ...
/
IEC
The International Electrotechnical Commission (IEC; ) is an international standards organization that prepares and publishes international standards for all electrical, electronic and related technologies. IEC standards cover a vast range of ...
23003-3 (a.k.a.
MPEG-D Part 3)
and also as an
MPEG-4 Audio Object Type in ISO/IEC 14496-3:2009/Amd 3 in 2012.
It uses time-domain linear prediction and residual coding tools (
ACELP-like techniques) for speech signal segments and transform coding tools (
MDCT-based techniques) for music signal segments and it is able to switch between the tool sets dynamically in a signal-responsive manner. It is being developed with the aim of a single, unified coder with performance that equals or surpasses that of dedicated speech coders and dedicated music coders over a broad range of bitrates. Enhanced variations of the MPEG-4
Spectral Band Replication (SBR) and MPEG-D
MPEG Surround parametric coding tools are integrated into the USAC codec.
Extended HE-AAC
The MPEG-D USAC standard (ISO/IEC 23003-3) defines the Extended High Efficiency AAC profile, which contains all of the tools of the
HE-AAC v2
High-Efficiency Advanced Audio Coding (HE-AAC) is an audio coding format for lossy data compression of digital audio as part of the MPEG-4 standards. It is an extension of Low Complexity Advanced Audio Coding, AAC (AAC-LC) optimized for low- ...
profile plus the mono/stereo capabilities of the Baseline USAC profile. As a result, a decoder built according to the Extended High Efficiency AAC profile is able to also decode the bit streams created for the previous AAC family profiles. The Extended High Efficiency AAC profile was designed for applications relying on a consistent performance at low data rates while being able to decode all existing
AAC-LC,
HE-AAC
High-Efficiency Advanced Audio Coding (HE-AAC) is an audio coding format for lossy data compression of digital audio as part of the MPEG-4 standards. It is an extension of Low Complexity AAC (AAC-LC) optimized for low- bitrate applicati ...
and
HE-AACv2 content.
xHE-AAC
Fraunhofer has defined the xHE-AAC codec as the combination of the Extended High Efficiency AAC profile and appropriate parts of the MPEG-D DRC Loudness Control Profile or Dynamic Range Control Profile.
xHE-AAC extends the operating range of the codec from 12 to 300 kbit/s for stereo signals and allows seamless switching between bitrates over this range for adaptive bitrate delivery (using standards such as
MPEG-DASH
Dynamic Adaptive Streaming over HTTP (DASH), also known as MPEG-DASH, is an adaptive bitrate streaming technique that enables high quality streaming of media content over the Internet delivered from conventional HTTP web servers. Similar to Appl ...
or
HLS for example). xHE-AAC also includes MPEG-D DRC mandatory loudness control to playback content at a consistent volume and offers new dynamic range control profiles for listening in noisy situations.
While xHE-AAC decoders will be able to decode the bit streams created for the previous AAC family profiles, xHE-AAC encoders are typically intended for encoding of MPEG-D USAC audio object type (AOT 42) with MPEG-D DRC loudness metadata, though some may support encoding legacy AAC object types.
xHE-AAC is a mandatory audio codec in the
Digital Radio Mondiale
Digital Radio Mondiale (DRM; ''mondiale'' being Italian and French for "worldwide") is a set of digital audio broadcasting technologies designed to work over the bands currently used for analogue radio broadcasting including AM broadcasting—p ...
standard and is a trademark of Fraunhofer.
In April 2016, Via Licensing announced the launch of a xHE-AAC patent pool licensing program for 2016. In 2018, xHE-AAC was included in Via Licensing's AAC patent pool at no additional cost.
In January 2021, Fraunhofer announced a test service and trademark program for xHE-AAC and announced that the codec is being used by Netflix. Netflix reported that users switched from speakers to headphones 16% less often (due to poor sound quality or inadequate volume) on high dynamic range content when using xHE-AAC instead of HE-AAC. Netflix also explained that xHE-AAC allowed them to begin streaming with adaptive audio bitrates to Android devices. Fraunhofer also announced xHE-AAC licenses to MainConcept, Poikosoft, and LG. xHE-AAC is supported by the Bento4 DASH/HLS packager. In January 2022, MainConcept established
web encoding serviceto test xHE-AAC. In October 2022, xHE-AAC decoding was added to Windows 11 and Xbox devices.
Compatibility
xHE-AAC is supported in Android since
Android Pie and in iOS since
iOS 13
iOS 13 is the thirteenth major release of the iOS mobile operating system developed by Apple for the iPhone, iPod Touch and HomePod. The successor to iOS 12, it was announced at the company's Worldwide Developers Conference (WWDC) on June ...
. It has been announced that it will be added to
watchOS 7 and has been licensed to
Microsoft
Microsoft Corporation is an American multinational corporation and technology company, technology conglomerate headquartered in Redmond, Washington. Founded in 1975, the company became influential in the History of personal computers#The ear ...
. Playing xHE-AAC audio files is supported in
foobar2000
foobar2000 (often abbreviated as fb2k or f2k) is a freeware audio player for Microsoft Windows, iOS, Android, macOS, and formerly Windows Phone, developed by Peter Pawłowski. It has a modular design, which provides user flexibility in c ...
with the use of an add-on AAC decoder, and natively in 2.25 Release Preview. In October 2022, Windows 11 added support for xHE-AAC in the
22H2 update.
See also
*
Opus (codec) – a royalty free alternative, low latency codec for a similar usage
References
External links
Fraunhofer xHE-AAC WebsiteFraunhofer AAC Audio Playback Test SitexHE-AAC encoder for Windows 7/8/10/11Fraunhofer xHE-AAC Codec Test ServiceNetflix Tech Blog: Optimizing the Aural Experience on Android Devices with xHE-AAC
{{MPEG
Audio codecs
MPEG
Open standards covered by patents