G.729ab
   HOME

TheInfoList



OR:

G.729 is a
royalty-free Royalty-free (RF) material subject to copyright or other intellectual property rights may be used without the need to pay royalties or license fees for each use, per each copy or volume sold or some time period of use or sales. Computer standards ...
narrow-band
vocoder A vocoder (, a portmanteau of ''vo''ice and en''coder'') is a category of speech coding that analyzes and synthesizes the human voice signal for audio data compression, multiplexing, voice encryption or voice transformation. The vocoder wa ...
-based
audio data compression In information theory, data compression, source coding, or bit-rate reduction is the process of encoding information using fewer bits than the original representation. Any particular compression is either lossy or lossless. Lossless compression ...
algorithm using a frame length of 10 milliseconds. It is officially described as ''Coding of speech at 8 kbit/s using
code-excited linear prediction Code-excited linear prediction (CELP) is a linear predictive speech coding algorithm originally proposed by Manfred R. Schroeder and Bishnu S. Atal in 1985. At the time, it provided significantly better quality than existing low bit-rate algori ...
''
speech coding Speech coding is an application of data compression to digital audio signals containing speech. Speech coding uses speech-specific parameter estimation using audio signal processing techniques to model the speech signal, combined with generic da ...
(CS-ACELP), and was introduced in 1996. The wide-band extension of G.729 is called
G.729.1 G.729.1 is an 8-32 kbit/s embedded speech and audio codec providing bitstream interoperability with G.729, G.729 Annex A and G.729 Annex B. Its official name is ''G.729-based embedded variable bit rate codec: An 8-32 kbit/s scalable wi ...
, which equals G.729 Annex J. Because of its low bandwidth requirements, G.729 is mostly used in
voice over Internet Protocol Voice over Internet Protocol (VoIP), also known as IP telephony, is a set of technologies used primarily for voice communication sessions over Internet Protocol (IP) networks, such as the Internet. VoIP enables Voice call, voice calls to be tran ...
(VoIP) applications when bandwidth must be conserved. Standard G.729 operates at a
bit rate In telecommunications and computing, bit rate (bitrate or as a variable ''R'') is the number of bits that are conveyed or processed per unit of time. The bit rate is expressed in the unit bit per second (symbol: bit/s), often in conjunction ...
of 8 kbit/s, but extensions provide rates of 6.4 kbit/s (Annex D, F, H, I, C+) and 11.8 kbit/s (Annex E, G, H, I, C+) for worse and better speech quality, respectively. G.729 has been extended with various features, commonly designated as G.729a and G.729b: * G.729: This is the original codec using a high-complexity algorithm. * G.729A or Annex A: This version has a medium complexity, and is compatible with G.729. It provides a slightly lower voice quality. * G.729B or Annex B: This version extends G.729 with
silence suppression The term silence suppression is used in telephony to describe the process of not transmitting information over the network when one of the parties involved in a telephone call is not speaking, thereby reducing bandwidth usage. Voice is carried ove ...
, and is not compatible with the previous versions. * G.729AB: This version extends G.729A with silence suppression, and is only compatible with G.729B. *
G.729.1 G.729.1 is an 8-32 kbit/s embedded speech and audio codec providing bitstream interoperability with G.729, G.729 Annex A and G.729 Annex B. Its official name is ''G.729-based embedded variable bit rate codec: An 8-32 kbit/s scalable wi ...
or Annex J: This version extends G.729A and B with scalable variable encoding using hierarchical enhancement layers. It provides support for
wideband In communications, a system is wideband when the message bandwidth significantly exceeds the coherence bandwidth of the channel. Some communication links have such a high data rate that they are forced to use a wide bandwidth; other links ma ...
speech and audio, using
modified discrete cosine transform The modified discrete cosine transform (MDCT) is a transform based on the type-IV discrete cosine transform (DCT-IV), with the additional property of being lapped: it is designed to be performed on consecutive blocks of a larger dataset, where s ...
(MDCT) coding.
Dual-tone multi-frequency signaling Dual-tone multi-frequency (DTMF) signaling is a telecommunication signaling system using the voice-frequency band over telephone lines between telephone equipment and other communications devices and Automatic telephone exchange, switching center ...
(DTMF),
fax Fax (short for facsimile), sometimes called telecopying or telefax (short for telefacsimile), is the telephonic transmission of scanned printed material (both text and images), normally to a telephone number connected to a printer or other out ...
transmissions, and high-quality audio cannot be transported reliably with this
codec A codec is a computer hardware or software component that encodes or decodes a data stream or signal. ''Codec'' is a portmanteau of coder/decoder. In electronic communications, an endec is a device that acts as both an encoder and a decoder o ...
. DTMF requires the use of the named telephony events in the RTP payload for DTMF digits, telephony tones, and telephony signals as specified in RFC 4733.


G.729 annexes


G.729 Annex A

G.729a is a compatible extension of G.729, but requires less computational power. This lower complexity, however, bears the cost of marginally reduced speech quality. G.729a was developed by a consortium of organizations:
France Télécom Orange S.A. (; formerly , stylised as france telecom) is a French multinational corporation, multinational telecommunications corporation founded in 1988 and headquartered in Issy-les-Moulineaux, near Paris. ''Orange'' has been the corporation' ...
,
Mitsubishi Electric Corporation is a Japanese multinational electronics and electrical equipment manufacturing company headquartered in Tokyo, Japan. The company was established in 1921 as a spin-off from the electrical machinery manufacturing division of Mitsubishi Ship ...
, Nippon Telegraph and Telephone Corporation (NTT). The features of G.729a are: * Sampling frequency 8 kHz/16-bit (80 samples for 10 ms frames) * Fixed bit rate (8 kbit/s 10 ms frames) * Fixed frame size (10 bytes (80 bits) for 10 ms frame) * Algorithmic delay is 15 ms per frame, with 5 ms look-ahead delay * G.729a is a hybrid speech coder which uses
Algebraic Code Excited Linear Prediction Algebraic code-excited linear prediction (ACELP) is a speech coding algorithm in which a limited set of pulses is distributed as excitation to a linear prediction filter. It is a linear predictive coding (LPC) algorithm that is based on the code- ...
(ACELP) * The complexity of the algorithm is rated at 15, using a relative scale where
G.711 G.711 is a narrowband audio codec originally designed for use in telephony that provides toll-quality audio at 64 kbit/s. It is an ITU-T standard (Recommendation) for audio encoding, titled Pulse code modulation (PCM) of voice frequencies ...
is 1 and G.723.1 is 25. *
PSQM Perceptual Speech Quality Measure (PSQM) is a computational and modeling algorithm defined in Recommendation ITU-T P.861 that objectively evaluates and quantifies voice quality of voice-band (300 – 3400 Hz) speech codecs. It may be used t ...
testing under ideal conditions yields
mean opinion score Mean opinion score (MOS) is a measure used in the domain of Quality of Experience and telecommunications engineering, representing overall quality of a stimulus or system. It is the arithmetic mean over all individual "values on a predefined scale ...
s of 4.04 for G.729a, compared to 4.45 for
G.711 G.711 is a narrowband audio codec originally designed for use in telephony that provides toll-quality audio at 64 kbit/s. It is an ITU-T standard (Recommendation) for audio encoding, titled Pulse code modulation (PCM) of voice frequencies ...
(μ-law) * PSQM testing under network stress yields mean opinion scores of 3.51 for G.729a, compared to 4.13 for
G.711 G.711 is a narrowband audio codec originally designed for use in telephony that provides toll-quality audio at 64 kbit/s. It is an ITU-T standard (Recommendation) for audio encoding, titled Pulse code modulation (PCM) of voice frequencies ...
(μ-law) Some VoIP phones incorrectly use the description "G729a/8000" in SDP (e.g. this affects some Cisco and Linksys phones). This is incorrect as G729a is an alternative method of encoding the audio, but still generates data decodable by either G729 or G729a - i.e. there is no difference in terms of codec negotiation. Since the SDP RFC allows static payload types to be overridden by the textual rtpmap description this can cause problems calling from these phones to endpoints adhering to the RFC unless the codec is renamed in their settings since they will not recognise 'G729a' as 'G729' without a specific workaround in place for the bug.


G.729 Annex B

G.729 has been extended in Annex B (G.729b) which provides a
silence compression Silence compression is an audio processing technique used to effectively encode silent intervals, reducing the amount of storage or bandwidth needed to transmit audio recordings. Overview Silence can be defined as audio segments with negligible ...
method that enables a
voice activity detection Voice activity detection (VAD), also known as speech activity detection or speech detection, is the detection of the presence or absence of human speech, used in speech processing. The main uses of VAD are in speaker diarization, speech coding an ...
(VAD) module. It is used to detect voice activity in the signal. It also includes a
discontinuous transmission Discontinuous transmission (DTX) is a means by which a mobile telephone is temporarily shut off or muted while the phone lacks a voice input. Misconception A common misconception is that DTX improves capacity by freeing up TDMA time slots for use ...
(DTX) module which decides on updating the
background noise Background noise or ambient noise is any sound other than the sound being monitored (primary sound). Background noise is a form of noise pollution or interference. Background noise is an important concept in setting noise levels. Background no ...
parameters for non speech (noisy frames). It uses 2-byte Silence Insertion Descriptor (SID)
frames A frame is often a structural system that supports other components of a physical construction and/or steel frame that limits the construction's extent. Frame and FRAME may also refer to: Physical objects In building construction *Framing (con ...
transmitted to initiate
comfort noise Comfort noise (or comfort tone) is synthetic background noise used in radio and wireless communications to fill the artificial silence in a transmission resulting from voice activity detection or from the audio clarity of modern digital lines. ...
generation (CNG). If transmission is stopped, and the link goes quiet because of no speech, the receiving side might assume that the link has been cut. By inserting comfort noise, analog hiss is simulated digitally during silence to assure the receiver that the link is active and operational.


G.729 Annex J (G.729.1)

G.729 Annex J, maintained by
G.729.1 G.729.1 is an 8-32 kbit/s embedded speech and audio codec providing bitstream interoperability with G.729, G.729 Annex A and G.729 Annex B. Its official name is ''G.729-based embedded variable bit rate codec: An 8-32 kbit/s scalable wi ...
, provides support for
wideband In communications, a system is wideband when the message bandwidth significantly exceeds the coherence bandwidth of the channel. Some communication links have such a high data rate that they are forced to use a wide bandwidth; other links ma ...
speech and audio. Introduced in 2006, it defines variable bit-rate wideband enhancement using up to 12 hierarchical layers. The core layer is an 8 kbit/s G.729 bitstream, the second layer is a 4 kbit/s
narrowband Narrowband signals are signals that occupy a narrow range of frequencies or that have a small fractional bandwidth. In the audio spectrum, ''narrowband sounds'' are sounds that occupy a narrow range of frequencies. In telephony, narrowband is ...
enhancement layer, and the third 2 kibt/s layer is a bandwidth enhancement layer. Further layers provide wideband enhancement in 2 kbit/s steps. The G.729.1 uses three-stage coding: embedded
code-excited linear prediction Code-excited linear prediction (CELP) is a linear predictive speech coding algorithm originally proposed by Manfred R. Schroeder and Bishnu S. Atal in 1985. At the time, it provided significantly better quality than existing low bit-rate algori ...
(CELP) coding of the lower band, parametric coding of the higher band by Time-Domain Bandwidth Extension (TDBWE), and enhancement of the full band by a predictive
transform coding Transform coding is a type of data compression for "natural" data like audio signals or photographic images. The transformation is typically lossless (perfectly reversible) on its own but is used to enable better (more targeted) quantization, whi ...
algorithm called
time-domain aliasing cancellation The modified discrete cosine transform (MDCT) is a transform based on the type-IV discrete cosine transform (DCT-IV), with the additional property of being lapped: it is designed to be performed on consecutive blocks of a larger dataset, where sub ...
(TDAC), also known as
modified discrete cosine transform The modified discrete cosine transform (MDCT) is a transform based on the type-IV discrete cosine transform (DCT-IV), with the additional property of being lapped: it is designed to be performed on consecutive blocks of a larger dataset, where s ...
(MDCT) coding. Bit rate and the obtained quality are adjustable by simple bitstream truncation.


Licensing

As of January 1, 2017, the patent terms of most licensed
patents A patent is a type of intellectual property that gives its owner the legal right to exclude others from making, using, or selling an invention for a limited period of time in exchange for publishing an sufficiency of disclosure, enabling discl ...
under the G.729 Consortium have expired, the remaining unexpired patents are usable on a
royalty-free Royalty-free (RF) material subject to copyright or other intellectual property rights may be used without the need to pay royalties or license fees for each use, per each copy or volume sold or some time period of use or sales. Computer standards ...
basis. G.729 includes patents from several companies which were until the expiry licensed by Sipro Lab Telecom, the authorized Intellectual Property Licensing Administrator for G.729 technology and
patent pool In patent law, a patent pool is a consortium of two or more companies agreeing to cross-license patents relating to a particular technology. The creation of a patent pool can save patentees and licensees time and money, and, in case of blocking pa ...
.


Past patent litigation

AIM IP LLC, a
California California () is a U.S. state, state in the Western United States that lies on the West Coast of the United States, Pacific Coast. It borders Oregon to the north, Nevada and Arizona to the east, and shares Mexico–United States border, an ...
Limited Liability Company A limited liability company (LLC) is the United States-specific form of a private limited company. It is a business structure that can combine the pass-through taxation of a partnership or sole proprietorship with the limited liability of ...
based in
Mission Viejo Mission Viejo ( ; language change, corruption of ; ) is a Commuter town, commuter city in the Saddleback Valley in Orange County, California, United States. Mission Viejo is considered one of the largest Planned community, master-planned commu ...
, CA filed 17
patent infringement A patent is a type of intellectual property that gives its owner the legal right to exclude others from making, using, or selling an invention for a limited period of time in exchange for publishing an enabling disclosure of the invention."A ...
lawsuits in the Central District Courts of California accusing 22 different companies, including,
Cisco Systems Cisco Systems, Inc. (using the trademark Cisco) is an American multinational corporation, multinational digital communications technology conglomerate (company), conglomerate corporation headquartered in San Jose, California. Cisco develops, m ...
,
Polycom Poly Inc., formerly Polycom, is an American multinational corporation that develops video, voice and content collaboration and communication technology. Poly is a subsidiary of HP Inc. Polycom was co-founded in 1990 by Brian L Hinman and Jeff ...
and others of infringing U.S. Patent No. 5,920,853. The '853 patent was filed at the
United States Patent and Trademark Office The United States Patent and Trademark Office (USPTO) is an List of federal agencies in the United States, agency in the United States Department of Commerce, U.S. Department of Commerce that serves as the national patent office and trademark ...
in 1996 by
Rockwell International Rockwell International was a major American manufacturing conglomerate (company), conglomerate. It was involved in aircraft, the space industry, defense and commercial electronics, components in the automotive industry, printing presses, avioni ...
. The inventors listed on the '853 patent are Benyassine Adil, Su Huan-Yu and Shlomot Eyal. In 2000, the '853 patent was assigned by
Rockwell International Rockwell International was a major American manufacturing conglomerate (company), conglomerate. It was involved in aircraft, the space industry, defense and commercial electronics, components in the automotive industry, printing presses, avioni ...
to
Conexant Systems Conexant Systems, Inc. was an American-based software developer and fabless manufacturing, fabless semiconductor company that developed technology for voice and audio processing, imaging and modems. The company began as a division of Rockwell In ...
, an American-based software developer and fabless semiconductor company, which began as a division of Rockwell before being spun-off as its own
public company A public company is a company whose ownership is organized via shares of share capital, stock which are intended to be freely traded on a stock exchange or in over-the-counter (finance), over-the-counter markets. A public (publicly traded) co ...
. In 2010, the '853 patent was sold by
Conexant Systems Conexant Systems, Inc. was an American-based software developer and fabless manufacturing, fabless semiconductor company that developed technology for voice and audio processing, imaging and modems. The company began as a division of Rockwell In ...
to AIM IP LLC, a
California California () is a U.S. state, state in the Western United States that lies on the West Coast of the United States, Pacific Coast. It borders Oregon to the north, Nevada and Arizona to the east, and shares Mexico–United States border, an ...
Limited Liability Company A limited liability company (LLC) is the United States-specific form of a private limited company. It is a business structure that can combine the pass-through taxation of a partnership or sole proprietorship with the limited liability of ...
based in
Mission Viejo Mission Viejo ( ; language change, corruption of ; ) is a Commuter town, commuter city in the Saddleback Valley in Orange County, California, United States. Mission Viejo is considered one of the largest Planned community, master-planned commu ...
. The '853 patent contains
patent claim In a patent or patent application, the claims define in technical terms the extent, i.e. the scope, of the protection conferred by a patent, or the protection sought in a patent application. The claims particularly point out the subject matter whi ...
s which cover lookup tables used in G.729. The patent has since expired and is no longer in force due to its
patent term The term of a patent is the maximum time during which it can be maintained in force. It is usually expressed in a number of years either starting from the filing date of the patent application or from the date of grant of the patent. In most patent ...
expiring.


RTP payload type

G.729 is assigned the static payload type 18 for RTP by
IANA The Internet Assigned Numbers Authority (IANA) is a standards organization that oversees global IP address allocation, autonomous system number allocation, root zone management in the Domain Name System (DNS), media types, and other Internet P ...
. The rtpmap parameter description for this payload type is "G729/8000". Both G.729a and G.729b use the same rtpmap description as G.729. G.729a and G.729b are indicated using annexb=no or annexb=yes, respectively. G.729 Annex B (G.729b) is the default in absence of parameter ''annexb'' in the
Session Description Protocol The Session Description Protocol (SDP) is a format for describing multimedia communication sessions for the purposes of announcement and invitation. Its predominant use is in support of streaming media applications, such as voice over IP (VoIP) ...
.


See also

*
List of codecs The following is a list of compression formats and related codecs. Audio compression formats Non-compression * Linear pulse-code modulation (LPCM, generally only described as PCM) is the format for uncompressed audio in media files and it is ...
*
Comparison of audio coding formats The following tables compare general and technical information for a variety of audio coding formats. For listening tests comparing the perceived audio quality of audio formats and codecs, see the article Codec listening test. General informati ...
*
RTP audio video profile The Real-time Transport Protocol (RTP) specifies a general-purpose data format and network protocol for transmitting digital media streams on Internet Protocol (IP) networks. The details of media encoding, such as signal sampling rate, frame size a ...


References


External links


ITU-T Recommendation G.729
- technical specification.
G.729 Error Recovery for Internet Telephony



Sipro Lab Telecom (administers the patent pools for G.723.1 and G.729)

Voiceage's G.729 implementation
{{Compression formats Audio codecs Speech codecs ITU-T recommendations ITU-T G Series Recommendations