G.729d
   HOME

TheInfoList



OR:

G.729 is a royalty-free narrow-band
vocoder A vocoder (, a portmanteau of ''voice'' and ''encoder'') is a category of speech coding that analyzes and synthesizes the human voice signal for audio data compression, multiplexing, voice encryption or voice transformation. The vocoder was ...
-based
audio data compression In information theory, data compression, source coding, or bit-rate reduction is the process of encoding information using fewer bits than the original representation. Any particular compression is either lossy or lossless. Lossless compressi ...
algorithm using a frame length of 10 milliseconds. It is officially described as ''Coding of speech at 8 kbit/s using
code-excited linear prediction Code-excited linear prediction (CELP) is a linear predictive speech coding algorithm originally proposed by Manfred R. Schroeder and Bishnu S. Atal in 1985. At the time, it provided significantly better quality than existing low bit-rate algori ...
''
speech coding Speech coding is an application of data compression of digital audio signals containing speech. Speech coding uses speech-specific parameter estimation using audio signal processing techniques to model the speech signal, combined with generic d ...
(CS-ACELP), and was introduced in 1996. The wide-band extension of G.729 is called
G.729.1 G.729.1 is an 8-32 kbit/s embedded speech and audio codec providing bitstream interoperability with G.729, G.729 Annex A and G.729 Annex B. Its official name is ''G.729-based embedded variable bit rate codec: An 8-32 kbit/s scalable wideband cod ...
, which equals G.729 Annex J. Because of its low bandwidth requirements, G.729 is mostly used in
voice over Internet Protocol Voice over Internet Protocol (VoIP), also called IP telephony, is a method and group of technologies for the delivery of voice communications and multimedia sessions over Internet Protocol (IP) networks, such as the Internet. The terms Internet t ...
(VoIP) applications when bandwidth must be conserved. Standard G.729 operates at a bit rate of 8 kbit/s, but extensions provide rates of 6.4 kbit/s (Annex D, F, H, I, C+) and 11.8 kbit/s (Annex E, G, H, I, C+) for worse and better speech quality, respectively. G.729 has been extended with various features, commonly designated as G.729a and G.729b: * G.729: This is the original codec using a high-complexity algorithm. * G.729A or Annex A: This version has a medium complexity, and is compatible with G.729. It provides a slightly lower voice quality. * G.729B or Annex B: This version extends G.729 with
silence suppression The term silence suppression is used in telephony to describe the process of not transmitting information over the network when one of the parties involved in a telephone call is not speaking, thereby reducing bandwidth usage. Voice is carried over ...
, and is not compatible with the previous versions. * G.729AB: This version extends G.729A with silence suppression, and is only compatible with G.729B. *
G.729.1 G.729.1 is an 8-32 kbit/s embedded speech and audio codec providing bitstream interoperability with G.729, G.729 Annex A and G.729 Annex B. Its official name is ''G.729-based embedded variable bit rate codec: An 8-32 kbit/s scalable wideband cod ...
or Annex J: This version extends G.729A and B with scalable variable encoding using hierarchical enhancement layers. It provides support for
wideband In communications, a system is wideband when the message bandwidth significantly exceeds the coherence bandwidth of the channel. Some communication links have such a high data rate that they are forced to use a wide bandwidth; other links ma ...
speech and audio, using modified discrete cosine transform (MDCT) coding. Dual-tone multi-frequency signaling (DTMF),
fax Fax (short for facsimile), sometimes called telecopying or telefax (the latter short for telefacsimile), is the telephonic transmission of scanned printed material (both text and images), normally to a telephone number connected to a printer o ...
transmissions, and high-quality audio cannot be transported reliably with this
codec A codec is a device or computer program that encodes or decodes a data stream or signal. ''Codec'' is a portmanteau of coder/decoder. In electronic communications, an endec is a device that acts as both an encoder and a decoder on a signal or ...
. DTMF requires the use of the named telephony events in the RTP payload for DTMF digits, telephony tones, and telephony signals as specified in RFC 4733.


G.729 annexes


G.729 Annex A

G.729a is a compatible extension of G.729, but requires less computational power. This lower complexity, however, bears the cost of marginally reduced speech quality. G.729a was developed by a consortium of organizations: France Télécom,
Mitsubishi Electric Corporation , established on 15 January 1921, is a Japanese multinational electronics and electrical equipment manufacturing company headquartered in Tokyo, Japan. It is one of the core companies of Mitsubishi. The products from MELCO include elevators an ...
, Nippon Telegraph and Telephone Corporation (NTT). The features of G.729a are: * Sampling frequency 8 kHz/16-bit (80 samples for 10 ms frames) * Fixed bit rate (8 kbit/s 10 ms frames) * Fixed frame size (10 bytes (80 bits) for 10 ms frame) * Algorithmic delay is 15 ms per frame, with 5 ms look-ahead delay * G.729a is a hybrid speech coder which uses
Algebraic Code Excited Linear Prediction Algebraic code-excited linear prediction (ACELP) is a speech coding algorithm in which a limited set of pulses is distributed as excitation to a linear prediction filter. It is a linear predictive coding (LPC) algorithm that is based on the cod ...
(ACELP) * The complexity of the algorithm is rated at 15, using a relative scale where
G.711 G.711 is a narrowband audio codec originally designed for use in telephony that provides toll-quality audio at 64 kbit/s. G.711 passes audio signals in the range of 300–3400 Hz and samples them at the rate of 8,000 samples per second ...
is 1 and
G.723.1 G.723.1 is an audio codec for voice that compresses voice audio in frames. An algorithmic look-ahead of duration means that total algorithmic delay is . Its official name is ''Dual rate speech coder for multimedia communications transmitting at ...
is 25. *
PSQM Perceptual Speech Quality Measure (PSQM) is a computational and modeling algorithm defined in Recommendation ITU-T P.861 that objectively evaluates and quantifies voice quality of voice-band (300 – 3400 Hz) speech codecs. It may be used to ...
testing under ideal conditions yields
mean opinion score Mean opinion score (MOS) is a measure used in the domain of Quality of Experience and telecommunications engineering, representing overall quality of a stimulus or system. It is the arithmetic mean over all individual "values on a predefined scale t ...
s of 4.04 for G.729a, compared to 4.45 for
G.711 G.711 is a narrowband audio codec originally designed for use in telephony that provides toll-quality audio at 64 kbit/s. G.711 passes audio signals in the range of 300–3400 Hz and samples them at the rate of 8,000 samples per second ...
(μ-law) * PSQM testing under network stress yields mean opinion scores of 3.51 for G.729a, compared to 4.13 for
G.711 G.711 is a narrowband audio codec originally designed for use in telephony that provides toll-quality audio at 64 kbit/s. G.711 passes audio signals in the range of 300–3400 Hz and samples them at the rate of 8,000 samples per second ...
(μ-law) Some VoIP phones incorrectly use the description "G729a/8000" in SDP (e.g. this affects some Cisco and Linksys phones). This is incorrect as G729a is an alternative method of encoding the audio, but still generates data decodable by either G729 or G729a - i.e. there is no difference in terms of codec negotiation. Since the SDP RFC allows static payload types to be overridden by the textual rtpmap description this can cause problems calling from these phones to endpoints adhering to the RFC unless the codec is renamed in their settings since they will not recognise 'G729a' as 'G729' without a specific workaround in place for the bug.


G.729 Annex B

G.729 has been extended in Annex B (G.729b) which provides a
silence Silence is the absence of ambient audible sound, the emission of sounds of such low intensity that they do not draw attention to themselves, or the state of having ceased to produce sounds; this latter sense can be extended to apply to the c ...
compression method that enables a
voice activity detection Voice activity detection (VAD), also known as speech activity detection or speech detection, is the detection of the presence or absence of human speech, used in speech processing. The main uses of VAD are in speech coding and speech recognition. I ...
(VAD) module. It is used to detect voice activity in the signal. It also includes a
discontinuous transmission Discontinuous transmission (DTX) is a means by which a mobile telephone is temporarily shut off or muted while the phone lacks a voice input. Misconception A common misconception is that DTX improves capacity by freeing up TDMA time slots for us ...
(DTX) module which decides on updating the
background noise Background noise or ambient noise is any sound other than the sound being monitored (primary sound). Background noise is a form of noise pollution or interference. Background noise is an important concept in setting noise levels. Background n ...
parameters for non speech (noisy frames). It uses 2-byte Silence Insertion Descriptor (SID) frames transmitted to initiate
comfort noise Comfort noise (or comfort tone) is synthetic background noise used in radio and wireless communications to fill the artificial silence in a transmission resulting from voice activity detection or from the audio clarity of modern digital lines. ...
generation (CNG). If transmission is stopped, and the link goes quiet because of no speech, the receiving side might assume that the link has been cut. By inserting comfort noise, analog hiss is simulated digitally during silence to assure the receiver that the link is active and operational.


G.729 Annex J (G.729.1)

G.729 Annex J, maintained by
G.729.1 G.729.1 is an 8-32 kbit/s embedded speech and audio codec providing bitstream interoperability with G.729, G.729 Annex A and G.729 Annex B. Its official name is ''G.729-based embedded variable bit rate codec: An 8-32 kbit/s scalable wideband cod ...
, provides support for
wideband In communications, a system is wideband when the message bandwidth significantly exceeds the coherence bandwidth of the channel. Some communication links have such a high data rate that they are forced to use a wide bandwidth; other links ma ...
speech and audio. Introduced in 2006, it defines variable bit-rate wideband enhancement using up to 12 hierarchical layers. The core layer is an 8 kbit/s G.729 bitstream, the second layer is a 4 kbit/s
narrowband Narrowband signals are signals that occupy a narrow range of frequencies or that have a small fractional bandwidth. In the audio spectrum, narrowband sounds are sounds that occupy a narrow range of frequencies. In telephony, narrowband is usua ...
enhancement layer, and the third 2 kibt/s layer is a bandwidth enhancement layer. Further layers provide wideband enhancement in 2 kbit/s steps. The G.729.1 uses three-stage coding: embedded
code-excited linear prediction Code-excited linear prediction (CELP) is a linear predictive speech coding algorithm originally proposed by Manfred R. Schroeder and Bishnu S. Atal in 1985. At the time, it provided significantly better quality than existing low bit-rate algori ...
(CELP) coding of the lower band, parametric coding of the higher band by Time-Domain Bandwidth Extension (TDBWE), and enhancement of the full band by a predictive
transform coding Transform coding is a type of data compression for "natural" data like audio signals or photographic images. The transformation is typically lossless (perfectly reversible) on its own but is used to enable better (more targeted) quantization, ...
algorithm called
time-domain aliasing cancellation The modified discrete cosine transform (MDCT) is a transform based on the type-IV discrete cosine transform (DCT-IV), with the additional property of being lapped: it is designed to be performed on consecutive blocks of a larger dataset, where su ...
(TDAC), also known as modified discrete cosine transform (MDCT) coding. Bit rate and the obtained quality are adjustable by simple bitstream truncation.


Licensing

As of January 1, 2017, the patent terms of most licensed patents under the G.729 Consortium have expired, the remaining unexpired patents are usable on a royalty-free basis. G.729 includes patents from several companies which were until the expiry licensed by Sipro Lab Telecom, the authorized Intellectual Property Licensing Administrator for G.729 technology and
patent pool In patent law, a patent pool is a consortium of at least two companies agreeing to cross-license patents relating to a particular technology. The creation of a patent pool can save patentees and licensees time and money, and, in case of blocking ...
.


Past patent litigation

AIM IP LLC, a
California California is a state in the Western United States, located along the Pacific Coast. With nearly 39.2million residents across a total area of approximately , it is the most populous U.S. state and the 3rd largest by area. It is also the m ...
Limited Liability Company A limited liability company (LLC for short) is the US-specific form of a private limited company. It is a business structure that can combine the pass-through taxation of a partnership or sole proprietorship with the limited liability of ...
based in
Mission Viejo Mission Viejo ( ; corruption of ''Misión Vieja'', Spanish for "Old Mission") is a commuter city in the Saddleback Valley in Orange County, California, United States. Mission Viejo is considered one of the largest master-planned communities eve ...
, CA filed 17 patent infringement lawsuits in the Central District Courts of California accusing 22 different companies, including,
Cisco Systems Cisco Systems, Inc., commonly known as Cisco, is an American-based multinational digital communications technology conglomerate corporation headquartered in San Jose, California. Cisco develops, manufactures, and sells networking hardware, ...
,
Polycom Poly, formerly Polycom, a part of HP Inc., is an American multinational corporation that develops video, voice and content collaboration and communication technology. Polycom was co-founded in 1990 by Brian L Hinman and Jeffrey Rodman. In 2018 ...
and others of infringing U.S. Patent No. 5,920,853. The '853 patent was filed at the
United States Patent and Trademark Office The United States Patent and Trademark Office (USPTO) is an agency in the U.S. Department of Commerce that serves as the national patent office and trademark registration authority for the United States. The USPTO's headquarters are in Alex ...
in 1996 by
Rockwell International Rockwell International was a major American manufacturing conglomerate involved in aircraft, the space industry, defense and commercial electronics, components in the automotive industry, printing presses, avionics and industrial products. R ...
. The inventors listed on the '853 patent are Benyassine Adil, Su Huan-Yu and Shlomot Eyal. In 2000, the '853 patent was assigned by
Rockwell International Rockwell International was a major American manufacturing conglomerate involved in aircraft, the space industry, defense and commercial electronics, components in the automotive industry, printing presses, avionics and industrial products. R ...
to
Conexant Systems Conexant Systems, Inc. was an American-based software developer and fabless semiconductor company that developed technology for voice and audio processing, imaging and modems. The company began as a division of Rockwell International, before be ...
, an American-based software developer and fabless semiconductor company, which began as a division of Rockwell before being spun-off as its own
public company A public company is a company whose ownership is organized via shares of stock which are intended to be freely traded on a stock exchange or in over-the-counter markets. A public (publicly traded) company can be listed on a stock exchange ( ...
. In 2010, the '853 patent was sold by
Conexant Systems Conexant Systems, Inc. was an American-based software developer and fabless semiconductor company that developed technology for voice and audio processing, imaging and modems. The company began as a division of Rockwell International, before be ...
to AIM IP LLC, a
California California is a state in the Western United States, located along the Pacific Coast. With nearly 39.2million residents across a total area of approximately , it is the most populous U.S. state and the 3rd largest by area. It is also the m ...
Limited Liability Company A limited liability company (LLC for short) is the US-specific form of a private limited company. It is a business structure that can combine the pass-through taxation of a partnership or sole proprietorship with the limited liability of ...
based in
Mission Viejo Mission Viejo ( ; corruption of ''Misión Vieja'', Spanish for "Old Mission") is a commuter city in the Saddleback Valley in Orange County, California, United States. Mission Viejo is considered one of the largest master-planned communities eve ...
. The '853 patent contains
patent claim In a patent or patent application, the claims define, in technical terms, the extent, i.e. the scope, of the protection conferred by a patent, or the protection sought in a patent application. In other words, the purpose of the claims is to define ...
s which cover lookup tables used in G.729. The patent has since expired and is no longer in force due to its patent term expiring.


RTP payload type

G.729 is assigned the static payload type 18 for RTP by IANA. The rtpmap parameter description for this payload type is "G729/8000". Both G.729a and G.729b use the same rtpmap description as G.729. G.729a and G.729b are indicated using annexb=no or annexb=yes, respectively. G.729 Annex B (G.729b) is the default in absence of parameter ''annexb'' in the
Session Description Protocol The Session Description Protocol (SDP) is a format for describing multimedia communication sessions for the purposes of announcement and invitation. Its predominant use is in support of streaming media applications, such as voice over IP (VoIP) ...
.


See also

* List of codecs *
Comparison of audio coding formats The following tables compare general and technical information for a variety of audio coding formats. For listening tests comparing the perceived audio quality of audio formats and codecs, see the article Codec listening test. General informati ...
*
RTP audio video profile The Real-time Transport Protocol (RTP) specifies a general-purpose data format and network protocol for transmitting digital media streams on Internet Protocol (IP) networks. The details of media encoding, such as signal sampling rate, frame size an ...


References


External links


ITU-T Recommendation G.729
- technical specification.
G.729 Error Recovery for Internet Telephony



Sipro Lab Telecom (administers the patent pools for G.723.1 and G.729)

Voiceage's G.729 implementation
{{Compression formats Audio codecs Speech codecs ITU-T recommendations ITU-T G Series Recommendations