Selectable Mode Vocoder
   HOME

TheInfoList



OR:

Selectable Mode Vocoder (SMV) is
variable bitrate Variable bitrate (VBR) is a term used in telecommunications and computing that relates to the bitrate used in sound or video encoding. As opposed to constant bitrate (CBR), VBR files vary the amount of output data per time segment. VBR allows a ...
speech coding Speech coding is an application of data compression of digital audio signals containing speech. Speech coding uses speech-specific parameter estimation using audio signal processing techniques to model the speech signal, combined with generic d ...
standard used in
CDMA2000 CDMA2000 (also known as C2K or IMT Multi‑Carrier (IMT‑MC)) is a family of 3G mobile technology standards for sending voice, data, and Signaling (telecommunication), signaling data between mobile phones and cell sites. It is developed by 3GP ...
networks. SMV provides multiple modes of operation that are selected based on input speech characteristics. The SMV for
Wideband In communications, a system is wideband when the message bandwidth significantly exceeds the coherence bandwidth of the channel. Some communication links have such a high data rate that they are forced to use a wide bandwidth; other links ma ...
CDMA Code-division multiple access (CDMA) is a channel access method used by various radio communication technologies. CDMA is an example of multiple access, where several transmitters can send information simultaneously over a single communicatio ...
is based on 4 codecs: full rate at 8.5 kbit/s, half rate at 4 kbit/s, quarter rate at 2 kbit/s, and eighth rate at 800 bit/s. The full rate and half rate are based on the
CELP Code-excited linear prediction (CELP) is a linear predictive speech coding algorithm originally proposed by Manfred R. Schroeder and Bishnu S. Atal in 1985. At the time, it provided significantly better quality than existing low bit-rate algori ...
algorithm In mathematics and computer science, an algorithm () is a finite sequence of rigorous instructions, typically used to solve a class of specific problems or to perform a computation. Algorithms are used as specifications for performing ...
that is based on a combined closed-loop-open-loop-analysis (COLA). In SMV the signal frames are first classified as: * Silence/Background noise * Non-stationary unvoiced * Stationary unvoiced * Onset * Non-stationary voiced * Stationary voiced The algorithm includes
voice activity detection Voice activity detection (VAD), also known as speech activity detection or speech detection, is the detection of the presence or absence of human speech, used in speech processing. The main uses of VAD are in speech coding and speech recognition. I ...
(VAD) followed by an elaborate frame classification scheme. Silence/background noise and stationary unvoiced frames are represented by
spectrum A spectrum (plural ''spectra'' or ''spectrums'') is a condition that is not limited to a specific set of values but can vary, without gaps, across a continuum. The word was first used scientifically in optics to describe the rainbow of colors ...
- modulated noise and coded at 1/4 or 1/8 rate. The SMV uses 4 subframes for full rate and two/three subframes for half rate. The stochastic (fixed) codebook structure is also elaborate and uses sub-codebooks each tuned for a particular type of speech. The sub-codebooks have different degrees of pulse sparseness (more sparse for noise like excitation). SMV scores a high of 3.6
MOS MOS or Mos may refer to: Technology * MOSFET (metal–oxide–semiconductor field-effect transistor), also known as the MOS transistor * Mathematical Optimization Society * Model output statistics, a weather-forecasting technique * MOS (fil ...
at full rate with clean speech. The coder works on a frame of 160 speech samples (20 ms) and requires a look ahead of 80 samples (10 ms) if noise-suppression option B is used. An additional 24 samples of look ahead is required if noise-suppression option A is used. So the algorithmic delay for the coder is 30 ms with noise-suppression option B and 33 ms with noise-suppression option A. The next evolution of CDMA speech codecs is
VMR-WB Variable-Rate Multimode Wideband (VMR-WB) is a source-controlled variable-rate multimode codec designed for robust encoding/decoding of wideband/narrowband speech. The operation of VMR-WB is controlled by speech signal characteristics (i.e., source ...
which provides much higher speech quality with
wideband In communications, a system is wideband when the message bandwidth significantly exceeds the coherence bandwidth of the channel. Some communication links have such a high data rate that they are forced to use a wide bandwidth; other links ma ...
while fitting to the same networks. SMV can be also used in 3GPP2 container file format –
3G2 3GP (3GPP file format) is a multimedia container format defined by the Third Generation Partnership Project (3GPP) for 3G UMTS multimedia services. It is used on 3G mobile phones but can also be played on some 2G and 4G phones. 3G2 (3GPP2 ...
.


References


External links

* - RTP Payload Format for Enhanced Variable Rate Codecs (EVRC) and Selectable Mode Vocoders (SMV) {{Compression formats Speech codecs 3rd Generation Partnership Project 2 standards Mobile telecommunications standards Code division multiple access