Continuously variable slope delta modulation (CVSD or CVSDM) is a voice coding method. It is a delta modulation with variable step size (i.e., special case of adaptive delta modulation), first proposed by Greefkes and Riemens in 1970. CVSD encodes at 1 bit per sample, so that audio sampled at 16 kHz is encoded at 16 kbit/s. The encoder maintains a reference sample and a step size. Each input sample is compared to the reference sample. If the input sample is larger, the encoder emits a ''1'' bit and adds the step size to the reference sample. If the input sample is smaller, the encoder emits a ''0'' bit and subtracts the step size from the reference sample. The encoder also keeps the previous ''N'' bits of output (''N'' = 3 or ''N'' = 4 are very common) to determine adjustments to the step size; if the previous ''N'' bits are all 1s or 0s, the step size is increased. Otherwise, the step size is decreased (usually in an exponential manner, with

\tau

being in the range of 5 ms). The step size is adjusted for every input sample processed. To allow for bit errors to fade out and to allow (re)synchronization to an ongoing bitstream, the output register (which keeps the reference sample) is normally realized as a leaky integrator with a time constant (

\tau

) of about 1 ms. The decoder reverses this process, starting with the reference sample, and adding or subtracting the step size according to the bit stream. The sequence of adjusted reference samples are the reconstructed waveform, and the step size is adjusted according to the same all-1s-or-0s logic as in the encoder. Adaptation of step size allows one to avoid slope overload (step of quantization increases when the signal rapidly changes) and decreases granular noise when the signal is constant (decrease of step of quantisation). CVSD is sometimes called a compromise between simplicity, low bitrate, and quality. Common bitrates are 9.6–128 kbit/s. Like other delta-modulation techniques, the output of the decoder does not exactly match the original input to the encoder.

Applications

12 kbit/s CVSD is used by

Motorola Motorola, Inc. () was an American multinational telecommunications company based in Schaumburg, Illinois. It was founded by brothers Paul and Joseph Galvin in 1928 and had been named Motorola since 1947. Many of Motorola's products had been ...

's SECURENET line of digitally encrypted two-way radio products. 16 and 32 kbit/s CVSD were used by military TRI-TAC digital telephones (DNVT, DSVT) for use in deployed areas to provide voice recognition quality audio. 16 kbit/s rates were typically used by

US Army The United States Army (USA) is the primary land service branch of the United States Department of Defense. It is designated as the Army of the United States in the United States Constitution.Article II, section 2, clause 1 of the United Stat ...

forces to conserve bandwidth over tactical links. 32 kbit/s rates were typically used by

US Air Force The United States Air Force (USAF) is the Air force, air service branch of the United States Department of Defense. It is one of the six United States Armed Forces and one of the eight uniformed services of the United States. Tracing its ori ...

forces for improved voice quality. 64 kbit/s CVSD is one of the options to encode voice signals in telephony-related

Bluetooth Bluetooth is a short-range wireless technology standard that is used for exchanging data between fixed and mobile devices over short distances and building personal area networks (PANs). In the most widely used mode, transmission power is li ...

service profiles; e.g., between mobile phones and wireless headsets. The other options are PCM with logarithmic a-law or μ-law quantization, as well as mSBC codec featuring 16 kHz sample rate and best quality. Numerous arcade games, such as ''

Sinistar ''Sinistar'' is a 1983 multidirectional shooter arcade video game developed and manufactured by Williams Electronics. It was created by Sam Dicker, Jack Haeger, Noah Falstein, RJ Mical, Python Anghelo, and Richard Witt. Players control a sp ...

'' and '' Smash TV'', and pinball machines, such as ''

Gorgar ''Gorgar'' is a 1979 pinball machine designed by Barry Oursler and released by WMS Industries, Williams Electronics. It was the first speech synthesis, speech-synthesized ("talking") pinball machine, containing a vocabulary of seven words ("Gorg ...

'' or ''

Space Shuttle The Space Shuttle is a retired, partially reusable launch system, reusable low Earth orbital spacecraft system operated from 1981 to 2011 by the U.S. National Aeronautics and Space Administration (NASA) as part of the Space Shuttle program. ...

'', play pre-recorded speech through an HC-55516 CVSD decoder. The Victor 9000 computer used a MC6852-SSDA chip and a

CODEC A codec is a computer hardware or software component that encodes or decodes a data stream or signal. ''Codec'' is a portmanteau of coder/decoder. In electronic communications, an endec is a device that acts as both an encoder and a decoder o ...

supporting voice coding with CVSD modulation with a sampling rate of about 16KHz.

References

* J. A. Greefkes and K. Riemens, "Code Modulation with Digitally Controlled Companding for Speech Transmission," Philips Tech. Rev., pp. 335–353, 1970. * N.S. Jayant, "Digital coding of speech waveforms: PCM, DPCM, and DM quantizers," Proc. IEEE, vol. 62, no. 5, pp. 61 1-632, May 1974. * R. Steele, ''Delta Modulation Systems'', Pentech Press, London, England, 1975. * N. S. Jayant and P. Noll, ''Digital Coding of Waveforms: Principles and Applications to Speech and Video'', Prentice-Hall, Englewood Cliffs, N. J., 1984.
A description of the algorithm, plus speech samples
* Specification of the Bluetooth System 2.0 + EDR, Core System Package, Part B "Baseband Specification", Section 9 "Audio", November 2004 {{reflist Audio codecs Speech codecs

Applications

See also

References