Continuously variable slope delta modulation (CVSD or CVSDM) is a
voice coding method. It is a
delta modulation with variable step size (i.e., special case of
adaptive delta modulation), first proposed by Greefkes and Riemens in 1970.
CVSD encodes at 1 bit per sample, so that audio sampled at 16 kHz is encoded at 16 kbit/s.
The encoder maintains a reference sample and a step size. Each input sample is compared to the reference sample. If the input sample is larger, the encoder emits a ''1'' bit and adds the step size to the reference sample. If the input sample is smaller, the encoder emits a ''0'' bit and subtracts the step size from the reference sample. The encoder also keeps the previous ''N'' bits of output (''N'' = 3 or ''N'' = 4 are very common) to determine adjustments to the step size; if the previous ''N'' bits are all 1s or 0s, the step size is increased. Otherwise, the step size is decreased (usually in an exponential manner, with
being in the range of 5 ms). The step size is adjusted for every input sample processed.
To allow for bit errors to fade out and to allow (re)synchronization to an ongoing bitstream, the output register (which keeps the reference sample) is normally realized as a
leaky integrator
In mathematics
Mathematics is an area of knowledge that includes the topics of numbers, formulas and related structures, shapes and the spaces in which they are contained, and quantities and their changes. These topics are represented in mod ...
with a time constant (
) of about 1 ms.
The decoder reverses this process, starting with the reference sample, and adding or subtracting the step size according to the bit stream. The sequence of adjusted reference samples are the reconstructed waveform, and the step size is adjusted according to the same all-1s-or-0s logic as in the encoder.
Adaptation of step size allows one to avoid slope overload (step of
quantization increases when the signal rapidly changes) and decreases
granular noise when the signal is constant (decrease of step of quantisation).
CVSD is sometimes called a compromise between simplicity, low bitrate, and quality. Common bitrates are 9.6–128 kbit/s.
Like other delta-modulation techniques, the output of the decoder
does not exactly match the original input to the encoder.
Applications
12 kbit/s CVSD is used by
Motorola
Motorola, Inc. () was an American multinational telecommunications company based in Schaumburg, Illinois, United States. After having lost $4.3 billion from 2007 to 2009, the company split into two independent public companies, Motorola ...
's SECURENET line of digitally encrypted two-way radio products.
16 and 32 kbit/s CVSD were used by military
TRI-TAC digital telephones (DNVT, DSVT) for use in deployed areas to provide voice recognition quality audio. 16 kbit/s rates were typically used by
US Army
The United States Army (USA) is the land service branch of the United States Armed Forces. It is one of the eight U.S. uniformed services, and is designated as the Army of the United States in the U.S. Constitution.Article II, section 2, c ...
forces to conserve
bandwidth over tactical links. 32 kbit/s rates were typically used by
US Air Force
The United States Air Force (USAF) is the Aerial warfare, air military branch, service branch of the United States Armed Forces, and is one of the eight uniformed services of the United States. Originally created on 1 August 1907, as a part ...
forces for improved voice quality.
64 kbit/s CVSD is one of the options to encode voice signals in telephony-related
Bluetooth
Bluetooth is a short-range wireless technology standard that is used for exchanging data between fixed and mobile devices over short distances and building personal area networks (PANs). In the most widely used mode, transmission power is limit ...
service profiles; e.g., between mobile phones and wireless headsets. The other options are
PCM
Pulse-code modulation (PCM) is a method used to digitally represent sampled analog signals. It is the standard form of digital audio in computers, compact discs, digital telephony and other digital audio applications. In a PCM stream, the amp ...
with logarithmic
a-law
An A-law algorithm is a standard companding algorithm, used in European 8-bit PCM digital communications systems to optimize, i.e. modify, the dynamic range of an analog signal for digitizing. It is one of two versions of the G.711 standard ...
or
μ-law quantization, as well as mSBC codec featuring 16 kHz sample rate and best quality.
Numerous arcade games, such as ''
Sinistar'' and ''
Smash TV
''Smash TV'' is a 1990 arcade video game created by Eugene Jarvis and Mark Turmell for Williams Electronics Games. It is a dual-stick shooter (one for moving and the other for firing) in the same vein as 1982's ''Robotron: 2084'' (co-created b ...
'', and pinball machines, such as ''
Gorgar'' or ''
Space Shuttle
The Space Shuttle is a retired, partially reusable low Earth orbital spacecraft system operated from 1981 to 2011 by the U.S. National Aeronautics and Space Administration (NASA) as part of the Space Shuttle program. Its official program na ...
'', play pre-recorded speech through an HC-55516 CVSD decoder.
Williams/Midway Y-Unit games
/ref>
SBS application 24 kbit/s delta modulation
Delta modulation was used by Satellite Business Systems or SBS for its voice ports to provide long-distance phone service to large domestic corporations with a significant inter-corporation communications need (such as IBM). This system was in service throughout the 1980s. The voice ports used ''digitally implemented 24 kbit/s delta modulation'' with voice activity compression (VAC) and echo suppressors to control the half second echo path through the satellite. Listening tests were conducted to verify that the ''24 kbit/s Delta Modulator'' achieved "full voice quality" with no discernible degradation as compared to a high quality phone line or the standard 64 kbit/s μ-law companded PCM. This provided an 8:3 improvement in satellite channel capacity. IBM developed the Satellite Communications Controller and the voice port functions.
The original proposal in 1974 used a state-of-the-art 24 kbit/s Delta Modulator with a single integrator and a Shindler compander modified for gain error recovery. This proved to have less than full phone line speech quality. In 1977, one engineer with two assistants in the IBM Research Triangle Park
Research Triangle Park (RTP) is the largest research park in the United States, occupying in North Carolina and hosting more than 300 companies and 65,000 workers.
The facility is named for its location relative to the three surrounding citie ...
, NC laboratory was assigned to improve the quality.
The final implementation replaced the integrator with a predictor implemented with a two-pole complex-pair low-pass filter designed to approximate the long-term average speech spectrum. The theory was that ideally the integrator should be a predictor designed to match the signal spectrum. A nearly perfect Shindler compander replaced the modified version. It was found the modified compander resulted in a less than perfect step size at most signal levels and the fast gain error recovery increased the noise as determined by actual listening tests as compared to simple signal to noise measurements. The final compander achieved a very mild gain error recovery due to the natural truncation rounding error caused by 12-bit arithmetic.
The complete function of delta modulation, VAC, and echo control for 6 ports was implemented in a single digital integrated circuit chip with 12-bit arithmetic. A single DAC was shared by all 6 ports providing voltage compare functions for the modulators and feeding sample and hold circuits for the demodulator outputs. A single card held the chip, DAC, and all the analog circuits for the phone line interface including transformers.
See also
* Pulse-width modulation
Pulse-width modulation (PWM), or pulse-duration modulation (PDM), is a method of reducing the average power delivered by an electrical signal, by effectively chopping it up into discrete parts. The average value of voltage (and current) fed ...
* Delta-sigma modulation
Delta-sigma (ΔΣ; or sigma-delta, ΣΔ) modulation is a method for encoding analog signals into digital signals as found in an analog-to-digital converter (ADC). It is also used to convert high bit-count, low-frequency digital signals into l ...
References
* J. A. Greefkes and K. Riemens, "Code Modulation with Digitally Controlled Companding for Speech Transmission," Philips Tech. Rev., pp. 335-353, 1970.
* N.S. Jayant, "Digital coding of speech waveforms: PCM, DPCM, and DM quantizers," Proc. IEEE, vol. 62, no. 5, pp. 61 1-632, May 1974.
* R. Steele, ''Delta Modulation Systems'', Pentech Press, London, England, 1975.
* N. S. Jayant and P. Noll, ''Digital Coding of Waveforms: Principles and Applications to Speech and Video'', Prentice-Hall, Englewood Cliffs, N. J., 1984.
A description of the algorithm, plus speech samples
* Specification of the Bluetooth System 2.0 + EDR, Core System Package, Part B "Baseband Specification", Section 9 "Audio", November 2004
{{reflist
Audio codecs
Speech codecs