HOME

TheInfoList



OR:

A vocoder (, a
portmanteau In linguistics, a blend—also known as a blend word, lexical blend, or portmanteau—is a word formed by combining the meanings, and parts of the sounds, of two or more words together.
of ''vo''ice and en''coder'') is a
category Category, plural categories, may refer to: General uses *Classification, the general act of allocating things to classes/categories Philosophy * Category of being * ''Categories'' (Aristotle) * Category (Kant) * Categories (Peirce) * Category ( ...
of
speech coding Speech coding is an application of data compression to digital audio signals containing speech. Speech coding uses speech-specific parameter estimation using audio signal processing techniques to model the speech signal, combined with generic da ...
that analyzes and synthesizes the human voice signal for audio data compression, multiplexing, voice encryption or voice transformation. The vocoder was invented in 1938 by Homer Dudley at
Bell Labs Nokia Bell Labs, commonly referred to as ''Bell Labs'', is an American industrial research and development company owned by Finnish technology company Nokia. With headquarters located in Murray Hill, New Jersey, Murray Hill, New Jersey, the compa ...
as a means of synthesizing human speech. This work was developed into the channel vocoder which was used as a voice
codec A codec is a computer hardware or software component that encodes or decodes a data stream or signal. ''Codec'' is a portmanteau of coder/decoder. In electronic communications, an endec is a device that acts as both an encoder and a decoder o ...
for
telecommunications Telecommunication, often used in its plural form or abbreviated as telecom, is the transmission of information over a distance using electronic means, typically through cables, radio waves, or other communication technologies. These means of ...
for speech coding to conserve bandwidth in transmission. By encrypting the control signals, voice transmission can be secured against interception. Its primary use in this fashion is for secure radio communication. The advantage of this method of encryption is that none of the original signal is sent, only envelopes of the bandpass filters. The receiving unit needs to be set up in the same filter configuration to re-synthesize a version of the original signal spectrum. The vocoder has also been used extensively as an
electronic musical instrument An electronic musical instrument or electrophone is a musical instrument that produces sound using electronics, electronic circuitry. Such an instrument sounds by outputting an electrical, electronic or digital audio signal that ultimately is ...
. The decoder portion of the vocoder, called a voder, can be used independently for speech synthesis.


Theory

The
human voice The human voice consists of sound Voice production, made by a human being using the vocal tract, including Speech, talking, singing, Laughter, laughing, crying, screaming, shouting, humming or yelling. The human voice frequency is specifically ...
consists of sounds generated by the periodic opening and closing of the
glottis The glottis (: glottises or glottides) is the opening between the vocal folds (the rima glottidis). The glottis is crucial in producing sound from the vocal folds. Etymology From Ancient Greek ''γλωττίς'' (glōttís), derived from ''γ ...
by the
vocal cords In humans, the vocal cords, also known as vocal folds, are folds of throat tissues that are key in creating sounds through Speech, vocalization. The length of the vocal cords affects the pitch of voice, similar to a violin string. Open when brea ...
, which produces an acoustic waveform with many
harmonic In physics, acoustics, and telecommunications, a harmonic is a sinusoidal wave with a frequency that is a positive integer multiple of the ''fundamental frequency'' of a periodic signal. The fundamental frequency is also called the ''1st har ...
s. This initial sound is then filtered by movements in the nose, mouth and throat (a complicated resonant piping system known as the
vocal tract The vocal tract is the cavity in human bodies and in animals where the sound produced at the sound source (larynx in mammals; syrinx in birds) is filtered. In birds, it consists of the trachea, the syrinx, the oral cavity, the upper part of t ...
) to produce fluctuations in harmonic content (
formant In speech science and phonetics, a formant is the broad spectral maximum that results from an acoustic resonance of the human vocal tract. In acoustics, a formant is usually defined as a broad peak, or local maximum, in the spectrum. For harmo ...
s) in a controlled way, creating the wide variety of sounds used in speech. There is another set of sounds, known as the unvoiced and
plosive In phonetics, a plosive, also known as an occlusive or simply a stop, is a pulmonic consonant in which the vocal tract is blocked so that all airflow ceases. The occlusion may be made with the tongue tip or blade (, ), tongue body (, ), lip ...
sounds, which are created or modified by a variety of sound generating disruptions of airflow occurring in the
vocal tract The vocal tract is the cavity in human bodies and in animals where the sound produced at the sound source (larynx in mammals; syrinx in birds) is filtered. In birds, it consists of the trachea, the syrinx, the oral cavity, the upper part of t ...
. The vocoder analyzes speech by measuring how its spectral energy distribution characteristics fluctuate across time. This analysis results in a set of temporally parallel
envelope An envelope is a common packaging item, usually made of thin, flat material. It is designed to contain a flat object, such as a letter (message), letter or Greeting card, card. Traditional envelopes are made from sheets of paper cut to one o ...
signals, each representing the individual
frequency band Spectral bands are regions of a given spectrum, having a specific range of wavelengths or frequencies. Most often, it refers to electromagnetic bands, regions of the electromagnetic spectrum. More generally, spectral bands may also be means in ...
amplitudes of the user's speech. Put another way, the voice signal is divided into a number of frequency bands (the larger this number, the more accurate the analysis) and the level of signal present at each frequency band, occurring simultaneously, measured by an envelope follower, represents the spectral energy distribution across time. This set of envelope amplitude signals is called the "modulator". To recreate speech, the vocoder reverses the analysis process, variably filtering an initial broadband noise (referred to alternately as the "source" or "carrier"), by passing it through a set of band-pass filters, whose individual envelope amplitude levels are controlled, in real time, by the set of analyzed envelope amplitude signals from the modulator. The digital encoding process involves a periodic analysis of each of the modulator's multiband set of filter envelope amplitudes. This analysis results in a set of digital pulse code modulation stream readings. Then the pulse code modulation stream outputs of each band are transmitted to a decoder. The decoder applies the pulse code modulations as control signals to corresponding amplifiers of the output filter channels. Information about the
fundamental frequency The fundamental frequency, often referred to simply as the ''fundamental'' (abbreviated as 0 or 1 ), is defined as the lowest frequency of a Periodic signal, periodic waveform. In music, the fundamental is the musical pitch (music), pitch of a n ...
of the initial voice signal (as distinct from its spectral characteristic) is discarded; it was not important to preserve this for the vocoder's original use as an encryption aid. It is this dehumanizing aspect of the vocoding process that has made it useful in creating special voice effects in popular music and audio entertainment. Instead of a point-by-point recreation of the waveform, the vocoder process sends only the parameters of the vocal model over the communication link. Since the parameters change slowly compared to the original speech waveform, the bandwidth required to transmit speech can be reduced. This allows more speech channels to utilize a given
communication channel A communication channel refers either to a physical transmission medium such as a wire, or to a logical connection over a multiplexed medium such as a radio channel in telecommunications and computer networking. A channel is used for infor ...
, such as a radio channel or a submarine cable. Analog vocoders typically analyze an incoming signal by splitting the signal into multiple tuned frequency bands or ranges. To reconstruct the signal, a
carrier signal In telecommunications, a carrier wave, carrier signal, or just carrier, is a periodic waveform (usually sinusoidal) that conveys information through a process called ''modulation''. One or more of the wave's properties, such as amplitude or frequ ...
is sent through a series of these tuned band-pass filters. In the example of a typical robot voice the carrier is noise or a sawtooth waveform. There are usually between 8 and 20 bands. The amplitude of the modulator for each of the individual analysis bands generates a voltage that is used to control amplifiers for each of the corresponding carrier bands. The result is that frequency components of the modulating signal are mapped onto the carrier signal as discrete amplitude changes in each of the frequency bands. Often there is an unvoiced band or sibilance channel. This is for frequencies that are outside the analysis bands for typical speech but are still important in speech. Examples are words that start with the letters ''s'', ''f'', ''ch'' or any other sibilant sound. Using this band produces recognizable speech, although somewhat mechanical sounding. Vocoders often include a second system for generating unvoiced sounds, using a
noise generator A noise generator is a circuit that produces electrical noise (i.e., a random signal). Noise generators are used to test signals for measuring noise figure, frequency response, and other parameters. Noise generators are also used for Random numbe ...
instead of the fundamental frequency. This is mixed with the carrier output to increase clarity. In the channel vocoder algorithm, among the two components of an analytic signal, considering only the
amplitude The amplitude of a periodic variable is a measure of its change in a single period (such as time or spatial period). The amplitude of a non-periodic signal is its magnitude compared with a reference value. There are various definitions of am ...
component and simply ignoring the phase component tends to result in an unclear voice; on methods for rectifying this, see phase vocoder.


History

The development of a vocoder was started in 1928 by
Bell Labs Nokia Bell Labs, commonly referred to as ''Bell Labs'', is an American industrial research and development company owned by Finnish technology company Nokia. With headquarters located in Murray Hill, New Jersey, Murray Hill, New Jersey, the compa ...
engineer Homer Dudley, who was granted patents for it on March 21, 1939, (filed October 30, 1935) and Nov 16, 1937. To demonstrate the
speech synthesis Speech synthesis is the artificial production of human speech. A computer system used for this purpose is called a speech synthesizer, and can be implemented in software or hardware products. A text-to-speech (TTS) system converts normal langua ...
ability of its decoder section, the voder (voice operating demonstrator) was introduced to the public at the AT&T building at the 1939–1940 New York World's Fair. The voder consisted of an
electronic oscillator An electronic oscillator is an electronic circuit that produces a periodic, oscillating or alternating current (AC) signal, usually a sine wave, square wave or a triangle wave, powered by a direct current (DC) source. Oscillators are found ...
a sound source of pitched tone and
noise generator A noise generator is a circuit that produces electrical noise (i.e., a random signal). Noise generators are used to test signals for measuring noise figure, frequency response, and other parameters. Noise generators are also used for Random numbe ...
for
hiss Hiss or Hissing may refer to: * Hiss (electromagnetic), a wave generated in the plasma of the Earth's ionosphere or magnetosphere * Hiss (surname) * ''Hissing'' (manhwa), a Korean manhwa series by Kang EunYoung * Noise (electronics) or electro ...
, a 10-band resonator filters with
variable-gain amplifier A variable-gain (VGA) or voltage-controlled amplifier (VCA) is an electronic amplifier that varies its gain depending on a control voltage (often abbreviated CV). VCAs have many applications, including audio level compression, synthesizers and ...
s as a
vocal tract The vocal tract is the cavity in human bodies and in animals where the sound produced at the sound source (larynx in mammals; syrinx in birds) is filtered. In birds, it consists of the trachea, the syrinx, the oral cavity, the upper part of t ...
, and the manual controllers including a set of pressure-sensitive keys for filter control, and a foot pedal for pitch control of tone. Based on (Se
schematic diagram of the Voder synthesizer
)
The filters controlled by keys convert the tone and the hiss into
vowel A vowel is a speech sound pronounced without any stricture in the vocal tract, forming the nucleus of a syllable. Vowels are one of the two principal classes of speech sounds, the other being the consonant. Vowels vary in quality, in loudness a ...
s,
consonant In articulatory phonetics, a consonant is a speech sound that is articulated with complete or partial closure of the vocal tract, except for the h sound, which is pronounced without any stricture in the vocal tract. Examples are and pronou ...
s, and
inflection In linguistic Morphology (linguistics), morphology, inflection (less commonly, inflexion) is a process of word formation in which a word is modified to express different grammatical category, grammatical categories such as grammatical tense, ...
s. This was a complex machine to operate, but a skilled operator could produce recognizable speech. A demonstration of the voder (not the vocoder). Dudley's vocoder was used in the
SIGSALY SIGSALY (also known as the X System, Project X, Ciphony I, and the Green Hornet) was a secure voice, secure speech system used in World War II for the highest-level Allies of World War II, Allied communications. It pioneered a number of digital co ...
system, which was built by Bell Labs engineers in 1943. SIGSALY was used for encrypted voice communications during
World War II World War II or the Second World War (1 September 1939 – 2 September 1945) was a World war, global conflict between two coalitions: the Allies of World War II, Allies and the Axis powers. World War II by country, Nearly all of the wo ...
. The KO-6 voice coder was released in 1949 in limited quantities; it was a close approximation to the SIGSALY at . In 1953, KY-9 THESEUS voice coder used solid-state logic to reduce the weight to from SIGSALY's , and in 1961 the HY-2 voice coder, a 16-channel system, weighed and was the last implementation of a channel vocoder in a secure speech system. Later work in this field has since used digital
speech coding Speech coding is an application of data compression to digital audio signals containing speech. Speech coding uses speech-specific parameter estimation using audio signal processing techniques to model the speech signal, combined with generic da ...
. The most widely used speech coding technique is
linear predictive coding Linear predictive coding (LPC) is a method used mostly in audio signal processing and speech processing for representing the spectral envelope of a digital signal of speech in compressed form, using the information of a linear predictive model ...
(LPC). Another speech coding technique, adaptive differential pulse-code modulation (ADPCM), was developed by P. Cummiskey, Nikil S. Jayant and James L. Flanagan at Bell Labs in 1973.


Applications

* Terminal equipment for systems based on digital mobile radio (DMR). * Digital voice scrambling and encryption *
Cochlear implant A cochlear implant (CI) is a surgically implanted Neuroprosthetics, neuroprosthesis that provides a person who has moderate-to-profound sensorineural hearing loss with sound perception. With the help of therapy, cochlear implants may allow for imp ...
s: noise and tone vocoding is used to simulate the effects of cochlear implants. * Musical and other artistic effects


Modern implementations

Even with the need to record several frequencies, and additional unvoiced sounds, the compression of vocoder systems is impressive. Standard speech-recording systems capture frequencies from about 500 to 3,400 Hz, where most of the frequencies used in speech lie, typically using a sampling rate of 8 kHz (slightly greater than the Nyquist rate). The sampling resolution is typically 8 or more bits per sample resolution, for a data rate in the range of , but a good vocoder can provide a reasonably good simulation of voice with as little as of data. ''Toll quality'' voice coders, such as ITU G.729, are used in many telephone networks. G.729 in particular has a final data rate of with superb voice quality. G.723 achieves slightly worse quality at data rates of 5.3 and . Many voice vocoder systems use lower data rates, but below voice quality begins to drop rapidly. Several vocoder systems are used in NSA encryption systems: * LPC-10, FIPS Pub 137, , which uses
linear predictive coding Linear predictive coding (LPC) is a method used mostly in audio signal processing and speech processing for representing the spectral envelope of a digital signal of speech in compressed form, using the information of a linear predictive model ...
* Code-excited linear prediction (CELP), 2400 and , Federal Standard 1016, used in STU-III *
Continuously variable slope delta modulation Continuously variable slope delta modulation (CVSD or CVSDM) is a voice coding method. It is a delta modulation with variable step size (i.e., special case of adaptive delta modulation), first proposed by Greefkes and Riemens in 1970. CVSD enco ...
(CVSD), , used in wide band encryptors such as the KY-57. * Mixed-excitation linear prediction (MELP), MIL STD 3005, , used in the Future Narrowband Digital Terminal FNBDT, NSA's 21st century secure telephone. * Adaptive Differential Pulse Code Modulation (ADPCM), former
ITU-T The International Telecommunication Union Telecommunication Standardization Sector (ITU-T) is one of the three Sectors (branches) of the International Telecommunication Union (ITU). It is responsible for coordinating Standardization, standards fo ...
G.721, used in STE secure telephone Modern vocoders that are used in communication equipment and in voice storage devices today are based on the following algorithms: * Algebraic code-excited linear prediction (ACELP 4.7–24 kbit/s) * Mixed-excitation linear prediction (MELPe 2400, 1200 and ) * Multi-band excitation (AMBE  – ) * Sinusoidal-Pulsed Representation (SPR  – ) * Robust Advanced Low-complexity Waveform Interpolation (RALCWI 2050, 2400 and ) * Tri-Wave Excited Linear Prediction (TWELP 300–9600 bit/s) * Noise Robust Vocoder (NRV 300 and ) Vocoders are also currently used in
psychophysics Psychophysics is the field of psychology which quantitatively investigates the relationship between physical stimulus (physiology), stimuli and the sensation (psychology), sensations and perceptions they produce. Psychophysics has been described ...
,
linguistics Linguistics is the scientific study of language. The areas of linguistic analysis are syntax (rules governing the structure of sentences), semantics (meaning), Morphology (linguistics), morphology (structure of words), phonetics (speech sounds ...
,
computational neuroscience Computational neuroscience (also known as theoretical neuroscience or mathematical neuroscience) is a branch of  neuroscience which employs mathematics, computer science, theoretical analysis and abstractions of the brain to understand th ...
and
cochlear implant A cochlear implant (CI) is a surgically implanted Neuroprosthetics, neuroprosthesis that provides a person who has moderate-to-profound sensorineural hearing loss with sound perception. With the help of therapy, cochlear implants may allow for imp ...
research.


Linear prediction-based

Since the late 1970s, most non-musical vocoders have been implemented using linear prediction, whereby the target signal's spectral envelope (formant) is estimated by an all-pole IIR filter. In linear prediction coding, the all-pole filter replaces the bandpass filter bank of its predecessor and is used at the encoder to ''whiten'' the signal (i.e., flatten the spectrum) and again at the decoder to re-apply the spectral shape of the target speech signal. One advantage of this type of filtering is that the location of the linear predictor's spectral peaks is entirely determined by the target signal, and can be as precise as allowed by the time period to be filtered. This is in contrast with vocoders realized using fixed-width filter banks, where the location of spectral peaks is constrained by the available fixed frequency bands. LP filtering also has disadvantages in that signals with a large number of constituent frequencies may exceed the number of frequencies that can be represented by the linear prediction filter. This restriction is the primary reason that LP coding is almost always used in tandem with other methods in high-compression voice coders.


Waveform-interpolative

Waveform-interpolative (WI) vocoder was developed at
AT&T Bell Laboratories Nokia Bell Labs, commonly referred to as ''Bell Labs'', is an American industrial research and development company owned by Finnish technology company Nokia. With headquarters located in Murray Hill, New Jersey, Murray Hill, New Jersey, the compa ...
around 1995 by W.B. Kleijn, and subsequently, a low- complexity version was developed by AT&T for the DoD secure vocoder competition. Notable enhancements to the WI coder were made at the
University of California, Santa Barbara The University of California, Santa Barbara (UC Santa Barbara or UCSB) is a Public university, public Land-grant university, land-grant research university in Santa Barbara County, California, United States. Tracing its roots back to 1891 as an ...
. AT&T holds the core patents related to WI and other institutes hold additional patents.


Artistic effects


Uses in music

For
music Music is the arrangement of sound to create some combination of Musical form, form, harmony, melody, rhythm, or otherwise Musical expression, expressive content. Music is generally agreed to be a cultural universal that is present in all hum ...
al applications, a source of musical sounds is used as the carrier, instead of extracting the fundamental frequency. For instance, one could use the sound of a
synthesizer A synthesizer (also synthesiser or synth) is an electronic musical instrument that generates audio signals. Synthesizers typically create sounds by generating waveforms through methods including subtractive synthesis, additive synthesis a ...
as the input to the filter bank, a technique that became popular in the 1970s.


History

Werner Meyer-Eppler, a German scientist with a special interest in electronic voice synthesis, published a thesis in 1948 on
electronic music Electronic music broadly is a group of music genres that employ electronic musical instruments, circuitry-based music technology and software, or general-purpose electronics (such as personal computers) in its creation. It includes both music ...
and
speech synthesis Speech synthesis is the artificial production of human speech. A computer system used for this purpose is called a speech synthesizer, and can be implemented in software or hardware products. A text-to-speech (TTS) system converts normal langua ...
from the viewpoint of
sound synthesis A synthesizer (also synthesiser or synth) is an electronic musical instrument that generates audio signals. Synthesizers typically create sounds by generating waveforms through methods including subtractive synthesis, additive synthesis an ...
. Later he was instrumental in the founding of the Studio for Electronic Music of WDR in Cologne, in 1951. One of the first attempts to use a vocoder in creating music was the ''Siemens Synthesizer'' at the Siemens Studio for Electronic Music, developed between 1956 and 1959. (See also excerpt of pp
157

160
from th
3rd edition in 2008
())
  Details of the Siemens Electronic Music Studio, exhibited at the
Deutsches Museum The Deutsches Museum (''German Museum'', officially (English: ''German Museum of Masterpieces of Science and Technology'')) in Munich, Germany, is the world's largest museum of science museum, science and technology museum, technology, with a ...
.
In 1968,
Robert Moog Robert Arthur Moog ( ; May 23, 1934 – August 21, 2005) was an American engineer and electronic music pioneer. He was the founder of the synthesizer manufacturer Moog Music and the inventor of the first commercial synthesizer, the Moog synthe ...
developed one of the first solid-state musical vocoders for the electronic music studio of the
University at Buffalo The State University of New York at Buffalo (commonly referred to as UB, University at Buffalo, and sometimes SUNY Buffalo) is a public university, public research university in Buffalo, New York, Buffalo and Amherst, New York, United States. ...
. In 1968, Bruce Haack built a prototype vocoder, named ''Farad'' after
Michael Faraday Michael Faraday (; 22 September 1791 – 25 August 1867) was an English chemist and physicist who contributed to the study of electrochemistry and electromagnetism. His main discoveries include the principles underlying electromagnetic inducti ...
. It was first featured on "The Electronic Record For Children" released in 1969 and then on his rock album '' The Electric Lucifer'' released in 1970.   A sample of earlier Vocoder. Vocoder effects have been used by musicians in both electronic music and as a special effect along with more traditional instruments. In 1969,
Sly and the Family Stone Sly and the Family Stone was an American band formed in San Francisco, San Francisco, California, in 1966 and active until 1983. Their work, which blended elements of funk, soul music, soul, psychedelic rock, gospel music, gospel, and R&B, becam ...
used it in "Sex Machine", a song on the album '' Stand!''. Other artists who have made vocoders an essential part of their music, overall or during an extended phase. Examples include the German
synthpop Synth-pop (short for synthesizer pop; also called techno-pop; ) is a music genre that first became prominent in the late 1970s and features the synthesizer as the dominant musical instrument. It was prefigured in the 1960s and early 1970s ...
group
Kraftwerk Kraftwerk (, ) is a Germany, German Electronic music, electronic band formed in Düsseldorf in 1970 by Ralf Hütter and Florian Schneider. Widely considered innovators and pioneers of electronic music, Kraftwerk was among the first successful a ...
, the Japanese new wave group Polysics,
Stevie Wonder Stevland Hardaway Morris (; Judkins; born May 13, 1950), known professionally as Stevie Wonder, is an American and Ghanaian singer-songwriter, musician, and record producer. He is regarded as one of the most influential musicians of the 20th c ...
(" Send One Your Love", " A Seed's a Star") and jazz/fusion keyboardist
Herbie Hancock Herbert Jeffrey Hancock (born April 12, 1940) is an American jazz musician, bandleader, and composer. He started his career with trumpeter Donald Byrd's group. Hancock soon joined the Miles Davis Quintet, where he helped to redefine the role of ...
during his late 1970s period. In 1982
Neil Young Neil Percival Young (born November 12, 1945) is a Canadian and American singer-songwriter. After embarking on a music career in Winnipeg in the 1960s, Young moved to Los Angeles, forming the folk rock group Buffalo Springfield. Since the begi ...
used a Sennheiser Vocoder VSM201 on six of the nine tracks on ''
Trans Trans- is a Latin prefix meaning "across", "beyond", or "on the other side of". Used alone, trans may refer to: Sociology * Trans, a sociological term which may refer to: ** Transgender, people who identify themselves with a gender that di ...
''. The chorus and bridge of
Michael Jackson Michael Joseph Jackson (August 29, 1958 – June 25, 2009) was an American singer, songwriter, dancer, and philanthropist. Dubbed the "King of Pop", he is regarded as Cultural impact of Michael Jackson, one of the most culturally significan ...
's " P.Y.T. (Pretty Young Thing)". features a vocoder ("Pretty young thing/You make me sing"), courtesy of session musician Michael Boddicker. Among the most consistent users of the vocoder in emulating the human voice are
Daft Punk Daft Punk were a French electronic music duo formed in 1993 in Paris by Thomas Bangalter and Guy-Manuel de Homem-Christo. They achieved popularity in the late 1990s as part of the French house movement, combining house music, funk, disco, tech ...
, who have used this instrument from their first album ''
Homework Homework is a set of tasks assigned to students by their teachers to be completed at home. Common homework assignments may include required reading, a writing or typing project, Exercise (mathematics), math problems to be completed, informatio ...
'' (1997) to their latest work ''
Random Access Memories ''Random Access Memories'' is the fourth and final studio album by the French electronic music duo Daft Punk, released on 17 May 2013 through Columbia Records. It pays tribute to late Music history of the United States in the 1970s, 1970s and e ...
'' (2013) and consider the convergence of technological and human voice "the identity of their musical project". For instance, the lyrics of " Around the World" (1997) are integrally vocoder-processed, " Get Lucky" (2013) features a mix of natural and processed human voices, and " Instant Crush" (2013) features
Julian Casablancas Julian Fernando Casablancas (born August 23, 1978) is an American musician. He is the lead vocalist and primary songwriter of the Rock music, rock band the Strokes, with whom he has released six studio albums since their founding in 1998. Casabl ...
singing into a vocoder.


Voice effects in other arts

Robot voices became a recurring element in popular music during the 20th century. Apart from vocoders, several other methods of producing variations on this effect include: the Sonovox, Talk box,
Auto-Tune Auto-Tune is audio processor software released on September 19, 1997, by the American company Antares Audio Technologies. It uses a proprietary device to measure and Pitch correction, correct pitch in music. It operates on different principles ...
, linear prediction vocoders,
speech synthesis Speech synthesis is the artificial production of human speech. A computer system used for this purpose is called a speech synthesizer, and can be implemented in software or hardware products. A text-to-speech (TTS) system converts normal langua ...
, ring modulation and comb filter. Vocoders are used in
television production A television show, TV program (), or simply a TV show, is the general reference to any content produced for viewing on a television set that is broadcast via over-the-air, satellite, and cable, or distributed digitally on streaming plat ...
,
filmmaking Filmmaking or film production is the process by which a Film, motion picture is produced. Filmmaking involves a number of complex and discrete stages, beginning with an initial story, idea, or commission. Production then continues through screen ...
and games, usually for robots or talking computers. The robot voices of the Cylons in ''
Battlestar Galactica ''Battlestar Galactica'' is an American science fiction media franchise created by Glen A. Larson. It began with the original television series in 1978, and was followed by a short-run sequel series, '' Galactica 1980'', a line of book adaptat ...
'' were created with an EMS Vocoder 2000. The 1980 version of the ''
Doctor Who ''Doctor Who'' is a British science fiction television series broadcast by the BBC since 1963. The series, created by Sydney Newman, C. E. Webber and Donald Wilson (writer and producer), Donald Wilson, depicts the adventures of an extraterre ...
'' theme, as arranged and recorded by Peter Howell, has a section of the main melody generated by a Roland SVC-350 vocoder. A similar
Roland VP-330 The Roland VP-330 is a paraphonic ten-band vocoder and string machine manufactured by Roland Corporation from 1979 to 1980. While there are several string machines and vocoders, a single device combining the two is rare, despite the advantag ...
vocoder was used to create the voice of Soundwave, a character from the ''
Transformers ''Transformers'' is a media franchise produced by American toy company Hasbro and Japanese toy company Tomy, Takara Tomy. It primarily follows the heroic Autobots and the villainous Decepticons, two Extraterrestrials in fiction, alien robot fac ...
'' series.


See also

* Audio time stretching and pitch scaling * List of vocoders * Silent speech interface


Notes


References

;Multimedia references


External links

*
Description, photographs, and diagram for the vocoder at 120years.net



Object of Interest: The Vocoder The New Yorker Magazine mini documentary
{{Authority control Audio effects Electronic musical instruments Music hardware Lossy compression algorithms Speech codecs Robotics engineering