Kraftwerk Vocoder custom made in early1970s

A vocoder (, a

portmanteau A portmanteau word, or portmanteau (, ) is a blend of wordscategory Category, plural categories, may refer to: Philosophy and general uses *Categorization, categories in cognitive science, information science and generally * Category of being * ''Categories'' (Aristotle) * Category (Kant) * Categories (Peirce) ...

speech coding Speech coding is an application of data compression of digital audio signals containing speech. Speech coding uses speech-specific parameter estimation using audio signal processing techniques to model the speech signal, combined with generic d ...

that analyzes and synthesizes the human voice signal for audio data compression, multiplexing, voice encryption or voice transformation. The vocoder was invented in 1938 by Homer Dudley at Bell Labs as a means of synthesizing human speech. This work was developed into the channel vocoder which was used as a voice

codec A codec is a device or computer program that encodes or decodes a data stream or signal. ''Codec'' is a portmanteau of coder/decoder. In electronic communications, an endec is a device that acts as both an encoder and a decoder on a signal or ...

for

telecommunications Telecommunication is the transmission of information by various types of technologies over wire, radio, optical, or other electromagnetic systems. It has its origin in the desire of humans for communication over a distance greater than that fe ...

for speech coding to conserve

bandwidth Bandwidth commonly refers to: * Bandwidth (signal processing) or ''analog bandwidth'', ''frequency bandwidth'', or ''radio bandwidth'', a measure of the width of a frequency range * Bandwidth (computing), the rate of data transfer, bit rate or thr ...

in transmission. By

encrypting In cryptography, encryption is the process of Code, encoding information. This process converts the original representation of the information, known as plaintext, into an alternative form known as ciphertext. Ideally, only authorized parties can ...

the control signals, voice transmission can be secured against interception. Its primary use in this fashion is for secure radio communication. The advantage of this method of encryption is that none of the original signal is sent, only envelopes of the bandpass filters. The receiving unit needs to be set up in the same filter configuration to re-synthesize a version of the original signal spectrum. The vocoder has also been used extensively as an electronic musical instrument. The decoder portion of the vocoder, called a voder, can be used independently for speech synthesis.

Theory

The human voice consists of sounds generated by the opening and closing of the glottis by the

vocal cords In humans, vocal cords, also known as vocal folds or voice reeds, are folds of throat tissues that are key in creating sounds through vocalization. The size of vocal cords affects the pitch of voice. Open when breathing and vibrating for speec ...

, which produces a periodic waveform with many harmonics. This basic sound is then

filter Filter, filtering or filters may refer to: Science and technology Computing * Filter (higher-order function), in functional programming * Filter (software), a computer program to process a data stream * Filter (video), a software component tha ...

ed by the nose and throat (a complicated

resonant Resonance describes the phenomenon of increased amplitude that occurs when the frequency of an applied periodic force (or a Fourier component of it) is equal or close to a natural frequency of the system on which it acts. When an oscilla ...

piping system) to produce differences in harmonic content (

formant In speech science and phonetics, a formant is the broad spectral maximum that results from an acoustic resonance of the human vocal tract. In acoustics, a formant is usually defined as a broad peak, or local maximum, in the spectrum. For harmoni ...

s) in a controlled way, creating the wide variety of sounds used in speech. There is another set of sounds, known as the unvoiced and plosive sounds, which are created or modified by the mouth in different fashions. The vocoder examines speech by measuring how its spectral characteristics change over time. This results in a series of signals representing these frequencies at any particular time as the user speaks. In simple terms, the signal is split into a number of frequency bands (the larger this number, the more accurate the analysis) and the level of signal present at each frequency band gives the instantaneous representation of the spectral energy content. To recreate speech, the vocoder simply reverses the process, processing a broadband noise source by passing it through a stage that filters the frequency content based on the originally recorded series of numbers. Specifically, in the encoder, the input is passed through a multiband

, then the output of each band is measured using an

envelope follower An envelope detector (sometimes called a peak detector) is an electronic circuit that takes a (relatively) high-frequency amplitude modulated signal as input and provides an output, which is the demodulated ''envelope'' of the original signal. ...

, and the signals from the envelope followers are transmitted to the decoder. The decoder applies these as control signals to corresponding amplifiers of the output filter channels. Information about the instantaneous frequency of the original voice signal (as distinct from its spectral characteristic) is discarded; it was not important to preserve this for the vocoder's original use as an encryption aid. It is this dehumanizing aspect of the vocoding process that has made it useful in creating special voice effects in popular music and audio entertainment. Instead of a point-by-point recreation of the waveform, the vocoder process sends only the parameters of the vocal model over the communication link. Since the parameters change slowly compared to the original speech waveform, the bandwidth required to transmit speech can be reduced. This allows more speech channels to utilize a given

communication channel A communication channel refers either to a physical transmission medium such as a wire, or to a logical connection over a multiplexed medium such as a radio channel in telecommunications and computer networking. A channel is used for informa ...

, such as a radio channel or a

submarine cable Submarine cable is any electrical cable that is laid on the seabed, although the term is often extended to encompass cables laid on the bottom of large freshwater bodies of water. Examples include: *Submarine communications cable *Submarine power ...

. Analog vocoders typically analyze an incoming signal by splitting the signal into multiple tuned frequency bands or ranges. To reconstruct the signal, a

carrier signal In telecommunications, a carrier wave, carrier signal, or just carrier, is a waveform (usually sinusoidal) that is modulated (modified) with an information-bearing signal for the purpose of conveying information. This carrier wave usually has ...

is sent through a series of these tuned

bandpass filter A band-pass filter or bandpass filter (BPF) is a device that passes frequencies within a certain range and rejects ( attenuates) frequencies outside that range. Description In electronics and signal processing, a filter is usually a two-p ...

s. In the example of a typical robot voice the carrier is noise or a sawtooth waveform. There are usually between 8 and 20 bands. The amplitude of the modulator for each of the individual analysis bands generates a voltage that is used to control amplifiers for each of the corresponding carrier bands. The result is that frequency components of the modulating signal are mapped onto the carrier signal as discrete amplitude changes in each of the frequency bands. Often there is an unvoiced band or

sibilance Sibilants are fricative consonants of higher amplitude and pitch, made by directing a stream of air with the tongue towards the teeth. Examples of sibilants are the consonants at the beginning of the English words ''sip'', ''zip'', ''ship'', and ...

channel. This is for frequencies that are outside the analysis bands for typical speech but are still important in speech. Examples are words that start with the letters ''s'', ''f'', ''ch'' or any other sibilant sound. Using this band produces recognizable speech, although somewhat mechanical sounding. Vocoders often include a second system for generating unvoiced sounds, using a noise generator instead of the

fundamental frequency The fundamental frequency, often referred to simply as the ''fundamental'', is defined as the lowest frequency of a periodic waveform. In music, the fundamental is the musical pitch of a note that is perceived as the lowest partial present. I ...

. This is mixed with the carrier output to increase clarity. In the channel vocoder algorithm, among the two components of an

analytic signal In mathematics and signal processing, an analytic signal is a complex-valued function that has no negative frequency components. The real and imaginary parts of an analytic signal are real-valued functions related to each other by the Hil ...

, considering only the amplitude component and simply ignoring the phase component tends to result in an unclear voice; on methods for rectifying this, see

phase vocoder A phase vocoder is a type of vocoder-purposed algorithm which can interpolate information present in the frequency and time domains of audio signals by using phase information extracted from a frequency transform. The computer algorithm allows freq ...

History

The development of a vocoder was started in 1928 by Bell Labs engineer Homer Dudley, who was granted patents for it on March 21, 1939, (filed October 30, 1935) and Nov 16, 1937. To demonstrate the speech synthesis ability of its decoder section, the voder (voice operating demonstrator) was introduced to the public at the AT&T building at the 1939–1940 New York World's Fair. The voder consisted of an electronic oscillator a sound source of pitched tone and noise generator for hiss, a 10-band resonator filters with variable-gain amplifiers as a vocal tract, and the manual controllers including a set of pressure-sensitive keys for filter control, and a foot pedal for

pitch control A variable speed pitch control (or vari-speed) is a control on an audio device such as a turntable, tape recorder, or CD player that allows the operator to deviate from a standard speed (such as 33, 45 or even 78 rpm on a turntable), resulting ...

of tone. Based on
See
schematic diagram of the Voder synthesizer
The filters controlled by keys convert the tone and the hiss into

vowel A vowel is a syllabic speech sound pronounced without any stricture in the vocal tract. Vowels are one of the two principal classes of speech sounds, the other being the consonant. Vowels vary in quality, in loudness and also in quantity (leng ...

consonant In articulatory phonetics, a consonant is a speech sound that is articulated with complete or partial closure of the vocal tract. Examples are and pronounced with the lips; and pronounced with the front of the tongue; and pronounced wi ...

s, and

inflection In linguistic morphology, inflection (or inflexion) is a process of word formation in which a word is modified to express different grammatical categories such as tense, case, voice, aspect, person, number, gender, mood, animacy, and ...

s. This was a complex machine to operate, but a skilled operator could produce recognizable speech. A demonstration of the voder (not the vocoder). Dudley's vocoder was used in the SIGSALY system, which was built by Bell Labs engineers in 1943. SIGSALY was used for encrypted voice communications during World War II. The KO-6 voice coder was released in 1949 in limited quantities; it was a close approximation to the SIGSALY at 1200 bit/s. In 1953, KY-9 THESEUS 1650 bit/s voice coder used solid-state logic to reduce the weight to from SIGSALY's , and in 1961 the HY-2 voice coder, a 16-channel 2400 bit/s system, weighed and was the last implementation of a channel vocoder in a secure speech system. Later work in this field has since used digital

. The most widely used speech coding technique is

linear predictive coding Linear predictive coding (LPC) is a method used mostly in audio signal processing and speech processing for representing the spectral envelope of a digital signal of speech in compressed form, using the information of a linear predictive mod ...

(LPC). Another speech coding technique, adaptive differential pulse-code modulation (ADPCM), was developed by P. Cummiskey, Nikil S. Jayant and James L. Flanagan at Bell Labs in 1973.

Applications

* Terminal equipment for systems based on

digital mobile radio Digital mobile radio (DMR) is a specification for commercial products so they can interoperate. It is defined by a standard created by the European Telecommunications Standards Institute (ETSI), and is designed to be low-cost and easy to use. DMR, ...

(DMR). * Digital voice scrambling and encryption *

Cochlear implant A cochlear implant (CI) is a surgically implanted neuroprosthesis that provides a person who has moderate-to-profound sensorineural hearing loss with sound perception. With the help of therapy, cochlear implants may allow for improved speech unde ...

s: noise and tone vocoding is used to simulate the effects of cochlear implants. * Musical and other artistic effects

Modern implementations

Even with the need to record several frequencies, and additional unvoiced sounds, the compression of vocoder systems is impressive. Standard speech-recording systems capture frequencies from about 500 Hz to 3,400 Hz, where most of the frequencies used in speech lie, typically using a sampling rate of 8 kHz (slightly greater than the

Nyquist rate In signal processing, the Nyquist rate, named after Harry Nyquist, is a value (in units of samples per second or hertz, Hz) equal to twice the highest frequency ( bandwidth) of a given function or signal. When the function is digitized at a hi ...

). The sampling resolution is typically 12 or more bits per sample resolution (16 is standard), for a final data rate in the range of 96–128 kbit/s, but a good vocoder can provide a reasonably good simulation of voice with as little as 2.4 kbit/s of data. ''Toll quality'' voice coders, such as ITU G.729, are used in many telephone networks. G.729 in particular has a final data rate of 8 kbit/s with superb voice quality. G.723 achieves slightly worse quality at data rates of 5.3 kbit/s and 6.4 kbit/s. Many voice vocoder systems use lower data rates, but below 5 kbit/s voice quality begins to drop rapidly. Several vocoder systems are used in NSA encryption systems: * LPC-10, FIPS Pub 137, 2400 bit/s, which uses

Code-excited linear prediction Code-excited linear prediction (CELP) is a linear predictive speech coding algorithm originally proposed by Manfred R. Schroeder and Bishnu S. Atal in 1985. At the time, it provided significantly better quality than existing low bit-rate algori ...

(CELP), 2400 and 4800 bit/s, Federal Standard 1016, used in

STU-III STU-III (Secure Telephone Unit - third generation) is a family of secure telephones introduced in 1987 by the NSA for use by the United States government, its contractors, and its allies. STU-III desk units look much like typical office telephone ...

Continuously variable slope delta modulation Continuously variable slope delta modulation (CVSD or CVSDM) is a voice coding method. It is a delta modulation with variable step size (i.e., special case of adaptive delta modulation), first proposed by Greefkes and Riemens in 1970. CVSD encode ...

(CVSD), 16 kbit/s, used in wide band encryptors such as the KY-57. * Mixed-excitation linear prediction (MELP), MIL STD 3005, 2400 bit/s, used in the Future Narrowband Digital Terminal FNBDT, NSA's 21st century secure telephone. *

Adaptive Differential Pulse Code Modulation Adaptive differential pulse-code modulation (ADPCM) is a variant of differential pulse-code modulation (DPCM) that varies the size of the quantization step, to allow further reduction of the required data bandwidth for a given signal-to-noise ratio ...

(

ADPCM Adaptive differential pulse-code modulation (ADPCM) is a variant of differential pulse-code modulation (DPCM) that varies the size of the quantization step, to allow further reduction of the required data bandwidth for a given signal-to-noise ratio ...

), former ITU-T G.721, 32 kbit/s used in STE secure telephone (ADPCM is not a proper vocoder but rather a waveform codec. ITU has gathered G.721 along with some other ADPCM codecs into G.726.) Vocoders are also currently used in developing psychophysics,

linguistics Linguistics is the science, scientific study of human language. It is called a scientific study because it entails a comprehensive, systematic, objective, and precise analysis of all aspects of language, particularly its nature and structure ...

computational neuroscience Computational neuroscience (also known as theoretical neuroscience or mathematical neuroscience) is a branch of neuroscience which employs mathematical models, computer simulations, theoretical analysis and abstractions of the brain to u ...

and

cochlear implant A cochlear implant (CI) is a surgically implanted neuroprosthesis that provides a person who has moderate-to-profound sensorineural hearing loss with sound perception. With the help of therapy, cochlear implants may allow for improved speech unde ...

research. Modern vocoders that are used in communication equipment and in voice storage devices today are based on the following algorithms: *

Algebraic code-excited linear prediction Algebraic code-excited linear prediction (ACELP) is a speech coding algorithm in which a limited set of pulses is distributed as excitation to a linear prediction filter. It is a linear predictive coding (LPC) algorithm that is based on the cod ...

(ACELP 4.7 kbit/s – 24 kbit/s) * Mixed-excitation linear prediction (MELPe 2400, 1200 and 600 bit/s) * Multi-band excitation (AMBE 2000 bit/s – 9600 bit/s) * Sinusoidal-Pulsed Representation (SPR 600 bit/s – 4800 bit/s) * Robust Advanced Low-complexity Waveform Interpolation (RALCWI 2050bit/s, 2400bit/s and 2750bit/s) * Tri-Wave Excited Linear Prediction (TWELP 600 bit/s – 9600 bit/s) * Noise Robust Vocoder (NRV 300 bit/s and 800 bit/s)

Linear prediction-based

Since the late 1970s, most non-musical vocoders have been implemented using

linear prediction Linear prediction is a mathematical operation where future values of a discrete-time signal are estimated as a linear function of previous samples. In digital signal processing, linear prediction is often called linear predictive coding (LPC) and ...

, whereby the target signal's spectral envelope (formant) is estimated by an all-pole IIR

. In linear prediction coding, the all-pole filter replaces the bandpass filter bank of its predecessor and is used at the encoder to ''whiten'' the signal (i.e., flatten the spectrum) and again at the decoder to re-apply the spectral shape of the target speech signal. One advantage of this type of filtering is that the location of the linear predictor's spectral peaks is entirely determined by the target signal, and can be as precise as allowed by the time period to be filtered. This is in contrast with vocoders realized using fixed-width filter banks, where spectral peaks can generally only be determined to be within the scope of a given frequency band. LP filtering also has disadvantages in that signals with a large number of constituent frequencies may exceed the number of frequencies that can be represented by the linear prediction filter. This restriction is the primary reason that LP coding is almost always used in tandem with other methods in high-compression voice coders.

Waveform-interpolative

Waveform-interpolative (WI) vocoder was developed in AT&T Bell Laboratories around 1995 by W.B. Kleijn, and subsequently, a low- complexity version was developed by AT&T for the DoD secure vocoder competition. Notable enhancements to the WI coder were made at the University of California, Santa Barbara. AT&T holds the core patents related to WI, and other institutes hold additional patents.

Artistic effects

Uses in music

For

music Music is generally defined as the art of arranging sound to create some combination of form, harmony, melody, rhythm or otherwise expressive content. Exact definitions of music vary considerably around the world, though it is an aspe ...

al applications, a source of musical sounds is used as the carrier, instead of extracting the fundamental frequency. For instance, one could use the sound of a synthesizer as the input to the filter bank, a technique that became popular in the 1970s.

History

Werner Meyer-Eppler, a German scientist with a special interest in electronic voice synthesis, published a thesis in 1948 on electronic music and speech synthesis from the viewpoint of

sound synthesis A synthesizer (also spelled synthesiser) is an electronic musical instrument that generates audio signals. Synthesizers typically create sounds by generating waveforms through methods including subtractive synthesis, additive synthesis and f ...

. Later he was instrumental in the founding of the Studio for Electronic Music of WDR in Cologne, in 1951. Siemens Tonstudio, Deutsches Museum

One of the first attempts to use a vocoder in creating music was the "Siemens Synthesizer" at the Siemens Studio for Electronic Music, developed between 1956 and 1959.
(See also excerpt of pp
157

160
from th
3rd edition in 2008
())
Details of the Siemens Electronic Music Studio, exhibited at the Deutsches Museum. In 1968,

Robert Moog Robert Arthur Moog ( ; May 23, 1934 – August 21, 2005) was an American engineer and electronic music pioneer. He was the founder of the synthesizer manufacturer Moog Music and the inventor of the first commercial synthesizer, the Moog synthesi ...

developed one of the first solid-state musical vocoders for the electronic music studio of the

University at Buffalo The State University of New York at Buffalo, commonly called the University at Buffalo (UB) and sometimes called SUNY Buffalo, is a public research university with campuses in Buffalo and Amherst, New York. The university was founded in 18 ...

. In 1968, Bruce Haack built a prototype vocoder, named ''Farad'' after Michael Faraday. It was first featured on "The Electronic Record For Children" released in 1969 and then on his rock album '' The Electric Lucifer'' released in 1970.
A sample of earlier Vocoder. In 1970,

Wendy Carlos Wendy Carlos (born Walter Carlos, November 14, 1939) is an American musician and composer best known for her electronic music and film scores. Born and raised in Rhode Island, Carlos studied physics and music at Brown University before moving ...

and

built another musical vocoder, a ten-band device inspired by the vocoder designs of Homer Dudley. It was originally called a spectrum encoder-decoder and later referred to simply as a vocoder. The carrier signal came from a Moog modular synthesizer, and the modulator from a microphone input. The output of the ten-band vocoder was fairly intelligible but relied on specially articulated speech. Some vocoders use a high-pass filter to let some

through from the microphone; this ruins the device for its original speech-coding application, but it makes the talking synthesizer effect much more intelligible. In 1972, Isao Tomita's first electronic music album ''Electric Samurai: Switched on Rock'' was an early attempt at applying speech synthesis technique in

electronic rock Electronic rock is a music genre that involves a combination of rock music and electronic music, featuring instruments typically found within both genres. It originates from the late 1960s, when rock bands began incorporating electronic instrum ...

and

pop music Pop music is a genre of popular music that originated in its modern form during the mid-1950s in the United States and the United Kingdom. The terms ''popular music'' and ''pop music'' are often used interchangeably, although the former descri ...

. The album featured electronic renditions of contemporary rock and pop songs, while utilizing synthesized voices in place of human voices. In 1974, he utilized synthesized voices in his popular classical music album '' Snowflakes are Dancing'', which became a worldwide success and helped to popularize electronic music. In 1973, the british band

Emerson, Lake and Palmer Emerson, Lake & Palmer (informally known as ELP) were an English progressive rock supergroup formed in London in 1970. The band consisted of Keith Emerson (keyboards), Greg Lake (vocals, bass, guitar, producer) and Carl Palmer (drums, percuss ...

used a vocoder on their album ''

Brain Salad Surgery ''Brain Salad Surgery'' is the fourth studio album by English progressive rock band Emerson, Lake & Palmer, released on 19 November 1973 by their record label, Manticore Records, and distributed by Atlantic Records. Following the tour in suppo ...

'', for the song " Karn Evil 9: 3rd Impression". The 1975 song "

The Raven "The Raven" is a narrative poem by American writer Edgar Allan Poe. First published in January 1845, the poem is often noted for its musicality, stylized language, and supernatural atmosphere. It tells of a distraught lover who is paid a myst ...

" from the album ''

Tales of Mystery and Imagination ''Tales of Mystery & Imagination'' (often rendered as ''Tales of Mystery and Imagination'') is a popular title for posthumous compilations of writings by American author, essayist and poet Edgar Allan Poe and was the first complete collection of ...

'' by The Alan Parsons Project features Alan Parsons performing vocals through an EMI vocoder. According to the album's liner notes, "The Raven" was the first rock song to feature a digital vocoder. Pink Floyd also used a vocoder on three of their albums, first on their 1977 ''

Animals Animals are multicellular, eukaryotic organisms in the biological kingdom Animalia. With few exceptions, animals consume organic material, breathe oxygen, are able to move, can reproduce sexually, and go through an ontogenetic stage in ...

'' for the songs "Sheep" and "Pigs (Three Different Ones)", then on ''

A Momentary Lapse of Reason ''A Momentary Lapse of Reason'' is the thirteenth studio album by the English progressive rock band Pink Floyd, released in the UK on 7 September 1987 by EMI and the following day in the US on Columbia. It was recorded primarily on guitarist ...

'' on "A New Machine Part 1" and "A New Machine Part 2" (1987), and finally on 1994's '' The Division Bell'', on "Keep Talking". The Electric Light Orchestra was among the first to use the vocoder in a commercial context, with their 1977 album '' Out of the Blue''. The band extensively uses it on the album, including on the hits "Sweet Talkin' Woman" and "Mr. Blue Sky". On following albums, the band made sporadic use of it, notably on their hits "The Diary of Horace Wimp" and "Confusion (Electric Light Orchestra song), Confusion" from their 1979 album ''Discovery (Electric Light Orchestra album), Discovery'', the tracks "Prologue", "Yours Truly, 2095", and "Epilogue" on their 1981 album ''Time (ELO album), Time'', and "Calling America" from their 1986 album ''Balance of Power (album), Balance of Power''. In the late 1970s, French duo Space Art (band), Space Art used a vocoder during the recording of their second album, ''Trip in the Centre Head''. Phil Collins used a vocoder to provide a vocal effect for his 1981 international hit single "In the Air Tonight". Vocoders have appeared on pop recordings from time to time, most often simply as a special effect rather than a featured aspect of the work. However, many experimental electronic artists of the new-age music genre often utilize vocoder in a more comprehensive manner in specific works, such as Jean-Michel Jarre (on ''Zoolook'', 1984) and Mike Oldfield (on ''QE2 (album), QE2'', 1980 and ''Five Miles Out'', 1982). Vocoder module and use by M. Oldfield can be clearly seen on his ''Live At Montreux 1981'' DVD (Track "Sheba"). There are also some artists who have made vocoders an essential part of their music, overall or during an extended phase. Examples include the German synthpop group Kraftwerk, the Japanese New wave music, new wave group Polysics, Stevie Wonder ("Send One Your Love", "A Seed's a Star") and jazz/fusion keyboardist Herbie Hancock during his late 1970s period. In 1982 Neil Young used a Sennheiser Vocoder VSM201 on six of the nine tracks on ''Trans (album), Trans''. The chorus and bridge of Michael Jackson's "P.Y.T. (Pretty Young Thing)". features a vocoder ("Pretty young thing/You make me sing"), courtesy of session musician Michael Boddicker. Coldplay have used a vocoder in some of their songs. For example, in "Major Minus" and "Hurts Like Heaven", both from the album ''Mylo Xyloto'' (2011), Chris Martin's vocals are mostly vocoder-processed. "Midnight (Coldplay song), Midnight", from ''Ghost Stories (Coldplay album), Ghost Stories'' (2014), also features Martin singing through a vocoder. The hidden track "X Marks the Spot" from ''A Head Full of Dreams'' was also recorded through a vocoder. Noisecore band Atari Teenage Riot have used vocoders in variety of their songs and live performances such as ''Live at Brixton Academy (Atari Teenage Riot album), Live at the Brixton Academy'' (2002) alongside other digital audio technology both old and new. The Red Hot Chili Peppers song "By the Way (Red Hot Chili Peppers song), By the Way" uses a vocoder effect on Anthony Kiedis' vocals. Among the most consistent users of the vocoder in emulating the human voice are Daft Punk, who have used this instrument from their first album ''Homework (Daft Punk album), Homework'' (1997) to their latest work ''Random Access Memories'' (2013) and consider the convergence of technological and human voice "the identity of their musical project". For instance, the lyrics of "Around the World (Daft Punk song), Around the World" (1997) are integrally vocoder-processed, "Get Lucky (Daft Punk song), Get Lucky" (2013) features a mix of natural and processed human voices, and "Instant Crush" (2013) features Julian Casablancas singing into a vocoder. Producer Zedd, American country singer Maren Morris and American musical duo Grey (duo), Grey made a song titled "The Middle (Zedd, Maren Morris and Grey song), The Middle" which featured a vocoder and reached the top ten of the charts in 2018.

Voice effects in other arts

Robot voices became a recurring element in popular music during the 20th century. Apart from vocoders, several other methods of producing variations on this effect include: the Sonovox, Talk box, and Auto-Tune,
A sample of Auto-Tune effect (a.k.a. ''T-Pain effect''). linear prediction vocoders, speech synthesis,
A sample of earlier computer-based speech synthesis and song synthesis, by John Larry Kelly, Jr. and Louis Gerstman at Bell Labs, using IBM 704 computer. The demo song "Daisy Bell", musical accompanied by Max Mathews, impressed Arthur C. Clarke and later he used it in the climactic scene of the screenplay for his novel ''2001: A Space Odyssey''.
A sample of speech synthesis. ring modulation and comb filter. Vocoders are used in television production, filmmaking and games, usually for robots or talking computers. The robot voices of the Cylon (1978), Cylons in ''Battlestar Galactica (1978 TV series), Battlestar Galactica'' were created with an EMS Vocoder 2000. The Doctor Who theme#History, 1980 version of the ''Doctor Who'' theme, as arranged and recorded by Peter Howell (musician), Peter Howell, has a section of the main melody generated by a Roland SVC-350 vocoder. A similar Roland VP-330 vocoder was used to create the voice of Soundwave (Transformers), Soundwave, a character from the Transformers series.

References

;Multimedia references

External links

*
Description, photographs, and diagram for the vocoder at 120years.net
* Description of a modern Vocoder.

Object of Interest: The Vocoder The New Yorker Magazine mini documentary
{{Authority control Audio effects Electronic musical instruments Music hardware Lossy compression algorithms Speech codecs Robotics