PSOLA
   HOME

TheInfoList



OR:

PSOLA (Pitch Synchronous Overlap and Add) is a digital
signal processing Signal processing is an electrical engineering subfield that focuses on analyzing, modifying and synthesizing ''signals'', such as audio signal processing, sound, image processing, images, Scalar potential, potential fields, Seismic tomograph ...
technique used for
speech processing Speech processing is the study of speech signals and the processing methods of signals. The signals are usually processed in a digital representation, so speech processing can be regarded as a special case of digital signal processing, applied to ...
and more specifically
speech synthesis Speech synthesis is the artificial production of human speech. A computer system used for this purpose is called a speech synthesizer, and can be implemented in software or hardware products. A text-to-speech (TTS) system converts normal langua ...
. It can be used to modify the pitch and duration of a speech signal. It was invented around 1986. PSOLA works by dividing the speech waveform in small overlapping segments. To change the pitch of the signal, the segments are moved further apart (to decrease the pitch) or closer together (to increase the pitch). To change the duration of the signal, the segments are then repeated multiple times (to increase the duration) or some are eliminated (to decrease the duration). The segments are then combined using the
overlap add Overlap may refer to: * In set theory, an overlap of elements shared between sets is called an intersection, as in a Venn diagram. * In music theory, overlap is a synonym for reinterpretation of a chord at the boundary of two musical phrases * Ove ...
technique. PSOLA can be used to change the prosody of a speech signal.


See also

* Audio time stretching and pitch scaling


References


External links


Changing Pitch with PSOLA for Voice Conversion
(Archived fro
the original

A thesis that discusses PSOLA with diagrams
(PDF format; see page 35, which is page 44 of the PDF
(Archived)
Speech synthesis {{Tech-stub