Speech enhancement aims to improve
speech
Speech is a human vocal communication using language. Each language uses Phonetics, phonetic combinations of vowel and consonant sounds that form the sound of its words (that is, all English words sound different from all French words, even if ...
quality by using various algorithms. The objective of enhancement is improvement in
intelligibility and/or overall perceptual quality of degraded
speech
Speech is a human vocal communication using language. Each language uses Phonetics, phonetic combinations of vowel and consonant sounds that form the sound of its words (that is, all English words sound different from all French words, even if ...
signal using
audio signal processing techniques.
Enhancing of speech degraded by noise, or noise reduction, is the most important field of speech enhancement, and used for many applications such as
mobile phones,
VoIP,
teleconferencing systems,
speech recognition,
speaker diarization
Speaker diarisation ( or diarization) is the process of partitioning an audio stream containing human speech into homogeneous segments according to the identity of each speaker. It can enhance the readability of an automatic speech transcription b ...
, and
hearing aids.
Algorithms
The algorithms of speech enhancement for noise reduction can be categorized into three fundamental classes: filtering techniques, spectral restoration, and model-based methods.
[J. Benesty, M. M. Sondhi, Y. Huang (ed). ''Springer Handbook of Speech Processing''. pp.843-869. Springer, 2007. .]
* Filtering Techniques
:* Spectral Subtraction Method
:*
Wiener Filtering
In signal processing, the Wiener filter is a filter used to produce an estimate of a desired or target random process by linear time-invariant ( LTI) filtering of an observed noisy process, assuming known stationary signal and noise spectra, and ...
:*
Signal subspace
In signal processing, signal subspace methods are empirical linear methods for dimensionality reduction and noise reduction. These approaches have attracted significant interest and investigation recently in the context of speech enhancement, speec ...
approach (SSA)
* Spectral Restoration
:* Minimum Mean-Square-Error Short-Time Spectral Amplitude Estimator (MMSE-STSA)
* Speech-Model-Based
See also
*
Audio noise reduction
*
Speech coding
*
Speech interface guideline
Speech interface guideline is a guideline with the aim for guiding decisions and criteria regarding designing interfaces operated by human voice. Speech interface system has many advantages such as consistent service and saving cost. However, for ...
*
Speech processing
*
Speech recognition
*
Voice analysis
References
* J. Benesty, M. M. Sondhi, Y. Huang (ed). ''Springer Handbook of Speech Processing''. Springer, 2007. .
* J. Benesty, S. Makino, J. Chen (ed). ''Speech Enhancement''. Springer, 2005. .
* P. C. Loizou. ''Speech Enhancement: Theory and Practice''. CRC Press, 2013. .
Speech processing
{{Sound-tech-stub