Noise shaping is a technique typically used in

digital audio Digital audio is a representation of sound recorded in, or converted into, digital form. In digital audio, the sound wave of the audio signal is typically encoded as numerical samples in a continuous sequence. For example, in CD audio, samp ...

image An image is a visual representation of something. It can be two-dimensional, three-dimensional, or somehow otherwise feed into the visual system to convey information. An image can be an artifact, such as a photograph or other two-dimensio ...

, and

video processing In electronics engineering, video processing is a particular case of signal processing, in particular image processing, which often employs video filters and where the input and output signals are video files or video streams. Video processing ...

, usually in combination with

dithering Dither is an intentionally applied form of noise used to randomize quantization error, preventing large-scale patterns such as color banding in images. Dither is routinely used in processing of both digital audio and video data, and is often ...

, as part of the process of quantization or bit-depth reduction of a digital signal. Its purpose is to increase the apparent

signal-to-noise ratio Signal-to-noise ratio (SNR or S/N) is a measure used in science and engineering that compares the level of a desired signal to the level of background noise. SNR is defined as the ratio of signal power to the noise power, often expressed in de ...

of the resultant signal. It does this by altering the spectral shape of the error that is introduced by dithering and quantization; such that the

noise power In telecommunication, the term noise power has the following meanings: # The measured total noise in a given bandwidth at the input or output of a device when the signal is not present; the integral of noise spectral density over the bandwidth # T ...

is at a lower level in frequency bands at which noise is considered to be less desirable and at a correspondingly higher level in bands where it is considered to be more desirable. A popular noise shaping algorithm used in image processing is known as ‘ Floyd Steinberg dithering’; and many noise shaping algorithms used in audio processing are based on an ‘

Absolute threshold of hearing The absolute threshold of hearing (ATH) is the minimum sound level of a pure tone that an average human ear with normal hearing can hear with no other sound present. The absolute threshold relates to the sound that can just be heard by the organi ...

’ model.

Operation

Noise shaping works by putting the quantization error in a

feedback Feedback occurs when outputs of a system are routed back as inputs as part of a chain of cause-and-effect that forms a circuit or loop. The system can then be said to ''feed back'' into itself. The notion of cause-and-effect has to be handled ...

loop. Any feedback loop functions as a filter, so by creating a feedback loop for the error itself, the error can be filtered as desired. For example, consider the feedback system: :

\ y = x + e -1

where is the output sample value that is to be quantized, is the input sample value, is the sample number, and is the quantization error introduced at sample : :

\ e = y_\text - y

In this model, when any sample's bit depth is reduced, the quantization error between the quantized value and the original value is measured and stored. That "error value" is then re-added into the next sample prior to its quantization. The effect is that the quantization error is low-pass filtered by a 2-sample

boxcar A boxcar is the North American (AAR) term for a railroad car that is enclosed and generally used to carry freight. The boxcar, while not the simplest freight car design, is considered one of the most versatile since it can carry most ...

filter (also known as a simple moving average filter). As a result, compared to before, the quantization error has lower power at higher frequencies and higher power at lower frequencies. Note that we can adjust the cutoff frequency of the filter by modifying the proportion, , of the error from the previous sample that is fed back: :

\ y = x + b e -1

More generally, any

FIR filter In signal processing, a finite impulse response (FIR) filter is a filter whose impulse response (or response to any finite length input) is of ''finite'' duration, because it settles to zero in finite time. This is in contrast to infinite impulse ...

or IIR filter can be used to create a more complex

frequency response In signal processing and electronics, the frequency response of a system is the quantitative measure of the magnitude and phase of the output as a function of input frequency. The frequency response is widely used in the design and analysis of s ...

curve. Such filters can be designed using the

weighted least squares Weighted least squares (WLS), also known as weighted linear regression, is a generalization of ordinary least squares and linear regression in which knowledge of the variance of observations is incorporated into the regression. WLS is also a speci ...

method. In the case of digital audio, typically the weighting function used is one divided by the absolute threshold of hearing curve, i.e. :

\ W(f) = \frac.

Noise shaping should also always involve an appropriate amount of

dither Dither is an intentionally applied form of noise used to randomize quantization error, preventing large-scale patterns such as color banding in images. Dither is routinely used in processing of both digital audio and video data, and is often ...

within the process itself so as to prevent determinable and correlated errors to the signal itself. If dither is not used then noise shaping effectively functions merely as distortion shaping — pushing the distortion energy around to different frequency bands, but it is still distortion. If dither is added to the process as :

+ \mathrm,

then the quantization error truly becomes noise, and the process indeed yields noise shaping.

In digital audio

Noise shaping in audio is most commonly applied as a bit-reduction scheme. The most basic form of dither is flat, white noise. The ear, however, is less sensitive to certain frequencies than others at low levels (see

Equal-loudness contour An equal-loudness contour is a measure of sound pressure level, over the frequency spectrum, for which a listener perceives a constant loudness when presented with pure steady tones. The unit of measurement for loudness levels is the phon and ...

). By using noise shaping the quantization error can be effectively spread around so that more of it is focused on frequencies that can't be heard as well and less of it is focused on frequencies that can. The result is that where the ear is most critical the quantization error can be reduced greatly and where the ears are less sensitive the noise is much greater. This can give a perceived noise reduction of 4 bits compared to straight dither. While 16-bit audio is typically thought to have 96 dB of dynamic range (see quantization distortion calculations), it can actually be increased to 120 dB using noise-shaped dither.

Noise shaping and 1-bit converters

Since around 1989, 1 bit

delta-sigma modulator Delta-sigma (ΔΣ; or sigma-delta, ΣΔ) modulation is a method for encoding analog signals into digital signals as found in an analog-to-digital converter (ADC). It is also used to convert high bit-count, low-frequency digital signals into l ...

s have been used in

analog-to-digital converter In electronics, an analog-to-digital converter (ADC, A/D, or A-to-D) is a system that converts an analog signal, such as a sound picked up by a microphone or light entering a digital camera, into a digital signal. An ADC may also provide ...

s. This involves sampling the audio at a very high rate (2.8224 million samples per second, for example) but only using a single bit. Because only 1 bit is used, this converter only has 6.02 dB of

dynamic range Dynamic range (abbreviated DR, DNR, or DYR) is the ratio between the largest and smallest values that a certain quantity can assume. It is often used in the context of signals, like sound and light. It is measured either as a ratio or as a base ...

. The noise floor, however, is spread throughout the entire non- aliased frequency range below the

Nyquist frequency In signal processing, the Nyquist frequency (or folding frequency), named after Harry Nyquist, is a characteristic of a sampler, which converts a continuous function or signal into a discrete sequence. In units of cycles per second ( Hz), it ...

of 1.4112 MHz. Noise shaping is used to lower the noise present in the audible range (20 Hz to 20 kHz) and increase the noise above the audible range. This results in a broadband dynamic range of only 7.78 dB, but it is not consistent among frequency bands, and in the lowest frequencies (the audible range) the dynamic range is much greater — over 100 dB. Noise shaping is inherently built into the delta-sigma modulators. The 1 bit converter is the basis of the DSD format by Sony. One criticism of the 1 bit converter (and thus the DSD system) is that because only 1 bit is used in both the signal and the feedback loop, adequate amounts of dither cannot be used in the feedback loop and distortion can be heard under some conditions. Most A/D converters made since 2000 use multi-bit or multi-level delta sigma modulators that yield more than 1 bit output so that proper dither can be added in the feedback loop. For traditional

PCM Pulse-code modulation (PCM) is a method used to digitally represent sampled analog signals. It is the standard form of digital audio in computers, compact discs, digital telephony and other digital audio applications. In a PCM stream, the am ...

sampling the signal is then

decimate Decimation, Decimate, or variants may refer to: * Decimation (punishment), punitive discipline * Decimation (signal processing), reduction of digital signal's sampling rate * Decimation (comics), 2006 Marvel crossover spinoff ''House of M'' * ...

d to 44.1 kHz or other appropriate sample rates.

In modern ADCs

Analog Devices Analog Devices, Inc. (ADI), also known simply as Analog, is an American multinational semiconductor company specializing in data conversion, signal processing and power management technology, headquartered in Wilmington, Massachusetts. The ...

uses what they refer to as "Noise Shaping Requantizer", and

Texas Instruments Texas Instruments Incorporated (TI) is an American technology company headquartered in Dallas, Texas, that designs and manufactures semiconductors and various integrated circuits, which it sells to electronics designers and manufacturers globa ...

uses what they refer to as "SNRBoost"Understanding Low-Amplitude Behavior of 11-bit ADCs
(PDF) to lower the noise floor approximately 30db compared to the surrounding frequencies. This comes at a cost of non-continuous operation but produces a nice bathtub shape to the spectrum floor. This can be combined with other techniques such as Bit-Boost to further enhance the resolution of the spectrum.

References

{{Noise Audio engineering Digital signal processing Noise (electronics)