In
signal processing
Signal processing is an electrical engineering subfield that focuses on analyzing, modifying and synthesizing ''signals'', such as audio signal processing, sound, image processing, images, Scalar potential, potential fields, Seismic tomograph ...
, sampling is the reduction of a
continuous-time signal to a
discrete-time signal. A common example is the conversion of a
sound wave to a sequence of "samples".
A sample is a value of the
signal
A signal is both the process and the result of transmission of data over some media accomplished by embedding some variation. Signals are important in multiple subject fields including signal processing, information theory and biology.
In ...
at a point in time and/or space; this definition differs from
the term's usage in statistics, which refers to a set of such values.
A sampler is a subsystem or operation that extracts samples from a
continuous signal
In mathematical dynamics, discrete time and continuous time are two alternative frameworks within which variables that evolve over time are modeled.
Discrete time
Discrete time views values of variables as occurring at distinct, separate "poi ...
. A theoretical ideal sampler produces samples equivalent to the instantaneous value of the continuous signal at the desired points.
The original signal can be reconstructed from a sequence of samples, up to the
Nyquist limit, by passing the sequence of samples through a
reconstruction filter.
Theory
Functions of space, time, or any other dimension can be sampled, and similarly in two or more dimensions.
For functions that vary with time, let
be a continuous function (or "signal") to be sampled, and let sampling be performed by measuring the value of the continuous function every
seconds, which is called the sampling interval or sampling period.
Then the sampled function is given by the sequence:
:
, for integer values of
.
The sampling frequency or sampling rate,
, is the average number of samples obtained in one second, thus
, with the unit ''samples per second'', sometimes referred to as
hertz
The hertz (symbol: Hz) is the unit of frequency in the International System of Units (SI), often described as being equivalent to one event (or Cycle per second, cycle) per second. The hertz is an SI derived unit whose formal expression in ter ...
, for example 48 kHz is 48,000 ''samples per second''.
Reconstructing a continuous function from samples is done by interpolation algorithms. The
Whittaker–Shannon interpolation formula is mathematically equivalent to an ideal
low-pass filter whose input is a sequence of
Dirac delta functions that are modulated (multiplied) by the sample values. When the time interval between adjacent samples is a constant
, the sequence of delta functions is called a
Dirac comb. Mathematically, the modulated Dirac comb is equivalent to the product of the comb function with
. That mathematical abstraction is sometimes referred to as ''impulse sampling''.
Most sampled signals are not simply stored and reconstructed. The fidelity of a theoretical reconstruction is a common measure of the effectiveness of sampling. That fidelity is reduced when
contains frequency components whose cycle length (period) is less than 2 sample intervals (see ''
Aliasing''). The corresponding frequency limit, in ''cycles per second'' (
hertz
The hertz (symbol: Hz) is the unit of frequency in the International System of Units (SI), often described as being equivalent to one event (or Cycle per second, cycle) per second. The hertz is an SI derived unit whose formal expression in ter ...
), is
cycle/sample ×
samples/second =
, known as the
Nyquist frequency of the sampler. Therefore,
is usually the output of a
low-pass filter, functionally known as an ''anti-aliasing filter''. Without an anti-aliasing filter, frequencies higher than the Nyquist frequency will influence the samples in a way that is misinterpreted by the interpolation process.
Practical considerations
In practice, the continuous signal is sampled using an
analog-to-digital converter (ADC), a device with various physical limitations. This results in deviations from the theoretically perfect reconstruction, collectively referred to as
distortion
In signal processing, distortion is the alteration of the original shape (or other characteristic) of a signal. In communications and electronics it means the alteration of the waveform of an information-bearing signal, such as an audio signal ...
.
Various types of distortion can occur, including:
*
Aliasing. Some amount of aliasing is inevitable because only theoretical, infinitely long, functions can have no frequency content above the Nyquist frequency. Aliasing can be made
arbitrarily small by using a
sufficiently large
In the mathematical areas of number theory and analysis, an infinite sequence or a function is said to eventually have a certain property, if it does not have the said property across all its ordered instances, but will after some instances have ...
order of the anti-aliasing filter.
*
Aperture error results from the fact that the sample is obtained as a time average within a sampling region, rather than just being equal to the signal value at the sampling instant. In a
capacitor-based
sample and hold circuit, aperture errors are introduced by multiple mechanisms. For example, the capacitor cannot instantly track the input signal and the capacitor can not instantly be isolated from the input signal.
*
Jitter or deviation from the precise sample timing intervals.
*
Noise
Noise is sound, chiefly unwanted, unintentional, or harmful sound considered unpleasant, loud, or disruptive to mental or hearing faculties. From a physics standpoint, there is no distinction between noise and desired sound, as both are vibrat ...
, including thermal sensor noise,
analog circuit noise, etc..
*
Slew rate limit error, caused by the inability of the ADC input value to change sufficiently rapidly.
*
Quantization as a consequence of the finite precision of words that represent the converted values.
* Error due to other
non-linear effects of the mapping of input voltage to converted output value (in addition to the effects of quantization).
Although the use of
oversampling can completely eliminate aperture error and aliasing by shifting them out of the passband, this technique cannot be practically used above a few GHz, and may be prohibitively expensive at much lower frequencies. Furthermore, while oversampling can reduce quantization error and non-linearity, it cannot eliminate these entirely. Consequently, practical ADCs at audio frequencies typically do not exhibit aliasing, aperture error, and are not limited by quantization error. Instead, analog noise dominates. At RF and microwave frequencies where oversampling is impractical and filters are expensive, aperture error, quantization error and aliasing can be significant limitations.
Jitter, noise, and quantization are often analyzed by modeling them as random errors added to the sample values. Integration and zero-order hold effects can be analyzed as a form of
low-pass filtering. The non-linearities of either ADC or DAC are analyzed by replacing the ideal
linear function
In mathematics, the term linear function refers to two distinct but related notions:
* In calculus and related areas, a linear function is a function whose graph is a straight line, that is, a polynomial function of degree zero or one. For di ...
mapping with a proposed
nonlinear function.
Applications
Audio sampling
Digital audio
Digital audio is a representation of sound recorded in, or converted into, digital signal (signal processing), digital form. In digital audio, the sound wave of the audio signal is typically encoded as numerical sampling (signal processing), ...
uses
pulse-code modulation (PCM) and digital signals for sound reproduction. This includes analog-to-digital conversion (ADC), digital-to-analog conversion (DAC), storage, and transmission. In effect, the system commonly referred to as digital is in fact a discrete-time, discrete-level analog of a previous electrical analog. While modern systems can be quite subtle in their methods, the primary usefulness of a digital system is the ability to store, retrieve and transmit signals without any loss of quality.
When it is necessary to capture audio covering the entire 20–20,000 Hz range of
human hearing such as when recording music or many types of acoustic events, audio waveforms are typically sampled at 44.1 kHz (
CD), 48 kHz, 88.2 kHz, or 96 kHz. The approximately double-rate requirement is a consequence of the
Nyquist theorem. Sampling rates higher than about 50 kHz to 60 kHz cannot supply more usable information for human listeners. Early
professional audio equipment manufacturers chose sampling rates in the region of 40 to 50 kHz for this reason.
There has been an industry trend towards sampling rates well beyond the basic requirements: such as 96 kHz and even 192 kHz Even though
ultrasonic frequencies are inaudible to humans, recording and mixing at higher sampling rates is effective in eliminating the distortion that can be caused by
foldback aliasing. Conversely, ultrasonic sounds may interact with and modulate the audible part of the frequency spectrum (
intermodulation distortion), ''degrading'' the fidelity. One advantage of higher sampling rates is that they can relax the low-pass filter design requirements for
ADCs and
DACs, but with modern oversampling
delta-sigma-converters this advantage is less important.
The
Audio Engineering Society recommends 48 kHz sampling rate for most applications but gives recognition to 44.1 kHz for CD and other consumer uses, 32 kHz for transmission-related applications, and 96 kHz for higher bandwidth or relaxed
anti-aliasing filtering.
Both Lavry Engineering and J. Robert Stuart state that the ideal sampling rate would be about 60 kHz, but since this is not a standard frequency, recommend 88.2 or 96 kHz for recording purposes.
A more complete list of common audio sample rates is:
Bit depth
Audio is typically recorded at 8-, 16-, and 24-bit depth; which yield a theoretical maximum
signal-to-quantization-noise ratio (SQNR) for a pure
sine wave
A sine wave, sinusoidal wave, or sinusoid (symbol: ∿) is a periodic function, periodic wave whose waveform (shape) is the trigonometric function, trigonometric sine, sine function. In mechanics, as a linear motion over time, this is ''simple ...
of, approximately; 49.93
dB, 98.09 dB, and 122.17 dB. CD quality audio uses 16-bit samples.
Thermal noise
A thermal column (or thermal) is a rising mass of buoyant air, a convective current in the atmosphere, that transfers heat energy vertically. Thermals are created by the uneven heating of Earth's surface from solar radiation, and are an example ...
limits the true number of bits that can be used in quantization. Few analog systems have
signal to noise ratios (SNR) exceeding 120 dB. However,
digital signal processing
Digital signal processing (DSP) is the use of digital processing, such as by computers or more specialized digital signal processors, to perform a wide variety of signal processing operations. The digital signals processed in this manner are a ...
operations can have very high dynamic range, consequently it is common to perform mixing and mastering operations at 32-bit precision and then convert to 16- or 24-bit for distribution.
Speech sampling
Speech signals, i.e., signals intended to carry only human
speech
Speech is the use of the human voice as a medium for language. Spoken language combines vowel and consonant sounds to form units of meaning like words, which belong to a language's lexicon. There are many different intentional speech acts, suc ...
, can usually be sampled at a much lower rate. For most
phoneme
A phoneme () is any set of similar Phone (phonetics), speech sounds that are perceptually regarded by the speakers of a language as a single basic sound—a smallest possible Phonetics, phonetic unit—that helps distinguish one word fr ...
s, almost all of the energy is contained in the 100 Hz – 4 kHz range, allowing a sampling rate of 8 kHz. This is the sampling rate used by nearly all
telephony
Telephony ( ) is the field of technology involving the development, application, and deployment of telecommunications services for the purpose of electronic transmission of voice, fax, or data, between distant parties. The history of telephony is ...
systems, which use the
G.711 sampling and quantization specifications.
Video sampling
Standard-definition television
Standard-definition television (SDTV; also standard definition or SD) is a television system that uses a resolution that is not considered to be either high or enhanced definition. ''Standard'' refers to offering a similar resolution to the ...
(SDTV) uses either 720 by 480
pixels (US
NTSC 525-line) or 720 by 576 pixels (UK
PAL 625-line) for the visible picture area.
High-definition television
High-definition television (HDTV) describes a television or video system which provides a substantially higher image resolution than the previous generation of technologies. The term has been used since at least 1933; in more recent times, it ref ...
(HDTV) uses
720p
720p (720 lines progressive) is a progressive HD signal format with 720 horizontal lines/1280 columns and an aspect ratio (AR) of 16:9, normally known as widescreen HD (1.78:1). All major HD broadcasting standards (such as SMPTE 292M) includ ...
(progressive),
1080i
In high-definition television (HDTV) and video display technology, 1080i is a video display format with 1080 lines of vertical resolution and Interlaced video, interlaced scanning method. This format was once a standard in HDTV. It was particular ...
(interlaced), and
1080p
1080p (1920 × 1080 progressively displayed pixels; also known as Full HD or FHD, and BT.709) is a set of HDTV high-definition video modes characterized by 1,920 pixels displayed across the screen horizontally and 1,080 pixels down the sc ...
(progressive, also known as Full-HD).
In
digital video, the temporal sampling rate is defined as the
frame rateor rather the
field raterather than the notional
pixel clock. The image sampling frequency is the repetition rate of the sensor integration period. Since the integration period may be significantly shorter than the time between repetitions, the sampling frequency can be different from the inverse of the sample time:
* 50 Hz –
PAL video
* 60 / 1.001 Hz ~= 59.94 Hz –
NTSC video
Video
digital-to-analog converters operate in the megahertz range (from ~3 MHz for low quality composite video scalers in early game consoles, to 250 MHz or more for the highest-resolution VGA output).
When analog video is converted to
digital video, a different sampling process occurs, this time at the pixel frequency, corresponding to a spatial sampling rate along
scan lines. A common pixel sampling rate is:
* 13.5 MHz –
CCIR 601,
D1 video
Spatial sampling in the other direction is determined by the spacing of scan lines in the
raster. The sampling rates and resolutions in both spatial directions can be measured in units of lines per picture height.
Spatial
aliasing of high-frequency
luma or
chroma video components shows up as a
moiré pattern.
3D sampling
The process of
volume rendering samples a 3D grid of
voxels to produce 3D renderings of sliced (tomographic) data. The 3D grid is assumed to represent a continuous region of 3D space. Volume rendering is common in medical imaging,
X-ray computed tomography (CT/CAT),
magnetic resonance imaging
Magnetic resonance imaging (MRI) is a medical imaging technique used in radiology to generate pictures of the anatomy and the physiological processes inside the body. MRI scanners use strong magnetic fields, magnetic field gradients, and ...
(MRI),
positron emission tomography (PET) are some examples. It is also used for
seismic tomography and other applications.
Undersampling
When a
bandpass signal is sampled slower than its
Nyquist rate, the samples are indistinguishable from samples of a low-frequency
alias of the high-frequency signal. That is often done purposefully in such a way that the lowest-frequency alias satisfies the
Nyquist criterion, because the bandpass signal is still uniquely represented and recoverable. Such
undersampling is also known as ''bandpass sampling'', ''harmonic sampling'', ''IF sampling'', and ''direct IF to digital conversion.''
Oversampling
Oversampling is used in most modern analog-to-digital converters to reduce the distortion introduced by practical
digital-to-analog converters, such as a
zero-order hold instead of idealizations like the
Whittaker–Shannon interpolation formula.
Complex sampling
Complex sampling (or I/Q sampling) is the simultaneous sampling of two different, but related, waveforms, resulting in pairs of samples that are subsequently treated as
complex numbers. When one waveform,
, is the
Hilbert transform of the other waveform,
, the complex-valued function,
, is called an
analytic signal, whose Fourier transform is zero for all negative values of frequency. In that case, the
Nyquist rate for a waveform with no frequencies ≥ ''B'' can be reduced to just ''B'' (complex samples/sec), instead of
(real samples/sec). More apparently, the
equivalent baseband waveform,
, also has a Nyquist rate of
, because all of its non-zero frequency content is shifted into the interval