Affine shape adaptation is a methodology for iteratively adapting the shape of the smoothing kernels in an
affine group
In mathematics, the affine group or general affine group of any affine space over a field is the group of all invertible affine transformations from the space into itself.
It is a Lie group if is the real or complex field or quaternions.
...
of smoothing kernels to the local image structure in neighbourhood region of a specific image point. Equivalently, affine shape adaptation can be accomplished by iteratively warping a local image patch with affine transformations while applying a rotationally symmetric filter to the warped image patches. Provided that this iterative process converges, the resulting fixed point will be ''affine invariant''. In the area of
computer vision
Computer vision is an Interdisciplinarity, interdisciplinary scientific field that deals with how computers can gain high-level understanding from digital images or videos. From the perspective of engineering, it seeks to understand and automate t ...
, this idea has been used for defining affine invariant interest point operators as well as affine invariant texture analysis methods.
Affine-adapted interest point operators
The interest points obtained from the scale-adapted Laplacian
blob detector or the multi-scale Harris
corner detector with automatic scale selection are invariant to translations, rotations and uniform rescalings in the spatial domain. The images that constitute the input to a computer vision system are, however, also subject to perspective distortions. To obtain interest points that are more robust to perspective transformations, a natural approach is to devise a feature detector that is ''invariant to affine transformations''.
Affine invariance can be accomplished from measurements of the same multi-scale windowed second moment matrix
as is used in the multi-scale Harris operator provided that we extend the regular
scale space
Scale-space theory is a framework for multi-scale signal representation developed by the computer vision, image processing and signal processing communities with complementary motivations from physics and biological vision. It is a formal theor ...
concept obtained by
convolution
In mathematics (in particular, functional analysis), convolution is a mathematical operation on two functions ( and ) that produces a third function (f*g) that expresses how the shape of one is modified by the other. The term ''convolution' ...
with rotationally symmetric Gaussian kernels to an ''affine Gaussian scale-space'' obtained by shape-adapted Gaussian kernels (; ). For a two-dimensional image
, let
and let
be a positive definite 2×2 matrix. Then, a non-uniform Gaussian kernel can be defined as
:
and given any input image
the affine Gaussian scale-space is the three-parameter scale-space defined as
:
Next, introduce an affine transformation
where
is a 2×2-matrix, and define a transformed image
as
:
.
Then, the affine scale-space representations
and
of
and
, respectively, are related according to
:
provided that the affine shape matrices
and
are related according to
:
.
Disregarding mathematical details, which unfortunately become somewhat technical if one aims at a precise description of what is going on, the important message is that ''the affine Gaussian scale-space is closed under affine transformations''.
If we, given the notation
as well as local shape matrix
and an integration shape matrix
, introduce an ''affine-adapted multi-scale second-moment matrix'' according to
:
it can be shown that under any affine transformation
the affine-adapted multi-scale second-moment matrix transforms according to
:
.
Again, disregarding somewhat messy technical details, the important message here is that ''given a correspondence between the image points
and
, the affine transformation
can be estimated from measurements of the multi-scale second-moment matrices
and
in the two domains.
An important consequence of this study is that if we can find an affine transformation
such that
is a constant times the unit matrix, then we obtain a ''fixed-point that is invariant to affine transformations'' (; ). For the purpose of practical implementation, this property can often be reached by in either of two main ways. The first approach is based on ''transformations of the smoothing filters'' and consists of:
* estimating the second-moment matrix
in the image domain,
* determining a new adapted smoothing kernel with covariance matrix proportional to
,
* smoothing the original image by the shape-adapted smoothing kernel, and
* repeating this operation until the difference between two successive second-moment matrices is sufficiently small.
The second approach is based on ''warpings in the image domain'' and implies:
* estimating
in the image domain,
* estimating a local affine transformation proportional to
where
denotes the square root matrix of
,
* warping the input image by the affine transformation
and
* repeating this operation until
is sufficiently close to a constant times the unit matrix.
This overall process is referred to as ''affine shape adaptation'' (; ; ; ; ; ). In the ideal continuous case, the two approaches are mathematically equivalent. In practical implementations, however, the first filter-based approach is usually more accurate in the presence of noise while the second warping-based approach is usually faster.
In practice, the affine shape adaptation process described here is often combined with interest point detection automatic scale selection as described in the articles on
blob detection
In computer vision, blob detection methods are aimed at detecting regions in a digital image that differ in properties, such as brightness or color, compared to surrounding regions. Informally, a blob is a region of an image in which some propert ...
and
corner detection
Corner detection is an approach used within computer vision systems to extract certain kinds of features and infer the contents of an image. Corner detection is frequently used in motion detection, image registration, video tracking, image mo ...
, to obtain interest points that are invariant to the full affine group, including scale changes. Besides the commonly used multi-scale Harris operator, this affine shape adaptation can also be applied to other types of interest point operators such as the Laplacian/Difference of Gaussian blob operator and the determinant of the Hessian (). Affine shape adaptation can also be used for affine invariant texture recognition and affine invariant texture segmentation.
Closely related to the notion of affine shape adaptation is the notion of ''affine normalization'', which defines an ''affine invariant reference frame'' as further described in Lindeberg (,, :Appendix I.3), such that any image measurement performed in the affine invariant reference frame is affine invariant.
See also
*
Blob detection
In computer vision, blob detection methods are aimed at detecting regions in a digital image that differ in properties, such as brightness or color, compared to surrounding regions. Informally, a blob is a region of an image in which some propert ...
*
Corner detection
Corner detection is an approach used within computer vision systems to extract certain kinds of features and infer the contents of an image. Corner detection is frequently used in motion detection, image registration, video tracking, image mo ...
*
Gaussian function
In mathematics, a Gaussian function, often simply referred to as a Gaussian, is a function of the base form
f(x) = \exp (-x^2)
and with parametric extension
f(x) = a \exp\left( -\frac \right)
for arbitrary real constants , and non-zero . It i ...
*
Harris affine region detector
In the fields of computer vision and image analysis, the Harris affine region detector belongs to the category of feature detection. Feature detection is a preprocessing step of several algorithms that rely on identifying characteristic points or ...
*
Hessian affine region detector
*
Scale space
Scale-space theory is a framework for multi-scale signal representation developed by the computer vision, image processing and signal processing communities with complementary motivations from physics and biological vision. It is a formal theor ...
References
*
*
*
*
*
*
*
*
*
*
{{DEFAULTSORT:Affine Shape Adaptation
Feature detection (computer vision)