HOME

TheInfoList



OR:

Camera resectioning is the process of estimating the parameters of a pinhole camera model approximating the camera that produced a given photograph or video; it determines which incoming light ray is associated with each pixel on the resulting image. Basically, the process determines the
pose Human positions refer to the different physical configurations that the human body can take. There are several synonyms that refer to human positioning, often used interchangeably, but having specific nuances of meaning. *''Position'' is a gen ...
of the pinhole camera. Usually, the camera parameters are represented in a 3 × 4 projection matrix called the '' camera matrix''. The extrinsic parameters define the camera ''
pose Human positions refer to the different physical configurations that the human body can take. There are several synonyms that refer to human positioning, often used interchangeably, but having specific nuances of meaning. *''Position'' is a gen ...
'' (position and orientation) while the intrinsic parameters specify the camera image format (focal length, pixel size, and image origin). This process is often called geometric camera calibration or simply camera calibration, although that term may also refer to photometric camera calibration or be restricted for the estimation of the intrinsic parameters only. Exterior orientation and interior orientation refer to the determination of only the extrinsic and intrinsic parameters, respectively. The classic camera calibration requires special objects in the scene, which is not required in '' camera auto-calibration''. Camera resectioning is often used in the application of stereo vision where the camera projection matrices of two cameras are used to calculate the 3D world coordinates of a point viewed by both cameras.


Formulation

The camera projection matrix is derived from the intrinsic and extrinsic parameters of the camera, and is often represented by the series of transformations; e.g., a matrix of camera intrinsic parameters, a 3 × 3 rotation matrix, and a translation vector. The camera projection matrix can be used to associate points in a camera's image space with locations in 3D world space.


Homogeneous coordinates

In this context, we use \ v\ 1T to represent a 2D point position in ''pixel'' coordinates and _w\ y_w\ z_w\ 1T is used to represent a 3D point position in ''world'' coordinates. In both cases, they are represented in homogeneous coordinates (i.e. they have an additional last component, which is initially, by convention, a 1), which is the most common notation in robotics and rigid body transforms.


Projection

Referring to the pinhole camera model, a camera matrix M is used to denote a projective mapping from ''world'' coordinates to ''pixel'' coordinates. :z_\begin u\\ v\\ 1\end=K\, \begin R & T\end\begin x_\\ y_\\ z_\\ 1\end =M \begin x_\\ y_\\ z_\\ 1\end where M = K\, \begin R & T\end. u,v by convention are the x and y coordinates of the pixel in the camera, K is the intrinsic matrix as described below, and R\,T form the extrinsic matrix as described below. x_,y_,z_ are the coordinates of the source of the light ray which hits the camera sensor in world coordinates, relative to the origin of the world. By dividing the matrix product by z_, the z-coordinate of the camera relative to the world origin, the theoretical value for the pixel coordinates can be found.


Intrinsic parameters

:K=\begin \alpha_ & \gamma & u_ & 0\\ 0 & \alpha_ & v_ & 0\\ 0 & 0 & 1 & 0\end The K contains 5 intrinsic parameters of the specific camera model. These parameters encompass
focal length The focal length of an optical system is a measure of how strongly the system converges or diverges light; it is the inverse of the system's optical power. A positive focal length indicates that a system converges light, while a negative foca ...
,
image sensor format In digital photography, the image sensor format is the shape and size of the image sensor. The image sensor format of a digital camera determines the angle of view of a particular lens when used with a particular sensor. Because the image se ...
, and camera principal point. The parameters \alpha_ = f \cdot m_ and \alpha_ = f \cdot m_ represent focal length in terms of pixels, where m_ and m_ are the inverses of the width and height of a pixel on the projection plane and f is the
focal length The focal length of an optical system is a measure of how strongly the system converges or diverges light; it is the inverse of the system's optical power. A positive focal length indicates that a system converges light, while a negative foca ...
in terms of distance. \gamma represents the skew coefficient between the x and the y axis, and is often 0. u_ and v_ represent the principal point, which would be ideally in the center of the image. Nonlinear intrinsic parameters such as
lens distortion In geometric optics, distortion is a deviation from rectilinear projection; a projection in which straight lines in a scene remain straight in an image. It is a form of aberration in optical systems, optical aberration. Radial distortion Al ...
are also important although they cannot be included in the linear camera model described by the intrinsic parameter matrix. Many modern camera calibration algorithms estimate these intrinsic parameters as well in the form of non-linear optimisation techniques. This is done in the form of optimising the camera and distortion parameters in the form of what is generally known as bundle adjustment.


Extrinsic parameters

\beginR_ & T_ \\ 0_ & 1\end_ R,T are the extrinsic parameters which denote the coordinate system transformations from 3D world coordinates to 3D camera coordinates. Equivalently, the extrinsic parameters define the position of the camera center and the camera's heading in world coordinates. T is the position of the origin of the world coordinate system expressed in coordinates of the camera-centered coordinate system. T is often mistakenly considered the position of the camera. The position, C, of the camera expressed in world coordinates is C = -R^T = -R^T T (since R is a rotation matrix). Camera calibration is often used as an early stage in
computer vision Computer vision is an interdisciplinary scientific field that deals with how computers can gain high-level understanding from digital images or videos. From the perspective of engineering, it seeks to understand and automate tasks that the hum ...
. When a camera is used, light from the environment is focused on an image plane and captured. This process reduces the dimensions of the data taken in by the camera from three to two (light from a 3D scene is stored on a 2D image). Each pixel on the image plane therefore corresponds to a shaft of light from the original scene.


Algorithms

There are many different approaches to calculate the intrinsic and extrinsic parameters for a specific camera setup. The most common ones are: # Direct linear transformation (DLT) method # Zhang's method # Tsai's method # Selby's method (for X-ray cameras)


Zhang's method

Zhang model is a camera calibration method that uses traditional calibration techniques (known calibration points) and self-calibration techniques (correspondence between the calibration points when they are in different positions). To perform a full calibration by the Zhang method at least three different images of the calibration target/gauge are required, either by moving the gauge or the camera itself. If some of the intrinsic parameters are given as data (orthogonality of the image or optical center coordinates) the number of images required can be reduced to two. In a first step, an approximation of the estimated projection matrix H between the calibration target and the image plane is determined using DLT method. Subsequently, applying self-calibration techniques to obtained the image of the absolute conic matrix ink The main contribution of Zhang method is how to extract a constrained instrinsic K and n numbers of R and T calibration parameters from n pose of the calibration target.


Derivation

Assume we have a homography \textbf that maps points x_\pi on a "probe plane" \pi to points x on the image. The circular points I, J = \begin1 & \pm j & 0\end^ lie on both our probe plane \pi and on the absolute conic \Omega_\infty. Lying on \Omega_\infty of course means they are also projected onto the ''image'' of the absolute conic (IAC) \omega, thus x_1^T \omega x_1= 0 and x_2^T \omega x_2= 0. The circular points project as : \begin x_1 & = \textbf I = \begin h_1 & h_2 & h_3 \end \begin 1 \\ j \\ 0 \end = h_1 + j h_2 \\ x_2 & = \textbf J = \begin h_1 & h_2 & h_3 \end \begin 1 \\ -j \\ 0 \end = h_1 - j h_2 \end . We can actually ignore x_2 while substituting our new expression for x_1 as follows: : \begin x_1^T \omega x_1 &= \left ( h_1 + j h_2 \right )^T \omega \left ( h_1 + j h_2 \right ) \\ &= \left ( h_1^T + j h_2^T \right ) \omega \left ( h_1 + j h_2 \right ) \\ &= h_1^T \omega h_1 + j \left ( h_2^T \omega h_2 \right ) \\ &= 0 \end


Tsai's Algorithm

It is a 2-stage algorithm, calculating the pose (3D Orientation, and x-axis and y-axis translation) in first stage. In second stage it computes the focal length, distortion coefficients and the z-axis translation.


Selby's method (for X-ray cameras)

Selby's camera calibration methodBoris Peter Selby et al.
"Patient positioning with X-ray detector self-calibration for image guided therapy"
Australasian Physical & Engineering Science in Medicine, Vol.34, No.3, pages 391–400, 2011
addresses the auto-calibration of X-ray camera systems. X-ray camera systems, consisting of the X-ray generating tube and a solid state detector can be modelled as pinhole camera systems, comprising 9 intrinsic and extrinsic camera parameters. Intensity based registration based on an arbitrary X-ray image and a reference model (as a tomographic dataset) can then be used to determine the relative camera parameters without the need of a special calibration body or any ground-truth data.


See also

*
3D pose estimation 3D pose estimation is a process of predicting the transformation of an object from a user-defined reference pose, given an image or a 3D scan. It arises in computer vision or robotics where the pose or transformation of an object can be used for ...
*
Augmented reality Augmented reality (AR) is an interactive experience that combines the real world and computer-generated content. The content can span multiple sensory modalities, including visual, auditory, haptic, somatosensory and olfactory. AR can be de ...
* Augmented virtuality * Eight-point algorithm * Mixed reality * Pinhole camera model *
Perspective-n-Point Perspective-''n''-Point is the problem of estimating the pose of a calibrated camera given a set of 3D points in the world and their corresponding 2D projections in the image. The camera pose consists of 6 degrees-of-freedom (DOF) which are made up ...
* Rational polynomial coefficient


References


External links

{{external cleanup, date=July 2015
Zhang's Camera Calibration and Tsai's Calibration Software on LGPL licence

Zhang's Camera Calibration Method with Software

C++ Camera Calibration Toolbox with source code

Camera Calibration Toolbox for Matlab

The DLR CalDe and DLR CalLab Camera Calibration Toolbox

Camera Calibration
- Augmented reality lecture at TU Muenchen, Germany

(using ARToolKit)
A Four-step Camera Calibration Procedure with Implicit Image Correction

mrcal: a high-fidelity calibration toolkit with thorough uncertainty propagation
Geometry in computer vision Mixed reality Stereophotogrammetry