In
computer vision
Computer vision is an interdisciplinary scientific field that deals with how computers can gain high-level understanding from digital images or videos. From the perspective of engineering, it seeks to understand and automate tasks that the hum ...
, the trifocal tensor (also tritensor) is a 3×3×3 array of numbers (i.e., a
tensor) that incorporates all
projective geometric relationships among three views. It relates the coordinates of corresponding points or lines in three views, being independent of the scene structure and depending only on the relative motion (i.e.,
pose
Human positions refer to the different physical configurations that the human body can take.
There are several synonyms that refer to human positioning, often used interchangeably, but having specific nuances of meaning.
*''Position'' is a gen ...
) among the three views and their intrinsic calibration parameters. Hence, the trifocal tensor can be considered as the generalization of the
fundamental matrix in three views. It is noted that despite the tensor being made up of 27 elements, only 18 of them are actually independent.
There is also a so-called calibrated trifocal tensor, which relates the coordinates of points and lines in three views given their intrinsic parameters and encodes the relative pose of the cameras up to global scale, totalling 11 independent elements or degrees of freedom. The reduced degrees of freedom allow for fewer correspondences to fit the model, at the cost of increased nonlinearity.
Correlation slices
The tensor can also be seen as a collection of three rank-two 3 x 3 matrices
known as its ''correlation slices''. Assuming that the
projection matrices of three views are