The Teknomo–Fernandez algorithm (TF algorithm), is an efficient algorithm for generating the background image of a given video sequence. By assuming that the background image is shown in the majority of the video, the algorithm is able to generate a good background image of a video in

O(R)

-time using only a small number of

binary operations In mathematics, a binary operation or dyadic operation is a rule for combining two elements (called operands) to produce another element. More formally, a binary operation is an operation of arity two. More specifically, an internal binary op ...

and Boolean Bit operations, which require a small amount of memory and has built-in operators found in many programming languages such as C, C++, and

Java Java (; id, Jawa, ; jv, ꦗꦮ; su, ) is one of the Greater Sunda Islands in Indonesia. It is bordered by the Indian Ocean to the south and the Java Sea to the north. With a population of 151.6 million people, Java is the world's mo ...

History

People tracking from videos usually involves some form of

background subtraction Background may refer to: Performing arts and stagecraft * Background actor * Background artist * Background light * Background music * Background story * Background vocals * ''Background'' (play), a 1950 play by Warren Chetham-Strode Record ...

to segment foreground from background. Once foreground images are extracted, then desired algorithms (such as those for motion tracking, object tracking, and

facial recognition Facial recognition or face recognition may refer to: * Face detection, often a step done before facial recognition * Face perception, the process by which the human brain understands and interprets the face * Pareidolia, which involves, in part, se ...

) may be executed using these images. However,

requires that the background image is already available and unfortunately, this is not always the case. Traditionally, the background image is searched for manually or automatically from the video images when there are no objects. More recently, automatic background generation through

object detection Object detection is a computer technology related to computer vision and image processing that deals with detecting instances of semantic objects of a certain class (such as humans, buildings, or cars) in digital images and videos. Well-researched ...

, medial filtering, medoid filtering,

approximated median filtering An approximation is anything that is intentionally similar but not exactly equal to something else. Etymology and usage The word ''approximation'' is derived from Latin ''approximatus'', from ''proximus'' meaning ''very near'' and the prefix '' ...

, linear predictive filter, non-parametric model,

Kalman filter For statistics and control theory, Kalman filtering, also known as linear quadratic estimation (LQE), is an algorithm that uses a series of measurements observed over time, including statistical noise and other inaccuracies, and produces estima ...

, and adaptive smoothening have been suggested; however, most of these methods have high computational complexity and are resource-intensive. The Teknomo–Fernandez algorithm is also an automatic background generation algorithm. Its advantage, however, is its computational speed of only

O(R)

-time, depending on the resolution

R

of an image and its accuracy gained within a manageable number of frames. Only at least three frames from a video is needed to produce the background image assuming that for every pixel position, the background occurs in the majority of the videos. Furthermore, it can be performed for both grayscale and colored videos.

Assumptions

* The camera is stationary. * The light of the environment changes only slowly relative to the motions of the people in the scene. * The number of people does not occupy the scene for the most of the time at the same place. Generally, however, the algorithm will certainly work whenever the following single important assumption holds:

For each pixel position, the majority of the pixel values in the entire video contain the pixel value of the actual background image (at that position).

As long as each part of the background is shown in the majority of the video, the entire background image needs not to appear in any of its frames. The algorithm is expected to work accurately.

Background image generation

Equations

# For three frames of image sequence

x_1

x_2

, and

x_3

, the background image

B

is obtained using

B = x_3(x_1\oplus x_2)+x_1x_2

#The Boolean mode function

S

of the table occurs when the number of 1 entries is larger than half of the number of images such that

S=\begin 1, & \text \sum_^n x_i\ge\left \lceil \frac n 2 + 1 \right\rceil, \text n\ge 3 \\
0, &\text \end

# For three images, the background image

B

can be taken as the value :::

\bar_1 x_2x_3+x_1\bar_2 x_3+x_1x_2\bar_3+x_1x_2x_3

Background generation algorithm

At the first level, three frames are selected at random from the image sequence to produce a background image by combining them using the first equation. This yields a better background image at the second level. The procedure is repeated until desired level

L

Theoretical accuracy

At level

\ell

, the probability

p_\ell

that the modal bit predicted is the actual modal bit is represented by the equation

p_\ell = (p_)^3 + 3(p_)^2(1-p_)

. The table below gives the computed probability values across several levels using some specific initial probabilities. It can be observed that even if the modal bit at the considered position is at a low 60% of the frames, the probability of accurate modal bit determination is already more than 99% at 6 levels. Probability Table

Space complexity

The space requirement of the Teknomo–Fernandez algorithm is given by the function

O(RF+R3^L)

, depending on the resolution

R

of the image, the number

F

of frames in the video, and the desired number

L

of levels. However, the fact that

L

will probably not exceed 6 reduces the space complexity to

O(RF)

Time complexity

The entire algorithm runs in

O(R)

-time, only depending on the resolution of the image. Computing the modal bit for each bit can be done in

O(1)

-time while the computation of the resulting image from the three given images can be done in

O(R)

-time. The number of the images to be processed in

L

levels is

O(3^L)

. However, since

L \le 6

, then this is actually

O(1)

, thus the algorithm runs in

O(R)

Variants

A variant of the Teknomo–Fernandez algorithm that incorporates the

Monte-Carlo method Monte Carlo methods, or Monte Carlo experiments, are a broad class of computational algorithms that rely on repeated random sampling to obtain numerical results. The underlying concept is to use randomness to solve problems that might be determini ...

named CRF has been developed. Two different configurations of CRF were implemented: CRF9,2 and CRF81,1. Experiments on some colored video sequences showed that the CRF configurations outperform the TF algorithm in terms of accuracy. However, the TF algorithm remains more efficient in terms of processing time.

Applications

Object detection Object detection is a computer technology related to computer vision and image processing that deals with detecting instances of semantic objects of a certain class (such as humans, buildings, or cars) in digital images and videos. Well-researched ...

Face detection Face detection is a computer technology being used in a variety of applications that identifies human faces in digital images. Face detection also refers to the psychological process by which humans locate and attend to faces in a visual scene. ...

Face recognition A facial recognition system is a technology capable of matching a human face from a digital image or a video frame against a database of faces. Such a system is typically employed to authenticate users through ID verification services, an ...

* Pedestrian detection *

Video surveillance Closed-circuit television (CCTV), also known as video surveillance, is the use of video cameras to transmit a signal to a specific place, on a limited set of monitors. It differs from broadcast television in that the signal is not openly tr ...

Motion capture Motion capture (sometimes referred as mo-cap or mocap, for short) is the process of recording the movement of objects or people. It is used in military, entertainment, sports, medical applications, and for validation of computer vision and robo ...

* Human-computer interaction * Content-based video coding * Traffic monitoring * Real-time

gesture recognition Gesture recognition is a topic in computer science and language technology with the goal of interpreting human gestures via mathematical algorithms. It is a subdiscipline of computer vision. Gestures can originate from any bodily motion or sta ...

References

External links

Background Image Generation Using Boolean Operations
– describes the TF algorithm, its assumptions, processes, accuracy, time and space complexity, and sample results.
A Monte-Carlo-based Algorithm for Background Generation
– a variant of the Teknomo–Fernandez algorithm that incorporates the

was developed in this study. {{DEFAULTSORT:Teknomo-Fernandez algorithm Mathematical examples Image processing Computer vision