Biological motion perception is the act of perceiving the fluid unique motion of a biological agent. The phenomenon was first documented by Swedish perceptual psychologist, Gunnar Johansson, in 1973.
There are many
brain areas involved in this process, some similar to those used to perceive
faces. While humans complete this process with ease, from a
computational neuroscience perspective there is still much to be learned as to how this complex perceptual problem is solved. One tool which many research studies in this area use is a display
stimuli called a point light walker. Point light walkers are coordinated moving dots that simulate biological motion in which each dot represents specific joints of a human performing an action.
Currently a large topic of research, many different models of biological motion/perception have been proposed. The following models have shown that both
form
Form is the shape, visual appearance, or configuration of an object. In a wider sense, the form is the way something happens.
Form also refers to:
*Form (document), a document (printed or electronic) with spaces in which to write or enter data
...
and
motion are important components of biological motion perception. However, to what extent each of the components play is contrasted upon the models.
Neuroanatomy
Research in this area seeks to identify the specific brain regions or circuits responsible for processing the information which the
visual system perceives in the world. And in this case, specifically recognizing motion created by biological agents.
Single Cell Recording
The most precise research is done using
single-cell recordings in the primate brain. This research has yielded areas important to motion perception in primates such as area MT (middle
temporal visual area), also referred to as V5, and area MST (
medial superior temporal area The medial superior temporal (MST) area is a part of the cerebral cortex, which lies in the dorsal stream of the visual area of the primate
Primates are a diverse order of mammals. They are divided into the strepsirrhines, which include the ...
). These areas contain cells characterized as direction cells, expansion/contraction cells, and rotation cells, which react to certain classes of movement.
Neuroimaging
Additionally, research on human participants is being conducted as well. While single-cell recording is not conducted on humans, this research uses
neuroimaging methods such as
fMRI,
PET,
EEG/
ERP to collect information on what brain areas become active when executing biological motion perception tasks, such as viewing point light walker stimuli. Areas uncovered from this type of research are the dorsal visual pathway,
extrastriate body area,
fusiform gyrus,
superior temporal sulcus, and
premotor cortex. The
dorsal visual pathway (sometimes referred to as the “where” pathway), as contrasted with the
ventral visual pathway (“what” pathway), has been shown to play a significant role in the perception of motion cues. While the ventral pathway is more responsible for form cues.
Neuropsychological Damage
Valuable information can also be learned from cases where a patient has suffered from some sort of
neurological
Neurology (from el, νεῦρον (neûron), "string, nerve" and the suffix -logia, "study of") is the branch of medicine dealing with the diagnosis and treatment of all categories of conditions and disease involving the brain, the spinal ...
damage and consequently loses certain functionalities of neural processing. One patient with bilateral
lesions that included the human homologue of area MT, lost their ability to see biological motion when the stimulus was embedded in noise, a task which the average observer is able to complete. Another study on
stroke
A stroke is a medical condition in which poor blood flow to the brain causes cell death. There are two main types of stroke: ischemic, due to lack of blood flow, and hemorrhagic, due to bleeding. Both cause parts of the brain to stop functionin ...
patients sustaining lesions to their superior temporal and premotor
frontal areas showed deficits in their processing of biological motion stimuli, thereby implicating these areas as important to that perception process. A
case study conducted on a patient with bilateral lesions involving the posterior
visual pathways
The visual system comprises the sensory organ (the eye) and parts of the central nervous system (the retina containing photoreceptor cells, the optic nerve, the optic tract and the visual cortex) which gives organisms the sense of sight (th ...
and effecting the lateral
parietal-temporal-occipital
The parietal-temporal-occipital (PTO) association area is located in the cerebral cortex which is bounded by the anterior somatic association area and posterior visual cortex of the human brain. As its name implies, the PTO includes portions of t ...
cortex struggled with early motion tasks, and yet was able to perceive the biological motion of a point light walker, a higher-order task. This may be due to the fact that area V3B and area KO were still intact, suggesting their possible roles in biological motion perception.
Biological Motion Perception Models
Cognitive Model of Biological Motion Form (Lange & Lappe, 2006)
Background
The relative roles of form cues compared to motion cues in the process of perceiving biological motion is unclear. Previous research has not untangled the circumstances under which local motion cues are needed or only additive. This model looks at how form-only cues can replicate psychophysical results of biological motion perception.
Model
''Template Creation''
Same as below. See 2.2.2 Template Generation
''Stage 1''
The first stage compares stimulus images to the assumed library of upright human walker templates in memory. Each dot in a given stimulus frame is compared to the nearest limb location on a template and these combined, weighted distances are outputted by the function:
::
where
gives the position of a particular stimulus dot and
represents the nearest limb position in the template.
represents the size of the receptor field to adjust for the size of the stimulus figure.
The best fitting template was then selected by a
winner-takes-all
Winner(s) take(s) (it) all may refer to:
Competition, economics and politics
* Winner-takes-all voting
* Winner-take-all (computing)
* Winner-take-all market
Books Fiction
* Winner Takes All (novel), ''Winner Takes All'' (novel), a BBC Books Doc ...
mechanism and entered into a
leaky integrator
In mathematics, a leaky integrator equation is a specific differential equation, used to describe a component or system that takes the integral of an input, but gradually leaks a small amount of input over time. It appears commonly in hydraulics, ...
:
::
where
and
are the weights for lateral excitation and inhibition, respectively, and the activities
provide the left/right decision for which direction the stimulus is facing.
''Stage 2''
The second stage attempts to use the
temporal order of the stimulus frames to change the expectations of what frame would be coming next. The equation
::
takes into account
bottom-up input from stage 1
, the activities in decision stage 2 for the possible responses
, and weights the difference between selected frame
and previous frame
.
''Implications''
This model highlights the abilities of form-related cues to detect biological motion and orientation in a neurologically feasible model. The results of the Stage 1 model showed that all behavioral data could be replicated by using form information alone - global motion information was not necessary to detect figures and their orientation. This model shows the possibility of the use of form cues, but can be criticized for a lack of
ecological validity
In the behavioral sciences, ecological validity is often used to refer to the judgment of whether a given study's variables and conclusions (often collected in lab) are sufficiently relevant to its population (e.g. the "real world" context). Psycho ...
. Humans do not detect biological figures in static environments and motion is an inherent aspect in upright figure recognition.
Action Recognition by Motion Detection in Posture Space (Theusner, Lussanet, and Lappe, 2014)
Overview
Old models of
biological motion perception are concerned with
tracking
Tracking may refer to:
Science and technology Computing
* Tracking, in computer graphics, in match moving (insertion of graphics into footage)
* Tracking, composing music with music tracker software
* Eye tracking, measuring the position of t ...
joint and limb motion relative to one another over time.
However, recent experiments in biological motion perception have suggested that motion information is unimportant for action recognition.
This model shows how biological motion may be perceived from sequences of
posture recognition, rather than from the direct perception of motion information. An experiment was conducted to test the validity of this model, in which subjects are presented moving point-light and stick-figure walking
stimuli. Each frame of the walking stimulus is matched to a posture template, the progression of which is recorded on a 2D posture–time
plot
Plot or Plotting may refer to:
Art, media and entertainment
* Plot (narrative), the story of a piece of fiction
Music
* ''The Plot'' (album), a 1976 album by jazz trumpeter Enrico Rava
* The Plot (band), a band formed in 2003
Other
* ''Plot' ...
that implies motion recognition.
Posture Model
''Template Generation''
Posture templates for stimulus matching were constructed with motion tracking data from nine people walking.
3D coordinates
A Cartesian coordinate system (, ) in a plane is a coordinate system that specifies each point uniquely by a pair of numerical coordinates, which are the signed distances to the point from two fixed perpendicular oriented lines, measured in t ...
of the twelve major joints (feet, knees, hips, hands, elbows, and shoulders) were tracked and
interpolated between to generate limb motion. Five sets of 2D projections were created: leftward, frontward, rightward, and the two 45° intermediate orientations. Finally, projections of the nine walkers were
normalized for walking speed (1.39 seconds at 100 frames per cycle), height, and hip location in posture space. One of the nine walkers was chosen as the walking stimulus, and the remaining eight were kept as templates for matching.
''Template Matching''
Template matching is computed by simulating posture selective
neurons
A neuron, neurone, or nerve cell is an electrically excitable cell that communicates with other cells via specialized connections called synapses. The neuron is the main component of nervous tissue in all animals except sponges and placozoa. N ...
as described by
A neuron is excited by similarity to a static frame of the walker stimulus. For this experiment, 4,000 neurons were generated (8 walkers times 100 frames per cycle times 5 2D projections). A neuron's similarity to a frame of the stimulus is calculated as follows:
::
where
describe a stimulus point and
describe the limb location at time
;
describes the preferred posture;
describes a neuron's response to a stimulus of
points; and
describes limb width.
''Response Simulation''
The neuron most closely resembling the posture of the walking stimulus changes over time. The neural activation pattern can be graphed in a 2D plot, called a posture-time plot. Along the x axis, templates are sorted chronologically according to a forward walking pattern. Time progresses along the y axis with the beginning corresponding to the origin. The perception of forward walking motion is represented as a line with a positive
slope from the origin, while backward walking is conversely represented as a line with a negative slope.
Motion Model
''Motion Detection in Posture Space''
The posture-time plots used in this model follow the established space-time plots used for describing object motion.
Space-time plots with time at the y axis and the spatial dimension at the x axis, define
velocity of an object by the slope of the line. Information about an object's motion can be detected by spatio-temporal
filters
Filter, filtering or filters may refer to:
Science and technology
Computing
* Filter (higher-order function), in functional programming
* Filter (software), a computer program to process a data stream
* Filter (video), a software component tha ...
.
In this biological motion model, motion is detected similarly but replaces the spatial dimension for posture space along the x axis, and body motion is detected by using posturo-temporal filters rather than spatio-temporal filters.
''Posturo-Temporal Filters''
Neural responses are first normalized as described by
::
where
describes the neural response;
describes the preferred posture at time
;
describes the mean neural response over all neurons over
; and
describes the normalized response. The filters are defined for forward and backward walking (
respectively). The response of the posturo-temporal filter is described
::
where
is the response of the filter at time
; and
describes the posture dimension. The response of the filter is normalized by
::