Imagined speech (also called silent speech, covert speech, inner speech, or, in the original Latin terminology used by
clinicians, endophasia) is
thinking
In their most common sense, the terms thought and thinking refer to cognitive processes that can happen independently of sensory stimulation. Their most paradigmatic forms are judging, reasoning, concept formation, problem solving, and delibe ...
in the form of sound – "hearing" one's own voice silently to oneself, without the intentional movement of any extremities such as the lips, tongue, or hands.
[Brigham, K.; Vijaya Kumar, B.V.K.,]
Imagined Speech Classification with EEG Signals for Silent Communication: A Preliminary Investigation into Synthetic Telepathy
, June 2010 Logically, imagined speech has been possible since the emergence of language, however, the phenomenon is most associated with its investigation through
signal processing
Signal processing is an electrical engineering subfield that focuses on analyzing, modifying and synthesizing ''signals'', such as audio signal processing, sound, image processing, images, Scalar potential, potential fields, Seismic tomograph ...
[Brigham, K.; Vijaya Kumar, B.V.K.,]
Subject Identification from Electroencephalogram (EEG) Signals During Imagined Speech
, September 2010. and detection within
electroencephalograph (EEG) data
[A. Porbadnigk; M. Wester; Schultz, T.,]
EEG-Based Speech Recognition: Impact of Temporal Effects
", 2009. as well as data obtained using
alternative non-invasive,
brain–computer interface (BCI) devices.
[Robert Bogue,]
Brain-computer interfaces: control by thought
" Industrial Robot, Vol. 37 Iss: 2, pp.126 – 132, 2010
History
In 2008, the
US Defense Advanced Research Projects Agency (DARPA) provided a $4 million grant to the
University of California (Irvine), with the intent of providing a foundation for
synthetic telepathy. According to DARPA, the project "will allow user-to-user communication on the battlefield without the use of vocalized speech through neural signals analysis. The brain generates word-specific signals prior to sending electrical impulses to the vocal cords. These ''imagined speech'' signals would be analyzed and translated into distinct words allowing covert person-to-person communication."
In his "Impossible languages" (2016)
Andrea Moro
Andrea Carlo Moro (; born 24 July 1962) is an Italian linguist, neuroscientist and novelist.
He is currently full professor of general linguistics at the Institute for Advanced Study IUSS Pavia and the Scuola Normale Superiore in Pisa, Italy, ...
discusses the "sound of thoughts" and the relationship between linguistics units and imagined speech, mainly capitalizing on Magrassi et al. (2015) "Sound representation in higher language areas during language production".
DARPA's program outline has three major goals:
:* To attempt to identify EEG patterns unique to individual words
:* To ensure these patterns are common to different users to avoid extensive device training
:* To construct a
prototype
A prototype is an early sample, model, or release of a product built to test a concept or process. It is a term used in a variety of contexts, including semantics, design, electronics, and Software prototyping, software programming. A prototype ...
that would decode the signals and transmit them over a limited range
Detection methods
The process for analyzing subjects' ''silent speech'' is composed of recording subjects'
brain waves, and then using a computer to process the data and determine the content of the subjects' ''covert speech''.
Recording
Subject
neural patterns (brain waves) can be recorded using
BCI devices;
currently, use of non-invasive devices,
specifically the EEG, is of greater interest to researchers than
invasive and partially invasive types. This is because non-invasive types pose the least risk to subject health;
EEG's have attracted the greatest interest because they offer the most user-friendly approach in addition to having far less complex
instrumentation
Instrumentation is a collective term for measuring instruments, used for indicating, measuring, and recording physical quantities. It is also a field of study about the art and science about making measurement instruments, involving the related ...
than that of
functional magnetic resonance imaging
Functional magnetic resonance imaging or functional MRI (fMRI) measures brain activity by detecting changes associated with blood flow. This technique relies on the fact that cerebral blood flow and neuronal activation are coupled. When an area o ...
(fMRI's),
another commonly used non-invasive BCI.
Processing
The first step in processing non-invasive data is to remove
artifacts such as eye movement and blinking, as well as other
electromyographic activity.
After artifact-removal, a series of
algorithm
In mathematics and computer science, an algorithm () is a finite sequence of Rigour#Mathematics, mathematically rigorous instructions, typically used to solve a class of specific Computational problem, problems or to perform a computation. Algo ...
s is used to translate raw data into the ''imagined speech'' content.
Processing is also intended to occur in real-time—the information is processed as it is recorded, which allows for near-simultaneous viewing of the content as the subject imagines it.
Decoding
Presumably, "thinking in the form of sound" recruits auditory and language areas whose activation profiles may be extracted from the EEG, given adequate processing. The goal is to relate these signals to a template that represents "what the person is thinking about". This template could for instance be the acoustic envelope (energy) timeseries corresponding to sound if it were physically uttered. Such linear mapping from EEG to stimulus is an example of
neural decoding
Neural decoding is a neuroscience field concerned with the hypothetical reconstruction of sensory and other stimuli from information that has already been encoded and represented in the brain by biological neural network, networks of neurons. Recon ...
.
A major problem however is the many variations that the very same message can have under diverse physical conditions (speaker or noise, for example). Hence one can have the same EEG signal, but it is uncertain, at least in acoustic terms, what stimulus to map it to. This in turn makes it difficult to train the relevant decoder.
This process could instead be approached using higher-order ('linguistic') representations of the message. The mappings to such representations are non-linear and can be heavily context-dependent, therefore further research may be necessary. Nevertheless, it is known that an 'acoustic' strategy can still be maintained by pre-setting a "template" by making it known to the listener exactly what message to think about, even if passively, and in a non-explicit form. In these circumstances it is possible to partially decode the acoustic envelope of speech message from neural timeseries if the listener is induced to think in the form of sound.
Challenges
In detection of other imagined actions, such as imagined physical movements, greater brain activity occurs in one
hemisphere
Hemisphere may refer to:
In geometry
* Hemisphere (geometry), a half of a sphere
As half of Earth or any spherical astronomical object
* A hemisphere of Earth
** Northern Hemisphere
** Southern Hemisphere
** Eastern Hemisphere
** Western Hemi ...
over the other. This presence of asymmetrical activity acts as a major aid in identifying the subject's imagined action. In imagined speech detection, equal levels of activity commonly occur in both the
left and right hemispheres simultaneously. This lack of
lateralization demonstrates a significant challenge in analyzing neural signals of this type.
Another unique challenge is a relatively low
signal-to-noise ratio (SNR) in the recorded data. An SNR represents the amount of meaningful signals found in a data set, compared to the amount of arbitrary or useless signals present in the same set. Artifacts present in EEG data are just one of many significant sources of noise.
To further complicate matters, the relative placement of EEG electrodes will vary amongst subjects. This is because the
anatomical
Anatomy () is the branch of morphology concerned with the study of the internal structure of organisms and their parts. Anatomy is a branch of natural science that deals with the structural organization of living things. It is an old scien ...
details of people's heads will differ; therefore, the signals recorded will vary in each subject, regardless of individuals-specific imagined speech characteristics.
See also
References
{{Reflist
*
Electrophysiology
Neurophysiology
Neurotechnology
Electrodiagnosis
Brain–computer interface
Psychiatric assessment
Thought
Speech