Predictive learning is a

machine learning Machine learning (ML) is a field of study in artificial intelligence concerned with the development and study of Computational statistics, statistical algorithms that can learn from data and generalise to unseen data, and thus perform Task ( ...

(ML) technique where an

artificial intelligence Artificial intelligence (AI) is the capability of computer, computational systems to perform tasks typically associated with human intelligence, such as learning, reasoning, problem-solving, perception, and decision-making. It is a field of re ...

model is fed new data to develop an understanding of its environment, capabilities, and limitations. This technique finds application in many areas, including

neuroscience Neuroscience is the scientific study of the nervous system (the brain, spinal cord, and peripheral nervous system), its functions, and its disorders. It is a multidisciplinary science that combines physiology, anatomy, molecular biology, ...

business Business is the practice of making one's living or making money by producing or Trade, buying and selling Product (business), products (such as goods and Service (economics), services). It is also "any activity or enterprise entered into for ...

robotics Robotics is the interdisciplinary study and practice of the design, construction, operation, and use of robots. Within mechanical engineering, robotics is the design and construction of the physical structures of robots, while in computer s ...

, and

computer vision Computer vision tasks include methods for image sensor, acquiring, Image processing, processing, Image analysis, analyzing, and understanding digital images, and extraction of high-dimensional data from the real world in order to produce numerical ...

. This concept was developed and expanded by French computer scientist

Yann LeCun Yann André Le Cun ( , ; usually spelled LeCun; born 8 July 1960) is a French-American computer scientist working primarily in the fields of machine learning, computer vision, mobile robotics and computational neuroscience. He is the Silver Pr ...

in 1988 during his career at

Bell Labs Nokia Bell Labs, commonly referred to as ''Bell Labs'', is an American industrial research and development company owned by Finnish technology company Nokia. With headquarters located in Murray Hill, New Jersey, Murray Hill, New Jersey, the compa ...

, where he trained models to detect handwriting so that financial companies could automate check processing. The mathematical foundation for predictive learning dates back to the 17th century, where British insurance company Lloyd's used

predictive analytics Predictive analytics encompasses a variety of Statistics, statistical techniques from data mining, Predictive modelling, predictive modeling, and machine learning that analyze current and historical facts to make predictions about future or other ...

to make a profit. Starting out as a mathematical concept, this method expanded the possibilities of artificial intelligence. Predictive learning is an attempt to learn with a minimum of pre-existing mental structure. It was inspired by

Jean Piaget Jean William Fritz Piaget (, ; ; 9 August 1896 – 16 September 1980) was a Swiss psychologist known for his work on child development. Piaget's theory of cognitive development and epistemological view are together called genetic epistemology. ...

's account of children constructing knowledge of the world through interaction.

Gary Drescher Gary L. Drescher is a scientist in the field of artificial intelligence (AI), and author of multiple books on AI, including ''Made-Up Minds: A Constructivist Approach to Artificial Intelligence''. His book describes a theory of how a computer pr ...

's book ''Made-up Minds'' was crucial to the development of this concept. The idea that predictions and unconscious inference are used by the brain to construct a model of the world, in which it can identify causes of

percepts Perception () is the organization, identification, and interpretation of sensory information in order to represent and understand the presented information or environment. All perception involves signals that go through the nervous system ...

, goes back even further to

Hermann von Helmholtz Hermann Ludwig Ferdinand von Helmholtz (; ; 31 August 1821 – 8 September 1894; "von" since 1883) was a German physicist and physician who made significant contributions in several scientific fields, particularly hydrodynamic stability. The ...

's iteration of this study. These ideas were further developed by the field of

predictive coding In neuroscience, predictive coding (also known as predictive processing) is a theory of brain function which postulates that the brain is constantly generating and updating a " mental model" of the environment. According to the theory, such a men ...

. Another related predictive learning theory is

Jeff Hawkins Jeffrey Hawkins is an American businessman, computer scientist, neuroscientist and engineer. He co-founded Palm Computing — where he co-created the PalmPilot and Treo — and Handspring. He subsequently turned to work on neuroscience, fou ...

memory-prediction framework The memory-prediction framework is a theory of brain function created by Jeff Hawkins and described in his 2004 book ''On Intelligence''. This theory concerns the role of the mammalian neocortex and its associations with the hippocampus, hippocamp ...

, which is laid out in his book ''

On Intelligence ''On Intelligence: How a New Understanding of the Brain will Lead to the Creation of Truly Intelligent Machines'' is a 2004 book by Jeff Hawkins and Sandra Blakeslee. The book explains Hawkins' memory-prediction framework theory of the brain an ...

''.

Mathematical procedures

Training process

Similar to ML, predictive learning aims to extrapolate the value of an unknown dependent variable

Y

, given independent input data

X=(x_1,x_2,\dots,x_n)

. A set of attributes can be classified into categorical data (discrete factors such as race, sex, or affiliation) or numerical data (continuous values such as temperature, annual income, or speed). Every set of input values is fed into a

neural network A neural network is a group of interconnected units called neurons that send signals to one another. Neurons can be either biological cells or signal pathways. While individual neurons are simple, many of them together in a network can perfor ...

to predict a value

y

. In order to predict the output accurately, the weights of the neural network (which represent how much each predictor variable affects the outcome) must be incrementally adjusted via

backpropagation In machine learning, backpropagation is a gradient computation method commonly used for training a neural network to compute its parameter updates. It is an efficient application of the chain rule to neural networks. Backpropagation computes th ...

to produce estimates closer to the actual data. Once an ML model is given enough adjustments through training to predict values closer to the

ground truth Ground truth is information that is known to be real or true, provided by direct observation and measurement (i.e. empirical evidence) as opposed to information provided by inference. Etymology The ''Oxford English Dictionary'' (s.v. ''ground ...

, it should be able to correctly predict outputs of new data with little

error An error (from the Latin , meaning 'to wander'Oxford English Dictionary, s.v. “error (n.), Etymology,” September 2023, .) is an inaccurate or incorrect action, thought, or judgement. In statistics, "error" refers to the difference between t ...

Maximizing accuracy

In order to ensure maximum accuracy for a predictive learning model, the predicted values

\hat=F(x)

must not exceed a certain error threshold when compared to actual values

y

by the risk formula: :

R(F) = E_L(y, F(x))

, where

L

is the

loss function In mathematical optimization and decision theory, a loss function or cost function (sometimes also called an error function) is a function that maps an event or values of one or more variables onto a real number intuitively representing some "cost ...

y

is the ground truth, and

F(x)

is the predicted data. This error function is used to make incremental adjustments to the model's weights to eventually reach a well-trained prediction of: :

F^(x) = \underset\, E_

L(y,F(x))

Once the error is negligible or considered small enough after training, the model is said to have converged.

Ensemble learning

In some cases, using a singular machine learning approach is not enough to create an accurate estimate for certain data.

Ensemble learning In statistics and machine learning, ensemble methods use multiple learning algorithms to obtain better predictive performance than could be obtained from any of the constituent learning algorithms alone. Unlike a statistical ensemble in statist ...

is the combination of several ML algorithms to create a stronger model. Each model is represented by the function :

F(x)=a_0 + \sum_^M a_m f_m(x)

, where

M

is the number of ensemble models,

a_0

is the bias,

a_m

is the weight corresponding to each

m

-th variable, and

f_m(x)

is the

activation function The activation function of a node in an artificial neural network is a function that calculates the output of the node based on its individual inputs and their weights. Nontrivial problems can be solved using only a few nodes if the activation f ...

corresponding to each variable. An ensemble learning model is represented as a

linear combination In mathematics, a linear combination or superposition is an Expression (mathematics), expression constructed from a Set (mathematics), set of terms by multiplying each term by a constant and adding the results (e.g. a linear combination of ''x'' a ...

of the predictions from each constituent approach, :

\hat_m = \underset \sum_^N L\left(y_i, a_0 + \sum_^M a_m f_m(x_i)\right) + \lambda \sum_^M , a_m,

where

y_i

is the actual value, the second parameter is the value predicted by each constituent method, and

\lambda

is a coefficient representing each model's variation for a certain predictor variable.

Applications

Cognitive development

Sensorimotor signals are neural impulses sent to the brain upon physical touch. Using predictive learning to detect sensorimotor signals plays a key role in early

cognitive development Cognitive development is a field of study in neuroscience and psychology focusing on a child's development in terms of information processing, conceptual resources, perceptual skill, language learning, and other aspects of the developed adult bra ...

, as the human brain represents sensorimotor signals in a predictive manner (it attempts to minimize prediction error between incomin
sensory signals
an
top–down prediction
. In order to update an unadjusted predictor, it must be trained through sensorimotor experiences because it does not inherently have prediction ability. In a recent research paper, Dr. Yukie Nagai suggested
new architecture
in predictive learning to predict sensorimotor signals based on a two-module approach: a sensorimotor system which interacts with the environment and a predictor which simulates the sensorimotor system in the brain.

Spatiotemporal memory

Computers use predictive learning in spatiotemporal memory to completely create an image given constituent frames. This implementation uses predictive

recurrent neural networks Recurrent neural networks (RNNs) are a class of artificial neural networks designed for processing sequential data, such as text, speech, and time series, where the order of elements is important. Unlike feedforward neural networks, which proces ...

, which are neural networks designed to work with sequential data, such as a

time series In mathematics, a time series is a series of data points indexed (or listed or graphed) in time order. Most commonly, a time series is a sequence taken at successive equally spaced points in time. Thus it is a sequence of discrete-time data. ...

. Using predictive learning in conjunction with computer vision enables computers to create images of their own, which can be helpful when replicating sequential phenomena such as replicating DNA strands, face recognition, or even creating X-ray images.

Social media consumer behavior

In a recent study, data on consumer behavior was collected from various social media platforms such as Facebook, Twitter, LinkedIn, YouTube, Instagram, and Pinterest. The usage of predictive learning analytics led researchers to discover various trends in consumer behavior, such as determining how successful a campaign could be, estimating a fair price for a product to attract consumers, assessing how secure data is, and analyzing the specific audience of the consumers they could target for specific products.

References

{{DEFAULTSORT:Predictive Learning Machine learning