The primary value learned value (PVLV)

model A model is an informative representation of an object, person, or system. The term originally denoted the plans of a building in late 16th-century English, and derived via French and Italian ultimately from Latin , . Models can be divided in ...

is a possible explanation for the reward-predictive firing properties of

dopamine Dopamine (DA, a contraction of 3,4-dihydroxyphenethylamine) is a neuromodulatory molecule that plays several important roles in cells. It is an organic chemical of the catecholamine and phenethylamine families. It is an amine synthesized ...

(DA) neurons. It simulates behavioral and neural data on

Pavlovian conditioning Classical conditioning (also respondent conditioning and Pavlovian conditioning) is a behavioral procedure in which a biologically potent stimulus (e.g. food, a puff of air on the eye, a potential rival) is paired with a neutral stimulus (e.g. ...

and the

midbrain The midbrain or mesencephalon is the uppermost portion of the brainstem connecting the diencephalon and cerebrum with the pons. It consists of the cerebral peduncles, tegmentum, and tectum. It is functionally associated with vision, hearing, mo ...

dopaminergic neurons that fire in proportion to unexpected rewards. It is an alternative to the temporal-differences (TD) algorithm. It is used as part of Leabra.

References

Computational neuroscience Machine learning algorithms {{neuroscience-stub