Background
The "self-administration" behavioral paradigm serves as an animal behavioral model of the human pathology of addiction. During the task, animal subjects are operant conditioned to perform one action, typically a lever press, in order to receive a drug. Reinforcement (through the use of the drug) occurs contingent upon the subject performing the desired behavior. Drug dosing in self-administration studies is response-dependent. This is an important element of creating a disease model of drug addiction in humans because response-independent drug administration is associated with increased toxicity and different neurobiological, neurochemical, and behavioral effects. In summary, the effects of response-dependent drug dosing greatly differ from response-independent drug dosing and self-administration studies appropriately capture this distinction.History
As far back as the mid-20th century, researchers have investigated animals’ drive to consume addictive substances in order to better understand human addictive processes. Spragg was one of the first researchers to create a model of chronic morphinism in a chimpanzee to explore the role of operant conditioning in relation to a drug dependency. When deprived of both food and morphine, chimpanzees would repeatedly attempt to seek out the drug of choice, even doing so much as to physically pull the experimenter into the room housing morphine and syringes. Weeks (1962) published an account of the first true use of the intravenous self-administration paradigm in a study aiming to model morphine addiction in unrestrained rats. For the first time, an addictive substance served as an operant reinforcer and rats self-administered morphine to satiety in stereotyped response patterns. The scientific community quickly adopted the self-administration paradigm as a behavioral means to examine addictive processes and adapted it to non-human primates. Thompson and Schuster (1964) studied the relative reinforcement properties of morphine in restrained rhesus monkeys using intravenous self-administration. Significant changes in response to other types of reinforcers (i.e., food, shock avoidance) were observed in drug-dependent subjects. In 1969, Deneau, Yanagita and Seevers provided macaque monkeys free access to a variety of addictive substances for investigating whether nonhuman primates would voluntarily initiate self-administration of these substances. Initiation and maintenance of self-administration produced dependence and toxicity in monkeys, thereby more closely approximating important aspects of drug addiction in humans and allowing for the first of modern self-administration studies. The procedure of testing the efficacy of a pharmacological agent as a reinforcer would soon become a standard assay. Most frequently, studies were performed in nonhuman primates to identify addictive potential, as required by the drug development process. In 1983, Collins ''et al.'' published a landmark paper in which rats were exposed to a battery of 27 psychoactive substances. The team compared test drug self-administration rates with saline vehicle self-administration rates. If animals self-administered at a rate significantly greater than vehicles, the drug was considered an active reinforcer with addictive potential. With few exceptions, the abuse liability observed in rats paralleled that observed from previous research in monkeys. In light of these similarities between the different animal models, it was identified that the addictive potential of psychoactive substances could be investigated using rats instead of nonhuman primates.Technique
Acquisition
Operant conditioning represents the behavioral paradigm underlying self-administration studies. Although not always required, subjects may be first pre-trained to perform some action, such as a lever press or nose poke to receive a food or water reward (under food- or water- restricted conditions, respectively). Following this initial training, the reinforcer is replaced by a test drug to be administered by one of the following methods: oral, inhalation, intracerebral, intravenous. Intravenous catheterization is used most commonly because it maximizes bioavailability and has a rapid onset, although is inappropriate for drugs taken orally, such as alcohol. Humans suffering from addiction often resort to intravenous drug use for similar reasons, so this route of administration increases the face validity of the construct.Maintenance
Upon presentation of the drug to the subject, a number of experimental variables might be manipulated to test hypotheses:Dose-response relationship
Reinforcement schedules
Continuous reinforcement: A single operant response triggers the dispense of a single dose of the reinforcer. A time-out period may follow each operant response that successfully yields a dose of reinforcer; during this period the lever used in training may be retracted preventing the animal from making further responses. Alternatively, operant responses will fail to produce drug administration allowing previous injections to take effect. Moreover, time-outs also help prevent subjects from overdosing during self-administration experiments. Fixed-ratio studies require a predefined number of operant responses to dispense one unit of the reinforcer. Standard fixed ratio reinforcement schedules include FR5 and FR10, requiring 5 and 10 operant responses to dispense a unit of reinforcer, respectively. Progressive ratio reinforcement schedules utilize a multiplicative increase in the number of operant responses required to dispense a unit of the reinforcer. For example, successive trials might require 5 operant responses per unit of reward, then 10 responses per unit of reward, then 15, and so on. The number of operant responses required per unit of reinforcer may be altered after each trial, each session, or any other time period as defined by the experimenter. Progressive ratio reinforcement schedules provide information about the extent that a pharmacological agent is reinforcing through the breakpoint. The breakpoint is the number of operant responses at which the subject ceases engaging in self-administration, defined by some period of time between operant responses (generally up to an hour). Fixed interval (FI) schedules require that a set amount of time pass between drug infusions, regardless of the number of times that the desired response is performed. This “refractory” period can prevent the animal from overdosing on a drug. Variable interval (VI) schedules of reinforcement are identical to FI schedules, except that the amount of time between reinforced operant responses varies, making it more difficult for the animal to predict when the drug will be delivered. Second-order reinforcement schedules build on basic reinforcement schedules by introducing a conditioned stimulus that has previously been paired with the reinforcer (such as the illumination of a light). Second-order schedules are built from two simpler schedules; completion of the first schedule results in the presentation of an abbreviated version conditioned stimulus, following completion of a fixed-interval, the drug is delivered, alongside the full-length conditioned stimulus. Second-order schedules result in a very high rate of operant responding at the presentation of the conditioned reinforcer becomes reinforcing in its own right. Benefits of this schedule include the ability to investigate the motivation to seek the drug, without the interference of the drug's own pharmacological effects, maintaining a high level of responding with relatively few drug infusions, reduced risk of self-administered overdose, and external validity to human populations where environmental context can provide a strong reinforcing effect for drug use.Extinction and reinstatement
Extinction involves the discontinuation of a particular reinforcer in response to operant behavior, such as replacing a reinforcing drug infusion with a saline vehicle. When the reinforcing element of the operant paradigm is no longer present, a gradual reduction in operant responses results in the eventual cessation or “extinction” of the operant behavior. Reinstatement is the restoration of operant behavior to acquire a reinforcer, often triggered by external events/cues or exposure to the original reinforcer itself. Reinstatement can be broken into a few broad categories: Drug-induced reinstatement: exposure to a reinforcing drug after the extinction of drug-seeking operant behavior can often reinstate drug-seeking, and can even occur when the new drug of exposure is different from the original reinforcer. This is thought to be strongly linked to drug sensitization Cue-induced reinstatement: environmental cues associated with drug administration can trigger drug reinstatement by acting as conditioned stimuli, even during drug abstinence 1. Environmental surroundings, as well as drug-associated behavior or actions, can function as environmental cues. 2. Stress-induced reinstatement: in many cases, a stressor can reinstate drug-seeking in a drug-abstinent animal. This can include (but is not limited to) acute stressors such as foot-shock or social defeat stress. In many cases, it appears that social stress can potentiate drug reinstatement just as strongly as exposure to the drug itselfApparatus
Significant findings
Self-administration studies have long been considered the “standard” in addiction research using both animal and human models. Conducting self-administration studies in animal models provides a much greater level of experimental flexibility than in humans because investigating the effects of novel pharmacological drug treatments poses significantly fewer ethical and practical barriers. In 1999, Pilla and colleagues published in Nature a study documenting the efficacy of a partial D3-agonist (BP-897) in reducing environmental cue-induced cocaine craving and vulnerability to relapse. An interesting aspect of this study was the use of second-order reinforcement schedules to identify a dissociation in the effects of BP-897 in that the drug inhibits cue-induced cocaine-seeking but has no primary reinforcement effect. This latter condition is important for any pharmacological agent to be used in the treatment of addiction—drugs used to treat addiction should be less reinforcing than the drug whose addiction they treat and optimally have no reinforcing effects.