Operant conditioning is a form of associative learning and
results from the contingency established between a behavioural
response and the presentation of a reinforcing stimulus
(Mackintosh, 1974). The response–reinforcer contingency can
be defined as the probability of receiving the reinforcement
for performing the specific behaviour compared with the
probability of receiving the reinforcing stimulus in the absence
of the behavioural response. In operant conditioning, the
frequency of emitting the behavioural responses can be
increased or decreased as a result of applying positive or
negative reinforcing stimuli, respectively (Domjan and
Burkhard, 1993). For optimal conditioning, it is important that
the reinforcing stimulus not be presented when the subject is
not performing the particular behaviour. Operant conditioning,
therefore, results in a specific association between a
behavioural response and an external stimulus.
Different schedules of reinforcement are used in studies of
operant conditioning and memory of the association. A
schedule of continuous reinforcement (CR) involves 100 %
contingency between the behaviour and the reinforcement; that
is, reinforcement is presented every time the animal performs
the behaviour. Partial reinforcement (PR), however, refers to
any schedule in which there is less than 100 % contingency so
that there are instances when the animal’s behaviour is not
reinforced. In some operant learning experiments, PR may
interfere with the initial acquisition of the operant response,
especially when a negative stimulus is used as the reinforcing
stimulus; in others, PR has been found to lead eventually to
superior performance (e.g. Weinstock, 1958).
If, following conditioning, animals receive extinction trials
(in which there is no reinforcement), the acquired association
is lost, resulting in a behavioural phenotype that resembles
the naïve state (Pavlov, 1927). The most important variable
determining the magnitude of the behavioural effects of an
extinction procedure is the schedule of reinforcement used in
the acquisition phase of learning (Domjan and Burkhard,
1993). A PR-induced behaviour is more resistant to extinction
than a CR-induced behaviour. This phenomenon has been
termed the partial reinforcement extinction effect (PREE) and
was first described in the work of both Skinner and Humphreys
in the late 1930s (Skinner, 1938; Humphreys, 1939). How can
partial, but not continuous, reinforcement offer resistance
to extinction? One suggestion proposed by Amsel (1972) is
that a ‘disruptive process’, based on non-reward, emerges in
partial reinforcement acquisition. This disruptive process
does not occur in ordinary CR training, so there is no
‘counterconditioning’; extinction is therefore rapid after CR
training. This has sometimes been referred to as the ‘frustration
theory’. Another possibility is that, if the subject does not
receive reinforcement after each response during training, it
may not immediately ‘notice’ when reinforcement ceases, as
in extinction training. The change in reinforcement conditions
is more dramatic and noticeable if reinforcement ceases after
continuous reinforcement. This particular explanation of the
PREE is called the discrimination hypothesis (Domjan and
Burkhard, 1993) and is somewhat similar to the Pearce–Hall
model in that a PR schedule maintains attention because trial
outcomes are always ‘surprising’ (Bouton and Sunsay, 2001).
Operant conditioning protocols have been used in both
vertebrates (Chen and Wolpaw, 1995; Feng-Chen and Wolpaw,
1171 The Journal of Experimental Biology 205, 1171–1178 (2002)
Printed in Great Britain © The Company of Biologists Limited 2002
JEB3885
A continuous schedule of reinforcement (CR) in an
operant conditioning procedure results in the acquisition
of associative learning and the formation of long-term
memory. A 50 % partial reinforcement (PR) schedule does
not result in learning. The sequence of PR–CR training
has different and significant effects on memory retention
and resistance to extinction. A CR/PR schedule results in a
longer-lasting memory than a PR/CR schedule. Moreover,
the memory produced by the CR/PR schedule is resistant
to extinction training. In contrast, extinction occurs
following the PR/CR schedule.
Key words: operant conditioning, partial reinforcement, extinction,
long-term memory, Lymnaea stagnalis.
Summary
Introduction
The effects of continuous versus partial reinforcement schedules on associative
learning, memory and extinction in Lymnaea stagnalis
Susan Sangha*, Chloe McComb*, Andi Scheibenstock, Christine Johannes and Ken Lukowiak
†
Department of Physiology and Biophysics, University of Calgary, Faculty of Medicine, 3330 Hospital Drive NW,
Calgary, Alberta, Canada T2N 4N1
*Contributed equally to this study
†
Author for correspondence (e-mail: lukowiak@acs.ucalgary.ca)
Accepted 1 February 2002