Handbook of Psychology, Volume 4: Experimental Psychology

380Conditioning and Learning

thepresenceofadifferentstimulus.Subsequently,oneof
theoutcomeswasdevaluedbypairingwithLiCl.Therats
werethengivenatestinwhichtheycouldperformeitherre-
sponseinthepresenceofeachofthestimuli.Theresultwas
thatrespondingwasselectivelysuppressed;theresponsethat
ledtothedevaluedoutcomeinthepresenceoftheparticular
stimulusoccurredlessfrequently.Thisresultcannotbeex-
plainedintermsofbinaryassociationsbecauseindividual
stimuliandresponseswerepairedequallyoftenwithboth
outcomes.Itsuggeststhattheratshadformedhierarchical
associations,whichencodedeachthree-termcontingency
[i.e.,S–(R-O)].Thus,theroleofinstrumentaldiscrimina-
tive stimuli may be similar to occasion setters in Pavlovian
conditioning (Davidson, Aparicio, & Rescorla, 1988).

Incentive Learning

Associations between stimuli, responses, and outcomes may
comprise part of what is learned in instrumental condition-
ing, but clearly the organism must also be motivated to per-
form the response. Although motivation was an important
topic for the neobehaviorists of the 1930s and 1940s (e.g.,
Hull, 1943), the shift towards more cognitively oriented ex-
planations of behavior in the 1960s led to a relative neglect
of motivation. More recently, however, Dickinson and col-
leagues (see Dickinson & Balleine, 1994, for review) have
provided evidence that in some circumstances, subjects must
learn the incentive properties of outcomes in instrumental
conditioning.
For example, Balleine (1992) trained sated rats to press
a lever for a novel food item. Half of the rats were later
exposed to the novel food while hungry. Subsequently, an
extinction test was conducted in which half of the rats were
hungry (thus generating four groups, depending on whether
the rats had been preexposed to the novel food while hungry,
and whether they were hungry during the extinction test).
The results were that the rats given preexposure to the novel
food item while hungry and tested in a deprived state re-
sponded at the highest rate during the extinction test.
This suggests that exposure to the novel food while in the de-
prived state contributed to that food’s serving as an effective
reinforcer. However, Dickinson, Balleine, Watt, Gonzalez,
and Boakes (1995) found that the magnitude of the incentive
learning effect diminished when subjects received extended
instrumental training prior to test. Thus, motivational control
of behavior may change, depending on experience with the
instrumental contingency.
In summary, efforts to elucidate the nature of associative
structures underlying instrumental conditioning have found

evidence for all the possible binary associations (e.g., stimulus- response, response-outcome, and stimulus-outcome), as well as for a hierarchical association involving all three elements (stimulus: response-outcome). Additionally, in some situa- tions, whether an outcome has incentive value is apparently learned. From this perspective, it seems reasonable to assume that these associations are acquired in the same fashion as stimulus-outcome associations in Pavlovian conditioning. In this view, instrumental conditioning may be considered an elaboration of fundamental associative processes.

Functional Analyses of Instrumental Conditioning

A second approach to instrumental conditioning is derived from Skinner’s (1938) interpretation of the law of effect. Rather than construe the law literally in terms of S-R connec- tions, Skinner interpreted the law of effect to mean only that response strengthincreases with reinforcement and decreases with punishment. Exactly how response strength could be measured thus became a major concern. Skinner (1938) de- veloped an apparatus (i.e., experimental chambers called Skinner boxes and cumulative recorders) that allowed the passage of time as well as lever presses and reward deliveries to be recorded. This allowed a shift in the dependent variable from the probability of a response’s occurring on a particular trial to the rate of that response over a sustained period of time. Such procedures are sometimes called free-operant(as opposed to discrete-trial). The ability to study intensively the behavior of individual organisms has led researchers in the Skinnerian tradition to emphasize molar rather than molecu- lar measures of responding (i.e., response rate aggregated over several sessions), to examine responding at stability (i.e., asymptote) rather than during acquisition, and to use a relatively small number of subjects in their research designs (Sidman, 1960). This research tradition, often called the experimental analysis of behavior,has led to an emphasis on various formal arrangements for instrumental conditioning— for example, reinforcement schedules and the export of tech- nologies for effective behavior modification (e.g., Sulzer- Azaroff & Mayer, 1991).

Choice and the Matching Law

Researchers have attempted to quantify the law of effect by articulating the functional relationships between behavior (measured as response rate) and parameters of reinforcement (specifically, the rate, magnitude, delay, and probability of reinforcement). The goal has been to obtain a quantitative expression that summarizes these relationships and that is

Handbook of Psychology, Volume 4: Experimental Psychology

Get our desktop app

Company

Features

Documentation

Resources