+ All Categories
Home > Technology > Contingency and Continuity

Contingency and Continuity

Date post: 25-Apr-2015
Category:
Upload: david-a-townsend
View: 1,129 times
Download: 2 times
Share this document with a friend
Description:
 
56
Operant Conditioning: Causal Factors and Explanations Contingency and Continuity
Transcript
Page 1: Contingency and Continuity

Operant Conditioning:

Causal Factors and Explanations

Contingency and Continuity

Page 2: Contingency and Continuity

Conditioning and Learning

Conditioning is the process by which an activity originates or is changed through reacting to an encountered situation provided that the change in activity can not be explained on the basis or native tendencies, maturation or temporary states. Hilgard (1956)

Conditioning is the learning of relations among events so as to allow the organism to represent its environment. Rescorla (1988)

S-R

S-S

Page 3: Contingency and Continuity

WHAT PRODUCES CONDITIONING: CONTIGUITY OR CONTINGENCY? Power of reinforcement to shape and

sustain operant behavior is pervasive. So, too, is its potential as a tool in a

wide range of practical applications. But, what is it about response-reinforcer

relation that produces conditioning?

Page 4: Contingency and Continuity

Operant Conditioning

S+ R O*

Page 5: Contingency and Continuity

WHAT PRODUCES CONDITIONING: CONTIGUITY OR CONTINGENCY? Two answers have been offered: Contiguity

– Temporal proximity between response and reinforcer

Contingency– Probabilistic relation with reinforcement of

responding and not responding

temporal relationship; temporal contiguity refers to the delivery of the reinforcer immediately after the

response

causal relationship; response-reinforcer contingency refers to the extent to which the response is necessaryand sufficient for the occurrence of the reinforcer

Page 6: Contingency and Continuity

Evidence for Contiguity

Generally, as delay between response and reinforcer increases, rate of operant responding decreases.

Page 7: Contingency and Continuity

Evidence for Contiguity

Generally, as delay between response and reinforcer increases, rate of operant responding decreases.

Page 8: Contingency and Continuity

Evidence for Contiguity

Generally, as delay between response and reinforcer increases, rate of operant responding decreases.

Temporal contiguity is thus necessary for conditioning to occur.

But, is temporal contiguity also sufficient?

Can’t tell because most studies require responses to produce reinforcers.

Page 9: Contingency and Continuity

Evidence for Contiguity Skinner’s 1948

superstition project : Studied 8 hungry

pigeons. Food given every 15 sec

regardless of pigeon’s behavior.

6/8 pigeons performed idiosyncratic patterns of unnecessary behavior.

Responding rose as time to food approached.

Why did all of this happen?

Page 10: Contingency and Continuity

Evidence for Contiguity

Food happened to follow something each pigeon was doing.

Different behaviors were strengthened for different pigeons.

Higher the rate of response, the more likely food would again follow response.

Responding rose as time to food neared because P of response-food pairing rose the longer the time since last food.

Page 11: Contingency and Continuity

Evidence for Contiguity

Skinner thus concluded that necessary and sufficient condition for operant conditioning was that a reinforcer closely follow a response.

Why is response-reinforcer contingency so effective?

It guarantees response-reinforcer contiguity.

Page 12: Contingency and Continuity

Evidence for Contiguity

Skinner’s results and conclusions have been questioned.

His results may be difficult to replicate. His conclusions may not be general. But, beyond superstition experiment,

there may be good evidence to support importance of response-reinforcer contiguity.

Page 13: Contingency and Continuity

What leads to conditioning?

Contiguity– Stimuli that are close

to one another in time and in space become associated

Co-occurrence– Proximity critical

Contingency– When one stimulus

depends on the other, they will become associated

Information – Predictive value

critical

Page 14: Contingency and Continuity

Evidence for Contiguity

Thomas (1981) study on contiguity-promoting schedules.

P(Food|Press) = P(Food|No Press)

Page 15: Contingency and Continuity

Trial 1 Trial 2

Trial 1 Trial 2

20s

No Response:

Subject Responds:

RewardS*

RewardS*

RewardS*

RewardS*

Bar pressR

Bar pressR

Page 16: Contingency and Continuity

Thomas Schedule

p p pp ----/----/----/---- f f f f

Page 17: Contingency and Continuity

Evidence for Contiguity

Thomas (1981) study on contiguity-promoting schedules.

P(Food|Press) = P(Food|No Press) No press-food contingency. But, response-food contiguity was

promoted by novel schedule. So too was lever pressing. Rats

increased lever pressing

Page 18: Contingency and Continuity

Evidence for Contiguity

Extra wrinkle of Thomas (1981) study: P(Food|Press) < P(Food|No Press) Thus, negative press-food contingency.

Page 19: Contingency and Continuity

Trial 1 Trial 2

Trial 1 Trial 2

20s

No Response:

Subject Responds:

RewardS*

NOS*

RewardS*

RewardS*

Bar pressR

Bar pressR

Rewarded response causes next 20s trial to be unrewarded

Trial 4Trial 3

Page 20: Contingency and Continuity

Second Thomas Schedule

p p pp ----/----/----/---- f f f

Page 21: Contingency and Continuity

Evidence for Contiguity

Extra wrinkle of Thomas (1981) study: P(Food|Press) < P(Food|No Press) Thus, negative press-food contingency. Response-food contiguity was still promoted

by second schedule. So too was lever pressing. Rats increased

lever pressing Power of contiguity is very strong; can even

override effects of contingency.

Page 22: Contingency and Continuity

What leads to conditioning?

Contiguity– Stimuli that are close

to one another in time and in space become associated

Co-occurrence– Proximity critical

Contingency– When one stimulus

depends on the other, they will become associated

Information – Predictive value

critical

Page 23: Contingency and Continuity

Operant Conditioning: Contiguity

S+ R O*

O* stamps in S+-R relationship

Pavlov

Hull

Skinner

Page 24: Contingency and Continuity

CONTINGENCY LEARNING

Attempts to assess contingency learning in operant conditioning parallel studies in Pavlovian conditioning.

Operant studies suggest that organisms can distinguish dependence from independence between response and reinforcer.

Cause and effect

Page 25: Contingency and Continuity

What leads to conditioning?

Contiguity– Stimuli that are close

to one another in time and in space become associated

Co-occurrence– Proximity critical

Contingency– When one stimulus

depends on the other, they will become associated

Information – Predictive value

critical

Page 26: Contingency and Continuity

CONTINGENCY LEARNING Figure 8.4

Page 27: Contingency and Continuity
Page 28: Contingency and Continuity

Contiguity without Contingency

10 20

20 40

airplane

no plane

no

S* 2 S* 2 a b

c d

S+ 1

No S+ 1

bird andplane are paired

A quick test for contingency

a·d > c·bthen positive

a·d = c·bzero contingency

a·d < c·bthen negative

no bird bird

prob.(birdplane) = .33prob.(birdno plane) = .33

10/30 20/60

Page 29: Contingency and Continuity

You can have a positive contingency even whenpairing is the least frequent possibility

Example: can you learn that

and “cat” are associated?

“cat” no “cat”

100 900 1,000

200 9,800 10,000

see

no

p (“cat” ) = .10

p (“cat”no ) = .02

hear

positive contingency

Learning:

Seeking cause and effect

relationships

Page 30: Contingency and Continuity

CONTINGENCY LEARNING

Head turn, mobile, infants given positive contingency procedure (Watson, 1967):– Infants’ head turning increased, plus they

smiled when mobile moved Infants put on zero contingency

procedure:– Infants’ head turning did not increase, plus

they stopped smiling when mobile moved

Apple martinis

Page 31: Contingency and Continuity

Carolyn Rovee-Collier

445-3364

Page 32: Contingency and Continuity

CONTINGENCY LEARNING

Infants discriminate response-dependent from response-independent reinforcement: shown by head turning.

Infants differentially enjoy response-dependent and response-independent reinforcers: shown by smiling and cooing.

Both cognition and affect may be changed by control by consequence.

Cause and effect

Page 33: Contingency and Continuity

Learned Helplessness (Seligman)

Page 34: Contingency and Continuity

panel

Learned Helplessness (Seligman)

Phase I - Learning to Escape

Control Dogs Yoked Dogs

Shock

•A long lasting shock is given to both groups

every once in a while

•Control dogs can turn shock off by pushing a

panel

•Yoked dogs’ shock turns off too, when control

dog pushes panel

•Yoked dogs can do nothing themselves to escape

shock

Page 35: Contingency and Continuity

Contiguity or Contingency?

Spot Periodically shocked Can terminate shock

by pressing lever with his nose

Lassie Periodically shocked Has no control over

shocks, but when Spot’s shock is terminated, so is Lassie’s

Page 36: Contingency and Continuity

Phase 2 - Avoidance Learning

•shock delivered to one side of box•if dog jumps hurdle to other side there is no shock

Control dogs learn to avoid shockYoked dogs don’t

Yoked dogs have learned that they can’t stop shockThey have learned to be helpless

hurdle

Page 37: Contingency and Continuity

Learned Helplessness

Yoked dog seems to have learned that its behavior does not matter:– It not only fails to learn– It stops reacting to shock

Phenomenon of learned helplessness strongly suggests that organisms can discriminate response-dependent from response-independent events.

Page 38: Contingency and Continuity

Learned Helplessness

Animals must learn to jump barrier to avoid shock

Results– Spot learns, Lassie yelps

but eventually becomes passive and accepts shocks

Contingency– Spot learns his actions

matter– Lassie learned that it was

helpless Contiguity

– Spot learned to press lever– Lassie learned to act

passively

Page 39: Contingency and Continuity

Seligman’s Learned Helplessness Study

Two groups of dogs are exposed to shock– control group could escape shock– “no escape” group could NOT escape

shock Later, when escape was possible, “no

escape” dogs didn’t even try Learned that they had NO CONTROL

Page 40: Contingency and Continuity

OPERANT CONDITIONING: WHAT IS LEARNED? In any operant conditioning study, three

events need to be considered: – Response (R)– Reinforcer or punisher it produces (O*) – Stimulus situation in which response occurs (S) – Three occur in S-R-S* sequence

What associations among three elements are formed when animal learns to make operant response?

Page 41: Contingency and Continuity

OPERANT CONDITIONING: WHAT IS LEARNED? R-O* association Seems to require foresight: acting in

accord with future consequences. Thorndike famously denied that

animals know what consequence of their behavior will be.

Law of effect thus emphasized past consequences.

Page 42: Contingency and Continuity

Operant Conditioning

S+ (R O*)

Page 43: Contingency and Continuity

OPERANT CONDITIONING: WHAT IS LEARNED? S+-R association Thorndike’s idea Situation evokes behavior (S-R). Reinforcers strengthen S-R bond. S+ becomes more likely to evoke R.

Page 44: Contingency and Continuity

Operant Conditioning: Contiguity

S+ R O*

O* stamps in S+-R relationship

Pavlov

Hull

Skinner

Page 45: Contingency and Continuity

OPERANT CONDITIONING: WHAT IS LEARNED? Two-process theory:

– S-R association (operant)– S-O* association (Pavlovian)

Sight of lever not only triggers lever pressing, but it also makes animal “think” about upcoming food.

Anticipation of reinforcer motivates operant response.

Page 46: Contingency and Continuity

OPERANT CONDITIONING: R-S* Learning Strongest evidence comes from studies

using devaluation procedure (Colwill & Rescorla, 1985).

Chain PullSugar Water Lever PressFood Pellet Food PelletIllness (Devaluation) Choice: chain pull versus lever press Rats pull chain much more than press

lever.

Page 47: Contingency and Continuity

R-O* association

Colwill & Rescorla (1985)Training Devaluation Test

R1 O1

R2 O2

O1 LiCL

O2 nothing

R1 and R2

1

2

3

4

5

6

7

Meanresp/min

R1 -outcomewas devalued

Time

R2 -outcomenot devalued

Page 48: Contingency and Continuity

OPERANT CONDITIONING: R-O* Learning Association of food with illness does not

change stimulus aspects of situation that might generate responses.

Lever press does not occur because it is associated with chamber (S-R), but because it is associated with reinforcer (R-S*).

When value of reinforcer is eliminated, so too is impetus for response.

Page 49: Contingency and Continuity

OPERANT CONDITIONING: R-O* Learning Operant conditioning involves learning to

expect responses to produce reward. Rats not only expect reward, but a

particular kind of reward. Devaluation procedure could not work

unless rats had specifically remembered that one response produced food pellets and other produced sugar water.

Page 50: Contingency and Continuity

Operant Conditioning

(S+ ) R (O*)

Page 51: Contingency and Continuity

OPERANT CONDITIONING: S-O* Learning Rats trained to panel press. A light or noise was always present. S1 = sugar water and S2 = food pellets. Lever press = sugar water and chain pull =

food pellets. S1 increased lever pressing, but not chain

pulling. S2 increased chain pulling, but not lever

pressing.

Page 52: Contingency and Continuity

S-O* Learning

Page 53: Contingency and Continuity

OPERANT CONDITIONING: S-O* Learning For rat to show these selective

increases in responding, it must have learned which stimulus was associated with which reward.

Therefore, this study provides evidence of S-O* associations in operant conditioning (Colwill & Rescorla, 1988).

Page 54: Contingency and Continuity

S-O association

Colwill & Rescorla (1988)

Sd training Response training Test

S1 R1 O1

S2 R2 O2

R3 O1

R4 O2

S1: R3 vs R4

S2: R3 vs R4

2

4

6

10

Meanresp/min Different

outcome

Trials

Sameoutcome

8

Page 55: Contingency and Continuity

OPERANT CONDITIONING: S-R Learning Devaluation studies find a reduction in

response that leads to devalued reward. But, response is rarely eliminated. Residual responding may represent

behavior triggered by stimulus situation in which responding was rewarded.

Page 56: Contingency and Continuity

OPERANT CONDITIONING: WHAT IS LEARNED? Summary statement: Research suggests organisms learn

associations between response and reinforcer (R-O*), environmental stimuli and reinforcer (S-O*), and stimuli and response (S-R).

The “simple” process of operant conditioning is not so simple after all.


Recommended