+ All Categories
Home > Documents > Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice...

Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice...

Date post: 21-Jan-2016
Category:
Upload: susan-ellis
View: 220 times
Download: 0 times
Share this document with a friend
37
Instrumental Conditioning II
Transcript
Page 1: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.

Instrumental Conditioning II

Page 2: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.

Delay of Reinforcement

StartDelayChoice

Correct

Incorrect

Grice (1948)

Goal

Reward or No Reward

Page 3: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.

Grice (1948) Results

2030405060708090

10025 100

175

250

325

400

475

550

625

700

Trials

Per

cen

t C

orr

ect

0s

5s

2s1.2s

0.5s

10s

Page 4: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.

Overcoming the effects of delay

• Secondary reinforcers

• “Marking” procedure

Page 5: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.

Lieberman, McIntosh & Thomas (1979)

Page 6: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.

Reinforcement Punishment

Positive contingency

Negative contingency

Chocolate Bar Electric Shock

Excused from Chores

No TV privileges

Effect on Rate Behavior

Page 7: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.
Page 8: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.
Page 9: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.
Page 10: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.

Professor Drew

Page 11: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.

Anticipatory Contrast - Crespi (1942)

00.5

11.5

22.5

33.5

44.5

2 4 6 8 10 12 14 16 18 20 2 4 6 8

Trials

Run

ning

Spe

ed (f

t/se

c)

256-16 Pellets16-16 Pellets1 - 16 Pellets

Rats run down maze to find food pellets in goal arm.

Page 12: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.

What is a reinforcer?

Operational Definition (behaviorists): That which increases the probability of the response that preceded it.

Thorndike: A stimulus that produces a “satisfying state of affairs”

Page 13: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.

Drive Reduction Theory

Amt of H2O in body

Compare with Set Point

Seek water/ don’t seek water

drives

Page 14: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.

Drive Reduction Considered: Are reinforcers necessary for survival?

– Eating to excess

– Drugs of Abuse

– “Pleasure centers” of the brain

Page 15: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.
Page 16: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.

Behavioral Regulation View: The Premack Principle

• Behaviors are reinforcing, not stimuli

• To predict what will be reinforcing, observe the baseline frequency of different behaviors

• Highly probable behaviors will reinforce less probable behaviors

Page 17: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.

Premack Revised: The Response Deprivation Hypothesis

• Low frequency behaviors can reinforce high frequency behaviors (and vice versa)

• All behaviors have a preferred frequency = the behavioral bliss point

• Deprivation below that frequency is aversive, and organisms will work to remedy this

Timberlake & Allison (1974)

Page 18: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.

Response deprivation hypothesis

.25 .5 .75

The ice cream scale (in pints)

1.0 1.25 1.5 1.75 2.0 2.25 2.5

Bliss point

(1.0 pints/night)

Will work to avoid ice creamWill work to obtain

Page 19: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.

Contiguity versus Contingency in operant conditioning

Page 20: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.

Degraded Contingency Effect

= bar press = food

Perfect contingency

Strong Responding

Degraded contingency

Weak Responding

Page 21: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.

G.V. Thomas (1983)

Contiguity pitted against contingency

“Free” reinforcers given every 20s

Lever press advances delivery of pellet, but cancels pellet for next 20-s interval

So if you press at second 2, you get a pellet immediately, but you get no pellet during seconds 3-20 and 21-40.

20s 40s 60s

Page 22: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.

G.V. Thomas (1983)

Contiguity pitted against contingency

So if you press at second 2, you get a pellet immediately, but you get no pellet during seconds 3-20 and 21-40.

20s 40s 60s

Lever press here

Lose this pellet

Page 23: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.

“Superstitious Behavior”

• Suggested that temporal contiguity more important than contingency

• 15-s FT, no response requirement

• “adventitious reinforcement”

“In 6 out of 8 cases the resulting responses were so clearly defined that two observers could agree perfectly in counting instances. One bird was conditioned to turn counter-clockwise about the cage, making 2 or 3 turns between reinforcements. Another repeatedly thrust its head into one of the upper corners of the cage….”

Page 24: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.

Orienting toward feeder

Pecking near feeder

Moving along wall

¼ turn

Page 25: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.

“Misbehavior” and the limits of operant conditioning

Page 26: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.

Limits of Operant Conditioning

• Some behaviors can’t be conditioned– Yawning– Scratching

• Belongingness– Presentation of a female won’t reinforce biting

• “Misbehavior”

Page 27: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.

Marian Breland Bailey – How to train a chicken

Page 28: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.

The famous dancing chicken

Page 29: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.

What is learned in operant conditioning?

Page 30: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.

S R

What is learned?

Edwin Guthrie: mere contiguity of a stimulus and a behavior stamps in that S-R; reinforcement is not necessary

Page 31: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.

S R

What is learned?

Thorndike:Reinforcement “stamps in” this connection

Page 32: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.

S R

O

What is learned?

?

Page 33: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.

S R O

2-Process Theory

operant

Pavlovian

Page 34: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.

S R

CR

2-Process Theory

operant

Pavlovian

Page 35: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.

Evidence for 2-process theoryPavlovian-Instrumental Transfer

Phase 1 Phase 2 Test

LeverFood LightFood Light: #Presses?No Light: #Presses?

# Presses

Light No CS

The presence of the CS intensifies operant responding

Page 36: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.

S R O?

?

What is learned?

Does the Pavlovian S-O association activate a vague emotional state or a specific mental representation of the outcome?

Page 37: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward.

Specific Outcome RepresentationsTrapold

Phase 1 Phase 2 Test

(operant) (classical)

R LeverPellet TonePellet Tone:Left? Right?

L LeverSucrose LightSucrose Light:Left? Right?

# Presses

Light Noise

Left

Right


Recommended