Download - 6 Learning
-
8/6/2019 6 Learning
1/60
-
8/6/2019 6 Learning
2/60
Learning
Learning is a lasting change in behavior or mentalprocess as the result of an experience.
There are two important parts: a lasting changea simple reflexive reaction is not learning
learning regarding mental process is much harder toobserve and study.
-
8/6/2019 6 Learning
3/60
Life without Learning
Learning is more than school, books and tests.
Without learning our lives would simply be a seriesof reflexes and instincts.
We would not be able to communicate, we would have nomemory of our past or goals for the future.
-
8/6/2019 6 Learning
4/60
Learnings Effects on Behavior
In humans, learning has a much larger influence onbehavior than instincts.
Learning represents an evolutionary advance over instincts.
-
8/6/2019 6 Learning
5/60
S I M P L E A N D C O M P L E X L E A R N I N G
Types of Learning
-
8/6/2019 6 Learning
6/60
Simple Learning
Habituation: Learning not to respond to therepeated presentation of a stimulus. Ex: Emergency sirens in the city
How often do youlook when a caralarm goes off?
-
8/6/2019 6 Learning
7/60
Simple Learning
Mere Exposure Effect: A learned preference for stimuli towhich we have been previously exposed. Ex: A coach/parents voice
Which do you prefer?
Which did your parentsdrink when you were alittle kid?
-
8/6/2019 6 Learning
8/60
Complex Learning
Behavioral Learning: Forms of learning, suchas classicaland operant conditioningwhich can bedescribed in terms of stimuli and responses.
Classical conditioning is more simple learning, operantconditioning is more complex learning.
-
8/6/2019 6 Learning
9/60
Ivan Pavlov and Classical Conditioning
One of most famous people in the study of learning isIvan Pavlov.
Originally studying salivation and digestion, Pavlov
stumbled upon classical conditioning while he wasexperimenting on his dog.
Classical Conditioning: A form of learning in which a previouslyneutral stimulus (stimuli w/o reflex provoking power) acquires thepower to elicit the same innate reflex produced by another stimulus.
-
8/6/2019 6 Learning
10/60
Pavlovs Experiment
Ivan Pavlov
-
8/6/2019 6 Learning
11/60
Pavlovs Findings Explained
Pavlov discovered that a neutral stimulus, when paired witha natural reflex-producing stimulus, will begin to produce alearned response, even when it is presented by itself.
Neutral Stimulus: Any stimulus that produces noconditioned response prior to learning.
-
8/6/2019 6 Learning
12/60
Pavlovs Experiment
-
8/6/2019 6 Learning
13/60
Components of Conditioning
There are 5 main components of conditioning.Classical Conditioning always involves these parts.They are:
Neutral StimulusUnconditioned Stimulus (UCS)
Unconditioned Response (UCR)
Conditioned Stimulus (CS)
Conditioned Response (CR)
-
8/6/2019 6 Learning
14/60
Unconditioned Stimulus (UCS)
UCS: A stimulus thatautomatically-without conditioningor learning- provokes a reflexiveresponse.
In Pavlovs experiment, food wasused as the UCS because itproduced a salivation reflex.
Classical conditioning cannot happen without UCS. The onlybehaviors that can be classically conditioned are those that areproduced by unconditioned stimulus.
-
8/6/2019 6 Learning
15/60
Unconditioned Response (UCR)
UCR: A response resulting froman unconditioned stimuluswithout prior learning.
In Pavlovs experiment, the UCRwas the dog salivating when itstongue touched food.
Realize that the UCS-UCR connection involves nolearning or acquisition.
-
8/6/2019 6 Learning
16/60
From Unconditioned to Conditioned
During acquisition, a neutral stimulus is paired with theunconditioned stimulus.
After several trials the neutral stimulus will gradually begin to elicit the sameresponse as the UCS.
Acquisition: The learning stage during which a conditionedresponse comes to be elicited by the conditioned stimulus.
=
-
8/6/2019 6 Learning
17/60
Conditioned Stimulus
A CS is the originally neutral stimulus that gains thepower to cause the response.
In Pavlovs experiment, the bell/tone began to producethe same response that the food once did.
-
8/6/2019 6 Learning
18/60
Conditioned Response
A CR is a response elicited by a previously neutral stimulusthat has become associated with the unconditionedstimulus.
Although the response to the CS is essentially the same asthe response originally produced by the UCS, we now call ita conditioned response.
-
8/6/2019 6 Learning
19/60
Extinction
Extinction: The diminishing (or lessening) of alearned response, when an unconditioned stimulusdoes not follow a conditioned stimulus.
To acquire a CR, we repeatedly pair a neutral stimulus with theUCS. But, if we want to reverse this learning, we must weaken thestrength of the connection between the two stimuli.
It is important to realize that extinction does not mean completeelimination of a response.
-
8/6/2019 6 Learning
20/60
Spontaneous Recovery
Extinction merely suppresses the conditioned response,and the CR can reappear during spontaneous recovery.
Spontaneous Recovery: The response after a rest period ofan extinguished conditioned response. Spontaneous recovery is weaker than the original CR.
-
8/6/2019 6 Learning
21/60
Classical Conditioning
Strengthof CR
Pause
Acquisition(CS+UCS)
Extinction(CS alone)
Extinction(CS alone)
Spontaneousrecovery ofCR
-
8/6/2019 6 Learning
22/60
Reinforcement Procedures
What if we could not distinguish between stimuli that weresimilar? The bell ending class vs. fire alarm
The door bell vs. our cell phones
Discrimination: The ability to distinguish between twosimilar signals stimulus.
-
8/6/2019 6 Learning
23/60
-
8/6/2019 6 Learning
24/60
-
8/6/2019 6 Learning
25/60
Operant Conditioning
-
8/6/2019 6 Learning
26/60
-
8/6/2019 6 Learning
27/60
Operant Conditioning
An operant is an observable behavior that anorganism uses to operate in the environment.
Operant Conditioning: A form of learning inwhich the probability of a response is changed byits consequencesthat is, by the stimuli thatfollows the response.
-
8/6/2019 6 Learning
28/60
B.F. Skinner
B.F. Skinner became famousfor his ideas in behaviorismand his work with rats.
Law of Effect: The idea thatresponses that produced desirableresults would be learned, orstamped into the organism.
-
8/6/2019 6 Learning
29/60
B.F. Skinner and The Skinner Box
-
8/6/2019 6 Learning
30/60
Reinforcement
Areinforcer is a condition in which thepresentation or removal of a stimulus, that occursafter a response (behavior) and strengthens thatresponse, or makes it more likely to happen again
in the future.
Positive Reinforcement: A stimulus presentedafter a response that increases the probability ofthat response happening again.
Ex: Getting paid for good grades
-
8/6/2019 6 Learning
31/60
Negative Reinforcement
Negative Reinforcement: The removal of anunpleasant or averse stimulus that increases theprobability of that response happening again.
Ex: Taking Advil to get rid of a headache.
Ex: Putting on a seatbelt to make the annoying seatbeltbuzzer stop.
The word positive means add or apply;negative is used to mean subtract or remove.
-
8/6/2019 6 Learning
32/60
Reinforcement Schedules
Continuous Reinforcement: A reinforcementschedule under which all correct responses arereinforced.
This is a useful tactic early in the learning process.It also helps when shaping new behavior.
Shaping: A technique where new behavior is
produced by reinforcing responses that are similarto the desired response.
-
8/6/2019 6 Learning
33/60
Punishment
Apunishment is an averse/disliked stimuluswhich occurs after a behavior, and decreases theprobability it will occur again.
Positive Punishment: An undesirable event thatfollows a behavior: getting spanked after telling a lie.
+ Punishment - Punishment
-
8/6/2019 6 Learning
34/60
Punishment
Negative Punishment: When a desirable eventends or is taken away after a behavior.
Example: getting grounded from your cell phone afterfailing your progress report.
Think of a time-out (taking away time from a fun activity withthe hope that it will stop the unwanted behavior in the future.)
-
8/6/2019 6 Learning
35/60
The consequence
provides something
($, a spanking)
The consequence
takes something away
(removes headache,
timeout)
Positive
ReinforcementNegative
Reinforcement
The consequence
makes the behavior
more likely to happen
in the future.
Positive
PunishmentNegative
Punishment
The consequence
makes the behavior
less likely to happen in
the future.
Reinforcement/Punishment Matrix
-
8/6/2019 6 Learning
36/60
Reinforcement vs. Punishment
Unlike reinforcement, punishment must beadministered consistently. Intermittent punishmentis far less effective than punishment delivered afterevery undesired behavior.
In fact, not punishing every misbehavior can havethe effect of rewarding the behavior.
-
8/6/2019 6 Learning
37/60
Punishment vs. Negative Reinforcement
Punishment and negative reinforcement areused to produce opposite effects on behavior.
Punishment is used to decrease a behavior or reduce its
probability of reoccurring.
Negative reinforcement always increases a behaviorsprobability of happening in the future (by taking away anunwanted stimuli).
Remember, positive means adding something andnegative means removing something.
-
8/6/2019 6 Learning
38/60
Uses and Abuses of Punishment
Punishment often produces an immediate change inbehavior, which ironically reinforces the punisher.
However, punishment rarely works in the long run for four reasons:
1. The power of punishment to suppress behavior usually disappearswhen the threat of punishment is gone.
2. Punishment triggers escape or aggression
3. Punishment makes the learner apprehensive: inhibits learning.
4. Punishment is often applied unequally.
-
8/6/2019 6 Learning
39/60
-
8/6/2019 6 Learning
40/60
Reinforcement Schedules
Intermittent Reinforcement: A type ofreinforcement schedule by which some, but not all,correct responses are reinforced.
Intermittent reinforcement is the most effective wayto maintain a desired behavior that has already beenlearned.
-
8/6/2019 6 Learning
41/60
-
8/6/2019 6 Learning
42/60
Schedules of Intermittent Reinforcement
Interval schedule: rewards subjects after acertain time interval.
Ratio schedule: rewards subjects after a certainnumber of responses.
There are 4 types of intermittent reinforcement:
Fixed Interval Schedule (FI)
Variable Interval Schedule (VI)
Fixed Ratio Schedule (FR)
Variable Ratio Schedule (VR)
-
8/6/2019 6 Learning
43/60
Interval Schedules
Fixed Interval Schedule (FI): A schedule that a rewards a learner only for the first correct
response after some defined period of time.
Example: B.F. Skinner put rats in a box with a lever connected to a feeder. Itonly provided a reinforcement after 60 seconds. The rats quickly learned that itdidnt matter how early or often it pushed the lever, it had to wait a set amount oftime. As the set amount of time came to an end, the rats became more active inhitting the lever.
-
8/6/2019 6 Learning
44/60
Interval Schedules
Variable Interval Schedule (VI):
A reinforcement system that rewards a correctresponse after an unpredictable amount of time.
Example: A pop-quiz
-
8/6/2019 6 Learning
45/60
Ratio Schedules
Fixed Ratio Schedule (FR):
A reinforcement schedule that rewards a responseonly after a defined number of correct answers. Example: A used car salesman earning commission after every
5th car sold.
-
8/6/2019 6 Learning
46/60
Ratio Schedules
Variable Ratio Schedule (VR):
A reinforcement schedule that rewards anunpredictable number of correct responses. Example: Buying lottery tickets or slot machines
-
8/6/2019 6 Learning
47/60
Schedules of Reinforcement
Variable Interval
Number ofresponses
1000
750
500
250
0
10 20 30 40 50 60 70
Time (minutes)
Fixed Ratio
Variable Ratio
Fixed Interval
Steady responding
Rapid respondingnear time forreinforcement
80
Intermittent ReinforcementSchedules-
Skinners laboratory pigeons
produced these responsespatterns to each of four
reinforcement schedules
For people, as for pigeons,research linked to number ofresponses (ratio) produces ahigher response rate thanreinforcement linked to timeelapsed (interval).
-
8/6/2019 6 Learning
48/60
Primary and Secondary reinforcement
Primary reinforcement: something that is naturallyreinforcing: food, warmth, water
Secondary reinforcement: something you have learned is areward because it is paired with a primary reinforcement in thelong run: good grades.
-
8/6/2019 6 Learning
49/60
Two Important Theories
Token Economy: A therapeutic method based on operantconditioning that where individuals are rewarded withtokens, which act as a secondary reinforcer. The tokens canbe redeemed for a variety of rewards.
Premack Principle: The idea that a more preferred activitycan be used to reinforce a less-preferred activity.
Operant and Classical Conditioning
-
8/6/2019 6 Learning
50/60
Operant and Classical Conditioning
Classical Conditioning Operant Conditioning
Behavior is controlled by the stimulithat precedethe response (by theCS and the UCS).
Behavior is controlled byconsequences (rewards,punishments) that follow theresponse.
No reward or punishment is involved(although pleasant and averse
stimuli may be used).
Often involves rewards(reinforcement) and punishments.
Through conditioning, a newstimulus (CS) comes to produce theold (reflexive) behavior.
Through conditioning, a newstimulus (reinforcer) produces a newbehavior.
Extinction is produced bywithholding the UCS.
Extinction is produced bywithholding reinforcement.
Learner is passive (acts reflexively):Responses are involuntary. That isbehavior is elicitedby stimulation.
Learner is active: Responses arevoluntary. That is behavior isemittedby the organism.
-
8/6/2019 6 Learning
51/60
A Third Type of Learning
Sometimes we have flashes of insight when dealingwith a problem where we have been experiencingtrial and error.
This type of learning is called cognitive learning,which is explained as changes in mental processes,rather than as changes in behavior alone.
-
8/6/2019 6 Learning
52/60
Wolfgang Kohler and Sultan
Kohler believed that chimps could solve complexproblems by combining simpler behaviors they hadpreviously learned separately.
Kohler taught Sultan the chimp how to stack boxesto obtain bananas that were over his head and howto use a stick to obtain something that was out of
his reach. He taught Sultan these skills in separatesituations.
-
8/6/2019 6 Learning
53/60
Sultans Situation
When Sultan was put in a situation where the bananas werestill out of his reach after stacking the boxes, Sultan becamefrustrated. He threw the stick and kicked the wall beforesitting down.
Suddenly, he jumped up and dragged the boxes and stickunder the bananas. He then climbed up the boxes andwhacked the fruit down with the stick.
This suggested to Kohler that the animals were not
mindlessly using conditioned behavior, but were learningby reorganizing their perceptions of problems.
-
8/6/2019 6 Learning
54/60
Sultan the Chimp
-
8/6/2019 6 Learning
55/60
Cognitive Learning
Sultan was not the only animal to demonstratecognitive learning. When rats were put into a mazewith multiple routes to the reinforcer, the ratswould repeatedly attempt the shortest route.
If their preferred route was blocked, they wouldchose the next shortest route to the reward.
Cognition Map: A mental representation of a place.
-
8/6/2019 6 Learning
56/60
Latent Learning
In a similar study, rats were allowed to wanderaround a maze, without reinforcements, forseveral hours. It formerly was thought thatreinforcements were essential for learning.
However, the rats later were able to negotiate themaze for food more quickly than rats that hadnever seen the maze before.
Latent learning: Learning that occurs but is notapparent until the learner has an incentive todemonstrate it.
-
8/6/2019 6 Learning
57/60
Latent Learning
-
8/6/2019 6 Learning
58/60
Observational Learning
You can think of observational learning as anextension of operant conditioning, in which weobserve someone else getting rewarded but act asthought we had also received the reward.
Observational learning: Learning in which newresponses are acquired after others behavior and the
consequences of their behavior are observed.
-
8/6/2019 6 Learning
59/60
Observational Learning
After observing adults seeming to enjoy punching,hitting and kicking an inflated doll called Bobo, thechildren later showed similar aggressive behaviortoward the doll.
Significantly, these children were more aggressivethan those in a control condition who did not
witness the adults violence.
-
8/6/2019 6 Learning
60/60