Chapter 6 and 7 Psych Reviewer
Chapter 6 and 7 Psych Reviewer
Psych11 Reviewer
Chapter 6 - Learning
Classical Conditioning (by Ivan Pavlov)- an originally neutral stimulus is repeatedly paired with a
stimulus that naturally elicits a response. With repeated pairings, the neutral stimulus begins to elicit a
similar or even identical response.
Basic Stages/Processes in Classical Conditioning:
1. ACQUISITION
Before Conditioning- the unconditioned stimulus (US) elicits the unconditioned response
(UR), but the neutral stimulus does not
During Conditioning- the neutral stimulus is paired with the US
After Conditioning- the conditioned stimulus (CS) elicits the UR even without the US
*Little Albert experiment in 1920 by John B. Watson and Rosalie Rayner, involving a rat and a
loud noise to evoke fear
2. EXTINCTION- weakening conditioned responses
If the CS occurs repeatedly without the US, the response would gradually weaken and be
eventually eliminated
3. SPONTANEOUS RECOVERY- recovering conditioned responses
The CS-US association is not permanently destroyed in an extinction procedure, only actively
inhibited by the organism
4. Higher Order Conditioning- when pairing an established CS (CS1) with a new neutral stimulus
(CS2), the new stimulus comes to elicit the CR
Secondary conditional reflex- happens when a new CS bonds with an old CS
5. Generalization and Discrimination
Stimulus generalization- CRs occur in the face of similar stimuli
Stimulus discrimination- the learned tendency to respond only to the stimulus used in training
6. Contiguity and Predictability
Important factors in conditioning:
Contiguity (closeness in time between CS and US)
Frequency of pairings
Predictability of CS
Blocking- CS overshadows the presentation of new stimulus because it offers no new nor
useful information
Operant Conditioning
Thorndike’s Law of Effect (1898)
- By Edward Thorndike, studied animal intelligence
- Basis of operant conditioning
- The greater the satisfaction or discomfort generated by a response, the greater the strengthening
or weakening of the bond
The Research of B.F. Skinner
*Project Pigeon- pigeons were trained to guide missiles to their targets
- Operant Conditioning (OC) – behavior is strengthened through reinforcement
- No reinforce will occur until the subject makes the required response
- Instrumental conditioning- subject is instrumental to obtaining the reinforcer; organism has an
active role
The three-term contingency
- A contingency rule states that some event B will occur if only if event A occurs
- In OC, the reinforce occurs only if the response occurs
Components:
1. Discriminative stimulus (context or situation in which a response occurs)
2. Response
3. Stimulus reinforcer
Shaping- involves reinforcing behaviors until the desired behavior occurs
Kinds of reinforcement:
Positive Reinforcement- an event which, when it follows an operant response, increases the
likelihood that the response will recur.
Negative Reinforcement- an event whose termination, when it occurs following an operant
response, increases the likelihood that the response will recur
o Primary reinforcer- biologically significant appetitive stimulus (e.g. food, water)
o Conditioned reinforcer- surrogate for the reinforcer, increasing the strength of any response that it
follows (e.g. grades, tokens, money)
o Back-up reinforcers – in a token economy, these are the items (e.g. ice cream, candy, toys) in
exchange for the tokens that are administered immediately after a response
Schedules of Reinforcement
- A rule that states under what conditions a reinforcer will be delivered
- Continuous reinforcement (CRF)- reinforce every occurrence of the operant response (e.g. putting
coins in vendo machines)
- *Skinner realized that most behavior is reinforced only intermittently or partially
4 simple schedules:
1. fixed-ratio – fixed number of responses must be made before the reward is administered (e.g.
factory worker is paid P20 after every 12 shirts)
2. variable-ratio – number of responses determines the delivery of reinforcement, but the ratio
changes from reinforcement to reinforcement (e.g. slot machines, gambling joints- keeps people
coming back and guessing when the next pay-off will be)
3. fixed-interval – after a reinforced response, some interval of time passes during which
reinforcement is unavailable. Once the interval is over, the next response is reinforced, thereby
triggering the non-reinforcement interval to start again (e.g. salaried employees who receive their
paycheck every week)
*Scalloping effect- no responses just after a reinforcement and responding begins just before the
next reinforcement
4. variable-interval – period of non-reinforcement varies after each reinforced response (e.g.
waiting for bus, arrives on an average of 10 minutes, thereby reinforcing your waiting behavior
after 10 minutes)
*Ratio schedule- delivery of reinforcement depends on the number of times the learner makes the
response
*Interval schedule- based on the passage of time
The Use of Negative Reinforcement
Escape learning- a specific behavior is made to terminate or end an aversive event (e.g. dogs
jump to escape electric shock; turning off the volume of the tv to escape loud sound)
Avoidance learning- a specific behavior is made to prevent or avoid an aversive event (e.g. study
to avoid failing, paying bills on time to avoid cut-off)
Punishment
- An aversive stimulus event is presented after a response, with the end view of suppressing that
response
- The goal is to decrease a response (unlike NR where the goal is to increase a response)
- Effects of punishment can be permanent, like that of positive reinforcement
6 Variables that lead to the effectiveness of punishment:
1. Manner of introduction – introduced at its full intensity (mild punishment will lead to subjects
getting used to it; gradual increase will enable the subjects to adapt- and therefore will have little
effect on behavior)
2. Immediacy of Punishment – the more immediate the punishment, the greater the decrease in
responding
3. Schedule of Punishment – the most effective way to eliminate a behavior is to punish every
response
4. Motivation to Respond – the effectiveness of a punishment is inversely related to the intensity of
the subjects’ motivation to respond
5. Availability of Alternative Behaviors – the subject is provided with an alternative way to obtain
the reinforcer that has been maintaining some inappropriate behavior (e.g. children punished for
fighting should be reinforced with cooperative play)
6. Punishment as a discriminative stimulus – punishment is ineffective when it functions as a
discriminative stimulus—a signal predicting the availability of a reinforcer or a cue for something
pleasant (e.g. children may get punished but may later be lavished with attention)
Contiguity vs. Contingency: Learned Helplessness
Temporal contiguity- an operant behavior is conditioned when reinforcement immediately follows the
behavior
Controllability – alternative to contiguity; the extent to which the situation or event is seen to be
under or out of one’s control
Learned helplessness effect – when response and consequence are independent, an organism learns
that important environmental events are not subject to its control, thereby producing an inability to
learn in situations where important events may be controllable
*Operant conditioning occurs when an organism perceives contingency or relationship between its
response and reinforcement
Biological Constraints in Operant Conditioning
- In escape learning where an animal acquires a response that is reinforced by the termination of
shock, pigeons learn faster if the response is wing flapping rather than pecking a key.
- However, in the case of reward training where the reinforcer is food, pigeons learn pecking faster
than wing flapping, since pecking is part of the birds’ natural eating activity
- Does not support the assumption of equipotentiality in learning, or the idea that the same laws of
behavior apply to all situations.
Learning by Observation
- Also called vicarious learning
- Possibly accounts for most human learning, because it would be a very slow process if we have to
experience the consequences of every behavior
- *Social learning theorist Albert Bandura conducted several studies that show how we acquire
operants by observing others. He holds that in learning through imitation, 2 important points
must be considered:
The consequences- there is less imitation when a child sees the model punished rather
than reinforced
Expectancy of reinforcement- direct reinforcement is not necessary
- 4 factors that determine the occurrence of imitative behavior:
Attentional processes – pay attention to the appropriate features of the model’s
behavior
Retentional processes – retain some of the information gathered through observation
if imitation is to occur at a later time. Rehearsal may be important.
*Observational learning in humans involves 2 representational systems:
imaginal and verbal
*verbal descriptions can guide behavior
Motor reproductive processes – translate some general knowledge into a
coordinated pattern of muscle movements
Incentive and motivational processes – not necessary, but without which the
behavior may not occur. The individual must have an expectation that the
performance of the new behavior will result in some type of reinforcement.
3 important elements in observational learning:
a) Type and power of the model – authoritative and nurturant, rewardingness
b) Learner’s personality and degree of independence – the less self-confidence a person
has, the more likely the person is to imitate a model
c) Situation – when there is uncertainty about what is considered proper behavior (e.g.
asking dating tips from your peers)
Cognitive Learning
The Gestaltists – Max Wertheimer, Wolfgang Kohler, Franz Koffka, believed that behaviorism could
not account for much of human learning
Phi phenomenon – by Wertheimer, an optical illusion of perceiving a series of still images, when
viewed in rapid succession, as continuous motion (e.g. advertising lights)
- Wertheimer argued that this cannot be explained by simple stimulus-response
- During information processing, subjects add something to incoming data to form the
perception of movement
Animal Experiments
Insight Learning – grasping the relationships inherent in the problem and achieving the solution
through insight
Sultan (by Kohler) – the most celebrated chimpanzee. He wanted to get the fruit
outside of his cage, and he used the short stick to scratch the long stick towards him
so he can reach the fruit. Another case was when he stacked the crates to get the
bunch of bananas hanging from the ceiling.
Pigeons (by Epstein) – Conditioned to climb, peck, and push. Eventually, the pigeons
used all of these conditioned behaviors to reach a hanging banana.
Latent Learning – takes place even without any reward. When the reward appears, what has been
learned (latent) is suddenly demonstrated (manifest)
Rats – In a maze, group 1 was given food when it reached the end, group 2 none, and group 3
none for the first 10 days and given food on the next 7 days. Group 1 performed better than
the 2 groups. However, when group 3 was given food on the 11th day, they started performing
as well as group 1.
*They used a cognitive map (schema) – abstract mental representation used to organize
knowledge and make sense of real-world situations
Cognitive Viewpoint in Learning
1. Learning is a constructive, not a receptive, process – active integration of previous
knowledge with incoming information; construction of personal meaning
2. Structuring knowledge is essential – schemas tell us to what we should give attention, help
us remember relevant things, and guide us in comprehending the world. We often view situations
in terms of our personal schemas.
3. Self-awareness and self-regulation are emphasized – metacognition (thinking about
thinking) has 2 components: a) the knowledge we have about our thinking and b) ability to
regulate our thinking
4. Learning is influenced by motivation and beliefs – successful learners do not just have
mastery of content, but are also active, motivated, and confident learners
5. Social Interaction is integral to cognitive development – cooperative learning and peer
discussions stimulate learners to clarify and reconceptualize their information via bouncing ideas
off each other and generating constant feedback
Meaningful Learning
- The acquisition of new meanings (David Ausubel, 1970)
Material to be learned is potentially appropriate for learners
Learners should transform such material so that it will have personal meaning
- Related to what learners already know
- Discovery Learning (Jerome Bruner) – we tend to remember and comprehend better the
things we have discovered for ourselves
Chapter 7 - Memory
Memory Disorders
Anterograde amnesia – difficulty forming new memories
Korsakoff syndrome – symptoms of anterograde amnesia found among certain alcoholic
patients
Retrograde amnesia – memory loss for events prior to the event that caused the amnesia
Alzheimer’s disease – “eating away” of the brain by plaques and tangles, usually starting
with the hippocampus and later on affecting other areas of the brain including those with
memory functions