Chapter 6 copy
Chapter 6 copy
- Pavlov's research dogs were given meat powder, and this routine led the dogs to salivate before meat powder
was presented them
○ Then he started to serve meat powder while a metronome is ticking and later on, dogs salivate when they
just hear the metronome; so dogs are salivating in anticipation of food
- Classical conditioning/Pavlovian conditioning is a form of associative learning in which an organism learns to
associate a neutral stimulus (e.g. sound) to a biological relevant stimulus (e.g. food), this results a change in the
response to the previously neutral stimulus (e.g. salivation)
- Unconditioned stimulus (US) is a stimulus that elicits a reflexive response without learning (e.g. food, water, pain)
- Unconditioned response (UR) is a reflexive, unlearned reaction to a US (e.g. hunger, drooling, expressions of pain)
- So US and UR are both unlearned and something that happens naturally
○ E.g. The meat powder elicit an unconditioned salivation (Pavlov's dogs)
- Conditioned stimulus (CS) is a neutral stimulus that later starts to elicit a conditioned response due to being
associated with an unconditioned stimulus for a period of time (e.g. the metronome in Pavlov's dogs)
- Conditioned response (CR) is a learned response that occurs due to CS (e.g. dogs salivate due to metronome)
- CS can only have an effect if it becomes associated with the US
- CR is learned and UR is naturally occurring
- UR and CR are not always the same response
○ Many animals "freeze" (i.e. become motionless) when they are scared, since predators can detect movement
▪ Lab rats hear a tone and get an electrical shock on their feet have a UR of jumping, flinching, and pain,
but when they associate the tone with the shock, they just freeze when they hear the tone even if there
is no shock (CR)
- So conditioning has an evolutionary function
○ Evolutionary function of the CR can be seen as a way for the organism to interact adaptively with the US
- According to Hebb's rule: a weak connection becomes strengthened when a weak connection b/w neurons
is stimulated at the same time as strong connections
○ So in conditioning, if perceiving a puff of air and blinking response is a strong connection, and sound and blinking
is a weak connection, if you perceive a puff of air and hear sounds at the same time repeatedly (blinking
response would still occur), sounds and blinking response will become a strong connection and you will blink
when you hear the sound only
○
- CR can diminish over time, or it may occur with a new stimuli with which the response has never been paired
- Acquisition is the initial phase of learning in which a response is established
○ Thus, in classical conditioning, acquisition is when the neutral stimulus is repeatedly paired with the US
▪ If its not repeatedly paired (e.g. dog's were given food only sometimes when metronome was ticking),
then conditioning would not occur, or be very weak
- Extinction is the reduction of a conditioned response when a US and CS no longer occur together
○ So if dogs didn't receive food when metronome was ticking, and this happened frequently, then the
salvation would occur less and less till they don’t salivate at all
▪ Biologically makes sense, since metronome is no longer a good predictor of food
○ Rate of firing in the brain areas related to this association decreases over the course of extinction
- Spontaneous recovery is the reoccurrence of a conditioned response after some time of extinction
○ So dogs start to salivate when they go back to the experiment room after quite some time and start to
salivate due to the metronome
○ Possible animal may not able retrieve memory of extinction and go back to the memory of conditioned response
- Generalization is a process in which a response that occurred for a specific stimulus also occurs for a different
but similar stimuli (e.g. dogs not only salivating to metronome, but also similar sounds)
○ When we perceive a stimulus, it activates our brain's representation for that item and representations of other
related items
▪ So according to Hebb's rule, those additional representations synapses would also fire at the same time
as the synapses involved in conditioned responses and therefore strengthen the connections of those
additional synapses.
- Discrimination is when an organism responds to an original conditioned stimulus but not to new stimuli that may
be similar to the original stimulus
○ If stimuli similar to CS is presented without a US, it becomes less likely that the stimuli lead to
stimulus generalization
○ So the dogs would hear these other tones that would have their own memory representation in which they
did not receive any food
- Conditioned emotional responses consist of emotional and physiological responses that develop to a
specific object/situation
○ Ex: Watson & Rayner conditioned a 11−month old child Albert to fear white rats. Before conditioning, Albert
showed no fear. Then they startled Albert with a loud noise from striking a steel bar with a hammer when
Albert was with the rat. (US is the loud noise, UR is the feeling of fear by the noise). Then after repeated pairings
with rat and noise, he started to just fear the rat w/o the noise. The rat became CS and the fear elicited became
the CR. Albert emotional conditioning was also generalized to other white furry objects (like rabbits).
▪ Emotional conditioning doesn’t have to be very experimental, but can also happen naturally
- Conditioned emotional responses offer a possible explanation to many phobias
- If organisms learns a fear−related association, activity occurs in the amygdala (brain area related to fear)
- If organism learns to fear a particular location (like a certain cage is associated with electrical shock), then
context− related activity in the hippocampus will communicate to the amygdala to produce contextual fear
conditioning
- Neural connections related to fear conditioning remain intact after extinction
○ Other neurons suppress the activity of brain areas related to fear responses
▪ But if CS and US is paired again, this suppression is gone and fear−conditioned responses will occur again
- People with psychopathy (antisocial personality disorder; notorious for disregarding the feelings of others) are
not really affected by emotional conditioning
○ In an experiment, people were showed a face (neutral stimuli) and then followed by a pain (US) which elicit a
pain response (UR). Then after repeated pairings, the face became the CS and had a negative emotional
reaction to those faces (CR). But the psychopathy people showed little physiological arousal and their emotional
brain areas were quiet when looking at the CS, and don't really mind looking at the faces, unlike the normal ppl.
- Conditional taste aversion is the acquired dislike or disgust for a food or drink because it was paired with illness
○ CS is the food, and US is whatever is in the food (like bacteria) and in the environment that makes you sick
○ Conditioned aversions only occur for the flavour of a particular food and not any other stimuli during that time
▪ Listening to a particular song while eating a 2 week old tuna sandwich, your aversion would develop to
the tuna sandwich and not the particular song
○ You can develop conditioned aversions even after one single exposure and even if you feel sick couple hours
later eating the food
▪ E.g. the food (CS) and feeling sick because of food poisoning (UR) can take a matter of hours
□ Most conditioning only happens if CS, US, and UR happen in a short period of time
- Usually conditioned taste aversion develops to something we ingested that has an unfamiliar flavour, since
these unfamiliar flavours stick out and are much easier to remember
○ If someone eats a Swiss cheese sandwich for lunch everyday and suddenly you feel ill during the afternoon,
the person will less likely develop conditioned taste aversion
▪ This scenario can be explained by latent inhibition
- Latent inhibition occurs when a frequent experience with a stimulus before it is paired with a US makes it less
likely that conditioning will occur after a single episode of illness
- Conditioned emotional responses are also being created by negative political advertisements
○ CS would be attacked politician. US would be the negative imagery (black and white image, grainy, poor
quality). UR would negative emotional response to the imagery. Then the people who made the ads will hope
that the attacked politician will produce a negative emotional response (CR)
▪ This is similar to evaluative conditioning. Evaluative conditioning is when you pair a stimulus (e.g. shape)
with a positive or negative stimuli (happy/angry face). Repeated association of a stimulus with a emotion
leads to people to develop positive or negative feeling towards the stimulus. This is what negative
political ads are trying to accomplish.
□ Ultimately, the actual effect of the ad increased the voters who already agreed with the views
expressed in ads. Goal of having a negative opinion on attacked politician led to motivate people
who already have negative view on attacked politician to go and vote to the party that made the ad.
- Classical conditioning can explain drug−related phenomena, such as craving and tolerance
○ Cues that accompany drug use (cigarette lighter, smell of tobacco smoke) can be conditioned stimuli that
elicit cravings
○ When a person takes a drug, the body attempts to metabolize the substance. However, overtime the
paraphernalia associated with drug serves as cues (CS) that the drug (US) will process the body (UR), and
process involving metabolizing the drug will begin before the drug is consumed (CR)
▪ So over time you need more dosage of drug to override the preparedness of the body
□ This is called conditioned drug tolerance
◆ This is dangerous, since if you changed the environment where you normally take drugs,
there will be less CS's to trigger CR (the body's metabolizing activity to prepared for the drug
arrival)
and you can actually overdose
- Rats in the operand chamber don't immediately go and press the lever, must first learn that it accomplishes something
○ So getting the rat to press the lever can be done by reinforcing behaviours that approximate (or lead up to)
lever pressing , such as standing up, facing the lever, putting paws on lever, and pressing downward
▪ This process is known as shaping
- Shaping is the process of reinforcing successive approximations of a specific operant response
○ This is done by a step−by−step fashion until the desired response
- Chaining involves linking together two or more shaped behaviours into a more complex action or sequence of actions
- Applied behaviour analysis (ABA) involves using close observation, prompting, and reinforcement to teach
behaviours, often to people who experience difficulties and challenges owing to a developmental condition such as
autism
○ Autistic kid can have trouble clearing the dishes on the dining table
▪ So you use prompts (stand up, gather silverware, gather plate, etc.) and give verbal reward for each
step completed. Then desired behaviour is shaped
- Primary reinforcers consist of reinforcing stimuli that satisfy basic motivational needs—needs that affect an
individual’s ability to survive (and, if possible, reproduce)
○ E.g. Food, water, shelter, sexual contact
- Secondary reinforcers consist of stimuli that acquire their reinforcing effects only after we learn that they have value
○ More abstract and do not directly influence survival−related behaviours
○ E.g. Instagram likes, money, etc...
- Nucleus accumbens becomes activated during the processing of rewards, including primary ones (eating, sex)
and "artificial" rewards like cocaine or smoking a cigarette
○ Variations in this area might account why individuals differ so much in their drive for reinforcers
▪ People who are prone to risky behaviours like gambling and alcohol abuse are more likely
inherited particular copies of gene that code for dopamine and other reward−based chemicals in
the brain
▪ People who are impulsive (vulnerable to gambling and drug abuse) release more dopamine and
have trouble removing dopamine
- Secondary reinforcers also trigger the release of dopamine in reward are of the brain
○ E.g. Monetary rewards cause dopamine release in basal ganglia and frontal lobes
- When a behaviour is rewarded for the first time, dopamine is released which reinforced reward−producing behaviours
○ Dopamine releasing neurons and nucleus accumbens keep track of which behaviours are associated to rewards
▪ Alter rate of firing when need to update which actions leads to rewards
- Discriminative stimulus is a cue or event that indicates that if a response is made, it will be reinforced
○ E.g. You ask to borrow parent's car only when they are in a good mood
▪ So your parent's mood will dictate whether you perform a behaviour (asking to borrow the car)
- Discrimination occurs when an organism learns to respond to one original discriminative stimulus but not to
new stimuli that may be similar
○ E.g. Pigeon may learn that it will receive a reward if it pecks the key after 1000 Hz tone, but it wont receive
the reward if it pecks the key at 2000 Hz tone. As a result, pigeon wont peck the key after 2000 Hz tone
- Generalization takes place when an operant response occurs in response to a new stimulus that is similar to
the stimulus present during original learning
○ E.g. Pigeon who learn to peck key after 1000 Hz tone may attempt to peck a key whenever any tone is presented
○ E.g. Child who pets neighbour's dogs led to child laughing and playing with the dog, then they might be
more likely to pet other dogs and furry animals
- Thorndike said that reinforcement was more effective if there was very little time b/w action and consequence
○ Study showed that pigeon would peck the key less frequently if amount of time to get the reward was increased
- Delayed reinforcement influences human behaviours as well
○ Drugs that hit you as soon as you take it is more addictive than drugs that take a while to actually feel an affect
▪ So more likely to get addicted to drugs that have a rapid effect than drugs that have a delayed effect
- Extinction is the weakening of an operant response when reinforcement is no longer available
○ E.g. if your parents no longer let you borrow the car no matter how nicely you ask, you may persist
your behaviour for a while but you will eventually stop asking
○ If you expect a reward for your behaviour and nothing comes, the amount of dopamine release decreases
▪ Dopamine will increase again if there is a new behaviour−reward relationship to learn
- Its possible that a complex behaviour is influenced by classical conditioning and operant conditioning
○ E.g. Consider gambling; slot machines use a variable−ratio schedule of reinforcement, a type of operant
conditioning that leads to high response rate. The flashy lights and dinging sounds from the machine serves as
a CS for the UR of excitement associated with gambling
▪ classical conditioning produces an emotional response and operant conditioning maintains the behaviour