Chapter 9 - Conceptual Knowledge
Chapter 9 - Conceptual Knowledge
9
Basic Properties of Concepts and Categories The Connectionist Approach
What Is a Connectionist Model?
How Are Objects Placed into Categories?
Why Definitions Don’t Work for Categories How Are Concepts Represented in a Connectionist Network?
263
Copyright 2019 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. WCN 02-200-203
We answer questions such as these all the time, usually without even realizing it. For
example, imagine that you find yourself in an unfamiliar town, where you have never been
before. As you walk down the street, you notice that many things are not exactly the same
as what you would encounter if you were in your own town. On the other hand, there are
many things that seem familiar. Cars pass by, there are buildings on either side of the street
and a gas station on the corner, and a cat dashes across the street and makes it safely to the
other side. Luckily, you know a lot about cars, buildings, gas stations, and cats, so you have
no trouble understanding what is going on.
This chapter is about the conceptual knowledge that enables you to recognize and
understand the objects in the street scene and the world. This knowledge exists in the form
of concepts. Concepts have been defined in a number of ways, including “the mental rep-
resentation of a class or individual” (Smith, 1989) and “categories of objects, events, and
abstract ideas” (Kiefer & Pulvermüller, 2012). To express this in concrete terms, we can say
that the concept “cat” is the answer to the question “What is a cat?” If your answer is that a
cat is an animal that is furry, meows, can be petted, moves, and catches mice, you will have
described some aspects of the concept “cat” (Kiefer & Pulvermüller, 2012).
Because we are interested in our knowledge about the world, we need to go beyond
cats! When we start expanding our scope to dogs, automobiles, can openers, radishes, and
264 roses, things start to become both more complicated and more interesting, because the
Copyright 2019 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. WCN 02-200-203
question then becomes “How are all of these things organized in the mind?” One way we
organize concepts is in terms of categories.
A category includes all possible examples of a particular concept. Thus, the category
“cats” includes tabbies, Siamese cats, Persian cats, wildcats, leopards, and so on. Looked at
in this way, concepts provide the rules for creating categories. Thus, the mental represen-
tation for “cat” would affect what animals we place in the “cat” category. Because concepts
provide rules for sorting objects into categories, concepts and categories are often discussed
together, and a great deal of research has focused on the process of categorization—the
process by which things are placed in categories.
Categorization is something we do every time we place an object into a category, and
once we have assigned an object to a category, we know a lot about it. For example, being
able to say that the furry animal across the street is a “cat” provides a great deal of informa-
tion about it (Figure 9.1). Categories have therefore been called “pointers to knowledge”
(Yamauchi & Markman, 2000). Once you know that something is in a category, whether
“cat,” “gas station,” or “impressionist painting,” you can focus your energy on specifying
what’s special about this particular object (see Solomon et al., 1999).
Categorization not only helps understand what is happening in the environment, it also
plays an essential role in enabling us to take action. For example, to spread jam on bread, you
must recognize the jam jar, the bread, and the knife; realize their relevant properties (bread
is soft unless it is toasted; knives are rigid, and jam is sticky); and know how to grasp the
knife handle with the right grip to scrape the jam from its jar (Lambon Ralph et al., 2017).
Being able to place things in categories can also help us understand behaviors that we
might otherwise find baffling. For example, if we see a man with the left side of his face
painted black and the right side painted gold, we might wonder what is going on. However,
once we note that the person is heading toward the football stadium and it is Sunday
afternoon, we can categorize the person as a “Pittsburgh Steelers fan.” Placing him in that
Difficult to train
Catches mice
Has whiskers
Likes milk,
fish
Sleeps a lot,
but more active
at night
Likes to
rub up against
people and other
objects
Bruce Goldstein
➤ Figure 9.1 Knowing that something is in a category provides a great deal of information about it.
Copyright 2019 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. WCN 02-200-203
category explains his painted face and perhaps other strange behaviors that happen to be
normal on game day in Pittsburgh (Solomon et al., 1999).
These various uses of categories testify to their importance in everyday life. It is no
exaggeration to say that if there were no such thing as categories, we would have a very
difficult time dealing with the world. Consider what it would mean if every time you saw a
different object, you knew nothing about it other than what you could find out by investi-
gating it individually. Clearly, life would become extremely complicated if we weren’t able
to rely on the knowledge provided to us by categories.
Yes, categories are important. But what do we need to know in order to understand cat-
egories? The answer to this question isn’t obvious, because we routinely categorize things,
so it seems like an automatic process. It’s obvious that there’s a cat sitting in a chair, across
the room—cat, chair, and room being different categories. These things, and thousands of
others, are so easy to categorize that there seems to be no problem to solve.
But as with other cognitive abilities, just because it’s easy doesn’t mean it’s simple.
Categorization becomes more difficult if you encounter something unfamiliar. “What is
that over there? An aardvark? Very strange.” Or things become even more difficult when
a person suffers a brain injury that makes it difficult to identify different objects or know
what they are used for. Once we understand that there are situations in which categorization
becomes difficult, things become more interesting, because recognizing and understanding
these difficulties is the first step to uncovering the mechanisms of categorization.
This chapter considers, in three major sections, the difficulties of categorization and
the mechanisms involved in day-to-day categorization. Each of the three sections tells
a story that involves a different approach to categorization. First, in the section “Basic
Properties of Concepts and Categories,” we consider a behavioral approach that originated
with a series of experiments begun in the 1970s, which have helped us understand how
we place objects in different categories and which have shown that “not all objects are
created equal.” In the second section, “Network Models of Categorization,” we consider
the network approach to categorization that began in the 1960s, inspired by the emerging
field of computer science, which created computer models of how categories are repre-
sented in the mind. In the third section, “How Categories Are Represented in the Brain,”
we take a physiological approach, which looks at the relationship between categories and
the brain. We will see that each approach provides its own perspective on categorization,
and that all three together provide a more complete explanation of categorization than
any one approach alone.
Copyright 2019 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. WCN 02-200-203
(a) (b)
Bruce Goldstein
(c) (d)
Copyright 2019 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. WCN 02-200-203
Wittgenstein proposed the idea of family resemblance to deal with the problem that
definitions often do not include all members of a category. Family resemblance refers to
the idea that things in a particular category resemble one another in a number of ways.
Thus, instead of setting definite criteria that every member of a category must meet, the
family resemblance approach allows for some variation within a category. Chairs may come
in many different sizes and shapes and may be made of different materials, but every chair
does resemble other chairs in some way. Looking at category membership in this way, we
can see that the chair in Figure 9.2a and the chair in Figure 9.2c do have in common that
they offer a place to sit, a way to support a person’s back, and perhaps a place to rest the arms
while sitting.
In a series of experiments beginning in the 1970s, Eleanor Rosch and coworkers used
the idea of family resemblance as a jumping off point for experiments that investigated the
basic nature of categories. One of the early ideas to emerge from these experiments is the
idea of prototypes.
➤ Figure 9.3 Three real birds—a sparrow, a robin, and a blue jay—and a “prototype” bird that is the average
representation of the category “birds.”
Copyright 2019 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. WCN 02-200-203
D E M O N S T R AT I O N Family Resemblance
Rosch and Mervis’s (1975) instructions were as follows: For each of the following
common objects, list as many characteristics and attributes as you can that you feel
are common to these objects. For example, common characteristics for bicycles
are two wheels, pedals, handlebars, you ride on them, they don’t use fuel, and so
on. Give yourself about a minute to write down the characteristics for each of the
following items:
1. chair 3. sofa
2. mirror 4. telephone
If you responded like Rosch and Mervis’s participants, you assigned many of the same
characteristics to chair and sofa. For example, chairs and sofas share the characteristics of
having legs, having backs, you sit on them, they can have cushions, and so on. When an
item’s characteristics have a large amount of overlap with the characteristics of many other
items in a category, this means that the family resemblance of these items is high. But when
we consider mirror and telephone, we find that there is far less overlap, even though they
were both classified by Rosch and Mervis as “furniture” (Figure 9.4b). Little overlap with
other members of a category means the family resemblance is low.
Rosch and Mervis concluded from their results that there is a strong relationship
between family resemblance and prototypicality. Thus, good examples of the category “fur-
niture,” such as chair and sofa, share many attributes with other members of this category;
poor examples, like mirror and telephone, do not. In addition to the connection between
prototypicality and family resemblance, researchers have determined a number of other
connections between prototypicality and behavior.
Statements About Prototypical Objects Are Verified Rapidly Edward Smith and
coworkers (1974) used a procedure called the sentence verification technique to deter-
mine how rapidly people could answer questions about an object’s category.
Copyright 2019 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. WCN 02-200-203
two statements:
Prototypical Objects Are Affected More by Priming Priming occurs when presenta-
tion of one stimulus facilitates the response to another stimulus that usually follows closely
in time (see Chapter 6, page 182). Rosch (1975b) demonstrated that prototypical members
of a category are more affected by a priming stimulus than are nonprototypical members.
The procedure for Rosch’s experiment is shown in Figure 9.6. Participants first heard the
prime, which was the name of a color, such as “green.” Two seconds later they saw a pair of
colors side by side and indicated, by pressing a key as quickly as possible, whether the two
colors were the same or different.
The side-by-side colors that participants saw after hearing the prime were paired in
three different ways: (1) colors were the same and were good examples of the category
(primary reds, blues, greens, etc.; Figure 9.6a); (2) colors were the same but were poor
examples of the category (less rich versions of the good colors, such as light blue, light green,
etc.; Figure 9.6b); (3) colors were different, with the two colors coming from different cat-
egories (for example, pairing red with blue; Figure 9.6c).
The most important result occurred for the two “same” groups. In this condition,
priming resulted in faster “same” judgments for the prototypical (good) colors (reaction
time, RT = 610 ms) than for the nonprototypical (poor) colors (RT = 780 ms). Thus,
when participants heard the word green, they judged two patches of primary green as being
the same more rapidly than two patches of light green.
Rosch explains this result as follows: When participants hear the word green, they
imagine a “good” (highly prototypical) green (Figure 9.7a). The principle behind priming
is that the prime will facilitate the participants’ response to a stimulus if it contains some of
the information needed to respond to the stimulus. This apparently occurs when the good
greens are presented in the test (Figure 9.7b), but not when the poor greens are presented
(Figure 9.7c). Thus, the results of the priming experiments support the idea that partic-
ipants create images of prototypes in response to color names. Table 9.1 summarizes the
various ways that prototypicality affects behavior.
The prototype approach to categorization, and in particular Rosch’s pioneering
research, represented a great advance over the definitional approach because it provided a
Copyright 2019 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. WCN 02-200-203
Reaction time
Prime Test for “same”
610 ms
(a)
Hear
“green” 780 ms
(b)
Different color
(c)
➤ Figure 9.6 Procedure for Rosch’s (1975b) priming experiment. Results for the conditions
when the test colors were the same are shown on the right. (a) The person’s “green”
prototype matches the good green but (b) is a poor match for the light green; (c) shows
the condition in which colors were different.
Good match
(b)
Prototype
for green
(c)
Poor
match
“Green”
(a)
➤ Figure 9.7 How Rosch explains the finding that priming resulted in faster “same”
judgments for prototypical colors than for nonprototypical colors.
Copyright 2019 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. WCN 02-200-203
TABLE 9.1
Some Effects of Prototypicality
Effect Description Experimental Result
Family Things in a category resemble each Higher ratings for high-prototypical items
resemblance other in a number of ways. when people rate how “good” a member of
the category it is (Rosch, 1975a).
Typicality People react rapidly to members of Faster reaction time to statements like
a category that are “typical” of the “A _______ is a bird” for high-prototypical
category. items (like robin) than for low-prototypical
items (like ostrich) (Smith et al., 1974).
Naming People are more likely to list some High-prototypical items are named first
objects than others when asked to when people list examples of a category
name objects in a category. (Mervis et al., 1976).
Priming Presentation of one stimulus affects Faster same–different color judgments for
responses to a stimulus that follows. high-prototypical items (Rosch, 1975b).
wealth of experimental evidence that all items within a category are not the same. Another
approach to categorization, called the exemplar approach, also takes into account the wide
variation among items that belong to a particular category.
Copyright 2019 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. WCN 02-200-203
Some researchers have concluded that people may use both approaches. It has been
proposed that as we initially learn about a category, we may average exemplars into a pro-
totype; then, later in learning, some of the exemplar information becomes stronger (Keri
et al., 2002; Malt, 1989). Thus, early in learning, we would be poor at taking into account
“exceptions” such as ostriches or penguins, but later, exemplars for these cases would be
added to the category. We know generally what cats are—the prototype—but we know our
own specific cat the best—an exemplar (Minda & Smith, 2001; Smith & Minda, 2000).
A recent survey considering the virtues of both prototypes and exemplars ends with the
following conclusion: “The two kinds of information work together to produce our rich
store of conceptual knowledge allowing each kind of knowledge to explain the tasks that are
most suited for it” (Murphy, 2016).
Kitchen Dining Kitchen Dining Single Double Specific Ford Chevy Pickup Van Road Trail
room room (Subordinate)
(a) (b)
➤ Figure 9.8 Levels of categories for (a) furniture and (b) vehicle. Rosch provided evidence for the idea that the basic level
is “psychologically privileged.”
Copyright 2019 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. WCN 02-200-203
D E M O N S T R AT I O N Naming Things
Look at Figure 9.10 and, as quickly as possible, write down or say a word that identifies
each picture.
What names did you assign to each object? When Rosch and coworkers (1976) did
a similar experiment, they found that people tended to pick a basic level name. They said
guitar (basic level) rather than musical instrument (global) or rock guitar (specific), fish
rather than animal or trout, and pants rather than clothing or jeans.
Copyright 2019 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. WCN 02-200-203
Percent responses
campus, 75 percent of the responses used labels like “tree,” rather than
more specific labels like “oak.”
50
But instead of asking college undergraduates to name plants, what if
Coley had taken a group of horticulturalists around campus? Do you think
they would have said “tree” or “oak”? An experiment by James Tanaka and
Marjorie Taylor (1991) asked a similar question for birds. They asked bird 0
experts and nonexperts to name pictures of objects. There were objects from sic ific sic ific
Ba Spec Ba Spec
many different categories (tools, clothing, flowers, etc.), but Tanaka and Taylor Categories used to name
were interested in how the participants responded to the four bird pictures. pictures of birds
The results (Figure 9.11) show that the experts responded by spec-
ifying the birds’ species (robin, sparrow, jay, or cardinal), but the nonex- ➤ Figure 9.11 Results of Tanaka and Taylor’s (1991)
perts responded by saying “bird.” Apparently, the experts had learned to “expert” experiment. Experts (left pair of bars) used
pay attention to features of birds that nonexperts were unaware of. Thus, more specific categories to name birds, whereas
in order to fully understand how people categorize objects, we need to nonexperts (right pair of bars) used more basic
consider not only the properties of the objects but also the learning and categories.
experience of the people perceiving those objects (also see Johnson &
Mervis, 1997).
From the result of Tanaka’s bird experiment, we can guess that a horticulturist walking
around campus would be likely to label plants more specifically than people who had little
specific knowledge about plants. In fact, members of the Guatemalan Itzaj culture, who live
in close contact with their natural environment, call an oak tree an “oak,” not a “tree” (Coley
et al., 1997).
Thus, the level that is “special”—meaning that people tend to focus on it—is not the
same for everyone. Generally, people with more expertise and familiarity with a particular
category tend to focus on the more specific information that Rosch associated with the
specific level. This result isn’t surprising, because our ability to categorize is learned from
experience and depends on which objects we typically encounter and what characteristics
of these objects we pay attention to.
Copyright 2019 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. WCN 02-200-203
Copyright 2019 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. WCN 02-200-203
living thing
can
is grow
living
IS A
IS AN
➤ Figure 9.12 Collins and Quillian’s (1969) semantic network. Specific concepts
are indicated in color. Properties of concepts are indicated at the nodes for each
concept. Additional properties of a concept can be determined by moving up
the network, along the lines connecting the concepts. For example, moving from
“canary” up to “bird” indicates that canaries have feathers and wings and can
fly. The dashed lines indicate the distance in the network from canary to bird and
from bird to animal.
(Source: Adapted from T. T. Rogers & J. L. McClelland, 2004)
To access more information about “canary,” we move up the link and learn that a canary is
a bird and that a bird has wings, can fly, and has feathers. Moving up another level, we find
that a canary is also an animal, which has skin and can move, and finally we reach the level
of living things, which tells us it can grow and is living.
You might wonder why we have to travel from “canary” to “bird” to find out that a
canary can fly. That information could have been placed at the canary node, and then we
would know it right away. But Collins and Quillian proposed that including “can fly” at the
node for every bird (canary, robin, vulture, etc.) was inefficient and would use up too much
storage space. Thus, instead of indicating the properties “can fly” and “has feathers” for
every kind of bird, these properties are placed at the node for “bird” because this property
holds for most birds. This way of storing shared properties just once at a higher-level node
is called cognitive economy.
Although cognitive economy makes the network more efficient, it does create a problem
because not all birds fly. To deal with this problem while still achieving the advantages of cog-
nitive economy, Collins and Quillian added exceptions at lower nodes. For example, the node
for “ostrich,” which is not shown in this network, would indicate the property “can’t fly.”
How do the elements in this semantic network correspond to the actual operation of
the brain? The links and nodes we have been describing do not necessarily correspond to
specific nerve fibers or locations in the brain. The Collins and Quillian model is not meant
to mirror physiology but to indicate how concepts and their properties are associated in the
mind, and to make predictions about how we retrieve properties associated with a concept.
Copyright 2019 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. WCN 02-200-203
Animal
Primed
Robin
Canary Ostrich
Copyright 2019 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. WCN 02-200-203
The idea that spreading activation can influence priming was studied by David Meyer
and Roger Schvaneveldt (1971) in a paper published shortly after Collins and Quillian’s
model was proposed. They used a method called the lexical decision task.
Meyer and Schvaneveldt used a variation of the lexical decision task by presenting par-
ticipants with pairs of words, one above the other, as shown below:
Criticism of the Collins and Quillian Model ➤ Figure 9.15 Results of Meyer and
Schvaneveldt’s (1971) experiment.
Although Collins and Quillian’s model was supported by the results of a number Participants responded faster for words that
of experiments, such as their reaction time experiment (Figure 9.13) and Meyer were more closely associated (left bar).
and Schvaneveldt’s priming experiment, it didn’t take long for other researchers to call the
theory into question. They pointed out that the theory couldn’t explain the typicality effect,
in which reaction times for statements about an object are faster for more typical members
of a category than for less typical members (see page 270; Rips et al., 1973). Thus, the state-
ment “A canary is a bird” is verified more quickly than “An ostrich is a bird,” but the model
predicts equally fast reaction times because “canary” and “ostrich” are both one node away
from “bird.”
Researchers also questioned the concept of cognitive economy because of evidence that
people may, in fact, store specific properties of concepts (like “has wings” for “canary”) right
at the node for that concept (Conrad, 1972). In addition, Lance Rips and coworkers (1973)
obtained sentence verification results such as the following:
A pig is a mammal. RT = 1,476 ms
A pig is an animal. RT = 1,268 ms
“A pig is an animal” is verified more quickly, but as we can see from the network in
Figure 9.16, the Collins and Quillian model predicts that “A pig is a mammal” should be
Copyright 2019 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. WCN 02-200-203
verified more quickly because a link leads directly from “pig” to “mammal,” but we
Animal
need to travel one link past the “mammal” node to get to “animal.” Sentence veri-
fication results such as these, plus the other criticisms of the theory, led research-
ers to look for alternative ways to using networks to describe how concepts are
organized (Glass & Holyoak, 1975; Murphy et al., 2012) and eventually, in the
1980s, to the proposal of a new approach to networks, called connectionism.
Bird Fish Mammal
Copyright 2019 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. WCN 02-200-203
Activation of units in a network therefore depends on two things: (1) the signal that origi-
nates in the input units and (2) the connection weights throughout the network.
In the network of Figure 9.17, two of the input units are receiving stimuli. Activation
of each of the hidden and output units is indicated by the shading, with darker shading
indicating more activation. These differences in activation, and the pattern of activ-
ity they create, are responsible for a basic principle of connectionism: A stimulus pre-
sented to the input units is represented by the pattern of activity that is distributed
across the other units. If this sounds familiar, it is because it is similar to the distributed
representations in the brain we described in Chapters 2 (page 43) and 5 (page 154),
Now that we have used the simple network in Figure 9.17, we will consider how some
specific concepts are represented in the more complex connectionist network shown
in Figure 9.18.
Property
living thing
plant
animal
tree
flower
bird
flower
pine is a
Concept
Representation oak
pine Hidden
rose
oak daisy
rose robin
daisy canary
robin sunfish
canary salmon
sunfish
salmon pretty
tall
living
is
green
Relation red
IS A yellow
is
can grow
has move
swim can
fly
sing
bark
petals
wings
feathers
has
scales
gills
roots
skin
➤ Figure 9.18 A connectionist network. Activation of an item unit (“canary”) and a relation
unit (can) causes activity to travel through the network that eventually results in activation
of the property units grow, move, fly, and sing, associated with “canary can.” Shading
indicates the activity of the units, with darker shading indicating more activity. Note that
only a few of the units and connections that would be activated by “canary” and can are
shown as being activated. In the actual network, many more units and connections would
be activated.
(Source: T. T. Rogers & J. L. McClelland, 2004)
Copyright 2019 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. WCN 02-200-203
Training a Network According to the above description, the answer to “A canary can …”
is represented in the network by activation of the property units plus the pattern of activa-
tion of the network’s representation and hidden units. But according to connectionism, this
network had to be trained in order to achieve this result.
We can appreciate the need for training by considering Figure 9.19, which indicates
how the network might have responded before training had occurred. In the untrained
network, stimulating the canary and can units sends activity out to the rest of the network,
with the effect of this activation depending on the connection weights between the units.
Let’s assume that in our untrained network, all of the connection weights are 1.0.
Because the connection weights are the same, activity spreads throughout the network,
and property nodes such as flower, pine, and bark, which have nothing to do with canar-
ies, are activated. For the network to operate properly, the connection weights have to be
adjusted so that activating the concept unit “canary” and the relation unit can activates only
the property units grow, move, fly, and sing. This adjustment of weights is achieved by a
learning process. The learning process occurs when the erroneous responses in the property
units cause an error signal to be sent back through the network, by a process called back
propagation (since the signals are being sent backward in the network starting from the
property units). The error signals that are sent back to the hidden units and the represen-
tation units provide information about how the connection weights should be adjusted so
that the correct property units will be activated.
To explain the idea behind activation and back propagation, let’s consider a behavioral
example. A young child is watching a robin sitting on a branch, when suddenly the robin flies
away. This simple observation, which strengthens the association between “robin” and can fly,
would involve activation. But if the child were to see a canary and say “robin,” the child’s parent
might correct her and say “That is a canary” and “Robins have red breasts.” The information
provided by the parent is similar to the idea of feedback provided by back propagation.
Copyright 2019 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. WCN 02-200-203
Property
living thing
plant
animal
tree
flower
bird
flower
pine is a
Concept
Representation oak
pine Hidden
rose
oak daisy
rose robin
daisy canary
robin sunfish
canary salmon
sunfish
salmon pretty
tall
living
is
green
Relation red
IS A yellow
is
can grow
has move
swim can
fly
sing
bark
petals
wings
feathers
has
scales
gills
roots
skin
➤ Figure 9.19 How the connectionist network in Figure 9.18 might respond before training,
when all of the connection weights are 1.0. See text for details.
Thus, a child’s learning about concepts begins with little information and some incor-
rect ideas, which are slowly modified in response both to observation of the environment
and to feedback from others. Similarly, the connectionist network’s learning about concepts
begins with incorrect connection weights that result in the activation shown in Figure 9.19,
which are slowly modified in response to error signals to create the correctly functioning
network in Figure 9.18.
Although this “educated” network might work well for canaries, what happens when a
robin flies by and alights on the branch of a pine tree? To be useful, this network needs to be
able to represent not just canaries but also robins and pine trees. Thus, to create a network
that can represent many different concepts, the network is not trained just on “canary.”
Instead, presentations of “canary” are interleaved with presentations of “robin,” “pine tree,”
and so on, with small changes in connection weights made after each presentation.
We can appreciate how this learning process occurs over many trials by looking at the
results of a computer simulation (McClelland & Rogers, 2003). The network in Figure 9.18
was presented with a number of different concepts and relation statements, one after another,
and the activity of the units and connection weights between units were calculated by the
computer. Figure 9.20 indicates the activation of the eight representation units in response to
Copyright 2019 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. WCN 02-200-203
Learning trials
0 250 750 2,500
Rose
Activity in hidden units
Daisy
Canary
Hidden units
➤ Figure 9.20 Learning in a connectionist network. Bars represent activity in the eight
representation units. Notice how the pattern of activation changes as learning
progresses.
(Source: Adapted from J. L. McClelland & T. T. Rogers, 2003)
the concepts “canary,” “rose,” and “daisy.” At the beginning of the process, the experimenter set
the connection weights so that activity was about the same in each unit (Learning trials = 0).
This corresponds to the initially weak and undifferentiated activation we discussed earlier.
As learning progressed, with each concept being presented one after another and the
computer changing the weights just slightly after each trial in response to error signals, the
patterns became adjusted, so by Trial 250, the patterns for “canary” and “daisy” begin to look
different. By Trial 2,500, it is easy to tell the difference between the patterns for “canary” and
“daisy,” while the two flowers, “daisy” and “rose,” have similar but slightly different patterns.
Although our description has been based on one particular connectionist network,
most networks have similar properties. Connectionist networks are created by a learning
process that shapes the networks so information about each concept is contained in the
distributed pattern of activity across a number of units.
Notice how different this operation of the connectionist network is from the operation
of Collins and Quillian’s hierarchical network, in which concepts and their properties are
represented by activation of different nodes. Representation in a connectionist network is
far more complex, involving many more units for each concept, but it is also much more like
what happens in the brain.
Because of the resemblance between connectionist networks and the brain, and the
fact that connectionist networks have been developed that can simulate normal cognitive
functioning for processes such as language processing, memory, and cognitive development
(Rogers & McClelland, 2004; Seidenberg & Zevin, 2006), many researchers believe that
the idea that knowledge is represented by distributed activity holds great promise. The fol-
lowing results also support the idea of connectionism:
1. The operation of connectionist networks is not totally disrupted by damage. Because
information in the network is distributed across many units, damage to the system
does not completely disrupt its operation. This property, in which disruption of
performance occurs only gradually as parts of the system are damaged, is called
graceful degradation. It is similar to what often happens in actual cases of brain
damage, in which damage to the brain causes only a partial loss of functioning.
Copyright 2019 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. WCN 02-200-203
Some researchers have suggested that studying the way networks respond to dam-
age may suggest strategies for rehabilitation of human patients (Farah et al., 1993;
Hinton & Shallice, 1991; Olson & Humphreys, 1997; Plaut, 1996).
2. Connectionist networks can explain generalization of learning. Because similar
concepts have similar patterns, training a system to recognize the properties of
one concept (such as “canary”) also provides information about other, related
concepts (such as “robin” or “sparrow”). This is similar to the way we actually
learn about concepts because learning about canaries enables us to predict proper-
ties of different types of birds we’ve never seen (see McClelland et al., 1995).
While active research on connectionism continues in many laboratories, some research-
ers point out that there are limits to what connectionist networks can explain. Whatever
the final verdict on connectionism, this approach has stimulated a great deal of research,
some of which has added to our understanding of both normal cognition and how brain
damage affects cognition. In the next section, we will focus even more directly on the brain
by considering neuropsychological and brain imaging research on how concepts are repre-
sented in the brain.
Copyright 2019 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. WCN 02-200-203
80 patients were able to identify nonanimals, like furniture and tools, as well as
fruits and vegetables, but had impaired ability to identify animals (Figure 9.21).
60 (As we discuss various cases next, we will use the term artifacts to refer to non-
living things, which would include furniture and tools.)
40
To explain why this selective impairment occurred, Warrington and Shal-
lice considered properties that people use to distinguish between artifacts and
20
living things. They noted that distinguishing living things depends on perceiv-
ing their sensory features. For example, distinguishing between a tiger and a
0
Patient K.C. Patient E.W. leopard depends on perceiving stripes and spots. Artifacts, in contrast, are more
likely to be distinguished by their function. For example, a screwdriver, chisel,
and hammer are all tools but are used for different purposes (turning screws,
➤ Figure 9.21 Performance on a naming task
scraping, and pounding nails) (Table 9.2).
for patients K.C. and E.W., both of whom
had category-specific memory impairment. The observation that living things are distinguished by sensory properties
They were able to correctly name pictures of and artifacts by functions led to the sensory-functional (S-F) hypothesis,
nonliving things (such as car and table) and which states that our ability to differentiate living things and artifacts depends
fruits and vegetables (such as tomato and pear) on a memory system that distinguishes sensory attributes and a system that dis-
but performed poorly when asked to name tinguishes functions.
pictures of animals. While the S-F hypothesis explained the behavior of Warrington and Shallice’s
(Source: B. Z. Mahon & A. Caramazza, 2009) patients, plus dozens of other patients, researchers began describing cases that
couldn’t be explained by this hypothesis. For example, Matthew Lambon Ralph and cowork-
ers (1998) studied a patient who had a sensory deficit—she performed poorly on perceptual
tests—yet she was better at identifying animals than artifacts, which is the opposite of what
the S-F hypothesis predicts. In addition, there are patients who are able to identify mechanical
devices even if they perform poorly for other types of artifacts. For example, Hoffman and
Lambon Ralph describe patients who have poor comprehension of small artifacts like tools
but better knowledge of larger artifacts, such as vehicles (Cappa et al., 1998; Hillis et al., 1990;
Warrington & McCarthy, 1987). Thus, “artifacts” are not a single homogeneous category as
hypothesized by the S-F hypothesis. Results such as these have led many researchers to con-
clude that many effects of brain damage can’t be explained by the simple distinction between
sensory and function (Hoffman & Lambon, 2013).
TABLE 9.2
The Sensory-Functional Hypothesis
Relevant Information Relevant to … Deficit in ability to process relevant
information causes…
Sensory Living things Problems identifying living things
(Ex: a tiger has stripes)
Copyright 2019 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. WCN 02-200-203
TABLE 9.3
Sample Stimuli and Question Used in the Hoffman and Lambon Ralph (2013)
Experiment
A. A Few of the 160 Items Presented to Subjects
Mammal Machine Clothing
Color Taste
Sound Performed action (in which you interact with the object)
following question: Assume that we start with a large number of items selected from lists of
different types of animals, plants, and artifacts. If you wanted to arrange them in terms of
how similar they are to each other, how would you do it? You could arrange them by shape,
but then items like a pencil, a screwdriver, a person’s finger, and a breakfast sausage might be
grouped together. Or considering just color, you could end up placing fir trees, leprechauns,
and Kermit the Frog together. Although it is true that members of specific categories do
share similar perceptual attributes, it is also clear that we need to use more than just one or
two features when grouping objects in terms of similarity.
Taking this idea as their starting point, researchers picked a number of different
features and had participants rate a large number of items with regard to these fea-
tures. This was the idea behind an experiment by Paul Hoffman and
Matthew Lambon Ralph (2013), who used 160 items like the ones shown
in Table 9.3a. The participants’ task was to rate each item on the features Animals
shown in Table 9.3b. For example, for the concept “door,” the participant Artifacts
would be asked “How much do you associate door with a particular color 1
The results, shown in Figure 9.22, indicate that animals were more highly
0
associated with motion and color compared to artifacts, and artifacts were
more highly associated with performed actions (actions associated with using –0.5
or interacting with an object). This result conforms to the S-F hypothesis, but
when Hoffman and Lambon Ralph looked at the groupings more closely, they –1
found some interesting results. Mechanical devices such as machines, vehicles, Color Motion Performed
action
and musical instruments overlapped with both artifacts (involving performed
actions) and animals (involving sound and motion). For example, musical
instruments are associated with specific actions (how you play them), which ➤ Figure 9.22 How participants rated animals
and artifacts on color, motion, and performed
goes with artifacts, and are also associated with sensory properties (their visual
actions. Animals are rated higher on color
form and the sounds they create), which goes with animals. Thus, musical and motion; artifacts are rated higher on
instruments and some mechanical devices occupy a middle ground between performed actions.
artifacts and living things, because they can involve both action knowledge (Source: Based on data in P. Hoffman & M. A. Lambon
and sensory attributes. Ralph, 2013)
Copyright 2019 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. WCN 02-200-203
Bruce Goldstein
Albert Pego/Shutterstock.com
➤ Figure 9.23 Some animals and vehicles. Notice that the animals are more similar to each
other than are the vehicles. This higher similarity of animals is called crowding.
Another factor that researchers have proposed to differentiate between animals and
artifacts is crowding, which refers to the fact that animals tend to share many properties
(like eyes, legs, and the ability to move). In contrast, artifacts like cars and boats share fewer
properties, other than that they are both vehicles (Figure 9.23) (Rogers & Cox, 2015;
Tyler & Moss 2001). This has led some researches to propose that patients who appear
to have category-specific impairments, such as difficulty recognizing living things but not
artifacts, don’t really have a category-specific impairment at all. They propose that these
patients have difficulty recognizing living things because they have difficulty distinguishing
between items that share similar features. According to this idea, because animals tend to be
more similar then artifacts, these patients find animals harder to recognize (Cree & McRae,
2003; Lambon Ralph et al., 2007).
Copyright 2019 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. WCN 02-200-203
➤ Figure 9.24 Results of the Huth et al. (2016) experiment in which participants listened to
stories in a scanner. (a) Words that activated different places on the cortex. (b) Close-up
of a smaller area of cortex. Note that a particular area usually responded to a number of
different words, as indicated in Figure 9.25.
cortex color-coded to indicate where different categories of words activate the cortex.
For example, the light area in the back of the brain is activated by words associated with
violence. The words that activated a single voxel in that area are indicated on the right.
Another voxel, which is activated by words associated with visual qualities, is shown in the
green area near the top of the brain. An interesting aspect of Huth’s results is that the maps
were very similar for each of the seven participants.
While the semantic category approach focuses on areas of the brain that are specialized
to respond to specific types of stimuli, it also emphasizes that the brain’s response to items
ribbons sleeve
violence
colour satin
coloured tunic
social
body part
glossy colored strips person
visual
brown cap red mental
inch gold
outdoor
leather yellow
thinner
believed
pregnant
judge guilty
death claims
innocent child
killed victim
murderer
stated
false birth
claim
believes
claiming
accusation
Courtesy of Alexander Huth
➤ Figure 9.25 More results from the Huth et al. (2016) experiment. Colors on the cortex
indicate where different categories of words caused activation, as indicated on the legend in
the upper right. The pink words activated a voxel that responded to words related to violence.
The green words activated a voxel that responded to visual qualities.
Copyright 2019 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. WCN 02-200-203
from a particular category is distributed over a number of different cortical areas (Mahon
et al., 2007; Mahon & Caramazza, 2011). Thus, identifying faces may be based on activity
in the face area in the temporal lobe (see Chapter 2, page 42), but it also depends on activity
in areas that respond to emotions, facial expressions, where the face is looking, and the face’s
attractiveness (see page 43).
Similarly, the response to a hammer activates visual areas that respond to the hammer’s
shape and color, but it also causes activity in areas that respond to how a hammer is used and
to a hammer’s typical motions. This idea that some objects, like hammers, cause activity in
areas of the brain associated with actions, brings us to the embodied approach to categorization.
Copyright 2019 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. WCN 02-200-203
question whether the embodied approach offers a complete explanation of how the brain
processes concepts (Almeida et al., 2013; Chatterjee, 2010; Dravida et al., 2013; Goldinger
et al., 2016). For example, Frank Garcea and coworkers (2013) tested patient AA, who had
suffered a stroke that affected his ability to produce actions associated with various objects.
Thus, when AA was asked to use hand motions to indicate how he would use objects such
as a hammer, scissors, and a feather duster, he was impaired compared to normal control
participants in producing these actions. According to the embodied approach, a person
who has trouble producing actions associated with objects should have trouble recognizing
the objects. AA was, however, able to identify pictures of the objects. Garcea and cowork-
ers concluded from this result that the ability to represent motor activity associated with
actions is not necessary for recognizing objects, as the embodied approach would predict.
Another criticism of the embodied approach is that it isn’t well suited to explaining our
knowledge of abstract concepts such as “democracy” or “truth.” However, proponents of
the embodied approach have offered explanations in response to these criticisms (which we
won’t go into here; see Barsalou, 2005; Chatterjee, 2010).
Copyright 2019 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. WCN 02-200-203
20
12
8
IPL TMS
4
–4
Manmade Living Low High
Category Item manipulability (Praxis)
20
12
ATL TMS 8
–4
Manmade Living Low High
Category Item manipulability (Praxis)
(a) Neuroanatomical sketch of the (b) Effect of TMS stimulation on (c) Effect of TMS stimulation on
hub-and-spoke framework manmade and living things. manipulability Top: Parietal
Top: Parietal TMS; Bottom: ATL TMS. TMS; Bottom: ATL TMS.
➤ Figure 9.27 (a) The hub and spoke model proposes that areas of the brain specialized for different functions are linked to
the anterior temporal lobe (purple), which integrates the information from areas that serve the following functions: Valence
(yellow); speech (pink); auditory (red); praxis (dark blue); functionality (light blue); visual (green). The dark blue area is in the
parietal cortex. (b) Effect of TMS stimulation on man-made objects versus living things. Top: Parietal stimulation causes more
slowing of RT to man-made objects than to living things. Bottom: ATL stimulation causes the same effect on both. (c) Effect of
TMS stimulation on low versus high manipulability objects. Top: Parietal stimulation causes more slowing of RT to high
manipulability objects compared to low. Bottom: ATL stimulation causes the same effect on both.
(Source: Adapted from Lambon Ralph et al., 2017. Supplementary Figure 5. Based on data from Pobric et al., 2013.)
Gorana Pobric and coworkers (2010) presented pictures of living things and artifacts to
participants and measured the response time for naming each picture. They then repeated
this procedure while TMS was being applied either to the ATL or to an area in the pari-
etal lobe that is normally activated when a person is manipulating an object. Figure 9.27b
Copyright 2019 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. WCN 02-200-203
indicates that parietal inactivation (top) slowed reaction times for manmade objects but
not for living objects, whereas ATL inactivation (bottom) caused the same effect for
both manmade and living objects. Figure 9.27c indicates that parietal activation caused a
large slowing in reaction time for highly manipulatable objects (like some tools) but not
for non-manipulatable objects (like furniture), whereas ATL inactivation affected both
types of objects equally. This result—a general effect of stimulating the “hub” (ATL),
but a more specific effect of stimulating an area that would be associated with one of the
“spokes” (parietal cortex)—supports the idea of a hub with general functions and spokes
with more specific functions ( Jefferies, 2013; Lambon-Ralph et al., 2017).
Most researchers agree that the ATL plays a role in integrating information from dif-
ferent areas. But it has also been suggested that other structures may be “hubs,” or that
the most important way concepts are represented is not by “hubs” but by the pattern of
connections formed between the “spokes” (Pulvermüller, 2013). Thus, as we noted at the
end of the last section, research on how concepts are represented in the brain is still a
“work in progress.”
CHAPTER SUMMARY
1. Semantic memory is our memory for facts and conform to the definition. The philosopher Wittgenstein
knowledge. proposed the idea of family resemblances to deal with
2. Categories are “pointers to knowledge.” Once you know the fact that definitions do not include all members of
something is in a category, you know a lot of general things a category.
about it and can focus your energy on specifying what is 4. The idea behind the prototypical approach to categorization
special about this particular object. is that we decide whether an object belongs to a category by
3. The definitional approach to categorization doesn’t work deciding whether it is similar to a standard representative of
because most categories contain members that do not the category, called a prototype. A prototype is formed by
Copyright 2019 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. WCN 02-200-203
averaging category members a person has encountered in are organized in the mind. Collins and Quillian’s model
the past. is a network that consists of nodes connected by links.
5. Prototypicality is a term used to describe how well an object Concepts and properties of concepts are located at
resembles the prototype of a particular category. the nodes. Properties that hold for most members of a
concept are stored at higher level nodes. This is called
6. The following is true of high-prototypical objects: (a) they
cognitive economy.
have high family resemblance; (b) statements about them
are verified rapidly; (c) they are named first; and (d) they are 13. Collins and Quillian’s model is supported by the results of
affected more by priming. experiments using the sentence verification technique. The
spreading activation feature of the model is supported by
7. The exemplar approach to categorization involves
priming experiments.
determining whether an object is similar to an exemplar.
An exemplar is an actual member of a category that a person 14. The Collins and Quillian model has been criticized for
has encountered in the past. several reasons: It can’t explain the typicality effect, the
idea of cognitive economy doesn’t always hold, and it can’t
8. An advantage to the exemplar approach is that it doesn’t
explain all results of sentence verification experiments.
discard information about atypical cases within a category,
such as penguin in the “bird” category. The exemplar 15. The connectionist approach proposes that concepts are
approach can also deal more easily with categories that represented in networks that consist of input units, hidden
contain widely varying members, such as games. units, and output units, and that information about concepts
is represented in these networks by a distributed activation
9. Researchers have concluded that people use both approaches
of these units. This approach is also called the parallel
to categorization. Prototypes may be more important
distributed processing (PDP) approach.
as people initially learn about categories; later, exemplar
information may become more important. 16. Connectionist networks learn the correct distributed pattern
for a particular concept through a gradual learning process
10. The kind of organization in which larger, more general
that involves adjusting the weights that determine how
categories are divided into smaller, more specific categories
activation is transferred from one unit to another.
is called hierarchical organization. Experiments by Rosch
indicate that a basic level of categories (such as guitar, as 17. Connectionist networks have a number of features that
opposed to musical instrument or rock guitar) is a “basic” enable them to reproduce many aspects of human concept
level of categorization that reflects people’s everyday formation.
experience. 18. Four approaches to explaining how concepts are represented
11. Experiments in which experts were tested show that the in the brain are the sensory-functional hypothesis, the
basic level of categorization can depend on a person’s degree semantic category approach, the multiple-factor approach,
of expertise. and the embodied approach.
12. The semantic network approach proposes that concepts 19. The hub and spoke model proposes that different functions in
are arranged in networks that represent the way concepts the brain are integrated by the anterior temporal lobe (ATL).
THINK ABOUT IT
1. In this chapter, we have seen how networks can be people to name, as quickly as possible, three typical “birds”
constructed that link different levels of concepts. In or “vehicles” or “beverages.” What do the results of this
Chapter 7 we saw how networks can be constructed survey tell you about what level is “basic” for different
that organize knowledge about a particular topic (see people? What do the results tell you about the variability of
Figure 7.5). Create a network that represents the different people’s conception of categories?
material in this chapter by linking together things that 3. Try asking a number of people to name the objects pictured
are related. How is this network similar to or different in Figure 9.10. Rosch, who ran her experiment in the early
from the semantic network in Figure 9.12? Is your 1970s, found that the most common responses were guitar,
network hierarchical? What information does it contain fish, and pants. Notice whether the responses you receive
about each concept? are the same as or different from the responses reported
2. Do a survey to determine people’s conception of “typical” by Rosch. If they are different, explain why you think this
members of various categories. For example, ask several might have occurred.
Copyright 2019 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. WCN 02-200-203
KEY TERMS
Anterior temporal lobe (ATL), 291 Exemplar, 272 Prototype approach to
Back propagation, 282 Exemplar approach to categorization, 268
Basic level, 273 categorization, 272 Semantic category approach, 288
Categorization, 265 Family resemblance, 268 Semantic dementia, 291
Category, 265 Global level, 273 Semantic network approach, 276
Category-specific memory Graceful degradation, 284 Semantic somatotopy, 290
impairment, 286 Hidden units, 280 Sensory-functional (S-F)
Cognitive economy, 277 Hierarchical model, 276 hypothesis, 286
Concept, 264 Hierarchical organization, 273 Sentence verification technique, 269
Conceptual knowledge, 264 Hub and spoke model, 291 Specific level, 273
Connection weight, 280 Input units, 280 Spreading activation, 278
Connectionism, 280 Lexical decision task, 279 Subordinate (specific) level, 273
Connectionist network, 280 Multiple-factor approach, 286 Superordinate (global) level, 273
Crowding, 288 Output units, 280 Transcranial magnetic stimulation
(TMS), 291
Definitional approach to Parallel distributed processing
categorization, 266 (PDP), 280 Typicality, 268
Embodied approach, 290 Priming, 270 Typicality effect, 270
Error signal, 282 Prototype, 268 Unit (in a connectionist network), 280
Copyright 2019 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. WCN 02-200-203
“Visual imagery” occurs when a person sees in his or her mind something that isn’t physically
present. This picture symbolizes the finding that although visual perception and visual imagery
share many properties, experiences associated with visual imagery can be less detailed and more
fragile than experiences associated with visual perception.
Copyright 2019 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. WCN 02-200-203