0% found this document useful (0 votes)

5 views

19

Uploaded by

nidhaldoctorat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

19

Uploaded by

nidhaldoctorat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

Speech and Language Processing. Daniel Jurafsky & James H. Martin. Copyright © 2021.

All
rights reserved. Draft of September 21, 2021.

CHAPTER

Semantic Role Labeling

19
“Who, What, Where, When, With what, Why, How”
The seven circumstances, associated with Hermagoras and Aristotle (Sloan, 2010)

Sometime between the 7th and 4th centuries BCE, the Indian grammarian Pān.ini1
wrote a famous treatise on Sanskrit grammar, the As.t.ādhyāyı̄ (‘8 books’), a treatise
that has been called “one of the greatest monuments of hu-
man intelligence” (Bloomfield, 1933, 11). The work de-
scribes the linguistics of the Sanskrit language in the form
of 3959 sutras, each very efficiently (since it had to be
memorized!) expressing part of a formal rule system that
brilliantly prefigured modern mechanisms of formal lan-
guage theory (Penn and Kiparsky, 2012). One set of rules
describes the kārakas, semantic relationships between a
verb and noun arguments, roles like agent, instrument, or
destination. Pān.ini’s work was the earliest we know of
that modeled the linguistic realization of events and their
participants. This task of understanding how participants relate to events—being
able to answer the question “Who did what to whom” (and perhaps also “when and
where”)—is a central question of natural language processing.
Let’s move forward 2.5 millennia to the present and consider the very mundane
goal of understanding text about a purchase of stock by XYZ Corporation. This
purchasing event and its participants can be described by a wide variety of surface
forms. The event can be described by a verb (sold, bought) or a noun (purchase),
and XYZ Corp can be the syntactic subject (of bought), the indirect object (of sold),
or in a genitive or noun compound relation (with the noun purchase) despite having
notionally the same role in all of them:
• XYZ corporation bought the stock.
• They sold the stock to XYZ corporation.
• The stock was bought by XYZ corporation.
• The purchase of the stock by XYZ corporation...
• The stock purchase by XYZ corporation...
In this chapter we introduce a level of representation that captures the common-
ality between these sentences: there was a purchase event, the participants were
XYZ Corp and some stock, and XYZ Corp was the buyer. These shallow semantic
representations , semantic roles, express the role that arguments of a predicate take
in the event, codified in databases like PropBank and FrameNet. We’ll introduce
semantic role labeling, the task of assigning roles to spans in sentences, and selec-
tional restrictions, the preferences that predicates express about their arguments,
such as the fact that the theme of eat is generally something edible.
1 Figure shows a birch bark manuscript from Kashmir of the Rupavatra, a grammatical textbook based
on the Sanskrit grammar of Panini. Image from the Wellcome Collection.
2 C HAPTER 19 • S EMANTIC ROLE L ABELING

19.1 Semantic Roles

Consider how in Chapter 15 we represented the meaning of arguments for sentences
like these:
(19.1) Sasha broke the window.
(19.2) Pat opened the door.
A neo-Davidsonian event representation of these two sentences would be

∃e, x, y Breaking(e) ∧ Breaker(e, Sasha)

∧BrokenT hing(e, y) ∧Window(y)
∃e, x, y Opening(e) ∧ Opener(e, Pat)
∧OpenedT hing(e, y) ∧ Door(y)

In this representation, the roles of the subjects of the verbs break and open are
deep roles Breaker and Opener respectively. These deep roles are specific to each event; Break-
ing events have Breakers, Opening events have Openers, and so on.
If we are going to be able to answer questions, perform inferences, or do any
further kinds of semantic processing of these events, we’ll need to know a little more
about the semantics of these arguments. Breakers and Openers have something in
common. They are both volitional actors, often animate, and they have direct causal
responsibility for their events.
thematic roles Thematic roles are a way to capture this semantic commonality between Break-
agents ers and Openers. We say that the subjects of both these verbs are agents. Thus,
AGENT is the thematic role that represents an abstract idea such as volitional causa-
tion. Similarly, the direct objects of both these verbs, the BrokenThing and OpenedThing,
are both prototypically inanimate objects that are affected in some way by the action.
theme The semantic role for these participants is theme.

Thematic Role Definition

AGENT The volitional causer of an event
EXPERIENCER The experiencer of an event
FORCE The non-volitional causer of the event
THEME The participant most directly affected by an event
RESULT The end product of an event
CONTENT The proposition or content of a propositional event
INSTRUMENT An instrument used in an event
BENEFICIARY The beneficiary of an event
SOURCE The origin of the object of a transfer event
GOAL The destination of an object of a transfer event
Figure 19.1 Some commonly used thematic roles with their definitions.

Although thematic roles are one of the oldest linguistic models, as we saw above,
their modern formulation is due to Fillmore (1968) and Gruber (1965). Although
there is no universally agreed-upon set of roles, Figs. 19.1 and 19.2 list some the-
matic roles that have been used in various computational papers, together with rough
definitions and examples. Most thematic role sets have about a dozen roles, but we’ll
see sets with smaller numbers of roles with even more abstract meanings, and sets
with very large numbers of roles that are specific to situations. We’ll use the general
semantic roles term semantic roles for all sets of roles, whether small or large.
19.2 • D IATHESIS A LTERNATIONS 3

Thematic Role Example

AGENT The waiter spilled the soup.
EXPERIENCER John has a headache.
FORCE The wind blows debris from the mall into our yards.
THEME Only after Benjamin Franklin broke the ice...
RESULT The city built a regulation-size baseball diamond...
CONTENT Mona asked “You met Mary Ann at a supermarket?”
INSTRUMENT He poached catfish, stunning them with a shocking device...
BENEFICIARY Whenever Ann Callahan makes hotel reservations for her boss...
SOURCE I flew in from Boston.
GOAL I drove to Portland.
Figure 19.2 Some prototypical examples of various thematic roles.

19.2 Diathesis Alternations

The main reason computational systems use semantic roles is to act as a shallow
meaning representation that can let us make simple inferences that aren’t possible
from the pure surface string of words, or even from the parse tree. To extend the
earlier examples, if a document says that Company A acquired Company B, we’d
like to know that this answers the query Was Company B acquired? despite the fact
that the two sentences have very different surface syntax. Similarly, this shallow
semantics might act as a useful intermediate language in machine translation.
Semantic roles thus help generalize over different surface realizations of pred-
icate arguments. For example, while the AGENT is often realized as the subject of
the sentence, in other cases the THEME can be the subject. Consider these possible
realizations of the thematic arguments of the verb break:
(19.3) John broke the window.
AGENT THEME
(19.4) John broke the window with a rock.
AGENT THEME INSTRUMENT
(19.5) The rock broke the window.
INSTRUMENT THEME
(19.6) The window broke.
THEME
(19.7) The window was broken by John.
THEME AGENT
These examples suggest that break has (at least) the possible arguments AGENT,
THEME , and INSTRUMENT. The set of thematic role arguments taken by a verb is
thematic grid often called the thematic grid, θ -grid, or case frame. We can see that there are
case frame (among others) the following possibilities for the realization of these arguments of
break:
AGENT/Subject, THEME /Object
AGENT/Subject, THEME /Object, INSTRUMENT/PPwith
INSTRUMENT/Subject, THEME /Object
THEME /Subject

It turns out that many verbs allow their thematic roles to be realized in various
syntactic positions. For example, verbs like give can realize the THEME and GOAL
arguments in two different ways:
4 C HAPTER 19 • S EMANTIC ROLE L ABELING

(19.8) a. Doris gave the book to Cary.

AGENT THEME GOAL

b. Doris gave Cary the book.

AGENT GOAL THEME

These multiple argument structure realizations (the fact that break can take AGENT,
INSTRUMENT , or THEME as subject, and give can realize its THEME and GOAL in
verb either order) are called verb alternations or diathesis alternations. The alternation
alternation
dative we showed above for give, the dative alternation, seems to occur with particular se-
alternation
mantic classes of verbs, including “verbs of future having” (advance, allocate, offer,
owe), “send verbs” (forward, hand, mail), “verbs of throwing” (kick, pass, throw),
and so on. Levin (1993) lists for 3100 English verbs the semantic classes to which
they belong (47 high-level classes, divided into 193 more specific classes) and the
various alternations in which they participate. These lists of verb classes have been
incorporated into the online resource VerbNet (Kipper et al., 2000), which links each
verb to both WordNet and FrameNet entries.

19.3 Semantic Roles: Problems with Thematic Roles

Representing meaning at the thematic role level seems like it should be useful in
dealing with complications like diathesis alternations. Yet it has proved quite diffi-
cult to come up with a standard set of roles, and equally difficult to produce a formal
definition of roles like AGENT, THEME, or INSTRUMENT.
For example, researchers attempting to define role sets often find they need to
fragment a role like AGENT or THEME into many specific roles. Levin and Rappa-
port Hovav (2005) summarize a number of such cases, such as the fact there seem
to be at least two kinds of INSTRUMENTS, intermediary instruments that can appear
as subjects and enabling instruments that cannot:
(19.9) a. The cook opened the jar with the new gadget.
b. The new gadget opened the jar.
(19.10) a. Shelly ate the sliced banana with a fork.
b. *The fork ate the sliced banana.
In addition to the fragmentation problem, there are cases in which we’d like to
reason about and generalize across semantic roles, but the finite discrete lists of roles
don’t let us do this.
Finally, it has proved difficult to formally define the thematic roles. Consider the
AGENT role; most cases of AGENTS are animate, volitional, sentient, causal, but any
individual noun phrase might not exhibit all of these properties.
semantic role These problems have led to alternative semantic role models that use either
many fewer or many more roles.
The first of these options is to define generalized semantic roles that abstract
proto-agent over the specific thematic roles. For example, PROTO - AGENT and PROTO - PATIENT
proto-patient are generalized roles that express roughly agent-like and roughly patient-like mean-
ings. These roles are defined, not by necessary and sufficient conditions, but rather
by a set of heuristic features that accompany more agent-like or more patient-like
meanings. Thus, the more an argument displays agent-like properties (being voli-
tionally involved in the event, causing an event or a change of state in another par-
ticipant, being sentient or intentionally involved, moving) the greater the likelihood
19.4 • T HE P ROPOSITION BANK 5

that the argument can be labeled a PROTO - AGENT. The more patient-like the proper-
ties (undergoing change of state, causally affected by another participant, stationary
relative to other participants, etc.), the greater the likelihood that the argument can
be labeled a PROTO - PATIENT.
The second direction is instead to define semantic roles that are specific to a
particular verb or a particular group of semantically related verbs or nouns.
In the next two sections we describe two commonly used lexical resources that
make use of these alternative versions of semantic roles. PropBank uses both proto-
roles and verb-specific semantic roles. FrameNet uses semantic roles that are spe-
cific to a general semantic idea called a frame.

19.4 The Proposition Bank

PropBank The Proposition Bank, generally referred to as PropBank, is a resource of sen-
tences annotated with semantic roles. The English PropBank labels all the sentences
in the Penn TreeBank; the Chinese PropBank labels sentences in the Penn Chinese
TreeBank. Because of the difficulty of defining a universal set of thematic roles,
the semantic roles in PropBank are defined with respect to an individual verb sense.
Each sense of each verb thus has a specific set of roles, which are given only numbers
rather than names: Arg0, Arg1, Arg2, and so on. In general, Arg0 represents the
PROTO - AGENT, and Arg1, the PROTO - PATIENT . The semantics of the other roles
are less consistent, often being defined specifically for each verb. Nonetheless there
are some generalization; the Arg2 is often the benefactive, instrument, attribute, or
end state, the Arg3 the start point, benefactive, instrument, or attribute, and the Arg4
the end point.
Here are some slightly simplified PropBank entries for one sense each of the
verbs agree and fall. Such PropBank entries are called frame files; note that the
definitions in the frame file for each role (“Other entity agreeing”, “Extent, amount
fallen”) are informal glosses intended to be read by humans, rather than being formal
definitions.
(19.11) agree.01
Arg0: Agreer
Arg1: Proposition
Arg2: Other entity agreeing

Ex1: [Arg0 The group] agreed [Arg1 it wouldn’t make an offer].

Ex2: [ArgM-TMP Usually] [Arg0 John] agrees [Arg2 with Mary]
[Arg1 on everything].
(19.12) fall.01
Arg1: Logical subject, patient, thing falling
Arg2: Extent, amount fallen
Arg3: start point
Arg4: end point, end state of arg1
Ex1: [Arg1 Sales] fell [Arg4 to $25 million] [Arg3 from $27 million].
Ex2: [Arg1 The average junk bond] fell [Arg2 by 4.2%].
Note that there is no Arg0 role for fall, because the normal subject of fall is a
PROTO - PATIENT.
6 C HAPTER 19 • S EMANTIC ROLE L ABELING

The PropBank semantic roles can be useful in recovering shallow semantic in-
formation about verbal arguments. Consider the verb increase:
(19.13) increase.01 “go up incrementally”
Arg0: causer of increase
Arg1: thing increasing
Arg2: amount increased by, EXT, or MNR
Arg3: start point
Arg4: end point
A PropBank semantic role labeling would allow us to infer the commonality in
the event structures of the following three examples, that is, that in each case Big
Fruit Co. is the AGENT and the price of bananas is the THEME, despite the differing
surface forms.
(19.14) [Arg0 Big Fruit Co. ] increased [Arg1 the price of bananas].
(19.15) [Arg1 The price of bananas] was increased again [Arg0 by Big Fruit Co. ]
(19.16) [Arg1 The price of bananas] increased [Arg2 5%].
PropBank also has a number of non-numbered arguments called ArgMs, (ArgM-
TMP, ArgM-LOC, etc.) which represent modification or adjunct meanings. These
are relatively stable across predicates, so aren’t listed with each frame file. Data
labeled with these modifiers can be helpful in training systems to detect temporal,
location, or directional modification across predicates. Some of the ArgM’s include:
TMP when? yesterday evening, now
LOC where? at the museum, in San Francisco
DIR where to/from? down, to Bangkok
MNR how? clearly, with much enthusiasm
PRP/CAU why? because ... , in response to the ruling
REC themselves, each other
ADV miscellaneous
PRD secondary predication ...ate the meat raw
NomBank While PropBank focuses on verbs, a related project, NomBank (Meyers et al.,
2004) adds annotations to noun predicates. For example the noun agreement in
Apple’s agreement with IBM would be labeled with Apple as the Arg0 and IBM as
the Arg2. This allows semantic role labelers to assign labels to arguments of both
verbal and nominal predicates.

19.5 FrameNet
While making inferences about the semantic commonalities across different sen-
tences with increase is useful, it would be even more useful if we could make such
inferences in many more situations, across different verbs, and also between verbs
and nouns. For example, we’d like to extract the similarity among these three sen-
tences:
(19.17) [Arg1 The price of bananas] increased [Arg2 5%].
(19.18) [Arg1 The price of bananas] rose [Arg2 5%].
(19.19) There has been a [Arg2 5%] rise [Arg1 in the price of bananas].
Note that the second example uses the different verb rise, and the third example
uses the noun rather than the verb rise. We’d like a system to recognize that the
19.5 • F RAME N ET 7

price of bananas is what went up, and that 5% is the amount it went up, no matter
whether the 5% appears as the object of the verb increased or as a nominal modifier
of the noun rise.
FrameNet The FrameNet project is another semantic-role-labeling project that attempts
to address just these kinds of problems (Baker et al. 1998, Fillmore et al. 2003,
Fillmore and Baker 2009, Ruppenhofer et al. 2016). Whereas roles in the PropBank
project are specific to an individual verb, roles in the FrameNet project are specific
to a frame.
What is a frame? Consider the following set of words:
reservation, flight, travel, buy, price, cost, fare, rates, meal, plane
There are many individual lexical relations of hyponymy, synonymy, and so on
between many of the words in this list. The resulting set of relations does not,
however, add up to a complete account of how these words are related. They are
clearly all defined with respect to a coherent chunk of common-sense background
information concerning air travel.
frame We call the holistic background knowledge that unites these words a frame (Fill-
more, 1985). The idea that groups of words are defined with respect to some back-
ground information is widespread in artificial intelligence and cognitive science,
model where besides frame we see related works like a model (Johnson-Laird, 1983), or
script even script (Schank and Abelson, 1977).
A frame in FrameNet is a background knowledge structure that defines a set of
frame elements frame-specific semantic roles, called frame elements, and includes a set of predi-
cates that use these roles. Each word evokes a frame and profiles some aspect of the
frame and its elements. The FrameNet dataset includes a set of frames and frame
elements, the lexical units associated with each frame, and a set of labeled exam-
ple sentences. For example, the change position on a scale frame is defined as
follows:
This frame consists of words that indicate the change of an Item’s posi-
tion on a scale (the Attribute) from a starting point (Initial value) to an
end point (Final value).
Some of the semantic roles (frame elements) in the frame are defined as in
core roles Fig. 19.3. Note that these are separated into core roles, which are frame specific, and
non-core roles non-core roles, which are more like the Arg-M arguments in PropBank, expressing
more general properties of time, location, and so on.
Here are some example sentences:
(19.20) [I TEM Oil] rose [ATTRIBUTE in price] [D IFFERENCE by 2%].
(19.21) [I TEM It] has increased [F INAL STATE to having them 1 day a month].
(19.22) [I TEM Microsoft shares] fell [F INAL VALUE to 7 5/8].
(19.23) [I TEM Colon cancer incidence] fell [D IFFERENCE by 50%] [G ROUP among
men].
(19.24) a steady increase [I NITIAL VALUE from 9.5] [F INAL VALUE to 14.3] [I TEM
in dividends]
(19.25) a [D IFFERENCE 5%] [I TEM dividend] increase...
Note from these example sentences that the frame includes target words like rise,
fall, and increase. In fact, the complete frame consists of the following words:
8 C HAPTER 19 • S EMANTIC ROLE L ABELING

Core Roles
ATTRIBUTE The ATTRIBUTE is a scalar property that the I TEM possesses.
D IFFERENCE The distance by which an I TEM changes its position on the scale.
F INAL STATE A description that presents the I TEM’s state after the change in the ATTRIBUTE’s
value as an independent predication.
F INAL VALUE The position on the scale where the I TEM ends up.
I NITIAL STATE A description that presents the I TEM’s state before the change in the AT-
TRIBUTE ’s value as an independent predication.
I NITIAL VALUE The initial position on the scale from which the I TEM moves away.
I TEM The entity that has a position on the scale.
VALUE RANGE A portion of the scale, typically identified by its end points, along which the
values of the ATTRIBUTE fluctuate.
Some Non-Core Roles
D URATION The length of time over which the change takes place.
S PEED The rate of change of the VALUE.
G ROUP The G ROUP in which an I TEM changes the value of an
ATTRIBUTE in a specified way.
Figure 19.3 The frame elements in the change position on a scale frame from the FrameNet Labelers
Guide (Ruppenhofer et al., 2016).

VERBS: dwindle move soar escalation shift

advance edge mushroom swell explosion tumble
climb explode plummet swing fall
decline fall reach triple fluctuation ADVERBS:
decrease fluctuate rise tumble gain increasingly
diminish gain rocket growth
dip grow shift NOUNS: hike
double increase skyrocket decline increase
drop jump slide decrease rise

FrameNet also codes relationships between frames, allowing frames to inherit

from each other, or representing relations between frames like causation (and gen-
eralizations among frame elements in different frames can be representing by inher-
itance as well). Thus, there is a Cause change of position on a scale frame that is
linked to the Change of position on a scale frame by the cause relation, but that
adds an AGENT role and is used for causative examples such as the following:
(19.26) [AGENT They] raised [I TEM the price of their soda] [D IFFERENCE by 2%].
Together, these two frames would allow an understanding system to extract the
common event semantics of all the verbal and nominal causative and non-causative
usages.
FrameNets have also been developed for many other languages including Span-
ish, German, Japanese, Portuguese, Italian, and Chinese.

19.6 Semantic Role Labeling

semantic role
labeling Semantic role labeling (sometimes shortened as SRL) is the task of automatically
finding the semantic roles of each argument of each predicate in a sentence. Cur-
rent approaches to semantic role labeling are based on supervised machine learning,
often using the FrameNet and PropBank resources to specify what counts as a pred-
icate, define the set of roles used in the task, and provide training and test sets.
19.6 • S EMANTIC ROLE L ABELING 9

Recall that the difference between these two models of semantic roles is that
FrameNet (19.27) employs many frame-specific frame elements as roles, while Prop-
Bank (19.28) uses a smaller number of numbered argument labels that can be inter-
preted as verb-specific labels, along with the more general ARGM labels. Some
examples:
[You] can’t [blame] [the program] [for being unable to identify it]
(19.27)
COGNIZER TARGET EVALUEE REASON
[The San Francisco Examiner] issued [a special edition] [yesterday]
(19.28)
ARG 0 TARGET ARG 1 ARGM - TMP

19.6.1 A Feature-based Algorithm for Semantic Role Labeling

A simplified feature-based semantic role labeling algorithm is sketched in Fig. 19.4.
Feature-based algorithms—from the very earliest systems like (Simmons, 1973)—
begin by parsing, using broad-coverage parsers to assign a parse to the input string.
Figure 19.5 shows a parse of (19.28) above. The parse is then traversed to find all
words that are predicates.
For each of these predicates, the algorithm examines each node in the parse
tree and uses supervised classification to decide the semantic role (if any) it plays
for this predicate. Given a labeled training set such as PropBank or FrameNet, a
feature vector is extracted for each node, using feature templates described in the
next subsection. A 1-of-N classifier is then trained to predict a semantic role for
each constituent given these features, where N is the number of potential semantic
roles plus an extra NONE role for non-role constituents. Any standard classification
algorithms can be used. Finally, for each test sentence to be labeled, the classifier is
run on each relevant constituent.

function S EMANTIC ROLE L ABEL(words) returns labeled tree

parse ← PARSE(words)
for each predicate in parse do
for each node in parse do
featurevector ← E XTRACT F EATURES(node, predicate, parse)
C LASSIFY N ODE(node, featurevector, parse)

Figure 19.4 A generic semantic-role-labeling algorithm. C LASSIFY N ODE is a 1-of-N clas-

sifier that assigns a semantic role (or NONE for non-role constituents), trained on labeled data
such as FrameNet or PropBank.

Instead of training a single-stage classifier as in Fig. 19.5, the node-level classi-

fication task can be broken down into multiple steps:
1. Pruning: Since only a small number of the constituents in a sentence are
arguments of any given predicate, many systems use simple heuristics to prune
unlikely constituents.
2. Identification: a binary classification of each node as an argument to be la-
beled or a NONE.
3. Classification: a 1-of-N classification of all the constituents that were labeled
as arguments by the previous stage
The separation of identification and classification may lead to better use of fea-
tures (different features may be useful for the two tasks) or to computational effi-
ciency.
10 C HAPTER 19 • S EMANTIC ROLE L ABELING

NP-SBJ = ARG0 VP

DT NNP NNP NNP

The San Francisco Examiner

VBD = TARGET NP = ARG1 PP-TMP = ARGM-TMP

issued DT JJ NN IN NP

a special edition around NN NP-TMP

noon yesterday

Figure 19.5 Parse tree for a PropBank sentence, showing the PropBank argument labels. The dotted line
shows the path feature NP↑S↓VP↓VBD for ARG0, the NP-SBJ constituent The San Francisco Examiner.

Global Optimization
The classification algorithm of Fig. 19.5 classifies each argument separately (‘lo-
cally’), making the simplifying assumption that each argument of a predicate can be
labeled independently. This assumption is false; there are interactions between argu-
ments that require a more ‘global’ assignment of labels to constituents. For example,
constituents in FrameNet and PropBank are required to be non-overlapping. More
significantly, the semantic roles of constituents are not independent. For example
PropBank does not allow multiple identical arguments; two constituents of the same
verb cannot both be labeled ARG 0 .
Role labeling systems thus often add a fourth step to deal with global consistency
across the labels in a sentence. For example, the local classifiers can return a list of
possible labels associated with probabilities for each constituent, and a second-pass
Viterbi decoding or re-ranking approach can be used to choose the best consensus
label. Integer linear programming (ILP) is another common way to choose a solution
that conforms best to multiple constraints.

Features for Semantic Role Labeling

Most systems use some generalization of the core set of features introduced by
Gildea and Jurafsky (2000). Common basic features templates (demonstrated on
the NP-SBJ constituent The San Francisco Examiner in Fig. 19.5) include:
• The governing predicate, in this case the verb issued. The predicate is a cru-
cial feature since labels are defined only with respect to a particular predicate.
• The phrase type of the constituent, in this case, NP (or NP-SBJ). Some se-
mantic roles tend to appear as NPs, others as S or PP, and so on.
• The headword of the constituent, Examiner. The headword of a constituent
can be computed with standard head rules, such as those given in Chapter 12
in Fig. ??. Certain headwords (e.g., pronouns) place strong constraints on the
possible semantic roles they are likely to fill.
• The headword part of speech of the constituent, NNP.
• The path in the parse tree from the constituent to the predicate. This path is
marked by the dotted line in Fig. 19.5. Following Gildea and Jurafsky (2000),
we can use a simple linear representation of the path, NP↑S↓VP↓VBD. ↑ and
↓ represent upward and downward movement in the tree, respectively. The
19.6 • S EMANTIC ROLE L ABELING 11

path is very useful as a compact representation of many kinds of grammatical

function relationships between the constituent and the predicate.
• The voice of the clause in which the constituent appears, in this case, active
(as contrasted with passive). Passive sentences tend to have strongly different
linkings of semantic roles to surface form than do active ones.
• The binary linear position of the constituent with respect to the predicate,
either before or after.
• The subcategorization of the predicate, the set of expected arguments that
appear in the verb phrase. We can extract this information by using the phrase-
structure rule that expands the immediate parent of the predicate; VP → VBD
NP PP for the predicate in Fig. 19.5.
• The named entity type of the constituent.
• The first words and the last word of the constituent.
The following feature vector thus represents the first NP in our example (recall
that most observations will have the value NONE rather than, for example, ARG 0,
since most constituents in the parse tree will not bear a semantic role):

ARG 0: [issued, NP, Examiner, NNP, NP↑S↓VP↓VBD, active, before, VP → NP PP,

ORG, The, Examiner]
Other features are often used in addition, such as sets of n-grams inside the
constituent, or more complex versions of the path features (the upward or downward
halves, or whether particular nodes occur in the path).
It’s also possible to use dependency parses instead of constituency parses as the
basis of features, for example using dependency parse paths instead of constituency
paths.

19.6.2 A Neural Algorithm for Semantic Role Labeling

A simple neural approach to SRL is to treat it as a sequence labeling task like named-
entity recognition, using the BIO approach. Let’s assume that we are given the
predicate and the task is just detecting and labeling spans. Recall that with BIO
tagging, we have a begin and end tag for each possible role (B - ARG 0, I - ARG 0; B -
ARG 1, I - ARG 1, and so on), plus an outside tag O .

B-ARG0 I-ARG0 B-PRED B-ARG1

Softmax

FFN FFN FFN FFN FFN

concatenate
with predicate

ENCODER

[CLS] the cats love hats [SEP] love [SEP]

Figure 19.6 A simple neural approach to semantic role labeling. The input sentence is
followed by [SEP] and an extra input for the predicate, in this case love. The encoder outputs
are concatenated to an indicator variable which is 1 for the predicate and 0 for all other words
After He et al. (2017) and Shi and Lin (2019).
12 C HAPTER 19 • S EMANTIC ROLE L ABELING

As with all the taggers, the goal is to compute the highest probability tag se-
quence ŷ, given the input sequence of words w:

ŷ = argmax P(y|w)
y∈T

Fig. 19.6 shows a sketch of a standard algorithm from He et al. (2017). Here each
input word is mapped to pretrained embeddings, and then each token is concatenated
with the predicate embedding and then passed through a feedforward network with
a softmax which outputs a distribution over each SRL label. For decoding, a CRF
layer can be used instead of the MLP layer on top of the biLSTM output to do global
inference, but in practice this doesn’t seem to provide much benefit.

19.6.3 Evaluation of Semantic Role Labeling

The standard evaluation for semantic role labeling is to require that each argument
label must be assigned to the exactly correct word sequence or parse constituent, and
then compute precision, recall, and F-measure. Identification and classification can
also be evaluated separately. Two common datasets used for evaluation are CoNLL-
2005 (Carreras and Màrquez, 2005) and CoNLL-2012 (Pradhan et al., 2013).

19.7 Selectional Restrictions

We turn in this section to another way to represent facts about the relationship be-
selectional tween predicates and arguments. A selectional restriction is a semantic type con-
restriction
straint that a verb imposes on the kind of concepts that are allowed to fill its argument
roles. Consider the two meanings associated with the following example:
(19.29) I want to eat someplace nearby.
There are two possible parses and semantic interpretations for this sentence. In
the sensible interpretation, eat is intransitive and the phrase someplace nearby is
an adjunct that gives the location of the eating event. In the nonsensical speaker-as-
Godzilla interpretation, eat is transitive and the phrase someplace nearby is the direct
object and the THEME of the eating, like the NP Malaysian food in the following
sentences:
(19.30) I want to eat Malaysian food.
How do we know that someplace nearby isn’t the direct object in this sentence?
One useful cue is the semantic fact that the THEME of E ATING events tends to be
something that is edible. This restriction placed by the verb eat on the filler of its
THEME argument is a selectional restriction.
Selectional restrictions are associated with senses, not entire lexemes. We can
see this in the following examples of the lexeme serve:
(19.31) The restaurant serves green-lipped mussels.
(19.32) Which airlines serve Denver?
Example (19.31) illustrates the offering-food sense of serve, which ordinarily re-
stricts its THEME to be some kind of food Example (19.32) illustrates the provides a
commercial service to sense of serve, which constrains its THEME to be some type
of appropriate location.
19.7 • S ELECTIONAL R ESTRICTIONS 13

Selectional restrictions vary widely in their specificity. The verb imagine, for
example, imposes strict requirements on its AGENT role (restricting it to humans
and other animate entities) but places very few semantic requirements on its THEME
role. A verb like diagonalize, on the other hand, places a very specific constraint
on the filler of its THEME role: it has to be a matrix, while the arguments of the
adjectives odorless are restricted to concepts that could possess an odor:
(19.33) In rehearsal, I often ask the musicians to imagine a tennis game.
(19.34) Radon is an odorless gas that can’t be detected by human senses.
(19.35) To diagonalize a matrix is to find its eigenvalues.
These examples illustrate that the set of concepts we need to represent selectional
restrictions (being a matrix, being able to possess an odor, etc) is quite open ended.
This distinguishes selectional restrictions from other features for representing lexical
knowledge, like parts-of-speech, which are quite limited in number.

19.7.1 Representing Selectional Restrictions

One way to capture the semantics of selectional restrictions is to use and extend the
event representation of Chapter 15. Recall that the neo-Davidsonian representation
of an event consists of a single variable that stands for the event, a predicate denoting
the kind of event, and variables and relations for the event roles. Ignoring the issue of
the λ -structures and using thematic roles rather than deep event roles, the semantic
contribution of a verb like eat might look like the following:

∃e, x, y Eating(e) ∧ Agent(e, x) ∧ T heme(e, y)

With this representation, all we know about y, the filler of the THEME role, is that
it is associated with an Eating event through the Theme relation. To stipulate the
selectional restriction that y must be something edible, we simply add a new term to
that effect:

∃e, x, y Eating(e) ∧ Agent(e, x) ∧ T heme(e, y) ∧ EdibleT hing(y)

When a phrase like ate a hamburger is encountered, a semantic analyzer can form
the following kind of representation:

∃e, x, y Eating(e) ∧ Eater(e, x) ∧ T heme(e, y) ∧ EdibleT hing(y) ∧ Hamburger(y)

This representation is perfectly reasonable since the membership of y in the category

Hamburger is consistent with its membership in the category EdibleThing, assuming
a reasonable set of facts in the knowledge base. Correspondingly, the representation
for a phrase such as ate a takeoff would be ill-formed because membership in an
event-like category such as Takeoff would be inconsistent with membership in the
category EdibleThing.
While this approach adequately captures the semantics of selectional restrictions,
there are two problems with its direct use. First, using FOL to perform the simple
task of enforcing selectional restrictions is overkill. Other, far simpler, formalisms
can do the job with far less computational cost. The second problem is that this
approach presupposes a large, logical knowledge base of facts about the concepts
that make up selectional restrictions. Unfortunately, although such common-sense
knowledge bases are being developed, none currently have the kind of coverage
necessary to the task.
14 C HAPTER 19 • S EMANTIC ROLE L ABELING

Sense 1
hamburger, beefburger --
(a fried cake of minced beef served on a bun)
=> sandwich
=> snack food
=> dish
=> nutriment, nourishment, nutrition...
=> food, nutrient
=> substance
=> matter
=> physical entity
=> entity
Figure 19.7 Evidence from WordNet that hamburgers are edible.

A more practical approach is to state selectional restrictions in terms of WordNet

synsets rather than as logical concepts. Each predicate simply specifies a WordNet
synset as the selectional restriction on each of its arguments. A meaning representa-
tion is well-formed if the role filler word is a hyponym (subordinate) of this synset.
For our ate a hamburger example, for instance, we could set the selectional
restriction on the THEME role of the verb eat to the synset {food, nutrient}, glossed
as any substance that can be metabolized by an animal to give energy and build
tissue. Luckily, the chain of hypernyms for hamburger shown in Fig. 19.7 reveals
that hamburgers are indeed food. Again, the filler of a role need not match the
restriction synset exactly; it just needs to have the synset as one of its superordinates.
We can apply this approach to the THEME roles of the verbs imagine, lift, and di-
agonalize, discussed earlier. Let us restrict imagine’s THEME to the synset {entity},
lift’s THEME to {physical entity}, and diagonalize to {matrix}. This arrangement
correctly permits imagine a hamburger and lift a hamburger, while also correctly
ruling out diagonalize a hamburger.

19.7.2 Selectional Preferences

In the earliest implementations, selectional restrictions were considered strict con-
straints on the kind of arguments a predicate could take (Katz and Fodor 1963,
Hirst 1987). For example, the verb eat might require that its THEME argument
be [+FOOD]. Early word sense disambiguation systems used this idea to rule out
senses that violated the selectional restrictions of their governing predicates.
Very quickly, however, it became clear that these selectional restrictions were
better represented as preferences rather than strict constraints (Wilks 1975b, Wilks
1975a). For example, selectional restriction violations (like inedible arguments of
eat) often occur in well-formed sentences, for example because they are negated
(19.36), or because selectional restrictions are overstated (19.37):
(19.36) But it fell apart in 1931, perhaps because people realized you can’t eat
gold for lunch if you’re hungry.
(19.37) In his two championship trials, Mr. Kulkarni ate glass on an empty
stomach, accompanied only by water and tea.
Modern systems for selectional preferences therefore specify the relation be-
tween a predicate and its possible arguments with soft constraints of some kind.
19.7 • S ELECTIONAL R ESTRICTIONS 15

Selectional Association

selectional
One of the most influential has been the selectional association model of Resnik
preference (1993). Resnik defines the idea of selectional preference strength as the general
strength
amount of information that a predicate tells us about the semantic class of its argu-
ments. For example, the verb eat tells us a lot about the semantic class of its direct
objects, since they tend to be edible. The verb be, by contrast, tells us less about
its direct objects. The selectional preference strength can be defined by the differ-
ence in information between two distributions: the distribution of expected semantic
classes P(c) (how likely is it that a direct object will fall into class c) and the dis-
tribution of expected semantic classes for the particular verb P(c|v) (how likely is
it that the direct object of the specific verb v will fall into semantic class c). The
greater the difference between these distributions, the more information the verb
is giving us about possible objects. The difference between these two distributions
relative entropy can be quantified by relative entropy, or the Kullback-Leibler divergence (Kullback
KL divergence and Leibler, 1951). The Kullback-Leibler or KL divergence D(P||Q) expresses the
difference between two probability distributions P and Q
X P(x)
D(P||Q) = P(x) log (19.38)
x
Q(x)

The selectional preference SR (v) uses the KL divergence to express how much in-
formation, in bits, the verb v expresses about the possible semantic class of its argu-
ment.

SR (v) = D(P(c|v)||P(c))
X P(c|v)
= P(c|v) log (19.39)
c
P(c)
selectional Resnik then defines the selectional association of a particular class and verb as the
association
relative contribution of that class to the general selectional preference of the verb:
1 P(c|v)
AR (v, c) = P(c|v) log (19.40)
SR (v) P(c)
The selectional association is thus a probabilistic measure of the strength of asso-
ciation between a predicate and a class dominating the argument to the predicate.
Resnik estimates the probabilities for these associations by parsing a corpus, count-
ing all the times each predicate occurs with each argument word, and assuming
that each word is a partial observation of all the WordNet concepts containing the
word. The following table from Resnik (1996) shows some sample high and low
selectional associations for verbs and some WordNet semantic classes of their direct
objects.
Direct Object Direct Object
Verb Semantic Class Assoc Semantic Class Assoc
read WRITING 6.80 ACTIVITY -.20
write WRITING 7.26 COMMERCE 0
see ENTITY 5.79 METHOD -0.01

Selectional Preference via Conditional Probability

An alternative to using selectional association between a verb and the WordNet class
of its arguments is to use the conditional probability of an argument word given a
16 C HAPTER 19 • S EMANTIC ROLE L ABELING

predicate verb, directly modeling the strength of association of one verb (predicate)
with one noun (argument).
The conditional probability model can be computed by parsing a very large cor-
pus (billions of words), and computing co-occurrence counts: how often a given
verb occurs with a given noun in a given relation. The conditional probability of an
argument noun given a verb for a particular relation P(n|v, r) can then be used as a
selectional preference metric for that pair of words (Brockmann and Lapata 2003,
Keller and Lapata 2003):
(
C(n,v,r)
P(n|v, r) = C(v,r) if C(n, v, r) > 0
0 otherwise

The inverse probability P(v|n, r) was found to have better performance in some cases
(Brockmann and Lapata, 2003):
(
C(n,v,r)
P(v|n, r) = C(n,r) if C(n, v, r) > 0
0 otherwise

An even simpler approach is to use the simple log co-occurrence frequency of

the predicate with the argument log count(v, n, r) instead of conditional probability;
this seems to do better for extracting preferences for syntactic subjects rather than
objects (Brockmann and Lapata, 2003).

Evaluating Selectional Preferences

pseudowords One way to evaluate models of selectional preferences is to use pseudowords (Gale
et al. 1992, Schütze 1992). A pseudoword is an artificial word created by concate-
nating a test word in some context (say banana) with a confounder word (say door)
to create banana-door). The task of the system is to identify which of the two words
is the original word. To evaluate a selectional preference model (for example on the
relationship between a verb and a direct object) we take a test corpus and select all
verb tokens. For each verb token (say drive) we select the direct object (e.g., car),
concatenated with a confounder word that is its nearest neighbor, the noun with the
frequency closest to the original (say house), to make car/house). We then use the
selectional preference model to choose which of car and house are more preferred
objects of drive, and compute how often the model chooses the correct original ob-
ject (e.g., car) (Chambers and Jurafsky, 2010).
Another evaluation metric is to get human preferences for a test set of verb-
argument pairs, and have them rate their degree of plausibility. This is usually done
by using magnitude estimation, a technique from psychophysics, in which subjects
rate the plausibility of an argument proportional to a modulus item. A selectional
preference model can then be evaluated by its correlation with the human prefer-
ences (Keller and Lapata, 2003).

19.8 Primitive Decomposition of Predicates

One way of thinking about the semantic roles we have discussed through the chapter
is that they help us define the roles that arguments play in a decompositional way,
based on finite lists of thematic roles (agent, patient, instrument, proto-agent, proto-
patient, etc.). This idea of decomposing meaning into sets of primitive semantics
19.8 • P RIMITIVE D ECOMPOSITION OF P REDICATES 17

componential
analysis elements or features, called primitive decomposition or componential analysis,
has been taken even further, and focused particularly on predicates.
Consider these examples of the verb kill:
(19.41) Jim killed his philodendron.
(19.42) Jim did something to cause his philodendron to become not alive.
There is a truth-conditional (‘propositional semantics’) perspective from which these
two sentences have the same meaning. Assuming this equivalence, we could repre-
sent the meaning of kill as:
(19.43) KILL(x,y) ⇔ CAUSE(x, BECOME(NOT(ALIVE(y))))
thus using semantic primitives like do, cause, become not, and alive.
Indeed, one such set of potential semantic primitives has been used to account
for some of the verbal alternations discussed in Section 19.2 (Lakoff 1965, Dowty
1979). Consider the following examples.
(19.44) John opened the door. ⇒ CAUSE(John, BECOME(OPEN(door)))
(19.45) The door opened. ⇒ BECOME(OPEN(door))
(19.46) The door is open. ⇒ OPEN(door)
The decompositional approach asserts that a single state-like predicate associ-
ated with open underlies all of these examples. The differences among the meanings
of these examples arises from the combination of this single predicate with the prim-
itives CAUSE and BECOME.
While this approach to primitive decomposition can explain the similarity be-
tween states and actions or causative and non-causative predicates, it still relies on
having a large number of predicates like open. More radical approaches choose to
break down these predicates as well. One such approach to verbal predicate decom-
position that played a role in early natural language systems is conceptual depen-
conceptual
dependency dency (CD), a set of ten primitive predicates, shown in Fig. 19.8.

Primitive Definition
ATRANS The abstract transfer of possession or control from one entity to
another
P TRANS The physical transfer of an object from one location to another
M TRANS The transfer of mental concepts between entities or within an
entity
M BUILD The creation of new information within an entity
P ROPEL The application of physical force to move an object
M OVE The integral movement of a body part by an animal
I NGEST The taking in of a substance by an animal
E XPEL The expulsion of something from an animal
S PEAK The action of producing a sound
ATTEND The action of focusing a sense organ
Figure 19.8 A set of conceptual dependency primitives.

Below is an example sentence along with its CD representation. The verb brought
is translated into the two primitives ATRANS and PTRANS to indicate that the waiter
both physically conveyed the check to Mary and passed control of it to her. Note
that CD also associates a fixed set of thematic roles with each primitive to represent
the various participants in the action.
(19.47) The waiter brought Mary the check.
18 C HAPTER 19 • S EMANTIC ROLE L ABELING

∃x, y Atrans(x) ∧ Actor(x,Waiter) ∧ Ob ject(x,Check) ∧ To(x, Mary)

∧Ptrans(y) ∧ Actor(y,Waiter) ∧ Ob ject(y,Check) ∧ To(y, Mary)

19.9 Summary
• Semantic roles are abstract models of the role an argument plays in the event
described by the predicate.
• Thematic roles are a model of semantic roles based on a single finite list of
roles. Other semantic role models include per-verb semantic role lists and
proto-agent/proto-patient, both of which are implemented in PropBank,
and per-frame role lists, implemented in FrameNet.
• Semantic role labeling is the task of assigning semantic role labels to the
constituents of a sentence. The task is generally treated as a supervised ma-
chine learning task, with models trained on PropBank or FrameNet. Algo-
rithms generally start by parsing a sentence and then automatically tag each
parse tree node with a semantic role. Neural models map straight from words
end-to-end.
• Semantic selectional restrictions allow words (particularly predicates) to post
constraints on the semantic properties of their argument words. Selectional
preference models (like selectional association or simple conditional proba-
bility) allow a weight or probability to be assigned to the association between
a predicate and an argument word or class.

Bibliographical and Historical Notes

Although the idea of semantic roles dates back to Pān.ini, they were re-introduced
into modern linguistics by Gruber (1965), Fillmore (1966) and Fillmore (1968). Fill-
more had become interested in argument structure by studying Lucien Tesnière’s
groundbreaking Éléments de Syntaxe Structurale (Tesnière, 1959) in which the term
‘dependency’ was introduced and the foundations were laid for dependency gram-
mar. Following Tesnière’s terminology, Fillmore first referred to argument roles as
actants (Fillmore, 1966) but quickly switched to the term case, (see Fillmore (2003))
and proposed a universal list of semantic roles or cases (Agent, Patient, Instrument,
etc.), that could be taken on by the arguments of predicates. Verbs would be listed in
the lexicon with their case frame, the list of obligatory (or optional) case arguments.
The idea that semantic roles could provide an intermediate level of semantic
representation that could help map from syntactic parse structures to deeper, more
fully-specified representations of meaning was quickly adopted in natural language
processing, and systems for extracting case frames were created for machine transla-
tion (Wilks, 1973), question-answering (Hendrix et al., 1973), spoken-language pro-
cessing (Nash-Webber, 1975), and dialogue systems (Bobrow et al., 1977). General-
purpose semantic role labelers were developed. The earliest ones (Simmons, 1973)
first parsed a sentence by means of an ATN (Augmented Transition Network) parser.
B IBLIOGRAPHICAL AND H ISTORICAL N OTES 19

Each verb then had a set of rules specifying how the parse should be mapped to se-
mantic roles. These rules mainly made reference to grammatical functions (subject,
object, complement of specific prepositions) but also checked constituent internal
features such as the animacy of head nouns. Later systems assigned roles from pre-
built parse trees, again by using dictionaries with verb-specific case frames (Levin
1977, Marcus 1980).
By 1977 case representation was widely used and taught in AI and NLP courses,
and was described as a standard of natural language processing in the first edition of
Winston’s 1977 textbook Artificial Intelligence.
In the 1980s Fillmore proposed his model of frame semantics, later describing
the intuition as follows:
“The idea behind frame semantics is that speakers are aware of possi-
bly quite complex situation types, packages of connected expectations,
that go by various names—frames, schemas, scenarios, scripts, cultural
narratives, memes—and the words in our language are understood with
such frames as their presupposed background.” (Fillmore, 2012, p. 712)
The word frame seemed to be in the air for a suite of related notions proposed at
about the same time by Minsky (1974), Hymes (1974), and Goffman (1974), as
well as related notions with other names like scripts (Schank and Abelson, 1975)
and schemata (Bobrow and Norman, 1975) (see Tannen (1979) for a comparison).
Fillmore was also influenced by the semantic field theorists and by a visit to the Yale
AI lab where he took notice of the lists of slots and fillers used by early information
extraction systems like DeJong (1982) and Schank and Abelson (1977). In the 1990s
Fillmore drew on these insights to begin the FrameNet corpus annotation project.
At the same time, Beth Levin drew on her early case frame dictionaries (Levin,
1977) to develop her book which summarized sets of verb classes defined by shared
argument realizations (Levin, 1993). The VerbNet project built on this work (Kipper
et al., 2000), leading soon afterwards to the PropBank semantic-role-labeled corpus
created by Martha Palmer and colleagues (Palmer et al., 2005).
The combination of rich linguistic annotation and corpus-based approach in-
stantiated in FrameNet and PropBank led to a revival of automatic approaches to
semantic role labeling, first on FrameNet (Gildea and Jurafsky, 2000) and then on
PropBank data (Gildea and Palmer, 2002, inter alia). The problem first addressed in
the 1970s by handwritten rules was thus now generally recast as one of supervised
machine learning enabled by large and consistent databases. Many popular features
used for role labeling are defined in Gildea and Jurafsky (2002), Surdeanu et al.
(2003), Xue and Palmer (2004), Pradhan et al. (2005), Che et al. (2009), and Zhao
et al. (2009). The use of dependency rather than constituency parses was introduced
in the CoNLL-2008 shared task (Surdeanu et al., 2008). For surveys see Palmer
et al. (2010) and Màrquez et al. (2008).
The use of neural approaches to semantic role labeling was pioneered by Col-
lobert et al. (2011), who applied a CRF on top of a convolutional net. Early work
like Foland, Jr. and Martin (2015) focused on using dependency features. Later work
eschewed syntactic features altogether; Zhou and Xu (2015) introduced the use of
a stacked (6-8 layer) biLSTM architecture, and (He et al., 2017) showed how to
augment the biLSTM architecture with highway networks and also replace the CRF
with A* decoding that make it possible to apply a wide variety of global constraints
in SRL decoding.
Most semantic role labeling schemes only work within a single sentence, fo-
cusing on the object of the verbal (or nominal, in the case of NomBank) predicate.
20 C HAPTER 19 • S EMANTIC ROLE L ABELING

However, in many cases, a verbal or nominal predicate may have an implicit argu-
implicit
argument ment: one that appears only in a contextual sentence, or perhaps not at all and must
be inferred. In the two sentences This house has a new owner. The sale was finalized
10 days ago. the sale in the second sentence has no A RG 1, but a reasonable reader
would infer that the Arg1 should be the house mentioned in the prior sentence. Find-
iSRL ing these arguments, implicit argument detection (sometimes shortened as iSRL)
was introduced by Gerber and Chai (2010) and Ruppenhofer et al. (2010). See Do
et al. (2017) for more recent neural models.
To avoid the need for huge labeled training sets, unsupervised approaches for
semantic role labeling attempt to induce the set of semantic roles by clustering over
arguments. The task was pioneered by Riloff and Schmelzenbach (1998) and Swier
and Stevenson (2004); see Grenager and Manning (2006), Titov and Klementiev
(2012), Lang and Lapata (2014), Woodsend and Lapata (2015), and Titov and Khod-
dam (2014).
Recent innovations in frame labeling include connotation frames, which mark
richer information about the argument of predicates. Connotation frames mark the
sentiment of the writer or reader toward the arguments (for example using the verb
survive in he survived a bombing expresses the writer’s sympathy toward the subject
he and negative sentiment toward the bombing. See Chapter 20 for more details.
Selectional preference has been widely studied beyond the selectional associa-
tion models of Resnik (1993) and Resnik (1996). Methods have included clustering
(Rooth et al., 1999), discriminative learning (Bergsma et al., 2008), and topic mod-
els (Séaghdha 2010, Ritter et al. 2010), and constraints can be expressed at the level
of words or classes (Agirre and Martinez, 2001). Selectional preferences have also
been successfully integrated into semantic role labeling (Erk 2007, Zapirain et al.
2013, Do et al. 2017).

Exercises
Exercises 21

Agirre, E. and D. Martinez. 2001. Learning class-to-class Fillmore, C. J. 2012. Encounters with language. Computa-
selectional preferences. CoNLL. tional Linguistics, 38(4):701–718.
Baker, C. F., C. J. Fillmore, and J. B. Lowe. 1998. The Berke- Fillmore, C. J. and C. F. Baker. 2009. A frames approach to
ley FrameNet project. COLING/ACL. semantic analysis. In Bernd Heine and Heiko Narrog, ed-
Bergsma, S., D. Lin, and R. Goebel. 2008. Discriminative itors, The Oxford Handbook of Linguistic Analysis, pages
learning of selectional preference from unlabeled text. 313–340. Oxford University Press.
EMNLP. Fillmore, C. J., C. R. Johnson, and M. R. L. Petruck. 2003.
Bloomfield, L. 1933. Language. University of Chicago Background to FrameNet. International journal of lexi-
Press. cography, 16(3):235–250.
Foland, Jr., W. R. and J. H. Martin. 2015. Dependency-
Bobrow, D. G., R. M. Kaplan, M. Kay, D. A. Norman,
based semantic role labeling using convolutional neural
H. Thompson, and T. Winograd. 1977. GUS, A frame
networks. *SEM 2015.
driven dialog system. Artificial Intelligence, 8:155–173.
Gale, W. A., K. W. Church, and D. Yarowsky. 1992. Work on
Bobrow, D. G. and D. A. Norman. 1975. Some principles
statistical methods for word sense disambiguation. AAAI
of memory schemata. In Daniel G. Bobrow and Allan
Fall Symposium on Probabilistic Approaches to Natural
Collins, editors, Representation and Understanding. Aca-
Language.
demic Press.
Gerber, M. and J. Y. Chai. 2010. Beyond nombank: A study
Brockmann, C. and M. Lapata. 2003. Evaluating and com-
of implicit arguments for nominal predicates. ACL.
bining approaches to selectional preference acquisition.
EACL. Gildea, D. and D. Jurafsky. 2000. Automatic labeling of se-
mantic roles. ACL.
Carreras, X. and L. Màrquez. 2005. Introduction to
the CoNLL-2005 shared task: Semantic role labeling. Gildea, D. and D. Jurafsky. 2002. Automatic labeling of se-
CoNLL. mantic roles. Computational Linguistics, 28(3):245–288.
Chambers, N. and D. Jurafsky. 2010. Improving the use Gildea, D. and M. Palmer. 2002. The necessity of syntactic
of pseudo-words for evaluating selectional preferences. parsing for predicate argument recognition. ACL.
ACL. Goffman, E. 1974. Frame analysis: An essay on the organi-
Che, W., Z. Li, Y. Li, Y. Guo, B. Qin, and T. Liu. 2009. Mul- zation of experience. Harvard University Press.
tilingual dependency-based syntactic and semantic pars- Grenager, T. and C. D. Manning. 2006. Unsupervised dis-
ing. CoNLL. covery of a statistical verb lexicon. EMNLP.
Collobert, R., J. Weston, L. Bottou, M. Karlen, Gruber, J. S. 1965. Studies in Lexical Relations. Ph.D. thesis,
K. Kavukcuoglu, and P. Kuksa. 2011. Natural language MIT.
processing (almost) from scratch. JMLR, 12:2493–2537. He, L., K. Lee, M. Lewis, and L. Zettlemoyer. 2017. Deep
DeJong, G. F. 1982. An overview of the FRUMP system. In semantic role labeling: What works and what’s next.
Wendy G. Lehnert and Martin H. Ringle, editors, Strate- ACL.
gies for Natural Language Processing, pages 149–176. Hendrix, G. G., C. W. Thompson, and J. Slocum. 1973. Lan-
LEA. guage processing via canonical verbs and semantic mod-
Do, Q. N. T., S. Bethard, and M.-F. Moens. 2017. Improv- els. Proceedings of IJCAI-73.
ing implicit semantic role labeling by predicting semantic Hirst, G. 1987. Semantic Interpretation and the Resolution
frame arguments. IJCNLP. of Ambiguity. Cambridge University Press.
Dowty, D. R. 1979. Word Meaning and Montague Grammar. Hymes, D. 1974. Ways of speaking. In Richard Bauman and
D. Reidel. Joel Sherzer, editors, Explorations in the ethnography of
Erk, K. 2007. A simple, similarity-based model for selec- speaking, pages 433–451. Cambridge University Press.
tional preferences. ACL. Johnson-Laird, P. N. 1983. Mental Models. Harvard Univer-
Fillmore, C. J. 1966. A proposal concerning English prepo- sity Press, Cambridge, MA.
sitions. In Francis P. Dinneen, editor, 17th annual Round Katz, J. J. and J. A. Fodor. 1963. The structure of a semantic
Table, volume 17 of Monograph Series on Language and theory. Language, 39:170–210.
Linguistics, pages 19–34. Georgetown University Press. Keller, F. and M. Lapata. 2003. Using the web to obtain fre-
Fillmore, C. J. 1968. The case for case. In Emmon W. Bach quencies for unseen bigrams. Computational Linguistics,
and Robert T. Harms, editors, Universals in Linguistic 29:459–484.
Theory, pages 1–88. Holt, Rinehart & Winston. Kipper, K., H. T. Dang, and M. Palmer. 2000. Class-based
Fillmore, C. J. 1985. Frames and the semantics of under- construction of a verb lexicon. AAAI.
standing. Quaderni di Semantica, VI(2):222–254. Kullback, S. and R. A. Leibler. 1951. On information and
Fillmore, C. J. 2003. Valency and semantic roles: the sufficiency. Annals of Mathematical Statistics, 22:79–86.
concept of deep structure case. In Vilmos Ágel, Lud- Lakoff, G. 1965. On the Nature of Syntactic Irregularity.
wig M. Eichinger, Hans Werner Eroms, Peter Hellwig, Ph.D. thesis, Indiana University. Published as Irregularity
Hans Jürgen Heringer, and Henning Lobin, editors, De- in Syntax. Holt, Rinehart, and Winston, New York, 1970.
pendenz und Valenz: Ein internationales Handbuch der
Lang, J. and M. Lapata. 2014. Similarity-driven semantic
zeitgenössischen Forschung, chapter 36, pages 457–475.
role induction via graph partitioning. Computational Lin-
Walter de Gruyter.
guistics, 40(3):633–669.
22 Chapter 19 • Semantic Role Labeling

Levin, B. 1977. Mapping sentences to case frames. Techni- Schank, R. C. and R. P. Abelson. 1975. Scripts, plans, and
cal Report 167, MIT AI Laboratory. AI Working Paper knowledge. Proceedings of IJCAI-75.
143. Schank, R. C. and R. P. Abelson. 1977. Scripts, Plans, Goals
Levin, B. 1993. English Verb Classes and Alternations: A and Understanding. Lawrence Erlbaum.
Preliminary Investigation. University of Chicago Press. Schütze, H. 1992. Context space. AAAI Fall Symposium on
Levin, B. and M. Rappaport Hovav. 2005. Argument Real- Probabilistic Approaches to Natural Language.
ization. Cambridge University Press. Séaghdha, D. O. 2010. Latent variable models of selectional
Marcus, M. P. 1980. A Theory of Syntactic Recognition for preference. ACL.
Natural Language. MIT Press. Shi, P. and J. Lin. 2019. Simple BERT models for relation
Màrquez, L., X. Carreras, K. C. Litkowski, and S. Steven- extraction and semantic role labeling. ArXiv.
son. 2008. Semantic role labeling: An introduction to the Simmons, R. F. 1973. Semantic networks: Their compu-
special issue. Computational linguistics, 34(2):145–159. tation and use for understanding English sentences. In
Meyers, A., R. Reeves, C. Macleod, R. Szekely, V. Zielinska, Roger C. Schank and Kenneth Mark Colby, editors, Com-
B. Young, and R. Grishman. 2004. The nombank project: puter Models of Thought and Language, pages 61–113.
An interim report. NAACL/HLT Workshop: Frontiers in W.H. Freeman and Co.
Corpus Annotation. Sloan, M. C. 2010. Aristotle’s Nicomachean Ethics as the
Minsky, M. 1974. A framework for representing knowledge. original locus for the Septem Circumstantiae. Classical
Technical Report 306, MIT AI Laboratory. Memo 306. Philology, 105(3):236–251.
Surdeanu, M., S. Harabagiu, J. Williams, and P. Aarseth.
Nash-Webber, B. L. 1975. The role of semantics in auto-
2003. Using predicate-argument structures for informa-
matic speech understanding. In Daniel G. Bobrow and
tion extraction. ACL.
Allan Collins, editors, Representation and Understand-
ing, pages 351–382. Academic Press. Surdeanu, M., R. Johansson, A. Meyers, L. Màrquez, and
J. Nivre. 2008. The CoNLL 2008 shared task on joint
Palmer, M., D. Gildea, and N. Xue. 2010. Semantic role
parsing of syntactic and semantic dependencies. CoNLL.
labeling. Synthesis Lectures on Human Language Tech-
nologies, 3(1):1–103. Swier, R. and S. Stevenson. 2004. Unsupervised semantic
role labelling. EMNLP.
Palmer, M., P. Kingsbury, and D. Gildea. 2005. The proposi-
tion bank: An annotated corpus of semantic roles. Com- Tannen, D. 1979. What’s in a frame? Surface evidence for
putational Linguistics, 31(1):71–106. underlying expectations. In Roy Freedle, editor, New Di-
rections in Discourse Processing, pages 137–181. Ablex.
Penn, G. and P. Kiparsky. 2012. On Pān.ini and the gen-
Tesnière, L. 1959. Éléments de Syntaxe Structurale. Librairie
erative capacity of contextualized replacement systems.
C. Klincksieck, Paris.
COLING.
Titov, I. and E. Khoddam. 2014. Unsupervised induction of
Pradhan, S., A. Moschitti, N. Xue, H. T. Ng, A. Björkelund,
semantic roles within a reconstruction-error minimization
O. Uryupina, Y. Zhang, and Z. Zhong. 2013. Towards
framework. NAACL HLT.
robust linguistic analysis using OntoNotes. CoNLL.
Titov, I. and A. Klementiev. 2012. A Bayesian approach to
Pradhan, S., W. Ward, K. Hacioglu, J. H. Martin, and D. Ju-
unsupervised semantic role induction. EACL.
rafsky. 2005. Semantic role labeling using different syn-
tactic views. ACL. Wilks, Y. 1973. An artificial intelligence approach to ma-
chine translation. In Roger C. Schank and Kenneth Mark
Resnik, P. 1993. Semantic classes and syntactic ambiguity. Colby, editors, Computer Models of Thought and Lan-
Proceedings of the workshop on Human Language Tech- guage, pages 114–151. W.H. Freeman.
nology.
Wilks, Y. 1975a. Preference semantics. In Edward L.
Resnik, P. 1996. Selectional constraints: An information- Keenan, editor, The Formal Semantics of Natural Lan-
theoretic model and its computational realization. Cogni- guage, pages 329–350. Cambridge Univ. Press.
tion, 61:127–159.
Wilks, Y. 1975b. A preferential, pattern-seeking, seman-
Riloff, E. and M. Schmelzenbach. 1998. An empirical ap- tics for natural language inference. Artificial Intelligence,
proach to conceptual case frame acquisition. Proceedings 6(1):53–74.
of the Sixth Workshop on Very Large Corpora.
Winston, P. H. 1977. Artificial Intelligence. Addison Wesley.
Ritter, A., O. Etzioni, and Mausam. 2010. A latent dirichlet Woodsend, K. and M. Lapata. 2015. Distributed representa-
allocation method for selectional preferences. ACL. tions for unsupervised semantic role labeling. EMNLP.
Rooth, M., S. Riezler, D. Prescher, G. Carroll, and F. Beil. Xue, N. and M. Palmer. 2004. Calibrating features for se-
1999. Inducing a semantically annotated lexicon via EM- mantic role labeling. EMNLP.
based clustering. ACL.
Zapirain, B., E. Agirre, L. Màrquez, and M. Surdeanu. 2013.
Ruppenhofer, J., M. Ellsworth, M. R. L. Petruck, C. R. John- Selectional preferences for semantic role classification.
son, C. F. Baker, and J. Scheffczyk. 2016. FrameNet II: Computational Linguistics, 39(3):631–663.
Extended theory and practice.
Zhao, H., W. Chen, C. Kit, and G. Zhou. 2009. Multilingual
Ruppenhofer, J., C. Sporleder, R. Morante, C. F. Baker, dependency learning: A huge feature engineering method
and M. Palmer. 2010. Semeval-2010 task 10: Linking to semantic dependency parsing. CoNLL.
events and their participants in discourse. 5th Interna-
Zhou, J. and W. Xu. 2015. End-to-end learning of semantic
tional Workshop on Semantic Evaluation.
role labeling using recurrent neural networks. ACL.

2022 RaySharp Product Guide
No ratings yet
2022 RaySharp Product Guide
87 pages
SSt-6000 Install Manual-2.1
No ratings yet
SSt-6000 Install Manual-2.1
64 pages
20 Jurafsky
No ratings yet
20 Jurafsky
22 pages
Ling571 Class12 Sem SRL
No ratings yet
Ling571 Class12 Sem SRL
136 pages
Inf5830 Semanticroles
No ratings yet
Inf5830 Semanticroles
37 pages
Semantic Roles
No ratings yet
Semantic Roles
27 pages
Welcome To My Presentation
No ratings yet
Welcome To My Presentation
8 pages
Grammatical Relations and Semantic Roles
100% (1)
Grammatical Relations and Semantic Roles
8 pages
Semantic Roles: INF5830 Fall 2013
No ratings yet
Semantic Roles: INF5830 Fall 2013
39 pages
22-Semantic Role Labelling-01-10-2024
No ratings yet
22-Semantic Role Labelling-01-10-2024
46 pages
5 - Semantic Roles
No ratings yet
5 - Semantic Roles
6 pages
Semantic Roles and Sentence Meaning
No ratings yet
Semantic Roles and Sentence Meaning
7 pages
Lecture+5+-+NNLS+22
No ratings yet
Lecture+5+-+NNLS+22
30 pages
Semantic Roles
No ratings yet
Semantic Roles
2 pages
RhiandBook - Semantic
No ratings yet
RhiandBook - Semantic
14 pages
SYNTAX - Rivaldi Yudistira B - Assignment Session 6&7 - Summary
No ratings yet
SYNTAX - Rivaldi Yudistira B - Assignment Session 6&7 - Summary
4 pages
SEMANTIC ROLES New
No ratings yet
SEMANTIC ROLES New
17 pages
Sentence Semantics 2: Participants
No ratings yet
Sentence Semantics 2: Participants
32 pages
Apunts3
No ratings yet
Apunts3
14 pages
Semantic Roles PDF
100% (2)
Semantic Roles PDF
20 pages
An Empirical View On Semantic Roles: Katrin Erk Sebastian Pado Saarland University ESSLLI 2006
No ratings yet
An Empirical View On Semantic Roles: Katrin Erk Sebastian Pado Saarland University ESSLLI 2006
10 pages
The Semantic Roles: Part Two
No ratings yet
The Semantic Roles: Part Two
18 pages
2.Semantic-Roles_
No ratings yet
2.Semantic-Roles_
6 pages
Semantics: Representations and Analyses: Julia Hirschberg CS 4705
No ratings yet
Semantics: Representations and Analyses: Julia Hirschberg CS 4705
32 pages
Types of Semantic Roles
100% (1)
Types of Semantic Roles
18 pages
Dowtz Thematic Proto-Roles and Argument Selection
No ratings yet
Dowtz Thematic Proto-Roles and Argument Selection
74 pages
CM 15 Semantics PBI 2022.1 (Semantic Roles)
No ratings yet
CM 15 Semantics PBI 2022.1 (Semantic Roles)
8 pages
semanticroles-160928075018
No ratings yet
semanticroles-160928075018
9 pages
CH 9
No ratings yet
CH 9
29 pages
Unit 6 Summary
100% (1)
Unit 6 Summary
12 pages
Semantic Role
No ratings yet
Semantic Role
26 pages
Thematic role
No ratings yet
Thematic role
3 pages
Semantics Features and Semantic Roles
No ratings yet
Semantics Features and Semantic Roles
1 page
Semantic Roles Bael 1
No ratings yet
Semantic Roles Bael 1
16 pages
LCS Final Term
No ratings yet
LCS Final Term
4 pages
Unit 5 - Chapter 6
No ratings yet
Unit 5 - Chapter 6
44 pages
Semantic Case
No ratings yet
Semantic Case
10 pages
Semantics: Titik Sudartinah English Language and Literature Study Program Yogyakarta State University
No ratings yet
Semantics: Titik Sudartinah English Language and Literature Study Program Yogyakarta State University
163 pages
Semantics: Name: . NIM: . Class: .
No ratings yet
Semantics: Name: . NIM: . Class: .
3 pages
Chapter 6 Resumen Semantics PDF
No ratings yet
Chapter 6 Resumen Semantics PDF
23 pages
LIN 1080 Semantics: Albert Gatt
No ratings yet
LIN 1080 Semantics: Albert Gatt
32 pages
Report LEXICOLOGY Semantics
No ratings yet
Report LEXICOLOGY Semantics
3 pages
ThematicRoles-LinguisticTheories.Sep2 (1)
No ratings yet
ThematicRoles-LinguisticTheories.Sep2 (1)
47 pages
Role of Thematic Roles in Sentence
No ratings yet
Role of Thematic Roles in Sentence
6 pages
Kisi Kisi Mid Tes Semantic
No ratings yet
Kisi Kisi Mid Tes Semantic
10 pages
Handout 1: Basic Notions in Argument Structure 1 1.1 Some Basic Concepts
No ratings yet
Handout 1: Basic Notions in Argument Structure 1 1.1 Some Basic Concepts
10 pages
Article: "Thematic Roles Are Not Semantic Roles" Jan G. Van Voorst
No ratings yet
Article: "Thematic Roles Are Not Semantic Roles" Jan G. Van Voorst
16 pages
Thematic Relations & - Roles
No ratings yet
Thematic Relations & - Roles
4 pages
Chapter 9 Semantics Part1.pptx
No ratings yet
Chapter 9 Semantics Part1.pptx
16 pages
Semantic Roles: (Source: Yule, G. (2010) - The Study of Language (4th Ed.) - Cambridge: Cambridge University Press.)
No ratings yet
Semantic Roles: (Source: Yule, G. (2010) - The Study of Language (4th Ed.) - Cambridge: Cambridge University Press.)
2 pages
508-Quiz-2
No ratings yet
508-Quiz-2
10 pages
8 Semantic Roles
No ratings yet
8 Semantic Roles
8 pages
2 Introduction To Semantics
No ratings yet
2 Introduction To Semantics
64 pages
8 Semantic Roles 2015
No ratings yet
8 Semantic Roles 2015
8 pages
Semantics
No ratings yet
Semantics
8 pages
Linguistic Participant Roles
No ratings yet
Linguistic Participant Roles
13 pages
participantsII
No ratings yet
participantsII
48 pages
Chapter 9 (Semantics)
No ratings yet
Chapter 9 (Semantics)
42 pages
Semantic Analysis: Natural Language Processing (CSE 5321)
No ratings yet
Semantic Analysis: Natural Language Processing (CSE 5321)
35 pages
Semantics 3 SS17 PDF
No ratings yet
Semantics 3 SS17 PDF
62 pages
The Semantics OF Sentence Elements
No ratings yet
The Semantics OF Sentence Elements
36 pages
Comments on Marie George’s Essay (2019) "Aquinas Teachings on Concepts and Words"
From Everand
Comments on Marie George’s Essay (2019) "Aquinas Teachings on Concepts and Words"
Razie Mah
No ratings yet
JB Schedule Instrumentation PDF
No ratings yet
JB Schedule Instrumentation PDF
1 page
Câu So Sánh
No ratings yet
Câu So Sánh
15 pages
10818
No ratings yet
10818
1 page
UNITXPRO - Digitization at Food and Beverage Manufacturing
No ratings yet
UNITXPRO - Digitization at Food and Beverage Manufacturing
2 pages
7 Pointcontainer Inspection Report
No ratings yet
7 Pointcontainer Inspection Report
1 page
Rod Control System Coil and Cable Testing in Nucle
No ratings yet
Rod Control System Coil and Cable Testing in Nucle
10 pages
Table IV Study Guide
No ratings yet
Table IV Study Guide
6 pages
Inspection Systems
No ratings yet
Inspection Systems
10 pages
Din2632 PN10 PDF
No ratings yet
Din2632 PN10 PDF
2 pages
270 Caustic Soda Table EN 20140414
No ratings yet
270 Caustic Soda Table EN 20140414
2 pages
Translation Studies Today: Old Problems and New Challenges: Vadim V. Sdobnikov
No ratings yet
Translation Studies Today: Old Problems and New Challenges: Vadim V. Sdobnikov
33 pages
Literature Review On Single Parenthood
100% (2)
Literature Review On Single Parenthood
4 pages
Kaltekis Et Al (2023) A Review of Constitutive Models For An FEA-based Monopile Design
No ratings yet
Kaltekis Et Al (2023) A Review of Constitutive Models For An FEA-based Monopile Design
8 pages
All Questions and Answers Are Based On The 2011 NEC.: Electrical Construction and Maintenance
No ratings yet
All Questions and Answers Are Based On The 2011 NEC.: Electrical Construction and Maintenance
2 pages
Perancangan Turap Dengan Angkur - ENG PDF
No ratings yet
Perancangan Turap Dengan Angkur - ENG PDF
48 pages
Technology LEQ
No ratings yet
Technology LEQ
1 page
g
No ratings yet
g
1 page
Covert Persuasion - Reference Guide
100% (1)
Covert Persuasion - Reference Guide
10 pages
Converting Conventional Bike Into E-Bike: Design & Fabrication "A Concept"
No ratings yet
Converting Conventional Bike Into E-Bike: Design & Fabrication "A Concept"
11 pages
9A02601 Power Semiconductor Drives 2 PDF
No ratings yet
9A02601 Power Semiconductor Drives 2 PDF
1 page
15MA301 Aug16
No ratings yet
15MA301 Aug16
3 pages
Hammer Unions Fig de
No ratings yet
Hammer Unions Fig de
7 pages
ISO 9001 Process Procedure QPP-092-1 Internal Audit
No ratings yet
ISO 9001 Process Procedure QPP-092-1 Internal Audit
4 pages
Grove GMK
No ratings yet
Grove GMK
5 pages
Strategic Leadership: Michael A. Hitt R. Duane Ireland Robert E. Hoskisson
100% (1)
Strategic Leadership: Michael A. Hitt R. Duane Ireland Robert E. Hoskisson
29 pages
Group 5
No ratings yet
Group 5
72 pages
8DIO Studio Upright - User Manual
No ratings yet
8DIO Studio Upright - User Manual
20 pages
Lettresfamilire00tailgoog Djvu
No ratings yet
Lettresfamilire00tailgoog Djvu
135 pages

19

Uploaded by

19

Uploaded by

Speech and Language Processing. Daniel Jurafsky & James H. Martin. Copyright © 2021.

Semantic Role Labeling

19.1 Semantic Roles

∃e, x, y Breaking(e) ∧ Breaker(e, Sasha)

Thematic Role Definition

Thematic Role Example

19.2 Diathesis Alternations

(19.8) a. Doris gave the book to Cary.

b. Doris gave Cary the book.

19.3 Semantic Roles: Problems with Thematic Roles

19.4 The Proposition Bank

Ex1: [Arg0 The group] agreed [Arg1 it wouldn’t make an offer].

VERBS: dwindle move soar escalation shift

FrameNet also codes relationships between frames, allowing frames to inherit

19.6 Semantic Role Labeling

19.6.1 A Feature-based Algorithm for Semantic Role Labeling

function S EMANTIC ROLE L ABEL(words) returns labeled tree

Figure 19.4 A generic semantic-role-labeling algorithm. C LASSIFY N ODE is a 1-of-N clas-

Instead of training a single-stage classifier as in Fig. 19.5, the node-level classi-

DT NNP NNP NNP

The San Francisco Examiner

VBD = TARGET NP = ARG1 PP-TMP = ARGM-TMP

a special edition around NN NP-TMP

Features for Semantic Role Labeling

path is very useful as a compact representation of many kinds of grammatical

ARG 0: [issued, NP, Examiner, NNP, NP↑S↓VP↓VBD, active, before, VP → NP PP,

19.6.2 A Neural Algorithm for Semantic Role Labeling

B-ARG0 I-ARG0 B-PRED B-ARG1

FFN FFN FFN FFN FFN

[CLS] the cats love hats [SEP] love [SEP]

19.6.3 Evaluation of Semantic Role Labeling

19.7 Selectional Restrictions

19.7.1 Representing Selectional Restrictions

∃e, x, y Eating(e) ∧ Agent(e, x) ∧ T heme(e, y)

∃e, x, y Eating(e) ∧ Agent(e, x) ∧ T heme(e, y) ∧ EdibleT hing(y)

∃e, x, y Eating(e) ∧ Eater(e, x) ∧ T heme(e, y) ∧ EdibleT hing(y) ∧ Hamburger(y)

This representation is perfectly reasonable since the membership of y in the category

A more practical approach is to state selectional restrictions in terms of WordNet

19.7.2 Selectional Preferences

Selectional Preference via Conditional Probability

An even simpler approach is to use the simple log co-occurrence frequency of

Evaluating Selectional Preferences

19.8 Primitive Decomposition of Predicates

∃x, y Atrans(x) ∧ Actor(x,Waiter) ∧ Ob ject(x,Check) ∧ To(x, Mary)

Bibliographical and Historical Notes

You might also like