Lin 13

The document discusses parsing, which involves determining if a string of symbols can be generated by a context-free grammar. It also discusses treebanks, which are corpora with parsed text used for testing parsers, and provides an example of evaluating parser performance metrics on a treebank.

Uploaded by

Brock Ternov

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views

Lin 13

Uploaded by

Brock Ternov

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Spurious Ambiguity

• Most parse trees of most NL sentences make no

sense.

19
Parsing
• Given a string of non-terminals and a CFG,
determine if the string can be generated by the
CFG.
– Also return a parse tree for the string
– Also return all possible parse trees for the string
• Must search space of derivations for one that
derives the given string.
– Top-Down Parsing: Start searching space of
derivations for the start symbol.
– Bottom-up Parsing: Start search space of reverse
deivations from the terminal symbols in the string.
Parsing Example

Verb NP
book that flight
book Det Nominal

that Noun

flight
Treebanks
• English Penn Treebank: Standard corpus for
testing syntactic parsing consists of 1.2 M words
of text from the Wall Street Journal (WSJ).
• Typical to train on about 40,000 parsed sentences
and test on an additional standard disjoint test set
of 2,416 sentences.
• Chinese Penn Treebank: 100K words from the
Xinhua news service.
• Other corpora existing in many languages, see the
Wikipedia article “Treebank”

85
First WSJ Sentence

( (S
(NP-SBJ
(NP (NNP Pierre) (NNP Vinken) )
(, ,)
(ADJP
(NP (CD 61) (NNS years) )
(JJ old) )
(, ,) )
(VP (MD will)
(VP (VB join)
(NP (DT the) (NN board) )
(PP-CLR (IN as)
(NP (DT a) (JJ nonexecutive) (NN director) ))
(NP-TMP (NNP Nov.) (CD 29) )))
(. .) ))
86
Parsing Evaluation Metrics
• PARSEVAL metrics measure the fraction of the
constituents that match between the computed and
human parse trees. If P is the system’s parse tree and T
is the human parse tree (the “gold standard”):
– Recall = (# correct constituents in P) / (# constituents in T)
– Precision = (# correct constituents in P) / (# constituents in P)
• Labeled Precision and labeled recall require getting the
non-terminal label on the constituent node correct to
count as correct.
• F1 is the harmonic mean of precision and recall.

87
Computing Evaluation Metrics

Correct Tree T Computed Tree P

S S
VP
VP
Verb NP VP
book Det Nominal Verb NP
the Nominal PP book Det Nominal PP
Noun Prep NP Noun Prep NP
the
flight through Proper-Noun flight through Proper-Noun
Houston Houston
# Constituents: 12 # Constituents: 12
# Correct Constituents: 10
Recall = 10/12= 83.3% Precision = 10/12=83.3% F1 = 83.3%
Treebank Results
• Results of current state-of-the-art systems on the
English Penn WSJ treebank are slightly greater than
90% labeled precision and recall.

Teaching English Through Songs, Rhymes and Chants (I)
60% (5)
Teaching English Through Songs, Rhymes and Chants (I)
28 pages
Lecture 2
No ratings yet
Lecture 2
8 pages
What Is Parsing
No ratings yet
What Is Parsing
47 pages
NLP CHAPTER 3
No ratings yet
NLP CHAPTER 3
23 pages
4.Chapter5_ Syntactic and Semantic Representations
No ratings yet
4.Chapter5_ Syntactic and Semantic Representations
47 pages
Unit 3
No ratings yet
Unit 3
19 pages
Natural Language Processing
No ratings yet
Natural Language Processing
21 pages
CSE 12 Abstract Syntax Trees
No ratings yet
CSE 12 Abstract Syntax Trees
38 pages
13-Dependency Grammar-03-09-2024
No ratings yet
13-Dependency Grammar-03-09-2024
31 pages
c
No ratings yet
c
54 pages
Chapter 4 Syntax Directed Translation
No ratings yet
Chapter 4 Syntax Directed Translation
37 pages
Natural Language Processing: Parsing
No ratings yet
Natural Language Processing: Parsing
18 pages
Lecture15 Parsing
No ratings yet
Lecture15 Parsing
37 pages
Chapter15 NaturalLanguage
100% (1)
Chapter15 NaturalLanguage
35 pages
Mod - 3 (2)
No ratings yet
Mod - 3 (2)
51 pages
Lecture 2
No ratings yet
Lecture 2
28 pages
A Simple One-Pass Compiler (To Generate Code For The JVM)
No ratings yet
A Simple One-Pass Compiler (To Generate Code For The JVM)
70 pages
Lecture NLP
100% (1)
Lecture NLP
38 pages
CC-Lec 5 Week 5 Cfgs
No ratings yet
CC-Lec 5 Week 5 Cfgs
29 pages
Chapter 3 (1)
No ratings yet
Chapter 3 (1)
43 pages
Unit - 5 Natural Language Processing
No ratings yet
Unit - 5 Natural Language Processing
66 pages
MIT Open Access
No ratings yet
MIT Open Access
11 pages
13000121136
No ratings yet
13000121136
15 pages
Natural Language Processing
No ratings yet
Natural Language Processing
13 pages
Check Semantics - Error Reporting - Disambiguate - Type Coercion - Static Checking
No ratings yet
Check Semantics - Error Reporting - Disambiguate - Type Coercion - Static Checking
108 pages
14-syntax-1
No ratings yet
14-syntax-1
22 pages
CSC 461 Final
No ratings yet
CSC 461 Final
170 pages
NLP UNIT-II
No ratings yet
NLP UNIT-II
71 pages
1 Motivation: Setting Up To Use Pstone
No ratings yet
1 Motivation: Setting Up To Use Pstone
9 pages
Syntax_complete
No ratings yet
Syntax_complete
22 pages
Chapter 4 - 6
No ratings yet
Chapter 4 - 6
78 pages
NLP Unit Ii
No ratings yet
NLP Unit Ii
30 pages
4 Semantic Analysis
No ratings yet
4 Semantic Analysis
20 pages
7.CD Lab Manual
No ratings yet
7.CD Lab Manual
35 pages
Parsing
No ratings yet
Parsing
10 pages
CH2 2
No ratings yet
CH2 2
30 pages
SLoSP 2007 1
No ratings yet
SLoSP 2007 1
42 pages
CS 4300: Compiler Theory A Simple Syntax-Directed Translator
No ratings yet
CS 4300: Compiler Theory A Simple Syntax-Directed Translator
70 pages
Unit 2_Lecture 1
No ratings yet
Unit 2_Lecture 1
19 pages
NLP Unit 2
No ratings yet
NLP Unit 2
20 pages
COMPI-DESI-CHP-04
No ratings yet
COMPI-DESI-CHP-04
28 pages
Atural Anguage Rocessing: Chandra Prakash LPU
No ratings yet
Atural Anguage Rocessing: Chandra Prakash LPU
59 pages
NLP Module 3
No ratings yet
NLP Module 3
41 pages
Parsing Techniques
No ratings yet
Parsing Techniques
16 pages
CD UNIT 3
No ratings yet
CD UNIT 3
76 pages
ALL The ROUGH Pages Included From Lesson 1,2,3,4,5 Are Not Included in The Paper
No ratings yet
ALL The ROUGH Pages Included From Lesson 1,2,3,4,5 Are Not Included in The Paper
41 pages
Unit 5 NLP
No ratings yet
Unit 5 NLP
6 pages
Lecture 2 Hierarchy of NLP & TF-IDF
No ratings yet
Lecture 2 Hierarchy of NLP & TF-IDF
48 pages
SDT Material
No ratings yet
SDT Material
30 pages
NLP - UNIT II
No ratings yet
NLP - UNIT II
13 pages
Ai Unit 5
No ratings yet
Ai Unit 5
19 pages
Software Construction and Development: Lecture-02
No ratings yet
Software Construction and Development: Lecture-02
33 pages
Session 7 - Syntax Parsing
No ratings yet
Session 7 - Syntax Parsing
53 pages
Compiler 2
100% (1)
Compiler 2
45 pages
CD Merged
No ratings yet
CD Merged
153 pages
BNF
No ratings yet
BNF
30 pages
Compiler Design (All Modules) - 16
No ratings yet
Compiler Design (All Modules) - 16
1 page
Ch2 Modified
No ratings yet
Ch2 Modified
39 pages
Lecture 07
No ratings yet
Lecture 07
35 pages
Chapter 3 Syntax Analysis
No ratings yet
Chapter 3 Syntax Analysis
78 pages
CH 4 - Semantic Analysis PDF
100% (1)
CH 4 - Semantic Analysis PDF
36 pages
The Cyclic System of Transposition for Trumpet
From Everand
The Cyclic System of Transposition for Trumpet
Keith Doles
5/5 (1)
Iso 23308 2 2020
0% (1)
Iso 23308 2 2020
9 pages
Advanced Propulsion System GEM 423E Week 9: Podded Propulsion&Propellers
No ratings yet
Advanced Propulsion System GEM 423E Week 9: Podded Propulsion&Propellers
23 pages
Week 1. Historical Forest and Present Natural Divisions of Illinois
No ratings yet
Week 1. Historical Forest and Present Natural Divisions of Illinois
12 pages
SOC6401H Theory Historical Sociology F18 JB
No ratings yet
SOC6401H Theory Historical Sociology F18 JB
11 pages
Historical Archaeology: C S C W
No ratings yet
Historical Archaeology: C S C W
6 pages
Spring 2017 Final Presentation
No ratings yet
Spring 2017 Final Presentation
13 pages
Historical and Comparative Sociology: Winter Quarter 2008
No ratings yet
Historical and Comparative Sociology: Winter Quarter 2008
3 pages
HIST 5101 Syllabus Fall 2018
No ratings yet
HIST 5101 Syllabus Fall 2018
8 pages
American Civilization I: Bsschwantes@widener - Edu
No ratings yet
American Civilization I: Bsschwantes@widener - Edu
8 pages
HIS206 1-Unit Syllabus
No ratings yet
HIS206 1-Unit Syllabus
7 pages
Full Text
No ratings yet
Full Text
15 pages
History 302 Syllabus Fall 2018
No ratings yet
History 302 Syllabus Fall 2018
10 pages
Data
No ratings yet
Data
12 pages
Short Selling Around The 52-Week and Historical Highs: Eunju - Lee@uml - Edu Npiqueira@bauer - Uh.edu
No ratings yet
Short Selling Around The 52-Week and Historical Highs: Eunju - Lee@uml - Edu Npiqueira@bauer - Uh.edu
32 pages
British Council Historical Report
No ratings yet
British Council Historical Report
12 pages
2018 Summer Brochure
No ratings yet
2018 Summer Brochure
2 pages
Du XL AWFq 3 N
No ratings yet
Du XL AWFq 3 N
2 pages
Example Application HI 199
No ratings yet
Example Application HI 199
6 pages
D Rea Etal PVM 2013 Retrospective Time Series Analysis of Veterinary Laboratory Data
No ratings yet
D Rea Etal PVM 2013 Retrospective Time Series Analysis of Veterinary Laboratory Data
19 pages
23 Reserves On STG Briefing FINAL For NCS Approved Tagged
No ratings yet
23 Reserves On STG Briefing FINAL For NCS Approved Tagged
22 pages
Discov Er East Devon's Historical and Nature-Rich Pebblebeds During Heath Week
No ratings yet
Discov Er East Devon's Historical and Nature-Rich Pebblebeds During Heath Week
3 pages
Historical Financial Statistics: Calendar Notes
No ratings yet
Historical Financial Statistics: Calendar Notes
13 pages
81635793
No ratings yet
81635793
13 pages
Historiography & Historical Methodology: S: L: 377 University College
No ratings yet
Historiography & Historical Methodology: S: L: 377 University College
6 pages
HIST 390 Syllabus
No ratings yet
HIST 390 Syllabus
4 pages
10th February 2017
No ratings yet
10th February 2017
6 pages
DSHPFall 2016
No ratings yet
DSHPFall 2016
4 pages
U of N Historical Course Documentation Form: Purpose Statement and Procedural Explanation
No ratings yet
U of N Historical Course Documentation Form: Purpose Statement and Procedural Explanation
2 pages
Proposal - Puja Kathresna P Revisi
No ratings yet
Proposal - Puja Kathresna P Revisi
26 pages
L007E1120
No ratings yet
L007E1120
96 pages
Bbi 2420 Assignment 1 Portfolio
No ratings yet
Bbi 2420 Assignment 1 Portfolio
19 pages
ინგლისური ენის ისტორიის რენდომ გამოცდა
No ratings yet
ინგლისური ენის ისტორიის რენდომ გამოცდა
27 pages
C-4. Sentence-Combining-Practice-G8
No ratings yet
C-4. Sentence-Combining-Practice-G8
48 pages
Week 30 &31 2 Lavender 30 .8-2.9.2021 & 5.9.2021
No ratings yet
Week 30 &31 2 Lavender 30 .8-2.9.2021 & 5.9.2021
8 pages
s21 Word Choice
No ratings yet
s21 Word Choice
23 pages
Survey of Literature of Selected Countries (1)
No ratings yet
Survey of Literature of Selected Countries (1)
6 pages
Seminar Paper Prefixes Iz - and Pre - Aleksandra Pilipović
No ratings yet
Seminar Paper Prefixes Iz - and Pre - Aleksandra Pilipović
25 pages
Types of Translation Transformations
No ratings yet
Types of Translation Transformations
13 pages
Curriculum Map 3rd Quarter
No ratings yet
Curriculum Map 3rd Quarter
7 pages
Proiect de Lectie Final
No ratings yet
Proiect de Lectie Final
4 pages
Interactive Writing
No ratings yet
Interactive Writing
6 pages
ENGLISH
No ratings yet
ENGLISH
8 pages
Chapter 2
No ratings yet
Chapter 2
7 pages
Mail of Psychology
No ratings yet
Mail of Psychology
89 pages
2020 Gong, Gao, & Lyu 2020
No ratings yet
2020 Gong, Gao, & Lyu 2020
19 pages
English Las (Second Quarter)
No ratings yet
English Las (Second Quarter)
7 pages
Agreement Exercise
No ratings yet
Agreement Exercise
3 pages
English Week 4: Malabon Elementary School
No ratings yet
English Week 4: Malabon Elementary School
23 pages
Giao An Anh 9 20122013 Chuan
No ratings yet
Giao An Anh 9 20122013 Chuan
185 pages
Scheme of Work For English Language Form 4 KK
100% (1)
Scheme of Work For English Language Form 4 KK
14 pages
Latin Practice Questions
No ratings yet
Latin Practice Questions
3 pages
Kelompok 8 - TR PDF
No ratings yet
Kelompok 8 - TR PDF
11 pages
Adverbs
100% (1)
Adverbs
24 pages
Treirb Telugu JL Syllabus
No ratings yet
Treirb Telugu JL Syllabus
3 pages
L. 7 AACHCHI by Lasantha Rodrigo
33% (3)
L. 7 AACHCHI by Lasantha Rodrigo
38 pages
Third Periodical Test in English 4
No ratings yet
Third Periodical Test in English 4
7 pages

Lin 13

Uploaded by

Lin 13

Uploaded by

Spurious Ambiguity

• Most parse trees of most NL sentences make no

Correct Tree T Computed Tree P

You might also like