0% found this document useful (0 votes)

4 views26 pages

Unit 3-2

The document discusses parsing using Context Free Grammar (CFG), explaining the process of generating parse trees for strings based on grammatical rules. It covers various parsing strategies, including top-down and bottom-up approaches, as well as the use of dynamic programming methods like the CKY algorithm. Additionally, it highlights the importance of treebanks, such as the Penn Treebank, in providing syntactically annotated corpora that serve as grammars for language processing.

Uploaded by

moumitashopping0

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views26 pages

Unit 3-2

Uploaded by

moumitashopping0

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

Parsing with Context Free Grammar

Sudeshna Sarkar

16 AUG 2019
Parsing
• Parsing is the process of taking a string and a
grammar and returning parse tree(s) for that string

15-Aug-19
“The old dog the footsteps of the young.”

Speech and Language

15-Aug-19 Processing - Jurafsky and Martin 4
Syntactic Analysis (Parsing)
• Automatic methods of finding the syntactic structure
for a sentence
– Symbolic methods: a phrase grammar or another
description of the structure of language is required.
The chart parser.
– Statistical methods: a text corpus with syntactic structures
is needed (a treebank)

9.12.1999 http://ufal.mff.cuni.cz/course/npfl094 5
Search Framework
• Think about parsing as a form of search…
– A search through the space of possible trees given an
input sentence and grammar

Speech and Language

15-Aug-19 Processing - Jurafsky and Martin 6
How to parse
• Top-down: Start at the top of the tree with an S
node, and work your way down to the words.

• Bottom-up: Look for small pieces that you know how

to assemble, and work your way up to larger pieces.
Top-Down Search
• Builds from the root S node to the leaves
• Expectation-based
• Common top-down search strategy
– Top-down, left-to-right, with backtracking
– Try first rule s.t. LHS is S
– Next expand all constituents on RHS
– Iterate until all leaves are POS
– Backtrack when candidate POS does not match POS of
current word in input string

Speech and Language

15-Aug-19 Processing - Jurafsky and Martin 8
“The old dog the footsteps of the young.”

Speech and Language

15-Aug-19 Processing - Jurafsky and Martin 10
Bottom-Up Search

Speech and Language

15-Aug-19 Processing - Jurafsky and Martin 11
Bottom-Up Search

Speech and Language

15-Aug-19 Processing - Jurafsky and Martin 12
Bottom-Up Search

Speech and Language

15-Aug-19 Processing - Jurafsky and Martin 13
Bottom-Up Search

Speech and Language

15-Aug-19 Processing - Jurafsky and Martin 14
Bottom-Up Search

Speech and Language

15-Aug-19 Processing - Jurafsky and Martin 15
Issues
• Ambiguity
• Shared subproblems

15-Aug-19
Ambiguity

Speech and Language

15-Aug-19 Processing - Jurafsky and Martin 17
Dynamic Programming

• DP search methods fill tables with partial results and

thereby
– Avoid doing avoidable repeated work
– Solve exponential problems in polynomial time (ok, not
really)
– Efficiently store ambiguous structures with shared sub-parts.
• We’ll cover one approach that corresponds to a
bottom-up strategy
– CKY

Speech and Language

15-Aug-19 Processing - Jurafsky and Martin 18
CKY Algorithm

Speech and Language

15-Aug-19 Processing - Jurafsky and Martin 19
CKY Algorithm

Looping over the columns

Filling the bottom cell

Filling row i in column j

Looping over the possible split
locations between i and j.

Check the grammar for rules

that link the constituents in [i,k]
with those in [k,j]. For each rule
found store the LHS of the rule
in cell [i,j].

Speech and Language

15-Aug-19 Processing - Jurafsky and Martin 20
Treebank
• A syntactically annotated corpus where every sentence is
paired with a corresponding tree.
• The Penn Treebank project
– treebanks from the Brown, Switchboard, ATIS, and Wall
Street Journal corpora of English
– treebanks in Arabic and Chinese.
• Others
– the Prague Dependency Treebank for Czech,
– the Negra treebank for German, and
– the Susanne treebank for English
– Universal Dependencies Treebank
Penn Treebank
• Penn TreeBank is a widely used treebank.

Most well known part is

the Wall Street Journal
section of the Penn
TreeBank.
1 M words from the
1987-1989 Wall
Street Journal.

Speech and Language

15-Aug-19 Processing - Jurafsky and Martin 22
Treebanks as Grammars
• The sentences in a treebank implicitly constitute a
grammar of the language represented by the corpus
being annotated.

• Simply take the local rules that make up the sub-

trees in all the trees in the collection and you have a
grammar
– The WSJ section gives us about 12k rules

15-Aug-19
Treebanks as Grammars
• The sentences in a treebank implicitly constitute a
grammar of the language represented by the corpus
being annotated.

• Simply take the local rules that make up the sub-

trees in all the trees in the collection and you have a
grammar
– The WSJ section gives us about 12k rules if you do this
• Treebanks (and head-finding) are particularly critical
to the development of statistical parsers

15-Aug-19

1500 Flexibility Method - Trusses
No ratings yet
1500 Flexibility Method - Trusses
6 pages
Operation Manul For Chinese Controller
75% (4)
Operation Manul For Chinese Controller
40 pages
Organisational Behaviour of Google Inc.
67% (3)
Organisational Behaviour of Google Inc.
30 pages
ADM 1100 A Course Outline Fall 2021
No ratings yet
ADM 1100 A Course Outline Fall 2021
14 pages
Natural Language Processing
No ratings yet
Natural Language Processing
21 pages
Unit V Intelligence and Applications: Morphological Analysis/Lexical Analysis
No ratings yet
Unit V Intelligence and Applications: Morphological Analysis/Lexical Analysis
30 pages
Overview of Linguistics
No ratings yet
Overview of Linguistics
19 pages
NLP Unit Ii
No ratings yet
NLP Unit Ii
30 pages
NLP Unit 2 Part 1
No ratings yet
NLP Unit 2 Part 1
28 pages
Ai Phases in NLP Sem Vi
No ratings yet
Ai Phases in NLP Sem Vi
3 pages
5th Unit NLP (1)
No ratings yet
5th Unit NLP (1)
32 pages
Natural Language Processing Artificial Intelligence
No ratings yet
Natural Language Processing Artificial Intelligence
81 pages
NLP-UNIT-II
No ratings yet
NLP-UNIT-II
30 pages
21cse356t Nlp Unit 2
No ratings yet
21cse356t Nlp Unit 2
89 pages
Grammars: Before You Can Parse You Need A Grammar. So Where Do Grammars Come From?
No ratings yet
Grammars: Before You Can Parse You Need A Grammar. So Where Do Grammars Come From?
32 pages
Unit 2
No ratings yet
Unit 2
15 pages
3nlp Computer
No ratings yet
3nlp Computer
83 pages
ACFrOgBKMtkrKQXYgwzYfGAQxQ0GJjQ4MloahBs6vi5pwqo xRZUN6IRgh8lAAyR2U7sguAn6becvxh174Y RYo84nZ3K9mm OlN3Q JrDvd18FxMzMkCBuxruzd1tH0C6XqndKXsCSXuwHIWVT7olg5FKOstIhFYq-Kh6hMBg
No ratings yet
ACFrOgBKMtkrKQXYgwzYfGAQxQ0GJjQ4MloahBs6vi5pwqo xRZUN6IRgh8lAAyR2U7sguAn6becvxh174Y RYo84nZ3K9mm OlN3Q JrDvd18FxMzMkCBuxruzd1tH0C6XqndKXsCSXuwHIWVT7olg5FKOstIhFYq-Kh6hMBg
32 pages
Grammars and Parsing
No ratings yet
Grammars and Parsing
29 pages
Lecture15 Parsing
No ratings yet
Lecture15 Parsing
37 pages
Unit - 5 Natural Language Processing
No ratings yet
Unit - 5 Natural Language Processing
66 pages
Natural Language Processing PDF
100% (1)
Natural Language Processing PDF
47 pages
11-Syntax_part3
No ratings yet
11-Syntax_part3
36 pages
Chapter15 NaturalLanguage
100% (1)
Chapter15 NaturalLanguage
35 pages
Module 14
No ratings yet
Module 14
7 pages
Natural Language Understanding
No ratings yet
Natural Language Understanding
41 pages
NLP CHAPTER 3
No ratings yet
NLP CHAPTER 3
23 pages
4.Chapter5_ Syntactic and Semantic Representations
No ratings yet
4.Chapter5_ Syntactic and Semantic Representations
47 pages
14 SemanticRep
No ratings yet
14 SemanticRep
27 pages
Natural Language Parsers - A Course in Cooking
No ratings yet
Natural Language Parsers - A Course in Cooking
87 pages
Applied Ai U5
No ratings yet
Applied Ai U5
48 pages
Unit 3
No ratings yet
Unit 3
19 pages
Unit - 2 NLP - R20
No ratings yet
Unit - 2 NLP - R20
21 pages
Natural Language Processing
No ratings yet
Natural Language Processing
47 pages
Chart Parsing Bottom-Up Chart Parsing
No ratings yet
Chart Parsing Bottom-Up Chart Parsing
5 pages
Module 3 NLP
No ratings yet
Module 3 NLP
32 pages
Unit 2_Lecture 1
No ratings yet
Unit 2_Lecture 1
19 pages
Speech and Language Processing: Words
No ratings yet
Speech and Language Processing: Words
45 pages
Artificial Intelligence: Natural Language Processing II
No ratings yet
Artificial Intelligence: Natural Language Processing II
51 pages
SLoSP 2007 1
No ratings yet
SLoSP 2007 1
42 pages
c
No ratings yet
c
54 pages
8 Parsing
No ratings yet
8 Parsing
40 pages
NLP - Viva - Que & Ans
No ratings yet
NLP - Viva - Que & Ans
15 pages
Atural Anguage Rocessing: Chandra Prakash LPU
No ratings yet
Atural Anguage Rocessing: Chandra Prakash LPU
59 pages
NLP_M3_SPP
No ratings yet
NLP_M3_SPP
53 pages
NLP CHAPTER-1
No ratings yet
NLP CHAPTER-1
24 pages
AI Notes Part-3
No ratings yet
AI Notes Part-3
29 pages
Unit 5
No ratings yet
Unit 5
70 pages
Natural Language Processing
100% (1)
Natural Language Processing
21 pages
CSC 594 Topics in AI - Natural Language Processing: Spring 2016/17
No ratings yet
CSC 594 Topics in AI - Natural Language Processing: Spring 2016/17
47 pages
Basic Parsing Techniques - Parsing
No ratings yet
Basic Parsing Techniques - Parsing
20 pages
Dependency Parsing
No ratings yet
Dependency Parsing
51 pages
Natural Language Processing
No ratings yet
Natural Language Processing
57 pages
Natural Language Processing Notes
No ratings yet
Natural Language Processing Notes
26 pages
Unit 3
No ratings yet
Unit 3
8 pages
214 Grammar 2014
No ratings yet
214 Grammar 2014
50 pages
Natural Language Processing: Rada Mihalcea
No ratings yet
Natural Language Processing: Rada Mihalcea
26 pages
Parsing Techniques
No ratings yet
Parsing Techniques
16 pages
NLP Module 3
No ratings yet
NLP Module 3
41 pages
Units - 2.1
No ratings yet
Units - 2.1
8 pages
nlp unit 2
No ratings yet
nlp unit 2
13 pages
NLP UNIT-II PPT
No ratings yet
NLP UNIT-II PPT
45 pages
Graded Lessons in English An Elementary English Grammar Consisting of One Hundred Practical Lessons, Carefully Graded and Adapted to the Class-Room
From Everand
Graded Lessons in English An Elementary English Grammar Consisting of One Hundred Practical Lessons, Carefully Graded and Adapted to the Class-Room
Brainerd Kellogg
1.5/5 (4)
The New Practical Shorthand Manual - A Complete And Comprehensive Exposition Of Pitman Shorthand Adapted For Use In Schools, Colleges And For Home Instruction
From Everand
The New Practical Shorthand Manual - A Complete And Comprehensive Exposition Of Pitman Shorthand Adapted For Use In Schools, Colleges And For Home Instruction
Benn Pitman
5/5 (1)
4NF
No ratings yet
4NF
5 pages
UNIT 2
No ratings yet
UNIT 2
13 pages
PLsql
No ratings yet
PLsql
13 pages
Unit 1-2
No ratings yet
Unit 1-2
90 pages
unit1
No ratings yet
unit1
90 pages
Phone Data
No ratings yet
Phone Data
135 pages
SDUML Lab Manual 21 22
No ratings yet
SDUML Lab Manual 21 22
53 pages
E-Commerce & POS (Point-of-Sale) Integration With SAP Business One
No ratings yet
E-Commerce & POS (Point-of-Sale) Integration With SAP Business One
9 pages
LED-T16S-A LED Controller Specification
No ratings yet
LED-T16S-A LED Controller Specification
13 pages
Senarai Tajuk FYP Dan Nama Pelajar Sidang 2022-2023-Program Elektronik 2nd Round
No ratings yet
Senarai Tajuk FYP Dan Nama Pelajar Sidang 2022-2023-Program Elektronik 2nd Round
19 pages
Computer Network
No ratings yet
Computer Network
14 pages
SCTM 72 DV
No ratings yet
SCTM 72 DV
132 pages
LED SL - KAEC Case Study
No ratings yet
LED SL - KAEC Case Study
2 pages
IIT Kharagpur Data Science PDF
No ratings yet
IIT Kharagpur Data Science PDF
22 pages
TC-Question Bank For 1st IA Test PDF
No ratings yet
TC-Question Bank For 1st IA Test PDF
2 pages
Hive User Guide
No ratings yet
Hive User Guide
77 pages
Midjourney Style Tuner (Full Guide)
No ratings yet
Midjourney Style Tuner (Full Guide)
1 page
MR-J5 TROUBLESHOOTING
No ratings yet
MR-J5 TROUBLESHOOTING
248 pages
string manupulation exam question
No ratings yet
string manupulation exam question
42 pages
Experience Live Music in Car: Control Wirelessly Via Your Smartphone
No ratings yet
Experience Live Music in Car: Control Wirelessly Via Your Smartphone
3 pages
Dreamweaver CS5.5 Tutorial - How To Design A Website With Dreamweaver CS 5.5 PDF
No ratings yet
Dreamweaver CS5.5 Tutorial - How To Design A Website With Dreamweaver CS 5.5 PDF
14 pages
Rick and Morty Season 2 S02 720p Torrent
No ratings yet
Rick and Morty Season 2 S02 720p Torrent
1 page
BMS-ADCTRA-2021-7Fulltext
No ratings yet
BMS-ADCTRA-2021-7Fulltext
14 pages
Operation and Installation Guide: Control/Communicator D7212G
No ratings yet
Operation and Installation Guide: Control/Communicator D7212G
68 pages
PWP Project
No ratings yet
PWP Project
28 pages
Detailed Drawing Exercises: Solidworks Education
No ratings yet
Detailed Drawing Exercises: Solidworks Education
51 pages
Machine Learning-1 BUSINESS REPORT
No ratings yet
Machine Learning-1 BUSINESS REPORT
122 pages
Word Family Sliders
100% (1)
Word Family Sliders
14 pages
Push-In RTD Temperature Probe With Connecting Cable: Technical Data
No ratings yet
Push-In RTD Temperature Probe With Connecting Cable: Technical Data
6 pages
MCQ Question Bank Computer Graphics
100% (1)
MCQ Question Bank Computer Graphics
19 pages
Pechakucha Presentation
No ratings yet
Pechakucha Presentation
7 pages
TMS VCL UI Pack v13
No ratings yet
TMS VCL UI Pack v13
3 pages
Download (Ebook) Introduction to Structural Dynamics and Aeroelasticity by Dewey H. Hodges, G. Alvin Pierce ISBN 9780521806985, 0521806984 ebook All Chapters PDF
No ratings yet
Download (Ebook) Introduction to Structural Dynamics and Aeroelasticity by Dewey H. Hodges, G. Alvin Pierce ISBN 9780521806985, 0521806984 ebook All Chapters PDF
61 pages

Unit 3-2

Uploaded by

Unit 3-2

Uploaded by

Parsing with Context Free Grammar

Speech and Language

Speech and Language

• Bottom-up: Look for small pieces that you know how

Speech and Language

Speech and Language

Speech and Language

Speech and Language

Speech and Language

Speech and Language

Speech and Language

Speech and Language

• DP search methods fill tables with partial results and

Speech and Language

Speech and Language

Looping over the columns

Filling row i in column j

Check the grammar for rules

Speech and Language

Most well known part is

Speech and Language

• Simply take the local rules that make up the sub-

• Simply take the local rules that make up the sub-

You might also like