CFG & PCFG

Uploaded by

225003012

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

CFG & PCFG

Uploaded by

225003012

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

CFG & PCFG

Context-Free Grammar (CFG) and Probabilistic Context-Free Grammar (PCFG) are
related formalisms used in linguistics and natural language processing (NLP) to model
the syntax of languages, particularly human languages like English.
CFG PCFG
• CFG is a formal grammar that describes • PCFG is an extension of CFG that
the syntax or structure of a language using incorporates probabilities into the
production rules. In CFGs, each grammar. In PCFGs, each production
production rule is equally likely, and there rule is associated with a probability that
is no notion of probability associated with represents the likelihood of choosing
the rules. that rule during language generation
• CFGs are commonly used in syntactic • By considering the probabilities of
parsing, where they help identify the different parse trees or sentence
syntactic structure of a sentence but do not structures, PCFGs can be used to rank
provide information about the likelihood and choose the most likely
or probability of different parse trees. interpretations of a sentence.
CFG & PCFG Example
CFG Production: PCFG Production (with probabilities):
•S → NP VP •S → NP VP [0.6]
•NP → Det N •NP → Det N [0.8]
•VP → V NP •VP → V NP [0.7]
•Det → "the" •Det → "the" [0.9]
•N → "cat" •N → "cat" [0.6]
•V → "chased" •V → "chased" [0.5]

In the PCFG example, each production rule has a probability associated with
it, indicating how likely it is to be chosen during the generation or parsing
process. This allows PCFGs to capture the likelihood of different sentence
structures or parse trees, which is valuable in various NLP tasks like
machine translation and speech recognition.
CFG & PCFG
The simplest probabilistic model for recursive embedding is a PCFG, a Probabilistic
(sometimes also called Stochastic) Context Free Grammar
Is simply a CFG with probabilities added to the rules, indicating how likely different
rewritings are.
PCFGs are the simplest and most natural probabilistic model for tree structures, the
mathematics behind them is well understood, the algorithms for them are a natural
development of the algorithms employed with HMMs.
PCFGs provide a sufficiently general computational device that they can simulate various
other forms of probabilistic conditioning
Some Features of PCFGs
The predictive power of a PCFG as measured by entropy tends to be greater than that for
a finite state grammar (i.e., an HMM) with the same number of parameters. (For such
comparisons, we compute the number of parameters as follows. A V terminal, n
nonterminal PCFG has n3 + nV parameters, while a K state M output HMM has K2 + MK
parameters. While the exponent is higher in the PCFG case, the number of nonterminals
used is normally quite small.
Inside & Outside Probability

• The Probability of a String: Using inside probabilities

• In general, we cannot efficiently calculate the probability of a string by simply
summing the probabilities of all possible parse trees for the string, as there will be
exponentially many of them.
• An efficient way to calculate the total probability of a string is by the inside
algorithm, a dynamic programming algorithm based on the inside probabilities:
The Probability of a String: Using inside probabilities
Notations
•

• Definition:

• Computed recursively, base case:

• Induction:
Chomsky Normal Form
• Chomsky Normal Form (CNF) grammars only
have unary and binary rules of the form
N →N N
j r s
For syntactic categories

Nj → w k For lexical categories

• The parameters of a PCFG in CNF are

i r s
n3 matrix of parameters
P (N → N N G) (when n nonterminals)
i k nV matrix of parameters n3+nV
P (N → w G ) (when n nonterminals and parameters
V terminals )
P(k N j → N r N s )+ P(N i → wk ) =
∑
1
r ,s
∑
• Any CFG can be represented by a weakly equivalent CFG in CNF
– “weakly equivalent” : “generating the same language”
• But do not assign the same phrase structure to each sentence
CYK Algorithm
• CYK (Cocke-Younger-Kasami) algorithm
– A bottom-up parser using the dynamic programming table
– Assume the PCFG is in Chomsky normal form (CNF)
• Definition
– w1…wn: an input string composed of n words
– wij: a string of words from words i to j
– π[i, j, a]: a table entry holds the maximum probability for a constituent with
non-terminal index a

N
a spaning words wi…wj

w1 …….wi ………..wj ……. wn

Inside probability example
• Consider the following PCFG fragment
NP→DET N 0.8 NP→N 0.2
DET→a 0.6 DET→the 0.4
N→apple 0.8 N→orange 0.2
CYK Algorithm

set to zero
Finding the most
Likely parse for a
sentence
A
on the word-span

m-word input B C
string
n non-terminals
begin m m+1
end
O(m3n3)

bookkeeping
Inside Probability Example
astronomers saw stars with ears
0 1 2 3 45
1 2 3 4 5

astronomers saw stars with ears

Inside Probability Example
astronomers saw stars with ears
Inside Probability Example
fruit fly like a banana
1 2 3 4 5

fruit fly like a banana

Mat 210 Report
No ratings yet
Mat 210 Report
11 pages
Lec 3
No ratings yet
Lec 3
76 pages
Proof Complexity Draft Krajicek
No ratings yet
Proof Complexity Draft Krajicek
588 pages
Artificial Intelligence Unit-4 First Order Logic
100% (2)
Artificial Intelligence Unit-4 First Order Logic
11 pages
Probabilistic Context-Free Grammar
No ratings yet
Probabilistic Context-Free Grammar
13 pages
NLPPR6
No ratings yet
NLPPR6
6 pages
Constituency Parsing Ppt 2
No ratings yet
Constituency Parsing Ppt 2
33 pages
unit-4
No ratings yet
unit-4
45 pages
Statistical Constituency Pars-Ing: C.1 Probabilistic Context-Free Grammars
No ratings yet
Statistical Constituency Pars-Ing: C.1 Probabilistic Context-Free Grammars
21 pages
PCFG
No ratings yet
PCFG
79 pages
CFG and PCFG
No ratings yet
CFG and PCFG
7 pages
14 Ai Cse551 NLP 2 PDF
No ratings yet
14 Ai Cse551 NLP 2 PDF
39 pages
NLP Module 3
No ratings yet
NLP Module 3
11 pages
Prob Cyk 230321 130440
No ratings yet
Prob Cyk 230321 130440
12 pages
Statistical Constituency Pars-Ing: 14.1 Probabilistic Context-Free Grammars
No ratings yet
Statistical Constituency Pars-Ing: 14.1 Probabilistic Context-Free Grammars
29 pages
Lecture 2
No ratings yet
Lecture 2
28 pages
Natural Language Processing: Parsing
No ratings yet
Natural Language Processing: Parsing
18 pages
Adaptor Nips
No ratings yet
Adaptor Nips
8 pages
SLoSP 2007 2
No ratings yet
SLoSP 2007 2
45 pages
PCFG
No ratings yet
PCFG
79 pages
Unit 4 CYK Algo Slides
No ratings yet
Unit 4 CYK Algo Slides
60 pages
Lec15 CL1-f11
No ratings yet
Lec15 CL1-f11
5 pages
TIC 2151 - Theory of Computation: Context-Free Grammars (CFG)
No ratings yet
TIC 2151 - Theory of Computation: Context-Free Grammars (CFG)
23 pages
SCFG-PCFG-LCFG
No ratings yet
SCFG-PCFG-LCFG
25 pages
Lecture 07
No ratings yet
Lecture 07
35 pages
Slp14 Handout s17hw
No ratings yet
Slp14 Handout s17hw
71 pages
Thuật toán NLP
No ratings yet
Thuật toán NLP
57 pages
NLP UNIT-III
No ratings yet
NLP UNIT-III
26 pages
CS242_Module 5
No ratings yet
CS242_Module 5
42 pages
Automata Theory Assignment Adam
No ratings yet
Automata Theory Assignment Adam
5 pages
NLP Session 16 - Post Midsem Review
No ratings yet
NLP Session 16 - Post Midsem Review
189 pages
Context-free Grammar
No ratings yet
Context-free Grammar
22 pages
Context-Free Languages & Grammars (Cfls & CFGS) : Reading: Chapter 5
No ratings yet
Context-Free Languages & Grammars (Cfls & CFGS) : Reading: Chapter 5
40 pages
Chapter 12: Context-Free Grammars
No ratings yet
Chapter 12: Context-Free Grammars
13 pages
Module 4
No ratings yet
Module 4
7 pages
Context Free Grammars
No ratings yet
Context Free Grammars
24 pages
CYK Parsing Notes
No ratings yet
CYK Parsing Notes
5 pages
Context Free Grammars
No ratings yet
Context Free Grammars
40 pages
Unit II PDF
No ratings yet
Unit II PDF
7 pages
Lect 11
No ratings yet
Lect 11
7 pages
6 Probabilisticparse
No ratings yet
6 Probabilisticparse
46 pages
lec26-dynamic-programming-7
No ratings yet
lec26-dynamic-programming-7
57 pages
Probabilistic Context Free Grammar For Urdu: Keywords
No ratings yet
Probabilistic Context Free Grammar For Urdu: Keywords
8 pages
Unit 3
No ratings yet
Unit 3
19 pages
Context Free Grammars
No ratings yet
Context Free Grammars
39 pages
Pda Annotated 10 12 2021
No ratings yet
Pda Annotated 10 12 2021
37 pages
c
No ratings yet
c
54 pages
NLP Unit 4
No ratings yet
NLP Unit 4
22 pages
toc mod3
No ratings yet
toc mod3
72 pages
ContextFreeGrammars
No ratings yet
ContextFreeGrammars
28 pages
Introduction To Computational Linguistics: Eugene Charniak and Mark Johnson
No ratings yet
Introduction To Computational Linguistics: Eugene Charniak and Mark Johnson
148 pages
NPTEL_NLP_Assignment_5 (1)
No ratings yet
NPTEL_NLP_Assignment_5 (1)
4 pages
Context Free Grammar
No ratings yet
Context Free Grammar
4 pages
Chapter_2_Finite State Automata_Part_3
No ratings yet
Chapter_2_Finite State Automata_Part_3
50 pages
ContextFreeGrammars (2)
No ratings yet
ContextFreeGrammars (2)
40 pages
Context-Free Grammer
No ratings yet
Context-Free Grammer
8 pages
CGF and CFL
No ratings yet
CGF and CFL
45 pages
Gramatici Exemplu
No ratings yet
Gramatici Exemplu
45 pages
Week 3 - Probablistic Context Free Grammars
No ratings yet
Week 3 - Probablistic Context Free Grammars
18 pages
Ch-6 CFG, Derivation Trees
No ratings yet
Ch-6 CFG, Derivation Trees
23 pages
Context-Free Languages & Grammars (Cfls & CFGS) : Reading: Chapter 5
No ratings yet
Context-Free Languages & Grammars (Cfls & CFGS) : Reading: Chapter 5
38 pages
Algorithmic Probability: Fundamentals and Applications
From Everand
Algorithmic Probability: Fundamentals and Applications
Fouad Sabry
No ratings yet
Dear The Weight
From Everand
Dear The Weight
Masud Rana
No ratings yet
Reference To Above PDF
No ratings yet
Reference To Above PDF
2 pages
Markov Models
No ratings yet
Markov Models
54 pages
Dmbs U1
No ratings yet
Dmbs U1
12 pages
Cse402 May 2023
No ratings yet
Cse402 May 2023
4 pages
Ooad-July 2022
No ratings yet
Ooad-July 2022
4 pages
Knowledge Representation
No ratings yet
Knowledge Representation
14 pages
Argument Forms and Refutation by Logical Analogy
No ratings yet
Argument Forms and Refutation by Logical Analogy
7 pages
CSC510 - Lecture 5 - MOP
No ratings yet
CSC510 - Lecture 5 - MOP
11 pages
Question Bank for IAT 1
No ratings yet
Question Bank for IAT 1
4 pages
Normal Forms and Parsing: CSC 3130: Automata Theory and Formal Languages
No ratings yet
Normal Forms and Parsing: CSC 3130: Automata Theory and Formal Languages
22 pages
SLM GM11 Quarter2 Week9
No ratings yet
SLM GM11 Quarter2 Week9
32 pages
PPL UNIT 1a PDF
No ratings yet
PPL UNIT 1a PDF
11 pages
Atfl
No ratings yet
Atfl
12 pages
String Text Processing
No ratings yet
String Text Processing
5 pages
Phil 120 Syllabus
No ratings yet
Phil 120 Syllabus
2 pages
AT Module 2
100% (1)
AT Module 2
87 pages
SPCC Oral Questions
No ratings yet
SPCC Oral Questions
10 pages
C H A P T e R 2
0% (1)
C H A P T e R 2
13 pages
Acd Notes
100% (1)
Acd Notes
117 pages
Truth Tables and Logical Equivalences Review
No ratings yet
Truth Tables and Logical Equivalences Review
32 pages
SPCC - 5
No ratings yet
SPCC - 5
19 pages
FMSE Lect 06
No ratings yet
FMSE Lect 06
64 pages
Homework 12 Parse Trees 1
No ratings yet
Homework 12 Parse Trees 1
4 pages
Da PDF
No ratings yet
Da PDF
2 pages
Chapter 4
No ratings yet
Chapter 4
33 pages
Thorsten Ball-Writing An Interpreter in Go (2017) PDF
100% (1)
Thorsten Ball-Writing An Interpreter in Go (2017) PDF
206 pages
DM 2nd Semester
No ratings yet
DM 2nd Semester
4 pages
Compiler Design - Quick Guide: Language Processing System
No ratings yet
Compiler Design - Quick Guide: Language Processing System
51 pages
PCD Quiz-4 (Badhri)
No ratings yet
PCD Quiz-4 (Badhri)
5 pages
CC102 Lesson 3 Bsit - PPT Variables Data Types
No ratings yet
CC102 Lesson 3 Bsit - PPT Variables Data Types
25 pages
FAFL Pyqs
No ratings yet
FAFL Pyqs
34 pages
2.1 Logical Form and Logical Equivalence: Irrational Numbers
No ratings yet
2.1 Logical Form and Logical Equivalence: Irrational Numbers
9 pages

CFG & PCFG

Uploaded by

CFG & PCFG

Uploaded by

CFG & PCFG

CFG & PCFG

• The Probability of a String: Using inside probabilities

• Computed recursively, base case:

Nj → w k For lexical categories

• The parameters of a PCFG in CNF are

w1 …….wi ………..wj ……. wn

astronomers saw stars with ears

fruit fly like a banana

You might also like