The Expectation Maximization (EM) Algorithm

Uploaded by

Tarun Singh

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views

The Expectation Maximization (EM) Algorithm

Uploaded by

Tarun Singh

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 10

The Expectation Maximization (EM)

Algorithm
General Idea
▪ Start by devising a noisy channel
▪ Any model that predicts the corpus observations via
some hidden structure (tags, parses, …)
▪ Initially guess the parameters of the model!
▪ Educated guess is best, but random can work

▪ Expectation step: Use current parameters (and

observations) to reconstruct hidden structure
▪ Maximization step: Use that hidden structure
(and observations) to reestimate parameters
Repeat until convergence!
General Idea
initial
guess E step

Guess of Guess of unknown

unknown hidden structure
(tags, parses, weather)
parameters
(probabilities) Observed structure
(words, ice cream)

M step
Grammar Reestimation
E step correct test trees
P
A s
c
R o
S r accuracy
e
E r
test R
sentences expensive and/or
wrong sublanguage

cheap, plentiful Grammar

and appropriate
LEARNER
training
M step trees
EM by Dynamic Programming: Two
Versions

▪ The Viterbi approximation

▪ Expectation: pick the best parse of each sentence
▪ Maximization: retrain on this best-parsed corpus
▪ Advantage: Speed!

▪ Real EM we r?
h y slo
▪ Expectation:
w find all parses of each sentence
▪ Maximization: retrain on all parses in proportion to
their probability (as if we observed fractional count)
▪ Advantage: p(training corpus) guaranteed to increase
▪ Exponentially many parses, so don’t extract them
from chart – need some kind of clever counting
Examples of EM
▪ Finite-State case: Hidden Markov Models
▪ “forward-backward” or “Baum-Welch” algorithm
▪ Applications:
▪ explain ice cream in terms of underlying weather sequence
▪ explain words in terms of underlying tag sequence
▪ explain phoneme sequence in terms of underlying word
compose ▪ explain sound sequence in terms of underlying phoneme
these?
▪Context-Free case: Probabilistic CFGs
▪ “inside-outside” algorithm: unsupervised grammar learning!
▪ Explain raw text in terms of underlying cx-free parse
▪ In practice, local maximum problem gets in the way
▪ But can improve a good starting grammar via raw text
▪ Clustering case: explain points via clusters
Our old friend PCFG
S

▪ Start with a “pretty good” grammar

▪ E.g., it was trained on supervised data (a treebank) that is small,
imperfectly annotated, or has sentences in a different style from
what you want to parse. S
▪ Parse a corpus of unparsed sentences:
AdvP S
# copies of …
this sentence 12 Today stocks were up 12 NP VP
Today
in the corpus …
stocks V PRT
▪ Reestimate:
▪ Collect counts: …; c(S  NP VP) += 12; c(S) += 2*12; …were up
▪ Divide: p(S  NP VP | S) = c(S  NP VP) / c(S)
▪ May be wise to smooth
True EM for parsing
▪ Similar, but now we consider all parses of each sentence
S
▪ Parse our corpus of unparsed sentences:
AdvP S
# copies of …
this sentence 12 Today stocks were up 10.8 NP VP
Today
in the corpus …
stocks V PRT

▪ Collect counts fractionally: were up

▪ …; c(S  NP VP) += 10.8; c(S) += 2*10.8; … 1.2 S
▪ …; c(S  NP VP) += 1.2; c(S) += 1*1.2; …
NP VP

NP NP V PRT

Today stocks were up

600.465 - Intro to NLP - J. Eisner
Why do we want this info?

▪ Grammar reestimation by EM method

▪ E step collects those expected counts
▪ M step sets

▪ Minimum Bayes Risk decoding

▪ Find a tree that maximizes expected reward,
e.g., expected total # of correct constituents
▪ CKY-like dynamic programming algorithm
▪The input specifies the probability of correctness
for each possible constituent (e.g., VP from 1 to 5)

The Expectation Maximization (EM) Algorithm
No ratings yet
The Expectation Maximization (EM) Algorithm
10 pages
The Expectation Maximization (EM) Algorithm: Continued!
No ratings yet
The Expectation Maximization (EM) Algorithm: Continued!
67 pages
Products of Random Latent Variable Grammars
No ratings yet
Products of Random Latent Variable Grammars
9 pages
Lecture 7 PDF
No ratings yet
Lecture 7 PDF
23 pages
Introduction To Computational Linguistics: Eugene Charniak and Mark Johnson
No ratings yet
Introduction To Computational Linguistics: Eugene Charniak and Mark Johnson
148 pages
Expectation Maximization: Dekang Lin Department of Computing Science University of Alberta
No ratings yet
Expectation Maximization: Dekang Lin Department of Computing Science University of Alberta
22 pages
Imitation Learning: Modeling & Learning Sequence of Decisions
No ratings yet
Imitation Learning: Modeling & Learning Sequence of Decisions
53 pages
Machine Learning and Statistical Natural Language Processing
No ratings yet
Machine Learning and Statistical Natural Language Processing
27 pages
Dependency Parsing
No ratings yet
Dependency Parsing
96 pages
Isolated-Word Speech Recognition Using Hidden Markov Models: H Akon Sandsmark December 18, 2010
No ratings yet
Isolated-Word Speech Recognition Using Hidden Markov Models: H Akon Sandsmark December 18, 2010
9 pages
Noun Phrase Extraction: A Description of Current Techniques
No ratings yet
Noun Phrase Extraction: A Description of Current Techniques
36 pages
Learning With Hidden Variables - EM Algorithm
No ratings yet
Learning With Hidden Variables - EM Algorithm
31 pages
L4 Tagging
No ratings yet
L4 Tagging
107 pages
CS 388: Natural Language Processing:: N-Gram Language Models
No ratings yet
CS 388: Natural Language Processing:: N-Gram Language Models
22 pages
Ngrams
100% (1)
Ngrams
22 pages
Probabilistic Theory in Natural Language Processing
No ratings yet
Probabilistic Theory in Natural Language Processing
15 pages
Inside-Outside and Forward-Backward Algorithms Are Just Backprop (Tutorial Paper)
No ratings yet
Inside-Outside and Forward-Backward Algorithms Are Just Backprop (Tutorial Paper)
17 pages
MLF__HMM
No ratings yet
MLF__HMM
96 pages
Spring School: Haus - House, Building, Home, Household, Shell. Multiple Translations House Building Haus Snail Shell
No ratings yet
Spring School: Haus - House, Building, Home, Household, Shell. Multiple Translations House Building Haus Snail Shell
20 pages
h3 P
No ratings yet
h3 P
6 pages
Natural Language Processing, Problem Set 3: Training Data
No ratings yet
Natural Language Processing, Problem Set 3: Training Data
6 pages
Hidden Markov Models: Ts. Nguyễn Văn Vinh Bộ môn KHMT, Trường ĐHCN, ĐH QG Hà nội
No ratings yet
Hidden Markov Models: Ts. Nguyễn Văn Vinh Bộ môn KHMT, Trường ĐHCN, ĐH QG Hà nội
51 pages
Learning Structured Models For Phone Recognition
No ratings yet
Learning Structured Models For Phone Recognition
9 pages
Hidden Markov Models in Speech Recognition: Wayne Ward
No ratings yet
Hidden Markov Models in Speech Recognition: Wayne Ward
35 pages
Lecture 07
No ratings yet
Lecture 07
35 pages
18-Graph Based Dependency Parsing-19-09-2024
No ratings yet
18-Graph Based Dependency Parsing-19-09-2024
19 pages
Dependency Parsing: Pawan Goyal
No ratings yet
Dependency Parsing: Pawan Goyal
38 pages
Unit - 3 Parsing Theory (II) : Prof. Dixita B. Kagathara
No ratings yet
Unit - 3 Parsing Theory (II) : Prof. Dixita B. Kagathara
27 pages
A probabilistic Earley parser as a psycholinguistic model 2001 N01-1021
No ratings yet
A probabilistic Earley parser as a psycholinguistic model 2001 N01-1021
8 pages
Probabilistic Language Modeling Challenges
No ratings yet
Probabilistic Language Modeling Challenges
12 pages
Statistical NLP
No ratings yet
Statistical NLP
19 pages
NLP unit-2
No ratings yet
NLP unit-2
18 pages
NLP PLM
No ratings yet
NLP PLM
35 pages
NLp
No ratings yet
NLp
12 pages
Trigram Language Models
No ratings yet
Trigram Language Models
19 pages
Unit 3
No ratings yet
Unit 3
19 pages
Stat NLP
No ratings yet
Stat NLP
19 pages
Dependency Parsing 2: CMSC 723 / LING 723 / INST 725
No ratings yet
Dependency Parsing 2: CMSC 723 / LING 723 / INST 725
52 pages
NLP Unit 4
No ratings yet
NLP Unit 4
22 pages
Readme
No ratings yet
Readme
1 page
The Expectation-Maximization Algorithm: IEEE Signal Processing Magazine December 1996
No ratings yet
The Expectation-Maximization Algorithm: IEEE Signal Processing Magazine December 1996
15 pages
NLP Unit-4
No ratings yet
NLP Unit-4
48 pages
NLP Unit 2
No ratings yet
NLP Unit 2
20 pages
Lecture05-Hmm Pos Tagging
No ratings yet
Lecture05-Hmm Pos Tagging
38 pages
Notes 4
No ratings yet
Notes 4
7 pages
Lecture 8: State-Space Models Based On Slides By: Probabilis C Graphical Models
No ratings yet
Lecture 8: State-Space Models Based On Slides By: Probabilis C Graphical Models
29 pages
Xu Ly Ngon Ngu Tu Nhien Regina Barzilay Lec20 Global Linear Models (Cuuduongthancong - Com)
No ratings yet
Xu Ly Ngon Ngu Tu Nhien Regina Barzilay Lec20 Global Linear Models (Cuuduongthancong - Com)
70 pages
3-Lecture Three - (Chapter Two-N-gram Language Models)
No ratings yet
3-Lecture Three - (Chapter Two-N-gram Language Models)
28 pages
HMM Tutorial
No ratings yet
HMM Tutorial
15 pages
Basic Parsing Techniques - Parsing
No ratings yet
Basic Parsing Techniques - Parsing
20 pages
Tutorial On Speech Recognition: Alex Acero Microsoft Research
No ratings yet
Tutorial On Speech Recognition: Alex Acero Microsoft Research
38 pages
Inducing Tree-Substitution Grammars: Trevor Cohn
No ratings yet
Inducing Tree-Substitution Grammars: Trevor Cohn
44 pages
Statistical Machine Translation
No ratings yet
Statistical Machine Translation
34 pages
Lecture09 Review
No ratings yet
Lecture09 Review
51 pages
18-Predictive parsing (1)
No ratings yet
18-Predictive parsing (1)
152 pages
SLoSP 2007 2
No ratings yet
SLoSP 2007 2
45 pages
Online Large-Margin Training of Dependency Parsers: Ryan Mcdonald Koby Crammer Fernando Pereira
No ratings yet
Online Large-Margin Training of Dependency Parsers: Ryan Mcdonald Koby Crammer Fernando Pereira
8 pages
6 Probabilisticparse
No ratings yet
6 Probabilisticparse
46 pages
Phonics Readiness, Grade PK
From Everand
Phonics Readiness, Grade PK
Spectrum
3.5/5 (2)
Uppercase Letters, Grades PK - K
From Everand
Uppercase Letters, Grades PK - K
Spectrum
No ratings yet
Unit 4 Infrastructure As A Service
No ratings yet
Unit 4 Infrastructure As A Service
37 pages
Saas Overview (Software As A Service) : Gartner
No ratings yet
Saas Overview (Software As A Service) : Gartner
15 pages
Unit III - Procedure & Macros Programs
No ratings yet
Unit III - Procedure & Macros Programs
13 pages
Lecture 24 - 8255 Various Modes of Operations
No ratings yet
Lecture 24 - 8255 Various Modes of Operations
16 pages
Unit-3: Medium Access Sublayer
No ratings yet
Unit-3: Medium Access Sublayer
21 pages
Sample Bejdi
No ratings yet
Sample Bejdi
11 pages
GAI End of Course Notes
No ratings yet
GAI End of Course Notes
3 pages
AWS ML Notes -Domain 2 - Data Transformation
No ratings yet
AWS ML Notes -Domain 2 - Data Transformation
32 pages
DNN Ho
No ratings yet
DNN Ho
8 pages
Deep Reinforcement Learning PDF
No ratings yet
Deep Reinforcement Learning PDF
150 pages
Machine Learning For Fluid Mechanics
No ratings yet
Machine Learning For Fluid Mechanics
34 pages
Assignment EE5179 ME20B145 Report
No ratings yet
Assignment EE5179 ME20B145 Report
6 pages
SLR Ocr
No ratings yet
SLR Ocr
28 pages
Dive Into Deep Learning: Aston Zhang, Zachary C. Lipton, Mu Li, and Alexander J. Smola
No ratings yet
Dive Into Deep Learning: Aston Zhang, Zachary C. Lipton, Mu Li, and Alexander J. Smola
1,222 pages
Using Deep Learning For Automatically Determining Correct Application of Basic Quranic Recitation Rules
No ratings yet
Using Deep Learning For Automatically Determining Correct Application of Basic Quranic Recitation Rules
7 pages
Deep Learning Curriculum
No ratings yet
Deep Learning Curriculum
23 pages
AI
No ratings yet
AI
12 pages
UNIT5
No ratings yet
UNIT5
60 pages
SRM Valliammai Engineering College (An Autonomous Institution)
No ratings yet
SRM Valliammai Engineering College (An Autonomous Institution)
11 pages
A Major Project Report ON "Mnist (Digit Recognisation) " Submitted To (M.P.)
No ratings yet
A Major Project Report ON "Mnist (Digit Recognisation) " Submitted To (M.P.)
21 pages
01 Basics 02knn 01
No ratings yet
01 Basics 02knn 01
7 pages
Chen Kelly Xiu LLM 2024
No ratings yet
Chen Kelly Xiu LLM 2024
62 pages
Seasonal Crops Disease Prediction and Classification Using Deep Convolutional Encoder Network
No ratings yet
Seasonal Crops Disease Prediction and Classification Using Deep Convolutional Encoder Network
19 pages
7 - 23 - Deep Image Captioning A Review of Methods, Trends and Future Challenges
No ratings yet
7 - 23 - Deep Image Captioning A Review of Methods, Trends and Future Challenges
21 pages
DL-Unit-5
No ratings yet
DL-Unit-5
2 pages
Deep Learning - IIT Ropar - - Unit 9 - Week 6
No ratings yet
Deep Learning - IIT Ropar - - Unit 9 - Week 6
4 pages
Scientific Machine Learning Through Physics-Informed Neural Networks: Where We Are and What's Next
No ratings yet
Scientific Machine Learning Through Physics-Informed Neural Networks: Where We Are and What's Next
67 pages
Fruits & Vegetable Classification and Calories Measurement System
No ratings yet
Fruits & Vegetable Classification and Calories Measurement System
2 pages
ANN_Unit
No ratings yet
ANN_Unit
40 pages
Clustering Before Classification
No ratings yet
Clustering Before Classification
3 pages
Fundamentals
No ratings yet
Fundamentals
32 pages
Vehicle Accidentand Traffic Classification Using Deep Convolutional Neural Networks
No ratings yet
Vehicle Accidentand Traffic Classification Using Deep Convolutional Neural Networks
7 pages
Image Captioning Final
No ratings yet
Image Captioning Final
31 pages
DataDriven ReservoirModeling NAGAO THESIS 2021
No ratings yet
DataDriven ReservoirModeling NAGAO THESIS 2021
119 pages
Goals and Applications of ML
No ratings yet
Goals and Applications of ML
21 pages