SlideShare a Scribd company logo
2
Most read
5
Most read
6
Most read
Named Entity
Recognition
Tomer Lieber
Moti Goldklang
5.5.2020
What is Named Entity Recognition (NER)?
● A common task of Natural Language Processing (NLP)
● Find and classify entities in text into predefined categories
● Popular categories are person names, organizations and locations
● Usages: machine translation, information retrieval, question-answering...
Example
Paris Hilton is an American singer that borned in New York 20 years ago.
Or
Another Example
Danny bought a chocolate snack of Mars yesterday in Tel Aviv.
Or
How to solve Named Entity Recognition task?
● Statistical learning models: Maximum Entropy model, Hidden Markov
Models.
● Machine Learning models: Support Vector Machines, Voted perceptron.
● Deep Learning models: Recurrent neural network (RNN), Long short term
Memory (LSTM), Bidirectional LSTM
● Tokens as words or letters
The Classes (labels)
● IO encoding (PER, LOC, OTHER)
Alex is going to Los Angeles
● IOB encoding (B-PER, I-PER, B-LOC, I-LOC, OTHER)
Alex is going to Los Angeles
PER O O O LOC LOC
B-PER O O O B-LOC I-LOC
Traditional Machine Learning
● Feature extraction
○ Word length
○ Location in the sentence
○ Previous/next word
○ Previous word label
○ Linguistics
○ Substrings
○ Regular expressions matches
● Train/Test via “traditional” models (usually trees)
Word Embeddings
Neural Networks & LSTM
● Feature extraction?
● Embeddings
● LSTM
The evaluation method - F1 score
● β is chosen such that recall is considered β times as important as precision.
● precision is the percentage of named entities found by the learning system
that are correct
● recall is the percentage of named entities present in the corpus that are
found by the system.
The evaluation problem
First Bank Chicago announced an important message last week...
The CoNLL-2003 Shared Task
● An academic conference held a competition to find a perfect method to detect
and classify entities in a given text (english and german).
● Sixteen groups have participated in the competition.
● Each group received a training file, a development file, a test file and a large
file with unannotated data (Reuters news stories and a German newspaper).
● They employed a wide variety of machine learning techniques as well as
system combination.
● The performance in this task was measured by a variant of the F1 score.
The CoNLL-2003 Shared Task - Results
The CoNLL-2003 Shared Task - Progress
References
● NLP Progress - Named Entity Recognition
● Introduction to the CoNLL-2003 Shared Task: Language-Independent
Named Entity Recognition
● Contextual String Embeddings for Sequence Labeling
● F1 Score
● Tel Aviv Meetup: Deep Learning for Named Entity Recognition - Kfir Bar
Questions?

More Related Content

What's hot (20)

PDF
Neural Architectures for Named Entity Recognition
Rrubaa Panchendrarajan
 
PDF
BERT: Bidirectional Encoder Representations from Transformers
Liangqun Lu
 
PDF
An introduction to the Transformers architecture and BERT
Suman Debnath
 
PPTX
Building Named Entity Recognition Models Efficiently using NERDS
Sujit Pal
 
PDF
Glove global vectors for word representation
hyunyoung Lee
 
PDF
IE: Named Entity Recognition (NER)
Marina Santini
 
PPTX
Word embedding
ShivaniChoudhary74
 
PDF
GPT-2: Language Models are Unsupervised Multitask Learners
Young Seok Kim
 
PDF
Deep Learning for Natural Language Processing: Word Embeddings
Roelof Pieters
 
PPTX
Bert
Abdallah Bashir
 
PDF
Natural Language Processing with Python
Benjamin Bengfort
 
PDF
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Minh Pham
 
PDF
Syntactic analysis in NLP
kartikaVashisht
 
PPT
Clustering
NLPseminar
 
PDF
Natural language processing
National Institute of Technology Durgapur
 
PPTX
BERT introduction
Hanwha System / ICT
 
PPTX
[Paper Reading] Attention is All You Need
Daiki Tanaka
 
PDF
LSTM Based Sentiment Analysis
ijtsrd
 
PDF
Introduction to Recurrent Neural Network
Yan Xu
 
PPTX
NLP_KASHK:Minimum Edit Distance
Hemantha Kulathilake
 
Neural Architectures for Named Entity Recognition
Rrubaa Panchendrarajan
 
BERT: Bidirectional Encoder Representations from Transformers
Liangqun Lu
 
An introduction to the Transformers architecture and BERT
Suman Debnath
 
Building Named Entity Recognition Models Efficiently using NERDS
Sujit Pal
 
Glove global vectors for word representation
hyunyoung Lee
 
IE: Named Entity Recognition (NER)
Marina Santini
 
Word embedding
ShivaniChoudhary74
 
GPT-2: Language Models are Unsupervised Multitask Learners
Young Seok Kim
 
Deep Learning for Natural Language Processing: Word Embeddings
Roelof Pieters
 
Natural Language Processing with Python
Benjamin Bengfort
 
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Minh Pham
 
Syntactic analysis in NLP
kartikaVashisht
 
Clustering
NLPseminar
 
Natural language processing
National Institute of Technology Durgapur
 
BERT introduction
Hanwha System / ICT
 
[Paper Reading] Attention is All You Need
Daiki Tanaka
 
LSTM Based Sentiment Analysis
ijtsrd
 
Introduction to Recurrent Neural Network
Yan Xu
 
NLP_KASHK:Minimum Edit Distance
Hemantha Kulathilake
 

Similar to Introduction to Named Entity Recognition (20)

PDF
leewayhertz.com-Named Entity Recognition NER Unveiling the value in unstructu...
KristiLBurns
 
DOCX
NLP Techniques for Named Entity Recognition.docx
KevinSims18
 
PDF
Named Entity Recognition using Bi-LSTM and Tenserflow Model
IRJET Journal
 
PDF
Domain Specific Named Entity Recognition Using Supervised Approach
Waqas Tariq
 
PDF
A survey of named entity recognition in assamese and other indian languages
ijnlc
 
PDF
SEMI-SUPERVISED BOOTSTRAPPING APPROACH FOR NAMED ENTITY RECOGNITION
kevig
 
PPTX
Named Entity Recognition - ACL 2011 Presentation
Richard Littauer
 
PDF
Named Entity Recognition from Online News
Bernardo Najlis
 
PDF
Evaluating Named Entity Recognition and Disambiguation in News and Tweets
Marieke van Erp
 
PDF
STUDY OF NAMED ENTITY RECOGNITION FOR INDIAN LANGUAGES
ijistjournal
 
PDF
STUDY OF NAMED ENTITY RECOGNITION FOR INDIAN LANGUAGES
ijistjournal
 
PDF
Handling ambiguities and unknown words in named entity recognition using anap...
ijcsa
 
PDF
HINDI NAMED ENTITY RECOGNITION BY AGGREGATING RULE BASED HEURISTICS AND HIDDE...
ijistjournal
 
PDF
HINDI NAMED ENTITY RECOGNITION BY AGGREGATING RULE BASED HEURISTICS AND HIDDE...
ijistjournal
 
PDF
HANDLING UNKNOWN WORDS IN NAMED ENTITY RECOGNITION USING TRANSLITERATION
ijnlc
 
PDF
ISSUES AND CHALLENGES IN MARATHI NAMED ENTITY RECOGNITION
ijnlc
 
PPTX
Named Entity RecognitionDatascience.pptx
Anandh798253
 
PDF
Named Entity Recognition from Online News
Bernardo Najlis
 
PPTX
Reading Group 2013 (DERI NUIG)
Bianca Pereira
 
PDF
HIDDEN MARKOV MODEL BASED NAMED ENTITY RECOGNITION TOOL
ijfcstjournal
 
leewayhertz.com-Named Entity Recognition NER Unveiling the value in unstructu...
KristiLBurns
 
NLP Techniques for Named Entity Recognition.docx
KevinSims18
 
Named Entity Recognition using Bi-LSTM and Tenserflow Model
IRJET Journal
 
Domain Specific Named Entity Recognition Using Supervised Approach
Waqas Tariq
 
A survey of named entity recognition in assamese and other indian languages
ijnlc
 
SEMI-SUPERVISED BOOTSTRAPPING APPROACH FOR NAMED ENTITY RECOGNITION
kevig
 
Named Entity Recognition - ACL 2011 Presentation
Richard Littauer
 
Named Entity Recognition from Online News
Bernardo Najlis
 
Evaluating Named Entity Recognition and Disambiguation in News and Tweets
Marieke van Erp
 
STUDY OF NAMED ENTITY RECOGNITION FOR INDIAN LANGUAGES
ijistjournal
 
STUDY OF NAMED ENTITY RECOGNITION FOR INDIAN LANGUAGES
ijistjournal
 
Handling ambiguities and unknown words in named entity recognition using anap...
ijcsa
 
HINDI NAMED ENTITY RECOGNITION BY AGGREGATING RULE BASED HEURISTICS AND HIDDE...
ijistjournal
 
HINDI NAMED ENTITY RECOGNITION BY AGGREGATING RULE BASED HEURISTICS AND HIDDE...
ijistjournal
 
HANDLING UNKNOWN WORDS IN NAMED ENTITY RECOGNITION USING TRANSLITERATION
ijnlc
 
ISSUES AND CHALLENGES IN MARATHI NAMED ENTITY RECOGNITION
ijnlc
 
Named Entity RecognitionDatascience.pptx
Anandh798253
 
Named Entity Recognition from Online News
Bernardo Najlis
 
Reading Group 2013 (DERI NUIG)
Bianca Pereira
 
HIDDEN MARKOV MODEL BASED NAMED ENTITY RECOGNITION TOOL
ijfcstjournal
 
Ad

Recently uploaded (20)

PDF
"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...
Fwdays
 
PDF
CIFDAQ Token Spotlight for 9th July 2025
CIFDAQ
 
PDF
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
PDF
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
PDF
New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
PDF
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
PDF
How Startups Are Growing Faster with App Developers in Australia.pdf
India App Developer
 
PPTX
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
PDF
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
PDF
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
PDF
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
PPTX
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
PDF
"AI Transformation: Directions and Challenges", Pavlo Shaternik
Fwdays
 
PDF
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
PPTX
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
PDF
Jak MŚP w Europie Środkowo-Wschodniej odnajdują się w świecie AI
dominikamizerska1
 
PPTX
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit Team
 
PPTX
The Project Compass - GDG on Campus MSIT
dscmsitkol
 
PPTX
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
PPTX
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 
"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...
Fwdays
 
CIFDAQ Token Spotlight for 9th July 2025
CIFDAQ
 
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
How Startups Are Growing Faster with App Developers in Australia.pdf
India App Developer
 
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
"AI Transformation: Directions and Challenges", Pavlo Shaternik
Fwdays
 
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
Jak MŚP w Europie Środkowo-Wschodniej odnajdują się w świecie AI
dominikamizerska1
 
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit Team
 
The Project Compass - GDG on Campus MSIT
dscmsitkol
 
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 
Ad

Introduction to Named Entity Recognition

  • 2. What is Named Entity Recognition (NER)? ● A common task of Natural Language Processing (NLP) ● Find and classify entities in text into predefined categories ● Popular categories are person names, organizations and locations ● Usages: machine translation, information retrieval, question-answering...
  • 3. Example Paris Hilton is an American singer that borned in New York 20 years ago. Or
  • 4. Another Example Danny bought a chocolate snack of Mars yesterday in Tel Aviv. Or
  • 5. How to solve Named Entity Recognition task? ● Statistical learning models: Maximum Entropy model, Hidden Markov Models. ● Machine Learning models: Support Vector Machines, Voted perceptron. ● Deep Learning models: Recurrent neural network (RNN), Long short term Memory (LSTM), Bidirectional LSTM ● Tokens as words or letters
  • 6. The Classes (labels) ● IO encoding (PER, LOC, OTHER) Alex is going to Los Angeles ● IOB encoding (B-PER, I-PER, B-LOC, I-LOC, OTHER) Alex is going to Los Angeles PER O O O LOC LOC B-PER O O O B-LOC I-LOC
  • 7. Traditional Machine Learning ● Feature extraction ○ Word length ○ Location in the sentence ○ Previous/next word ○ Previous word label ○ Linguistics ○ Substrings ○ Regular expressions matches ● Train/Test via “traditional” models (usually trees)
  • 9. Neural Networks & LSTM ● Feature extraction? ● Embeddings ● LSTM
  • 10. The evaluation method - F1 score ● β is chosen such that recall is considered β times as important as precision. ● precision is the percentage of named entities found by the learning system that are correct ● recall is the percentage of named entities present in the corpus that are found by the system.
  • 11. The evaluation problem First Bank Chicago announced an important message last week...
  • 12. The CoNLL-2003 Shared Task ● An academic conference held a competition to find a perfect method to detect and classify entities in a given text (english and german). ● Sixteen groups have participated in the competition. ● Each group received a training file, a development file, a test file and a large file with unannotated data (Reuters news stories and a German newspaper). ● They employed a wide variety of machine learning techniques as well as system combination. ● The performance in this task was measured by a variant of the F1 score.
  • 13. The CoNLL-2003 Shared Task - Results
  • 14. The CoNLL-2003 Shared Task - Progress
  • 15. References ● NLP Progress - Named Entity Recognition ● Introduction to the CoNLL-2003 Shared Task: Language-Independent Named Entity Recognition ● Contextual String Embeddings for Sequence Labeling ● F1 Score ● Tel Aviv Meetup: Deep Learning for Named Entity Recognition - Kfir Bar

Editor's Notes

  • #4: Classify entity and not token: put attention that we classify Paris Hilton together as a one entity and not as two token because we want to hold the connection between the tokens.
  • #12: The measure behave a bit weird when there are boundary errors (which are common). This counts as both fp a fn, therefore select nothing would have been better. There are some other methods like MUC that give partial credit according to complex rules, but it also has its disadvantages.
  • #13: The categories are person names, organizations, locations and others. The shared task organizers were especially interested in approaches that made use of resources other than the supplied training data, for example gazetteers and unannotated data The learning methods were trained with the training data. The development data could be used for tuning the parameters of the learning methods.