Introduction to Named Entity Recognition

Download as PPTX, PDF

0 likes1,186 views

Named Entity Recognition (NER) is a common task in Natural Language Processing that aims to find and classify named entities in text, such as person names, organizations, and locations, into predefined categories. NER can be used for applications like machine translation, information retrieval, and question answering. Traditional approaches to NER involve feature extraction and training statistical or machine learning models on features, while current state-of-the-art methods use deep learning models like LSTMs combined with word embeddings. NER performance is typically evaluated using the F1 score, which balances precision and recall of named entity detection.

Technology

More Related Content

What's hot (20)

PDF

Neural Architectures for Named Entity RecognitionRrubaa Panchendrarajan

PDF

BERT: Bidirectional Encoder Representations from TransformersLiangqun Lu

PDF

An introduction to the Transformers architecture and BERTSuman Debnath

PPTX

Building Named Entity Recognition Models Efficiently using NERDSSujit Pal

PDF

Glove global vectors for word representationhyunyoung Lee

PDF

IE: Named Entity Recognition (NER)Marina Santini

PPTX

Word embedding ShivaniChoudhary74

PDF

GPT-2: Language Models are Unsupervised Multitask LearnersYoung Seok Kim

PDF

Deep Learning for Natural Language Processing: Word EmbeddingsRoelof Pieters

PPTX

BertAbdallah Bashir

PDF

Natural Language Processing with PythonBenjamin Bengfort

PDF

BERT: Pre-training of Deep Bidirectional Transformers for Language UnderstandingMinh Pham

PDF

Syntactic analysis in NLPkartikaVashisht

PPT

ClusteringNLPseminar

PDF

Natural language processingNational Institute of Technology Durgapur

PPTX

BERT introductionHanwha System / ICT

PPTX

[Paper Reading] Attention is All You NeedDaiki Tanaka

PDF

LSTM Based Sentiment Analysisijtsrd

PDF

Introduction to Recurrent Neural NetworkYan Xu

PPTX

NLP_KASHK:Minimum Edit DistanceHemantha Kulathilake

Neural Architectures for Named Entity RecognitionRrubaa Panchendrarajan

BERT: Bidirectional Encoder Representations from TransformersLiangqun Lu

An introduction to the Transformers architecture and BERTSuman Debnath

Building Named Entity Recognition Models Efficiently using NERDSSujit Pal

Glove global vectors for word representationhyunyoung Lee

IE: Named Entity Recognition (NER)Marina Santini

Word embedding ShivaniChoudhary74

GPT-2: Language Models are Unsupervised Multitask LearnersYoung Seok Kim

Deep Learning for Natural Language Processing: Word EmbeddingsRoelof Pieters

BertAbdallah Bashir

Natural Language Processing with PythonBenjamin Bengfort

BERT: Pre-training of Deep Bidirectional Transformers for Language UnderstandingMinh Pham

Syntactic analysis in NLPkartikaVashisht

ClusteringNLPseminar

Natural language processingNational Institute of Technology Durgapur

BERT introductionHanwha System / ICT

[Paper Reading] Attention is All You NeedDaiki Tanaka

LSTM Based Sentiment Analysisijtsrd

Introduction to Recurrent Neural NetworkYan Xu

NLP_KASHK:Minimum Edit DistanceHemantha Kulathilake

Similar to Introduction to Named Entity Recognition (20)

PDF

leewayhertz.com-Named Entity Recognition NER Unveiling the value in unstructu...KristiLBurns

DOCX

NLP Techniques for Named Entity Recognition.docxKevinSims18

PDF

Named Entity Recognition using Bi-LSTM and Tenserflow ModelIRJET Journal

PDF

Domain Specific Named Entity Recognition Using Supervised ApproachWaqas Tariq

PDF

A survey of named entity recognition in assamese and other indian languagesijnlc

PDF

SEMI-SUPERVISED BOOTSTRAPPING APPROACH FOR NAMED ENTITY RECOGNITIONkevig

PPTX

Named Entity Recognition - ACL 2011 PresentationRichard Littauer

PDF

Named Entity Recognition from Online NewsBernardo Najlis

PDF

Evaluating Named Entity Recognition and Disambiguation in News and TweetsMarieke van Erp

PDF

STUDY OF NAMED ENTITY RECOGNITION FOR INDIAN LANGUAGESijistjournal

PDF

STUDY OF NAMED ENTITY RECOGNITION FOR INDIAN LANGUAGESijistjournal

PDF

Handling ambiguities and unknown words in named entity recognition using anap...ijcsa

PDF

HINDI NAMED ENTITY RECOGNITION BY AGGREGATING RULE BASED HEURISTICS AND HIDDE...ijistjournal

PDF

HINDI NAMED ENTITY RECOGNITION BY AGGREGATING RULE BASED HEURISTICS AND HIDDE...ijistjournal

PDF

HANDLING UNKNOWN WORDS IN NAMED ENTITY RECOGNITION USING TRANSLITERATIONijnlc

PDF

ISSUES AND CHALLENGES IN MARATHI NAMED ENTITY RECOGNITIONijnlc

PPTX

Named Entity RecognitionDatascience.pptxAnandh798253

PDF

Named Entity Recognition from Online NewsBernardo Najlis

PPTX

Reading Group 2013 (DERI NUIG)Bianca Pereira

PDF

HIDDEN MARKOV MODEL BASED NAMED ENTITY RECOGNITION TOOLijfcstjournal

leewayhertz.com-Named Entity Recognition NER Unveiling the value in unstructu...KristiLBurns

NLP Techniques for Named Entity Recognition.docxKevinSims18

Named Entity Recognition using Bi-LSTM and Tenserflow ModelIRJET Journal

Domain Specific Named Entity Recognition Using Supervised ApproachWaqas Tariq

A survey of named entity recognition in assamese and other indian languagesijnlc

SEMI-SUPERVISED BOOTSTRAPPING APPROACH FOR NAMED ENTITY RECOGNITIONkevig

Named Entity Recognition - ACL 2011 PresentationRichard Littauer

Named Entity Recognition from Online NewsBernardo Najlis

Evaluating Named Entity Recognition and Disambiguation in News and TweetsMarieke van Erp

STUDY OF NAMED ENTITY RECOGNITION FOR INDIAN LANGUAGESijistjournal

Handling ambiguities and unknown words in named entity recognition using anap...ijcsa

HINDI NAMED ENTITY RECOGNITION BY AGGREGATING RULE BASED HEURISTICS AND HIDDE...ijistjournal

HANDLING UNKNOWN WORDS IN NAMED ENTITY RECOGNITION USING TRANSLITERATIONijnlc

ISSUES AND CHALLENGES IN MARATHI NAMED ENTITY RECOGNITIONijnlc

Named Entity RecognitionDatascience.pptxAnandh798253

Named Entity Recognition from Online NewsBernardo Najlis

Reading Group 2013 (DERI NUIG)Bianca Pereira

HIDDEN MARKOV MODEL BASED NAMED ENTITY RECOGNITION TOOLijfcstjournal

Recently uploaded (20)

PDF

"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...Fwdays

PDF

CIFDAQ Token Spotlight for 9th July 2025CIFDAQ

PDF

Achieving Consistent and Reliable AI Code Generation - Medusa AImedusaaico

PDF

Agentic AI lifecycle for Enterprise Hyper-AutomationDebmalya Biswas

PDF

New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025BookNet Canada

PDF

POV_ Why Enterprises Need to Find Value in ZERO.pdfdarshakparmar

PDF

How Startups Are Growing Faster with App Developers in Australia.pdfIndia App Developer

PPTX

WooCommerce Workshop: Bring Your LaptopLaura Hartwig

PDF

Building Real-Time Digital Twins with IBM Maximo & ArcGIS IndoorsSafe Software

PDF

DevBcn - Building 10x Organizations Using Modern Productivity MetricsJustin Reock

PDF

Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdfEmily Achieng

PPTX

Webinar: Introduction to LF Energy EVerestDanBrown980551

PDF

"AI Transformation: Directions and Challenges", Pavlo ShaternikFwdays

PDF

Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...darshakparmar

PPTX

Q2 FY26 Tableau User Group Leader Quarterly Calllward7

PDF

Jak MŚP w Europie Środkowo-Wschodniej odnajdują się w świecie AIdominikamizerska1

PPTX

AI Penetration Testing Essentials: A Cybersecurity Guide for 2025defencerabbit Team

PPTX

The Project Compass - GDG on Campus MSITdscmsitkol

PPTX

From Sci-Fi to Reality: Exploring AI EvolutionSvetlana Meissner

PPTX

Building Search Using OpenSearch: Limitations and WorkaroundsSease

"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...Fwdays

CIFDAQ Token Spotlight for 9th July 2025CIFDAQ

Achieving Consistent and Reliable AI Code Generation - Medusa AImedusaaico

Agentic AI lifecycle for Enterprise Hyper-AutomationDebmalya Biswas

New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025BookNet Canada

POV_ Why Enterprises Need to Find Value in ZERO.pdfdarshakparmar

How Startups Are Growing Faster with App Developers in Australia.pdfIndia App Developer

WooCommerce Workshop: Bring Your LaptopLaura Hartwig

Building Real-Time Digital Twins with IBM Maximo & ArcGIS IndoorsSafe Software

DevBcn - Building 10x Organizations Using Modern Productivity MetricsJustin Reock

Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdfEmily Achieng

Webinar: Introduction to LF Energy EVerestDanBrown980551

"AI Transformation: Directions and Challenges", Pavlo ShaternikFwdays

Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...darshakparmar

Q2 FY26 Tableau User Group Leader Quarterly Calllward7

Jak MŚP w Europie Środkowo-Wschodniej odnajdują się w świecie AIdominikamizerska1

AI Penetration Testing Essentials: A Cybersecurity Guide for 2025defencerabbit Team

The Project Compass - GDG on Campus MSITdscmsitkol

From Sci-Fi to Reality: Exploring AI EvolutionSvetlana Meissner

Building Search Using OpenSearch: Limitations and WorkaroundsSease

Introduction to Named Entity Recognition

1. Named Entity Recognition Tomer Lieber Moti Goldklang 5.5.2020

2. What is Named Entity Recognition (NER)? ● A common task of Natural Language Processing (NLP) ● Find and classify entities in text into predefined categories ● Popular categories are person names, organizations and locations ● Usages: machine translation, information retrieval, question-answering...

3. Example Paris Hilton is an American singer that borned in New York 20 years ago. Or

4. Another Example Danny bought a chocolate snack of Mars yesterday in Tel Aviv. Or

5. How to solve Named Entity Recognition task? ● Statistical learning models: Maximum Entropy model, Hidden Markov Models. ● Machine Learning models: Support Vector Machines, Voted perceptron. ● Deep Learning models: Recurrent neural network (RNN), Long short term Memory (LSTM), Bidirectional LSTM ● Tokens as words or letters

6. The Classes (labels) ● IO encoding (PER, LOC, OTHER) Alex is going to Los Angeles ● IOB encoding (B-PER, I-PER, B-LOC, I-LOC, OTHER) Alex is going to Los Angeles PER O O O LOC LOC B-PER O O O B-LOC I-LOC

7. Traditional Machine Learning ● Feature extraction ○ Word length ○ Location in the sentence ○ Previous/next word ○ Previous word label ○ Linguistics ○ Substrings ○ Regular expressions matches ● Train/Test via “traditional” models (usually trees)

8. Word Embeddings

9. Neural Networks & LSTM ● Feature extraction? ● Embeddings ● LSTM

10. The evaluation method - F1 score ● β is chosen such that recall is considered β times as important as precision. ● precision is the percentage of named entities found by the learning system that are correct ● recall is the percentage of named entities present in the corpus that are found by the system.

11. The evaluation problem First Bank Chicago announced an important message last week...

12. The CoNLL-2003 Shared Task ● An academic conference held a competition to find a perfect method to detect and classify entities in a given text (english and german). ● Sixteen groups have participated in the competition. ● Each group received a training file, a development file, a test file and a large file with unannotated data (Reuters news stories and a German newspaper). ● They employed a wide variety of machine learning techniques as well as system combination. ● The performance in this task was measured by a variant of the F1 score.

13. The CoNLL-2003 Shared Task - Results

14. The CoNLL-2003 Shared Task - Progress

15. References ● NLP Progress - Named Entity Recognition ● Introduction to the CoNLL-2003 Shared Task: Language-Independent Named Entity Recognition ● Contextual String Embeddings for Sequence Labeling ● F1 Score ● Tel Aviv Meetup: Deep Learning for Named Entity Recognition - Kfir Bar

16. Questions?

Editor's Notes

#4: Classify entity and not token: put attention that we classify Paris Hilton together as a one entity and not as two token because we want to hold the connection between the tokens.
#12: The measure behave a bit weird when there are boundary errors (which are common). This counts as both fp a fn, therefore select nothing would have been better. There are some other methods like MUC that give partial credit according to complex rules, but it also has its disadvantages.
#13: The categories are person names, organizations, locations and others. The shared task organizers were especially interested in approaches that made use of resources other than the supplied training data, for example gazetteers and unannotated data The learning methods were trained with the training data. The development data could be used for tuning the parameters of the learning methods.