0% found this document useful (0 votes)

17 views

NLP Introduction Overview

Uploaded by

Aruna

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views

NLP Introduction Overview

Uploaded by

Aruna

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 34

INTRODUCTION TO NATURAL

LANGUAGE PROCESSING (NLP)

Overview and Key Concepts

Mrs.S.ARUNAKUMARI ,AP/AI&DS
INTRODUCTION
What is NLP?
NLP is a field of AI that focuses on the interaction
between computers and humans through natural
language.

Importance of NLP
• Enhances human-computer interaction.
• Automates and improves information
extraction.
History of NLP
Early Beginnings
• Roots in computational linguistics.

Evolution Over the Decades

• Transition from rule-based to statistical models
to deep learning.
Key Concepts in NLP
Tokenization
• Splitting text into words or sentences.
Stop Words
• Common words filtered out in text processing.
Stemming and Lemmatization
• Reducing words to their base or root form.
Part-of-Speech Tagging
• Identifying grammatical categories.
Named Entity Recognition (NER)
• Detecting proper names in text.

Syntactic Parsing
• Analyzing sentence structure.

Sentiment Analysis
• Determining emotional tone.
NLP Techniques
Rule-Based Approaches
• Handcrafted linguistic rules.
Statistical Methods
• Probabilistic models.
Machine Learning
• Supervised and unsupervised learning.
Deep Learning
• Neural networks and embeddings.
• Email platforms, such as Gmail, Outlook, etc., use
NLP extensively to provide a range of product
features, such as spam classification, priority
inbox, calendar event extraction, auto-complete,
etc
• Voice-based assistants, such as Apple Siri, Google
Assistant, Microsoft Cortana,and Amazon Alexa
rely on a range of NLP techniques to interact with
the user,understand user commands, and respond
accordingly.
• Modern search engines, such as Google and Bing, which are the
cornerstone of today’s internet, use NLP heavily for various subtasks,
such as query understanding, query expansion, question answering,
information retrieval, an ranking and grouping of the results, to name
a few.

• Machine translation services, such as Google Translate, Bing

Microsoft Translator, and Amazon Translate are increasingly used in
today’s world to solve a wide range of scenarios and business use
cases.
• Organizations across verticals analyze their social media feeds
to build a better and deeper understanding of the voice of their
customers.
• NLP is widely used to solve diverse sets of use cases on e-
commerce platformslike Amazon. These vary from extracting
relevant information from product descriptions to
understanding user reviews.
• Advances in NLP are being applied to solve use cases in
domains such as health‐care, finance, and law.
• Companies such as Arria [1] are working to use
NLP techniques to automatically generate reports
for various domains, from weather forecasting to
financial services. NLP in the Real World
• NLP forms the backbone of spelling- and
grammar-correction tools, such as Grammarly and
spell check in Microsoft Word and Google Docs.
• Jeopardy! is a popular quiz show on TV. In the
show, contestants are presented with clues in the
form of answers, and the contestants must phrase
their respon‐ ses in the form of questions.
• IBM built the Watson AI to compete with the show’s top
players. Watson won the first prize with a million dollars, more
than the world champions. Watson AI was built using NLP
techniques and is one of the examples of NLP bots winning a
world competition.

• NLP is used in a range of learning and assessment tools and

technologies, such as automated scoring in exams like the Graduate
Record Examination (GRE), plagi‐ arism detection (e.g., Turnitin),
intelligent tutoring systems, and language learn‐ ing apps like
Duolingo.

• NLP is used to build large knowledge bases, such as the Google

Knowledge Graph, which are useful in a range of applications like
search and question answering
• NLP Tasks - There is a collection of fundamental tasks that
appear frequently across various NLP projects.
• Language modeling This is the task of predicting what the
next word in a sentence will be based on the history of
previous words.
• The goal of this task is to learn the probability of a sequence
of words appearing in a given language. Language modeling is
useful for building solutions for a wide variety of problems,
such as speech recognition, optical character recognition,
handwriting recognition, machine translation, and spelling
correction.
• Language is a structured system of communication that involves
complex combinations of its constituent components, such as
characters, words, sentences, etc. Lin‐ guistics is the systematic
study of language. In order to study NLP, it is important to
understand some concepts from linguistics about how language
is structured.

• We can think of human language as composed of four major

building blocks: phonemes, morphemes and lexemes, syntax,
and context. NLP applications need knowledge of different
levels of these building blocks, starting from the basic sounds of
language (phonemes) to texts with some meaningful expressions
Phonemes :
• Phonemes are the smallest units of sound in a
language. They may not have any meaning by
themselves but can induce meanings when
uttered in combination with other phonemes.
For example, standard English has 44
phonemes, which are either single letters or a
combination of letters
• Morphemes and lexemes A morpheme is the smallest unit of
language that has a meaning. It is formed by a combination of
phonemes. Not all morphemes are words, but all prefixes and
suffixes are morphemes. For example, in the word
“multimedia,” “multi-” is not a word but a prefix that changes
the meaning when put together with “media.” “Multi-” is a
morpheme.
• Lexemes are the structural variations of morphemes related to
one another by mean‐ ing. For example, “run” and “running”
belong to the same lexeme form. Morphological analysis,
which analyzes the structure of words by studying its
morphemes and lexemes, is a foundational block for many
NLP tasks, such as tokenization, stemming, learning word
embeddings, and part-of-speech tagging.
Syntax:
• Syntax is a set of rules to construct grammatically correct
sentences out of words and phrases in a language. Syntactic
structure in linguistics is represented in many differ‐ ent ways.
A common approach to representing sentences is a parse tree.
• both sentences have a similar structure and hence a similar
syn‐ tactic parse tree. In this representation, N stands for
noun, V for verb, and P for preposition.
• Noun phrase is denoted by NP and verb phrase by VP. The
two noun phrases are “The girl” and “The boat,” while the
two verb phrases are “laughed at the monkey” and “sailed up
the river.”
• The syntactic structure is guided by a set of grammar rules
for the language (e.g., the sentence comprises an NP and a
VP), and this in turn guides some of the fundamental tasks of
language processing, such as parsing. Parsing is the NLP task
of constructing such trees automatically. Entity extraction
and relation extraction are some of the NLP tasks that build
on this knowledge of parsing,
• Context Context is how various parts in a language come together to
convey a particular meaning. Context includes long-term references,
world knowledge, and common sense along with the literal meaning
of words and phrases. The meaning of a sentence can change based
on the context, as words and phrases can sometimes have multiple
meanings.

• Generally, context is composed from semantics and pragmatics.

Semantics is the direct meaning of the words and sentences without
external context. Pragmatics adds world knowledge and external
context of the conversation to enable us to infer implied meaning.
Complex NLP tasks such as sarcasm detection, summariza‐ tion,
and topic modeling are some of tasks that use context heavily.
• Linguistics is the study of language and hence
is a vast area in itself, and we only introduced
some basic ideas to illustrate the role of
linguistic knowledge in NLP.
• Different tasks in NLP require varying degrees
of knowledge about these building blocks of
language
Why Is NLP Challenging?
What makes NLP a challenging problem domain? The ambiguity
and creativity of human language are just two of the
characteristics that make NLP a demanding area to work in.
Ambiguity:
Ambiguity means uncertainty of meaning. Most human
languages are inherently ambiguous. Consider the following
sentence: “I made her duck.” This sentence has multiple
meanings. The first one is: I cooked a duck for her.
• The second meaning is: I made her bend down to avoid an
object. (There are other possible meanings, too; we’ll leave
them for the reader to think of.) Here, the ambiguity comes
from the use of the word “made.” Which of the two meanings
applies depends on the context in which the sentence appears.
• If the sentence appears in a story about a mother and a child,
then the first meaning will probably apply. But if the sentence
appears in a book about sports, then the second meaning will
likely apply
• When it comes to figurative language—i.e., idioms—the
ambiguity only increases. For example, “He is as good as John
Doe.” Try to answer, “How good is he?” The answer depends
on how good John Doe is.
• The examples come from the Winograd Schema Challenge [5],
named after Professor Terry Winograd of Stanford University. This
schema has pairs of sentences that differ by only a few words, but
the meaning of the sentences is often flipped because of this minor
change.
• These examples are easily disambiguated by a human but are not
solvable using most NLP techniques. Consider the pairs of
sentences in the figure and the questions associated with them. With
some thought, how the answer changes should be apparent based on
a single word variation.
• As another experiment, consider taking an off-the-shelf NLP system
like Google Translate and try various examples to see how such
ambiguities affect (or don’t affect) the output of the system.
• Common knowledge A key aspect of any human language is
“common knowledge.” It is the set of all facts that most
humans are aware of. In any conversation, it is assumed that
these facts are known, hence they’re not explicitly mentioned,
but they do have a bearing on the meaning of the sentence. For
example, consider two sentences: “man bit dog” and “dog bit
man.” We all know that the first sentence is unlikely to
happen, while the second one is very possible.
• Why do we say so? Because we all “know” that it is very unlikely
that a human will bite a dog. Further, dogs are known to bite
humans. This knowledge is required for us to say that the first
sentence is unlikely to happen while the second one is possible.
Note that this common knowledge was not mentioned in What Is
Language? | 13 either sentence.
• Humans use common knowledge all the time to understand
and process any language. In the above example, the two
sentences are syntactically very similar, but a computer
would find it very difficult to differentiate between the two,
as it lacks the common knowledge humans have. One of the
key challenges in NLP is how to encode all the things that are
common knowledge to humans in a computational model.
• Creativity Language is not just rule driven; there is also a creative
aspect to it. Various styles, dialects, genres, and variations are used
in any language. Poems are a great example of creativity in
language. Making machines understand creativity is a hard problem
not just in NLP, but in AI in general.
• For most languages in the world, there is no direct mapping between
the vocabularies of any two languages. This makes porting an NLP
solution from one language to another hard. A solution that works
for one language might not work at all for another language. This
means that one either builds a solution that is language agnos‐ tic or
that one needs to build separate solutions for each language. While
the first one is conceptually very hard, the other is laborious and
time intensive. All these issues make NLP a challenging
Challenges in NLP
Ambiguity
• Words with multiple meanings.
Context Understanding
• Capturing context in language.
Sarcasm and Irony
• Detecting non-literal meanings.
Multilingual Processing
• Handling multiple languages.
Recent Advances in NLP
Transformer Models (e.g., BERT, GPT)
• Self-attention mechanism.

Pre-trained Language Models

• Transfer learning.

Transfer Learning in NLP

• Fine-tuning pre-trained models.
Tools and Libraries
NLTK
• Natural Language Toolkit.

SpaCy
• Industrial-strength NLP.

Hugging Face Transformers

• State-of-the-art models.

OpenNLP
• Apache’s machine learning library.
Future of NLP
Trends and Predictions
• Continued integration of NLP in daily life.

Research Directions
• Ethical AI, bias reduction, multilingual
models.

Natural Language Processing (NPL) : Group Name: Goal Diggers
No ratings yet
Natural Language Processing (NPL) : Group Name: Goal Diggers
22 pages
Basic NLP to End-to-end Pipeline .pptx_removed
No ratings yet
Basic NLP to End-to-end Pipeline .pptx_removed
35 pages
Lecture1
No ratings yet
Lecture1
16 pages
NLP Merged
100% (1)
NLP Merged
975 pages
6CS4 AI Unit-5
No ratings yet
6CS4 AI Unit-5
65 pages
Lecture 1 Introduction
No ratings yet
Lecture 1 Introduction
36 pages
Module_1_part1_NLP
No ratings yet
Module_1_part1_NLP
24 pages
CH1
No ratings yet
CH1
87 pages
1.introduction To Natural Language Processing (NLP)
100% (1)
1.introduction To Natural Language Processing (NLP)
37 pages
NLP Notes (Ch1-5) PDF
100% (1)
NLP Notes (Ch1-5) PDF
41 pages
Unit V
No ratings yet
Unit V
16 pages
Lect01
No ratings yet
Lect01
28 pages
Unit 1 Extra
No ratings yet
Unit 1 Extra
6 pages
nayie bayes classifier 21 page
No ratings yet
nayie bayes classifier 21 page
28 pages
Natural Language Processing: Bachelor of Technology Computer Science and Engineering
No ratings yet
Natural Language Processing: Bachelor of Technology Computer Science and Engineering
7 pages
Chapter 1
No ratings yet
Chapter 1
5 pages
Introduction To NLP - Part 1
No ratings yet
Introduction To NLP - Part 1
23 pages
1 Natural Language Processing-Intro
No ratings yet
1 Natural Language Processing-Intro
16 pages
NLP-UNIT-1-1
No ratings yet
NLP-UNIT-1-1
67 pages
NLP Presentation
No ratings yet
NLP Presentation
19 pages
Chapter 6
100% (1)
Chapter 6
28 pages
NLPNotes
No ratings yet
NLPNotes
12 pages
notes
No ratings yet
notes
9 pages
NLP Self Notes
No ratings yet
NLP Self Notes
12 pages
38. Natural Language Processing (1) Copy
No ratings yet
38. Natural Language Processing (1) Copy
30 pages
Introduction To NLP
No ratings yet
Introduction To NLP
51 pages
NLP Module 1
No ratings yet
NLP Module 1
124 pages
1 - Intro - To - NLP 2
No ratings yet
1 - Intro - To - NLP 2
55 pages
An In-Depth Exploration of Natural Language Processing: Evolution, Applications, and Future Directions
100% (8)
An In-Depth Exploration of Natural Language Processing: Evolution, Applications, and Future Directions
5 pages
NPL
No ratings yet
NPL
2 pages
1 Introduction
No ratings yet
1 Introduction
62 pages
NLP_PPT
No ratings yet
NLP_PPT
41 pages
Nlp Lecture
No ratings yet
Nlp Lecture
18 pages
Natural Language Processing State of the Art Curre
No ratings yet
Natural Language Processing State of the Art Curre
33 pages
NLP Introduction
No ratings yet
NLP Introduction
35 pages
Chapter 6-NLPs
No ratings yet
Chapter 6-NLPs
31 pages
NLP QB
No ratings yet
NLP QB
14 pages
Natural Language Processing
No ratings yet
Natural Language Processing
5 pages
ai-unit4
No ratings yet
ai-unit4
36 pages
1 - Introducntion To NLP
No ratings yet
1 - Introducntion To NLP
43 pages
NLP Textbook Star Edu
No ratings yet
NLP Textbook Star Edu
103 pages
Hadi Pres, 21-12-24-1
No ratings yet
Hadi Pres, 21-12-24-1
16 pages
Natural Language Processing
No ratings yet
Natural Language Processing
30 pages
NLP Presentation
No ratings yet
NLP Presentation
19 pages
Natural Language Processing
No ratings yet
Natural Language Processing
24 pages
NLP MODULE 1 Chapter1 &2 ppt
No ratings yet
NLP MODULE 1 Chapter1 &2 ppt
83 pages
Lec 1.1.2
No ratings yet
Lec 1.1.2
44 pages
01
No ratings yet
01
60 pages
Natural Language Processing (NLP) : Chapter 1: Introduction To NLP
No ratings yet
Natural Language Processing (NLP) : Chapter 1: Introduction To NLP
96 pages
What Is Natural Language Processing?
No ratings yet
What Is Natural Language Processing?
5 pages
NLP chap1
No ratings yet
NLP chap1
50 pages
Lec1 Introduction
No ratings yet
Lec1 Introduction
30 pages
INTRONLP
No ratings yet
INTRONLP
30 pages
CH 5 NLP
No ratings yet
CH 5 NLP
12 pages
Natural Language Processing Lec 1
No ratings yet
Natural Language Processing Lec 1
23 pages
Group 8 NLP
No ratings yet
Group 8 NLP
3 pages
Natural Language Processing State of The Art, Current Trends and Challenges - s11042-022-13428-4 PDF
No ratings yet
Natural Language Processing State of The Art, Current Trends and Challenges - s11042-022-13428-4 PDF
32 pages
2 INTRODUCTION
No ratings yet
2 INTRODUCTION
15 pages
Course Code HUM1012 Logic and Language Structure BL202425040 0921 D21+D22
No ratings yet
Course Code HUM1012 Logic and Language Structure BL202425040 0921 D21+D22
55 pages
Natural Language Processing
From Everand
Natural Language Processing
Ajit Singh
No ratings yet
Manhattan Distance
No ratings yet
Manhattan Distance
12 pages
Unit 1 Fundamentals
No ratings yet
Unit 1 Fundamentals
2 pages
List of Experiments
No ratings yet
List of Experiments
1 page
UG I YR - MODEL QP Template ODD SEM (100 Mark Pattern) WITH CO PO
No ratings yet
UG I YR - MODEL QP Template ODD SEM (100 Mark Pattern) WITH CO PO
3 pages
Unit I NLP
No ratings yet
Unit I NLP
5 pages
Sem II IA 2 ANS KEY AI&DS
No ratings yet
Sem II IA 2 ANS KEY AI&DS
15 pages
U1 NLP App Solved
No ratings yet
U1 NLP App Solved
26 pages
Python - Assignement Word
No ratings yet
Python - Assignement Word
2 pages
SITA3012 NLP Unit 1
No ratings yet
SITA3012 NLP Unit 1
33 pages
English Syntax - To Minh Thanh
No ratings yet
English Syntax - To Minh Thanh
211 pages
(Makalah) Syntax and Sentence Structure
No ratings yet
(Makalah) Syntax and Sentence Structure
17 pages
2nd Mid Term Exam
No ratings yet
2nd Mid Term Exam
3 pages
Semantics of Dets
No ratings yet
Semantics of Dets
20 pages
english-worksheet-1.pdf
No ratings yet
english-worksheet-1.pdf
5 pages
Review Syntax
No ratings yet
Review Syntax
56 pages
LIN 421 Transformational Grammar II LATEST
No ratings yet
LIN 421 Transformational Grammar II LATEST
73 pages
Grammar Reinforcement WKSHT Group1
No ratings yet
Grammar Reinforcement WKSHT Group1
5 pages
Introductory Lecture For TGG: (Transformational-Generative Grammar) Prepared by
No ratings yet
Introductory Lecture For TGG: (Transformational-Generative Grammar) Prepared by
5 pages
Introduction To English Syntax
100% (1)
Introduction To English Syntax
45 pages
Non Finite Clauses Summary
No ratings yet
Non Finite Clauses Summary
9 pages
LP Grammar Grade 7
No ratings yet
LP Grammar Grade 7
6 pages
1. Sentence structure_categories
No ratings yet
1. Sentence structure_categories
35 pages
Relative Vs Appositive Clause
75% (4)
Relative Vs Appositive Clause
5 pages
ALI GUESS 9066 SOLVED
No ratings yet
ALI GUESS 9066 SOLVED
16 pages
Topical Structure Analysis of The Essays Written by Cebuano Multiligual Students
100% (1)
Topical Structure Analysis of The Essays Written by Cebuano Multiligual Students
21 pages
Instant Download Lexical Functional Syntax 1st Edition Joan Bresnan PDF All Chapters
100% (9)
Instant Download Lexical Functional Syntax 1st Edition Joan Bresnan PDF All Chapters
77 pages
Natural Language Processing Inside Pages 2
No ratings yet
Natural Language Processing Inside Pages 2
159 pages
A Contrastive Study of English and Arabic Syntax
No ratings yet
A Contrastive Study of English and Arabic Syntax
30 pages
NLP Assignment
No ratings yet
NLP Assignment
8 pages
NP Structure
No ratings yet
NP Structure
4 pages
S10 1
No ratings yet
S10 1
477 pages
iLS - English - Lesson Plans - Y7 - Unit - 3
No ratings yet
iLS - English - Lesson Plans - Y7 - Unit - 3
20 pages
Get Full verb Inversion in Written and Spoken English 1st Edition José Carlos Prado Alonso PDF ebook with Full Chapters Now
100% (5)
Get Full verb Inversion in Written and Spoken English 1st Edition José Carlos Prado Alonso PDF ebook with Full Chapters Now
81 pages
A +grammar Worksheet
No ratings yet
A +grammar Worksheet
10 pages
English Grammar Part 1 and 2
No ratings yet
English Grammar Part 1 and 2
435 pages
Teaching The English Phrases
No ratings yet
Teaching The English Phrases
18 pages
Assignment AEN 210
No ratings yet
Assignment AEN 210
3 pages
Eng. 501 - Constituent Structure I
No ratings yet
Eng. 501 - Constituent Structure I
4 pages

NLP Introduction Overview

Uploaded by

NLP Introduction Overview

Uploaded by

INTRODUCTION TO NATURAL

LANGUAGE PROCESSING (NLP)

Overview and Key Concepts

Evolution Over the Decades

• Machine translation services, such as Google Translate, Bing

• NLP is used in a range of learning and assessment tools and

• NLP is used to build large knowledge bases, such as the Google

• We can think of human language as composed of four major

• Generally, context is composed from semantics and pragmatics.

Pre-trained Language Models

Transfer Learning in NLP

Hugging Face Transformers

You might also like