Unit-5-NLP (1)
Unit-5-NLP (1)
comes with NLP and it tries to find all linguistic expressions in a given text that refer to
the same real-world entity. This is how it works.
Suppose you have to find the pronouns in a sentence and replace them with relevant
nouns. Coreference resolution can be used to do that. It finds and groups the words which
refer to the same entities and replaces pronouns with noun phrases.
I Gave my book to Abdul because he said that he wants to write the Assignment.
• Text understanding
• Document summarization
• Information extraction
• Sentiment analysis
Anaphora: is the use of the expression whose interpretation depends upon another
expression in context (its antecedent)
Coreference Vs Anaphora:
Hobb’s algorithm is one of the several approaches for pronoun resolution. The algorithm
is mainly based on the syntactic parse tree of the sentences. Hobbs Algorithm is one of the
techniques used for Pronoun Resolu on.
So here, we have the syntactic parse tree of the two sentences as shown.
ti
The algorithm starts with the target pronoun and walks up the parse tree to the root node
‘S’. For each noun phrase or ‘S’ node that it finds, it does the breadth rst le to right
search of the node’s children to the left of the target. So in our example, the algorithm
starts with the parse tree of the sentence 2 and climbs up to the root node S2. Then it does
a breadth first search to find the noun phrase (NP). Here the algorithm, finds its first noun
phrase for noun ‘Jill’. [Source : https://medium.com/analytics-vidhya/hobbs-algorithm-pronoun-
resolution-7620aa1af538]
Binding theory states that: A re exive can refer to the subject of the most immediate
clause in which it appears, whereas a nonre exive cannot co-refer this subject. Words
such as himself, herself, themselves, etc. are known as reflexive.
Hence the algorithm now starts the search in the syntax tree of the previous sentence. And
hence the subject Jack in the sentence, Jack is an engineer, is explored before the object
engineer and finally Jack is the resolved referent for the pronoun him.
Discourse:
When we are dealing with Natural Language Processing, the provided language consists
of structured, collective, and consistent groups of sentences, which are termed discourse
in NLP. Discourse Analysis is extracting the meaning out of the corpus or text. Discourse
Analysis is very important in Natural language Processing and helps train the NLP model
better.
Coherence in terms of Discourse in NLP means making sense of the words or making
meaningful connections and correlations. There is a lot of connection between the
coherence and the discourse structure. The coherent rela on tells us that there is some
sort of connection present between the words.
Semantic Roles:
Natural Language Processing (NLP), a branch of artificial intelligence which studies the
ability of computers to interpret and “understand” the human language. It is an open
problem in computational linguistics concerned with identifying which sense of a word is
used in a sentence.
Dictionary-based or Knowledge-based Methods. As the name suggests, for
disambiguation, these methods primarily rely on dictionaries
Supervised Methods:
The context is represented as a set of “features” of the words. It includes the information
about the surrounding words also. Support vector machine and memory-based learning are
the most successful supervised learning approaches to WSD.
Semi-supervised Methods