Assignment Two

Morphological analysis and part-of-speech tagging are important natural language processing techniques. Morphological analysis involves identifying the morphemes that make up words to determine their grammatical properties. Part-of-speech tagging assigns tags like noun or verb to each word to identify its part of speech. Both are fundamental steps for applications such as machine translation, text-to-speech, and information extraction by providing information on word properties and structures.

Uploaded by

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

107 views

Assignment Two

Uploaded by

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Jimma University

Jimma Institute of Technology

Faculty of Computing
Department of Information Technology
Msc in Information Technology
Course Code: - CMIT 6122
Course Title: - Natural Language Processing
Given Work: - Assignment Two

Submitted To: - Getachew Mammo (PhD)

Prepared by: -Zerihun Tadesse Gebre
ID No: - RM 2903/12-0

Jimma University
ZERIHUN
Why Important Morphological analysis

Morphological analysis is the process of providing grammatical information about the word on
the basis of properties of the morpheme it contains. It is an integral part of the larger natural
language processing projects such as text to speech synthesis, information extraction and
machine translation. It is the sub discipline of linguistics that deals with the internal structure of
words. Example: - Consider the following sets of English word pairs:

Verb Noun
Bake Baker
Eat Eater
Run Runner
Write Writer
In these word pairs we observe a systematic form-meaning correspondence: the presence of –er
in the words in the right column correlates with the meaning component ‘one who Vs’ where V
stands for the meaning of the corresponding verb in the left column.

Also morphological analysis is very meaningful for the determination of part-of-speech structure
in syntactic parsing, and analysis of a sentence. Information about verbal inflection is especially
important for the word order concept. Moreover, a word may define two or more expressions.
The different parts of the word represent the smallest units of meaning known as Morphemes.

Morphology which comprise of Nature of words, are initiated by morphemes. An example of

Morpheme could be, the word precancellation can be morphologically scrutinized into three
separate morphemes: the prefix pre, the root cancella, and the suffix -tion. The interpretation of
morpheme stays same across all the words, just to understand the meaning humans can
break any unknown word into morphemes. For example, adding the suffix –ed to a verb,
conveys that the action of the verb took place in the past. The words that cannot be divided and
have meaning by themselves are called Lexical morpheme (e.g.: table, chair).The words (e.g.
-ed, -ing, -est, -ly, -ful) that are combined with the lexical morpheme are known as
Grammatical morphemes (eg. Worked, Consulting, Smallest, Likely, Use). Those grammatical
morphemes that occurs in combination called bound morphemes ( e.g. -ed, -ing).
Morphological analyzer and generator are the two essential and basic tools for building
any natural language processing application. It supplies information concerning
morphosyntactic properties of the words it analyses or constructs.

Morphological analyzer is a program for analyzing the morphology of an input word, the
analyzer reads the inflected surface form of each word in a text and provides its lexical form,
like for nouns it will provide gender, number, and case information, likewise for verbs it
will provide tense, aspect and modularity. Whereas generation is the inverse process i.e., given
a root and its grammatical features it will generate the word forms of the root word.

Also morphological analyzer is the program for analyzing the morphology of an input word. The
analyzer includes the recognition engine, identifying suffixes, and finding a stem within the input
word algorithms. A morphological analyzer takes a complete word form and the syntactic and
morphological properties of the word as its input. Morphological analyzers are composed of
three parts.

 Morpheme lexeme
 Set of rules governing the spelling and composition of morphologically complex words.
 Decision algorithm

Why Important Part-of-Speech tagging?

Part of speech tagging is the basic step of identifying a token’s functional role within a sentence
and is the fundamental step in any NLP pipeline. It is the process of assigning a part-of-speech to
each word in a sentence.

Example
Word Tag
Heat verb (noun)
Water noun (verb)
In prep (noun, adv)
A det (noun)
Large adj (noun)
Vessel noun
Part-of-speech tagging is the process of assigning a part-of-speech marker to each part-of-
speech tagging word in an input text. The input to a tagging algorithm is a sequence of
(tokenized) words and a tagset, and the output is a sequence of tags, one per token. Tagging is a
disambiguation task; words are ambiguous —have more than one ambiguous possible part-of-
speech—and the goal is to find the correct tag for the situation. For example, book can be a verb
(book that flight) or a noun (hand me that book). That can be a determiner (Does that flight serve
dinner) or a complementizer (I thought that your flight was earlier). The goal of POS-tagging is
to resolve these ambiguity resolution ambiguities, choosing the proper tag for the context.

Part-of-Speech tagging in itself may not be the solution to any particular NLP problem. It is
however something that is done as a pre-requisite to simplify a lot of different problems. Part of
Speech (hereby referred to as POS) Tags are useful for building parse trees, which are used in
building NERs (most named entities are Nouns) and extracting relations between words. POS
Tagging is also essential for building lemmatizers which are used to reduce a word to its root
form.

Example

Let us consider a few applications of POS tagging in various NLP tasks.

Text to Speech Conversion

 They refuse to permit us to obtain the refuse permit.

The word refuse is being used twice in this sentence and has two different meanings
here. refUSE (/rəˈfyo͞oz/) is a verb meaning “deny,” while REFuse(/ˈrefˌyo͞os/) is a noun
meaning “trash” (that is, they are not homophones). Thus, we need to know which word is being
used in order to pronounce the text correctly. (For this reason, text-to-speech systems usually
perform POS-tagging.)

Word Sense Disambiguation

Words often occur in different senses as different parts of speech. For example:

 She saw a bear.

 Your efforts will bear fruit.
The word bear in the above sentences has completely different senses, but more importantly one
is a noun and other is a verb. Rudimentary word sense disambiguation is possible if you can tag
words with their POS tags.
Word-sense disambiguation (WSD) is identifying which sense of a word (that is, which
meaning) is used in a sentence, when the word has multiple meanings.

NLP Practice Problems (2)
No ratings yet
NLP Practice Problems (2)
48 pages
Westgard Preview Advanced QC Strategies 2022
No ratings yet
Westgard Preview Advanced QC Strategies 2022
41 pages
Bad Tests Good Tests
No ratings yet
Bad Tests Good Tests
92 pages
Cane Toad Trap Stem Challenge
No ratings yet
Cane Toad Trap Stem Challenge
7 pages
(A) What Is Traditional Model of NLP?: Unit - 1
No ratings yet
(A) What Is Traditional Model of NLP?: Unit - 1
18 pages
Syntax_complete
No ratings yet
Syntax_complete
22 pages
AI Unit 3 Lecture 2
No ratings yet
AI Unit 3 Lecture 2
8 pages
NLP UNIT-II PPT
No ratings yet
NLP UNIT-II PPT
45 pages
Unit2 A
No ratings yet
Unit2 A
22 pages
SL-3_Assignment No 7
No ratings yet
SL-3_Assignment No 7
14 pages
Morphological Processing of Semitic Languages
No ratings yet
Morphological Processing of Semitic Languages
14 pages
Unit 2 Syntactic Processing
No ratings yet
Unit 2 Syntactic Processing
17 pages
Assignment of AI Finished
No ratings yet
Assignment of AI Finished
16 pages
Parts of Speech Tagging For Afaan Oromo
No ratings yet
Parts of Speech Tagging For Afaan Oromo
5 pages
Welcome To International Journal of Engineering Research and Development (IJERD)
No ratings yet
Welcome To International Journal of Engineering Research and Development (IJERD)
4 pages
Solutions To NLP I Mid Set A
100% (1)
Solutions To NLP I Mid Set A
8 pages
Informatin and Storage Retrieval Group - 5 Sec - 2 Assiment
No ratings yet
Informatin and Storage Retrieval Group - 5 Sec - 2 Assiment
14 pages
Unit Ii NLP Notes Final
No ratings yet
Unit Ii NLP Notes Final
6 pages
ACFrOgBKMtkrKQXYgwzYfGAQxQ0GJjQ4MloahBs6vi5pwqo xRZUN6IRgh8lAAyR2U7sguAn6becvxh174Y RYo84nZ3K9mm OlN3Q JrDvd18FxMzMkCBuxruzd1tH0C6XqndKXsCSXuwHIWVT7olg5FKOstIhFYq-Kh6hMBg
No ratings yet
ACFrOgBKMtkrKQXYgwzYfGAQxQ0GJjQ4MloahBs6vi5pwqo xRZUN6IRgh8lAAyR2U7sguAn6becvxh174Y RYo84nZ3K9mm OlN3Q JrDvd18FxMzMkCBuxruzd1tH0C6XqndKXsCSXuwHIWVT7olg5FKOstIhFYq-Kh6hMBg
32 pages
Chapter 1
No ratings yet
Chapter 1
41 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
16 pages
Unit V Intelligence and Applications: Morphological Analysis/Lexical Analysis
No ratings yet
Unit V Intelligence and Applications: Morphological Analysis/Lexical Analysis
30 pages
CMR University School of Engineering and Technology Department of Cse and It
No ratings yet
CMR University School of Engineering and Technology Department of Cse and It
8 pages
NLP - Viva - Que & Ans
No ratings yet
NLP - Viva - Que & Ans
15 pages
NLP unit1
No ratings yet
NLP unit1
24 pages
Morphological Analyzer For Tamil
No ratings yet
Morphological Analyzer For Tamil
37 pages
Unit 3 NLP
No ratings yet
Unit 3 NLP
103 pages
Inflection Morphology: Inflexion
No ratings yet
Inflection Morphology: Inflexion
4 pages
NLP UNIT 2 Part 2
No ratings yet
NLP UNIT 2 Part 2
6 pages
Combining Lexical and Syntactic Features For Supervised Word Sense Disambiguation
No ratings yet
Combining Lexical and Syntactic Features For Supervised Word Sense Disambiguation
8 pages
nlp unit 1
No ratings yet
nlp unit 1
52 pages
Natural Language Processing
No ratings yet
Natural Language Processing
23 pages
Exp-7
No ratings yet
Exp-7
9 pages
NLP Class10.PDF
No ratings yet
NLP Class10.PDF
9 pages
Icatest 2015127
No ratings yet
Icatest 2015127
5 pages
Ass7 Write Up .Final
No ratings yet
Ass7 Write Up .Final
11 pages
Tasks in NLP
No ratings yet
Tasks in NLP
7 pages
Chapter 2
No ratings yet
Chapter 2
8 pages
Proceeding Total Pages 422 428 2
No ratings yet
Proceeding Total Pages 422 428 2
7 pages
Seminar Guidline
No ratings yet
Seminar Guidline
13 pages
Chapter 1
No ratings yet
Chapter 1
5 pages
Chapter-1 Introduction To NLP
No ratings yet
Chapter-1 Introduction To NLP
12 pages
NLP Unit 5
No ratings yet
NLP Unit 5
10 pages
Ai Phases in NLP Sem Vi
No ratings yet
Ai Phases in NLP Sem Vi
3 pages
First Stage
No ratings yet
First Stage
15 pages
Introduction To NLP and Ambiguity
No ratings yet
Introduction To NLP and Ambiguity
42 pages
Unit Iii
No ratings yet
Unit Iii
17 pages
NLP_FINALLL (2)
No ratings yet
NLP_FINALLL (2)
72 pages
Chapter 6
100% (1)
Chapter 6
28 pages
Thesis Review On Morophological Analyzer For Geez Verbs
No ratings yet
Thesis Review On Morophological Analyzer For Geez Verbs
13 pages
NLP UNIT 2 Notes
No ratings yet
NLP UNIT 2 Notes
14 pages
NLP CHAPTER-1
No ratings yet
NLP CHAPTER-1
24 pages
NLP Soln
No ratings yet
NLP Soln
19 pages
NLP Qa
No ratings yet
NLP Qa
10 pages
IR Chapter 2
No ratings yet
IR Chapter 2
37 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
17 pages
NLP Notes (Ch1-5) PDF
100% (1)
NLP Notes (Ch1-5) PDF
41 pages
NLP Notes
No ratings yet
NLP Notes
26 pages
Unit 5
No ratings yet
Unit 5
45 pages
PARTS OF SPEECH TAGGING Article
No ratings yet
PARTS OF SPEECH TAGGING Article
4 pages
UNIT 1_Part1
No ratings yet
UNIT 1_Part1
121 pages
Evaluating Part-Of-speech Tagging and Parsing
No ratings yet
Evaluating Part-Of-speech Tagging and Parsing
26 pages
Disambiguation of Particles: Hindi-To-English
From Everand
Disambiguation of Particles: Hindi-To-English
Anil Thakur
No ratings yet
98 - Let Reviewer 2016 Professional Education
No ratings yet
98 - Let Reviewer 2016 Professional Education
18 pages
Lec 1
No ratings yet
Lec 1
10 pages
Task 3
No ratings yet
Task 3
19 pages
Rackham Vale (2. Print) (OSE)
No ratings yet
Rackham Vale (2. Print) (OSE)
156 pages
Indexing and Ranking in Spatial Database
No ratings yet
Indexing and Ranking in Spatial Database
10 pages
A.D Hope Australia
No ratings yet
A.D Hope Australia
2 pages
Chase Banking Mortgage Training Guide
100% (3)
Chase Banking Mortgage Training Guide
46 pages
Ontracting Strategies Epc Contracts in International Power Projects
No ratings yet
Ontracting Strategies Epc Contracts in International Power Projects
7 pages
Rin Detergent Case Study
100% (1)
Rin Detergent Case Study
19 pages
1998 05ÌýÁ
No ratings yet
1998 05ÌýÁ
5 pages
Chapter 8
100% (1)
Chapter 8
24 pages
2023 WAEC Civic Education Answers (Essay-OBJ) Is Out - UNN INFO
No ratings yet
2023 WAEC Civic Education Answers (Essay-OBJ) Is Out - UNN INFO
1 page
SIMPO 5 (Dr. Dr. Ahmad Asmedi, SP.S (K) ) Brain Mapping in Acute Ischemic Stroke
No ratings yet
SIMPO 5 (Dr. Dr. Ahmad Asmedi, SP.S (K) ) Brain Mapping in Acute Ischemic Stroke
17 pages
Math B22 Practice Exam 1
No ratings yet
Math B22 Practice Exam 1
2 pages
First FS - Audioscript - Test 1
0% (2)
First FS - Audioscript - Test 1
10 pages
Mathematics Laboratory in Primary and Upper Primary Schools: Class
No ratings yet
Mathematics Laboratory in Primary and Upper Primary Schools: Class
141 pages
MBA 3.5-4th-BUSA4140-13
No ratings yet
MBA 3.5-4th-BUSA4140-13
14 pages
Daclag Vs Macahilig
100% (1)
Daclag Vs Macahilig
2 pages
6.0 - That's Entertainment
No ratings yet
6.0 - That's Entertainment
10 pages
Notes UpdateW8BEN IncludeBaptismalCert3
100% (1)
Notes UpdateW8BEN IncludeBaptismalCert3
2 pages
Traditional Food Preferences in Ghana
No ratings yet
Traditional Food Preferences in Ghana
16 pages
EVIDENCEL
No ratings yet
EVIDENCEL
18 pages
Applying Multiple Techniques of Critical Thinking in Teaching Of, E.g., Economics 507, 461, 360, 361
No ratings yet
Applying Multiple Techniques of Critical Thinking in Teaching Of, E.g., Economics 507, 461, 360, 361
28 pages
The Unique Democracy of Switzerland FINALS
No ratings yet
The Unique Democracy of Switzerland FINALS
17 pages
16340-17656 JakartaLEAPBrief
No ratings yet
16340-17656 JakartaLEAPBrief
10 pages
Centripetal Force in Uniform Circular Motion Problems and Solutions
No ratings yet
Centripetal Force in Uniform Circular Motion Problems and Solutions
1 page
Corrintec Subsea Brochure PDF
No ratings yet
Corrintec Subsea Brochure PDF
8 pages