0% found this document useful (0 votes)

13 views

NLP_Lecture_9_and_10_Week_5

Dionysius Thrax's grammatical work established eight parts-of-speech that have influenced linguistic structures for over 2000 years, introducing key terms like syntax and clitic. Modern computational linguistics has expanded parts-of-speech tagsets for detailed analysis, employing various tagging algorithms and applications in language processing. The document also discusses the evolution of English word classes, the challenges in POS tagging, and the limitations of existing tagsets.

Uploaded by

Irfan Ul Haq

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views

NLP_Lecture_9_and_10_Week_5

Uploaded by

Irfan Ul Haq

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 10

1.

Introduction to Parts-of-Speech

Dionysius Thrax of Alexandria (c. 100 B.C.) wrote a grammatical sketch of Greek (a "techne¯")
that summarized linguistic knowledge of his era. This work significantly influenced modern
linguistic vocabulary, introducing terms such as:

 Syntax
 Diphthong
 Clitic
 Analogy

Thrax’s description of eight parts-of-speech—noun, verb, pronoun, preposition, adverb,

conjunction, participle, and article—became foundational for grammatical structures in Greek,
Latin, and most European languages for over 2000 years.

2. The Enduring Influence of Thrax’s Parts-of-Speech

 Earlier scholars like Aristotle and the Stoics had their own lists, but Thrax’s became the
standard.
 The tradition continued even into modern culture, as seen in Schoolhouse Rock (1973),
an educational TV series that taught grammar through music.
 Grammar Rock, a segment of Schoolhouse Rock, adhered to an eight-part classification,
albeit substituting adjective and interjection for participle and article, demonstrating the
continued importance of these categories.

3. Evolution of Parts-of-Speech Tagsets

Modern computational linguistics employs expanded tagsets for more precise classification:

 Penn Treebank (45 word classes)

 Brown Corpus (87 word classes)
 C7 Tagset (146 word classes)

These extended classifications allow for detailed linguistic analysis and computational
applications.

4. The Role of Parts-of-Speech in Language Processing

Parts-of-speech (POS), also known as word classes, morphological classes, or lexical tags,
provide valuable linguistic insights:

 Word Prediction: POS knowledge helps anticipate subsequent words, e.g., possessive
pronouns (my, your) are followed by nouns, whereas personal pronouns (I, you, he) are
typically followed by verbs.
 Speech Recognition: Knowing a word’s POS aids pronunciation; e.g., "content" is
pronounced as CONtent (noun) vs. conTENT (adjective).
 Stemming in Information Retrieval (IR): POS aids in selecting key terms for document
indexing and retrieval.
 Parsing and Disambiguation: POS tagging enhances parsing efficiency, aids in word-
sense disambiguation, and improves named entity recognition (e.g., detecting names,
dates, times).

5. Computational Methods for POS Tagging

Several algorithms have been developed for automatic POS tagging:

1. Rule-Based Tagging – Uses manually crafted linguistic rules.

2. HMM (Hidden Markov Model) Tagging – A probabilistic approach relying on
statistical models.
3. Transformation-Based Tagging – Applies transformation rules iteratively to refine
tagging accuracy.

6. Applications of POS Tagging

POS-tagged corpora have significant applications in:

 Linguistic Research – Studying grammatical constructions and usage frequencies.

 Speech Synthesis & Recognition – Improving pronunciation and recognition accuracy.
 Information Extraction – Identifying key entities in large text datasets.

Comprehensive Study Notes on English Word Classes

Introduction

English words are classified into various categories known as word classes or parts of speech.
These classifications are based on syntactic distribution and morphological properties rather than
purely semantic meaning. Word classes are broadly divided into open classes (which allow new
words to be added) and closed classes (which have fixed membership).

1. Open Classes

Open classes are dynamic and continually expand as new words are created or borrowed from
other languages. They include nouns, verbs, adjectives, and adverbs.

1.1 Nouns

Nouns typically name people, places, things, or abstract concepts. They can function as
subjects or objects in a sentence.
 Morphological Properties: Can take plural forms (goat → goats) and possessives
(IBM’s revenue).
 Syntactic Properties: Occur with determiners (a goat, the ship).

Types of Nouns:

1. Proper vs. Common Nouns

o Proper nouns (Regina, IBM) refer to specific entities and are capitalized.
o Common nouns (book, chair) refer to general items.
2. Count vs. Mass Nouns
o Count nouns (goat, apple) can be counted (one goat, two goats).
o Mass nouns (snow, water) cannot be counted (two snows is incorrect).

1.2 Verbs

Verbs describe actions, states, or processes.

 Morphological Forms: Include base form (eat), third-person singular (eats), past tense
(ate), past participle (eaten), and progressive form (eating).
 Syntactic Role: Often function as predicates in sentences.

Auxiliary Verbs

A subtype of verbs that assist the main verb by adding tense, aspect, mood, or voice.

 Examples: be, have, do, can, must, should.

 Copula Verb: The verb be connects subjects with predicates (She is a doctor).
 Modal Verbs: Express necessity or possibility (must, may, can).

1.3 Adjectives

Adjectives describe qualities or properties of nouns.

 Common Semantic Categories: Color (red, blue), Age (young, old), Value (good, bad).
 Syntactic Role: Often occur before nouns (a red car) or after copula verbs (the car is
red).

1.4 Adverbs

Adverbs modify verbs, adjectives, other adverbs, or entire sentences.

 Types:
o Locative Adverbs (home, here) indicate location.
o Degree Adverbs (very, extremely) indicate intensity.
o Manner Adverbs (slowly, carefully) describe how an action occurs.
o Temporal Adverbs (yesterday, soon) specify time.
2. Closed Classes

Closed classes contain a fixed number of words that rarely change over time. These include
prepositions, determiners, pronouns, conjunctions, auxiliary verbs, particles, numerals,
and interjections.

2.1 Prepositions

Prepositions occur before noun phrases and indicate spatial, temporal, or other relationships.

 Examples: on, under, at, from, with, before.

 Usage: She sat on the chair.

2.2 Determiners

Determiners introduce noun phrases and provide definiteness, quantity, or possession.

 Examples: a, an, the, this, that, my, your.

 Articles: English has three articles: a, an, and the.
o A and an are indefinite articles.
o The is a definite article.

2.3 Pronouns

Pronouns replace noun phrases and function as references to people, things, or ideas.

 Types:
o Personal Pronouns: (I, you, he, she, it, we, they)
o Possessive Pronouns: (my, your, his, her, its, our, their)
o Wh-Pronouns: (who, whom, what, which)

2.4 Conjunctions

Conjunctions connect words, phrases, or clauses.

 Coordinating Conjunctions: Join elements of equal status (and, but, or).

 Subordinating Conjunctions: Introduce dependent clauses (because, although, if).
 Complementizers: Special subordinating conjunctions that introduce noun clauses (that,
whether).

2.5 Particles

Particles resemble prepositions or adverbs but function as part of phrasal verbs.

 Examples: up, down, in, out, on.
 Usage in Phrasal Verbs:
o Turn down (reject)
o Find out (discover)

2.6 Numerals

Numerals indicate quantity or order.

 Examples: one, two, three, first, second, third.

2.7 Interjections

Interjections express emotions or exclamations.

 Examples: oh, ah, hey, alas, um, uh.

 Usage: Oh no! That was a mistake.

2.8 Negatives, Politeness Markers, and Greetings

 Negatives: no, not.

 Politeness Markers: please, thank you.
 Greetings: hello, goodbye.

Comprehensive Study Notes on Tagsets for English

Introduction
Tagging words with their appropriate part-of-speech (POS) is a fundamental task in natural
language processing (NLP). Different tagsets are used for this purpose, evolving from the
original Brown corpus tagset. This document explores major English tagsets, their applications,
and challenges in part-of-speech tagging.

1. Major Tagsets for English

1.1 The Brown Corpus Tagset

 Developed at Brown University in 1963-64.

 Consists of 87 tags.
 First applied to a 1-million-word corpus of 500 written texts.
 Initially tagged using the TAGGIT program, followed by manual correction.
1.2 The Penn Treebank Tagset

 Contains 45 tags.
 Used in corpora such as Brown Corpus, Wall Street Journal Corpus, and
Switchboard Corpus.
 Its small size makes it one of the most widely used tagsets.
 Example:
o (5.1) The/DT grand/JJ jury/NN commented/VBD on/IN a/DT number/NN of/IN
other/JJ topics/NNS ./.

1.3 The CLAWS C5 Tagset

 Contains 61 tags.
 Used in the British National Corpus (BNC).
 Developed by Lancaster UCREL’s CLAWS (Constituent Likelihood Automatic
Word-tagging System).

2. Examples of POS Tagging

2.1 POS Tagged Sentences (Penn Treebank)

 Existential There (EX) vs. Adverb (RB):

o (5.2) There/EX are/VBP 70/CD children/NNS there/RB
 Passive Construction:
o (5.3) Although/IN preliminary/JJ findings/NNS were/VBD reported/VBN
more/RBR than/IN a/DT year/NN ago/IN ,/, the/DT latest/JJS results/NNS
appear/VBP in/IN today/NN ’s/POS New/NNP England/NNP Journal/NNP of/IN
Medicine/NNP ,/.
 Proper Noun Segmentation:
o "New England Journal of Medicine" tagged as NNP for each noun.

3. Tagging Challenges
3.1 Overlap Between Prepositions (IN), Particles (RP), and Adverbs (RB)

Words like around can belong to different categories:

 Particle (RP): (5.4) Mrs./NNP Shaefer/NNP never/RB got/VBD around/RP to/TO

joining/VBG
 Preposition (IN): (5.5) All/DT we/PRP gotta/VBN do/VB is/VBZ go/VB around/IN
the/DT corner/NN
 Adverb (RB): (5.6) Chateau/NNP Petrus/NNP costs/VBZ around/RB 250/CD

3.2 Distinguishing Between Prepositions and Particles

 Particles can move:

o (5.7) She told off/RP her friends.
o (5.8) She told her friends off/RP.
 Prepositions cannot move:
o (5.9) She stepped off/IN the train.
o (5.10) *She stepped the train off/IN. (*Incorrect sentence)

3.3 Modifiers Preceding Nouns

 Common Nouns as Modifiers:

o (5.11) cotton/NN sweater/NN
 Hyphenated Adjectival Modifiers:
o (5.12) income-tax/JJ return/NN
 Proper Noun Modifiers:
o (5.13) the/DT Gramm-Rudman/NP Act/NP
 Common Nouns as Modifiers Instead of Adjectives:
o (5.14) Chinese/NN cooking/NN
o (5.15) Pacific/NN waters/NNS

3.4 Distinguishing Past Participles (VBN) from Adjectives (JJ)

 Past participle used in an eventive sense:

o (5.16) They were married/VBN by the Justice of the Peace yesterday at 5:00.
 Adjective expressing a property:
o (5.17) At the time, she was already married/JJ.

4. Limitations of the Penn Treebank Tagset

 Reduction from the original 87-tag Brown set.
 Loss of information about verb forms:
o Brown/C5 tagsets distinguish between did (VDD) and doing (VDG), whereas
Treebank does not.
 Merging Prepositions and Subordinating Conjunctions:
o Penn Treebank marks both as IN, while Brown/C5 differentiate them (CS for
conjunctions, IN for prepositions).
 Tagging Inconsistencies in Adverbial Nouns:
o Days of the week (Monday, Tuesday) → NNP.
o Other adverbial nouns (tomorrow, west, home) → Inconsistently tagged as NN or
RB.
Comprehensive Study Notes on Rule-Based Part-of-Speech Tagging

Introduction

Rule-based part-of-speech (POS) tagging is one of the earliest methods developed for assigning
POS tags to words in a text. The fundamental architecture follows a two-stage process:

1. Dictionary Lookup: Assigns all possible POS tags to each word.

2. Disambiguation Rules: Uses manually written linguistic rules to eliminate incorrect
tags.

This method, although initially developed in the 1960s, has been refined over time. One of the
most comprehensive rule-based tagging approaches is the Constraint Grammar (EngCG)
approach developed by Karlsson et al. (1995a).

EngCG Tagger

The EngCG tagger (Voutilainen, 1995, 1999) is a rule-based POS tagger that operates using:

 A lexicon-based approach derived from two-level morphology.

 A rule-based system to resolve ambiguities in tagging.

EngCG Lexicon

 The ENGTWOL lexicon contains about 56,000 English word stems.

 Each entry is associated with morphological and syntactic features.
 Words with multiple POS (e.g., “hit” as both a noun and a verb) are listed separately.

Example of Lexicon Entries (Fig. 5.11)

Each word in the lexicon is annotated with various features:

 SG: Singular noun.

 -SG3: Non-third-person-singular verb.
 ABSOLUTE: Adjective is non-comparative and non-superlative.
 NOMINATIVE: Non-genitive noun.
 PCP2: Past participle verb.
 PRE, CENTRAL, POST: Positions of determiners.
 NOINDEFDETERMINER: Restriction on determiners (e.g., “furniture” cannot take an
indefinite article).
 SV, SVO, SVOO: Verb subcategorization patterns.
Tagging Process

First Stage: Lexical Analysis

 Each word is processed using the two-level lexicon transducer to obtain all possible
POS tags.
 Example:
o Sentence: Pavlov had shown that salivation...
o Possible Tags:

Word Possible POS Tags

Pavlov N NOM SG PROPER
had HAVE V PAST VFIN SVO, HAVE PCP2 SVO
shown SHOW PCP2 SVOO SVO SV
that ADV, PRON DEM SG, DET CENTRAL DEM SG, CS
salivation N NOM SG

Second Stage: Constraint Application

 3,744 rules in EngCG-2 are used to eliminate incorrect tags.

 Example:
o The system selects HAVE V PAST instead of HAVE PCP2 for had.
o The complementizer (CS) tag is assigned to that.

Rule-Based Disambiguation

EngCG applies rules in a negative manner, meaning incorrect interpretations are removed.

Example: Adverbial-That Rule

This rule ensures that is tagged correctly based on its context.

Rule Logic:

 If that is followed by an adjective, adverb, or quantifier and a sentence boundary, it is

tagged as an adverb.
 Otherwise, the adverbial interpretation is removed.
 Additional conditions prevent misinterpretation of that after verbs like consider or
believe.

Example Sentences:

1. Correct Adverbial Tagging: It isn’t that odd.

2. Correct Complementizer Tagging: I consider that odd.

Another rule ensures that is tagged as a complementizer (CS) when:

 It follows a verb that requires a complement (believe, think, show).

 It precedes a noun phrase and a finite verb.

Enhancements in EngCG

 Probabilistic Constraints: Additional probability-based filtering.

 Syntactic Information Usage: Beyond basic POS tagging, EngCG incorporates syntax
rules.

For more details, refer to Karlsson et al. (1995b) and Voutilainen (1999).

DMV Cheat Sheet New Jersey Es Auto Premium
No ratings yet
DMV Cheat Sheet New Jersey Es Auto Premium
55 pages
1-Deploying Functional Grammar
No ratings yet
1-Deploying Functional Grammar
311 pages
VERBS (Transitive & Intransitive)
No ratings yet
VERBS (Transitive & Intransitive)
4 pages
Part of Speech Tagging (Chapter 5) : Adapted From Kathy Mccoy'S Presentation Downloaded From The Web, September 2010
No ratings yet
Part of Speech Tagging (Chapter 5) : Adapted From Kathy Mccoy'S Presentation Downloaded From The Web, September 2010
63 pages
Lec-5 POStagging
No ratings yet
Lec-5 POStagging
24 pages
Chapter Two Natural Language Processing
No ratings yet
Chapter Two Natural Language Processing
141 pages
Lecture_5_Part_Of_Speech_Tagging
No ratings yet
Lecture_5_Part_Of_Speech_Tagging
39 pages
Lect6 Pos
No ratings yet
Lect6 Pos
62 pages
POS Tagging 2.0
No ratings yet
POS Tagging 2.0
14 pages
NLP 3
No ratings yet
NLP 3
25 pages
Ilak Pos Tagging
No ratings yet
Ilak Pos Tagging
48 pages
lec04-2-PartOfSpeechTagging
No ratings yet
lec04-2-PartOfSpeechTagging
56 pages
Chapter Four - 2
No ratings yet
Chapter Four - 2
118 pages
IS 7118 Unit-5 POS Tagging
No ratings yet
IS 7118 Unit-5 POS Tagging
89 pages
Unit I - Linguistic Units
No ratings yet
Unit I - Linguistic Units
17 pages
17
No ratings yet
17
27 pages
Sequence Labeling For Parts of Speech and Named Entities: To Each Word A Warbling Note A Midsummer Night's Dream, V.I
No ratings yet
Sequence Labeling For Parts of Speech and Named Entities: To Each Word A Warbling Note A Midsummer Night's Dream, V.I
27 pages
08 Sequence Labelling
No ratings yet
08 Sequence Labelling
27 pages
Linguistics Essentials: Instructor: Rada Mihalcea Taught by J. Hajic at Johns Hopkins University
No ratings yet
Linguistics Essentials: Instructor: Rada Mihalcea Taught by J. Hajic at Johns Hopkins University
46 pages
nlp-unit-iii-notes
No ratings yet
nlp-unit-iii-notes
30 pages
Syntax Word Classes and Functions
No ratings yet
Syntax Word Classes and Functions
11 pages
Word Classes and Part-of-Speech (POS) Tagging: CS4705 Julia Hirschberg
No ratings yet
Word Classes and Part-of-Speech (POS) Tagging: CS4705 Julia Hirschberg
40 pages
Grammar Morphological Process
No ratings yet
Grammar Morphological Process
7 pages
Sequence Labeling For Parts of Speech and Named Entities: To Each Word A Warbling Note A Midsummer Night's Dream, V.I
No ratings yet
Sequence Labeling For Parts of Speech and Named Entities: To Each Word A Warbling Note A Midsummer Night's Dream, V.I
27 pages
Natural Language Processing: Dr. G. Bharadwaja Kumar
No ratings yet
Natural Language Processing: Dr. G. Bharadwaja Kumar
44 pages
Linguistics Essentials: Instructor: Rada Mihalcea Taught by J. Hajic at Johns Hopkins University
No ratings yet
Linguistics Essentials: Instructor: Rada Mihalcea Taught by J. Hajic at Johns Hopkins University
46 pages
ANTHONY
No ratings yet
ANTHONY
6 pages
Intro_to_Linguistics_Syntax_1_Overview_o
No ratings yet
Intro_to_Linguistics_Syntax_1_Overview_o
14 pages
NLP
No ratings yet
NLP
4 pages
Ch 2 edited
No ratings yet
Ch 2 edited
16 pages
Unit 4 Morfo I Class Ppt
No ratings yet
Unit 4 Morfo I Class Ppt
27 pages
Grammar of English
No ratings yet
Grammar of English
66 pages
Part-Of-Speech (POS) Tagging
No ratings yet
Part-Of-Speech (POS) Tagging
53 pages
How To Describe English Sentences
No ratings yet
How To Describe English Sentences
38 pages
ASASASA
No ratings yet
ASASASA
5 pages
2-Introduction to NLP_part2
No ratings yet
2-Introduction to NLP_part2
27 pages
Apznzaaczprqee1da4bjade7ul0meb Ap8tjou Feozcgqct6cpnh0z32ibu3faj 0wgfmnhp5p Eneunhaucakhow Bie9yhlaoqtsknu7yq0gfnxrzjd2mjuyrbnhadveb2wj7gjgcxpffbjgyxl4nzdqf5qeux-Lla2ggr5kg9w4bp8ev5hqrj7bwr3npwnp9gfmazwtau
No ratings yet
Apznzaaczprqee1da4bjade7ul0meb Ap8tjou Feozcgqct6cpnh0z32ibu3faj 0wgfmnhp5p Eneunhaucakhow Bie9yhlaoqtsknu7yq0gfnxrzjd2mjuyrbnhadveb2wj7gjgcxpffbjgyxl4nzdqf5qeux-Lla2ggr5kg9w4bp8ev5hqrj7bwr3npwnp9gfmazwtau
108 pages
Parts of Speech and Morphology - Phrase Structure - Semantics and Pragmatics
No ratings yet
Parts of Speech and Morphology - Phrase Structure - Semantics and Pragmatics
39 pages
5 Morphology Part1
No ratings yet
5 Morphology Part1
29 pages
Syntax Course 1 May 2025
No ratings yet
Syntax Course 1 May 2025
24 pages
Contranstive Analysis (Aprida Simbolon 17120272)
No ratings yet
Contranstive Analysis (Aprida Simbolon 17120272)
13 pages
Hmm
No ratings yet
Hmm
94 pages
Morphology (Session 2) : Immediate Constituents (Ics) Ii. Words (Definition & Classification) Iii. Word Formation
No ratings yet
Morphology (Session 2) : Immediate Constituents (Ics) Ii. Words (Definition & Classification) Iii. Word Formation
23 pages
Medical Slides (1403) - Final
No ratings yet
Medical Slides (1403) - Final
39 pages
Advanced English Grammar with Exercises
From Everand
Advanced English Grammar with Exercises
George Lyman Kittredge
2.5/5 (12)
Summary Grammar II
No ratings yet
Summary Grammar II
30 pages
S24 - LING 233 - Syntax - Ch.2
No ratings yet
S24 - LING 233 - Syntax - Ch.2
26 pages
Intro 7 Syntax
No ratings yet
Intro 7 Syntax
39 pages
Corpus ( Long Questions )
No ratings yet
Corpus ( Long Questions )
7 pages
Functional Conversant English Reviewer
No ratings yet
Functional Conversant English Reviewer
8 pages
Theoretical Grammar
No ratings yet
Theoretical Grammar
5 pages
Speech and Language Processing: SLP Chapter 5
No ratings yet
Speech and Language Processing: SLP Chapter 5
56 pages
Module 4: Lecture-15
No ratings yet
Module 4: Lecture-15
44 pages
Ia-1 NLP
No ratings yet
Ia-1 NLP
7 pages
Resumen Parcial 1 Grammar
No ratings yet
Resumen Parcial 1 Grammar
6 pages
Module 2 HMMppt
No ratings yet
Module 2 HMMppt
31 pages
Speech and Language Processing
No ratings yet
Speech and Language Processing
26 pages
English Word Classes
No ratings yet
English Word Classes
10 pages
Class 2 - SS 2015 - Syntax - Parts of Speech
No ratings yet
Class 2 - SS 2015 - Syntax - Parts of Speech
40 pages
Lecture 11. Syntax. Phraseological Units. the Problem of Its Classification.
No ratings yet
Lecture 11. Syntax. Phraseological Units. the Problem of Its Classification.
3 pages
handout 1&2
No ratings yet
handout 1&2
11 pages
Natural Language Processing
No ratings yet
Natural Language Processing
47 pages
Comprehensive English Grammar Guide: From Basics to Competitive Excellence
From Everand
Comprehensive English Grammar Guide: From Basics to Competitive Excellence
Ranjot Singh Chahal
No ratings yet
Introduction to NLP_first_week_lecture_2st
No ratings yet
Introduction to NLP_first_week_lecture_2st
4 pages
Basic Elements of Assembly Language
No ratings yet
Basic Elements of Assembly Language
2 pages
DSA Lab Manual(Merge Sort )
No ratings yet
DSA Lab Manual(Merge Sort )
3 pages
Banker's algorithm
No ratings yet
Banker's algorithm
5 pages
Basic Concepts Trees
No ratings yet
Basic Concepts Trees
48 pages
DSA Lab Manual(Counting sort)
No ratings yet
DSA Lab Manual(Counting sort)
4 pages
DSA Lab Manual(Shell Sort)
No ratings yet
DSA Lab Manual(Shell Sort)
3 pages
DSA Lab Manual(Heap Sort)
No ratings yet
DSA Lab Manual(Heap Sort)
3 pages
Data Structure and Algorithm_Assignment
No ratings yet
Data Structure and Algorithm_Assignment
2 pages
binary
No ratings yet
binary
8 pages
Exercise Questions
No ratings yet
Exercise Questions
1 page
Binary Search Trees
No ratings yet
Binary Search Trees
9 pages
Lab Manual_DSA
No ratings yet
Lab Manual_DSA
3 pages
AVL Trees
No ratings yet
AVL Trees
6 pages
DSA Lab Manual(Insertion Sort )
No ratings yet
DSA Lab Manual(Insertion Sort )
3 pages
Direct Indirect Double Object Pronouns
No ratings yet
Direct Indirect Double Object Pronouns
5 pages
Year 5 PlanIt Spelling Overview Pack Single
No ratings yet
Year 5 PlanIt Spelling Overview Pack Single
6 pages
Adverbs Intro
No ratings yet
Adverbs Intro
6 pages
New Practical Grammar of Ielts and TOEFL Asatideonline
100% (1)
New Practical Grammar of Ielts and TOEFL Asatideonline
94 pages
To From Through Round Along Across Past
No ratings yet
To From Through Round Along Across Past
10 pages
GR 12 Paper 1 Language Telematics 25 July 2023 Final
No ratings yet
GR 12 Paper 1 Language Telematics 25 July 2023 Final
43 pages
Contrastive Analysis
No ratings yet
Contrastive Analysis
5 pages
George J. Dann. First Lessons in Urdu
No ratings yet
George J. Dann. First Lessons in Urdu
166 pages
Not Only ... But Also... (Grammar and Exercises)
100% (14)
Not Only ... But Also... (Grammar and Exercises)
2 pages
Plurals 3
No ratings yet
Plurals 3
16 pages
Dewantoro Naufal - XIIA6 - SASING
No ratings yet
Dewantoro Naufal - XIIA6 - SASING
4 pages
Base Form Past Past Participle Spanish
No ratings yet
Base Form Past Past Participle Spanish
2 pages
Starlight 11 Unit 1.6 Grammar
No ratings yet
Starlight 11 Unit 1.6 Grammar
16 pages
7.2 Và 7.3. New
100% (1)
7.2 Và 7.3. New
35 pages
Grade 5 English Speaking Using Prepositions and Prepositional Phrases in Sentences
No ratings yet
Grade 5 English Speaking Using Prepositions and Prepositional Phrases in Sentences
5 pages
Tatsushi Motohashi - Mi Goho
No ratings yet
Tatsushi Motohashi - Mi Goho
16 pages
HW Possessives - August 13th
50% (2)
HW Possessives - August 13th
2 pages
Transitional Devices - Write Site - Athabasca University PDF
No ratings yet
Transitional Devices - Write Site - Athabasca University PDF
4 pages
Clase I - Pecado y Obediencia
No ratings yet
Clase I - Pecado y Obediencia
11 pages
GST 111 2020 PDF
No ratings yet
GST 111 2020 PDF
20 pages
Preposition: Place. Thus, Contrary To Other "Small Words", They Are Not An Element of Style, But Absolutely
No ratings yet
Preposition: Place. Thus, Contrary To Other "Small Words", They Are Not An Element of Style, But Absolutely
9 pages
Assignment for English (SME)
No ratings yet
Assignment for English (SME)
1 page
Grammar 3 (Complete Course)
No ratings yet
Grammar 3 (Complete Course)
26 pages
Conjunctions PDF
No ratings yet
Conjunctions PDF
11 pages
Mock Analysis Excel Simplicrack
No ratings yet
Mock Analysis Excel Simplicrack
14 pages
Coordinating Conjunctions (Fanboys)
No ratings yet
Coordinating Conjunctions (Fanboys)
6 pages
Basic German 2nd Edition Jolene Wochenske - Quickly download the ebook to read anytime, anywhere
100% (6)
Basic German 2nd Edition Jolene Wochenske - Quickly download the ebook to read anytime, anywhere
66 pages

NLP_Lecture_9_and_10_Week_5

Uploaded by

NLP_Lecture_9_and_10_Week_5

Uploaded by

1.

Thrax’s description of eight parts-of-speech—noun, verb, pronoun, preposition, adverb,

2. The Enduring Influence of Thrax’s Parts-of-Speech

3. Evolution of Parts-of-Speech Tagsets

 Penn Treebank (45 word classes)

4. The Role of Parts-of-Speech in Language Processing

5. Computational Methods for POS Tagging

Several algorithms have been developed for automatic POS tagging:

1. Rule-Based Tagging – Uses manually crafted linguistic rules.

6. Applications of POS Tagging

POS-tagged corpora have significant applications in:

 Linguistic Research – Studying grammatical constructions and usage frequencies.

Comprehensive Study Notes on English Word Classes

1. Proper vs. Common Nouns

Verbs describe actions, states, or processes.

 Examples: be, have, do, can, must, should.

Adjectives describe qualities or properties of nouns.

Adverbs modify verbs, adjectives, other adverbs, or entire sentences.

 Examples: on, under, at, from, with, before.

Determiners introduce noun phrases and provide definiteness, quantity, or possession.

 Examples: a, an, the, this, that, my, your.

Conjunctions connect words, phrases, or clauses.

 Coordinating Conjunctions: Join elements of equal status (and, but, or).

Particles resemble prepositions or adverbs but function as part of phrasal verbs.

Numerals indicate quantity or order.

 Examples: one, two, three, first, second, third.

Interjections express emotions or exclamations.

 Examples: oh, ah, hey, alas, um, uh.

2.8 Negatives, Politeness Markers, and Greetings

 Negatives: no, not.

Comprehensive Study Notes on Tagsets for English

1. Major Tagsets for English

 Developed at Brown University in 1963-64.

1.3 The CLAWS C5 Tagset

2. Examples of POS Tagging

 Existential There (EX) vs. Adverb (RB):

Words like around can belong to different categories:

 Particle (RP): (5.4) Mrs./NNP Shaefer/NNP never/RB got/VBD around/RP to/TO

3.2 Distinguishing Between Prepositions and Particles

 Particles can move:

3.3 Modifiers Preceding Nouns

 Common Nouns as Modifiers:

3.4 Distinguishing Past Participles (VBN) from Adjectives (JJ)

 Past participle used in an eventive sense:

4. Limitations of the Penn Treebank Tagset

1. Dictionary Lookup: Assigns all possible POS tags to each word.

 A lexicon-based approach derived from two-level morphology.

 The ENGTWOL lexicon contains about 56,000 English word stems.

Example of Lexicon Entries (Fig. 5.11)

Each word in the lexicon is annotated with various features:

 SG: Singular noun.

First Stage: Lexical Analysis

Word Possible POS Tags

Second Stage: Constraint Application

 3,744 rules in EngCG-2 are used to eliminate incorrect tags.

Example: Adverbial-That Rule

This rule ensures that is tagged correctly based on its context.

 If that is followed by an adjective, adverb, or quantifier and a sentence boundary, it is

1. Correct Adverbial Tagging: It isn’t that odd.

Another rule ensures that is tagged as a complementizer (CS) when:

 It follows a verb that requires a complement (believe, think, show).

 Probabilistic Constraints: Additional probability-based filtering.

You might also like