0% found this document useful (0 votes)

63 views26 pages

NLP Final

The document describes implementing various natural language processing techniques like word analysis, word generation, morphology, n-grams, n-gram smoothing and POS tagging using hidden Markov model with Python and NLTK. It provides the aim, algorithm and program code for each technique.

Uploaded by

Sai Samrat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

63 views26 pages

NLP Final

Uploaded by

Sai Samrat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

EXP:01 DATE: 11-1-2024

WORD ANALYSIS
AIM:
To implement word analysis using Python and NLTK.

ALGORITHM:
STEP1: Start the program

STEP2: Download Anaconda version 3.9

STEP3: Click Environment

STEP4: Create new environment with name Tensorflow and click Create
STEP5: Replace not installed option with installed
STEP6: Search tensorflow package and apply
STEP7: Repeat Step5, search Keras and NLTK and apply
STEP8: Go to home, click Tensorflow
STEP9: Install Spyder, Jupyter Notebook and launch
STEP10: String handling code has been generated and executed
STEP11: Stop the program

PROGRAM:
print(len("what it is what it isnt"))
s=["what","it","is","what","it","isnt"]
print(len(s))
x=sorted(s)
print(s)
print(x)
d=x+s
print(d)

1
J.SUBBA RAMI REDDY 211211101118
OUTPUT:

RESULT:
Word analysis using Python and NLTK is verified and executed.

2
J.SUBBA RAMI REDDY 211211101118
EXP:02 DATE:18-1-2024

WORD GENERATION
AIM:
To implement word generation using Python and NLTK.

ALGORITHM:
STEP1: Start the program

STEP2: Download Anaconda version 3.9

STEP3: Click Environment

STEP4: Create new environment with name Tensorflow and click Create
STEP5: Replace not installed option with installed
STEP6: Search tensorflow package and apply
STEP7: Repeat Step5, search Keras and NLTK and apply
STEP8: Go to home, click Tensorflow
STEP9: Install Spyder, Jupyter Notebook and launch
STEP10: File Handling is done using the process of tokenization and executed
STEP11: Stop the program

PROGRAM:
for line in open("nlp.py"):
for word in line.split():
if word.endswith('ing'):
print(word)
print(len(word))

3
J.SUBBA RAMI REDDY 211211101118
OUTPUT:

RESULT:
Word generation using Python and NLTK is verified and executed.

4
J.SUBBA RAMI REDDY 211211101118
EXP:03 DATE: 1-2-2024

MORPHOLOGY
AIM:
To implement morphology using Python and NLTK.

ALGORITHM:
STEP1: Start the program

STEP2: Download Anaconda version 3.9

STEP3: Click Environment

STEP9: Install Spyder, Jupyter Notebook and launch

STEP10: General Morphology Code and Stop Word Removal
STEP11: Stop the program

PROGRAM:
CODE:
import re
input="The5biggestanimalsare1.Elephant,2Rhinoand3dinasaur"
input=input.lower()
print(input)
result=re.sub(r'\d+','',input)
print(result)

5
J.SUBBA RAMI REDDY 211211101118
STOP WORD REMOVAL:
def punctuations(raw_review):
text=raw_review
text=text.replace("n't",'not')
text=text.replace("'s",'is')
text=text.replace("'re",'are')
text=text.replace("'ve",'have')
text=text.replace("'m",'am')
text=text.replace("'d",'would')
text=text.replace("'ll",'will')
text=text.replace("in",'ing')
import re
letters_only=re.sub("[^a-zA-Z]","",text)
return(''.join(letters_only))

t="Hows'smyteamdoin ,you'resupposedtobenotloosin"
p=punctuations(t)
print(p)

SYNONYM:
import nltk
nltk.download('omw-1.4')
nltk.download('wordnet')

from nltk.corpus import wordnet

synonyms = []

for syn in wordnet.synsets('Machine'):

for lemma in syn.lemmas():
synonyms.append(lemma.name())
print(synonyms)

6
J.SUBBA RAMI REDDY 211211101118
STEMMING:
from nltk.stem import PorterStemmer
stemmer=PorterStemmer()
print(stemmer.stem('eating'))
print(stemmer.stem('ate'))

7
J.SUBBA RAMI REDDY 211211101118
OUTPUTS:

CODE:

STOP WORD REMOVAL:

8
J.SUBBA RAMI REDDY 211211101118
SYNONYM:

STEMMING:

RESULT:
The Morphological Analysis Code of NLP is verified and executed.

9
J.SUBBA RAMI REDDY 211211101118
EX.NO:04 DATE: 8-2-2024

N-GRAMS
AIM:
To implement N-Grams using Python and NLTK.

ALGORITHM:
STEP1: Start the program

STEP2: Download Anaconda version 3.9

STEP3: Click Environment

STEP9: Install Spyder, Jupyter Notebook and launch

STEP10: N-Gram code has been generated and executed.
STEP11: Stop the program

PROGRAM:
import re

from nltk.util import ngrams

s ="Machine learning is an important part of AI""and AI is going to become inmporant for
daily functionong"

tokens=[token for token in s.split(" ")]

output =list(ngrams(tokens,2))
print(output)

10
J.SUBBA RAMI REDDY 211211101118
OUTPUT:

RESULT:
The N Grams code has been executed and verified using Python and NLTK.

11
J.SUBBA RAMI REDDY 211211101118
EX.NO:05 DATE: 15-2-2024

N-GRAMS SMOOTHING
AIM:
To implement N-Grams Smoothing using Python and NLTK.

ALGORITHM:
STEP1: Start the program

STEP2: Download Anaconda version 3.9

STEP3: Click Environment

STEP4: Create new environment with name Tensorflow and click Create
STEP5: Replace not installed option with installed
STEP6: Search tensorflow package and apply
STEP7: Repeat Step5, search Keras and NLTK and apply
STEP8: Go to home, click Tensorflow
STEP9: Install Spyder, Jupyter Notebook and launch
STEP10: N-Gram Smoothing code has been generated and executed.
STEP11: Stop the program

PROGRAM:
from collections import Counter
import numpy as np
# Define corpus
corpus = "the quick brown fox jumps over the lazy dog"
# Create unigrams
unigrams = Counter(corpus.split())

12
J.SUBBA RAMI REDDY 211211101118
# Define function to compute n-grams
def get_ngrams(sentence, n):
return [tuple(sentence[i:i+n]) for i in range(len(sentence)-n+1)]
# Create bigrams
bigrams = Counter(get_ngrams(corpus.split(), 2))
# Define smoothing function
def add_k_smoothing(ngram_counts, k, n_1gram_counts):
# Calculate total number of n-grams
total_ngrams = sum(ngram_counts.values())

# Calculate vocabulary size

vocabulary_size = len(n_1gram_counts)
# Calculate denominator for probability calculation
denominator = total_ngrams + k*vocabulary_size
# Calculate smoothed probabilities
probabilities = {}

for ngram, count in ngram_counts.items():

probabilities[ngram] = (count + k) / denominator
# Handle unseen n-grams
for ngram in set(n_1gram_counts.keys()) - set(ngram_counts.keys()):
probabilities[ngram] = k / denominator

return probabilities
# Apply smoothing to bigrams
k=1
bigram_probabilities = add_k_smoothing(bigrams, k, unigrams)
# Print results

for bigram, probability in bigram_probabilities.items():

print(bigram, probability)

13
J.SUBBA RAMI REDDY 211211101118
OUTPUT:

RESULT:
The N-Gram Smoothing code has been executed and verified using Python and NLTK.

14
J.SUBBA RAMI REDDY 211211101118
EX.NO:06 DATE: 22-2-2024

POS – TAGGING: HIDDEN MARKOV MODEL

AIM:
To implement POS-Tagging: Hidden Markov Model using Python and NLTK

ALGORITHM:
STEP1: Start the program

STEP2: Download Anaconda version 3.9

STEP3: Click Environment

STEP4: Create new environment with name Tensorflow and click Create
STEP5: Replace not installed option with installed
STEP6: Search tensorflow package and apply
STEP7: Repeat Step5, search Keras, NLTK, Pandas, Numba and Random and apply
STEP8: Go to home, click Tensorflow
STEP9: Install Spyder, Jupyter Notebook and launch

STEP10:Using POS-Tagging, Hidden Markov Model code has been generated and executed.
STEP11: Stop the program

PROGRAM:
import nltk

import numpy as np
import pandas as pd
import random
from sklearn.model_selection import train_test_split
import pprint, time
nltk.download('treebank')
nltk.download('universal_tagset')
nltk_data = list(nltk.corpus.treebank.tagged_sents(tagset='universal'))
print(nltk_data[:2]

15
J.SUBBA RAMI REDDY 211211101118
OUTPUT:

RESULT:
Using POS Tagging, Hidden Markov Model has been executed and verified using Python and
NLTK.

16
J.SUBBA RAMI REDDY 211211101118
EX.NO:07 DATE:29-2-2024

POS – TAGGING: VITERBI DECODING

AIM:
To implement POS-Tagging: Viterbi Decoding using Python and NLTK

ALGORITHM:
STEP1: Start the program

STEP2: Download Anaconda version 3.9

STEP3: Click Environment

STEP4: Create new environment with name Tensorflow and click Create
STEP5: Replace not installed option with installed
STEP6: Search tensorflow package and apply
STEP7: Repeat Step5, search Keras and NLTK and apply
STEP8: Go to home, click Tensorflow
STEP9: Install Spyder, Jupyter Notebook and launch
STEP10:Using POS-Tagging, Viterbi Decoding has been generated and executed.
STEP11: Stop the program.

PROGRAM :
import nltk
from nltk.corpus import brown
# Training data
sentences = brown.tagged_sents()[:5000]
# Create tag frequency distribution and transition probability matrix

tag_freq = nltk.FreqDist(tag for sentence in sentences for word, tag in sentence)

transition_prob = nltk.ConditionalFreqDist(
(tag1, tag2) for sentence in sentences for (_, tag1), (_, tag2) in nltk.bigrams(sentence)
)

17
J.SUBBA RAMI REDDY 211211101118
# Define Viterbi function
def viterbi(sentence, tag_freq, transition_prob):
# Initialize first word probabilities
v = [{}]
for tag in tag_freq:

v[0][tag] = {"prob": tag_freq[tag] / len(sentences), "prev": None}

# Recursion step
for i in range(1, len(sentence)):

v.append({})

for tag in tag_freq:

max_prob = max(
v[i - 1][prev_tag]["prob"] * transition_prob[prev_tag][tag] * tag_freq[tag] /
len(sentences)
for prev_tag in tag_freq
)

for prev_tag in tag_freq:

if v[i - 1][prev_tag]["prob"] * transition_prob[prev_tag][tag] * tag_freq[tag] /
len(sentences) == max_prob:

18
J.SUBBA RAMI REDDY 211211101118
current_tag = v[i][current_tag]["prev"]
tags.append(current_tag)
tags.reverse()
return list(zip(sentence, tags))
# Example usage
sentence = "The quick brown fox jumps over the lazy dog".split()
pos_tags = viterbi(sentence, tag_freq, transition_prob)
print(pos_tags)

19
J.SUBBA RAMI REDDY 211211101118
OUTPUT :

RESULT:
Using POS Tagging, Viterbi Decoding has been executed and verified using Python and NLTK.

20
J.SUBBA RAMI REDDY 211211101118
EX.NO:08 DATE: 7-3-2024

BUILDING POS TAGGER

AIM:
To implement Building POS Tagger using Python and NLTK

ALGORITHM:
STEP1: Start the program

STEP2: Download Anaconda version 3.9

STEP3: Click Environment

STEP4: Create new environment with name Tensorflow and click Create
STEP5: Replace not installed option with installed
STEP6: Search tensorflow package and apply
STEP7: Repeat Step5, search Keras and NLTK and apply
STEP8: Go to home, click Tensorflow
STEP9: Install Spyder, Jupyter Notebook and launch
STEP10: Building POS Tagger has been generated and executed.
STEP11: Stop the program

PROGRAM:
import nltk
nltk.download('averaged_perceptron_tagger')
nltk.download('punkt')
text=nltk.word_tokenize("And now for Everything completely Same")
print(nltk.pos_tag(text))

21
J.SUBBA RAMI REDDY 211211101118
OUTPUT :

RESULT:
Building POS Tagger code has been executed and verified using Python and NLTK.

22
J.SUBBA RAMI REDDY 211211101118
EX.NO:09 DATE:14-3-2024

CHUNKING
AIM:
To implement Chunking code using Python and NLTK

ALGORITHM:
STEP1: Start the program

STEP2: Download Anaconda version 3.9

STEP3: Click Environment

STEP4: Create new environment with name Tensorflow and click Create
STEP5: Replace not installed option with installed
STEP6: Search tensorflow package and apply
STEP7: Repeat Step5, search Keras and NLTK and apply
STEP8: Go to home, click Tensorflow
STEP9: Install Spyder, Jupyter Notebook and launch
STEP10: Chunking code is generated and verified, by printing the result.
STEP11: Stop the program

PROGRAM:
import nltk
sentence = [("the", "DT"), ("little", "JJ"), ("yellow", "JJ"), ("dog", "NN"), ("barked", "VBD"),
("at", "IN"), ("the", "DT"), ("cat", "NN")]
grammar = "NP: {<DT>?<JJ>*<NN>}"

cp = nltk.RegexpParser(grammar)
result = cp.parse(sentence)
print(result)
result.draw()

23
J.SUBBA RAMI REDDY 211211101118
OUTPUT :

RESULT:
The chunking code has been executed and verified using Python and NLTK

24
J.SUBBA RAMI REDDY 211211101118
EX.NO:10 DATE: 14-3-2024

BUILDING CHUNKERS
AIM:
To implement Building Chunkers code using Python and NLTK

ALGORITHM:
STEP1: Start the program

STEP2: Download Anaconda version 3.9

STEP3: Click Environment

STEP4: Create new environment with name Tensorflow and click Create
STEP5: Replace not installed option with installed
STEP6: Search tensorflow package and apply
STEP7: Repeat Step5, search Keras and NLTK and apply
STEP8: Go to home, click Tensorflow
STEP9: Install Spyder, Jupyter Notebook and launch
STEP10: Building Chunker code is generated and verified, by printing the result.
STEP11: Stop the program.

PROGRAM:
import nltk
from nltk.chunk import RegexpParser
# define chunking pattern
chunking_pattern = r"""
NP: {<DT>?<JJ>*<NN>} # noun phrase

{<NNP>+} # proper noun phrase

# tokenize and POS tag the text
text = "John saw the big brown bear in the forest"
tokens = nltk.word_tokenize(text)
pos_tags = nltk.pos_tag(tokens)
# apply chunking pattern to the POS tagged text
chunk_parser = RegexpParser(chunking_pattern)
chunks = chunk_parser.parse(pos_tags)
# print the extracted chunks rint(chunks)

25
J.SUBBA RAMI REDDY 211211101118
OUTPUT:

RESULT:
Thus, Building Chunkers code has been executed and verified using Python and NLTK.

26
J.SUBBA RAMI REDDY 211211101118

MCQ For Medical Student
100% (11)
MCQ For Medical Student
6 pages
Embedded Brochure
No ratings yet
Embedded Brochure
8 pages
ITC BLACK BOOK PROJECT Final Report
70% (10)
ITC BLACK BOOK PROJECT Final Report
62 pages
Semantic Web Unit 5 Written Notes
No ratings yet
Semantic Web Unit 5 Written Notes
22 pages
Safe Dance Practice Article 2fbooklet Online
No ratings yet
Safe Dance Practice Article 2fbooklet Online
84 pages
NLP QB
100% (2)
NLP QB
14 pages
Natural Language Processing
No ratings yet
Natural Language Processing
17 pages
U4 NLP Notes
No ratings yet
U4 NLP Notes
5 pages
NLP Notes Unit-3.Doc
No ratings yet
NLP Notes Unit-3.Doc
19 pages
Unit 4
100% (1)
Unit 4
8 pages
Natural Language Processing
No ratings yet
Natural Language Processing
47 pages
NLP Unit-3-Semantics-And-Pragmatics
No ratings yet
NLP Unit-3-Semantics-And-Pragmatics
20 pages
Unit 5 - Notes
No ratings yet
Unit 5 - Notes
11 pages
NLP Notes
No ratings yet
NLP Notes
43 pages
Unit 4 NLP Notes
No ratings yet
Unit 4 NLP Notes
35 pages
NLP Lect Unit I
100% (1)
NLP Lect Unit I
140 pages
Unit 1 2 3 4 5 NLP Notes Merged
100% (1)
Unit 1 2 3 4 5 NLP Notes Merged
105 pages
NLP UNIT-II PPT
No ratings yet
NLP UNIT-II PPT
45 pages
Unit 3
No ratings yet
Unit 3
19 pages
NLP Notes
No ratings yet
NLP Notes
18 pages
Unit 3
100% (1)
Unit 3
11 pages
NLP Unit-1 Notes
No ratings yet
NLP Unit-1 Notes
59 pages
NLP UNIT 2 (Ques Ans Bank)
No ratings yet
NLP UNIT 2 (Ques Ans Bank)
26 pages
Unit 2
No ratings yet
Unit 2
15 pages
Module 3 - Paper 1 - Extracting Relations From Text From Word Sequences To Dependency Paths
No ratings yet
Module 3 - Paper 1 - Extracting Relations From Text From Word Sequences To Dependency Paths
11 pages
Unit-III PDF
No ratings yet
Unit-III PDF
72 pages
NLP Lab Manual-1
No ratings yet
NLP Lab Manual-1
18 pages
Solutions To NLP I Mid Set A
100% (1)
Solutions To NLP I Mid Set A
8 pages
Notes of NLP - Unit-2
No ratings yet
Notes of NLP - Unit-2
23 pages
Unit-1 Aim 502
No ratings yet
Unit-1 Aim 502
15 pages
Shivangi Tyagi (NLP Assignments)
No ratings yet
Shivangi Tyagi (NLP Assignments)
60 pages
Natural Language Processing
100% (2)
Natural Language Processing
48 pages
NLP- AI2214601 unit 1to unit 5 notes
No ratings yet
NLP- AI2214601 unit 1to unit 5 notes
98 pages
Unit - 5 Natural Language Processing
No ratings yet
Unit - 5 Natural Language Processing
66 pages
Tangent Prop and Manifold Tangent Classifier are b
No ratings yet
Tangent Prop and Manifold Tangent Classifier are b
4 pages
Be Computer Engineering Semester 7 2023 May Dloc III Natural Language Processing Rev 2019 C Scheme
0% (1)
Be Computer Engineering Semester 7 2023 May Dloc III Natural Language Processing Rev 2019 C Scheme
2 pages
Unit 4 NLP
No ratings yet
Unit 4 NLP
51 pages
NLP Unit 3
No ratings yet
NLP Unit 3
20 pages
NLP LAB MANUAL 3-2 AIML R22 UPDATE (1)
100% (1)
NLP LAB MANUAL 3-2 AIML R22 UPDATE (1)
20 pages
Unit 5
No ratings yet
Unit 5
20 pages
Unit I
No ratings yet
Unit I
30 pages
Natural Language Processing: Dr. Abdulfetah A.A
No ratings yet
Natural Language Processing: Dr. Abdulfetah A.A
25 pages
Natural Language Processing
100% (1)
Natural Language Processing
21 pages
NLP Unit-2 Notes
No ratings yet
NLP Unit-2 Notes
45 pages
NLP UNIT-II
No ratings yet
NLP UNIT-II
71 pages
CCS369 - TSS-Unit 2
No ratings yet
CCS369 - TSS-Unit 2
56 pages
Word Sense Disambiguation: by Under The Guidance of
No ratings yet
Word Sense Disambiguation: by Under The Guidance of
99 pages
NLP Notes
No ratings yet
NLP Notes
71 pages
NLP Lab Expdoc New
No ratings yet
NLP Lab Expdoc New
103 pages
Chapter 7
No ratings yet
Chapter 7
49 pages
Natural Language Processing
No ratings yet
Natural Language Processing
12 pages
NLP Iat QB
No ratings yet
NLP Iat QB
10 pages
NLP Lab Manual Updated
No ratings yet
NLP Lab Manual Updated
34 pages
NLP SEM QUESTIONS AND ANSWERS
No ratings yet
NLP SEM QUESTIONS AND ANSWERS
72 pages
21AD3202 - Natural LanguageProcessing-Record
No ratings yet
21AD3202 - Natural LanguageProcessing-Record
64 pages
NLP Important and Super Important Questions-18CS743
No ratings yet
NLP Important and Super Important Questions-18CS743
2 pages
Unit-8: Natural Language: Processing
No ratings yet
Unit-8: Natural Language: Processing
16 pages
5.hyperparameters and Validation Sets (C)
No ratings yet
5.hyperparameters and Validation Sets (C)
3 pages
6CS4 AI Unit-5
No ratings yet
6CS4 AI Unit-5
65 pages
Top-Down and Bottom-Up Parsing
No ratings yet
Top-Down and Bottom-Up Parsing
23 pages
NLP Unit1
No ratings yet
NLP Unit1
51 pages
5.2 Natural Language Processing
No ratings yet
5.2 Natural Language Processing
43 pages
Textbook of Engineering Chemistry
From Everand
Textbook of Engineering Chemistry
C. Parameswara Murthy
No ratings yet
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
From Everand
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
Robert Johnson
No ratings yet
Mini 3dx Plans Tiled A4
No ratings yet
Mini 3dx Plans Tiled A4
20 pages
Allied Health Medicine
No ratings yet
Allied Health Medicine
9 pages
Third Quarter Summative Test No. 1 Objectives Code Percenta Ge No. of Items Item Placement
No ratings yet
Third Quarter Summative Test No. 1 Objectives Code Percenta Ge No. of Items Item Placement
2 pages
1999 XJ (X308) Engine Oil Pan R&I
No ratings yet
1999 XJ (X308) Engine Oil Pan R&I
4 pages
Ibm 380 560 760
No ratings yet
Ibm 380 560 760
716 pages
LEED v4 for Building Design and Construction
No ratings yet
LEED v4 for Building Design and Construction
1 page
Mudras for Self Confidence
No ratings yet
Mudras for Self Confidence
2 pages
GROUP 4 Conceptual Framework Research Design
No ratings yet
GROUP 4 Conceptual Framework Research Design
19 pages
Grace and Conformity: The Reformed Conformist Tradition and the Early Stuart Church of England (Oxford Studies in Historical Theology) Stephen Hampton All Chapters Instant Download
100% (4)
Grace and Conformity: The Reformed Conformist Tradition and the Early Stuart Church of England (Oxford Studies in Historical Theology) Stephen Hampton All Chapters Instant Download
66 pages
Teenage Pregnancy
No ratings yet
Teenage Pregnancy
2 pages
UNIT-II Mechanical Property Measurement
No ratings yet
UNIT-II Mechanical Property Measurement
127 pages
Hydrochloric Acid Plant Design: The Copperbelt University School of Technology Chemical Engineering Department
100% (1)
Hydrochloric Acid Plant Design: The Copperbelt University School of Technology Chemical Engineering Department
86 pages
6 Ubuntu
No ratings yet
6 Ubuntu
11 pages
Indian Dyes Virtual Trade Fair Bangladesh
No ratings yet
Indian Dyes Virtual Trade Fair Bangladesh
40 pages
Eco-Infrastracture Waterfront Property Development: Far Eastern University
No ratings yet
Eco-Infrastracture Waterfront Property Development: Far Eastern University
8 pages
Capacities (Refill) : Operation and Maintenance Manual
No ratings yet
Capacities (Refill) : Operation and Maintenance Manual
2 pages
Fortipam
No ratings yet
Fortipam
6 pages
A5/1 Security Project: Latest News
No ratings yet
A5/1 Security Project: Latest News
2 pages
Pro
No ratings yet
Pro
2 pages
Formalistic Approach
No ratings yet
Formalistic Approach
3 pages
OS Modules Summary
No ratings yet
OS Modules Summary
6 pages
PHP MP
No ratings yet
PHP MP
16 pages
What Is Climate Change? - United Nations
No ratings yet
What Is Climate Change? - United Nations
6 pages
(Chen Et Al., 2011) Seismic Behavior of Ductile Rectangular Composite Bridge Piers
No ratings yet
(Chen Et Al., 2011) Seismic Behavior of Ductile Rectangular Composite Bridge Piers
14 pages
Pharmaceutical Calculations USP
No ratings yet
Pharmaceutical Calculations USP
30 pages
KSPC Packing Marking Shipping Instruction
No ratings yet
KSPC Packing Marking Shipping Instruction
9 pages

NLP Final

Uploaded by

NLP Final

Uploaded by

EXP:01 DATE: 11-1-2024

STEP2: Download Anaconda version 3.9

STEP2: Download Anaconda version 3.9

STEP2: Download Anaconda version 3.9

STEP9: Install Spyder, Jupyter Notebook and launch

from nltk.corpus import wordnet

for syn in wordnet.synsets('Machine'):

STOP WORD REMOVAL:

STEP2: Download Anaconda version 3.9

STEP9: Install Spyder, Jupyter Notebook and launch

from nltk.util import ngrams

tokens=[token for token in s.split(" ")]

STEP2: Download Anaconda version 3.9

# Calculate vocabulary size

for ngram, count in ngram_counts.items():

for bigram, probability in bigram_probabilities.items():

POS – TAGGING: HIDDEN MARKOV MODEL

STEP2: Download Anaconda version 3.9

POS – TAGGING: VITERBI DECODING

STEP2: Download Anaconda version 3.9

tag_freq = nltk.FreqDist(tag for sentence in sentences for word, tag in sentence)

v[0][tag] = {"prob": tag_freq[tag] / len(sentences), "prev": None}

for tag in tag_freq:

for prev_tag in tag_freq:

v[i][tag] = {"prob": max_prob, "prev": prev_tag}

BUILDING POS TAGGER

STEP2: Download Anaconda version 3.9

STEP2: Download Anaconda version 3.9

STEP2: Download Anaconda version 3.9

{<NNP>+} # proper noun phrase

You might also like