0% found this document useful (0 votes)

3 views

NLP Lab File

Uploaded by

Bharat Mishra

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

NLP Lab File

Uploaded by

Bharat Mishra

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

DELHI TECHNOLOGICAL

UNIVERSITY
SE-316
NATURAL LANGUAGE PROCESSING

Department of Software Engineering

Delhi Technological University
Bawana Road, Delhi-110042

Submitted by
Prashant Tiwari
Roll Number :- 2K20/IT/103
Batch :- IT-B

Submitted to : Dr. Divyashikha Sethia

Department of Software Engineering
Delhi Technological University
INDEX

S. No. Experiment Date

1. Import nltk and download the ‘stopwords’ 13-01-2023

and ‘punkt’ packages

2. Import spacy and load the language model. 13-01-2023

3. WAP in python to tokenize a given text. 20-01-2023

4. WAP in python to get the sentences of a 03-03-2023

text document.

5. WAP in python to tokenize text with 03-02-2023

stopwords as delimiters.

6. WAP in python to add custom stop words in 03-02-2023

spaCy.

7. WAP to remove punctuations, perform 24-02-2023

stemming, lemmatize given text and extract
usernames from emails

8. WAP to do spell correction, extract all 07-03-2023

nouns, pronouns and verbs in a given text

9. WAP to find similarity between two words 31-03-2023

and classify a text as positive/negative
sentiment
EXPERIMENT - 1
AIM : Import nltk and download the ‘stopwords’ and ‘punkt’
packages

CODE :
import nltk

nltk.download('stopwords')
nltk.download('punkt')

OUTPUT :
EXPERIMENT - 2
AIM : Import spacy and load the language model

CODE :
import spacy
nlp_eng = spacy.load('en_core_web_sm')
nlp_multi = spacy.load('xx_ent_wiki_sm')

OUTPUT :
EXPERIMENT - 3
AIM : WAP in python to tokenize a given text

CODE :
from nltk import word_tokenize
text = "Last week, the University of Cambridge shared its own research
that shows if everyone wears a mask outside home,dreaded ‘second wave’
of the pandemic can be avoided."
text = word_tokenize(text)
for t in text:
print(t)

OUTPUT :
EXPERIMENT - 4
AIM : WAP in python to get the sentences of a text document.

CODE :
file = open('04.txt')
Input_text = file.read()
ans = Input_text.split('.')

for an in ans:
print(an,'\n')

OUTPUT :
EXPERIMENT - 5
AIM : WAP in python to tokenize text with stopwords as
delimiters.

CODE :
text = "Walter was feeling anxious. He was diagnosed today. He probably
is the best person I know."

stop_words_and_delims = ['was', 'is', 'the', '.', ',', '-', '!', '?']

for r in stop_words_and_delims:
text = text.replace(r, 'DELIM')

words = [t.strip() for t in text.split('DELIM')]

words_filtered = list(filter(lambda a: a not in [''], words))
for word in words_filtered:
print(word)

OUTPUT :
EXPERIMENT - 6
AIM : WAP in python to add custom stop words in spaCy.

CODE :
import spacy

nlp = spacy.load('en_core_web_sm')

custom_stop_words = ['was', 'is','the','JUNK','NIL','of','more' ,'.',

',', '-', '!', '?','a']
for word in custom_stop_words:
nlp.vocab[word].is_stop = True

doc = nlp("Jonas was a JUNK great guy NIL Adam was evil NIL Martha JUNK
was more of a fool")
for token in doc:
if not token.is_stop:
print(token.text, end=" ")

OUTPUT :
EXPERIMENT - 7
AIM : WAP to remove punctuations, perform stemming,
lemmatize given text and extract usernames from emails

CODE :
punctuations = '''!()-[]{};:'"\,<>./?@#$%^&*_~'''

string = "Jonas!!! great \\guy <> Adam --evil [Martha] ;;fool() ."

ans = ""
for char in string:
if char not in punctuations:
ans+=char

print(ans)

from nltk.stem import PorterStemmer

from nltk.tokenize import word_tokenize
text= "Dancing is an art. Students should be taught dance as a subject
in schools . I danced in many of my school function. Some people are
always hesitating to dance."
ans = ""
stemmer = PorterStemmer()
tokens = word_tokenize(text)
for token in tokens:
ans+=stemmer.stem(token)
ans+=" "
print(ans)

from nltk.corpus import wordnet

from nltk.tokenize import word_tokenize

from nltk.stem.wordnet import WordNetLemmatizer

lemmatizer = WordNetLemmatizer()
text= "Dancing is an art. Students should be taught dance as a subject
in schools . I danced in many of my school function. Some people are
always hesitating to dance."
ans = ""
tokens = word_tokenize(text)
for token in tokens:
ans+=lemmatizer.lemmatize(token, wordnet.VERB)
ans+=" "
print(ans)
from nltk.tokenize import word_tokenize

text= "The new registrations are [email protected] ,

[email protected]. If you find any disruptions, kindly contact
[email protected] or [email protected] "

text_list = word_tokenize(text)
usernames = []
for i in range(len(text_list)):
if text_list[i] == "@":
usernames.append(text_list[i-1])
print(usernames)

OUTPUT :
EXPERIMENT - 8
AIM : WAP to do spell correction, extract all nouns, pronouns
and verbs in a given text

CODE :
from textblob import TextBlob
text="He is a gret person. He beleives in bod"
textb = TextBlob(text)
correct_text = textb.correct()
print(correct_text)

import nltk
from nltk import word_tokenize, pos_tag
text="James works at Microsoft. She lives in manchester and likes to
play the flute"
tokens = word_tokenize(text)
parts_of_speech = nltk.pos_tag(tokens)
nouns = list(filter(lambda x: x[1] == "NN" or x[1] == "NNP",
parts_of_speech))
for noun in nouns:
print(noun[0])

from nltk import pos_tag, word_tokenize

text = "I may bake a cake for my birthday. The talk will introduce
reader about Use of baking"

words = word_tokenize(text)

verb_phrases = []
for i in range(len(words)):
if i > 0 and pos_tag(words)[i][1] == 'VB':
verb_phrase = words[i-1] + ' ' + words[i]
verb_phrases.append(verb_phrase)

for i in verb_phrases:
print (i)

OUTPUT :
EXPERIMENT - 9
AIM : WAP to find similarity between two words and classify a
text as positive/negative sentiment

CODE :
import spacy

nlp = spacy.load('en_core_web_md')
words = "amazing terrible excellent"

tokens = nlp(words)

token1, token2, token3 = tokens[0], tokens[1], tokens[2]

print(f"Similarity between {token1} and {token2} : ",

token1.similarity(token2))
print(f"Similarity between {token1} and {token3} : ",
token1.similarity(token3))

from textblob import TextBlob

text = "It was a very pleasant day"
print(TextBlob(text).sentiment)

OUTPUT :

A1-A2 Vocabulary
100% (10)
A1-A2 Vocabulary
723 pages
Flow Chart 2
100% (1)
Flow Chart 2
1 page
Grammar Map Note
100% (1)
Grammar Map Note
35 pages
NLP Lab File
No ratings yet
NLP Lab File
15 pages
NLP Lab File (1)
No ratings yet
NLP Lab File (1)
13 pages
NLP Smitpatel
No ratings yet
NLP Smitpatel
32 pages
NLP LAB_MANUAL (1)
No ratings yet
NLP LAB_MANUAL (1)
33 pages
NLP Lab Programs
No ratings yet
NLP Lab Programs
3 pages
Jal Patel NLP
No ratings yet
Jal Patel NLP
32 pages
NLP LAB MANUAL
No ratings yet
NLP LAB MANUAL
17 pages
NLP (1)
No ratings yet
NLP (1)
12 pages
NLP LAB MANUAL 3-2 AIML R22 UPDATE (1)
100% (1)
NLP LAB MANUAL 3-2 AIML R22 UPDATE (1)
20 pages
NLP-Lab Manual - Ashwini - Kachare
No ratings yet
NLP-Lab Manual - Ashwini - Kachare
41 pages
Sahil NLP
No ratings yet
Sahil NLP
16 pages
7 idf
No ratings yet
7 idf
5 pages
NLP Lab Manual (R20)
50% (2)
NLP Lab Manual (R20)
24 pages
277e5fcb-2a64-4802-9bfa-c0b031207675
No ratings yet
277e5fcb-2a64-4802-9bfa-c0b031207675
20 pages
01 NLP - Merged Vinay
No ratings yet
01 NLP - Merged Vinay
27 pages
Wsma Final Manual
No ratings yet
Wsma Final Manual
58 pages
SK NLP Practical (FS)
No ratings yet
SK NLP Practical (FS)
22 pages
NLP 02
No ratings yet
NLP 02
6 pages
NLP - Practical List
No ratings yet
NLP - Practical List
14 pages
CH4
No ratings yet
CH4
15 pages
3 b Morphology
No ratings yet
3 b Morphology
3 pages
NLP Experiment 2
No ratings yet
NLP Experiment 2
5 pages
Text Processing
No ratings yet
Text Processing
16 pages
NLP Manual (1-12)
No ratings yet
NLP Manual (1-12)
54 pages
NLTK Tutorial
No ratings yet
NLTK Tutorial
33 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
15 pages
Natural Language Processing: Practical 1
No ratings yet
Natural Language Processing: Practical 1
64 pages
20BCP123 - NLP Lab Manual
No ratings yet
20BCP123 - NLP Lab Manual
45 pages
NLP Manual (1-12)
No ratings yet
NLP Manual (1-12)
55 pages
Nlp Lab Manual
No ratings yet
Nlp Lab Manual
21 pages
Final_NLP_Lab_File
No ratings yet
Final_NLP_Lab_File
28 pages
20BCP112 - NLP Lab - LAB - Manual
No ratings yet
20BCP112 - NLP Lab - LAB - Manual
65 pages
Lab Manual - NLP
No ratings yet
Lab Manual - NLP
60 pages
NLP Lab Manual Lab Work
No ratings yet
NLP Lab Manual Lab Work
24 pages
NLP Programs
No ratings yet
NLP Programs
5 pages
Tokenizer
No ratings yet
Tokenizer
4 pages
NLP Lab Complete
No ratings yet
NLP Lab Complete
23 pages
7.TextAnalysis
No ratings yet
7.TextAnalysis
3 pages
AIML_P4
No ratings yet
AIML_P4
12 pages
NLP Manual (1-12) 1
No ratings yet
NLP Manual (1-12) 1
56 pages
Shubham Jade MSC It 31031420010 NLP Practical Journal
No ratings yet
Shubham Jade MSC It 31031420010 NLP Practical Journal
17 pages
NLP Practicals All
No ratings yet
NLP Practicals All
57 pages
1
No ratings yet
1
2 pages
Lab Prgms Weel1-Output
No ratings yet
Lab Prgms Weel1-Output
4 pages
NLP FinAL (1)
No ratings yet
NLP FinAL (1)
27 pages
NLP_Lab_1.ipynb - Colab
No ratings yet
NLP_Lab_1.ipynb - Colab
4 pages
Natural Language Processing
No ratings yet
Natural Language Processing
17 pages
Week-4 NLP 2
No ratings yet
Week-4 NLP 2
2 pages
H7 W5 NLP - Merged
No ratings yet
H7 W5 NLP - Merged
17 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
33 pages
p4
No ratings yet
p4
10 pages
NLP Op
No ratings yet
NLP Op
16 pages
NLP Lab 1
No ratings yet
NLP Lab 1
1 page
NLP Record
No ratings yet
NLP Record
15 pages
1-NLP - Lab Manual
No ratings yet
1-NLP - Lab Manual
15 pages
NLP___
No ratings yet
NLP___
28 pages
Lab2 IR
No ratings yet
Lab2 IR
16 pages
Machine Learning NLP LAB Sayak Mallick
No ratings yet
Machine Learning NLP LAB Sayak Mallick
4 pages
1
No ratings yet
1
13 pages
C# Package Mastery: 100 Essentials in 1 Hour - 2024 Edition
From Everand
C# Package Mastery: 100 Essentials in 1 Hour - 2024 Edition
Tenko
No ratings yet
Grammar Rules and Variations
No ratings yet
Grammar Rules and Variations
4 pages
12 Tenses in English Grammar - Tenses Table With Examples - 1
No ratings yet
12 Tenses in English Grammar - Tenses Table With Examples - 1
4 pages
Simple past and Simple present
100% (1)
Simple past and Simple present
21 pages
Unit 8 Telling A Different Story
No ratings yet
Unit 8 Telling A Different Story
54 pages
Untitled
No ratings yet
Untitled
200 pages
Common mistakes B1
No ratings yet
Common mistakes B1
6 pages
Timothy Ajani, "Syntax and People: How Amos Tutuola's English Was Shaped by His People"
No ratings yet
Timothy Ajani, "Syntax and People: How Amos Tutuola's English Was Shaped by His People"
20 pages
Conjucation
No ratings yet
Conjucation
24 pages
B.inggris. Kls 9 Unit 8. Long, Long Time Ago. There Was
100% (3)
B.inggris. Kls 9 Unit 8. Long, Long Time Ago. There Was
20 pages
Adjectives of Feeling Ed Ing Exercises
No ratings yet
Adjectives of Feeling Ed Ing Exercises
5 pages
A Generalization of Dijkstra's Algorithm.
No ratings yet
A Generalization of Dijkstra's Algorithm.
5 pages
Dormir - Partir - Sortir - Lawless French Verb Conjugations - Irregular Verbs
No ratings yet
Dormir - Partir - Sortir - Lawless French Verb Conjugations - Irregular Verbs
5 pages
2302024 b2 Unit 4 law and order 2.8 - 5.2 confusing words
No ratings yet
2302024 b2 Unit 4 law and order 2.8 - 5.2 confusing words
2 pages
Intermedio 6 WB
No ratings yet
Intermedio 6 WB
15 pages
Online İngilizce Kursu A1 - A2 Seviye Slaytı 1
No ratings yet
Online İngilizce Kursu A1 - A2 Seviye Slaytı 1
177 pages
Com. Arts Reviewer 2024
No ratings yet
Com. Arts Reviewer 2024
42 pages
Present Perfect
No ratings yet
Present Perfect
8 pages
Alc Book21 PDF
100% (2)
Alc Book21 PDF
91 pages
Gerund e Learning
No ratings yet
Gerund e Learning
19 pages
Unit 5: Pastimes/los Pasatiempos
No ratings yet
Unit 5: Pastimes/los Pasatiempos
4 pages
Present Perfect Tense
No ratings yet
Present Perfect Tense
1 page
Subject - Verb Agreement
No ratings yet
Subject - Verb Agreement
26 pages
Stat Ive Verbs Practice 1
No ratings yet
Stat Ive Verbs Practice 1
1 page
Final Paper of Syntax by Patrisius Eka Putra Trikora - 06 A
No ratings yet
Final Paper of Syntax by Patrisius Eka Putra Trikora - 06 A
12 pages
Simple Past Tense Test - Practice
No ratings yet
Simple Past Tense Test - Practice
3 pages
English - Number, Gender, Case, Definiteness/indefiniteness
No ratings yet
English - Number, Gender, Case, Definiteness/indefiniteness
7 pages
Ag1 February 2022
No ratings yet
Ag1 February 2022
52 pages

NLP Lab File

Uploaded by

NLP Lab File

Uploaded by

DELHI TECHNOLOGICAL

Department of Software Engineering

Submitted to : Dr. Divyashikha Sethia

S. No. Experiment Date

1. Import nltk and download the ‘stopwords’ 13-01-2023

2. Import spacy and load the language model. 13-01-2023

3. WAP in python to tokenize a given text. 20-01-2023

4. WAP in python to get the sentences of a 03-03-2023

5. WAP in python to tokenize text with 03-02-2023

6. WAP in python to add custom stop words in 03-02-2023

7. WAP to remove punctuations, perform 24-02-2023

8. WAP to do spell correction, extract all 07-03-2023

9. WAP to find similarity between two words 31-03-2023

stop_words_and_delims = ['was', 'is', 'the', '.', ',', '-', '!', '?']

words = [t.strip() for t in text.split('DELIM')]

custom_stop_words = ['was', 'is','the','JUNK','NIL','of','more' ,'.',

from nltk.stem import PorterStemmer

from nltk.corpus import wordnet

from nltk.stem.wordnet import WordNetLemmatizer

text= "The new registrations are [email protected] ,

from nltk import pos_tag, word_tokenize

token1, token2, token3 = tokens[0], tokens[1], tokens[2]

print(f"Similarity between {token1} and {token2} : ",

from textblob import TextBlob

You might also like