0% found this document useful (0 votes)

6 views4 pages

exp-2 nlp

The document describes various programs for generating text using different natural language processing techniques, including random word selection, bigrams, trigrams, Markov chains, and the GPT-2 model. Each program demonstrates how to generate words or sentences based on a given input or starting word. Outputs from these programs include randomly selected words and generated sentences that reflect the structure of the input text.

Uploaded by

sowjanyaanuku0304005

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views4 pages

exp-2 nlp

Uploaded by

sowjanyaanuku0304005

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Program:

import random

vocabulary=["natural","language","preprocessing","machine","learning","artificial","intelligence"]

random_word=random.choice(vocabulary)

print("Random Word:",random_word)

OUTPUT:

Random Word: natural

program

from nltk.util import bigrams

from nltk.probability import FreqDist

from random import choices

text="natural language processing is fascinating and language is powerful"

tokens=text.split()

bigrams_list=list(bigrams(tokens))

bigram_fd=FreqDist(bigrams_list)

def generate_next_word(last_word,bigram_fd):

possible_bigrams=[pair for pair in bigram_fd if pair[0]==last_word]

if not possible_bigrams:

return None

words,weights=zip(*[(pair[1],bigram_fd[pair])for pair in possible_bigrams])

return choices(words,weights=weights)[0]

start_word="language"

generated_word=generate_next_word(start_word,bigram_fd)

print("Next Word for '{}':{}".format(start_word,generated_word))

output:

Next Word for 'language':is

Program:

from nltk.util import trigrams

from random import choices

text = "natural language processing is fascinating and language is powerful"

tokens = text.split()

trigrams_list = list(trigrams(tokens))

from nltk import FreqDist

trigram_fd = FreqDist(trigrams_list)

def generate_sentence(trigram_fd, start_words, length=10):

sentence = list(start_words)

for _ in range(length - len(start_words)):

possible_trigrams = [trigram for trigram in trigram_fd if trigram[:2] == tuple(sentence[-2:])]

if not possible_trigrams:

break

words, weights = zip(*[(trigram[2], trigram_fd[trigram]) for trigram in possible_trigrams])

next_word = choices(words, weights=weights)[0]

sentence.append(next_word)

return " ".join(sentence)

start_words = ("language", "is")

generated_sentence = generate_sentence(trigram_fd, start_words)

print("Generated Sentence:", generated_sentence)

Output:

Generated Sentence: language is powerful

Program:

from collections import defaultdict

import random

text="natural language processing is fascinating and language is powerful."

tokens=text.split()

markov_chain=defaultdict(list)

for i in range(len(tokens)-1):

markov_chain[tokens[i]].append(tokens[i+1])

def generate_text(markov_chain,start_word,length=10):

current_word=start_word

result=[current_word]

for _ in range(length-1):

next_words=markov_chain.get(current_word)

if not next_words:

break

current_word=random.choice(next_words)

result.append(current_word)

return " ".join(result)

start_word="language"

generated_text=generate_text(markov_chain,start_word)

print("Generated Text:",generated_text)
Output

Generated Text: language is fascinating and language processing is powerful.

Program

from transformers import GPT2LMHeadModel, GPT2Tokenizer

model_name = "gpt2"

model = GPT2LMHeadModel.from_pretrained(model_name)

tokenizer = GPT2Tokenizer.from_pretrained(model_name)

prompt = "Natural Language Processing"

inputs = tokenizer.encode(prompt, return_tensors="pt")

outputs = model.generate(inputs, max_length=50, num_return_sequences=1)

generated_text = tokenizer.decode(outputs[0], skip_special_tokens=True)

print("Generated Text:", generated_text)

Output:

Generated Text: Natural Language Processing (LISP) is a new approach to processing and
processing data in a language. It is a new approach to processing and processing data in
a language. It is a new approach to processing and processing data in a language.

NLP EXP 3 (b) - Word Generation
No ratings yet
NLP EXP 3 (b) - Word Generation
2 pages
Natural Language Processing Lab Manual
No ratings yet
Natural Language Processing Lab Manual
24 pages
Module5 PPT
No ratings yet
Module5 PPT
69 pages
Natural Language Processing
No ratings yet
Natural Language Processing
17 pages
NLP lab Manual (3)
No ratings yet
NLP lab Manual (3)
7 pages
N_gram_Presentation
No ratings yet
N_gram_Presentation
29 pages
NLPPR8
No ratings yet
NLPPR8
4 pages
Batch 2
No ratings yet
Batch 2
13 pages
Https Raw - Githubusercontent.com Joelgrus Data-Science-From-Scratch Master Code Natural Language Processing
No ratings yet
Https Raw - Githubusercontent.com Joelgrus Data-Science-From-Scratch Master Code Natural Language Processing
5 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
15 pages
SK NLP Practical (FS)
No ratings yet
SK NLP Practical (FS)
22 pages
NLP - (Natural Language Processing Lab Manual)
No ratings yet
NLP - (Natural Language Processing Lab Manual)
12 pages
UBC Summer School in NLP - VSP 2019 Lecture 9
No ratings yet
UBC Summer School in NLP - VSP 2019 Lecture 9
17 pages
Nlp Lab Manual
No ratings yet
Nlp Lab Manual
21 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
25 pages
Python NLP Assignment
No ratings yet
Python NLP Assignment
9 pages
R22 Nlp Python Programs
No ratings yet
R22 Nlp Python Programs
15 pages
Ai&Ml Bai601 Nlp Lab Manual
No ratings yet
Ai&Ml Bai601 Nlp Lab Manual
48 pages
a4
No ratings yet
a4
2 pages
NLP (1)
No ratings yet
NLP (1)
12 pages
Assignment 3
No ratings yet
Assignment 3
5 pages
Rajeev Mishra 20 SCSE1180087
No ratings yet
Rajeev Mishra 20 SCSE1180087
29 pages
NLP Lab1
No ratings yet
NLP Lab1
6 pages
NLP_record[1][1] (1)
No ratings yet
NLP_record[1][1] (1)
23 pages
018 NLP EXP2
No ratings yet
018 NLP EXP2
1 page
Soundarya 256 NLP Practs
No ratings yet
Soundarya 256 NLP Practs
14 pages
Natural Language Processing
No ratings yet
Natural Language Processing
12 pages
Final_NLP_Lab_File
No ratings yet
Final_NLP_Lab_File
28 pages
GenAIL
No ratings yet
GenAIL
12 pages
Language Modeling
No ratings yet
Language Modeling
88 pages
NLP Smitpatel
No ratings yet
NLP Smitpatel
32 pages
DS 7
No ratings yet
DS 7
3 pages
Program 4
No ratings yet
Program 4
8 pages
University Elective ChatGPT Assignments
No ratings yet
University Elective ChatGPT Assignments
8 pages
NLP LAB_MANUAL (1)
No ratings yet
NLP LAB_MANUAL (1)
33 pages
19102B0052 - NLP - Exp - 4
No ratings yet
19102B0052 - NLP - Exp - 4
5 pages
module5_DS_ppt
No ratings yet
module5_DS_ppt
38 pages
Markov Processes Generator
No ratings yet
Markov Processes Generator
5 pages
123nlp456
No ratings yet
123nlp456
4 pages
Cs224n 2025 Lecture05 Rnnlm
No ratings yet
Cs224n 2025 Lecture05 Rnnlm
54 pages
Nlp Lab Manual (2)
No ratings yet
Nlp Lab Manual (2)
28 pages
Chapter2 Limitations of RNN
No ratings yet
Chapter2 Limitations of RNN
29 pages
1
No ratings yet
1
13 pages
NLP- AI2214601 unit 1to unit 5 notes
No ratings yet
NLP- AI2214601 unit 1to unit 5 notes
98 pages
Next Word Prediction With NLP and Deep Learning
No ratings yet
Next Word Prediction With NLP and Deep Learning
13 pages
List of Experiments: Experiment No. Experiment Name Page No
No ratings yet
List of Experiments: Experiment No. Experiment Name Page No
1 page
UBC Summer School in NLP - VSP 2019 Lecture 10
No ratings yet
UBC Summer School in NLP - VSP 2019 Lecture 10
33 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
5 pages
NLP Lab
No ratings yet
NLP Lab
18 pages
L1 Introduction
No ratings yet
L1 Introduction
127 pages
NLP_Record(Weeks 1-12) (1)
No ratings yet
NLP_Record(Weeks 1-12) (1)
41 pages
Exercise 2 en
No ratings yet
Exercise 2 en
3 pages
NLP Unit-4
No ratings yet
NLP Unit-4
48 pages
NLP LAB MANUAL
No ratings yet
NLP LAB MANUAL
17 pages
Text Modication Methods For Natural Language Generation: Universitat Autònoma de Barcelona
No ratings yet
Text Modication Methods For Natural Language Generation: Universitat Autònoma de Barcelona
44 pages
CSIT366-Lab File
No ratings yet
CSIT366-Lab File
17 pages
NLP Final Review
No ratings yet
NLP Final Review
32 pages
NLP MTE syllabus and Practice Problems (2)
No ratings yet
NLP MTE syllabus and Practice Problems (2)
2 pages
A Beginner's Guide To Natural Language Processing - IBM Developer
No ratings yet
A Beginner's Guide To Natural Language Processing - IBM Developer
9 pages
Python For Beginners
From Everand
Python For Beginners
Célio Azevedo
No ratings yet
Pratul Profile Summary
No ratings yet
Pratul Profile Summary
1 page
Arduino based LED using Bluetooth & Mobile
No ratings yet
Arduino based LED using Bluetooth & Mobile
3 pages
13-02-11 Oracle V Google Appeal Brief
No ratings yet
13-02-11 Oracle V Google Appeal Brief
227 pages
CCNP Route Eigrp Lab
No ratings yet
CCNP Route Eigrp Lab
2 pages
Introduction To Interfacing Arduino Hardware and MATLAB - Simulink
No ratings yet
Introduction To Interfacing Arduino Hardware and MATLAB - Simulink
16 pages
Fortios v6.0.0 Release Notes
No ratings yet
Fortios v6.0.0 Release Notes
36 pages
ACN Notes Ch5
No ratings yet
ACN Notes Ch5
28 pages
PC-Lab2000: Software For Velleman PC Peripherals: PCS500 / PCS100 / K8031 / PCG10 / K8016
No ratings yet
PC-Lab2000: Software For Velleman PC Peripherals: PCS500 / PCS100 / K8031 / PCG10 / K8016
12 pages
Library Management System Project Report
No ratings yet
Library Management System Project Report
96 pages
NXP Nexperia IP STB - Development Kit STB810
No ratings yet
NXP Nexperia IP STB - Development Kit STB810
4 pages
Software Quality Factors and Software Quality Metrics To Enhance Software Quality Assurance
No ratings yet
Software Quality Factors and Software Quality Metrics To Enhance Software Quality Assurance
27 pages
CANopen - CODESYS V3
No ratings yet
CANopen - CODESYS V3
46 pages
cv
No ratings yet
cv
14 pages
3G CDR Default Parameter RevPA15
No ratings yet
3G CDR Default Parameter RevPA15
17 pages
Cpa PDF
No ratings yet
Cpa PDF
8 pages
PE24-WDC-MAM-001 - Market Trends v5
No ratings yet
PE24-WDC-MAM-001 - Market Trends v5
57 pages
Passenger Reservation System of Indian Railways
100% (4)
Passenger Reservation System of Indian Railways
16 pages
Log
No ratings yet
Log
42 pages
Microsoft Course Catalogue 2019-20
No ratings yet
Microsoft Course Catalogue 2019-20
5 pages
Dell Inc Company History
No ratings yet
Dell Inc Company History
7 pages
Manual Ipi2win Ingles
No ratings yet
Manual Ipi2win Ingles
15 pages
Notes Compile Complete
No ratings yet
Notes Compile Complete
117 pages
Semantc Web and Social Networks
No ratings yet
Semantc Web and Social Networks
63 pages
Export ARCHICAD to Rhino-User Guide
No ratings yet
Export ARCHICAD to Rhino-User Guide
8 pages
Oops PDF
No ratings yet
Oops PDF
5 pages
24c04a, 24c08a, 24c16a
No ratings yet
24c04a, 24c08a, 24c16a
20 pages
Tour and Travels Website Using React - Js
No ratings yet
Tour and Travels Website Using React - Js
7 pages
TAO Dev Guide 2 2a p7 PDF
No ratings yet
TAO Dev Guide 2 2a p7 PDF
1,306 pages
Digsi PDF
100% (4)
Digsi PDF
255 pages
Pdfslide - Tips - Base Amp Art Series pst355c
No ratings yet
Pdfslide - Tips - Base Amp Art Series pst355c
2 pages

exp-2 nlp

Uploaded by

exp-2 nlp

Uploaded by

Program:

Random Word: natural

from nltk.util import bigrams

from nltk.probability import FreqDist

from random import choices

text="natural language processing is fascinating and language is powerful"

possible_bigrams=[pair for pair in bigram_fd if pair[0]==last_word]

words,weights=zip(*[(pair[1],bigram_fd[pair])for pair in possible_bigrams])

print("Next Word for '{}':{}".format(start_word,generated_word))

Next Word for 'language':is

from nltk.util import trigrams

from random import choices

text = "natural language processing is fascinating and language is powerful"

from nltk import FreqDist

def generate_sentence(trigram_fd, start_words, length=10):

for _ in range(length - len(start_words)):

possible_trigrams = [trigram for trigram in trigram_fd if trigram[:2] == tuple(sentence[-2:])]

words, weights = zip(*[(trigram[2], trigram_fd[trigram]) for trigram in possible_trigrams])

next_word = choices(words, weights=weights)[0]

return " ".join(sentence)

start_words = ("language", "is")

generated_sentence = generate_sentence(trigram_fd, start_words)

Generated Sentence: language is powerful

from collections import defaultdict

text="natural language processing is fascinating and language is powerful."

return " ".join(result)

Generated Text: language is fascinating and language processing is powerful.

from transformers import GPT2LMHeadModel, GPT2Tokenizer

prompt = "Natural Language Processing"

inputs = tokenizer.encode(prompt, return_tensors="pt")

outputs = model.generate(inputs, max_length=50, num_return_sequences=1)

generated_text = tokenizer.decode(outputs[0], skip_special_tokens=True)

print("Generated Text:", generated_text)

You might also like