0% found this document useful (0 votes)

47 views17 pages

CSIT366-Lab File

The document contains a summary of 10 experiments conducted using Python: 1. Create line, bar, and histogram charts using Matplotlib library. 2. Create an n*k matrix to represent a linear function mapping k-dimensional vectors to n-dimensional vectors. 3. Import a dataset from Kaggle and perform tasks like viewing columns and dimensions. 4. Perform text analysis operations like tokenization, frequency distribution, stopword removal using NLTK. 5. Generate random words using HTTP requests and from a text file. 6. Explore using morphing to transform images. 7. Generate n-grams from text using a specified number of words. 8

Uploaded by

shivangiimishraa1819

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

47 views17 pages

CSIT366-Lab File

Uploaded by

shivangiimishraa1819

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 17

INDEX

NO TOPIC DATE SIGN

1 To create Line chart, Bar chart and Histogram 05/07/202
3
2 Create a n * k matrix to represent a linear function that maps k- 12/07/202
dimensional vectors to n-dimensional vectors. 3
3 Import a dataset from Kaggle and perform different tasks on it 26/07/202
3
4 Write a program to perform different text analysis operations 02/08/202
using NLTK 3
5 Write a program to generate a random word using: HTTP, text 09/08/202
file 3
6 Morphing
7 Generate N-grams 06/09/202
3
8 POS Tagging: Hidden Markov Model 04/10/202
POS Tagging: Viterbi Decoding 3
9 Finding K-nearest neighbor 11/10/202
3
10 Sentiment Analysis 11/10/202
3
EXPERIMENT 1
AIM: To create Line chart, Bar chart and Histogram
SOFTWARE USED: Python, matplotlib library
CODE:

Line Graph

A line chart represents data points connected by straight lines. It is useful for showing trends or
changes over time or continuous data. The x-axis typically represents the independent variable,
while the y-axis represents the dependent variable.

import matplotlib.pyplot as plt

x = [1,2,3,4,5]
y = [2,4,6,8,10]

plt.plot(x,y)

plt.xlabel('X-AXIS')
plt.ylabel('Y-AXIS')
plt.title('Line Chart')

plt.show()
Bar Chart

A bar chart displays categorical data using rectangular bars of varying lengths. Each bar
represents a category, and the length of the bar corresponds to the value or frequency of that
category. Bar charts are suitable for comparing values between different categories.

x = ['Maths','English','Hindi','Science','GK']
y = [80, 90, 81, 45, 67, 76]

plt.bar(x,y)

plt.xlabel('Subjects')
plt.ylabel('Marks')
plt.title('Bar Chart')

plt.show()
Histogram

A histogram visualizes the distribution of numerical data. It consists of adjacent rectangular bins
representing intervals or ranges of values, while the height of each bin represents the frequency
or count of values falling within that range. Histograms help analyze the shape, spread, and
central tendency of the data.

data = [1,1,2,3,4,4,4,5,5,6,7,7,7,8,9]

plt.hist(data, bins=10)

plt.xlabel('Value')
plt.ylabel('Frequency')
plt.title('Histogram')

plt.show()
EXPERIMENT 2
AIM: Create a n*k matrix to represent a linear function that maps k-dimensional vectors to n-
dimensional vectors.
SOFTWARE USED: Python, matplotlib library
CODE:
import random
import numpy as np

def create_matrix(n,k):
matrix=[]
for i in range(n):
row=[]
for j in range(k):
row.append(random.random())
matrix.append(row)
return matrix

if __name__ == "__main__":
matrix = create_matrix(3,2)
print(matrix)
EXPERIMENT 3
AIM: Import a dataset from Kaggle and perform different tasks on it.
SOFTWARE USED: Python, Kaggle
CODE:
import numpy as np
import pandas as pd

df = pd.read_csv("./exp_3/data2.csv")
print(df["User ID"])
print(df.head())
print(df.shape)
print(df.info())
EXPERIMENT 4

AIM: Write a program to perform different text analysis operations using NLTK
SOFTWARE USED: Python, matplotlib, nltk library
CODE:

import nltk
from nltk.tokenize import sent_tokenize
from nltk.tokenize import word_tokenize
from nltk.probability import FreqDist
import matplotlib.pyplot as plt
from nltk.corpus import stopwords

nltk.download("punkt")

text = """Hello Dear Students, how are you doing today? Today we will study natural language concepts,
and implement the same on Python platform. Everyone has to write the program and make practile file
too"""

tokenized_sent = sent_tokenize(text)
print(tokenized_sent)

tokenized_word = word_tokenize(text)
print(tokenized_word)

fdist = FreqDist(tokenized_word)
print(fdist.most_common(2))

# Frequency Distribution Plot

fdist.plot(30, cumulative=False)
plt.show()

nltk.download("stopwords")

stop_words = set(stopwords.words("english"))

print(stop_words)

filtered_tokens = []
for w in tokenized_word:
if w not in stop_words:
filtered_tokens.append(w)

print("Tokenized Words:", tokenized_word)

print("Filterd Tokens:", filtered_tokens)
EXPERIMENT 5
AIM: Write a program to generate a random word using:
1) HTTP Request
2) A text file
SOFTWARE USED: Python
CODE:

1) HTTP Request

import random
import string

def generate_random_word(length):
letters = string.ascii_lowercase
return "".join(random.choice(letters for _ in range(length)))
def main():
try:
word_length = int(input("Enter the desired word length: "))
num_words = int(input("Enter the number of words you need: "))

if word_length <= 0 or num_words <= 0:

print("Word length and number of words should be positive integers")
return

generated_words = [generate_random_word(word_length) for _ in range(num_words)]

print("\nGenerated words: ")

for word in generated_words:
print(word)

except ValueError:
print(
"Please enter valid positive integers for word length and number of words."
)

if __name__ == "__main__":
main()

2) A text file

import random

def get_list_of_words(path):
with open(path, "r", encoding="utf-8") as f:
return f.read().splitlines()

words = get_list_of_words("/content/wordGenerate.txt")

print(words)
random_word = random.choice(words)

print(random_word)
EXPERIMENT 6
AIM: To explore how to use morphing to transform one image into another.
SOFTWARE USED: Python
CODE:

EXPERIMENT 7
AIM: To generate n-grams from a given text, where an n-gram is a sequence of 'n' words.
SOFTWARE USED: Python
CODE:
def generate_ngrams(text, WordsToCombine):
words = text.split()
output = []
for i in range(len(words) - WordsToCombine + 1):
output.append(words[i:i + WordsToCombine])
return output

# Example usage:
text = 'this is a very good class to teach and interact'
WordsToCombine = 3

ngrams = generate_ngrams(text, WordsToCombine)

print(ngrams)
EXPERIMENT 8
AIM: To understand and implement Part-of-Speech (POS) tagging using a Hidden Markov
Model and to implement Viterbi decoding for POS tagging.
SOFTWARE USED: Python, Jupyter
CODE:

HIDDEN MARKOV MODEL

# Import libraries
import nltk
from nltk.tag import hmm
from nltk.corpus import treebank
nltk.download('treebank')

# Train the HMM POS tagger

train_data = treebank.tagged_sents()[:3000] # Training data
tagger = hmm.HiddenMarkovModelTrainer().train(train_data)

# POS tagging
test_sentence = "This is a sample sentence for POS tagging."
tokens = nltk.word_tokenize(test_sentence)
pos_tags = tagger.tag(tokens)

# Display the POS tags

print(pos_tags)
VITERBI DECODING

# Import libraries
import nltk
from nltk.tag import hmm
from nltk.corpus import treebank

# Train the HMM POS tagger

train_data = treebank.tagged_sents()[:3000] # Training data
tagger = hmm.HiddenMarkovModelTrainer().train(train_data)

# Implement Viterbi algorithm for POS tagging

def viterbi(sentence, tagger):
tokens = nltk.word_tokenize(sentence)
viterbi_path = tagger.tag(tokens)
return viterbi_path

# Test the Viterbi POS tagging

test_sentence = "This is a test sentence for Viterbi decoding."
viterbi_tags = viterbi(test_sentence, tagger)
print(viterbi_tags)
EXPERIMENT 9
AIM: To implement K-nearest neighbor (KNN) classification for data points.
SOFTWARE USED: Python, Jupyter
CODE:

# Import necessary libraries

from sklearn.datasets import load_iris
from sklearn.neighbors import KNeighborsClassifier
from sklearn.model_selection import train_test_split
from sklearn.metrics import accuracy_score
import matplotlib.pyplot as plt # Import matplotlib

# Load the Iris dataset

data = load_iris()

# Split the data into features (X) and labels (y)

X = data.data
y = data.target

# Split the data into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

# Create and train the KNN classifier

k = 3 # Number of neighbors
knn_classifier = KNeighborsClassifier(n_neighbors=k)
knn_classifier.fit(X_train, y_train)

# Make predictions on the test set

y_pred = knn_classifier.predict(X_test)

# Calculate the accuracy of the model

accuracy = accuracy_score(y_test, y_pred)
print("Accuracy:", accuracy)

# Visualize the Iris dataset

plt.figure(figsize=(8, 6))
plt.scatter(X[:, 0], X[:, 1], c=y, cmap=plt.cm.Set1, edgecolor='k')
plt.xlabel('Sepal Length (cm)')
plt.ylabel('Sepal Width (cm)')
plt.title('Iris Dataset (Sepal Length vs. Sepal Width)')
plt.show()
EXPERIMENT 10
AIM: To perform sentiment analysis on text data using a pre-trained model.
SOFTWARE USED: Python, Jupyter
CODE:

# Import libraries
import transformers
from transformers import pipeline

# Load a pre-trained sentiment analysis model

sentiment_analyzer = pipeline("sentiment-analysis")

# Analyze sentiment of text

text = "I love this product! It's amazing."
sentiment = sentiment_analyzer(text)

# Display sentiment
print(sentiment)

Configuration Checklist For SAP Credit Management (FSCM)
No ratings yet
Configuration Checklist For SAP Credit Management (FSCM)
6 pages
Direct Messages
No ratings yet
Direct Messages
38 pages
Reviews Project Sum21
No ratings yet
Reviews Project Sum21
8 pages
CHMA Unit - I
No ratings yet
CHMA Unit - I
29 pages
VTAMPS 5.0 Primary 3 Set 5
100% (1)
VTAMPS 5.0 Primary 3 Set 5
9 pages
Harsh Psda Practical File
No ratings yet
Harsh Psda Practical File
22 pages
Nlp Lab Manual
No ratings yet
Nlp Lab Manual
21 pages
Next Word Prediction With NLP and Deep Learning
No ratings yet
Next Word Prediction With NLP and Deep Learning
13 pages
GenAI-Shortened
No ratings yet
GenAI-Shortened
8 pages
NLP Lab Complete
No ratings yet
NLP Lab Complete
23 pages
Rajeev Mishra 20 SCSE1180087
No ratings yet
Rajeev Mishra 20 SCSE1180087
29 pages
ML Lab
No ratings yet
ML Lab
7 pages
Practical 7 Thsem
No ratings yet
Practical 7 Thsem
50 pages
NLP MTE syllabus and Practice Problems (2)
No ratings yet
NLP MTE syllabus and Practice Problems (2)
2 pages
DMlab2021
No ratings yet
DMlab2021
4 pages
Batch 2
No ratings yet
Batch 2
13 pages
New Microsoft Word Document
No ratings yet
New Microsoft Word Document
16 pages
ML termwork
No ratings yet
ML termwork
30 pages
AIPT LAB 24-25 MANUAL EXPE 4 to8
No ratings yet
AIPT LAB 24-25 MANUAL EXPE 4 to8
15 pages
Machine Learning Lab Manual (15CSL76)
No ratings yet
Machine Learning Lab Manual (15CSL76)
30 pages
Ccs339 Text and Speech Analysis Lab Manual
No ratings yet
Ccs339 Text and Speech Analysis Lab Manual
51 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
15 pages
NLP Final
No ratings yet
NLP Final
26 pages
NLP Final Review
No ratings yet
NLP Final Review
32 pages
Chat Bot
No ratings yet
Chat Bot
10 pages
Tutorial 3 - 206009L
No ratings yet
Tutorial 3 - 206009L
34 pages
Python Chatbot Project
No ratings yet
Python Chatbot Project
10 pages
123nlp456
No ratings yet
123nlp456
4 pages
Generative AI (1)
No ratings yet
Generative AI (1)
16 pages
NLP Manual
No ratings yet
NLP Manual
21 pages
BTECH(CST) AINN Lab Manual 3rd sem
No ratings yet
BTECH(CST) AINN Lab Manual 3rd sem
43 pages
Def Generate - N - Chars (A, B) : Return A B
No ratings yet
Def Generate - N - Chars (A, B) : Return A B
20 pages
New Microsoft Word Document
No ratings yet
New Microsoft Word Document
9 pages
Natural Language Processing Lab Manual
No ratings yet
Natural Language Processing Lab Manual
24 pages
Laboratory Manual: Faculty of Engineering and Technology Bachelor of Technology
No ratings yet
Laboratory Manual: Faculty of Engineering and Technology Bachelor of Technology
10 pages
Practical Fie AI Class 10
No ratings yet
Practical Fie AI Class 10
19 pages
taask
No ratings yet
taask
18 pages
Experiment Python 12018
No ratings yet
Experiment Python 12018
13 pages
Information Retrival
No ratings yet
Information Retrival
43 pages
pipfile 1122
No ratings yet
pipfile 1122
35 pages
Ldddgi
No ratings yet
Ldddgi
16 pages
GEN AI LAB PROGRAMS
No ratings yet
GEN AI LAB PROGRAMS
15 pages
AI Lab file pdf
No ratings yet
AI Lab file pdf
9 pages
ML Lab
No ratings yet
ML Lab
45 pages
Generative AI 2
No ratings yet
Generative AI 2
24 pages
AIM_PROCEDURE_RESULT_SINGLE SIDE
No ratings yet
AIM_PROCEDURE_RESULT_SINGLE SIDE
18 pages
Ai Phase 3 Project
No ratings yet
Ai Phase 3 Project
18 pages
A Comprehensive Guide To Understand and Implement Text Classification in Python
No ratings yet
A Comprehensive Guide To Understand and Implement Text Classification in Python
34 pages
NLP Soc
No ratings yet
NLP Soc
15 pages
Day19 Machine Learning
No ratings yet
Day19 Machine Learning
15 pages
Challenge-2024
No ratings yet
Challenge-2024
5 pages
Sentiment Analysis On Tweets
No ratings yet
Sentiment Analysis On Tweets
2 pages
Natural Language Processing
No ratings yet
Natural Language Processing
17 pages
NLP record
No ratings yet
NLP record
16 pages
Ai&Ml Bai601 Nlp Lab Manual
No ratings yet
Ai&Ml Bai601 Nlp Lab Manual
48 pages
PRACTICAL FILE Fds
No ratings yet
PRACTICAL FILE Fds
11 pages
Lab Manual - NLP
No ratings yet
Lab Manual - NLP
60 pages
Gen AI Micro
No ratings yet
Gen AI Micro
15 pages
List of Experiments: Experiment No. Experiment Name Page No
No ratings yet
List of Experiments: Experiment No. Experiment Name Page No
1 page
Methodology (Autosaved)
No ratings yet
Methodology (Autosaved)
9 pages
AI&ML Lab Report
No ratings yet
AI&ML Lab Report
19 pages
rufh 4
No ratings yet
rufh 4
24 pages
ML Lab 01 Manual - Intro To Python
No ratings yet
ML Lab 01 Manual - Intro To Python
9 pages
PRACTICAL FILE Fds
No ratings yet
PRACTICAL FILE Fds
14 pages
C# Package Mastery: 100 Essentials in 1 Hour - 2024 Edition
From Everand
C# Package Mastery: 100 Essentials in 1 Hour - 2024 Edition
Tenko
No ratings yet
Id5059 23 2 1
No ratings yet
Id5059 23 2 1
8 pages
EX7 DesignofCoupling
No ratings yet
EX7 DesignofCoupling
6 pages
Acciojob Full Stack Development Week-Wise Syllabus
No ratings yet
Acciojob Full Stack Development Week-Wise Syllabus
4 pages
Permutations: P (N, R) N! / (N-R) !
No ratings yet
Permutations: P (N, R) N! / (N-R) !
19 pages
XXXXX: Android Ecosystem
No ratings yet
XXXXX: Android Ecosystem
47 pages
How To Install Wallpaper
No ratings yet
How To Install Wallpaper
2 pages
Test Chapter # 3
No ratings yet
Test Chapter # 3
4 pages
JLN Membership
No ratings yet
JLN Membership
2 pages
Sap PM Data
No ratings yet
Sap PM Data
8 pages
KCA-021 UNIT 4 Spring IQ
No ratings yet
KCA-021 UNIT 4 Spring IQ
11 pages
CLVII-Part A Lab Manual
No ratings yet
CLVII-Part A Lab Manual
57 pages
C TS414 2021 SAP S4HANA Implementation Consultant 1704806340
No ratings yet
C TS414 2021 SAP S4HANA Implementation Consultant 1704806340
6 pages
DX Diag
No ratings yet
DX Diag
10 pages
Zonewise Report 11.10.22
No ratings yet
Zonewise Report 11.10.22
16 pages
Tracking Genie Official
No ratings yet
Tracking Genie Official
46 pages
Lab 3 - Load Balancing, Monitoring and Persistence
No ratings yet
Lab 3 - Load Balancing, Monitoring and Persistence
21 pages
DFS With Example
No ratings yet
DFS With Example
8 pages
UserManual PDF 2404184
No ratings yet
UserManual PDF 2404184
23 pages
Java - JavaFx Linechart Zoom by CategoryAxis (String Type) - Stack Overflow
No ratings yet
Java - JavaFx Linechart Zoom by CategoryAxis (String Type) - Stack Overflow
8 pages
Creating and Adjusting Traverses
No ratings yet
Creating and Adjusting Traverses
21 pages
Notification Admission DElEd 2024-2026 16052024
No ratings yet
Notification Admission DElEd 2024-2026 16052024
3 pages
NMX 7 4 1
No ratings yet
NMX 7 4 1
91 pages
Cheapest Call Girls in Paharganj 9599646485 Shot 1500 Night 6000 Booking Now Day/Night Doorstep Open 24/7 Hrs.
No ratings yet
Cheapest Call Girls in Paharganj 9599646485 Shot 1500 Night 6000 Booking Now Day/Night Doorstep Open 24/7 Hrs.
1 page
DPI 610/615 Series: Druck Portable Pressure Calibrators
No ratings yet
DPI 610/615 Series: Druck Portable Pressure Calibrators
8 pages
Vector Graphics - Wiki
No ratings yet
Vector Graphics - Wiki
8 pages

CSIT366-Lab File

Uploaded by

CSIT366-Lab File

Uploaded by

INDEX

NO TOPIC DATE SIGN

import matplotlib.pyplot as plt

# Frequency Distribution Plot

print("Tokenized Words:", tokenized_word)

if word_length <= 0 or num_words <= 0:

generated_words = [generate_random_word(word_length) for _ in range(num_words)]

print("\nGenerated words: ")

ngrams = generate_ngrams(text, WordsToCombine)

HIDDEN MARKOV MODEL

# Train the HMM POS tagger

# Display the POS tags

# Train the HMM POS tagger

# Implement Viterbi algorithm for POS tagging

# Test the Viterbi POS tagging

# Import necessary libraries

# Load the Iris dataset

# Split the data into features (X) and labels (y)

# Split the data into training and testing sets

# Create and train the KNN classifier

# Make predictions on the test set

# Calculate the accuracy of the model

# Visualize the Iris dataset

# Load a pre-trained sentiment analysis model

# Analyze sentiment of text

You might also like