NLP Previous Sem

The document outlines the examination structure for the 6th Semester B. Tech CSE Semester End Examination in May 2024, focusing on Natural Language Processing. It includes detailed instructions for answering questions from various units, covering topics such as NLP applications, Python programming for text analysis, regular expressions, and supervised classification. Each unit presents multiple questions, allowing students to demonstrate their understanding of NLP concepts and techniques.

Uploaded by

Amann Adil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views

NLP Previous Sem

Uploaded by

Amann Adil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

SRN

6th Semester B. Tech CSE Semester End Examination MAY 2024

Course Title: Natural Language Processing
Course Code:B20EFS613 - 22221607
Time: 3 Hours Max. Marks: 100
Note:
1. Answer ONE FULL question from each unit.
2. Verify and ensure that question paper is completely printed before answering the question paper.
3. Any queries/discrepancies regarding the question paper, must be brought to the notice of the invigilator
4. Students must check the course title and course code before answering the question paper
UNIT – I Marks
1. a) Natural language processing is one of the advanced techniques and it is expanded to 7
many advancements. Identify and explain the applications of natural language
processing in real world.
b) i. Let text1=’The cat is on the mat’ 8
What is the difference between the following two lines? Which one will give a
larger value? Will this be the case for other texts?
>>> sorted(set([w.lower() for w in text1]))
>>> sorted([w.lower() for w in set(text1)])
ii. Build Python code that finds all the words occurring at least four times in Brown
Corpus.
c) Develop the expressions for finding all words in some text file that meet the following 10
conditions. The result should be in the form of a list of words: ['word1', 'word2', ...].
i. Ending in ize
ii. Containing the letter z
iii. Containing the sequence of letters pt
All lowercase letters except for an initial capital (i.e., titlecase)
OR
2. a) Develop and interpret with a suitable python code to perform the below basic 8
operations of NLP on the text file with example.
i. concordance( )
ii. tokenize()
iii. similar()
iv. common_context()
b) Suppose a text file contains more than 5000 words narrating a story of ‘Sherlock 8
Holmes’. So, Mention a definition of lexical diversity, Develop and interpret a python
program to compute the lexical diversity to understand the complexity of the text with
an example.
c) Identify any five common nouns in the English Literature and examine the holonym- 9
meronym relations for these nouns by outlining a step-by-step procedure of
examination.
UNIT – II
3. a) A text file contains a many collection of words. Apply the below regular expression in a 10
python code and describe the set of strings matched by the regular expressions:
i. [a-zA-Z]+

Page 1 of 3
ii. [A-Z][a-z]*

iii. ^m+i+n+e

iv. [^ghi] [mno] [jlk] [def]

b) Multilingual language incorporates many languages along with a unique Unicode. With a 10
neat Unicode decoding and encoding diagram, illustrate how NLP handles multilingual
languages.
c) Compare Stemming and lemmatization operations in NLP 5
OR
4. a) Develop and interpret python code to scrapes a favorite web page and extract some text 10
from it. For example, access a weather site and pull out the top temperature of the city
from HTML document.
url: http://www.accuweather.com/en/us/charlottesville-va/22902/weather-
forecast/331243
b) With an illustrative block diagram and the corresponding Python code, substantiate the 10
process of NLP pipeline to build the vocabulary in NLP.
c) Define Text normalization. Explain the techniques of text normalization with a suitable 5
python code.
UNIT – III
5. a) Linguists use morphological, syntactic, and semantic clues to determine the category of a 10
word. Explain the following terms with a suitable example.
i. Morphological Clues
ii. Syntactic Clues
iii. Semantic Clues
iv. New Words
b) Construct two dictionaries, Student(consisting of the Student’s srn and marks) and 5
NewEntryStudents, and add some entries to each. Now issue the command
Student.update(NewEntryStudents). What did this do? What might it be useful for?
c) Train a bigram tagger with no backoff tagger, and run it on some of the training data. 10
Next, run it on some new data. What happens to the performance of the tagger? Why?
OR
6. a) Words can be grouped into classes, such as nouns, verbs, adjectives, and adverbs. 10
Inspect and What explanation do you have for Lexical categories?
b) Tokenize and tag the following sentence: “They wind back the clock, while we chase 10
after the wind.” Analyze what different pronunciations and parts-of-speech are
involved?
c) Construct a dictionary e, to represent a single lexical entry for some word of your choice. 5
Define keys such as headword, part-of-speech, sense, and example, and assign them
suitable values.
UNIT – IV
7. a) A decision tree is constructed by partitioning the training samples into successive 10
subsets. Illustrate Five different algorithms have been developed to efficiently construct
an accurate decision tree.
b) “Generative models are strictly more powerful than conditional models.” Justify the 10
statement with suitable examples.
c) In maximum entropy models, the term “features” often refers to joint-features but 5
analyze how is Joint-feature connected to maximum entropy?
OR

Page 2 of 3
8. a) The synonyms strong and powerful pattern differently (try combining them with chip 10
and sales). What features are relevant in this distinction? Build a classifier that predicts
when each word should be used.
b) Suppose you wanted to automatically generate a prose description of a scene, and 10
already had a word to uniquely describe each entity, such as the book, and simply
wanted to decide whether to use in or on in relating various items, e.g., the book is in
the cupboard versus the book is on the shelf. Analyze this issue by looking at corpus data
and writing programs as needed. Consider the following examples:
i. in the car versus on the train
ii. in town versus on campus
iii. in the picture versus on the screen
iv. in Macbeth versus on Letterman

c) Consider one of the language technologies mentioned in this section, such as word sense 5
disambiguation, semantic role labelling, question answering, machine translation, named
entity detection. Identify what type and quantity of annotated data is required for
developing such systems. Why do you think a large amount of data is required?

***

Page 3 of 3
SRN

6th Semester B.Tech CSE Semester End Examination May 2024

Course Title: Natural Language Processing
Course Code: B21ET0601 - 22121602
Time: 3 Hours Max. Marks: 100
Note:
1. Answer ONE FULL question from each unit.
2. Verify and ensure that question paper is completely printed before answering the question paper.
3. Any queries/discrepancies regarding the question paper, must be brought to the notice of the invigilator
4. Students must check the course title and course code before answering the question paper
UNIT – I Marks
1. a) You are working on a project to analyze a large dataset of customer reviews for a range 10
of products and asked to extract meaningful insights about common customer
complaints, product features that are praised, and overall sentiment towards different
products. Explain the following NLTK functions for searching text with examples:
concordance(), similar(), common_contexts() for brown corpus.
b) Corpus is a large collection of linguistic data used to perform NLP operations. Explain 10
about the below mentioned datasets in corpus with python code snippet for accessing
them.
i. Gutenberg Corpus
ii. Web and Chat Text
iii. Brown Corpus
iv. Reuter Corpus
v. Inaugural Address
c) Demonstrate a program to print the 50 most frequent trigrams (3 adjacent words) of a 5
text, omitting trigrams that contain stopwords.
OR
2. a) Explain Conditional Frequency in detail. Write the program to find the word “is” in the 10
genre ‘news’ of brown corpus.

b) NLP Models like ChatGPT3 are very good at Natural language understanding that is why 10
they are so good at responding to different types of prompts in various languages.
Explain different technologies involved in automatic natural language understanding
c) Compare and contrast Stemming and Lemmatization operations in NLP 5
UNIT – II
3. a) Regular expressions are a powerful tool for pattern matching in NLP. Explain the basic 10
meta-characters used in regular expressions, including wildcards, ranges, and closures
with examples of each meta-character and describe how they can be used in pattern
matching tasks.
b) Build the regular expression to check 10
i. whether the string starts with the given pattern or not str = "Data Science"
ii. if whitespace is removed from the string having whitespace at the beginning
and end of a string.
c) Differentiate between list, strings and tuples. 5
OR
4. a) You are developing a text editor that includes a feature to detect and highlight specific 10

Page 1 of 2
grammatical constructs in text. Build the regular expressions to match the following
classes of strings:
i. Strings containing any one of the determiners - a, an, and the.
ii. An arithmetic expression using integers, addition, and multiplication,
such as 2*3+8.
b) Explain segmentation with derivative and evaluate function. 10
c) Our programs often need to deal with different languages, and different character sets. 5
Explain what you understand by the Unicode.
UNIT – III
5. a) The POS is identified using the word, its meaning and the context in which the word is 10
used. Explain POS tagging and illustrate reading and POS tagging of a tagged corpora
with python code using NLTK library.(Use Brown Corpus)
b) Python dictionary is the efficient way of storing the data as a key value pair. Explain 10
how the mapping of word to tag is done using dictionary. Explain and develop the
default dictionary of value list with example program.
c) Develop a python program to find the POS of given sentences. 5
Sent= "The quick brown fox jumps over the lazy dog"
OR
6. a) With the neat diagram explain how process of general N-gram tagging using NLTK 10
library’s built-in taggers. Why should data be split into training and test portions?
b) With suitable python code explain the automatic tagging with the evaluate function. 10
c) Explain the universal part of speech tag set. 5
UNIT – IV
7. a) The supervised classification classifies the based on the labeled data. With a neat 10
diagram explain working principle of supervised classification of text.
b) Develop a python NLP program for Movie review using NLTK library’s Naïve Bayes 10
classifier and ‘names’ corpus.
c) Explain the confusion matrix for the bigram tagger. 5
OR
8. a) Develop a python program for POS tagging using Decision Tree Classifier in NLTK. Use 10
tagged Brown corpus for training.
b) In general, one text depicts the same meaning of text2. Briefly describe recognizing the 10
textual entailment with an example.
c) Decision tree is supervised classification to classify the input. With a suitable diagram 5
explain the decision tree for classification.

***

Page 2 of 2

Codex Theodosianus
100% (1)
Codex Theodosianus
4 pages
NLP Previous Sem-1-3
No ratings yet
NLP Previous Sem-1-3
3 pages
NLP Previous Sem-4-5
No ratings yet
NLP Previous Sem-4-5
2 pages
Question Bank NLP
100% (1)
Question Bank NLP
11 pages
2 - 6N302 Natural Language Processing
No ratings yet
2 - 6N302 Natural Language Processing
6 pages
CT3
No ratings yet
CT3
3 pages
NLP Study Material
No ratings yet
NLP Study Material
8 pages
Question Bank
No ratings yet
Question Bank
2 pages
MTE Practice Set
No ratings yet
MTE Practice Set
4 pages
Module 1
No ratings yet
Module 1
5 pages
vnd.openxmlformats-officedocument.wordprocessingml.document&rendition=1
No ratings yet
vnd.openxmlformats-officedocument.wordprocessingml.document&rendition=1
5 pages
Batch 2
No ratings yet
Batch 2
13 pages
CEGP013091: 49.248.216.238 17/05/2024 13:48:57 Static-238
No ratings yet
CEGP013091: 49.248.216.238 17/05/2024 13:48:57 Static-238
3 pages
2 IPCC - Natural Language Processing
No ratings yet
2 IPCC - Natural Language Processing
4 pages
CT3 Set A
No ratings yet
CT3 Set A
3 pages
CT2 Set B
No ratings yet
CT2 Set B
4 pages
NLP Sample Questions-Stu
No ratings yet
NLP Sample Questions-Stu
4 pages
NLP_QB[1]
No ratings yet
NLP_QB[1]
5 pages
CT2 Set B
No ratings yet
CT2 Set B
4 pages
DOC-20250529-WA0002.
No ratings yet
DOC-20250529-WA0002.
6 pages
CT2 Set A
No ratings yet
CT2 Set A
4 pages
NaturalLanguageProcessingClassworkNotes_1473d9cb2fd64561b134cb14125f9536_37661
No ratings yet
NaturalLanguageProcessingClassworkNotes_1473d9cb2fd64561b134cb14125f9536_37661
10 pages
CT3 Set A - Qwerty
No ratings yet
CT3 Set A - Qwerty
4 pages
Natural Language Processing Dossier 20231110 141736 0000
No ratings yet
Natural Language Processing Dossier 20231110 141736 0000
114 pages
NLP 2K22 MAY CS3EA06 Natural Language Processing
No ratings yet
NLP 2K22 MAY CS3EA06 Natural Language Processing
2 pages
NLP Final Review
No ratings yet
NLP Final Review
32 pages
Question Bank - NLP
No ratings yet
Question Bank - NLP
3 pages
SNLP
No ratings yet
SNLP
18 pages
CT3 Set C - Qwerty
No ratings yet
CT3 Set C - Qwerty
4 pages
NLP
No ratings yet
NLP
16 pages
Text Modication Methods For Natural Language Generation: Universitat Autònoma de Barcelona
No ratings yet
Text Modication Methods For Natural Language Generation: Universitat Autònoma de Barcelona
44 pages
nlp file
No ratings yet
nlp file
21 pages
NLP Endsem 2016
No ratings yet
NLP Endsem 2016
2 pages
6th sem AIML syllabus 2022 scheme
No ratings yet
6th sem AIML syllabus 2022 scheme
53 pages
BAI601-NLP
No ratings yet
BAI601-NLP
5 pages
Natural Language Processing
No ratings yet
Natural Language Processing
8 pages
9783293-CLASS10 AI Worksheet PART B UNIT6 Natural Language Processing (1)
No ratings yet
9783293-CLASS10 AI Worksheet PART B UNIT6 Natural Language Processing (1)
3 pages
NLP Lab Manual-1
No ratings yet
NLP Lab Manual-1
18 pages
Nlp Mu Qpapers 2022-2024
No ratings yet
Nlp Mu Qpapers 2022-2024
5 pages
NLP MTE syllabus and Practice Problems (2)
No ratings yet
NLP MTE syllabus and Practice Problems (2)
2 pages
21ai643 Model Paper
No ratings yet
21ai643 Model Paper
2 pages
21AI643
No ratings yet
21AI643
2 pages
Module 2
No ratings yet
Module 2
3 pages
CCS369
No ratings yet
CCS369
2 pages
NLP_CT2_SET A_Answer Key
No ratings yet
NLP_CT2_SET A_Answer Key
10 pages
nlp syllabus
No ratings yet
nlp syllabus
1 page
Question Bank
No ratings yet
Question Bank
3 pages
gbhrfthrdf
No ratings yet
gbhrfthrdf
3 pages
Assignment-I
No ratings yet
Assignment-I
6 pages
IT3EA06 Natural Language Processing
No ratings yet
IT3EA06 Natural Language Processing
3 pages
NLP Exercises
No ratings yet
NLP Exercises
2 pages
Kcs072 Natural Language Processing
No ratings yet
Kcs072 Natural Language Processing
2 pages
Lab Syllabus NLP Lab
No ratings yet
Lab Syllabus NLP Lab
2 pages
KAI073-TEXT-ANALYTICS-AND-NATURAL-LANGUGAE-PROCESSING
No ratings yet
KAI073-TEXT-ANALYTICS-AND-NATURAL-LANGUGAE-PROCESSING
2 pages
Ai Unit - 5
No ratings yet
Ai Unit - 5
12 pages
NLP_SEM
No ratings yet
NLP_SEM
4 pages
Session2 3
No ratings yet
Session2 3
18 pages
NLP Syllabus R21
100% (1)
NLP Syllabus R21
2 pages
CS702B
No ratings yet
CS702B
114 pages
Swift 3 Object-Oriented Programming - Second Edition
From Everand
Swift 3 Object-Oriented Programming - Second Edition
Gastón C. Hillar
No ratings yet
Schematron: A language for validating XML
From Everand
Schematron: A language for validating XML
Erik Siegel
No ratings yet
Lego Movie Freedom Worksheet
No ratings yet
Lego Movie Freedom Worksheet
1 page
Image Acquisition: Sapro Robotics
No ratings yet
Image Acquisition: Sapro Robotics
4 pages
How To Make Folded-Book-Art
No ratings yet
How To Make Folded-Book-Art
10 pages
Focus3 2E Unit Test Vocabulary Grammar UoE Unit2 GroupB
No ratings yet
Focus3 2E Unit Test Vocabulary Grammar UoE Unit2 GroupB
2 pages
Ellipsis and Substitution
100% (1)
Ellipsis and Substitution
4 pages
ABAP Objects Overview
No ratings yet
ABAP Objects Overview
51 pages
TTL 2
100% (1)
TTL 2
20 pages
The Special Teaching of The Wise and Glorious King
No ratings yet
The Special Teaching of The Wise and Glorious King
13 pages
2060 VB
No ratings yet
2060 VB
24 pages
Country Frequency (Hertz) Three-Phase Voltage (Volts) Number of Wires (Not Including The Ground Wire)
No ratings yet
Country Frequency (Hertz) Three-Phase Voltage (Volts) Number of Wires (Not Including The Ground Wire)
11 pages
Gorgeous Shop Returns Form
No ratings yet
Gorgeous Shop Returns Form
1 page
Dissertation Reflective Report Sample
100% (2)
Dissertation Reflective Report Sample
5 pages
PDF Translation Between English and Arabic 1st Edition Noureldin Abdelaal Download
100% (3)
PDF Translation Between English and Arabic 1st Edition Noureldin Abdelaal Download
62 pages
Definition of Recount Text
No ratings yet
Definition of Recount Text
1 page
Memento Mori Quickstart ENG
0% (1)
Memento Mori Quickstart ENG
84 pages
Lecture 2 The PIC16F877 Memory Map & Assembly Programming
No ratings yet
Lecture 2 The PIC16F877 Memory Map & Assembly Programming
24 pages
Nazarene Siddur (Prayer Book)
100% (7)
Nazarene Siddur (Prayer Book)
19 pages
02 The Tenses & Transform
No ratings yet
02 The Tenses & Transform
11 pages
PRE DESTINATIONS Web PDF
No ratings yet
PRE DESTINATIONS Web PDF
39 pages
Idioms
No ratings yet
Idioms
31 pages
Symphonic Suite Contains Five Movements. They Are Unified by The Singular
No ratings yet
Symphonic Suite Contains Five Movements. They Are Unified by The Singular
5 pages
Verb - Preposition (Exercises)
No ratings yet
Verb - Preposition (Exercises)
2 pages
Behavior Problems in Children With Specific Language Impairment
No ratings yet
Behavior Problems in Children With Specific Language Impairment
9 pages
Guide 1812 01 Start
No ratings yet
Guide 1812 01 Start
1 page
SSIS Transformations
No ratings yet
SSIS Transformations
46 pages
AWS Glue
No ratings yet
AWS Glue
3 pages
DLL-Food Fish Processing 9-Q2-W6
100% (1)
DLL-Food Fish Processing 9-Q2-W6
4 pages
DLP English
No ratings yet
DLP English
10 pages
English Quiz Units 5 - 6 Juniors Level 2
No ratings yet
English Quiz Units 5 - 6 Juniors Level 2
4 pages

NLP Previous Sem

Uploaded by

NLP Previous Sem

Uploaded by

SRN

6th Semester B. Tech CSE Semester End Examination MAY 2024

iv. [^ghi] [mno] [jlk] [def]

6th Semester B.Tech CSE Semester End Examination May 2024

You might also like