Natural Language Processing Question Bank
Natural Language Processing Question Bank
1. Explain the difference between stemming and lemmatization. When would you choose one
over the other in text preprocessing?
2. Describe how stop words affect the results of text analysis. Provide an example of when
removing stop words might not be advisable.
3. What is the purpose of a term-document matrix, and how is it different from a
document-term matrix?
4. How does TF-IDF help in identifying the importance of words in a document? Provide a
real-world application.
5. Give an example of a sentence, and perform part-of-speech tagging for each word. Explain
the importance of this process in NLP.
1. Discuss the ethical considerations associated with web scraping. How can businesses ensure
responsible data extraction?
2. Explain the key steps involved in web scraping for real-time data extraction. Provide an
example of a website and the data you might extract from it.
Sentiment Analysis:
1. How does the Afinn algorithm work in sentiment analysis? What are its limitations?
2. Differentiate between sentiment polarity and subjectivity in text analysis. Provide examples
3. Create a visualization that represents sentiment analysis results for a set of Amazon
customer reviews. Interpret the visualization.
Topic Modelling:
1. What is the main objective of topic modelling? Explain Latent Dirichlet Allocation (LDA) as a
topic modelling technique.
2. Suppose you have a collection of news articles. How would you use topic modeling to group
similar articles together? Provide a step-by-step process.
1. Describe the role of Recurrent Neural Networks (RNNs) and Long Short-Term Memory (LSTM)
networks in text generation. Provide an example.
2. How can machine learning models be used to classify text into predefined categories?
Explain the concept of feature engineering in text classification.
1
Natural Language Processing
Prof. Jayesh Jain
Applying NLP to Real-World Business Problems:
1. Imagine you're working for a hotel chain. How would you apply NLP techniques to improve
customer reviews analysis and enhance customer experience?
2. Analyse a hypothetical e-commerce use case and propose an NLP-driven solution to reduce
customer churn and increase sales.
Chatbots:
Text Preprocessing:
1. In what scenarios might text normalization techniques such as lowercase conversion and
punctuation removal be necessary during text preprocessing?
2. Explain the concept of stemming and provide an example of how stemming might affect the
meaning of words in a sentence.
1. Describe the key differences between the Count Vectorization and TF-IDF approaches for text
representation. When would you prefer one over the other?
2. How can you handle rare or unique words in TF-IDF vectorization? Why is this important in
practice?
1. Discuss the role of syntactic parsing in NLP. Provide an example of a sentence and
demonstrate how you would represent its structure using grammar rules.
2. Explain the importance of part-of-speech tagging in machine translation systems. How does
it impact translation accuracy?
WordNet-based Similarity:
1. Describe how WordNet can be used to calculate semantic similarity between words. Provide
an example of two words and their semantic similarity score.
2. Discuss the limitations of WordNet-based similarity measures in handling polysemy and
context-dependent word meanings.
2
Natural Language Processing
Prof. Jayesh Jain
Visualization in NLP:
1. Explain the significance of data visualization in NLP. Provide an example of a complex NLP
dataset and describe how visualization can aid in understanding the data.
2. How can sentiment analysis results be visualized in a way that conveys not only sentiment
polarity but also subjectivity and objectivity levels?
Text Classification:
1. Suppose you're building a spam email classifier. Describe the process of feature selection and
the choice of a machine learning algorithm to achieve high accuracy.
2. In a customer support chatbot context, how can text classification be applied to route
customer queries to the appropriate departments or agents? Explain the steps involved.
3
Natural Language Processing
Prof. Jayesh Jain