0% found this document useful (0 votes)
89 views

Data Science 1st Sessional

This document is an examination for a Data Science course held at COMSATS University Islamabad Sahiwal Campus. It contains 3 questions assessing various data science concepts. Question 1 asks about data visualization, association mining, and how NLTK is helpful for data analysis. Question 2 asks about data pre-processing steps, the difference between stemming and lemmatization, and writing code for stop word removal in Python. Question 3 asks about the role of data cleaning, whether Python or R is better for text analytics and justifying the answer, and which Python library is best for sentiment analysis with justification. Students are instructed to upload a PDF of their handwritten answer sheet with their roll number and name as the file name.

Uploaded by

Ali Adnan Asghar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
89 views

Data Science 1st Sessional

This document is an examination for a Data Science course held at COMSATS University Islamabad Sahiwal Campus. It contains 3 questions assessing various data science concepts. Question 1 asks about data visualization, association mining, and how NLTK is helpful for data analysis. Question 2 asks about data pre-processing steps, the difference between stemming and lemmatization, and writing code for stop word removal in Python. Question 3 asks about the role of data cleaning, whether Python or R is better for text analytics and justifying the answer, and which Python library is best for sentiment analysis with justification. Students are instructed to upload a PDF of their handwritten answer sheet with their roll number and name as the file name.

Uploaded by

Ali Adnan Asghar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 1

COMSATS University Islamabad

Sahiwal Campus
(Computer Science)

Sessional-I Examination Spring-2021


Course Title: Data Science Course Code: CSC461 Credit Hours: 3
Course Instructor: Dr.Waheed Ramay Programme Name: BSI,BSE
Semester: Batch: Section: Date: 02-04-2021
Time Allowed: 60 mints Maximum Marks:
Student’s Name: Reg. No. CUI/ /SWL
Important Instructions / Guidelines:
Read the question paper carefully and answer the questions according to their statements.
Mobile phones are not allowed. Calculators must not have any data/equations etc. in their memory.

Q.1

1. What is Data visualization?


2. What is association mining?
3. What is NLTK. how it is helpful in data analysis?
4. Differentiate between classification and clustering?

Q.2
1. Describe the Data Pre-Processing steps.
2. What is the difference b/w stemming and lemmatization?
3. Write a code for stop word removal in python. Explain with example.

Q.3.
1. How does data cleaning play a vital role in the analysis?
2. Python or R – Which one would you prefer for text analytics? Justify your answer.
3. Which python library is best for sentiment analysis? Justify your answer.

Note
Upload hand written answer sheet in pdf format
File name must be your roll number and name e.g., SP18-BSE-010 Name
Write your roll number and name at all pages.

You might also like