0% found this document useful (0 votes)
5 views

Data Science Revision Sheet Question Bank (1)

The document consists of revision questions and exercises related to Data Science, covering topics such as data footprints, data recovery plans, and the DIKW model. It includes one-mark, two-mark, three-mark, and five-mark questions to assess understanding of concepts like data types, algorithms, and the impact of data in daily life. Additionally, it addresses practical applications of data analysis and the importance of protecting personal data.
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
5 views

Data Science Revision Sheet Question Bank (1)

The document consists of revision questions and exercises related to Data Science, covering topics such as data footprints, data recovery plans, and the DIKW model. It includes one-mark, two-mark, three-mark, and five-mark questions to assess understanding of concepts like data types, algorithms, and the impact of data in daily life. Additionally, it addresses practical applications of data analysis and the importance of protecting personal data.
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 6

Data Science

Chapter 1 Revision Sheet 1


One Mark Questions
1) Data and information are the same
a) Yes
b) No
2) Social media platforms are responsible for creating data footprints
a) Yes
b) No
3) There is no risk of losing data
a) Yes
b) No
4) Websites and mobile apps use our search history to provide personalized offers
a) Yes
b) No
5) Which of the following is not in DIKW Model?
a) Data
b) Information
c) Security
d) Knowledge
6) Should you keep a data recovery plan?
a) Yes
b) No
7) What is your data footprint?
a) The data trail left by you when you surf the internet
b) The time you spend on your computer
c) The number of electronics you buy in a year
d) The number of apps you have on your Mobile
8) How long is your data footprint visible?
a) It depends on the websites you visit
b) The data footprint wipes clean after a year
c) It creates a permanent record
d) The record expires after a month
9) Who can use or see data from your data footprint?
a) It is visible to professionals, but they need special access to go through the data
b) No one can access data from your digital footprint
c) Only the police have access to the information on your data footprint
d) Your data footprint is potentially visible to anyone
10) You regret posting a particular picture and want to take it down. Is it possible, and how would you do that?
a) It is a little tricky but can be done by asking a professional to do it. Then no one can see the photo.
b) You can delete the picture by clicking on the delete button. Then no one can see the photo anymore
c) Only the police can delete a picture uploaded by you
d) A photo can be deleted from your account, but someone might have already saved it or copied it.
11) How can you improve your data footprint?
a) It is best not to post anything if you want to stay safe
b) It is not necessary to improve your data footprint.
c) Check your social media accounts' privacy settings to make sure you share your posts with people you trust and
know.
d) Share your personal details with a good friend or family member so they can help you stay safe online
Two Marks Questions
1. What is Data & Information?
2. What is Data Science?
3. What are data footprints?
4. Discuss any two methods to protect your digital footprint.
OR How
to Protect your digital footprint?
5. Why should you keep a data recovery plan?

Three Marks Questions


1. Explain the difference between data and information?
2. Give three examples of how data impacts our daily lives.
3. What are the different types of data loss?
4. What are the Digital footprint examples?
5. What is personal data, and how can you keep your data safe online?
OR
Explain personal data? What precautions should be taken to protect our data?
6. Explain the DIKW model. Explain the model with some suitable example.
7. How do online streaming platforms use data?

Five Marks Questions


1. Give few examples of how data impacts our daily lives.
2. What are data footprints? What are the different types of data footprints?
3. How to protect your privacy online as business and personal use converge.
4. What is Data Loss? Explain the different types of data loss?

Data Science
Chapter 2 Standard Questions
One Mark Questions
1. A school named ABC has recorded the total marks of every student in the
class. This an example of: a. Qualitative data
b. Quantitative data
c. Both qualitative and quantitative data
d. None of the above
2. A food delivery app has asked for your feedback on the quality of the food. You have written two paragraphs
to describe the food. This is an example of: a. Qualitative data
b. Quantitative data
c. Both qualitative and quantitative data
d. None of the above
3. You need to predict what the temperature will be for next Friday. Which algorithm will you use?
a) Clustering
b) Regression
c) Anomaly detection
d) Binary classification
4. You need to predict if your car tyre will last for the next 1000 km. Which algorithm will you use?
a) Clustering
b) Regression
c) Anomaly detection
d) Binary classification
5. Which of the following are the benefits of Big data processing?
a) Business can utilize outside intelligence while making decisions
b) Improved customer service
c) Better optimal efficiency
d) All of the above
6. The analysis of large amounts of data to see what patterns or other
useful information can be found is known as a) Data Analysis
b) Information Analytics
c) Big data Analytics
d) Data Analytics
7. Big data analysis does the following except
a) Collects data
b) Spreads data
c) Organizes data
d) Analyzes data
8. Primary data for the research process be collected through
a) Experiment
b) Survey
c) Both a and b
d) None of the above
9. The advantage of secondary data are low cost, speed, availability, and flexibility
a) True
b) False
10. The method of getting primary data by watch people is called
a) Survey
b) Informative
c) Observational
d) Experimental
11. Secondary data are ____________to primary data.
a) Survey
b) opposite
c) Observe
d) same

Two Marks Questions


1. List any four sources of data collection with examples?
2. What are categorical variables? Give some examples.
OR What
are the Examples of categorical data?
3. How is Big Data used in social media?
4. Based on the type of data, explain five data questions.
5. Explain the primary sources Data.
6. What is the multivariate and univariate data? Give any one example of each.
7. Based on the type of data, explain any two algorithms for handling the data.
8. Explain
(i) Reinforcement Learning
(ii) Clustering
9. Three coins were tossed 30 times simultaneously. Each time the number of heads occurring was noted
down as follows:
Prepare a table for the data given above.

Three Marks Questions


1. What is the difference between multivariate and univariate data? Give some examples.
2. What are the primary characteristics of Big Data?
3. What are categorical variables? Give any three examples.
4. Based on the type of data, explain any three data questions with examples.
5. Make a list of all the data which are needed to be kept safe so that you do not lose it.
6. What is big data? How data science helps marketer to use big data in social media?
7. Explain Three V’s of Big Data with suitable examples?
8. Explain
(i) Reinforcement Learning
(ii) Clustering
(iii) Binary Classification
9. The blood groups of 30 students of Class VIII are recorded as follows:
A, B, O, O, AB, O, A, O, B, A, O, B, A, O, O, A, AB, O, A, A, O, O, AB, B, A, O, B, A, B, O.
Represent this data in the form of a table. Which is the most common, and which is the rarest, blood
group among these students?
10. The distance (in km) of 40 engineers from their residence to their place of work were found as follows:

Construct a grouped table with class size 5 for the data given above taking the first interval as 0 – 5 (5 not
included). What main features do you observe from this tabular representation?
11. The mean monthly relative humidity over the year in New Delhi (Delhi NCR), India is given in the below
graph:
Find
(i) On average, _____________ Month is with __________% the most humid.
(ii) On average, _____________ Month is with __________ % the least humid month.
(iii) The average annual percentage of humidity is: __________%.

Five Marks Questions


1. Give a few examples of how you can lose data and why it is essential to have a data recovery plan.
2. What are categorical variables? Explain with examples.
3. Given below are the seats won by different political parties in the polling outcome of a state assembly
elections:
(i) Draw a bar graph to represent the polling results.
(ii) Which political party won the maximum number of seats?
4. The value of π up to 50 decimal places is given below:
3.141592653589793238476264338327950288419716939937510
(i) Make a frequency distribution of the digits from 0 to 9 after the decimal point. (ii) What are
the most and the least frequently occurring digits?
The relative humidity (in %) of a certain city for a month of 30 days was as follows:

(i) Construct a grouped frequency distribution table with classes 84 – 86, 86 – 88, etc.
(ii) Which month or season do you think this data is about?
(iii) What is the range of this data?

6. The heights of 50 students, measured to the nearest centimetres, have been found to be as follows:

Represent the data given above by a grouped frequency distribution table, taking the class intervals as a.
160 – 165, 165 – 170, etc.
(ii) What can you conclude about their heights from the table?

7. Thirty children were asked about the number of hours they watched TV programmes in the previous
week. The results were found as follows:

(i) Make a grouped frequency distribution table for this data, taking class width 5 and one of the class
intervals as 5 – 10.
(ii) How many children watched television for 15 or more hours a week.

8. The heights of 50 students, measured to the nearest centimetres, have been found to be as follows:

(i) Represent the data given above by a grouped frequency distribution table, taking the class intervals
as 160 – 165, 165 – 170, etc.
(ii) What can you conclude about their heights from the table?
9. A study was conducted to find out the concentration of sulphur dioxide in the air in parts per million (ppm)
of a certain city. The data obtained for 30 days is as follows:

(i) Make a grouped frequency distribution table for this data with class intervals as 0.00 – 0.04, 0.04 –
0.08 and so on.
(ii) For how many days, was the concentration of sulphur dioxide more than 0.11 parts per million?
10. The following data on the number of girls (to the nearest ten) per thousand boys in different sections of
Indian society is given below.

(i) Represent the above information by a bar graph.


(ii) discuss what conclusions can be arrived at from the graph

11. A survey conducted by an organisation for the cause of illness and death among the women between the
ages 15 – 44 (in years) worldwide, found the following figures (in %):

(i) Represent the information given above graphically.


(ii) Which condition is the major cause of women’s ill health and death worldwide?
(iii) Try to find out, with the help of your teacher, any two factors which play a major role in the cause in
(ii) above being the major cause.

You might also like