Chapter 7 discusses the concept of big data, highlighting its three V's: Volume, Velocity, and Variety. It explores characteristics, challenges, advantages, and types of big data, providing examples from various industries such as healthcare, finance, and agriculture. Additionally, it introduces Hadoop as a key technology for processing large datasets.
Download as DOCX, PDF, TXT or read online on Scribd
0 ratings0% found this document useful (0 votes)
5 views
Chapter 7 Big Data
Chapter 7 discusses the concept of big data, highlighting its three V's: Volume, Velocity, and Variety. It explores characteristics, challenges, advantages, and types of big data, providing examples from various industries such as healthcare, finance, and agriculture. Additionally, it introduces Hadoop as a key technology for processing large datasets.
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 7
Chapter 7 Big Data 7. What are the three V's of big data?
a) Volume, velocity, and value
7.1Introduction b) Variety, veracity, and value 1. What is big data? c) Velocity, veracity, and variety a) A collection of large data sets d) Volume, veracity, and variety b) A collection of small data sets Answer: D c) A collection of unstructured data sets 8. Which of the following is an example of structured data? d) A collection of structured data sets a) Tweets Answer: A b) Emails 2. What is the volume of data that qualifies as big data? c) Images a) 1 GB d) Sales transactions b) 10 GB Answer: D c) 100 GB 9. Which of the following is an example of unstructured data? d) 1 TB a) Sensor data Answer: D b) Customer reviews 3. What is the variety of data types that make up big data? c) Financial transactions a) Only structured data d) Inventory records b) Only unstructured data Answer: B c) Structured and unstructured data 10. Which technology is commonly used for storing and processing big data? d) Semi-structured data only a) SQL databases Answer: C b) Relational databases 4. What is the velocity of big data? c) NoSQL databases a) The speed at which data is collected d) Mainframe computers b) The speed at which data is analyzed Answers: C c) The speed at which data is stored d) The speed at which data is accessed 7.2 characteristics of Big data Answer: A 1. Which of the following best describes the volume characteristic of Big 5. What is the veracity of big data? Data? a) The trustworthiness of the data a. Big Data refers to a specific amount of data that is larger than traditional b) The accuracy of the data data. c) The completeness of the data b. Big Data refers to data that is too complex for traditional processing d) The timeliness of the data methods. Answer: A c. Big Data refers to data that is too large for traditional processing 6. What is the value proposition of big data? methods. a) Increased efficiency and reduced costs d. Big Data refers to data that is too unstructured for traditional processing b) Increased security and privacy methods. c) Increased data redundancy and backup Answer: c. Big Data refers to data that is too large for traditional processing d) Increased data storage capacity methods. Answer: A 2. Which of the following best describes the velocity characteristic of Big 6. Which characteristic refers to the speed at which data is generated and Data? processed? a. Big Data refers to data that is too complex for traditional processing a. Velocity methods. b. Variety b. Big Data refers to data that is too large for traditional processing c. Volume methods. d. Veracity c. Big Data refers to data that is generated at a high speed and in large Answer: A quantities. 7. Which characteristic refers to the different types of data that can be d. Big Data refers to data that is stored in various formats. generated and collected? Answer: c. Big Data refers to data that is generated at a high speed and in a. Velocity large quantities. b. Variety 3. Which of the following best describes the variety characteristic of Big c. Volume Data? d. Veracity a. Big Data refers to data that is too complex for traditional processing Answer: B methods. 8. Which characteristic refers to the sheer amount of data that can be b. Big Data refers to data that is too large for traditional processing generated and collected? methods. a. Velocity c. Big Data refers to data that is generated at a high speed and in large b. Variety quantities. c. Volume d. Big Data refers to data that comes in many different forms and formats. d. Veracity Answer: d. Big Data refers to data that comes in many different forms and Answer: C formats. 9. Which characteristic refers to the accuracy and reliability of data? 4. Which of the following best describes the veracity characteristic of Big a. Velocity Data? b. Variety a. Big Data refers to data that is too complex for traditional processing c. Volume methods. d. Veracity b. Big Data refers to data that is too large for traditional processing Answer: D methods. 10. Which characteristic refers to the ability to handle both structured and c. Big Data refers to data that is generated at a high speed and in large unstructured data? quantities. a. Velocity d. Big Data refers to data that is uncertain, incomplete, or unreliable. b. Variety Answer: d. Big Data refers to data that is uncertain, incomplete, or c. Volume unreliable. d. Veracity 5. What is the most commonly cited characteristic of big data? Answer: B a. Velocity 11. Which characteristic refers to the ability to analyze data in real-time? b. Variety a. Velocity c. Volume b. Variety d. Veracity c. Volume Answer: C d. Veracity Answer: A c) Ensuring data availability 12. Which characteristic refers to the ability to store and manage data across d) Ensuring data privacy multiple platforms? Answer: b a. Velocity 4. What is the biggest challenge when it comes to processing big data? b. Variety a) Finding patterns in the data c. Volume b) Speed of processing d. Veracity c) Memory management Answer: B d) Data cleaning 13. Which characteristic refers to the ability to capture data from various Answer: b sources? 5. Which of the following is a challenge when it comes to analyzing big data? a. Velocity a) Understanding the data b. Variety b) Visualizing the data c. Volume c) Finding correlations d. Veracity d) All of the above Answer: B Answer: d 14. Which characteristic refers to the ability to make data-driven decisions? 6. Which of the following is NOT a method for dealing with big data? a. Velocity a) Data sampling b. Variety b) Data cleansing c. Volume c) Data visualization d. Veracity d) Data compression Answers: D Answer: c 7.3 Challenges of Big Data 7. Which of the following is a challenge when it comes to managing big data? 1. What is the definition of big data? a) Ensuring data quality a) Any data that is larger than 1TB b) Ensuring data privacy b) Any data that is difficult to manage c) Ensuring data availability c) Any data that is too complex for traditional data processing systems d) All of the above d) Any data that is generated by social media platforms Answer: d Answer: c 8. What is the biggest challenge when it comes to sharing big data? 2. Which of the following is NOT a challenge of big data? a) Ensuring data privacy a) Data storage b) Ensuring data security b) Data processing c) Ensuring data quality c) Data analysis d) Ensuring data availability d) Data security Answer: a Answer: d 9. What is the biggest challenge when it comes to using big data for decision- 3. Which of the following is a common challenge when it comes to storing big making? data? a) Understanding the data a) Ensuring data quality b) Finding patterns in the data b) Ensuring data security c) Ensuring data quality d) Ensuring data security Answer: a d) It can decrease access to education Answer: a) It can improve student outcomes 10. Which of the following is a potential ethical challenge when it comes to big 6. What is one advantage of using big data in the field of finance? data? a) It can help identify fraudulent transactions a) Bias in the data b) It can increase financial instability b) Data quality c) It can decrease financial transparency c) Data privacy d) It can limit financial innovation d) Data security Answer: a) It can help identify fraudulent transactions Answer: a 7. What is one advantage of using big data in the field of transportation? 7.4 Advantages of Big data a) It can improve traffic management 1. What is one advantage of big data? b) It can increase transportation costs a) Reduced storage costs c) It can limit transportation options b) Limited data sources d) It can decrease transportation efficiency c) Inefficient data processing Answer: a) It can improve traffic management d) Lack of scalability 8. How can big data benefit the field of agriculture? Answer: a) Reduced storage costs a) It can improve crop yields 2. How can big data benefit businesses? b) It can decrease food production a) It can help them identify new customer segments c) It can increase the cost of farming b) It can make their products less competitive d) It can limit the use of technology in farming c) It can increase inefficiencies in their operations Answer: a) It can improve crop yields d) It can hinder innovation 9. What is one advantage of using big data in the field of sports? Answer: a) It can help them identify new customer segments a) It can improve athletic performance 3. What is one advantage of using big data in healthcare? b) It can decrease fan engagement a) It can improve patient outcomes c) It can increase sports injuries b) It can reduce the number of healthcare providers needed d) It can limit the use of technology in sports c) It can increase healthcare costs Answer: a) It can improve athletic performance d) It can decrease accessibility to healthcare 10. How can big data benefit the field of cybersecurity? Answer: a) It can improve patient outcomes a) It can help identify and prevent cyber attacks 4. How can big data help in the field of marketing? b) It can increase the likelihood of cyber attacks a) It can help marketers understand customer behavior c) It can decrease the need for cybersecurity professionals b) It can make marketing campaigns less targeted d) It can limit the use of technology in cybersecurity c) It can make marketing campaigns more expensive Answer: a) It can help identify and prevent cyber attacks d) It can limit the data available for analysis 7.5 Types of Big data Answer: a) It can help marketers understand customer behavior 1. Which type of big data refers to the vast amounts of structured data 5. How can big data benefit the field of education? generated by organizations, such as customer transactions and financial a) It can improve student outcomes records? b) It can decrease teacher effectiveness a) Social media data c) It can increase education costs b) Machine-generated data c) Enterprise data 7. Which type of big data refers to the data generated by tracking the d) Sensor data movement of people or objects, such as GPS data? Answer: c) Enterprise data a) Social media data 2. Which type of big data refers to the unstructured data generated by b) Machine-generated data individuals on social media platforms? c) Enterprise data a) Social media data d) Location data b) Machine-generated data Answer: d) Location data c) Enterprise data 8. Which type of big data refers to the data generated by analyzing human d) Sensor data behavior, such as browsing history and search queries? Answer: a) Social media data a) Social media data 3. Which type of big data refers to the data generated by IoT devices, such as b) Machine-generated data sensors and cameras? c) Enterprise data a) Social media data d) Behavioral data b) Machine-generated data Answer: d) Behavioral data c) Enterprise data 9. Which type of big data refers to the data generated by analyzing text, such d) Sensor data as emails and customer feedback? Answer: d) Sensor data a) Social media data 4. Which type of big data refers to the data generated by automated systems, b) Machine-generated data such as logs and clickstreams? c) Enterprise data a) Social media data d) Text data b) Machine-generated data Answer: d) Text data c) Enterprise data 10. Which type of big data refers to the data generated by multimedia, such as d) Sensor data images and videos? Answer: b) Machine-generated data a) Social media data 5. Which type of big data refers to the data generated by scientific b) Machine-generated data instruments, such as telescopes and microscopes? c) Enterprise data a) Social media data d) Multimedia data b) Machine-generated data Answer: d) Multimedia data c) Enterprise data 7.6 Example of big data d) Scientific data 1. Which of the following is an example of big data? Answer: d) Scientific data a) An Excel spreadsheet with 100 rows and 10 columns 6. Which type of big data refers to the data generated by analyzing other b) A database with 1 million rows and 10 columns types of big data? c) A Word document with 10 pages a) Social media data d) A PowerPoint presentation with 20 slides b) Machine-generated data Answer: b) A database with 1 million rows and 10 columns c) Enterprise data 2. Which of the following industries is NOT an example of big data usage? d) Derived data a) Healthcare Answer: d) Derived data b) Finance c) Retail 7.7 Introduction to Hadoop d) Agriculture 1. What is Hadoop? Answer: d) Agriculture a) A programming language 3. Which of the following is NOT an example of a big data tool? b) An operating system a) Hadoop c) A distributed computing platform b) Spark d) A database management system c) Excel Answer: c) A distributed computing platform d) Cassandra 2. What is the primary objective of Hadoop? Answer: c) Excel a) To process large amounts of data quickly 4. Which of the following is an example of big data in healthcare? b) To store data in a structured format a) Patient records in a small clinic c) To perform complex calculations on data b) Electronic health records for a large hospital system d) To generate reports and visualizations c) A single medical chart for a patient Answer: a) To process large amounts of data quickly d) A prescription record for a single medication 3. Which programming language is commonly used for writing Hadoop Answer: b) Electronic health records for a large hospital system applications? 5. Which of the following is an example of big data in finance? a) Java a) A personal budget spreadsheet b) Python b) Credit card statements for a single person c) C++ c) Stock market data for an entire year d) Ruby d) Bank account information for a single person Answer: a) Java Answer: c) Stock market data for an entire year 4. What is the main component of Hadoop that manages the storage and 6. Which of the following is an example of big data in retail? processing of data? a) Sales records for a single store a) Name Node b) Inventory data for a small boutique b) Data Node c) Customer transaction data for a large chain store c) Resource Manager d) Product descriptions for a single item d) Node Manager Answer: c) Customer transaction data for a large chain store Answer: a) Name Node 7. Which of the following is an example of big data in transportation? 5. What is the Hadoop Distributed File System (HDFS)? a) A list of flight schedules for a single airport a) A database management system b) A log of a single truck driver's routes b) A file storage system used by Hadoop c) GPS data for a fleet of delivery trucks c) A programming language used for Hadoop applications d) Train schedules for a single route d) A data processing tool in Hadoop Answer: c) GPS data for a fleet of delivery trucks Answer: b) A file storage system used by Hadoop 8. Which of the following is an example of big data in education? 6. Which component of Hadoop is responsible for scheduling and managing a) A teacher's gradebook for a single class jobs? b) A student's transcript a) Resource Manager c) Assessment data for an entire school district b) Node Manager d) A syllabus for a single course c) MapReduce Answer: c) Assessment data for an entire school district d) HBase Answer: a) Resource Manager 7. Which Hadoop component is used for parallel processing of large datasets? a) MapReduce b) Pig c) Hive d) Spark Answer: a) MapReduce 8. What is the purpose of Hadoop streaming? a) To process streaming data in real-time b) To stream video content using Hadoop c) To allow non-Java programs to interact with Hadoop d) To manage Hadoop clusters in real-time Answer: c) To allow non-Java programs to interact with Hadoop 9. Which Hadoop ecosystem tool is used for real-time data processing? a) Hive b) Pig c) Flume d) Storm Answer: d) Storm 10. Which Hadoop component is used for column-oriented storage and processing? a) HBase b) Hive c) Pig d) Spark Answer: b) Hive