0% found this document useful (0 votes)
4 views

Lect 1

The document consists of a series of questions and answers related to Big Data, NoSQL databases, and their characteristics, including types of databases, data models, and processing paradigms. It covers key concepts such as the 4Vs of Big Data, ACID properties, and the differences between various NoSQL database types. The document serves as a quiz or study guide for understanding fundamental aspects of data management and database systems.
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
4 views

Lect 1

The document consists of a series of questions and answers related to Big Data, NoSQL databases, and their characteristics, including types of databases, data models, and processing paradigms. It covers key concepts such as the 4Vs of Big Data, ACID properties, and the differences between various NoSQL database types. The document serves as a quiz or study guide for understanding fundamental aspects of data management and database systems.
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 20

1.

**Which of the following is NOT a characteristic of Big Data according to Gartner's


definition?**
a) High Volume
b) High Velocity
c) High Veracity
d) High Optimization
2. **Which of the following is NOT a common source of Big Data?**
a) Social Media
b) Mobile Devices
c) Traditional Databases
d) Sensor Technology
3. **Which of the following is one of the 4Vs of Big Data?**
a) Vision
b) Variety
c) Virtualization
d) Validation
4. **The characteristic of Big Data related to data generation speed is:**
a) Volume
b) Variety
c) Velocity
d) Veracity
5. **In Big Data, 'Veracity' refers to:**
a) The speed at which data is processed
b) The complexity of the data
c) The uncertainty of data
d) The structure of data
6. **What does ACID stand for in the context of relational database transactions?**
a) Atomicity, Consistency, Isolation, Durability
b) Availability, Consistency, Integrity, Durability
c) Availability, Capacity, Isolation, Durability
d) Accuracy, Consistency, Integrity, Durability
7. **What is the primary goal of database normalization in relational databases?**
a) Increase the speed of transactions
b) Diminish data redundancy and prevent anomalies
c) Optimize the database for analytics
d) Increase data storage capacity
8. **Which of the following is a feature of a relational database model?**
a) Flat File System
b) Graph Structure
c) Table with Rows and Columns
d) Unstructured Data Storage
9. **Which of the following is an example of a relational database management
system (RDBMS)?**
a) MongoDB
b) Microsoft SQL Server
c) Apache Hadoop
d) Cassandra
10. **What type of data is considered ‘structured’ in the context of Big Data?**
a) XML documents
b) Social media posts
c) Multimedia files
d) Relational database tables
11. **What does OLAP stand for?**
a) Online Logical Application Processing
b) Offline Analytical Processing
c) Online Analytical Processing
d) Offline Logical Access Processing
12. **In the context of database transactions, what does ‘Isolation’ ensure?**
a) That uncommitted changes from one transaction are visible to other
transactions
b) That transactions are run in parallel without interference
c) That partial execution is allowed
d) That transaction logs are retained permanently
13. **Which technology is primarily associated with the need for new data
processing methods for Big Data?**
a) Hadoop and MapReduce
b) SQL and ACID
c) Relational Database Models
d) Normalization and Transactions
14. **Which of the following is a key technology trend leading to the rise of Big
Data?**
a) Decrease in online activity
b) Distributed File Systems
c) Strong dependence on small data
d) Less use of machine learning
15. **What does NoSQL stand for in the context of modern databases?**
a) No Structured Query Language
b) Not Only SQL
c) Not SQL-Compliant
d) Non-Transactional SQL
16. **Which of the following is a typical characteristic of NoSQL databases?**
a) Schema-Free
b) Strict ACID Compliance
c) Small Scale Processing
d) Strong Consistency
17. **Which processing paradigm is typically associated with Big Data analytics?**
a) OLTP
b) OLAP
c) RTAP
d) All of the above
18. **Which of the following is a modern challenge faced by relational databases
that Big Data and NoSQL systems aim to address?**
a) Handling small amounts of structured data
b) Efficiently managing exponential data growth
c) Processing transactions on slow hard drives
d) Maintaining strong consistency across multiple data updates
19. **In which type of processing does the system process data in real-time as it
arrives?**
a) Batch Processing
b) Streaming Data Processing
c) File Processing
d) On-Demand Processing
20. **Which of the following is NOT a feature commonly associated with Big Data
technologies?**
a) Horizontal Scalability
b) Schema-Free Data Models
c) Complex ACID Transactions
d) Distributed File Systems

### Answers:
1. **d)** High Optimization
2. **c)** Traditional Databases
3. **b)** Variety
4. **c)** Velocity
5. **c)** The uncertainty of data
6. **a)** Atomicity, Consistency, Isolation, Durability
7. **b)** Diminish data redundancy and prevent anomalies
8. **c)** Table with Rows and Columns
9. **b)** Microsoft SQL Server
10. **d)** Relational database tables
11. **c)** Online Analytical Processing
12. **b)** That transactions are run in parallel without interference
13. **a)** Hadoop and MapReduce
14. **b)** Distributed File Systems
15. **b)** Not Only SQL
16. **a)** Schema-Free
17. **d)** All of the above
18. **b)** Efficiently managing exponential data growth
19. **b)** Streaming Data Processing
20. **c)** Complex ACID Transactions

1. **Which of the following is the simplest NoSQL database type?**


- a) Document Stores
- b) Key-Value Stores
- c) Wide Column Stores
- d) Graph Databases
2. **What type of data structure is used in Key-Value stores?**
- a) Trees
- b) Tables
- c) Hash Tables
- d) Graphs
3. **In Key-Value stores, what is the database system’s view of the value part?**
- a) Human-readable format
- b) Binary object, black box
- c) Structured JSON
- d) Linked records
4. **Key-Value stores are most suitable for which use case?**
- a) Managing relationships among entities
- b) Session data and user profiles
- c) Complex queries across multiple datasets
- d) Storing graphs
5. **Which of the following is *not* a good use case for Key-Value stores?**
- a) Session data
- b) Shopping carts
- c) Set operations involving multiple key-value pairs
- d) User preferences
6. **Which of the following is an example of a Key-Value store?**
- a) MongoDB
- b) Redis
- c) CouchDB
- d) Neo4j
7. **Which data model do Document Stores use?**
- a) Graphs
- b) Key-Value pairs
- c) Hierarchical tree structures (e.g., JSON, XML)
- d) Tables with rows and columns
8. **What is the main difference between Document Stores and Key-Value Stores?**
- a) Document Stores can examine the value part, Key-Value Stores cannot
- b) Document Stores use relational schemas
- c) Document Stores are not scalable
- d) Document Stores are slower than Key-Value Stores
9. **Which of the following is a suitable use case for Document Stores?**
- a) Shopping carts
- b) Social network graphs
- c) Event logging and CMS
- d) Distributed file systems
10. **When should Document Stores *not* be used?**
- a) When the document structure changes frequently
- b) When storing e-commerce application data
- c) For web analytics
- d) For content management systems
11. **Which of the following is an example of a Document Store?**
- a) Redis
- b) CouchDB
- c) Apache Cassandra
- d) Neo4j
12. **Which data model do Wide Column Stores use?**
- a) Key-Value pairs
- b) Graphs
- c) Tables with rows and columns (Column family)
- d) JSON documents
13. **In a Wide Column Store, what is a ‘row’?**
- a) A set of key-value pairs
- b) A collection of columns
- c) A node in a graph
- d) A unique primary key
14. **Which of the following is a suitable use case for Wide Column Stores?**
- a) ACID transactions
- b) Event logging and content management
- c) Social network analysis
- d) Complex queries with joins and aggregates
15. **Which of the following is *not* a suitable use case for Wide Column Stores?**
- a) When ACID transactions are required
- b) Event logging
- c) Content management systems
- d) Blogs
16. **Which of the following is an example of a Wide Column Store?**
- a) Neo4j
- b) Apache HBase
- c) MongoDB
- d) CouchDB
17. **Which of the following is *not* true about Wide Column Stores?**
- a) They allow the use of column families
- b) They support complex queries involving joins
- c) They allow variable sets of columns in different rows
- d) They are suitable for flat structured data with similar schema
18. **In which type of NoSQL database are nodes and relationships used to
represent entities and their connections?**
- a) Document Stores
- b) Key-Value Stores
- c) Wide Column Stores
- d) Graph Databases
19. **Which of the following is a suitable use case for Graph Databases?**
- a) Session management
- b) Routing and location-based services
- c) Shopping carts
- d) Event logging
20. **Which of the following is an example of a Graph Database?**
- a) Amazon DynamoDB
- b) Neo4j
- c) CouchDB
- d) Apache Cassandra

1. **b) Key-Value Stores**


2. **c) Hash Tables**
3. **b) Binary object, black box**
4. **b) Session data and user profiles**
5. **c) Set operations involving multiple key-value pairs**
6. **b) Redis**
7. **c) Hierarchical tree structures (e.g., JSON, XML)**
8. **a) Document Stores can examine the value part, Key-Value Stores cannot**
9. **c) Event logging and CMS**
10. **a) When the document structure changes frequently**
11. **b) CouchDB**
12. **c) Tables with rows and columns (Column family)**
13. **b) A collection of columns**
14. **b) Event logging and content management**
15. **a) When ACID transactions are required**
16. **b) Apache HBase**
17. **b) They support complex queries involving joins**
18. **d) Graph Databases**
19. **b) Routing and location-based services**
20. **b) Neo4j**
---

1. **Which of the following best defines Big Data?**


A) High volume, high velocity, and high variety data
B) Data that can only be processed by relational databases
C) Data that does not require processing
D) Small structured data sets
3. **What is the data model used in Native XML Databases?**
A) Relational
B) XML documents
C) RDF triples
D) Key-value pairs
4. **Which of the following is NOT a query language for Native XML Databases?**
A) XPath
B) XQuery
C) SQL
D) XSLT
5. **What are the three components of RDF triples?**
A) Subject, Predicate, Object
B) Node, Vertex, Edge
C) Key, Value, Attribute
D) Collection, Document, Key
6. **Which of the following is a representative of RDF Stores?**
A) Sedna
B) Apache Jena
C) BaseX
D) Tamino
7. **Which of the following databases is an example of a multi-model database?**
A) MongoDB
B) MarkLogic
C) MySQL
D) PostgreSQL
8. **Which of the following is a feature of NoSQL databases?**
A) Strong consistency
B) Schemalessness
C) ACID properties
D) Relational structure
9. **In NoSQL databases, what does the term 'aggregate' refer to?**
A) A small piece of data
B) A unit of related data treated as one unit
C) A query language
D) A normalization form
10. **What is the traditional approach to scaling databases?**
A) Sharding
B) Scaling-out
C) Scaling-up
D) Data distribution
11. **What does sharding refer to in NoSQL databases?**
A) Merging multiple databases
B) Distributing data into separate groups
C) Deleting unwanted data
D) Creating indexes for faster retrieval
12. **What is the main advantage of eventual consistency in NoSQL databases?**
A) Data is consistent at all times
B) Higher performance due to relaxed consistency
C) Data is never replicated
D) No trade-offs are needed
13. **Which of the following is NOT a type of NoSQL database?**
A) Document Store
B) Graph Database
C) Relational Database
D) Wide Column Store
14. **What is a key advantage of using NoSQL databases over relational databases?
**
A) Easy to scale horizontally
B) Strict enforcement of schemas
C) Strong support for ACID transactions
D) Simple SQL queries
15. **Which of the following describes the 'Veracity' characteristic of Big Data?**
A) The speed at which data is generated
B) The uncertainty of data
C) The volume of data
D) The variety of data types
16. **What is polyglot persistence?**
A) Using a single data store for all applications
B) Using different data stores for different requirements
C) Storing data in only one format
D) Storing data with strict schema enforcement
17. **What is the key feature of aggregate-ignorant systems?**
A) They treat related data as one unit
B) They do not follow a specific aggregate structure
C) They distribute data horizontally
D) They use BASE properties
18. **Which of the following is a challenge of NoSQL databases?**
A) Easy installation and maintenance
B) Strong support for business intelligence
C) Low number of NoSQL experts available
D) High availability of NoSQL experts
19. **Which feature is common in traditional relational databases but relaxed in
NoSQL databases?**
A) Schemaless design
B) ACID properties
C) Horizontal scaling
D) Open-source software
20. **Which of the following is an advantage of using NoSQL databases?**
A) Vertical scaling
B) Automated processes
C) High cost of servers
D) Strict consistency enforcement

1. **A**
3. **B**
4. **C**
5. **A**
6. **B**
7. **B**
8. **B**
9. **B**
10. **C**
11. **B**
12. **B**
13. **C**
14. **A**
15. **B**
16. **B**
17. **B**
18. **C**
19. **B**
20. **B**

2. What is the goal of NoSQL databases with respect to data?


a) To enforce strong consistency
b) To respect the real-world nature of data
c) To limit data scalability
d) To reduce data redundancy
3. In NoSQL databases, what is an aggregate?
a) A single table of unrelated data
b) A collection of related data treated as a unit
c) A complex query structure
d) A database replication method
4. Which of the following is an example of an aggregate in key-value stores?
a) Row of a column family
b) Document
c) Value part of key-value pairs
d) Graph
5. Aggregate-ignorant systems include:
a) Key-value
b) Document
c) Graph
d) Wide-column
6. Which statement is true regarding aggregate-oriented systems?
a) They are primarily used in relational databases
b) Atomic operations can span across multiple aggregates
c) Only a single aggregate is handled at a time during operations
d) They are not scalable
7. Traditional relational databases typically use which scaling approach?
a) Horizontal scaling
b) Vertical scaling
c) Elastic scaling
d) Sharding
8. Which NoSQL scaling technique involves distributing data across multiple hosts?
a) Scaling-up
b) Scaling-down
c) Scaling-out
d) Aggregation
9. What is the process of splitting database data into separate groups called?
a) Sharding
b) Aggregation
c) Replication
d) Indexing
10. What is a key feature of automated processes in NoSQL databases?
a) They require highly trained administrators
b) They support strong consistency
c) They offer automatic recovery and distribution
d) They require manual tuning for optimization
11. Which of the following properties is associated with relaxed consistency in
NoSQL databases?
a) ACID
b) BASE
c) Strong consistency
d) Transactional consistency
12. What is a trade-off of eventual consistency in NoSQL databases?
a) Stronger data recovery
b) Improved transactional consistency
c) Possible delays in consistency
d) Lower data distribution
13. What is a key feature of schemalessness in NoSQL databases?
a) Strictly enforced database schema
b) Flexible or missing schema
c) High structural overhead during changes
d) Implicit schema at the database level
14. Which of the following is a consequence ( ‫)النتائج المترطبه علي‬of schemalessness in
NoSQL?
a) Uniform data enforcement
b) Increased overhead for structural changes
c) Flexibility to deal with non-uniform data
d) Schema is always defined at the database level
15. NoSQL databases are often available as:
a) Proprietary software only
b) Only community versions
c) Both community and enterprise versions
d) None of the above
16. Which of the following is often true for NoSQL application interfaces?
a) They are stateful
b) They are stateless
c) They require complex programming
d) They enforce strict consistency
17. Which of the following is an advantage of NoSQL databases?
a) Lack of automated maintenance
b) Horizontal data distribution among hosts
c) Inflexible data models
d) High costs for server infrastructure
18. NoSQL databases are most suitable for handling:
a) Low-volume data
b) High-volume data that cannot be handled by RDBMS
c) Complex transactional queries
d) Small-scale, uniform datasets
19. Which of the following is a challenge for NoSQL databases?
a) Highly mature systems with complete feature sets
b) Limited expertise available in the market
c) Full support for business intelligence and analytics
d) High support for ad-hoc querying
20. Which of the following is NOT a current challenge of NoSQL databases?
a) Installation and maintenance complexity
b) Lack of support for analytics
c) Horizontal scaling difficulties
d) Limited number of NoSQL experts available

2. b) To respect the real-world nature of data


3. b) A collection of related data treated as a unit
4. c) Value part of key-value pairs
5. c) Graph
6. c) Only a single aggregate is handled at a time during operations
7. b) Vertical scaling
8. c) Scaling-out
9. a) Sharding
10. c) They offer automatic recovery and distribution
11. b) BASE
12. c) Possible delays in consistency
13. b) Flexible or missing schema
14. c) Flexibility to deal with non-uniform data
15. c) Both community and enterprise versions
16. b) They are stateless
17. b) Horizontal data distribution among hosts
18. b) High-volume data that cannot be handled by RDBMS
19. b) Limited expertise available in the market
20. c) Horizontal scaling difficulties

1. Big Data is typically characterized by the "3Vs": Volume, Velocity, and Veracity.
2. NoSQL databases were initially developed as an alternative to traditional
relational databases for handling large-scale data needs.
3. Key-value stores are optimal for applications that require complex querying and
relationships between data entries.
4. Relational databases follow the ACID model, ensuring consistency and durability
of transactions.
5. NoSQL is an abbreviation for "No SQL allowed" in database management.
6. In NoSQL databases, "aggregate structure" refers to treating multiple related data
pieces as a single unit for manipulation and consistency.
7. MongoDB is an example of a key-value store in NoSQL databases.
8. Graph databases are ideal for applications involving social networks and location-
based services.
9. Document stores organize data in a format similar to JSON or XML.
10. Relational databases are especially well-suited for unpredictable exponential
data growth and high-speed data processing.
11. Hadoop is an example of a NoSQL database system.
12. RDF (Resource Description Framework) is a data model used in graph
databases.
13. In Big Data applications, "Veracity" refers to the speed of data generation and
processing.
14. The "ACID" properties of transactions stand for Atomicity, Consistency, Isolation,
and Durability.
15. Key-value stores are generally suited for scenarios where data needs to be
accessed by keys only, without complex querying.
16. Wide column stores are often referred to as column family or column-oriented
databases.
17. A major goal of NoSQL databases is to maintain strong consistency across
distributed data.
18. Neo4j is a popular example of a graph database.
19. The term "Big Data" has a standard definition accepted by all data researchers
and practitioners.
20. OLAP (Online Analytical Processing) is often associated with real-time data
processing in database applications.

1. **False** – Big Data is characterized by the "3Vs": Volume, Velocity, and


**Variety** (not Veracity).
2. **True** – NoSQL databases were created to handle large-scale data needs
beyond traditional relational databases.
3. **False** – Key-value stores are not ideal for complex querying; they’re best for
simple key-based access.
4. **True** – Relational databases follow the ACID model for transaction consistency
and durability.
5. **False** – "NoSQL" stands for "Not Only SQL," not "No SQL allowed."
6. **True** – In NoSQL, aggregate structure often refers to grouping related data for
consistency.
7. **False** – MongoDB is a document store, not a key-value store.
8. **True** – Graph databases are well-suited for social networks and location-based
applications.
9. **True** – Document stores format data similar to JSON or XML.
10. **False** – Relational databases are generally less suited for exponential growth
and high-speed processing compared to NoSQL options.
11. **False** – Hadoop is a Big Data processing framework, not a NoSQL database.
12. **True** – RDF is a data model often used in graph databases.
13. **False** – Veracity refers to data quality and trustworthiness, not speed.
14. **True** – ACID stands for Atomicity, Consistency, Isolation, and Durability.
15. **True** – Key-value stores are designed for key-based access without complex
querying.
16. **True** – Wide column stores are sometimes called column family or column-
oriented databases.
17. **False** – NoSQL databases often prioritize availability and partition tolerance
over strong consistency.
18. **True** – Neo4j is a popular example of a graph database.
19. **False** – Big Data lacks a universally agreed-upon definition.
20. **False** – OLAP is for analytical processing, not real-time data processing,
which is often OLTP (Online Transaction Processing).

Here is a revised version of the questions without answers:

5. Which of the following is a common database management system?


- a) MongoDB
- b) Oracle
- c) SQLite
- d) All of the above
7. Which of the following is a use case for document stores?
- a) User profiles
- b) Shopping carts
- c) Event logging
- d) None of the above
9. What is SPARQL used for?
- a) Querying SQL databases
- b) Querying RDF data
- c) Organizing key-value pairs
- d) None of the above
10. Which of the following constitutes a property graph?
- a) Tables and rows
- b) Triple format
- c) Nodes and relationships
- d) Key-value pairs
15. Which NoSQL type is typically used for complex relationships?
- a) Key-value stores
- b) Graph databases
- c) Document stores
- d) Wide column stores
19. What does the shift in data processing from batch to real-time streaming
signify?
- a) Change in data volume
- b) Evolution of data velocity
- c) Increase in data variety
- d) None of the above

5. **d) All of the above**


7. **c) Event logging**
9. **b) Querying RDF data**
10. **c) Nodes and relationships**
15. **b) Graph databases**
19. **b) Evolution of data velocity**

You might also like