0% found this document useful (0 votes)
16 views

Lecture 18

The document discusses several key concepts related to big data and analytics: 1) It describes the ACID properties that ensure transactions in a relational database management system are atomic, consistent, isolated, and durable. 2) It then contrasts the ACID properties with the BASE model that guarantees availability, allows soft states, and achieves eventual consistency as used in NoSQL databases. 3) It provides definitions for big data, analytics, NoSQL databases, and examples of popular NoSQL databases like Redis and MongoDB.
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
16 views

Lecture 18

The document discusses several key concepts related to big data and analytics: 1) It describes the ACID properties that ensure transactions in a relational database management system are atomic, consistent, isolated, and durable. 2) It then contrasts the ACID properties with the BASE model that guarantees availability, allows soft states, and achieves eventual consistency as used in NoSQL databases. 3) It provides definitions for big data, analytics, NoSQL databases, and examples of popular NoSQL databases like Redis and MongoDB.
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 11

Big Data and Analytics

Chapter 11
ACID
• When processing transactions, a DBMS must ensure that the transactions
follow four well-accepted properties, called the ACID properties:
• Atomicity — An “all or nothing” approach. If any statement in the
transaction fails, the entire transaction is rolled back.
• Consistency — The transaction must meet all protocols defined by the
system. No half completed transactions.
• Isolation — No transaction has access to any other transaction that is in
an intermediate or unfinished state. Each transaction is independent.
• Durability — Ensures that once a transaction commits to the database, it
is preserved through the use of backups and transaction logs.
BASE Properties
• The ‌NoSQL relies upon a softer model known as the BASE
model. BASE (Basically Available, Soft state, Eventual
consistency).
• Basically Available: Guarantees the availability of the data .
There will be a response to any request (can be failure too).
• Soft state: The state of the system could change over time.
• Eventual consistency: The system will eventually become
consistent once it stops receiving input.
Big Data
• Data that exist in very large volumes and many different varieties
(data types) and that need to be processed at a very high velocity
(speed)
Analytics
• Systematic analysis and interpretation of data—typically using
mathematical, statistical, and computational tools—to improve our
understanding of a real-world domain.
NoSQL
• A category of recently introduced data storage and retrieval
technologies that are not based on the relational model.
Classification of NoSQl Database
Management Systems
NoSQL Examples
• REDIS REDIS is the most popular key-value store NoSQL database management system. As most of the
others discussed in this section, Redis is an open source product and, according to db-engines.com, by far
the most widely used key-value store. Its keys can include various complex data structures (including
strings, hashes, lists, sets, and sorted sets) in addition to simple numeric values. In addition, REDIS makes
it possible to perform various atomic operations on the key types, extending the generic key-value store
feature set previously discussed. Many highly popular Web properties use Redis, the reputation of which
is largely based on its high performance, enabled by its support for in-memory operations.

• MONGODB The clear leader in the document store category (and also the most popular NoSQL
database management system in general) is MONGODB, also an open source product. MongoDB
offers a broader range of capabilities than Redis and is not as strongly focused solely on performance.
MONGODB offers versatile indexing, high availability through automated replication, a query
mechanism, its own file system for storing large objects, and automatic sharding for distributing the
processing load between multiple servers. It does not, however, support joins or transactions. Instead
of JSON, MongoDB uses BSON as its storage format. BSON is a binary JSON-like structure that is
designed to be easy and quick to traverse and fast to encode and decode.
Online analytical processing (OLAP)
• The use of a set of graphical tools that provides users with
multidimensional views of their data and allows them to analyze the
data using simple windowing techniques.

You might also like