Bigtable - A Distributed Storage System For Structured Data

Uploaded by

ojasva

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views

Bigtable - A Distributed Storage System For Structured Data

Uploaded by

ojasva

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Bigtable – A Distributed Storage System for Structured Data

Bigtable is a distributed storage solution developed by Google to store structured data in a scalable manner. The system
is being used by more than sixty products such as Personalized search, Google Analytics, Orkut for their storage needs.
Even though Bigtable serves as a storage system for multiple products, its interface differs from a typical database in a
sense that it does not provide a fully relational database model.
Various components of BigTable:
Data Model & API : Bigtable's data model resembles a multidimensional sorted map indexed by three attributes:
a) Row keys in Bigtable are random strings, and every read or write operation for a row key is atomic, regardless of the
number of columns involved. This simplifies handling concurrent operations. Data is stored in lexicographical order based
on row keys, and the table is partitioned into ranges of row keys, known as tablets.
b) Column keys are grouped into column families, which serve as the basic unit of access control. A column family must
be created before any operations can be performed on the data. Each column family is represented in the format
family:qualifier
c) Timestamp: Bigtable stores multiple versions of data indexed by timestamps, with versions arranged in decreasing
order.
So key in the big-table is represented along with its indexes as(row:string, column:string, time:int64) → string
Bigtable provides a rich API for creating and updating tables and column families, as well as for modifying clusters, tables,
and metadata. The API includes various abstractions for read and write operations. Additionally, the paper mentions that
certain abstractions are built on top of Bigtable to enable its integration with MapReduce for large-scale computations.
Technologies that power Bigtable: Bigtable leverages several core Google technologies, including GFS for log and data
storage, and Google's cluster management system for resource management and monitoring. It uses the SSTable file
format for storage and relies on Chubby, a distributed locking service based on Paxos, for leader election, tablet server
discovery, and failure detection. Chubby is critical for Bigtable’s availability, as it handles cross-node communication.
Underlying implementation :
Bigtable consists of three key components: a client library, a master server, and multiple tablet servers for data
management.
Bigtable adjusts the number of tablet servers based on load. The master node manages tablet assignments, detects
changes in tablet servers, performs log garbage collection, and handles schema and column family changes. Each tablet
server manages a set of tablets, routing read and write requests to them and splitting tablets as needed. Unlike typical
systems, clients communicate directly with tablet servers for information, which keeps the master node less busy and
highly available. A Bigtable cluster consists of multiple tables, each starting with one tablet that splits into more as the
data grows.
Tablet Location: Bigtable uses a three-level hierarchy to store the tablet information. The three levels are:
1) First level is a file stored in Chubby that consists of the location for the root tablet.
2) The root tablet contains the locations of all other tablets in a metadata table. It's the first tablet in this table and is
treated with special preference, as it is never split regardless of size. This ensures that the number of levels in the system
remains at three or fewer.
3) Each metadata tablet contains the location of a set of user tablets.
The client library caches tablet locations, and if this information becomes stale, it looks up the tablet location based on the
hierarchy. It doesn't have to check all three levels; instead, it can move back one level at a time until it finds the correct
location. This process follows a check-then-act pattern: the client first reads from Chubby to see if the information is up to
date, realizing it's stale only after encountering a miss.
Tablet Assignment : The master node assigns tablets to tablet servers and keeps track of which servers are active. It also
manages the assignments of tablets to servers and those yet to be assigned. Chubby helps monitor tablet servers: when
a tablet server starts, it creates an exclusive lock on a uniquely named file in a Chubby directory. The master node just
needs to check this directory to find newly added tablet servers. A tablet server can only serve its assigned tablets as long
as it holds the lock in the Chubby directory.
The master node monitors tablet servers to see if they stop serving their assigned tablets by checking their lock status. If
a tablet server confirms it has lost the lock or is unreachable after several attempts, the master tries to acquire the lock on
the server's file. If successful, this indicates an issue with the tablet server rather than Chubby. The master then deletes
the server file to prevent the lock from being reacquired if the server comes back online. After confirming the file is
deleted, the master reassigns the tablets to the unassigned list.
If the master can't communicate with Chubby, it terminates itself so a new master can take over. When a new master
starts, it acquires an exclusive lock on the master lock in Chubby to prevent multiple masters from running simultaneously.
It scans Chubby's server directory to find active tablet servers, then connects to each one to identify the tablets they
manage. By comparing this list to the metadata table, the master creates a list of unassigned tablets, making them eligible
for future assignments.
The set of tablets changes only when a tablet is created, deleted, merged, or split. Since all but tablet splits are initiated
by the master, it keeps track of all tablets in the system. Tablet splits are triggered by the tablet server, which notifies the
master after the split. To improve error handling in case the notification is lost (due to either the master or tablet server
failing), the master can also request the tablet server to list its assigned tablets to get updated information.
Tablet Serving : The state of a tablet is stored in GFS using a commit log and redo records, following a typical LSM tree
architecture. Recent updates are kept in a sorted in-memory buffer called a memtable. Once the memtable reaches a
certain size, it's written to disk as an immutable file known as an SSTable. To recover a tablet, the tablet server reads the
metadata table for the list of associated SSTables, loads them into memory, and applies the updates to rebuild the tablet.
To process a write operation, the tablet server first adds an entry to the commit log, grouping commits to enhance
throughput. It validates and authorizes the write request before committing. Read operations go through the same
validation checks. For reads, the server first checks the memtable, then the SSTables until it finds the requested data.
Since both the memtable and SSTables are sorted, lookups are efficient and fast.
Compactions: When the memtable is flushed to disk as SSTables, two issues can arise: looking up sparsely used keys
may require processing multiple SSTables, increasing read latency, and recovering a tablet can become time-consuming
due to the large number of SSTable files. To address these problems, it’s essential to minimize the number of SSTables.
The compaction process merges data from multiple SSTables into a single file, reducing their overall number and
improving performance.
Improvements in the original design
1) Locality groups: Clients can enhance lookup performance by grouping column families into locality groups, which are
accessed together as a logical chunk of information. This separation allows each group to be stored in its own SSTable,
resulting in more efficient reads. Additionally, clients can designate a locality group to be in-memory, enabling SSTables
for these groups to be loaded into memory lazily. This speeds up lookups, making it ideal for small datasets with high read
traffic.
2) Compression: Clients can configure certain SSTables in a locality group to bypass compression or specify the type of
compression to use. By skipping compression, read times improve since there’s no need to decompress SSTables when
loading them into memory.
3) Caching for read performance: Tablet servers use two-level caching to enhance read performance. The upper cache
stores frequently accessed key-value pairs, while the lower block cache holds SSTable blocks for applications that read
closely related data, such as iterating through a sequence of keys.
4) Bloom filters: In the original architecture, reads would scan SSTables to find a lookup key, potentially scanning all
SSTables for keys that aren't present. To improve performance, clients can use bloom filters, which quickly indicate
whether a key exists in an SSTable without needing a full scan. This significantly speeds up lookups for non-existent keys.
5) Commit-log implementation: Maintaining multiple commit logs for each tablet would require several disk seeks in GFS
during commit operations. To address this, all updates from a tablet server's tablets are written to a single commit log.
This can create a bottleneck during tablet recovery since the log contains mutations for other tablets as well sorting the
log entries by keys (table, row name, log sequence number) ensures that mutations for a specific tablet are contiguous,
allowing for more efficient processing.
6) Exploiting immutability: SSTables are immutable, which greatly simplifies the Bigtable system. Since they don't get
updated by other threads, there's no need for synchronization during reads. The only mutable component is the
in-memory memtable, where each row uses a copy-on-write approach to allow parallel reads and writes. This immutability
also helps during tablet splits, as both child tablets can access the original SSTable without needing separate files for
each split.

THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE: "THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE"
From Everand
THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE: "THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE"
AJIT DASH
2/5 (2)
Cloud Unit3
No ratings yet
Cloud Unit3
26 pages
MICROSOFT AZURE ADMINISTRATOR EXAM PREP(AZ-104) Part-3: AZ 104 EXAM STUDY GUIDE
From Everand
MICROSOFT AZURE ADMINISTRATOR EXAM PREP(AZ-104) Part-3: AZ 104 EXAM STUDY GUIDE
Devi Prasad
No ratings yet
AWS Certified Solutions Architect - Professional
From Everand
AWS Certified Solutions Architect - Professional
VB Dev
No ratings yet
Jump into JMP Scripting, Second Edition
From Everand
Jump into JMP Scripting, Second Edition
Wendy Murphrey
No ratings yet
Oracle Developer 2000 Training
No ratings yet
Oracle Developer 2000 Training
4 pages
Google Bigtable: Describe The Data Model of Bigtable
100% (1)
Google Bigtable: Describe The Data Model of Bigtable
6 pages
Paper Review 2 - BigTable
No ratings yet
Paper Review 2 - BigTable
2 pages
Bigtable: A Distributed Storage System For Structured Data: Presentation On Paper by
No ratings yet
Bigtable: A Distributed Storage System For Structured Data: Presentation On Paper by
12 pages
G G 'S Bigtable: Name: Tunahan YILDIRIM Number:2195303 Paper: A Distributed Storage System For Structured Data
No ratings yet
G G 'S Bigtable: Name: Tunahan YILDIRIM Number:2195303 Paper: A Distributed Storage System For Structured Data
38 pages
002 Bigtable
No ratings yet
002 Bigtable
16 pages
Bigtable Overview_Google Cloud
No ratings yet
Bigtable Overview_Google Cloud
8 pages
Bigtable A System For Distributed Structured Storage: Motivation
No ratings yet
Bigtable A System For Distributed Structured Storage: Motivation
9 pages
Bigtable: A Distributed Storage System For Structured Data
No ratings yet
Bigtable: A Distributed Storage System For Structured Data
26 pages
Bigtable: A Distributed Storage System For Structured Data
No ratings yet
Bigtable: A Distributed Storage System For Structured Data
23 pages
UCS15E08 - Cloud Computing - Unit 3 Notes
No ratings yet
UCS15E08 - Cloud Computing - Unit 3 Notes
13 pages
Big Table
No ratings yet
Big Table
10 pages
Unit 5 Lecture 3
No ratings yet
Unit 5 Lecture 3
18 pages
storage-systems
No ratings yet
storage-systems
23 pages
Google, Inc.: Bigtable: A Distributed Storage System For Structured Data
No ratings yet
Google, Inc.: Bigtable: A Distributed Storage System For Structured Data
24 pages
Big Table
No ratings yet
Big Table
21 pages
Lecture 11 Google Architecture Design
No ratings yet
Lecture 11 Google Architecture Design
44 pages
Bigtable: Cse 490H, Autumn 2008
No ratings yet
Bigtable: Cse 490H, Autumn 2008
9 pages
Ccomputing Madurya
No ratings yet
Ccomputing Madurya
20 pages
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
From Everand
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Bigtable: A Distributed Storage System For Structured Data
No ratings yet
Bigtable: A Distributed Storage System For Structured Data
14 pages
Programming Support of Google App Engine
No ratings yet
Programming Support of Google App Engine
8 pages
Dynamo and BigTable Review and Comparison
No ratings yet
Dynamo and BigTable Review and Comparison
5 pages
6.1 GCP - Cloud - Bigtable PDF
No ratings yet
6.1 GCP - Cloud - Bigtable PDF
18 pages
Google Bigtable
No ratings yet
Google Bigtable
21 pages
Bigtable: A Distributed Storage System For Structured Data
No ratings yet
Bigtable: A Distributed Storage System For Structured Data
4 pages
Scalability Design Principles
No ratings yet
Scalability Design Principles
10 pages
SAS Interview Questions You'll Most Likely Be Asked
From Everand
SAS Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Google BigQuery Analytics
From Everand
Google BigQuery Analytics
Jordan Tigani
3/5 (1)
Storage Architecture and Challenges: Faculty Summit, July 29, 2010 Andrew Fikes, Principal Engineer
No ratings yet
Storage Architecture and Challenges: Faculty Summit, July 29, 2010 Andrew Fikes, Principal Engineer
25 pages
BDA Unit-1
No ratings yet
BDA Unit-1
19 pages
A Review On GOOGLE File System
No ratings yet
A Review On GOOGLE File System
4 pages
Operating Systems Interview Questions You'll Most Likely Be Asked
From Everand
Operating Systems Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Google Case Study
No ratings yet
Google Case Study
23 pages
No SQL Databases - Review - Comments
No ratings yet
No SQL Databases - Review - Comments
4 pages
20 Windows Tools Every SysAdmin Should Know
From Everand
20 Windows Tools Every SysAdmin Should Know
padmin
5/5 (2)
Rapid Application Development and Short-Time To The Market Low Latency Scalability High Availability Consistent View of The Data
No ratings yet
Rapid Application Development and Short-Time To The Market Low Latency Scalability High Availability Consistent View of The Data
21 pages
Introduction to Oracle Database Administration
From Everand
Introduction to Oracle Database Administration
Ying Wang
5/5 (1)
Data Structures I Essentials
From Everand
Data Structures I Essentials
Dennis Smolarski
No ratings yet
Guha Roy 2017
No ratings yet
Guha Roy 2017
3 pages
Oracle Database 12c Quickstart
From Everand
Oracle Database 12c Quickstart
Michael Elliott
5/5 (5)
ADO Lecture II 2024-26
No ratings yet
ADO Lecture II 2024-26
67 pages
Google File System and Hadoop Distributed File System-An Analogy
No ratings yet
Google File System and Hadoop Distributed File System-An Analogy
11 pages
Bba Unit-1
No ratings yet
Bba Unit-1
11 pages
A Survey of Post-Relational Data Management and NOSQL Movement
No ratings yet
A Survey of Post-Relational Data Management and NOSQL Movement
22 pages
The Informed Company: How to Build Modern Agile Data Stacks that Drive Winning Insights
From Everand
The Informed Company: How to Build Modern Agile Data Stacks that Drive Winning Insights
Dave Fowler
No ratings yet
Logstash Made Easy: A Beginner's Guide to Log Ingestion and Transformation
From Everand
Logstash Made Easy: A Beginner's Guide to Log Ingestion and Transformation
Robert Johnson
No ratings yet
DSS - U4 - HBASE Rev 1.0
No ratings yet
DSS - U4 - HBASE Rev 1.0
20 pages
Sub: Dbms.. Topic: Architectue of Google..: Googie Architecture Is A Form of Modern Architecture
No ratings yet
Sub: Dbms.. Topic: Architectue of Google..: Googie Architecture Is A Form of Modern Architecture
7 pages
HBASE
No ratings yet
HBASE
35 pages
Base SAS Interview Questions You'll Most Likely Be Asked
From Everand
Base SAS Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Hbase in Practice
No ratings yet
Hbase in Practice
46 pages
Chubby System and Google API
No ratings yet
Chubby System and Google API
13 pages
Database Management System
From Everand
Database Management System
Manish Soni
No ratings yet
CC - Lecture 8-Final
No ratings yet
CC - Lecture 8-Final
51 pages
PostgreSQL 9.0 High Performance
From Everand
PostgreSQL 9.0 High Performance
Gregory Smith
4/5 (1)
School of Computer Science and Engineering Instructions For TARP Course FALL 2019 - 20 Instructions For Theory Component
No ratings yet
School of Computer Science and Engineering Instructions For TARP Course FALL 2019 - 20 Instructions For Theory Component
2 pages
Solutions For HW5-CS 6033 Fall 2024
No ratings yet
Solutions For HW5-CS 6033 Fall 2024
13 pages
Questions: Cse4020 - Machine Learning - Digital Assessment Ii Yourquestionno Lastdigitofyourregistrationno
No ratings yet
Questions: Cse4020 - Machine Learning - Digital Assessment Ii Yourquestionno Lastdigitofyourregistrationno
1 page
Machine Learning Da Ii Name: Mehakmeet Singh Regno: 16bce0376 Q6.)
No ratings yet
Machine Learning Da Ii Name: Mehakmeet Singh Regno: 16bce0376 Q6.)
48 pages
G
No ratings yet
G
22 pages
WINSEM2018-19 - CSE6019 - ETH - SJT421 - VL2018195001554 - Reference Material I - 3.3 PLSI
No ratings yet
WINSEM2018-19 - CSE6019 - ETH - SJT421 - VL2018195001554 - Reference Material I - 3.3 PLSI
22 pages
Assessment-1 Manan 2023220docx
No ratings yet
Assessment-1 Manan 2023220docx
2 pages
Computerized Enrollment System: This Chapter Deals With The Activities and Methods Employed in The Study
No ratings yet
Computerized Enrollment System: This Chapter Deals With The Activities and Methods Employed in The Study
7 pages
Spark Cheat Sheet 1717838924
No ratings yet
Spark Cheat Sheet 1717838924
10 pages
AIS Multiple Choice Question and Answer (Chapter 2 - Set D)
100% (1)
AIS Multiple Choice Question and Answer (Chapter 2 - Set D)
2 pages
School - Management - System Kate
No ratings yet
School - Management - System Kate
35 pages
Unit 1 Notes-1
No ratings yet
Unit 1 Notes-1
10 pages
MDSS - Session 18 USING F 1 SCORE AND QUALIFICATIONS OF DS PROFESSIONALS 1710777986361
No ratings yet
MDSS - Session 18 USING F 1 SCORE AND QUALIFICATIONS OF DS PROFESSIONALS 1710777986361
6 pages
Skillset-Oct-7-2023-1
No ratings yet
Skillset-Oct-7-2023-1
1 page
Chapter 3 The Relational Database Model
No ratings yet
Chapter 3 The Relational Database Model
37 pages
(IT) - Security Development Tool
No ratings yet
(IT) - Security Development Tool
19 pages
Inc Restore
No ratings yet
Inc Restore
131 pages
Black Book
No ratings yet
Black Book
92 pages
Python Web Framework
No ratings yet
Python Web Framework
5 pages
Car Showroom Management System
No ratings yet
Car Showroom Management System
21 pages
COBOL DB2 Tutorial
100% (1)
COBOL DB2 Tutorial
4 pages
Oracle
No ratings yet
Oracle
14 pages
Tugas 1 - 825189201 (Saskia Febe Fedhora)
No ratings yet
Tugas 1 - 825189201 (Saskia Febe Fedhora)
2 pages
Starting The Journey (DA) (Slides)
No ratings yet
Starting The Journey (DA) (Slides)
16 pages
PracticeExam DataEngineerAssociate
No ratings yet
PracticeExam DataEngineerAssociate
23 pages
Rap Sap Rap
No ratings yet
Rap Sap Rap
2 pages
Unit IV
No ratings yet
Unit IV
47 pages
Research IN BIG Data - AN: Dr. S.Vijayarani and Ms. S.Sharmila
No ratings yet
Research IN BIG Data - AN: Dr. S.Vijayarani and Ms. S.Sharmila
20 pages
03 Unit Bda Hadoop,Map Reduce
No ratings yet
03 Unit Bda Hadoop,Map Reduce
80 pages
Introduction To Information System
No ratings yet
Introduction To Information System
8 pages
Introduction To HBase
No ratings yet
Introduction To HBase
14 pages
Orange - IT402 - 10 - MS (P2)
No ratings yet
Orange - IT402 - 10 - MS (P2)
6 pages
Uploading The Master Records (MM01) To SAP System Using Call Transaction Method (Excel Flat File)
No ratings yet
Uploading The Master Records (MM01) To SAP System Using Call Transaction Method (Excel Flat File)
18 pages
Was Interview Questions
No ratings yet
Was Interview Questions
11 pages
Python Module Wise Important Questions
100% (3)
Python Module Wise Important Questions
4 pages

Bigtable - A Distributed Storage System For Structured Data

Uploaded by

Bigtable - A Distributed Storage System For Structured Data

Uploaded by

Bigtable – A Distributed Storage System for Structured Data

You might also like