SlideShare a Scribd company logo
Google File System
Lalit Kumar
M.Tech Final Year
Compute Science & Engineering Dept.
KEC Dwarahat, Almora
Overview
 Introduction To GFS
 Architecture
 Data Flow
 System Interactions
 Master Operations
 Meta Data Management
 Garbage Collection
 Fault tolerance
 Latest Advancement
 Drawbacks
 Conclusion
 References
Introduction
 More than 15,000 commodity-class PC's.
 Multiple clusters distributed worldwide.
 Thousands of queries served per second.
 One query reads 100's of MB of data.
 One query consumes 10's of billions of CPU cycles.
 Google stores dozens of copies of the entire Web!
Conclusion: Need large, distributed, highly fault tolerant file
system.
Architecture
A GFS cluster consists of a single master and multiple chunk-servers
and is accessed by multiple clients
Figure 1: GFS Architecture
Source: Howard Gobioff, “The GFS” Presented at SOSP 2003
Master
 Manages namespace/metadata.
 Manages chunk creation, replication, placement.
 Performs snapshot operation to create duplicate of file or directory tree.
 Performs checkpointing and logging of changes to metadata
Chunkservers
 On startup/failure recovery, reports chunks to master.
 Periodically reports sub-set of chunks to master (to detect no longer needed
chunks)
Metadata
 Types of Metadata:- File and chunk namespaces, Mapping from files to
chunks, Location of each chunks replicas.
 Easy and efficient for the master to periodically scan.
 Periodic scanning is used to implement chunk garbage collection, re-
replication and chunk migration .
 Data is pushed linearly along a carefully picked chain of chunk servers in a
TCP pipelined fashion.
 Once a chunkserver receives some data, it starts forwarding immediately to
the next chunkserver
 Each machine forwards the data to the closest machine in the network
topology that has not received it.
Data Flow
Figure 2: Data Flow in chunkservers
Source: http://research.google.com/archive/gfs‐sosp2003.pdf
System Interactions
Read Algorithm
1. Application originates the read request
2. GFS client translates the request form
(filename, byte range) -> (filename, chunk
index), and sends it to master
3. Master responds with chunk handle and replica
locations (i.e. chunkservers where the replicas
are stored)
4. Client picks a location and sends the (chunk
handle, byte range) request to the location
5. Chunkserver sends requested data to the client
6. Client forwards the data to the application .
Figure 3: Block diagram for Read operation
Source: Howard Gobioff, “The GFS” Presented at SOSP 2003
Write Algorithm
1. Application originates the request
2. GFS client translates request from
(filename, data) -> (filename, chunk
index), and sends it to master
3. Master responds with chunk handle and
(primary + secondary) replica locations
4. Client pushes write data to all locations.
Data is stored in chunkserver’s internal
buffers
5. Client sends write command to primary
6. Primary determines serial order for data
instances stored in its buffer and writes the
instances in that order to the chunk
7. Primary sends the serial order to the
secondaries and tells them to perform the
write
8. Secondaries respond to the primary &
primary responds back to the client
Figure 4: Block Diagram for Write operation
Source: Howard Gobioff, “The GFS” Presented at SOSP 2003
Master Operation
1. Namespace Management and Locking
 GFS maps full pathname to Metadata in a table.
 Each master operation acquires a set of locks.
 Locking scheme allows concurrent mutations in same directory.
 Locks are acquired in a consistent total order to prevent deadlock.
2. Replica Placement
3. Chunk Creation
4. Re-Replication
5. Balancing
 Each master operation acquires a set of locks before it runs
 To make operation on /dir1/dir2/dir3/leaf it first needs the
following locks
– Read-lock on /dir1
– Read-lock on /dir1/dir2
– Read-lock on /dir1/dir2/dir3
– Read-lock or Write-lock on /dir1/dir2/dir3/leaf
 File creation doesn’t require write‐lock on parent director read-
lock on the name Sufficient to protect the parent directory from
deletion, rename, or snapsho1ed
1. Namespace Management & Locking
2. Chunk Creation
 Master considers several factors
 Place new replicas on chunk servers with below‐average disk
space utilization
 Limit the number of “recent” creations on each chunk server
 Spread replicas of a chunk across racks
3. Re-replication
 Master Re-replicate a chunk as soon as the number of available
replicas falls below a user-specified goal.
 When a chunkserver becomes unavailable.
 When a chunkserver reports a corrupted chunk.
 When the replication goal is increased.
 Re‐replication placement is similar as for “creation”
4. Balancing
 Master Re-balances replicas periodically for better disk space and
load balancing
 Master gradually fills up a new chunkserver rather than instantly
swaps it with new chunks (and the heavy write traffic that come with
them!)
Metadata Management0
 The master stores three major types of metadata:
 File and chunk namespaces
 Mapping from files to chunks
 Locations of each chunk’s replicas
 All metadata is kept in the master’s memory.
Figure 5: logical Structure of Metadata
Source: Naushad UzZaman,“Survey on Google File System”,CSC 456,2007
 Storage reclaimed lazily by GC.
 File first renamed to a hidden name.
 Hidden files removes if more than three days old.
 When hidden file removed, in-memory metadata is removed.
 Regularly scans chunk namespace, identifying orphaned chunks. These
are removed.
 Chunk servers periodically report chunks they have and the master replies
with the identity of all chunks that are no longer present in the master’s
metadata. The chunkserver is free to delete its replicas of such chunks.
Garbage Collection
Fault Tolerance
 High availability:
 Fast recovery.
 Chunk replication.
 Master Replication
 Data Integrity:
 Chunkserver uses checksumming.
 Broken up into 64 KB blocks.
Latest Advancement
1. Gmail- An easily configurable email service with 15GB of web space.
2. Blogger- A free web-based service that helps consumers publish on the
web without writing code or installing software.
3. Google- “Next generation corporate s/w” A smaller version of the Google
software, modified for private use.
 Small files will have small number of chunks even one. This can lead to
chunk servers storing these files to become hot spots in case of many client
requests.
 Internal Fragmentation.
 If there are many such small files the master involvement will increase and
can lead to a potential bottleneck. Having a single master node can become
an issue.
 Master memory is a limitation.
 Performance might degrade if the numbers of writers and random writes
are more.
 No reasoning is provided for the choice of standard chunk size (64MB).
Drawbacks
Conclusion
GFS meets Google storage requirements
 Incremental growth.
 Regular check of component failure.
 Data optimization from special operations .
 Simple architecture.
 Fault Tolerance.
References
[1] Sanjay Ghemawat, Howard Gobioff and Shun-Tak Leung, The Google
File System, ACM SIGOPS Operating Systems Review, Volume 37, Issue 5,
2003.
[2] Sean Quinlan, Kirk McKusick “GFS-Evolution and Fast-Forward”
Communications of the ACM, Vol 53, 2013.
[3] Thomas Anderson, Michael Dahlin, JeannaNeefe, David Patterson, Drew
Roselli, and Randolph Wang. Serverlessnetworkfil e systems. In Proceedings of
the 15th ACM Symposium on Operating System Principles, pages 109–126,
Copper Mountain Resort, Colorado, December 1995.
[4] Luis-Felipe Cabrera and Darrell D. E. Long. Swift: Using distributed disks
triping to provide high I/O data rates. Computer Systems, 4(4):405–436, 1991.
[5] InterMezzo. http://www.inter-mezzo.org, 2003.
Thank You….
Ad

More Related Content

What's hot (20)

GOOGLE FILE SYSTEM
GOOGLE FILE SYSTEMGOOGLE FILE SYSTEM
GOOGLE FILE SYSTEM
JYoTHiSH o.s
 
Google File System
Google File SystemGoogle File System
Google File System
Amgad Muhammad
 
Google File System
Google File SystemGoogle File System
Google File System
Amir Payberah
 
Google File System
Google File SystemGoogle File System
Google File System
nadikari123
 
Google File Systems
Google File SystemsGoogle File Systems
Google File Systems
Azeem Mumtaz
 
Google File System
Google File SystemGoogle File System
Google File System
Junyoung Jung
 
Cloud infrastructure. Google File System and MapReduce - Andrii Vozniuk
Cloud infrastructure. Google File System and MapReduce - Andrii VozniukCloud infrastructure. Google File System and MapReduce - Andrii Vozniuk
Cloud infrastructure. Google File System and MapReduce - Andrii Vozniuk
Andrii Vozniuk
 
gfs-sosp2003
gfs-sosp2003gfs-sosp2003
gfs-sosp2003
Hiroshi Ono
 
Seminar Report on Google File System
Seminar Report on Google File SystemSeminar Report on Google File System
Seminar Report on Google File System
Vishal Polley
 
google file system
google file systemgoogle file system
google file system
diptipan
 
Gfs google-file-system-13331
Gfs google-file-system-13331Gfs google-file-system-13331
Gfs google-file-system-13331
Fengchang Xie
 
The Google File System (GFS)
The Google File System (GFS)The Google File System (GFS)
The Google File System (GFS)
Romain Jacotin
 
GFS - Google File System
GFS - Google File SystemGFS - Google File System
GFS - Google File System
tutchiio
 
Replication, Durability, and Disaster Recovery
Replication, Durability, and Disaster RecoveryReplication, Durability, and Disaster Recovery
Replication, Durability, and Disaster Recovery
Steven Francia
 
The Google file system
The Google file systemThe Google file system
The Google file system
Sergio Shevchenko
 
Gfs介绍
Gfs介绍Gfs介绍
Gfs介绍
yiditushe
 
Google
GoogleGoogle
Google
rpaikrao
 
Database Replication
Database ReplicationDatabase Replication
Database Replication
Vatroslav Mileusnić
 
Introduction to HDFS
Introduction to HDFSIntroduction to HDFS
Introduction to HDFS
Siddharth Mathur
 
Introduction to Mesos
Introduction to MesosIntroduction to Mesos
Introduction to Mesos
koboltmarky
 
GOOGLE FILE SYSTEM
GOOGLE FILE SYSTEMGOOGLE FILE SYSTEM
GOOGLE FILE SYSTEM
JYoTHiSH o.s
 
Google File System
Google File SystemGoogle File System
Google File System
nadikari123
 
Google File Systems
Google File SystemsGoogle File Systems
Google File Systems
Azeem Mumtaz
 
Cloud infrastructure. Google File System and MapReduce - Andrii Vozniuk
Cloud infrastructure. Google File System and MapReduce - Andrii VozniukCloud infrastructure. Google File System and MapReduce - Andrii Vozniuk
Cloud infrastructure. Google File System and MapReduce - Andrii Vozniuk
Andrii Vozniuk
 
Seminar Report on Google File System
Seminar Report on Google File SystemSeminar Report on Google File System
Seminar Report on Google File System
Vishal Polley
 
google file system
google file systemgoogle file system
google file system
diptipan
 
Gfs google-file-system-13331
Gfs google-file-system-13331Gfs google-file-system-13331
Gfs google-file-system-13331
Fengchang Xie
 
The Google File System (GFS)
The Google File System (GFS)The Google File System (GFS)
The Google File System (GFS)
Romain Jacotin
 
GFS - Google File System
GFS - Google File SystemGFS - Google File System
GFS - Google File System
tutchiio
 
Replication, Durability, and Disaster Recovery
Replication, Durability, and Disaster RecoveryReplication, Durability, and Disaster Recovery
Replication, Durability, and Disaster Recovery
Steven Francia
 
Introduction to Mesos
Introduction to MesosIntroduction to Mesos
Introduction to Mesos
koboltmarky
 

Similar to advanced Google file System (20)

Lalit
LalitLalit
Lalit
diptipan
 
Google File System
Google File SystemGoogle File System
Google File System
DreamJobs1
 
GFS xouzfz h ghdzg ix booc ug nog ghzg m
GFS xouzfz h ghdzg ix booc  ug nog ghzg mGFS xouzfz h ghdzg ix booc  ug nog ghzg m
GFS xouzfz h ghdzg ix booc ug nog ghzg m
gagaco5776
 
tittle
tittletittle
tittle
uvolodia
 
storage-systems.pptx
storage-systems.pptxstorage-systems.pptx
storage-systems.pptx
ShimoFcis
 
MongoDB Replication and Sharding
MongoDB Replication and ShardingMongoDB Replication and Sharding
MongoDB Replication and Sharding
Tharun Srinivasa
 
Chaptor 2- Big Data Processing in big data technologies
Chaptor 2- Big Data Processing in big data technologiesChaptor 2- Big Data Processing in big data technologies
Chaptor 2- Big Data Processing in big data technologies
GulbakshiDharmale
 
Kosmos Filesystem
Kosmos FilesystemKosmos Filesystem
Kosmos Filesystem
elliando dias
 
Gfs
GfsGfs
Gfs
ravi kiran
 
Distributed file systems (from Google)
Distributed file systems (from Google)Distributed file systems (from Google)
Distributed file systems (from Google)
Sri Prasanna
 
Lec3 Dfs
Lec3 DfsLec3 Dfs
Lec3 Dfs
mobius.cn
 
Distributed computing seminar lecture 3 - distributed file systems
Distributed computing seminar   lecture 3 - distributed file systemsDistributed computing seminar   lecture 3 - distributed file systems
Distributed computing seminar lecture 3 - distributed file systems
tugrulh
 
GFS & HDFS Introduction
GFS & HDFS IntroductionGFS & HDFS Introduction
GFS & HDFS Introduction
Hariharan Ganesan
 
Cluster based storage - Nasd and Google file system - advanced operating syst...
Cluster based storage - Nasd and Google file system - advanced operating syst...Cluster based storage - Nasd and Google file system - advanced operating syst...
Cluster based storage - Nasd and Google file system - advanced operating syst...
Antonio Cesarano
 
Google File System: System and Design Overview
Google File System: System and Design OverviewGoogle File System: System and Design Overview
Google File System: System and Design Overview
habibaabderrahim1
 
Google File System
Google File SystemGoogle File System
Google File System
vivatechijri
 
Talon systems - Distributed multi master replication strategy
Talon systems - Distributed multi master replication strategyTalon systems - Distributed multi master replication strategy
Talon systems - Distributed multi master replication strategy
Saptarshi Chatterjee
 
Teoria efectului defectului hardware: GoogleFS
Teoria efectului defectului hardware: GoogleFSTeoria efectului defectului hardware: GoogleFS
Teoria efectului defectului hardware: GoogleFS
Asociatia ProLinux
 
Gfs sosp2003
Gfs sosp2003Gfs sosp2003
Gfs sosp2003
睿琦 崔
 
Gfs
GfsGfs
Gfs
Shahbaz Sidhu
 
Google File System
Google File SystemGoogle File System
Google File System
DreamJobs1
 
GFS xouzfz h ghdzg ix booc ug nog ghzg m
GFS xouzfz h ghdzg ix booc  ug nog ghzg mGFS xouzfz h ghdzg ix booc  ug nog ghzg m
GFS xouzfz h ghdzg ix booc ug nog ghzg m
gagaco5776
 
storage-systems.pptx
storage-systems.pptxstorage-systems.pptx
storage-systems.pptx
ShimoFcis
 
MongoDB Replication and Sharding
MongoDB Replication and ShardingMongoDB Replication and Sharding
MongoDB Replication and Sharding
Tharun Srinivasa
 
Chaptor 2- Big Data Processing in big data technologies
Chaptor 2- Big Data Processing in big data technologiesChaptor 2- Big Data Processing in big data technologies
Chaptor 2- Big Data Processing in big data technologies
GulbakshiDharmale
 
Distributed file systems (from Google)
Distributed file systems (from Google)Distributed file systems (from Google)
Distributed file systems (from Google)
Sri Prasanna
 
Distributed computing seminar lecture 3 - distributed file systems
Distributed computing seminar   lecture 3 - distributed file systemsDistributed computing seminar   lecture 3 - distributed file systems
Distributed computing seminar lecture 3 - distributed file systems
tugrulh
 
Cluster based storage - Nasd and Google file system - advanced operating syst...
Cluster based storage - Nasd and Google file system - advanced operating syst...Cluster based storage - Nasd and Google file system - advanced operating syst...
Cluster based storage - Nasd and Google file system - advanced operating syst...
Antonio Cesarano
 
Google File System: System and Design Overview
Google File System: System and Design OverviewGoogle File System: System and Design Overview
Google File System: System and Design Overview
habibaabderrahim1
 
Google File System
Google File SystemGoogle File System
Google File System
vivatechijri
 
Talon systems - Distributed multi master replication strategy
Talon systems - Distributed multi master replication strategyTalon systems - Distributed multi master replication strategy
Talon systems - Distributed multi master replication strategy
Saptarshi Chatterjee
 
Teoria efectului defectului hardware: GoogleFS
Teoria efectului defectului hardware: GoogleFSTeoria efectului defectului hardware: GoogleFS
Teoria efectului defectului hardware: GoogleFS
Asociatia ProLinux
 
Ad

Recently uploaded (20)

QA/QC Manager (Quality management Expert)
QA/QC Manager (Quality management Expert)QA/QC Manager (Quality management Expert)
QA/QC Manager (Quality management Expert)
rccbatchplant
 
DT REPORT by Tech titan GROUP to introduce the subject design Thinking
DT REPORT by Tech titan GROUP to introduce the subject design ThinkingDT REPORT by Tech titan GROUP to introduce the subject design Thinking
DT REPORT by Tech titan GROUP to introduce the subject design Thinking
DhruvChotaliya2
 
Introduction to Zoomlion Earthmoving.pptx
Introduction to Zoomlion Earthmoving.pptxIntroduction to Zoomlion Earthmoving.pptx
Introduction to Zoomlion Earthmoving.pptx
AS1920
 
MAQUINARIA MINAS CEMA 6th Edition (1).pdf
MAQUINARIA MINAS CEMA 6th Edition (1).pdfMAQUINARIA MINAS CEMA 6th Edition (1).pdf
MAQUINARIA MINAS CEMA 6th Edition (1).pdf
ssuser562df4
 
Smart_Storage_Systems_Production_Engineering.pptx
Smart_Storage_Systems_Production_Engineering.pptxSmart_Storage_Systems_Production_Engineering.pptx
Smart_Storage_Systems_Production_Engineering.pptx
rushikeshnavghare94
 
Fort night presentation new0903 pdf.pdf.
Fort night presentation new0903 pdf.pdf.Fort night presentation new0903 pdf.pdf.
Fort night presentation new0903 pdf.pdf.
anuragmk56
 
ELectronics Boards & Product Testing_Shiju.pdf
ELectronics Boards & Product Testing_Shiju.pdfELectronics Boards & Product Testing_Shiju.pdf
ELectronics Boards & Product Testing_Shiju.pdf
Shiju Jacob
 
Artificial Intelligence (AI) basics.pptx
Artificial Intelligence (AI) basics.pptxArtificial Intelligence (AI) basics.pptx
Artificial Intelligence (AI) basics.pptx
aditichinar
 
Compiler Design Unit1 PPT Phases of Compiler.pptx
Compiler Design Unit1 PPT Phases of Compiler.pptxCompiler Design Unit1 PPT Phases of Compiler.pptx
Compiler Design Unit1 PPT Phases of Compiler.pptx
RushaliDeshmukh2
 
Reagent dosing (Bredel) presentation.pptx
Reagent dosing (Bredel) presentation.pptxReagent dosing (Bredel) presentation.pptx
Reagent dosing (Bredel) presentation.pptx
AlejandroOdio
 
15th International Conference on Computer Science, Engineering and Applicatio...
15th International Conference on Computer Science, Engineering and Applicatio...15th International Conference on Computer Science, Engineering and Applicatio...
15th International Conference on Computer Science, Engineering and Applicatio...
IJCSES Journal
 
RICS Membership-(The Royal Institution of Chartered Surveyors).pdf
RICS Membership-(The Royal Institution of Chartered Surveyors).pdfRICS Membership-(The Royal Institution of Chartered Surveyors).pdf
RICS Membership-(The Royal Institution of Chartered Surveyors).pdf
MohamedAbdelkader115
 
Oil-gas_Unconventional oil and gass_reseviours.pdf
Oil-gas_Unconventional oil and gass_reseviours.pdfOil-gas_Unconventional oil and gass_reseviours.pdf
Oil-gas_Unconventional oil and gass_reseviours.pdf
M7md3li2
 
fluke dealers in bangalore..............
fluke dealers in bangalore..............fluke dealers in bangalore..............
fluke dealers in bangalore..............
Haresh Vaswani
 
Machine learning project on employee attrition detection using (2).pptx
Machine learning project on employee attrition detection using (2).pptxMachine learning project on employee attrition detection using (2).pptx
Machine learning project on employee attrition detection using (2).pptx
rajeswari89780
 
AI-assisted Software Testing (3-hours tutorial)
AI-assisted Software Testing (3-hours tutorial)AI-assisted Software Testing (3-hours tutorial)
AI-assisted Software Testing (3-hours tutorial)
Vəhid Gəruslu
 
theory-slides-for react for beginners.pptx
theory-slides-for react for beginners.pptxtheory-slides-for react for beginners.pptx
theory-slides-for react for beginners.pptx
sanchezvanessa7896
 
Process Parameter Optimization for Minimizing Springback in Cold Drawing Proc...
Process Parameter Optimization for Minimizing Springback in Cold Drawing Proc...Process Parameter Optimization for Minimizing Springback in Cold Drawing Proc...
Process Parameter Optimization for Minimizing Springback in Cold Drawing Proc...
Journal of Soft Computing in Civil Engineering
 
Metal alkyne complexes.pptx in chemistry
Metal alkyne complexes.pptx in chemistryMetal alkyne complexes.pptx in chemistry
Metal alkyne complexes.pptx in chemistry
mee23nu
 
Raish Khanji GTU 8th sem Internship Report.pdf
Raish Khanji GTU 8th sem Internship Report.pdfRaish Khanji GTU 8th sem Internship Report.pdf
Raish Khanji GTU 8th sem Internship Report.pdf
RaishKhanji
 
QA/QC Manager (Quality management Expert)
QA/QC Manager (Quality management Expert)QA/QC Manager (Quality management Expert)
QA/QC Manager (Quality management Expert)
rccbatchplant
 
DT REPORT by Tech titan GROUP to introduce the subject design Thinking
DT REPORT by Tech titan GROUP to introduce the subject design ThinkingDT REPORT by Tech titan GROUP to introduce the subject design Thinking
DT REPORT by Tech titan GROUP to introduce the subject design Thinking
DhruvChotaliya2
 
Introduction to Zoomlion Earthmoving.pptx
Introduction to Zoomlion Earthmoving.pptxIntroduction to Zoomlion Earthmoving.pptx
Introduction to Zoomlion Earthmoving.pptx
AS1920
 
MAQUINARIA MINAS CEMA 6th Edition (1).pdf
MAQUINARIA MINAS CEMA 6th Edition (1).pdfMAQUINARIA MINAS CEMA 6th Edition (1).pdf
MAQUINARIA MINAS CEMA 6th Edition (1).pdf
ssuser562df4
 
Smart_Storage_Systems_Production_Engineering.pptx
Smart_Storage_Systems_Production_Engineering.pptxSmart_Storage_Systems_Production_Engineering.pptx
Smart_Storage_Systems_Production_Engineering.pptx
rushikeshnavghare94
 
Fort night presentation new0903 pdf.pdf.
Fort night presentation new0903 pdf.pdf.Fort night presentation new0903 pdf.pdf.
Fort night presentation new0903 pdf.pdf.
anuragmk56
 
ELectronics Boards & Product Testing_Shiju.pdf
ELectronics Boards & Product Testing_Shiju.pdfELectronics Boards & Product Testing_Shiju.pdf
ELectronics Boards & Product Testing_Shiju.pdf
Shiju Jacob
 
Artificial Intelligence (AI) basics.pptx
Artificial Intelligence (AI) basics.pptxArtificial Intelligence (AI) basics.pptx
Artificial Intelligence (AI) basics.pptx
aditichinar
 
Compiler Design Unit1 PPT Phases of Compiler.pptx
Compiler Design Unit1 PPT Phases of Compiler.pptxCompiler Design Unit1 PPT Phases of Compiler.pptx
Compiler Design Unit1 PPT Phases of Compiler.pptx
RushaliDeshmukh2
 
Reagent dosing (Bredel) presentation.pptx
Reagent dosing (Bredel) presentation.pptxReagent dosing (Bredel) presentation.pptx
Reagent dosing (Bredel) presentation.pptx
AlejandroOdio
 
15th International Conference on Computer Science, Engineering and Applicatio...
15th International Conference on Computer Science, Engineering and Applicatio...15th International Conference on Computer Science, Engineering and Applicatio...
15th International Conference on Computer Science, Engineering and Applicatio...
IJCSES Journal
 
RICS Membership-(The Royal Institution of Chartered Surveyors).pdf
RICS Membership-(The Royal Institution of Chartered Surveyors).pdfRICS Membership-(The Royal Institution of Chartered Surveyors).pdf
RICS Membership-(The Royal Institution of Chartered Surveyors).pdf
MohamedAbdelkader115
 
Oil-gas_Unconventional oil and gass_reseviours.pdf
Oil-gas_Unconventional oil and gass_reseviours.pdfOil-gas_Unconventional oil and gass_reseviours.pdf
Oil-gas_Unconventional oil and gass_reseviours.pdf
M7md3li2
 
fluke dealers in bangalore..............
fluke dealers in bangalore..............fluke dealers in bangalore..............
fluke dealers in bangalore..............
Haresh Vaswani
 
Machine learning project on employee attrition detection using (2).pptx
Machine learning project on employee attrition detection using (2).pptxMachine learning project on employee attrition detection using (2).pptx
Machine learning project on employee attrition detection using (2).pptx
rajeswari89780
 
AI-assisted Software Testing (3-hours tutorial)
AI-assisted Software Testing (3-hours tutorial)AI-assisted Software Testing (3-hours tutorial)
AI-assisted Software Testing (3-hours tutorial)
Vəhid Gəruslu
 
theory-slides-for react for beginners.pptx
theory-slides-for react for beginners.pptxtheory-slides-for react for beginners.pptx
theory-slides-for react for beginners.pptx
sanchezvanessa7896
 
Metal alkyne complexes.pptx in chemistry
Metal alkyne complexes.pptx in chemistryMetal alkyne complexes.pptx in chemistry
Metal alkyne complexes.pptx in chemistry
mee23nu
 
Raish Khanji GTU 8th sem Internship Report.pdf
Raish Khanji GTU 8th sem Internship Report.pdfRaish Khanji GTU 8th sem Internship Report.pdf
Raish Khanji GTU 8th sem Internship Report.pdf
RaishKhanji
 
Ad

advanced Google file System

  • 1. Google File System Lalit Kumar M.Tech Final Year Compute Science & Engineering Dept. KEC Dwarahat, Almora
  • 2. Overview  Introduction To GFS  Architecture  Data Flow  System Interactions  Master Operations  Meta Data Management  Garbage Collection  Fault tolerance  Latest Advancement  Drawbacks  Conclusion  References
  • 3. Introduction  More than 15,000 commodity-class PC's.  Multiple clusters distributed worldwide.  Thousands of queries served per second.  One query reads 100's of MB of data.  One query consumes 10's of billions of CPU cycles.  Google stores dozens of copies of the entire Web! Conclusion: Need large, distributed, highly fault tolerant file system.
  • 4. Architecture A GFS cluster consists of a single master and multiple chunk-servers and is accessed by multiple clients Figure 1: GFS Architecture Source: Howard Gobioff, “The GFS” Presented at SOSP 2003
  • 5. Master  Manages namespace/metadata.  Manages chunk creation, replication, placement.  Performs snapshot operation to create duplicate of file or directory tree.  Performs checkpointing and logging of changes to metadata Chunkservers  On startup/failure recovery, reports chunks to master.  Periodically reports sub-set of chunks to master (to detect no longer needed chunks) Metadata  Types of Metadata:- File and chunk namespaces, Mapping from files to chunks, Location of each chunks replicas.  Easy and efficient for the master to periodically scan.  Periodic scanning is used to implement chunk garbage collection, re- replication and chunk migration .
  • 6.  Data is pushed linearly along a carefully picked chain of chunk servers in a TCP pipelined fashion.  Once a chunkserver receives some data, it starts forwarding immediately to the next chunkserver  Each machine forwards the data to the closest machine in the network topology that has not received it. Data Flow Figure 2: Data Flow in chunkservers Source: http://research.google.com/archive/gfs‐sosp2003.pdf
  • 7. System Interactions Read Algorithm 1. Application originates the read request 2. GFS client translates the request form (filename, byte range) -> (filename, chunk index), and sends it to master 3. Master responds with chunk handle and replica locations (i.e. chunkservers where the replicas are stored) 4. Client picks a location and sends the (chunk handle, byte range) request to the location 5. Chunkserver sends requested data to the client 6. Client forwards the data to the application . Figure 3: Block diagram for Read operation Source: Howard Gobioff, “The GFS” Presented at SOSP 2003
  • 8. Write Algorithm 1. Application originates the request 2. GFS client translates request from (filename, data) -> (filename, chunk index), and sends it to master 3. Master responds with chunk handle and (primary + secondary) replica locations 4. Client pushes write data to all locations. Data is stored in chunkserver’s internal buffers 5. Client sends write command to primary 6. Primary determines serial order for data instances stored in its buffer and writes the instances in that order to the chunk 7. Primary sends the serial order to the secondaries and tells them to perform the write 8. Secondaries respond to the primary & primary responds back to the client Figure 4: Block Diagram for Write operation Source: Howard Gobioff, “The GFS” Presented at SOSP 2003
  • 9. Master Operation 1. Namespace Management and Locking  GFS maps full pathname to Metadata in a table.  Each master operation acquires a set of locks.  Locking scheme allows concurrent mutations in same directory.  Locks are acquired in a consistent total order to prevent deadlock. 2. Replica Placement 3. Chunk Creation 4. Re-Replication 5. Balancing
  • 10.  Each master operation acquires a set of locks before it runs  To make operation on /dir1/dir2/dir3/leaf it first needs the following locks – Read-lock on /dir1 – Read-lock on /dir1/dir2 – Read-lock on /dir1/dir2/dir3 – Read-lock or Write-lock on /dir1/dir2/dir3/leaf  File creation doesn’t require write‐lock on parent director read- lock on the name Sufficient to protect the parent directory from deletion, rename, or snapsho1ed 1. Namespace Management & Locking
  • 11. 2. Chunk Creation  Master considers several factors  Place new replicas on chunk servers with below‐average disk space utilization  Limit the number of “recent” creations on each chunk server  Spread replicas of a chunk across racks
  • 12. 3. Re-replication  Master Re-replicate a chunk as soon as the number of available replicas falls below a user-specified goal.  When a chunkserver becomes unavailable.  When a chunkserver reports a corrupted chunk.  When the replication goal is increased.  Re‐replication placement is similar as for “creation”
  • 13. 4. Balancing  Master Re-balances replicas periodically for better disk space and load balancing  Master gradually fills up a new chunkserver rather than instantly swaps it with new chunks (and the heavy write traffic that come with them!)
  • 14. Metadata Management0  The master stores three major types of metadata:  File and chunk namespaces  Mapping from files to chunks  Locations of each chunk’s replicas  All metadata is kept in the master’s memory. Figure 5: logical Structure of Metadata Source: Naushad UzZaman,“Survey on Google File System”,CSC 456,2007
  • 15.  Storage reclaimed lazily by GC.  File first renamed to a hidden name.  Hidden files removes if more than three days old.  When hidden file removed, in-memory metadata is removed.  Regularly scans chunk namespace, identifying orphaned chunks. These are removed.  Chunk servers periodically report chunks they have and the master replies with the identity of all chunks that are no longer present in the master’s metadata. The chunkserver is free to delete its replicas of such chunks. Garbage Collection
  • 16. Fault Tolerance  High availability:  Fast recovery.  Chunk replication.  Master Replication  Data Integrity:  Chunkserver uses checksumming.  Broken up into 64 KB blocks.
  • 17. Latest Advancement 1. Gmail- An easily configurable email service with 15GB of web space. 2. Blogger- A free web-based service that helps consumers publish on the web without writing code or installing software. 3. Google- “Next generation corporate s/w” A smaller version of the Google software, modified for private use.
  • 18.  Small files will have small number of chunks even one. This can lead to chunk servers storing these files to become hot spots in case of many client requests.  Internal Fragmentation.  If there are many such small files the master involvement will increase and can lead to a potential bottleneck. Having a single master node can become an issue.  Master memory is a limitation.  Performance might degrade if the numbers of writers and random writes are more.  No reasoning is provided for the choice of standard chunk size (64MB). Drawbacks
  • 19. Conclusion GFS meets Google storage requirements  Incremental growth.  Regular check of component failure.  Data optimization from special operations .  Simple architecture.  Fault Tolerance.
  • 20. References [1] Sanjay Ghemawat, Howard Gobioff and Shun-Tak Leung, The Google File System, ACM SIGOPS Operating Systems Review, Volume 37, Issue 5, 2003. [2] Sean Quinlan, Kirk McKusick “GFS-Evolution and Fast-Forward” Communications of the ACM, Vol 53, 2013. [3] Thomas Anderson, Michael Dahlin, JeannaNeefe, David Patterson, Drew Roselli, and Randolph Wang. Serverlessnetworkfil e systems. In Proceedings of the 15th ACM Symposium on Operating System Principles, pages 109–126, Copper Mountain Resort, Colorado, December 1995. [4] Luis-Felipe Cabrera and Darrell D. E. Long. Swift: Using distributed disks triping to provide high I/O data rates. Computer Systems, 4(4):405–436, 1991. [5] InterMezzo. http://www.inter-mezzo.org, 2003.