Speculative Execution in A Distributed File System: E. B. Nightingale P. M. Chen J. Flint

Speculative execution in distributed file systems allows clients to predictably speculate on the outcomes of operations and execute speculatively to reduce latency. The system tracks causal dependencies to ensure correctness. Evaluation shows SpecNFS is significantly faster than NFS for common workloads like Apache building, with performance degrading little as speculation failures increase. Speculation enables safe, consistent yet fast distributed file systems.

Uploaded by

sushmsn

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

60 views

Speculative Execution in A Distributed File System: E. B. Nightingale P. M. Chen J. Flint

Uploaded by

sushmsn

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 30

SPECULATIVE EXECUTION IN

A DISTRIBUTED FILE SYSTEM

E. B. Nightingale
P. M. Chen
J. Flint
University of Michigan
Motivation
• Distributed file systems are often much slower than
local file systems
– Due to synchronous operations required for
cache coherence and data safety
– Even true for file systems that weaken
consistency and safety guarantees
• Close-to-open consistency for AFS and most
versions of NFS
A better solution
• Most of these synchronous operations have
predictable outcomes
– We can bet on the outcome and let the client
process go forward (speculation)
• Make operation asynchronous
– Must take before that a checkpoint of the process
• Can restart operation if speculation failed
Why it works
1. Clients can correctly predict the outcome of many
operations
• Few concurrent accesses to files
2. Time to take a lightweight checkpoint is often less
than network round-trip time
• 52 ms for a small process thanks to
copy-on-write
3. Most clients have free cycles
Speculator
• File system controls when speculations start,
succeed and fail
• Speculator provides a mechanism to ensure
correct execution of speculative code
• No application changes are required
• Speculative state is never visible from the
outside
Correctness rules (I)
• A process that executes in speculative mode
cannot externalize output
– Speculator blocks the process
• Speculator tracks causal dependencies between
kernel objects
– Kernel objects modified by a speculative
process will be put in a speculative state
Correctness rules (II)
• Speculator tracks causal dependencies between
processes
– Processes receiving a message or a signal
from a speculative process will be
checkpointed and become speculative
• In case of doubt, Speculator will block the
execution of the speculative process
An example: conventional NFS
An example: conventional NFS
• Linux 2.4.21 NFSv3 implements close to open
consistency
– At close time, client sends to server:
1. Asynchronous write calls with the
modified data
2. A synchronous commit call once it
has received replies for all write calls
An example: SpecNFS
An example: SpecNFS
• All calls are non-blocking but force the calling
process to become speculative
• If a call returns an unexpected result, the calling
process is rolled back to its checkpoint and the
call is executed again
– A new speculation starts
Speculation interface
• Three new system calls:
– Create_speculation():
• Returns unique spec_id and a list of
previous speculations on which the
speculation depends
– Commit_speculation(spec_id)
– Fail_speculation(spec_id)
Implementing checkpoints
• Checkpoints are implemented through
copy-on-write fork
– Speculator also saves the state of any open
file descriptor and copies all pending signals
• Forked child is not placed on the ready queue
– It just waits
• If speculation fails, forked child assumes the
identity of the failed parent
New kernel structures
• Speculation structure:
– Created during create_speculation()
– Tracks the set of kernel objects that depend
on the speculation
• Undo log:
– Associated with each kernel object that has a
speculative state
– Ordered list of speculative modifications
Sharing checkpoints
• Letting successive speculations share the same
checkpoint reduces the speculation overhead
• Two limitations
– Speculator limits the amount of rollback work
by not letting speculation share a checkpoint
that is more than 500 ms old
– Cannot let a speculation share a checkpoint
with a previous speculation that changes state
of file system
Correctness invariants
1. Speculative state should never be visible to the
user or to any external device
– Speculator prevents all speculative
processes from externalizing output to any
interface
2. A process should never view speculative state
unless it is already speculatively dependent
upon that state.
Invariant implementations (I)
• First Implementation:
Block speculative processes whenever they try
to perform a system call
– Always correct
– Limits the amount of work that can be done by
a process in a speculative state
Invariant implementations (II)
• Second Implementation:
Allow speculative processes to perform systems
calls that
– Do not modify state
• “Read-only” calls such as getpid()
– Only modify state that is private to the calling
process
• It will be rolled back if speculation fails
Invariant implementations (III)
• Third Implementation:
Allow speculative processes to perform
operations on files in speculative file systems
– With VFS, can have multiple file systems on
the same machine
• Typically NFS plus FFS or ext3
• Must check type of file system
– Have a special bit in superblock
Multiprocess speculation (I)
• Whenever a speculative process P participates
in interprocess communication with a process Q
• Process Q must become speculatively
dependent on the speculative state of
process P and get checkpointed
Multiprocess speculation (II)
• Whenever a speculative process P modifies an
object X
• Object X must become speculatively
dependent on the speculative state of
process P and get an undo list

You are not responsible for the

implementation details
Performance: PostMark benchmark
Performance: PostMark benchmark
• SpecNFS is
– 2.5 times faster than NFS with no latency
between client and server
– 41 times faster than NFS with a 30ms round-trip
time delay between client and server
• A version of BlueFS providing single-copy
semantics is 49 times faster than NFS with same
30ms round-trip time delay
Performance: Apache benchmark
Performance: Apache benchmark
• Building Apache server from a tarred file
• SpecNFS is
– 2 times faster than NFS with no latency
between client and server
– 14 times faster than NFS with a 30ms round-
trip time delay between client and serve
– Always better than BlueFS and Coda
Performance: impact of rollbacks
Performance: impact of rollbacks
• Repeated Apache benchmark marking a
varying fraction of the files out-of-date
– Will result in speculation failures
– Percentage of out-of-date files has little
impact on SpecNFS performance
Performance: other
Performance: other
• Impact of group commits and sharing state
– Mostly affects Blue FS
• When speculative processes cannot
propagate their state, Blue FS performs
worse than NFS with no latency between
client and server
• Impact magnified at 30ms latency
Conclusion
• Speculation enables the development of
distributed file systems that are
– Safe
– Consistent
– Fast
• Generic kernel support for speculative execution
and causal dependency tracking could have
many other applications

DevOps Resume 71
No ratings yet
DevOps Resume 71
3 pages
CJ720 GPS Tracker Command List - Sheet1
50% (2)
CJ720 GPS Tracker Command List - Sheet1
1 page
Show Answer
No ratings yet
Show Answer
23 pages
Os Notes
No ratings yet
Os Notes
4 pages
Chapter 16 Confinement Problem
No ratings yet
Chapter 16 Confinement Problem
8 pages
Lecture 06
No ratings yet
Lecture 06
16 pages
Week 1 Talking Points
No ratings yet
Week 1 Talking Points
18 pages
OS Chapter 2-0
No ratings yet
OS Chapter 2-0
42 pages
Unit 2
No ratings yet
Unit 2
54 pages
Intro to DS Chapter 5
No ratings yet
Intro to DS Chapter 5
76 pages
OS IAE 1(B)
No ratings yet
OS IAE 1(B)
9 pages
Unit 2 - Process Management
No ratings yet
Unit 2 - Process Management
118 pages
Chapter-02-scheduling
No ratings yet
Chapter-02-scheduling
23 pages
Operating Systems: Chapter Three
No ratings yet
Operating Systems: Chapter Three
32 pages
Consistency
No ratings yet
Consistency
23 pages
Operating Systems Operating Systems: Learning Outcomes
No ratings yet
Operating Systems Operating Systems: Learning Outcomes
10 pages
Threads in Operating System
No ratings yet
Threads in Operating System
103 pages
Chapter 5 - Auditing Switches, Routers, and Firewalls
No ratings yet
Chapter 5 - Auditing Switches, Routers, and Firewalls
26 pages
Unit - Ii: Linux
No ratings yet
Unit - Ii: Linux
32 pages
Process Concept Process Scheduling Operations On Processes Cooperating Processes Interprocess Communication Communication in Client-Server Systems
No ratings yet
Process Concept Process Scheduling Operations On Processes Cooperating Processes Interprocess Communication Communication in Client-Server Systems
38 pages
Chapter 2 Process Management
No ratings yet
Chapter 2 Process Management
71 pages
Speculative Execution in A Distributed File System: Ed Nightingale Peter Chen Jason Flinn University of Michigan
No ratings yet
Speculative Execution in A Distributed File System: Ed Nightingale Peter Chen Jason Flinn University of Michigan
29 pages
Chap 5+
No ratings yet
Chap 5+
25 pages
Cs330 IIT Kanpur
No ratings yet
Cs330 IIT Kanpur
17 pages
UNIT-2
No ratings yet
UNIT-2
29 pages
Solution JUNEJULY 2018
No ratings yet
Solution JUNEJULY 2018
15 pages
code_migration
No ratings yet
code_migration
46 pages
An Intrusion-Tolerant and Self-Recoverable Network Service System Using A Security Enhanced Chip Multiprocessor
No ratings yet
An Intrusion-Tolerant and Self-Recoverable Network Service System Using A Security Enhanced Chip Multiprocessor
18 pages
6910 OS Revesion Important Topics
No ratings yet
6910 OS Revesion Important Topics
55 pages
Threads & Semaphore
No ratings yet
Threads & Semaphore
24 pages
Unit 2 Process & Thread
No ratings yet
Unit 2 Process & Thread
43 pages
lecture8-DistributedSystem
No ratings yet
lecture8-DistributedSystem
27 pages
9-Database System Architecture
No ratings yet
9-Database System Architecture
37 pages
itec2210
No ratings yet
itec2210
2 pages
Gfs Google File System 13331
No ratings yet
Gfs Google File System 13331
28 pages
CSC 504 - Lecture Series - Scheduling in Multiprocessor Systems
No ratings yet
CSC 504 - Lecture Series - Scheduling in Multiprocessor Systems
15 pages
System Call Programming & Debugging: Week 7
No ratings yet
System Call Programming & Debugging: Week 7
18 pages
Technical Question Bank Operating Systems
No ratings yet
Technical Question Bank Operating Systems
17 pages
Elg 6171 Scheduling Presentation
No ratings yet
Elg 6171 Scheduling Presentation
20 pages
Unit-2 Os 2024
No ratings yet
Unit-2 Os 2024
47 pages
Process Scheduling and Switching
No ratings yet
Process Scheduling and Switching
31 pages
LabManual-2
No ratings yet
LabManual-2
11 pages
3 Processes: 3.1 Threads
No ratings yet
3 Processes: 3.1 Threads
11 pages
Module 4: Processes: Operating System Concepts
No ratings yet
Module 4: Processes: Operating System Concepts
34 pages
CH 4
No ratings yet
CH 4
38 pages
Performance Concepts
No ratings yet
Performance Concepts
35 pages
Os Unit 2
No ratings yet
Os Unit 2
277 pages
Chapter - 1
No ratings yet
Chapter - 1
11 pages
Plecement Prep Whole
No ratings yet
Plecement Prep Whole
169 pages
Part 2 Computer Systems: System Configuration and Methods (Text No. 1 Chapter 5)
No ratings yet
Part 2 Computer Systems: System Configuration and Methods (Text No. 1 Chapter 5)
73 pages
RTOS Material
No ratings yet
RTOS Material
21 pages
File System Design For and NSF File Server Appliance: Dave Hitz, James Lau, and Michael Malcolm
No ratings yet
File System Design For and NSF File Server Appliance: Dave Hitz, James Lau, and Michael Malcolm
26 pages
Intro To DS Chapter 2
No ratings yet
Intro To DS Chapter 2
29 pages
_semester-5_2018_november_linux-system-administration-cbcs
No ratings yet
_semester-5_2018_november_linux-system-administration-cbcs
37 pages
CO4 CHAP 9 - (13-24) - Practice
No ratings yet
CO4 CHAP 9 - (13-24) - Practice
28 pages
Chapter-2 Processes and Threads in DS
No ratings yet
Chapter-2 Processes and Threads in DS
54 pages
Unit-VI: Advance Tools and Technologies (And Problem Solving in The OS)
No ratings yet
Unit-VI: Advance Tools and Technologies (And Problem Solving in The OS)
76 pages
Deadlock in Distributed Enviornment
0% (1)
Deadlock in Distributed Enviornment
31 pages
Otd Yair
No ratings yet
Otd Yair
50 pages
UNIT 2 OS_ppt
No ratings yet
UNIT 2 OS_ppt
56 pages
Embedded Systems Overview: - RTOS/EOS Design Concept
No ratings yet
Embedded Systems Overview: - RTOS/EOS Design Concept
19 pages
Advanced Penetration Testing for Highly-Secured Environments: The Ultimate Security Guide
From Everand
Advanced Penetration Testing for Highly-Secured Environments: The Ultimate Security Guide
Allen Lee
4.5/5 (6)
Kubernetes Made Easy
From Everand
Kubernetes Made Easy
Pankaj Joshi
No ratings yet
A Case For End System Multicast
No ratings yet
A Case For End System Multicast
19 pages
Green Inertia - Firm Presentation-April 2011-1
No ratings yet
Green Inertia - Firm Presentation-April 2011-1
18 pages
Torrent Clustering
No ratings yet
Torrent Clustering
12 pages
Chord: A Scalable Peer-To-Peer Lookup Protocol For Internet Applications
No ratings yet
Chord: A Scalable Peer-To-Peer Lookup Protocol For Internet Applications
25 pages
A Case For End System Multicast: Yang-Hua Chu, Sanjay Rao and Hui Zhang Carnegie Mellon University
No ratings yet
A Case For End System Multicast: Yang-Hua Chu, Sanjay Rao and Hui Zhang Carnegie Mellon University
27 pages
Narada
No ratings yet
Narada
12 pages
Congestion Avoidance and Control: V. Jacobson
No ratings yet
Congestion Avoidance and Control: V. Jacobson
17 pages
CSC: Principles of Computer Networks: Demultiplexing
No ratings yet
CSC: Principles of Computer Networks: Demultiplexing
11 pages
Chord: A Scalable Peer-to-Peer Lookup Protocol For Internet Applications
No ratings yet
Chord: A Scalable Peer-to-Peer Lookup Protocol For Internet Applications
40 pages
Rarest First and Choke Algorithms Are Enough: Arnaud LEGOUT
No ratings yet
Rarest First and Choke Algorithms Are Enough: Arnaud LEGOUT
29 pages
Core-Stateless Fair Queueing: A Scalable Architecture To Approximate Fair Bandwidth Allocations in High Speed Networks
No ratings yet
Core-Stateless Fair Queueing: A Scalable Architecture To Approximate Fair Bandwidth Allocations in High Speed Networks
56 pages
Clustering and Sharing Incentives in Bittorrent Systems
No ratings yet
Clustering and Sharing Incentives in Bittorrent Systems
23 pages
A Multifaceted Approach To Understanding The Botnet Phenomenon
No ratings yet
A Multifaceted Approach To Understanding The Botnet Phenomenon
27 pages
Chord: A Scalable Peer-to-Peer Lookup Service For Internet Applications
No ratings yet
Chord: A Scalable Peer-to-Peer Lookup Service For Internet Applications
33 pages
Domain Name System: DNS
No ratings yet
Domain Name System: DNS
16 pages
BGP Convergence
No ratings yet
BGP Convergence
24 pages
8 RouterSupport
No ratings yet
8 RouterSupport
36 pages
Congestion Control For High Bandwidth-Delay Product Networks
No ratings yet
Congestion Control For High Bandwidth-Delay Product Networks
14 pages
Internet Architecture: CPS 214 (Nick Feamster) January 14, 2008
No ratings yet
Internet Architecture: CPS 214 (Nick Feamster) January 14, 2008
31 pages
Greening of The Internet
No ratings yet
Greening of The Internet
14 pages
Reducing Network Energy Consumption Via Sleeping and Rate-Adaptation
No ratings yet
Reducing Network Energy Consumption Via Sleeping and Rate-Adaptation
14 pages
Resilient Overlay Networks: David Andersen, Hari Balakrishnan, Frans Kaashoek, and Robert Morris
No ratings yet
Resilient Overlay Networks: David Andersen, Hari Balakrishnan, Frans Kaashoek, and Robert Morris
15 pages
Multi Cast
No ratings yet
Multi Cast
38 pages
Timer Interaction in Route Flap Damping
No ratings yet
Timer Interaction in Route Flap Damping
11 pages
Delayed Internet Routing Convergence: Craig Labovitz Abha Ahuja, Abhijit Bose Farnam Jahanian
No ratings yet
Delayed Internet Routing Convergence: Craig Labovitz Abha Ahuja, Abhijit Bose Farnam Jahanian
13 pages
Kendriya Vidyalaya Sangathan Regional Office, Jabalpur Region
No ratings yet
Kendriya Vidyalaya Sangathan Regional Office, Jabalpur Region
24 pages
Analysing the requirements of Operational Engineer
No ratings yet
Analysing the requirements of Operational Engineer
2 pages
Iptables - A FORWARD - I Eth0 - J ACCEPT Exit 0
No ratings yet
Iptables - A FORWARD - I Eth0 - J ACCEPT Exit 0
2 pages
ServiceNow Virtual Agent - ServiceNow TechMinds - Medium
No ratings yet
ServiceNow Virtual Agent - ServiceNow TechMinds - Medium
10 pages
CN Model Question Paper
No ratings yet
CN Model Question Paper
4 pages
Design/Execution Steps: Web Technology Lab Manual
No ratings yet
Design/Execution Steps: Web Technology Lab Manual
3 pages
Case Study On Using Ai'S and Chatbots 7. Charter Communications: 500% ROI in Six Months
No ratings yet
Case Study On Using Ai'S and Chatbots 7. Charter Communications: 500% ROI in Six Months
10 pages
History Homework Takeaway Menu
100% (1)
History Homework Takeaway Menu
8 pages
29793
No ratings yet
29793
55 pages
Christine M. Taoc 12-Gas01: Activity 1: Social Media Scenario
No ratings yet
Christine M. Taoc 12-Gas01: Activity 1: Social Media Scenario
1 page
Edexcel Modular Homework Book Higher 2 Answers
100% (1)
Edexcel Modular Homework Book Higher 2 Answers
4 pages
Aunt Dee
No ratings yet
Aunt Dee
2 pages
Canon iRC2880 3880 Reference Guide
No ratings yet
Canon iRC2880 3880 Reference Guide
532 pages
UD38502B - Network Camera - User Manual - H11 V5.9.10 - H8 5.8.60 - 20240807
No ratings yet
UD38502B - Network Camera - User Manual - H11 V5.9.10 - H8 5.8.60 - 20240807
206 pages
SEO Executive Job Description - Ajavu Tech House
No ratings yet
SEO Executive Job Description - Ajavu Tech House
2 pages
TallyPrime May'24 Set 1
No ratings yet
TallyPrime May'24 Set 1
2 pages
HALFYEARLY_MARKINGSCHEME
No ratings yet
HALFYEARLY_MARKINGSCHEME
6 pages
List of Geographic Information Systems Software
No ratings yet
List of Geographic Information Systems Software
8 pages
Free Homework Checklist Template
100% (1)
Free Homework Checklist Template
4 pages
CCURE9000 SWH-TECHREF-nID-000035512 LT en
No ratings yet
CCURE9000 SWH-TECHREF-nID-000035512 LT en
4 pages
Fisher Hudak: Business Administra - On Major, Minor: Leadership
No ratings yet
Fisher Hudak: Business Administra - On Major, Minor: Leadership
1 page
MIL - Module 15 16
No ratings yet
MIL - Module 15 16
25 pages
SMA_Module_1
No ratings yet
SMA_Module_1
50 pages
Arun Resume
No ratings yet
Arun Resume
5 pages
Socio-Cultural Perspectives On Cancel Culture (Full Manuscript)
No ratings yet
Socio-Cultural Perspectives On Cancel Culture (Full Manuscript)
135 pages
Techstreet Enterprise User Guide PDF
No ratings yet
Techstreet Enterprise User Guide PDF
13 pages
GCP Cloud Security
No ratings yet
GCP Cloud Security
12 pages

Speculative Execution in A Distributed File System: E. B. Nightingale P. M. Chen J. Flint

Uploaded by

Speculative Execution in A Distributed File System: E. B. Nightingale P. M. Chen J. Flint

Uploaded by

SPECULATIVE EXECUTION IN

A DISTRIBUTED FILE SYSTEM

You are not responsible for the

You might also like