0% found this document useful (0 votes)

19 views24 pages

cs516 Unit II

Uploaded by

321106410027

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views24 pages

cs516 Unit II

Uploaded by

321106410027

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

CS516 Distributed Operating System Unit II

UNIT II : DISTRIBUTED COMPUTING

Introduction - Distributed Systems - Hardware and Software concepts - Design issues;
Communication in Distributed systems: Layered protocols - ATM networks - Client Server model -
Remote Procedure Calls

ADVANTAGES OF DISTRIBUTED OPERATING SYSTEM

Advantages over Centralized Systems

Economics Microprocessors offer a better price/performance than mainframes

A distributed system may have more total computing power than a

Speed
mainframe

Inherent distribution Some applications involve spatially separated machines

Reliability If one machine crashes, the system as a whole can still survive

Incremental growth Computing power can be added in small increments

Advantages over independent PC’s

Allow many users access to a common database

Data sharing
(Ex: Airline reservation)

Allow many users to share expensive peripherals like color laser

Device sharing printers, phototypesetters and massive archival storage devices like
optical jukeboxes

Communication Make human to human communications easier (Ex: email)

Spread the workload over the available machines in the most cost
Flexibility
effective way

MTech CSE (PT, 2011-14) SRM, Ramapuram 1 hcr:innovationcse@gg

CS516 Distributed Operating System Unit II

HARDWARE CONCEPTS OF DISTRIBUTED OS

Based on the number of instruction and data streams

 SISD
o A computer with a single instruction stream and a single data stream is called SISD.
o All traditional uniprocessor computers (i.e., those having only one CPU) fall in this
category, from personal computers to large mainframes
 SIMD
o single instruction stream, multiple data stream.
o Array processors with one instruction unit that fetches an instruction, and then
commands many data units to carry it out in parallel, each with its own data.
 MISD
o multiple instruction stream, single data stream
 MIMD
o a group of independent computers, each with its own PC, program, and data.
o all distributed systems are MIMD
MIMD Groups
 Multiprocessors
o have shared memory
o single virtual address space that is shared by all CPUs
 Multi computers
o every machine has its own private memory
Based on the architecture of the interconnection network
 Bus
o single network, backplane, bus, cable, or other medium that connects all the machines
o Example: Cable television
 Switched
o individual wires from machine to machine, with many different wiring patterns
o Messages move along the wires, with an explicit switching decision made at each step
to route the message along one of the outgoing wires. Example: PSTN

MTech CSE (PT, 2011-14) SRM, Ramapuram 2 hcr:innovationcse@gg

CS516 Distributed Operating System Unit II

Based on their communication

 Tightly coupled
o Delay experienced when a message is sent from one computer to another is short, and
the data rate is high
o used more as parallel systems
o two CPU chips on the same PCB and connected by wires etched onto the board
o multiprocessors tend to be more tightly coupled because they can exchange data at
memory speeds
 Loosely coupled
o The inter machine message delay is large and the data rate is low
o used as distributed systems
o two computers connected by a 2400 bit/sec modem over the telephone system
o some fiber optic based multicomputers can also work at memory speeds
Bus Based Multiprocessors
 Consists of some number of CPUs all connected to a common bus, along with a memory
module
 Coherent - Shared memory access between CPUs
 Cache memory: To speed up the performance; high speed; hit rate;
 Write through cache
 Snoopy cache
o All caches constantly monitors the bus; either removes that entry or updates the cache
entry with the new value

Switched Multiprocessor
To build a MP with more than 64 processors

MTech CSE (PT, 2011-14) SRM, Ramapuram 3 hcr:innovationcse@gg

CS516 Distributed Operating System Unit II

Crossbar switch
 Divide the memory up into modules and then connect them to the CPUs with a crossbar
switch
 Many CPUs can be accessing memory at the same time
 We need n x n crosspoint switches with n CPUs and n memories
Omega network
 2x2 switches
 Switches can be set in nano seconds or less
 every CPU can access every memory
NUMA
 Requires complex algorithm for good software placement

Bus Based Multicomputers

 Less traffic; need not be a high speed backplane bus
 Lower speed LAN (10-100 Mbps compared to 300 Mbps & above for a backplane bus)

Switched Multicomputers
Each CPU has direct and exclusive access to its own, private memory

Grid Topology
Best suited for problems that have an inherent 2D nature like graph theory or vision (Ex: robot
eyes, analysing photograph)

Hypercube
 n-dimensional cube
 Each vertex is a CPU
 Each edge is a connection between two CPUs
 The corresponding vertices in each of the two cubes are connected

MTech CSE (PT, 2011-14) SRM, Ramapuram 4 hcr:innovationcse@gg

CS516 Distributed Operating System Unit II

SOFTWARE CONCEPTS OF DISTRIBUTED OS

 Network Operating Systems
 True Distributed Systems
 Multiprocessor Timesharing Systems

System Description Main Goal

Tightly-coupled operating system for multi-

Hide and manage hardware
DOS processors and homogeneous
resources
multicomputers

Loosely-coupled operating system for

Offer local services to
NOS heterogeneous multicomputers (LAN and
remote clients
WAN)

Additional layer atop of NOS implementing Provide distribution

Middleware
general-purpose services transparency

Network Operating Systems

 In this model, each user has a workstation for his exclusive use.
 It may or may not have a hard disk.
 It definitely has its own operating system.
 All commands are normally run locally, right on the workstation.
 It is possible for a user to log into another workstation remotely
 Workstations can import or mount these file systems, augmenting their local file systems
with those located on the servers
True Distributed Systems
 Tightly-coupled software on the same loosely-coupled (i.e., multicomputer) hardware.
 Users should not have to be aware of the existence of multiple CPUs in the system
 There must be a single, global inter-process communication mechanism so that any
process can talk to any other process
 Global protection scheme
 Process management must also be the same
Multiprocessor Timesharing Systems
 Tightly-coupled software on tightly-coupled hardware
 Existence of a single run queue: a list of all the processes in the system that are logically
unblocked and ready to run

MTech CSE (PT, 2011-14) SRM, Ramapuram 5 hcr:innovationcse@gg

CS516 Distributed Operating System Unit II

Comparison of three different ways of organizing n CPUs

Distributed Multiprocessor
Item Network OS
OS OS

Dos it look like a virtual uniprocessor No Yes Yes

Do all have to run the same OS No Yes Yes

Have many copies of the OS are there? N N 1

How is communication achieved? Shared files Messages Shared memory

Are agreed upon network protocols required Yes Yes No

Is there a single run queue? No No Yes

Dos file sharing have well defined semantics Usually No Yes Yes

DESIGN ISSUES IN DISTRIBUTED OPERATING SYSTEMS

 Transparency
 Flexibility
 Reliability
 Performance
 Scalability

Transparency: LMR-CP
 how to achieve the single-system image
 how do the system designers makes everyone into thinking that the collection of machines
is simply a timesharing system
Types of Transparency

Location The users cannot tell where resources are located

Migration Resources can move at will without changing their names

Replication The users cannot tell how many copies exist

Concurrency Multiple users can share resources automatically

Parallelism Activities can happen in parallel without users knowing

Location transparency refers to the fact that in a true distributed system, users cannot tell
where hardware and software resources such as CPUs, printers, files, and data bases are located.
The name of the resource must not secretly encode the location of the resource, so names like
machine1:prog.c or /machine1/prog.care not acceptable
Migration transparency means that resources must be free to move from one location to
another without having their names change
Since a path like /work/news does not reveal the location of the server, it is location transparent.
With replication transparency, the operating system is free to make additional copies of files and
other resources on its own without the users noticing

MTech CSE (PT, 2011-14) SRM, Ramapuram 6 hcr:innovationcse@gg

CS516 Distributed Operating System Unit II

Flexibility
Flexibility in a distributed operating system is enhanced through the modular and characteristics
of the distributed OS, and by providing a richer set of higher-level services.
Monolithic: Kernel that provides most services itself
Microkernel: Kernel should provide as little as possible, with the bulk of the operating system
services available from user-level servers.

Reliability
If a machine goes down, some other machine takes over the job

 Availability
o Fraction of time that the system is usable
 Security
o Files and other resources must be protected from unauthorized usage
 Fault Tolerance
Performance
It should not be appreciably worse than running the same application on a single processor
 Response time
 Throughput
 System utilization
 Network capacity consumed
 Grain size of all computations
o Fine-grained parallelism
 Jobs that interact highly with one another
o Coarse-grained parallelism
 Jobs that involve large computations, low interaction rates, and little data
 Fault tolerance
Scalability
Potential bottlenecks that designers should try to avoid in very large distributed systems

Centralized components A single mail server for all users

Centralized tables A single on-line telephone book

Centralized algorithms Doing routing based on complete information

Only decentralized algorithms should be used with the following characteristics

1. No machine has complete information about the system state.
2. Machines make decisions based only on local information.
3. Failure of one machine does not ruin the algorithm.
4. There is no implicit assumption that a global clock exists

MTech CSE (PT, 2011-14) SRM, Ramapuram 7 hcr:innovationcse@gg

CS516 Distributed Operating System Unit II

LAYERED PROTOCOL IN COMMUNICATION

 The OSI model is designed to allow open systems to communicate.

 An open system is one that is prepared to communicate with any other open system by
using standard rules that govern the format, contents, and meaning of the messages sent
and received.
 These rules are formalized as protocols.
 A protocol is an agreement between the communicating parties on how communication is
to proceed.
 The collection of protocols used in a particular system is called a protocol suite or protocol
stack

MTech CSE (PT, 2011-14) SRM, Ramapuram 8 hcr:innovationcse@gg

CS516 Distributed Operating System Unit II

Physical Layer
 Concerned with transmitting the 0s and 1s.
 How many volts to use for 0 and 1, how many bits per second can be sent, and whether
transmission can take place in both directions simultaneously
 The size and shape of the network connector (plug), the number of pins and their
meaning
 Deals with standardizing the electrical, mechanical, and signaling interfaces
 Many physical layer standards like, the RS-232-C for serial communication lines.

The Transport Layer

 Provides reliable connection
 The transport layer breaks it into pieces small enough for each to fit in a single packet,
assigns each one a sequence number, and then sends them all.
 ISO transport protocol has five variants, known as TP0 through TP4
 Transmission Control Protocol (TP4)
 Universal Datagram Protocol

The Session Layer

 It provides dialog control, to keep track of which party is currently talking, and it provides
synchronization facilities.
 The latter are useful to allow users to insert checkpoints into long transfers, so that in the
event of a crash it is only necessary to go back to the last checkpoint, rather than all the
way back to the beginning.

The Presentation Layer

 Concerned with the meaning of the bits.
 Most messages consist of structured information such as people's names, addressesetc
 In the presentation layer it is possible to define records containing fields like these and
then have the sender notify the receiver that a message contains a particular record in a
certain format.
 This makes it easier for machines with different internal representations to communicate.

The Application Layer

 The application layer is really just a collection of miscellaneous protocols for common
activities such as electronic mail, file transfer, and connecting remote terminals to
computers over a network.
 Examples: X.400 electronic mail protocol and the X.500 directory server.

MTech CSE (PT, 2011-14) SRM, Ramapuram 9 hcr:innovationcse@gg

CS516 Distributed Operating System Unit II

ATM NETWORKS
 The ATM model is that a sender first establishes a
connection (i.e., a virtual circuit) to the receiver or
receivers.
 During connection establishment, a route is determined
from the sender to the receiver(s) and routing
information is stored in the switches along the way.
 Using this connection, packets can be sent, but they are
chopped up by the hardware into small, fixed-sized units
called cells.
 The cells for a given virtual circuit all follow the path
stored in the switches.
 When the connection is no longer needed, it is released
and the routing information purged from the switches.
 Cell switching is that it can handle both point-to-point and multicasting efficiently
 Fixed-size cells allow rapid switching

The ATM Physical Layer

 An ATM adaptor board plugged into a computer can put out a stream of cells onto a wire
or fiber.
 The transmission stream must be continuous.
 When there are no data to be sent, empty cells are transmitted, which means that in the
physical layer, ATM is really synchronous, not asynchronous.
 Within a virtual circuit, however, it is asynchronous.
 The adaptor board can also use SONET (Synchronous Optical NETwork) in the physical
layer

The ATM Layer

 53-byte cell containing a 48-byte data field
 ATM cells will span SONET frames
 Two separate levels of synchronization are thus needed:
o one to detect the start of a SONET frame
o one to detect the start of the first full ATM cell within the SONET payload.
 A standard for packing ATM cells into SONET frames exists, and the entire layer can be
done in hardware.

The ATM Adaptation Layer

A mechanism to allow a computer to send a packet and to have the ATM hardware break it into
cells, transmit the cells, and then have them reassembled at the other end, generating one
interrupt per packet, not per cell.

MTech CSE (PT, 2011-14) SRM, Ramapuram 10 hcr:innovationcse@gg

CS516 Distributed Operating System Unit II

Four classes of traffic:

1. Constant bit rate traffic (for audio and video).
2. Variable bit rate traffic but with bounded delay.
3. Connection-oriented data traffic.
4. Connectionless data traffic

 Simple and Efficient Adaptation Layer

o It uses only one bit in the ATM header, one of the bits in the Payload type field.
This bit is normally 0, but is set to 1 in the last cell of a packet.
ATM Switching
 ATM networks are built up of copper or optical cables and switches.
 Cells originating at any of the eight computers attached to the system can be switched to
any of the other computers by traversing one or more switches.
 Each of these switches has four ports, each used for both input and output.

MTech CSE (PT, 2011-14) SRM, Ramapuram 11 hcr:innovationcse@gg

CS516 Distributed Operating System Unit II

CLIENT SERVER MODEL

 Structure the operating system as a group of co-operating processes called servers
 Servers offers services to the users called clients
 Client server machines normally run the same micro kernel
 A machine may run a single process or it may run multiple clients, multiple servers or a
mixture of the two
 This model is based on a simple, connectionless request / reply protocol
Advantage
 Simplicity
 Efficiency - Protocol stack is shorter
 Communication services can be reduced to two system calls like send (dest, &mptr) and
receive (addr, &mptr)

Four Design issues for the communication primitives and

solutions
Types of addressing
Machine.process addressing
 Hardwire machine.number into client code
 machine.process like 243.0 to 199.0
 Variant: machine.local-id
o local-id: randomly chosen 16-bit or 32-bit integer (or the next one in a sequence)
o Berkeley Unix uses this method
 Disadv: Not transparent - Client knows where the server is

Process addressing with broadcasting

 Processes pick random addresses; locate them by broadcasting
 sender can broadcast a special locate packet containing the address of the destination
process
 All the kernels check to see if the address is theirs
 If so, it sends back a here I am message giving their NW address (machine number)
 The sending kernel uses this address, and caches it to avoid broadcasting the next time

MTech CSE (PT, 2011-14) SRM, Ramapuram 12 hcr:innovationcse@gg

CS516 Distributed Operating System Unit II

Address lookup via a name server

 Put ASCII server names in clients; look them up at run time
 To avoid the extra load on the system in broadcasting, a special mapping server called
name server is used
 The name server returns the address of server

Blocking versus Nonblocking Primitives

MTech CSE (PT, 2011-14) SRM, Ramapuram 13 hcr:innovationcse@gg

CS516 Distributed Operating System Unit II

Buffered versus Unbuffered Primitives

Reliable versus Unreliable Primitives

Summary of Four Design Issues:-

Item Option 1 Option 2 Option 3

Addressing Machine Number Sparse process addresses ASCII names looked

up via server

Blocking Blocking primitives Nonblocking with copy to Nonblocking with

kernel interrupt

Buffering Unbuffered, discarding Unbuffered, temporarily Mailboxes

unexpected messages keeping unexpected messages

Reliability Unreliable Request-Ack-Reply Ack Request-Reply-Ack

MTech CSE (PT, 2011-14) SRM, Ramapuram 14 hcr:innovationcse@gg

CS516 Distributed Operating System Unit II

Implementing the Client-Server Model

Packet types used in Client-Server protocols
Code Packet Type From To Description

REQ Request Client Server The client wants service

REP Reply Server Client Reply from the server to the client

ACK Ack Either Other The previous packet arrived

AYA Are You Alive? Client Server Probe to see if the server has crashed

IAA I Am Alive Server Client The server has not crashed

TA Try Again Server Client The server has no room

AU Address Unknown Server Client No process is using this address

Various types of packet sequences

 Straight request/reply with no acknowledgement
 Each message is acknowledged individually
 Reply acting as the ack
 Nervous client checking to see if the server is still there

MTech CSE (PT, 2011-14) SRM, Ramapuram 15 hcr:innovationcse@gg

CS516 Distributed Operating System Unit II

REMOTE PROCEDURE CALL (RPC)

 Allowing programs to call procedures located on other machines
 When a process on machine A calls a procedure on machine B,
the calling process on A is suspended
 Execution of the called procedure takes place on B
 Information can be transported from the caller to the callee in the parameters and can
come back in the procedure result
 No message passing or I/O at all is visible to the programmer

Basic RPC Operation

 The idea behind RPC is to make a remote procedure call look as much as possible like a
local one
 RPC achieves its transparency in an analogous way
o When read is a remote procedure, a different version of read called client stub is
put into the library
o This stub packs the parameters into a message and asks the kernel to send the
message to the server
o Following the call to send, the client stub calls receive, blocking itself until the reply
comes back
o When the message arrives at the server, the kernel passes it up to a server stub
that is bound with the actual server
o The server stub unpacks the parameters from the message and then calls the
server procedure in the usual way

MTech CSE (PT, 2011-14) SRM, Ramapuram 16 hcr:innovationcse@gg

CS516 Distributed Operating System Unit II

Steps involved in a remote procedure call

1. The client procedure calls the client stub in the normal way
2. The client stub builds a message and traps to the kernel
3. The kernel sends the message to the remote kernel
4. The remote kernel gives the message to the server stub
5. The server stub unpack the parameters and calls the server
6. The server does the work and returns the result to the stub
7. The server stub packs it in a message and traps to the kernel
8. The remote kernel sends the message to the client's kernel
9. The client's kernel gives the message to the client stub
10. The stub unpacks the result and returns to the client
The net effect of all these steps is to convert the local call by the client procedure to the client
stub to a local call to the server procedure without either client or server being aware of the
intermediate steps.

Parameter Passing
 Parameter marshaling
o Packing parameters into a message
o This model works fine as long as the client and server machines are identical and
all the parameters and results are scalar types such as integers, characters …
o In a large distributed system, multiple machine types are present. Each machine
has its own representation of numbers, characters and other data items like
EBCDIC code versus ASCII, representation of integers (1s complement vs 2s
complement), floating point numbers, byte order (little endian vs big endian)
o Solution is to use a network standard or canonical form for data types are defined
o All senders convert their internal types to this form while marshaling
o In-efficient due to two conversions needed
 Alternative to marshaling is that the client uses its own native format and indicates in the
first byte of the message which format this is
o Ex: Little endian client builds a little endian message and server checks the 1st
byte for known messages

Dynamic Binding
 The server performs registering with binder during initialize and exports its interface
 A handle is used to locate the server
 The client sends a message to the binder asking to import the server interface
 If a suitable interface exists, the binder returns the server handle. Otherwise, the RPC fails

MTech CSE (PT, 2011-14) SRM, Ramapuram 17 hcr:innovationcse@gg

CS516 Distributed Operating System Unit II

The Binder Interface

Call Input Output

Register Name, version, handle, unique id

Deregister Name, version, unique id

Lookup Name, version Handle, unique id

RPC SEMANTICS IN THE PRESENCE OF FAILURES

Five classes of failures

1. The client is unable to locate the server
2. The request message from the client to the server is lost
3. The reply message from the server to the client is lost
4. The server crashes after receiving a request
5. The client crashes after sending a request

1) Client Cannot Locate the Server

 Client cannot locate a suitable server
 The server might be down
 The server evolves and a new version of the interface is installed with new stubs
Solutions
 Return with a special error code (-1), not reliable always (Ex: -1 is legitimate value for
sum (7, -8))
 Raise exceptions (java) , special procedure (ada), signal handlers (c) etc
o Having to write an exception or signal handler destroys the transparency

2) Lost Request Messages

 The kernel starts a timer when sending the request.
 If the timer expires before a reply or acknowledgement comes back, the kernel sends the
message again.

3) Lost Reply messages

 If no reply is forthcoming within a reasonable period, just send the request once more.
 A request which has no side effects and can be executed as often as necessary without
any harm being done is said to be idempotent (Ex: asking for the first 1024 bytes of a
file). Request such as transferring money is not idempotent
 Try to structure all requests in an idempotent way
 Another method is to have the client's kernel assign each request a sequence number
 To have a bit in the message header that is used to distinguish initial requests from
retransmissions

4) Server Crashes
 At least once semantics
o Wait until the server reboots (or rebinds to a new server) and try the operation
again.
o Guarantees that the RPC has been carried out at least one time, but possibly more.
 At most once semantics
o Gives up immediately and reports back failure.
o Guarantees that the RPC has been carried out at most one time,
but possibly none at all.
 Exactly once semantics
o The client gets no help and no promises.
o Guarantee nothing

MTech CSE (PT, 2011-14) SRM, Ramapuram 18 hcr:innovationcse@gg

CS516 Distributed Operating System Unit II

5) Client Crashes
 Client sends a request to a server to do some work and crashes before the server replies
 A computation which is active and no parent is waiting for the result is called an orphan
 They waste CPU cycles
Solutions
 Extermination
o A log is kept on a medium that could survive the crash
o The log is examined after a reboot to detect the orphans
o Orphan is explicitly killed off
 Reincarnation
o When a client reboots, it broadcasts a message to all machines declaring the start
of a new epoch.
o When such a broadcast comes in, all remote computations are killed
 Gentle reincarnation
o When an epoch broadcast comes in, each machine checks to see if it has any
remote computations
o If so, tries to locate their owner.
o Only if the owner cannot be found is the computation killed.
 Expiration
o Each RPC is given a standard amount of time T to do the job
o If it can not finish, it must explicitly ask for another quantum

RPC IMPLEMENTATION ISSUES

 RPC Protocol
o Connection Oriented or Connectionless
o Whether to use a general purpose protocol (like IP & UDP) or one specifically
designed for RPC
 Acknowledgements
o Should individual packets be acknowledged or not
o Stop-and-wait protocol
 Client send packet 0 with 1st 1K
 Wait for ack from server
 Then send the 2nd 1K
o Blast Protocol
 Client sends all packets as fast as it can
 Server decides whether to process or do nothing
 It can perform selective repeat as ask for specific packet

MTech CSE (PT, 2011-14) SRM, Ramapuram 19 hcr:innovationcse@gg

CS516 Distributed Operating System Unit II

 Critical Path
o The sequence of instructions that is executed on every RPC is called the critical
path
o Copying
o The number of times a message must be copied from kernel and user address
o Depends on hardware, software and type of call

 Timer Management
o The amount of machine time that goes into managing the timers
o Algorithms that operate by periodically making a sequential pass through a table
are called sweep algorithms

GROUP COMMUNICATION – OUT OF SYLLABUS

 A group is a collection of processes that act together in some system or user-specified way
 When a message is sent to the group itself, all members of the group receive it
 It is a form of one-to-many communication (one sender, many receivers)

MTech CSE (PT, 2011-14) SRM, Ramapuram 20 hcr:innovationcse@gg

CS516 Distributed Operating System Unit II

Group communication implementation

 Multicasting
o create a special network address to which multiple machines can listen
o (example, indicated by setting one of the high-order bits to 1),
o When a packet is sent to one of these addresses, it is automatically delivered to all
machines listening to the address
 Broadcasting
o packets containing a certain address (e.g., 0) are delivered to all machines
o software must check to see if the packet is intended for it else the packet is discarded
 Unicasting
o sender transmit separate packets to each of the members of the group
o For a group with n members, n packets are required

Design Issues in Group Communication

 Closed Groups versus Open Groups
 Peer Groups versus Hierarchical Groups
 Group Membership
 Group Addressing
 Send and Receive Primitives
 Atomicity
 Message Ordering
 Scalability

Closed Groups versus Open Groups

 Closed groups
o only the members of the group can send to the group.
o Outsiders cannot send messages to the group as a whole, although they may be able
to send messages to individual members.
o typically used for parallel processing
 Open groups
o any process in the system can send to any group.

MTech CSE (PT, 2011-14) SRM, Ramapuram 21 hcr:innovationcse@gg

CS516 Distributed Operating System Unit II

Peer Groups versus Hierarchical Groups

 Peer groups
o all the processes are equal.
o No one is boss and all decisions are made collectively.
o Adv: symmetric and has no single point of failure
o Disadv: decision making is more complicated. To decide anything, a vote has to be
taken, incurring some delay and overhead
 Hierarchical groups
o some kind of hierarchy exists.
o one process is the coordinator and all the others are workers

Group Membership
 creating and deleting groups, as well as for allowing processes to join and leave groups
 Managing group membership can be done in two ways
 Group server
o The group server can then maintain a complete data base of all the groups and their
exact membership.
o This method is straightforward, efficient, and easy to implement
o If the group server crashes, group management ceases to exist
 Distributed
o an outsider can send a message to all group members announcing its presence.
o To leave a group, a member just sends a goodbye message to everyone.
o member crashes
 no polite announcement
 leaving and joining have to be synchronous with messages being sent
 if so many machines go down that the group can no longer function at all
o Some protocol is needed to rebuild the group

Group Addressing
Three ways of addressing

 give each group a unique address

 require the sender to provide an explicit list of all destinations (e.g., IP addresses)
 predicate addressing
o each message is sent to all members of the group
o contains a predicate (Boolean expression) to be evaluated
o If the predicate evaluates to TRUE, the message is accepted.

MTech CSE (PT, 2011-14) SRM, Ramapuram 22 hcr:innovationcse@gg

CS516 Distributed Operating System Unit II

o If it evaluates to FALSE, the message is discarded.

Send and Receive Primitives

 send, receive
 RPC
o the client sends one message to the server and gets back one answer.
 Group communication
o n different replies
 Buffered / unbuffered, blocking or non-blocking, reliable or not reliable
 group_send, group_receive

Atomicity
 when a message is sent to a group, it will either arrive correctly at all members of the group,
or at none of them.
 all-or-nothing delivery is called atomicity or atomic broadcast
 When any process sends a message to the group, it does not have to worry about what to do
if some of them do not get it.
 every destination receives every message is to require them to send back an
acknowledgement upon message receipt.
 The sender starts out by sending a message to all members of the group.
 Timers are set and retransmissions sent where necessary.
 When a process receives a message, if it has not yet seen this particular message, it, too,
sends the message to all members of the group, resulting in a reliable delivery

Message Ordering

MTech CSE (PT, 2011-14) SRM, Ramapuram 23 hcr:innovationcse@gg

CS516 Distributed Operating System Unit II

 global time ordering

o delivers all messages in the exact order in which they were sent
 consistent time ordering
o if two messages, say A and B, are sent close together in time, the system picks one of
them as being "first" and delivers it to all group members, followed by the other.

Overlapping Groups
although there is a global time ordering within each group, there is not necessarily any
coordination among multiple groups

Scalability

 Many algorithms work fine as long as all the groups only have a few members, but what
happens when there are tens, hundreds, or even thousands of members per group?
 Or thousands of groups? Also, what happens when the system is so large that it no longer fits
on a single LAN, so multiple LANs and gateways are required?
 And what happens when the groups are spread over several continents?

Comments & Feedback

Thanks to my family members who supported me while I spent hours and hours to prepare this.
Your feedback is welcome at [email protected]

MTech CSE (PT, 2011-14) SRM, Ramapuram 24 hcr:innovationcse@gg

OS PPT
67% (3)
OS PPT
36 pages
Unit 1
No ratings yet
Unit 1
88 pages
Istributed Perating Ystems: Jyotsna Singh
No ratings yet
Istributed Perating Ystems: Jyotsna Singh
25 pages
Introduction
No ratings yet
Introduction
25 pages
Distributed Operating: Systems Spring 2005
No ratings yet
Distributed Operating: Systems Spring 2005
23 pages
Dis Material
No ratings yet
Dis Material
49 pages
DoS - Unit 1
No ratings yet
DoS - Unit 1
57 pages
Unit One
No ratings yet
Unit One
93 pages
Distributed Systems Slides-Lesson4
No ratings yet
Distributed Systems Slides-Lesson4
36 pages
Chapter 2: Concepts and Architectures: CPU I/O Disk(s)
No ratings yet
Chapter 2: Concepts and Architectures: CPU I/O Disk(s)
38 pages
Cs3551 Distributed Computing Unit-1
No ratings yet
Cs3551 Distributed Computing Unit-1
52 pages
DS Lec03
No ratings yet
DS Lec03
43 pages
DC Introduction Unit1
No ratings yet
DC Introduction Unit1
42 pages
CS8603 Unit I
No ratings yet
CS8603 Unit I
35 pages
36559d0e9a D
No ratings yet
36559d0e9a D
95 pages
Distributed Operating Syst EM: 15SE327E Unit 1
No ratings yet
Distributed Operating Syst EM: 15SE327E Unit 1
49 pages
Distributed-Computing Notes
No ratings yet
Distributed-Computing Notes
108 pages
CS8603 U.I
No ratings yet
CS8603 U.I
37 pages
Lecture 19
No ratings yet
Lecture 19
20 pages
Ds/Unit 1 Truba College of Science & Tech., Bhopal
No ratings yet
Ds/Unit 1 Truba College of Science & Tech., Bhopal
9 pages
Distributed Operating System
100% (1)
Distributed Operating System
24 pages
DC unit 1 - notes
No ratings yet
DC unit 1 - notes
36 pages
CS8603 U.I
No ratings yet
CS8603 U.I
36 pages
Distributed System PDF
No ratings yet
Distributed System PDF
148 pages
Introduction To Distributed Operating Systems Communication in Distributed Systems
No ratings yet
Introduction To Distributed Operating Systems Communication in Distributed Systems
150 pages
Distributed Systems: University of Pennsylvania
No ratings yet
Distributed Systems: University of Pennsylvania
26 pages
CS02DOS Unit 1UNit - 1 Introduction
No ratings yet
CS02DOS Unit 1UNit - 1 Introduction
6 pages
OS UNIT I
No ratings yet
OS UNIT I
14 pages
Unit-3 2 Multiprocessor Systems
No ratings yet
Unit-3 2 Multiprocessor Systems
12 pages
Unit-2 (A)
No ratings yet
Unit-2 (A)
40 pages
NUNIT I
No ratings yet
NUNIT I
119 pages
ln13 Ds
No ratings yet
ln13 Ds
21 pages
Distributed Comp (Intro)
No ratings yet
Distributed Comp (Intro)
39 pages
OSY Lecture 2 Notes - MSBTE NEXT ICON
100% (1)
OSY Lecture 2 Notes - MSBTE NEXT ICON
11 pages
Computer Network
No ratings yet
Computer Network
89 pages
Unit: 1 Introduction To Distributed System
No ratings yet
Unit: 1 Introduction To Distributed System
147 pages
Unit 1notes Full
No ratings yet
Unit 1notes Full
20 pages
Dist Comp Intro
No ratings yet
Dist Comp Intro
41 pages
CS3451-Introduction to Operating System(IV SEM)
No ratings yet
CS3451-Introduction to Operating System(IV SEM)
341 pages
Distributed Systems: - CSE 380 - Lecture Note 13 - Insup Lee
No ratings yet
Distributed Systems: - CSE 380 - Lecture Note 13 - Insup Lee
24 pages
Chapter 1
No ratings yet
Chapter 1
50 pages
Distributed System
No ratings yet
Distributed System
175 pages
DC Unit 1
No ratings yet
DC Unit 1
32 pages
DC - Unit 1 - Introduction
No ratings yet
DC - Unit 1 - Introduction
68 pages
DC - Unit 1 - Introduction Notes
No ratings yet
DC - Unit 1 - Introduction Notes
23 pages
9 Module 4
No ratings yet
9 Module 4
49 pages
Unit VI
No ratings yet
Unit VI
50 pages
Distributed & MultiProcessor
100% (1)
Distributed & MultiProcessor
3 pages
Pipeline
No ratings yet
Pipeline
43 pages
Multiple Processor Systems: 8.1 Multiprocessors 8.2 Multicomputers 8.3 Distributed Systems
No ratings yet
Multiple Processor Systems: 8.1 Multiprocessors 8.2 Multicomputers 8.3 Distributed Systems
55 pages
Lec 01
No ratings yet
Lec 01
15 pages
Chapter (1) Introduction To Distributed Systems
No ratings yet
Chapter (1) Introduction To Distributed Systems
15 pages
COA Assignment
No ratings yet
COA Assignment
21 pages
Unit 1
No ratings yet
Unit 1
14 pages
unit6
No ratings yet
unit6
36 pages
DC Module1
No ratings yet
DC Module1
54 pages
Block-3
No ratings yet
Block-3
72 pages
Distributed Computing
No ratings yet
Distributed Computing
204 pages
Chapter 2
No ratings yet
Chapter 2
40 pages
Operating System Interview Questions and Answers
From Everand
Operating System Interview Questions and Answers
Manish Soni
No ratings yet
Operating System Text Book
From Everand
Operating System Text Book
Manish Soni
No ratings yet
Unit 2 Advanced Operating Systems 23pcsce24!3!1
No ratings yet
Unit 2 Advanced Operating Systems 23pcsce24!3!1
17 pages
Lecture 2.0 - Issues in Design of Distributed System
No ratings yet
Lecture 2.0 - Issues in Design of Distributed System
14 pages
OS-UNIT-1
No ratings yet
OS-UNIT-1
63 pages
Introduction To Distributed Systems
No ratings yet
Introduction To Distributed Systems
45 pages
Distributed Operating System
No ratings yet
Distributed Operating System
5 pages
1-2. Introduction & OS Structure
No ratings yet
1-2. Introduction & OS Structure
85 pages
Computer Activity 2 by Anubhav Tomar
No ratings yet
Computer Activity 2 by Anubhav Tomar
13 pages
Chapter (1) Introduction To Distributed Systems: Q:define Distributed System and Draw Figure For It?
No ratings yet
Chapter (1) Introduction To Distributed Systems: Q:define Distributed System and Draw Figure For It?
15 pages
The American Dream Definition Essay
100% (2)
The American Dream Definition Essay
7 pages
GTU DOS Syllabus
No ratings yet
GTU DOS Syllabus
2 pages
18CSC205J-Operating Systems: Unit-I
No ratings yet
18CSC205J-Operating Systems: Unit-I
95 pages
Group Communication
No ratings yet
Group Communication
11 pages
Operating Sysytem
100% (1)
Operating Sysytem
26 pages
Ch02-OS9e
No ratings yet
Ch02-OS9e
52 pages
Unit 5 - Operating System - WWW - Rgpvnotes.in
No ratings yet
Unit 5 - Operating System - WWW - Rgpvnotes.in
27 pages
Fundamentals of Computer Unit III
No ratings yet
Fundamentals of Computer Unit III
28 pages
OSY Notes Vol 1 - Ur Engineering Friend
No ratings yet
OSY Notes Vol 1 - Ur Engineering Friend
68 pages
Jimma University: Jimma Institute of Technology Faculty of Computing
No ratings yet
Jimma University: Jimma Institute of Technology Faculty of Computing
7 pages
Distributed Operating System
No ratings yet
Distributed Operating System
69 pages
Distributed Computing Second Edition: Sunita Mahajan Seema Shah
100% (1)
Distributed Computing Second Edition: Sunita Mahajan Seema Shah
39 pages
Types of Operating System by Harshit Agarwal
No ratings yet
Types of Operating System by Harshit Agarwal
15 pages
Unit 1 Notes
No ratings yet
Unit 1 Notes
32 pages
What Is Operating System? Discuss Role/functions of OS As A Resource
No ratings yet
What Is Operating System? Discuss Role/functions of OS As A Resource
46 pages
The Operating System (OS) Layer: Dr. Sanjay P. Ahuja, PH.D
No ratings yet
The Operating System (OS) Layer: Dr. Sanjay P. Ahuja, PH.D
38 pages
Unit-1 - Introduction To Distributed Computing System
No ratings yet
Unit-1 - Introduction To Distributed Computing System
68 pages
Distributed Operating Systems Syllabus
No ratings yet
Distributed Operating Systems Syllabus
2 pages
DS Quiz
No ratings yet
DS Quiz
19 pages
Distributed Operating Systems
No ratings yet
Distributed Operating Systems
17 pages
Distributed Operating Systems: Unit - 2
No ratings yet
Distributed Operating Systems: Unit - 2
48 pages

cs516 Unit II

Uploaded by

cs516 Unit II

Uploaded by

CS516 Distributed Operating System Unit II

UNIT II : DISTRIBUTED COMPUTING

ADVANTAGES OF DISTRIBUTED OPERATING SYSTEM

Economics Microprocessors offer a better price/performance than mainframes

A distributed system may have more total computing power than a

Inherent distribution Some applications involve spatially separated machines

Incremental growth Computing power can be added in small increments

Advantages over independent PC’s

Allow many users access to a common database

Allow many users to share expensive peripherals like color laser

Communication Make human to human communications easier (Ex: email)

MTech CSE (PT, 2011-14) SRM, Ramapuram 1 hcr:innovationcse@gg

HARDWARE CONCEPTS OF DISTRIBUTED OS

Based on the number of instruction and data streams

MTech CSE (PT, 2011-14) SRM, Ramapuram 2 hcr:innovationcse@gg

Based on their communication

MTech CSE (PT, 2011-14) SRM, Ramapuram 3 hcr:innovationcse@gg

Bus Based Multicomputers

MTech CSE (PT, 2011-14) SRM, Ramapuram 4 hcr:innovationcse@gg

SOFTWARE CONCEPTS OF DISTRIBUTED OS

System Description Main Goal

Tightly-coupled operating system for multi-

Loosely-coupled operating system for

Additional layer atop of NOS implementing Provide distribution

Network Operating Systems

MTech CSE (PT, 2011-14) SRM, Ramapuram 5 hcr:innovationcse@gg

Comparison of three different ways of organizing n CPUs

Dos it look like a virtual uniprocessor No Yes Yes

Do all have to run the same OS No Yes Yes

Have many copies of the OS are there? N N 1

How is communication achieved? Shared files Messages Shared memory

Are agreed upon network protocols required Yes Yes No

Is there a single run queue? No No Yes

DESIGN ISSUES IN DISTRIBUTED OPERATING SYSTEMS

Location The users cannot tell where resources are located

Migration Resources can move at will without changing their names

Replication The users cannot tell how many copies exist

Concurrency Multiple users can share resources automatically

Parallelism Activities can happen in parallel without users knowing

MTech CSE (PT, 2011-14) SRM, Ramapuram 6 hcr:innovationcse@gg

Centralized components A single mail server for all users

Centralized tables A single on-line telephone book

Centralized algorithms Doing routing based on complete information

Only decentralized algorithms should be used with the following characteristics

MTech CSE (PT, 2011-14) SRM, Ramapuram 7 hcr:innovationcse@gg

LAYERED PROTOCOL IN COMMUNICATION

 The OSI model is designed to allow open systems to communicate.

MTech CSE (PT, 2011-14) SRM, Ramapuram 8 hcr:innovationcse@gg

The Data Link Layer

The Network Layer

The Transport Layer

The Session Layer

The Presentation Layer

The Application Layer

MTech CSE (PT, 2011-14) SRM, Ramapuram 9 hcr:innovationcse@gg

The ATM Physical Layer

The ATM Layer

The ATM Adaptation Layer

MTech CSE (PT, 2011-14) SRM, Ramapuram 10 hcr:innovationcse@gg

Four classes of traffic:

 Simple and Efficient Adaptation Layer

MTech CSE (PT, 2011-14) SRM, Ramapuram 11 hcr:innovationcse@gg

CLIENT SERVER MODEL

Four Design issues for the communication primitives and

Process addressing with broadcasting

MTech CSE (PT, 2011-14) SRM, Ramapuram 12 hcr:innovationcse@gg

Address lookup via a name server

Blocking versus Nonblocking Primitives

MTech CSE (PT, 2011-14) SRM, Ramapuram 13 hcr:innovationcse@gg

Buffered versus Unbuffered Primitives

Reliable versus Unreliable Primitives

Summary of Four Design Issues:-

Addressing Machine Number Sparse process addresses ASCII names looked

Blocking Blocking primitives Nonblocking with copy to Nonblocking with

Buffering Unbuffered, discarding Unbuffered, temporarily Mailboxes

Reliability Unreliable Request-Ack-Reply Ack Request-Reply-Ack