0% found this document useful (0 votes)

65 views

CS 134: Operating Systems: Multiprocessing

Threads are often related - Schedule independently or together? - Completely independent: job completion is slowest thread - Hardware cache coherency introduces challenges for spinlocks on SMP systems - NUMA systems benefit from allocating process memory locally and giving processes CPU affinity

Uploaded by

Muhammad Tehseen Khan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

65 views

CS 134: Operating Systems: Multiprocessing

Uploaded by

Muhammad Tehseen Khan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

CS34

2013-05-17
CS 134:
Operating Systems
Multiprocessing

CS 134:
Operating Systems
Multiprocessing

1 / 13
Overview CS34 Overview

2013-05-17
Multiprocessing Designs

OS Implications

Overview Programming Models

Other Issues

Multiprocessing Designs

OS Implications

Programming Models

Other Issues

2 / 13
Multiprocessing Designs

SIMD and MIMD CS34 SIMD and MIMD

2013-05-17
Multiprocessing Designs Multiple CPUs come in several flavors:

SIMD: Single Instruction, Multiple Data

I Also called vector processor
I Sample instruction: a[i] = b[i] + c[i] for i in small

SIMD and MIMD I

range (e.g., 0-3)
Canonical example: GPUs

MIMD: Multiple Instruction, Multiple Data

I.e., 2 or more (semi-)independent CPUS

Multiple CPUs come in several flavors:

We won’t talk further about SIMD; from an OS point of view it’s just
SIMD: Single Instruction, Multiple Data another CPU.

I Also called vector processor

I Sample instruction: a[i] = b[i] + c[i] for i in small
range (e.g., 0-3)
I Canonical example: GPUs

MIMD: Multiple Instruction, Multiple Data

I.e., 2 or more (semi-)independent CPUS

3 / 13
Multiprocessing Designs

MIMD Approaches CS34 MIMD Approaches

2013-05-17
Multiprocessing Designs MIMD can be:
I Several chips or cores, (semi-)private memories, able to
access each other’s memory (NUMA—Non-Uniform Memory
Access)

MIMD Approaches I Several chips or cores, one memory (SMP—Symmetric

Multiprocessing)
I Several boxes (possibly each SMP or NUMA) connected by
network (distributed system)

MIMD can be:

I Several chips or cores, (semi-)private memories, able to
access each other’s memory (NUMA—Non-Uniform Memory
Access)
I Several chips or cores, one memory (SMP—Symmetric
Multiprocessing)
I Several boxes (possibly each SMP or NUMA) connected by
network (distributed system)

4 / 13
OS Implications

NUMA Issues CS34 NUMA Issues

2013-05-17
OS Implications
NUMA means processes access local memory faster
⇒ Allocate process memory on local CPU

NUMA Issues ⇒ Processes should have “CPU affinity”

NUMA means processes access local memory faster

⇒ Allocate process memory on local CPU
⇒ Processes should have “CPU affinity”

5 / 13
OS Implications

SMP Issues CS34 SMP Issues

2013-05-17
OS Implications SMPs still have caches

Introduces cache coherency problems:

I Processor 0 uses compare-and-swap to set a lock nonzero
I Write goes into local cache for speed

SMP Issues I Processor 1 reads lock from own cache, sees it’s still zero. . .

Cure: hardware coherency guarantees

. . . but spinlocks now have super-high costs
I May be better to do thread switch

SMPs still have caches

Thread switch is high cost, but may be cheaper than spinlock.
Introduces cache coherency problems:
I Processor 0 uses compare-and-swap to set a lock nonzero
I Write goes into local cache for speed
I Processor 1 reads lock from own cache, sees it’s still zero. . .

Cure: hardware coherency guarantees

. . . but spinlocks now have super-high costs
I May be better to do thread switch

6 / 13
OS Implications

SMP Scheduling CS34 SMP Scheduling

2013-05-17
OS Implications
Threads are often related
I Schedule independently or together?
I Completely independent: job completion is slowest thread

SMP Scheduling I

I
Together: some CPUs may be wasted on waiting for events
Always good to keep thread x on same CPU (because cache
is filled)

Threads are often related

I Schedule independently or together?
I Completely independent: job completion is slowest thread
I Together: some CPUs may be wasted on waiting for events
I Always good to keep thread x on same CPU (because cache
is filled)

7 / 13
OS Implications

Distributed Systems CS34 Distributed Systems

2013-05-17
OS Implications Many ways to communicate

Most important modern approach is. . .

Distributed Systems

Many ways to communicate

Most important modern approach is. . .

8 / 13
OS Implications

Distributed Systems CS34 Distributed Systems

2013-05-17
OS Implications Many ways to communicate

Most important modern approach is. . . the Internet!

Distributed Systems

Many ways to communicate

Most important modern approach is. . . the Internet!

8 / 13
OS Implications

Distributed Systems CS34 Distributed Systems

2013-05-17
OS Implications Many ways to communicate

Most important modern approach is. . . the Internet!

Communicating with skinny wires introduces new problems:

Distributed Systems I

I
Can’t move process to other machine (or must work hard)
Locking becomes really hard
I Programming multiprocessor systems is much harder

Many ways to communicate

Most important modern approach is. . . the Internet!

Communicating with skinny wires introduces new problems:

I Can’t move process to other machine (or must work hard)
I Locking becomes really hard
I Programming multiprocessor systems is much harder

8 / 13
OS Implications

Distributed Systems CS34 Distributed Systems

2013-05-17
OS Implications Many ways to communicate

Most important modern approach is. . . the Internet!

Communicating with skinny wires introduces new problems:

Distributed Systems I

I
Can’t move process to other machine (or must work hard)
Locking becomes really hard
I Programming multiprocessor systems is much harder
I . . . and what if network connection goes down?

Many ways to communicate

Most important modern approach is. . . the Internet!

Communicating with skinny wires introduces new problems:

I Can’t move process to other machine (or must work hard)
I Locking becomes really hard
I Programming multiprocessor systems is much harder
I . . . and what if network connection goes down?

8 / 13
Programming Models

RPC CS34 RPC

2013-05-17
Programming Models Programming is hard, so need abstractions that simplify things

Remote Procedure Call (RPC) makes distant system look like

normal function
1. Marshal arguments (i.e., pack up and serialize)
2. Send procedure ID and arguments to remote system
RPC 3. Wait for response
4. Deserialize return value

Class Exercise

Programming is hard, so need abstractions that simplify things What are the advantages and disadvantages?

Remote Procedure Call (RPC) makes distant system look like

normal function
1. Marshal arguments (i.e., pack up and serialize)
2. Send procedure ID and arguments to remote system
3. Wait for response
4. Deserialize return value

Class Exercise
What are the advantages and disadvantages?

9 / 13
Programming Models

DSM CS34 DSM

2013-05-17
Programming Models
RPC is nice, but limits parallelism
SMPs can do cool things because memory is shared

DSM So why not simulate shared memory across the network?

Teeny problem: hard to make it work fasta

“Hard” is a gross understatement.

RPC is nice, but limits parallelism
SMPs can do cool things because memory is shared

So why not simulate shared memory across the network?

Teeny problem: hard to make it work fasta

10 / 13
Other Issues

Load Balancing CS34 Load Balancing

2013-05-17
Other Issues Suppose you have servers A, B, C, and D
A and B are currently overloaded, C and D underloaded
A notices the situation and sends excess work to C and D
Simultaneously, B does the same! Now C and D are overloaded
Load Balancing Result can be thrashing

Common solution: have one front-end machine whose sole job is

allocating load to others

Suppose you have servers A, B, C, and D Random assignment works surprisingly well.
A and B are currently overloaded, C and D underloaded
A notices the situation and sends excess work to C and D
Simultaneously, B does the same! Now C and D are overloaded

Result can be thrashing

Common solution: have one front-end machine whose sole job is

allocating load to others

11 / 13
Other Issues

How Does Google Work? CS34 How Does Google Work?

2013-05-17
Other Issues Well, it’s a secret. . .

But basically they use the front-end approach

Obvious problem: one front end can’t handle millions of requests

How Does Google Work? per second even if it does almost nothing

Solution: DNS Round Robin tricks you into picking one of many
dozens of front ends (roughly at random) to talk to

Well, it’s a secret. . .

But basically they use the front-end approach

Obvious problem: one front end can’t handle millions of requests

per second even if it does almost nothing

Solution: DNS Round Robin tricks you into picking one of many
dozens of front ends (roughly at random) to talk to

12 / 13
Other Issues

Example of Google’s DNS tricks CS34 Example of Google’s DNS tricks

2013-05-17
These commands were run within 15 seconds of each other:

Other Issues bow:2:877> host www.google.com

www.google.com has address 74.125.224.241
www.google.com has address 74.125.224.242
www.google.com has address 74.125.224.243
www.google.com has address 74.125.224.244
www.google.com has address 74.125.224.240
Example of Google’s DNS tricks
These commands were run within 15 seconds of each other: bow:2:878> ssh
www.google.com
www.google.com
lever.cs.ucla.edu host www.google.com
has address 74.125.239.19
has address 74.125.239.20
www.google.com has address 74.125.239.17
www.google.com has address 74.125.239.18
www.google.com has address 74.125.239.16

bow:2:877> host www.google.com

bow:2:878> ssh lever.cs.ucla.edu host www.google.com

www.google.com has address 74.125.239.19
www.google.com has address 74.125.239.20
www.google.com has address 74.125.239.17
www.google.com has address 74.125.239.18
www.google.com has address 74.125.239.16

13 / 13

AWS Certified Cloud Practitioner CLF-C02 p10
No ratings yet
AWS Certified Cloud Practitioner CLF-C02 p10
14 pages
Distributed Computing 1st and 2nd Chapter
75% (4)
Distributed Computing 1st and 2nd Chapter
86 pages
PWM DC Motor Speed Control Using 555
67% (3)
PWM DC Motor Speed Control Using 555
16 pages
PlayStation Architecture: Architecture of Consoles: A Practical Analysis, #6
From Everand
PlayStation Architecture: Architecture of Consoles: A Practical Analysis, #6
Rodrigo Copetti
No ratings yet
Lecture 19
No ratings yet
Lecture 19
20 pages
Introduction To Distributed Systems: Brian Nielsen Bnielsen@cs - Aau.dk Bnielsen@cs - Aau.dk
No ratings yet
Introduction To Distributed Systems: Brian Nielsen Bnielsen@cs - Aau.dk Bnielsen@cs - Aau.dk
54 pages
Lecture 3 Multiprocessor Vs Multicomputer Vs DS
No ratings yet
Lecture 3 Multiprocessor Vs Multicomputer Vs DS
55 pages
MultiProcessors Tanenbaum BP
No ratings yet
MultiProcessors Tanenbaum BP
29 pages
Multiple Processor Systems: 8.1 Multiprocessors 8.2 Multicomputers 8.3 Distributed Systems
No ratings yet
Multiple Processor Systems: 8.1 Multiprocessors 8.2 Multicomputers 8.3 Distributed Systems
55 pages
CICS 504 Computer Organization
No ratings yet
CICS 504 Computer Organization
35 pages
CS Chap7 Multicores Multiprocessors Clusters
No ratings yet
CS Chap7 Multicores Multiprocessors Clusters
65 pages
What Is Parallel Computing
No ratings yet
What Is Parallel Computing
9 pages
Multiple Processor Systems: 8.1 Multiprocessors 8.2 Multicomputers 8.3 Distributed Systems
No ratings yet
Multiple Processor Systems: 8.1 Multiprocessors 8.2 Multicomputers 8.3 Distributed Systems
36 pages
Chapter 3
No ratings yet
Chapter 3
35 pages
Aa
No ratings yet
Aa
18 pages
Distributed Operating Syst EM: 15SE327E Unit 1
No ratings yet
Distributed Operating Syst EM: 15SE327E Unit 1
49 pages
2 - Distributed Computing Models (1)
No ratings yet
2 - Distributed Computing Models (1)
27 pages
Unit-2 (A)
No ratings yet
Unit-2 (A)
40 pages
Memory in Multiprocessor System
No ratings yet
Memory in Multiprocessor System
52 pages
Mscs6060 Parallel and Distributed Systems
No ratings yet
Mscs6060 Parallel and Distributed Systems
50 pages
Parallel Computing: Overview: John Urbanic Urbanic@psc - Edu
No ratings yet
Parallel Computing: Overview: John Urbanic Urbanic@psc - Edu
34 pages
Lecture
No ratings yet
Lecture
32 pages
Multiprocessor
No ratings yet
Multiprocessor
22 pages
Unit 2. Distributed Os and Issue
No ratings yet
Unit 2. Distributed Os and Issue
1 page
pdcco1
No ratings yet
pdcco1
8 pages
Parallel Processing Unit - 6
No ratings yet
Parallel Processing Unit - 6
11 pages
Lectures On DS
No ratings yet
Lectures On DS
8 pages
Introduction To Distributed Programming System V IPC: Message Queues, Shared Memory, Semaphores
No ratings yet
Introduction To Distributed Programming System V IPC: Message Queues, Shared Memory, Semaphores
29 pages
Computer Architecture
No ratings yet
Computer Architecture
20 pages
Distributed CS571
No ratings yet
Distributed CS571
36 pages
Overview of Parallel Computing: Shawn T. Brown
No ratings yet
Overview of Parallel Computing: Shawn T. Brown
46 pages
DS Lec03
No ratings yet
DS Lec03
43 pages
Distributed Operating System
No ratings yet
Distributed Operating System
69 pages
1-Lecture (Intro)_Notes
No ratings yet
1-Lecture (Intro)_Notes
10 pages
Introduction To Parallel Processing and Distributed Systems
No ratings yet
Introduction To Parallel Processing and Distributed Systems
15 pages
Csc4306 Net-Centric Computing
100% (1)
Csc4306 Net-Centric Computing
5 pages
Module 07 - Multiprocessing
No ratings yet
Module 07 - Multiprocessing
60 pages
2 CS Architecture
No ratings yet
2 CS Architecture
22 pages
CS-3006_3_ParallelArchitectures
No ratings yet
CS-3006_3_ParallelArchitectures
56 pages
Parallel Computing: Overview: John Urbanic Urbanic@psc - Edu
No ratings yet
Parallel Computing: Overview: John Urbanic Urbanic@psc - Edu
33 pages
135 LE2 Reviewer
No ratings yet
135 LE2 Reviewer
6 pages
Distributed System Assignment-1: Jimma University
No ratings yet
Distributed System Assignment-1: Jimma University
9 pages
OS - Scheme-1
No ratings yet
OS - Scheme-1
19 pages
Khaitan PSERC Webinar HPC Mar 2013 Slides
No ratings yet
Khaitan PSERC Webinar HPC Mar 2013 Slides
52 pages
RMCS
No ratings yet
RMCS
127 pages
Co-1 (2)
No ratings yet
Co-1 (2)
66 pages
Design of Parallel Algorithm'S: Faculty Guide: Group Members
No ratings yet
Design of Parallel Algorithm'S: Faculty Guide: Group Members
49 pages
Multi Core
No ratings yet
Multi Core
70 pages
HPC Computer Engg Sem 8 Notes
No ratings yet
HPC Computer Engg Sem 8 Notes
36 pages
Operating System
No ratings yet
Operating System
66 pages
DC - Unit 1 - Introduction
No ratings yet
DC - Unit 1 - Introduction
68 pages
Week 6 A
No ratings yet
Week 6 A
32 pages
CS4513 Distributed Computer Systems
No ratings yet
CS4513 Distributed Computer Systems
32 pages
08 Systems Programming-Concurrent Programming
No ratings yet
08 Systems Programming-Concurrent Programming
61 pages
Unit 1 System Models and Issues - MP (1)
No ratings yet
Unit 1 System Models and Issues - MP (1)
71 pages
Distributed Os
No ratings yet
Distributed Os
13 pages
Chapter 2: Concepts and Architectures: CPU I/O Disk(s)
No ratings yet
Chapter 2: Concepts and Architectures: CPU I/O Disk(s)
38 pages
135 LE2 Reviewer
No ratings yet
135 LE2 Reviewer
6 pages
Introduction
No ratings yet
Introduction
34 pages
OS Scalability + Multiprocessor Scheduling: (Thanks To Jonathan Appavoo, Todd Mowry, and Angela Demke Brown)
No ratings yet
OS Scalability + Multiprocessor Scheduling: (Thanks To Jonathan Appavoo, Todd Mowry, and Angela Demke Brown)
52 pages
CH 3
No ratings yet
CH 3
29 pages
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
From Everand
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
Franco Mario
No ratings yet
ECE 358: Computer Networks Solutions To Homework #4 Chapter 4 - The Network Layer P 4. Consider The Network Below
No ratings yet
ECE 358: Computer Networks Solutions To Homework #4 Chapter 4 - The Network Layer P 4. Consider The Network Below
7 pages
Admin Dashboard Requirements PDF
No ratings yet
Admin Dashboard Requirements PDF
3 pages
Misago Documentation: Release 0.6
No ratings yet
Misago Documentation: Release 0.6
48 pages
CC W3 AWS Basic Infra
No ratings yet
CC W3 AWS Basic Infra
57 pages
Research Article: Memory Map: A Multiprocessor Cache Simulator
No ratings yet
Research Article: Memory Map: A Multiprocessor Cache Simulator
13 pages
Ijcsc: Popularity Tree, Data Structure
No ratings yet
Ijcsc: Popularity Tree, Data Structure
4 pages
Assignment 1
No ratings yet
Assignment 1
1 page
Assignment 1
No ratings yet
Assignment 1
12 pages
Lab2: Static Variables and Error Reporting: Also Print The Descriptive Error Using Perror
No ratings yet
Lab2: Static Variables and Error Reporting: Also Print The Descriptive Error Using Perror
1 page
Digital Design Using Hdls Ee 4755 Midterm Examination: Name Solution
No ratings yet
Digital Design Using Hdls Ee 4755 Midterm Examination: Name Solution
10 pages
CMP 18
No ratings yet
CMP 18
46 pages
Assignment 2: Description
No ratings yet
Assignment 2: Description
5 pages
Access Control Fundamentals: 2.1 Protection System
No ratings yet
Access Control Fundamentals: 2.1 Protection System
14 pages
Systems Programming: Assignment #1
No ratings yet
Systems Programming: Assignment #1
1 page
Systems Programming: Assignment #1
No ratings yet
Systems Programming: Assignment #1
1 page
Lab2: Argument Arrays: Composite
No ratings yet
Lab2: Argument Arrays: Composite
1 page
Programming Cables: FPGA-UG-02042 Version 26.0
No ratings yet
Programming Cables: FPGA-UG-02042 Version 26.0
18 pages
Operating Systems: Chapter 2 - Operating System Structures
No ratings yet
Operating Systems: Chapter 2 - Operating System Structures
56 pages
Question Bank - EC-208 - Programming Concepts
No ratings yet
Question Bank - EC-208 - Programming Concepts
20 pages
BeneVision Central Monitoring System Specifications
No ratings yet
BeneVision Central Monitoring System Specifications
2 pages
ICs Interface and Motor Drivers
No ratings yet
ICs Interface and Motor Drivers
4 pages
Srs Appollo Final
No ratings yet
Srs Appollo Final
83 pages
UML Class Diagrams: Applying UML and Patterns Craig Larman
No ratings yet
UML Class Diagrams: Applying UML and Patterns Craig Larman
48 pages
"Digital Still Cameras" "Digital Video Cameras"
No ratings yet
"Digital Still Cameras" "Digital Video Cameras"
3 pages
GlobalSSH - HTTP Net Header (Official Tools)
No ratings yet
GlobalSSH - HTTP Net Header (Official Tools)
5 pages
Network System Design
No ratings yet
Network System Design
828 pages
Devaas 2.0: Service Overview
No ratings yet
Devaas 2.0: Service Overview
2 pages
13.5.1 Packet Tracer - WLAN Configuration - ILM
No ratings yet
13.5.1 Packet Tracer - WLAN Configuration - ILM
4 pages
Ziehm Exchanging and Adjusting Y-Drive Components D
No ratings yet
Ziehm Exchanging and Adjusting Y-Drive Components D
46 pages
Onkyo SKW 560 Service Manual
No ratings yet
Onkyo SKW 560 Service Manual
25 pages
Dev Resume
No ratings yet
Dev Resume
9 pages
C++ Concept
No ratings yet
C++ Concept
82 pages
Suse Manager Install
No ratings yet
Suse Manager Install
162 pages
Vijeo Designer Web Gate Setup Guide
No ratings yet
Vijeo Designer Web Gate Setup Guide
5 pages
PARTS OF COMPUTERS
No ratings yet
PARTS OF COMPUTERS
16 pages
SOC AS A Service
80% (5)
SOC AS A Service
12 pages
Using Batch Files
No ratings yet
Using Batch Files
37 pages
Hide Button or Icon in The ABAP Report
No ratings yet
Hide Button or Icon in The ABAP Report
17 pages
00a CIS EWP Projects Plan CheckList
No ratings yet
00a CIS EWP Projects Plan CheckList
7 pages
Unit No 3
No ratings yet
Unit No 3
10 pages
Shalini Reddy Molakaseema - Resume
No ratings yet
Shalini Reddy Molakaseema - Resume
6 pages
Assignment 8 - 2023 - Gate
No ratings yet
Assignment 8 - 2023 - Gate
10 pages
Wildfire: Palo Alto Networks: Wildfire Datasheet
No ratings yet
Wildfire: Palo Alto Networks: Wildfire Datasheet
4 pages
Smart Home Automation System Using Arduino
No ratings yet
Smart Home Automation System Using Arduino
6 pages

CS 134: Operating Systems: Multiprocessing

Uploaded by

CS 134: Operating Systems: Multiprocessing

Uploaded by

CS34

Overview Programming Models

SIMD and MIMD CS34 SIMD and MIMD

SIMD: Single Instruction, Multiple Data

SIMD and MIMD I

MIMD: Multiple Instruction, Multiple Data

Multiple CPUs come in several flavors:

I Also called vector processor

MIMD: Multiple Instruction, Multiple Data

MIMD Approaches CS34 MIMD Approaches

MIMD Approaches I Several chips or cores, one memory (SMP—Symmetric

MIMD can be:

NUMA Issues CS34 NUMA Issues

NUMA Issues ⇒ Processes should have “CPU affinity”

NUMA means processes access local memory faster

SMP Issues CS34 SMP Issues

Introduces cache coherency problems:

Cure: hardware coherency guarantees

SMPs still have caches

Cure: hardware coherency guarantees

SMP Scheduling CS34 SMP Scheduling

Threads are often related

Distributed Systems CS34 Distributed Systems

Most important modern approach is. . .

Many ways to communicate

Most important modern approach is. . .

Distributed Systems CS34 Distributed Systems

Most important modern approach is. . . the Internet!

Many ways to communicate

Most important modern approach is. . . the Internet!

Distributed Systems CS34 Distributed Systems

Most important modern approach is. . . the Internet!

Communicating with skinny wires introduces new problems:

Many ways to communicate

Most important modern approach is. . . the Internet!

Communicating with skinny wires introduces new problems:

Distributed Systems CS34 Distributed Systems

Most important modern approach is. . . the Internet!

Communicating with skinny wires introduces new problems:

Many ways to communicate

Most important modern approach is. . . the Internet!

Communicating with skinny wires introduces new problems:

RPC CS34 RPC

Remote Procedure Call (RPC) makes distant system look like

Remote Procedure Call (RPC) makes distant system look like

DSM CS34 DSM

DSM So why not simulate shared memory across the network?

Teeny problem: hard to make it work fasta

“Hard” is a gross understatement.

So why not simulate shared memory across the network?

Teeny problem: hard to make it work fasta

Load Balancing CS34 Load Balancing

Common solution: have one front-end machine whose sole job is

Result can be thrashing

Common solution: have one front-end machine whose sole job is

How Does Google Work? CS34 How Does Google Work?

But basically they use the front-end approach

Obvious problem: one front end can’t handle millions of requests

Well, it’s a secret. . .

But basically they use the front-end approach

Obvious problem: one front end can’t handle millions of requests

Example of Google’s DNS tricks CS34 Example of Google’s DNS tricks

Other Issues bow:2:877> host www.google.com

bow:2:877> host www.google.com

bow:2:878> ssh lever.cs.ucla.edu host www.google.com

You might also like