0% found this document useful (0 votes)

65 views

Lecture12 PDF

This document summarizes key aspects of cache coherence protocols discussed in the lecture, including: 1) It describes the MSI protocol and its states (Modified, Shared, Invalid), state transitions, and bus transactions used for coherence. 2) It compares invalidation-based protocols, where writes invalidate other caches, versus update-based protocols, where writes update other caches. Invalidation is typically used due to lower overhead. 3) It introduces the MESI protocol which adds an Exclusive state to optimize transitions between Invalid and Modified states, reducing bus transactions in some cases. 4) It discusses issues that can arise with coherence protocols like determining responsibility for flushing a cache line and handling different sharing patterns

Uploaded by

kalyan

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

65 views

Lecture12 PDF

Uploaded by

kalyan

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Objectives_template

Module 6: Shared Memory Multiprocessors: Consistency and Coherence

Lecture 12: Cache Coherence Protocols

The Lecture Contains:

Stores

Invalidation vs. Update

Which One is Better?

MSI Protocol

State Transition

M to S, or M to I?

MSI Example

MESI Protocol

MESI Example

MOESI Protocol

MOSI Protocol

file:///D|/...audhary,%20Dr.%20Sanjeev%20K%20Aggrwal%20&%20Dr.%20Rajat%20Moona/Multi-core_Architecture/lecture12/12_1.htm[6/14/2012 11:57:33 AM]

Objectives_template

Module 6: Shared Memory Multiprocessors: Consistency and Coherence

Lecture 12: Cache Coherence Protocols

Stores

Look at stores a little more closely

There are three situations at the time a store issues: the line is not in the cache, the
line is in the cache in S state, the line is in the cache in one of M, E and O states
If the line is in I state, the store generates a read-exclusive request on the bus and gets
the line in M state
If the line is in S or O state, that means the processor only has read permission for that
line; the store generates an upgrade request on the bus and the upgrade
acknowledgment gives it the write permission (this is a data-less transaction)
If the line is in M or E state, no bus transaction is generated; the cache already has
write permission for the line (this is the case of a write hit; previous two are write
misses)

Invalidation vs. Update

Two main classes of protocols:

Invalidation-based and update-based
Dictates what action should be taken on a write
Invalidation-based protocols invalidate sharers when a write miss (upgrade or readX )
appears on the bus
Update-based protocols update the sharer caches with new value on a write: requires
write transactions (carrying just the modified bytes) on the bus even on write hits (not
very attractive with writeback caches)
Advantage of update-based protocols: sharers continue to hit in the cache while in
invalidation-based protocols sharers will miss next time they try to access the line
Advantage of invalidation-based protocols: only write misses go on bus (suited for
writeback caches) and subsequent stores to the same line are cache hits

file:///D|/...audhary,%20Dr.%20Sanjeev%20K%20Aggrwal%20&%20Dr.%20Rajat%20Moona/Multi-core_Architecture/lecture12/12_2.htm[6/14/2012 11:57:33 AM]

Objectives_template

Module 6: Shared Memory Multiprocessors: Consistency and Coherence

Lecture 12: Cache Coherence Protocols

Which One is Better?

Difficult to answer
Depends on program behavior and hardware cost
When is update-based protocol good?
What sharing pattern? (large-scale producer/consumer)
Otherwise it would just waste bus bandwidth doing useless updates
When is invalidation-protocol good?
Sequence of multiple writes to a cache line
Saves intermediate write transactions
Also think about the overhead of initiating small updates for every write in update protocols
Invalidation-based protocols are much more popular
Some systems support both or maybe some hybrid based on dynamic sharing pattern of
a cache line

MSI Protocol

Forms the foundation of invalidation-based writeback protocols

Assumes only three supported cache line states: I, S, and M
There may be multiple processors caching a line in S state
There must be exactly one processor caching a line in M state and it is the owner of the
line
If none of the caches have the line, memory must have the most up-to-date copy of the
line
Processor requests to cache: PrRd , PrWr
Bus transactions: BusRd , BusRdX , BusUpgr , BusWB

file:///D|/...audhary,%20Dr.%20Sanjeev%20K%20Aggrwal%20&%20Dr.%20Rajat%20Moona/Multi-core_Architecture/lecture12/12_3.htm[6/14/2012 11:57:33 AM]

Objectives_template

Module 6: Shared Memory Multiprocessors: Consistency and Coherence

Lecture 12: Cache Coherence Protocols

State Transition

MSI Protocol

Few things to note

Flush operation essentially launches the line on the bus
Processor with the cache line in M state is responsible for flushing the line on bus
whenever there is a BusRd or BusRdX transaction generated by some other processor
On BusRd the line transitions from M to S, but not M to I. Why? Also at this point both
the requester and memory pick up the line from the bus; the requester puts the line in
its cache in S state while memory writes the line back. Why does memory need to write
back?
On BusRdX the line transitions from M to I and this time memory does not need to pick
up the line from bus. Only the requester picks up the line and puts it in M state in its
cache. Why?

file:///D|/...audhary,%20Dr.%20Sanjeev%20K%20Aggrwal%20&%20Dr.%20Rajat%20Moona/Multi-core_Architecture/lecture12/12_4.htm[6/14/2012 11:57:33 AM]

Objectives_template

Module 6: Shared Memory Multiprocessors: Consistency and Coherence

Lecture 12: Cache Coherence Protocols

M to S, or M to I?

BusRd takes a cache line in M state to S state

The assumption here is that the processor will read it soon, so save a cache miss by
going to S
May not be good if the sharing pattern is migratory : P0 reads and writes
cache line A, then P1 reads and writes cache line A, then P2…
For migratory patterns it makes sense to go to I state so that a future invalidation is
saved
But for bus-based SMPs it does not matter much because an upgrade transaction will
be launched anyway by the next writer, unless there is special hardware support to
avoid that: how?
The big problem is that the sharing pattern for a cache line may change dynamically:
adaptive protocols are good and are supported by Sequent Symmetry and MIT Alewife

MSI Example

Take the following example

P0 reads x, P1 reads x, P1 writes x, P0 reads x, P2 reads x, P3 writes x
Assume the state of the cache line containing the address of x is I in all processors

P0 generates BusRd , memory provides line, P0 puts line in S state

P1 generates BusRd , memory provides line, P1 puts line in S state
P1 generates BusUpgr , P0 snoops and invalidates line, memory does not respond, P1 sets
state of line to M
P0 generates BusRd , P1 flushes line and goes to S state, P0 puts line in S state, memory
writes back
P2 generates BusRd , memory provides line, P2 puts line in S state
P3 generates BusRdX , P0, P1, P2 snoop and invalidate, memory provides line, P3 puts line in
cache in M state

file:///D|/...audhary,%20Dr.%20Sanjeev%20K%20Aggrwal%20&%20Dr.%20Rajat%20Moona/Multi-core_Architecture/lecture12/12_5.htm[6/14/2012 11:57:33 AM]

Objectives_template

Module 6: Shared Memory Multiprocessors: Consistency and Coherence

Lecture 12: Cache Coherence Protocols

MESI Protocol

The most popular invalidation-based protocol e.g., appears in Intel Xeon MP

Why need E state?
The MSI protocol requires two transactions to go from I to M even if there is no
intervening requests for the line: BusRd followed by BusUpgr
We can save one transaction by having memory controller respond to the first BusRd
with E state if there is no other sharer in the system
How to know if there is no other sharer? Needs a dedicated control wire that gets
asserted by a sharer (wired OR)
Processor can write to a line in E state silently and take it to M state

State Transition

file:///D|/...audhary,%20Dr.%20Sanjeev%20K%20Aggrwal%20&%20Dr.%20Rajat%20Moona/Multi-core_Architecture/lecture12/12_6.htm[6/14/2012 11:57:33 AM]

Objectives_template

Module 6: Shared Memory Multiprocessors: Consistency and Coherence

Lecture 12: Cache Coherence Protocols

MESI Protocol

If a cache line is in M state definitely the processor with the line is responsible for flushing it
on the next BusRd or BusRdX transaction
If a line is not in M state who is responsible?
Memory or other caches in S or E state?
Original Illinois MESI protocol assumed cache-to-cache transfer i.e. any processor in E
or S state is responsible for flushing the line
However, it requires some expensive hardware, namely, if multiple processors are
caching the line in S state who flushes it? Also, memory needs to wait to know if it
should source the line
Without cache-to-cache sharing memory always sources the line unless it is in M state

MESI Example

Take the following example

P0 reads x, P0 writes x, P1 reads x, P1 writes x, …

P0 generates BusRd , memory provides line, P0 puts line in cache in E state

P0 does write silently, goes to M state
P1 generates BusRd , P0 provides line, P1 puts line in cache in S state, P0 transitions
to S state
Rest is identical to MSI

Consider this example: P0 reads x, P1 reads x, …

P0 generates BusRd , memory provides line, P0 puts line in cache in E state

P1 generates BusRd , memory provides line, P1 puts line in cache in S state, P0
transitions to S state (no cache-to-cache sharing)
Rest is same as MSI

file:///D|/...audhary,%20Dr.%20Sanjeev%20K%20Aggrwal%20&%20Dr.%20Rajat%20Moona/Multi-core_Architecture/lecture12/12_7.htm[6/14/2012 11:57:33 AM]

Objectives_template

Module 6: Shared Memory Multiprocessors: Consistency and Coherence

Lecture 12: Cache Coherence Protocols

MOESI Protocol

Some SMPs implement MOESI today e.g., AMD Athlon MP and the IBM servers
Why is the O state needed?
O state is very similar to E state with four differences: 1. If a cache line is in O state in
some cache, that cache is responsible for sourcing the line to the next requester; 2. The
memory may not have the most up-to-date copy of the line (this implies 1); 3. Eviction
of a line in O state generates a BusWB ; 4. Write to a line in O state must generate a
bus transaction
When a line transitions from M to S it is necessary to write the line back to memory
For a migratory sharing pattern (frequent in database workloads) this leads to a series
of writebacks to memory
These writebacks just keep the memory banks busy and consumes memory bandwidth
Take the following example
P0 reads x, P0 writes x, P1 reads x, P1 writes x, P2 reads x, P2 writes x, …
Thus at the time of a BusRd response the memory will write the line back: one
writeback per processor handover
O state aims at eliminating all these writebacks by transitioning from M to O instead of
M to S on a BusRd /Flush
Subsequent BusRd requests are replied by the owner holding the line in O state
The line is written back only when the owner evicts it: one single writeback

file:///D|/...audhary,%20Dr.%20Sanjeev%20K%20Aggrwal%20&%20Dr.%20Rajat%20Moona/Multi-core_Architecture/lecture12/12_8.htm[6/14/2012 11:57:33 AM]

Objectives_template

Module 6: Shared Memory Multiprocessors: Consistency and Coherence

Lecture 12: Cache Coherence Protocols

MOESI Protocol

State transitions pertaining to O state

I to O: not possible (or maybe; see below)
E to O or S to O: not possible
M to O: on a BusRd /Flush (but no memory writeback )
O to I: on CacheEvict / BusWB or { BusRdX,BusUpgr }/Flush
O to S: not possible (or maybe; next slide)
O to E: not possible (or maybe if silent eviction not allowed) '
O to M: on PrWr / BusUpgr
At most one cache can have a line in O state at any point in time
Two main design choices for MOESI
Consider the example P0 reads x, P0 writes x, P1 reads x, P2 reads x, P3 reads x, …
When P1 launches BusRd , P0 sources the line and now the protocol has two options:
1. The line in P0 goes to O and the line in P1 is filled in state S; 2. The line in P0 goes
to S and the line in P1 is filled in state O i.e. P1 inherits ownership from P0
For bus-based SMPs the two choices will yield roughly the same performance
For DSM multiprocessors we will revisit this issue if time permits
According to the second choice, when P2 generates a BusRd request, P1 sources the
line and transitions from O to S; P2 becomes the new owner

MOSI Protocol

Some SMPs do not support the E state

In many cases it is not helpful, only complicates the protocol
MOSI allows a compact state encoding in 2 bits
Sun WildFire uses MOSI protocol

file:///D|/...audhary,%20Dr.%20Sanjeev%20K%20Aggrwal%20&%20Dr.%20Rajat%20Moona/Multi-core_Architecture/lecture12/12_9.htm[6/14/2012 11:57:34 AM]

PCIe Training PDF
83% (6)
PCIe Training PDF
133 pages
Physics2A CheatSheet
No ratings yet
Physics2A CheatSheet
1 page
Consistency vs. Coherence: Example: Two Processors Are Synchronizing On A Variable Called
No ratings yet
Consistency vs. Coherence: Example: Two Processors Are Synchronizing On A Variable Called
12 pages
Shared Memory Architecture Concepts and Performance Issues: Outline
No ratings yet
Shared Memory Architecture Concepts and Performance Issues: Outline
7 pages
Simulation of A Split Transaction Bus: Abstract
No ratings yet
Simulation of A Split Transaction Bus: Abstract
9 pages
The Lecture Contains:: Lecture 15: Memory Consistency Models and Case Studies of Multi-Core
No ratings yet
The Lecture Contains:: Lecture 15: Memory Consistency Models and Case Studies of Multi-Core
9 pages
Cache Coherence: Write-Invalidate Snooping Protocol For Write-Back
No ratings yet
Cache Coherence: Write-Invalidate Snooping Protocol For Write-Back
21 pages
Pci
No ratings yet
Pci
6 pages
Cache Coherency
No ratings yet
Cache Coherency
19 pages
Cache Coherence Protocols: Evaluation Using A Multiprocessor Simulation Model
No ratings yet
Cache Coherence Protocols: Evaluation Using A Multiprocessor Simulation Model
26 pages
18bce2429 Da 2 Cao
No ratings yet
18bce2429 Da 2 Cao
13 pages
VLSI Design: Memory and Data Path
No ratings yet
VLSI Design: Memory and Data Path
22 pages
VII. Cache Coherence. Interconnection Networks (1) : March 16, 2009
No ratings yet
VII. Cache Coherence. Interconnection Networks (1) : March 16, 2009
42 pages
Cache Coherence: From Wikipedia, The Free Encyclopedia
No ratings yet
Cache Coherence: From Wikipedia, The Free Encyclopedia
8 pages
Computer Architecture Assignment 3 (ARCH)
No ratings yet
Computer Architecture Assignment 3 (ARCH)
9 pages
Data Rather Than The Old Data That Was Replicated Into Their Caches. Memory Coherence: A Read Shall Return The Value of The Latest Write As
No ratings yet
Data Rather Than The Old Data That Was Replicated Into Their Caches. Memory Coherence: A Read Shall Return The Value of The Latest Write As
5 pages
Assignment 1
No ratings yet
Assignment 1
4 pages
Whoops!: A Clustered Web Cache For DSM Systems Using Memory Mapped Networks
No ratings yet
Whoops!: A Clustered Web Cache For DSM Systems Using Memory Mapped Networks
6 pages
Multiprocessor Architectures and Programming
No ratings yet
Multiprocessor Architectures and Programming
89 pages
MB - Master - User Guide PDF
No ratings yet
MB - Master - User Guide PDF
21 pages
Exploiting Loop-Level Parallelism For Simd Arrays Using: Openmp
No ratings yet
Exploiting Loop-Level Parallelism For Simd Arrays Using: Openmp
12 pages
Abstract
No ratings yet
Abstract
23 pages
Multiprocessor Simulators LV
No ratings yet
Multiprocessor Simulators LV
18 pages
Modbus Tutorial For NI
No ratings yet
Modbus Tutorial For NI
6 pages
Distributed Shared Memory
No ratings yet
Distributed Shared Memory
23 pages
Qos Criteria in Ieee 802.16 Collision Resolution Protocol: Abdelillah Karouit and Abdelkrim Haqiq Luis Orozco Barbosa
No ratings yet
Qos Criteria in Ieee 802.16 Collision Resolution Protocol: Abdelillah Karouit and Abdelkrim Haqiq Luis Orozco Barbosa
7 pages
Muge - Snoop Based Multiprocessor Design
No ratings yet
Muge - Snoop Based Multiprocessor Design
32 pages
Unit I_CDA_DrManojY_to students
No ratings yet
Unit I_CDA_DrManojY_to students
131 pages
Ieee Wsi95
No ratings yet
Ieee Wsi95
10 pages
Using Modbus With Mach3
No ratings yet
Using Modbus With Mach3
14 pages
MB - Master - User Guide
No ratings yet
MB - Master - User Guide
16 pages
Brain Dump JNO-360: Sections
No ratings yet
Brain Dump JNO-360: Sections
260 pages
Shared-Memory Multiprocessors - Symmetric Multiprocessing Hardware
No ratings yet
Shared-Memory Multiprocessors - Symmetric Multiprocessing Hardware
7 pages
Multiprocessor Architecture and Programming
No ratings yet
Multiprocessor Architecture and Programming
20 pages
NI Modbus PDF
No ratings yet
NI Modbus PDF
6 pages
OSB SOA Admin Questions
No ratings yet
OSB SOA Admin Questions
4 pages
Multi Core
No ratings yet
Multi Core
7 pages
Parallel Architecture
No ratings yet
Parallel Architecture
33 pages
Content Beyond Syllabus PDF
No ratings yet
Content Beyond Syllabus PDF
7 pages
Multi Processors and Thread Level Parallelism
No ratings yet
Multi Processors and Thread Level Parallelism
74 pages
Introduction and Overview of Pentium Series
No ratings yet
Introduction and Overview of Pentium Series
3 pages
The Lecture Contains:: Module 9: Addendum To Module 6: Shared Memory Multiprocessors
No ratings yet
The Lecture Contains:: Module 9: Addendum To Module 6: Shared Memory Multiprocessors
8 pages
Introduction To Modbus - National Instruments
No ratings yet
Introduction To Modbus - National Instruments
7 pages
Cosmic Cube Inspired by Distributed Computing
No ratings yet
Cosmic Cube Inspired by Distributed Computing
18 pages
2021-MESI Protocol For Multicore Processors Based On FPGA
No ratings yet
2021-MESI Protocol For Multicore Processors Based On FPGA
10 pages
The Tera Computer System
No ratings yet
The Tera Computer System
6 pages
Notes 02
No ratings yet
Notes 02
9 pages
3217
No ratings yet
3217
11 pages
MESI Protocol
No ratings yet
MESI Protocol
9 pages
Provably Good Multicore Cache Performance For Divide-and-Conquer Algorithms
No ratings yet
Provably Good Multicore Cache Performance For Divide-and-Conquer Algorithms
10 pages
ECE657
No ratings yet
ECE657
15 pages
Introduction To Modbus - National Instruments
No ratings yet
Introduction To Modbus - National Instruments
6 pages
Spanning Tree Protocols: STP, RSTP, and MSTP: Feature Overview and Configuration Guide
No ratings yet
Spanning Tree Protocols: STP, RSTP, and MSTP: Feature Overview and Configuration Guide
21 pages
Tesla Project Presentation
No ratings yet
Tesla Project Presentation
20 pages
Taxonomy of Parallel Computing Paradigms
No ratings yet
Taxonomy of Parallel Computing Paradigms
9 pages
Moving From Amba Ahb To Axi Bus in Soc Designs: A Comparative Study
No ratings yet
Moving From Amba Ahb To Axi Bus in Soc Designs: A Comparative Study
4 pages
CSCI 8150 Advanced Computer Architecture
100% (2)
CSCI 8150 Advanced Computer Architecture
46 pages
load-balancing-in-data-center
No ratings yet
load-balancing-in-data-center
10 pages
Concurrency and Multithreading in C: POSIX Threads and Synchronization
From Everand
Concurrency and Multithreading in C: POSIX Threads and Synchronization
Larry Jones
No ratings yet
Mastering Concurrency and Multithreading in C++: Unlock the Secrets of Expert-Level Skills
From Everand
Mastering Concurrency and Multithreading in C++: Unlock the Secrets of Expert-Level Skills
Larry Jones
No ratings yet
Storage Area Networks For Dummies
From Everand
Storage Area Networks For Dummies
Christopher Poelker
3.5/5 (2)
6 Creative Strategy
No ratings yet
6 Creative Strategy
13 pages
Freight Charges or Discount Mandatory For Particular Vendor. - SAP Blogs
No ratings yet
Freight Charges or Discount Mandatory For Particular Vendor. - SAP Blogs
15 pages
Projection in ED
No ratings yet
Projection in ED
10 pages
Application Programming Interface For Windows: Standard ECMA-234
No ratings yet
Application Programming Interface For Windows: Standard ECMA-234
190 pages
Physics and Robotics
No ratings yet
Physics and Robotics
9 pages
Focus On 2.5.3 PDF
No ratings yet
Focus On 2.5.3 PDF
10 pages
Systems Theory of Business Intelligence
100% (7)
Systems Theory of Business Intelligence
5 pages
BS en ISO 4375 - 2014 - Hydrometry
No ratings yet
BS en ISO 4375 - 2014 - Hydrometry
2 pages
Global Green Dictatorship
No ratings yet
Global Green Dictatorship
32 pages
Lesson 4 Numeros Fechas, Hora, Años PDF
No ratings yet
Lesson 4 Numeros Fechas, Hora, Años PDF
15 pages
Colonialism Middle East PDF
No ratings yet
Colonialism Middle East PDF
2 pages
101 Word Transformation Sentences 1
100% (1)
101 Word Transformation Sentences 1
12 pages
Cooling by Underground Earth Tubes
No ratings yet
Cooling by Underground Earth Tubes
4 pages
R Maps
No ratings yet
R Maps
36 pages
Present and Future in The Use of micro-CT Scanner 3D Analysis For The Study of Dental and Root Canal Morphology
No ratings yet
Present and Future in The Use of micro-CT Scanner 3D Analysis For The Study of Dental and Root Canal Morphology
9 pages
CPT - A Synthesis of Highway Practice (NCHRP, 2007)
100% (2)
CPT - A Synthesis of Highway Practice (NCHRP, 2007)
126 pages
PURSUIT Newsletter No. 71, Third Quarter 1985 - Ivan T. Sanderson
100% (1)
PURSUIT Newsletter No. 71, Third Quarter 1985 - Ivan T. Sanderson
52 pages
R1. Theoretical Framework Self-Assessment Rubric For The First Draft
No ratings yet
R1. Theoretical Framework Self-Assessment Rubric For The First Draft
3 pages
Mitakshara and Dayabhaga Schools of Hind
No ratings yet
Mitakshara and Dayabhaga Schools of Hind
4 pages
T test
No ratings yet
T test
17 pages
Algebra. Equations. Solving Linear Equations C PDF
No ratings yet
Algebra. Equations. Solving Linear Equations C PDF
1 page
Subject - Verb - Agreement ALL TCS Questions
No ratings yet
Subject - Verb - Agreement ALL TCS Questions
16 pages
Everything You Wish To Know About Memristors But Are Afraid To Ask
No ratings yet
Everything You Wish To Know About Memristors But Are Afraid To Ask
50 pages
Add Maths Project Work 2020 Question
No ratings yet
Add Maths Project Work 2020 Question
4 pages
Heterogeneous Wireless Access in Large Mesh Networks
No ratings yet
Heterogeneous Wireless Access in Large Mesh Networks
10 pages
Family Case Study-Abstract
No ratings yet
Family Case Study-Abstract
3 pages
Judgement
100% (1)
Judgement
119 pages
SSPC - Pa 2
No ratings yet
SSPC - Pa 2
11 pages
Java Ring
100% (1)
Java Ring
31 pages

Lecture12 PDF

Uploaded by

Lecture12 PDF

Uploaded by

Objectives_template

Module 6: Shared Memory Multiprocessors: Consistency and Coherence

The Lecture Contains:

Invalidation vs. Update

Which One is Better?

file:///D|/...audhary,%20Dr.%20Sanjeev%20K%20Aggrwal%20&%20Dr.%20Rajat%20Moona/Multi-core_Architecture/lecture12/12_1.htm[6/14/2012 11:57:33 AM]

Module 6: Shared Memory Multiprocessors: Consistency and Coherence

Look at stores a little more closely

Invalidation vs. Update

Two main classes of protocols:

file:///D|/...audhary,%20Dr.%20Sanjeev%20K%20Aggrwal%20&%20Dr.%20Rajat%20Moona/Multi-core_Architecture/lecture12/12_2.htm[6/14/2012 11:57:33 AM]

Module 6: Shared Memory Multiprocessors: Consistency and Coherence

Which One is Better?

Forms the foundation of invalidation-based writeback protocols

file:///D|/...audhary,%20Dr.%20Sanjeev%20K%20Aggrwal%20&%20Dr.%20Rajat%20Moona/Multi-core_Architecture/lecture12/12_3.htm[6/14/2012 11:57:33 AM]

Module 6: Shared Memory Multiprocessors: Consistency and Coherence

Few things to note

file:///D|/...audhary,%20Dr.%20Sanjeev%20K%20Aggrwal%20&%20Dr.%20Rajat%20Moona/Multi-core_Architecture/lecture12/12_4.htm[6/14/2012 11:57:33 AM]

Module 6: Shared Memory Multiprocessors: Consistency and Coherence

BusRd takes a cache line in M state to S state

Take the following example

P0 generates BusRd , memory provides line, P0 puts line in S state

file:///D|/...audhary,%20Dr.%20Sanjeev%20K%20Aggrwal%20&%20Dr.%20Rajat%20Moona/Multi-core_Architecture/lecture12/12_5.htm[6/14/2012 11:57:33 AM]

Module 6: Shared Memory Multiprocessors: Consistency and Coherence

The most popular invalidation-based protocol e.g., appears in Intel Xeon MP

file:///D|/...audhary,%20Dr.%20Sanjeev%20K%20Aggrwal%20&%20Dr.%20Rajat%20Moona/Multi-core_Architecture/lecture12/12_6.htm[6/14/2012 11:57:33 AM]

Module 6: Shared Memory Multiprocessors: Consistency and Coherence

Take the following example

P0 generates BusRd , memory provides line, P0 puts line in cache in E state

Consider this example: P0 reads x, P1 reads x, …

P0 generates BusRd , memory provides line, P0 puts line in cache in E state

file:///D|/...audhary,%20Dr.%20Sanjeev%20K%20Aggrwal%20&%20Dr.%20Rajat%20Moona/Multi-core_Architecture/lecture12/12_7.htm[6/14/2012 11:57:33 AM]

Module 6: Shared Memory Multiprocessors: Consistency and Coherence

file:///D|/...audhary,%20Dr.%20Sanjeev%20K%20Aggrwal%20&%20Dr.%20Rajat%20Moona/Multi-core_Architecture/lecture12/12_8.htm[6/14/2012 11:57:33 AM]

Module 6: Shared Memory Multiprocessors: Consistency and Coherence

State transitions pertaining to O state

Some SMPs do not support the E state

file:///D|/...audhary,%20Dr.%20Sanjeev%20K%20Aggrwal%20&%20Dr.%20Rajat%20Moona/Multi-core_Architecture/lecture12/12_9.htm[6/14/2012 11:57:34 AM]

You might also like