0% found this document useful (0 votes)

12 views

Lec 4b

Uploaded by

medo.losy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views

Lec 4b

Uploaded by

medo.losy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

COMPUTER ORGANIZATION AND DESIGN

5th
Edition
The Hardware/Software Interface

Chapter 5
Large and Fast:
Exploiting Memory
Hierarchy (cont.)
Caching Example Block size = 16 bytes,
4 blocks in cache.
Request 0 164 83 192 10 90 175 673 168 59
(byte addr. in (00101
decimal) 0,0100)

Block addr. 000000 001010 000101 001100 000000 000101 001010 101010 001010 000011
(binary) 0 10 5 12 0 5 10 42 10 3

Index 00 10 01 00 00 01 10 10 10 11
(direct-map)
Cache 0000 0000 0000 0011 0000 0000 0000 0000 0000 0000
Set 0

Cache - - 0001 0001 0001 0001 0001 0001 0001 0001

Set 1
Cache - 0010 0010 0010 0010 0010 0010 1010 0010 0010
Set 2
Cache - - - - - - - - - 0000
Set 3
Hit/Miss M M M M M H H M M M

Miss type CM CM CM CM CF - - CM CF CM

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 2

Block size = 16 bytes,
Caching Example 4 blocks in cache.

Request 0 164 83 192 10 90 175 673 168 59

(byte addr. in (00101
decimal) 0,0100)

Block addr. 000000 001010 000101 001100 000000 000101 001010 101010 001010 000011
(binary) 0 10 5 12 0 5 10 42 10 3

Index 0 0 1 0 0 1 0 0 0 1
(2-way cache)

Cache 00000 00101 00101 00110 00000 00000 00101 10101 00101 00101
Set 0 - 00000 00000 00101 00110 00110 00000 00101 10101 10101

Cache - - 00010 00010 00010 00010 00010 00010 00010 00001

Set 1 - - - - - - - 00010

Hit/Miss M M M M M H M M H M

Miss type CM CM CM CM CF - CF CM - CM

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 3

number keys: Instructions

Radix sort
Quick (Instr/key)
800
Radix (Instr/key)
700

600

500

400

300

200 Quick
sort Instructions/key
100

0
1000 10000 100000 1000000 1E+07

Job size in keys

number keys: Instrs & Time

Radix sort
Quick (Instr/key)
800
Radix (Instr/key)
700 Quick (Clocks/key)
600 Radix (clocks/key)
Time
500

400

300
Quick
200
sort
100
Instructions

0
1000 10000 100000 1000000 1E+07

Job size in keys

number keys: Cache misses
5 Quick(miss/key)
Radix sort Radix(miss/key)
4

3
Cache misses
2

1
Quick
0 sort
1000 10000 100000 1000000 10000000

Job size in keys

Interactions with Software
 Misses depend on Inst./item

memory access
patterns
 Algorithm behavior Clock cycles/item
 Compiler

optimization for
memory access
Cache miss/item
More misses

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 7

Naïve Matrix Multiply
Number of slow memory references on unblocked matrix
multiply
m = n3 to read each column of B n times
+ n2 to read each row of A once
+ n2 to read and write each element of C once
= n3 + 2n2

C(i,j) C(i,j) A(i,:)

B(:,j)
= + *

Lec20.8
Blocked Matrix Multiply
Consider A,B,C to be N-by-N matrices of b-by-b subblocks where
b=n / N is called the block size
for i = 1 to N
for j = 1 to N
{read block C(i,j) into fast memory}
for k = 1 to N
{read block A(i,k) into fast memory}
{read block B(k,j) into fast memory}
C(i,j) = C(i,j) + A(i,k) * B(k,j) {do a matrix multiply on
blocks} {write block C(i,j) back to slow memory}

C(i,j) C(i,j) A(i,k)

= + * B(k,j)

9
Lec20.9
Blocked Matrix Multiply
m is amount memory traffic between slow and fast memory
matrix has nxn elements, and NxN blocks each of size bxb

m = Nn2 B: N2 blocks of size b2 are read N times (N3 b2 = N3 * (n/N)2 = N*n2)

+ N*n2 A: same as B
+ n2 read and write each block of C once
= (2N + 1) * n2 =o(n3/b)

So we can improve performance by increasing the blocksize b

01/19/2012 CS267 - Lecture 2

Blocked Matrix Multiply
Only this portion in cache

Unoptimized Blocked

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 11

Ansa Tutorials
No ratings yet
Ansa Tutorials
181 pages
Chapter 05
No ratings yet
Chapter 05
113 pages
Chapter 05 Computer Organization and Design, Fifth Edition: The Hardware/Software Interface (The Morgan Kaufmann Series in Computer Architecture and Design) 5th Edition
75% (4)
Chapter 05 Computer Organization and Design, Fifth Edition: The Hardware/Software Interface (The Morgan Kaufmann Series in Computer Architecture and Design) 5th Edition
105 pages
04 - Large and Fast Exploiting Memory Hierarchy
No ratings yet
04 - Large and Fast Exploiting Memory Hierarchy
92 pages
help2
No ratings yet
help2
102 pages
Lec 2
No ratings yet
Lec 2
26 pages
Chapter_05 9wY
No ratings yet
Chapter_05 9wY
136 pages
Large and Fast: Exploiting Memory Hierarchy
No ratings yet
Large and Fast: Exploiting Memory Hierarchy
24 pages
Chapter 5 Large and Fast Exploiting Memory Hierarchy
No ratings yet
Chapter 5 Large and Fast Exploiting Memory Hierarchy
101 pages
Large and Fast: Exploiting Memory Hierarchy: Omputer Rganization and Esign
No ratings yet
Large and Fast: Exploiting Memory Hierarchy: Omputer Rganization and Esign
87 pages
3. Lecture 19 Basics of Cache
No ratings yet
3. Lecture 19 Basics of Cache
23 pages
Lecture 9 - The Memory Hierarchy
No ratings yet
Lecture 9 - The Memory Hierarchy
25 pages
Chapter 5: Large and Fast Exploiting Memory Hierarchy Notes
No ratings yet
Chapter 5: Large and Fast Exploiting Memory Hierarchy Notes
16 pages
Large and Fast: Exploiting Memory Hierarchy: The Hardware/Software Interface
No ratings yet
Large and Fast: Exploiting Memory Hierarchy: The Hardware/Software Interface
33 pages
Chapter 5 Large and Fast Exploiting Memory Hierarchy
No ratings yet
Chapter 5 Large and Fast Exploiting Memory Hierarchy
96 pages
Chapter_05
No ratings yet
Chapter_05
52 pages
Chapter 3 Large and Fast
No ratings yet
Chapter 3 Large and Fast
86 pages
Lecture-17 CH-05 1
No ratings yet
Lecture-17 CH-05 1
21 pages
Chapter 05
No ratings yet
Chapter 05
105 pages
Memory
No ratings yet
Memory
12 pages
Supplemental Material On Cache From ECE-341 Memory
No ratings yet
Supplemental Material On Cache From ECE-341 Memory
79 pages
Chap 6
No ratings yet
Chap 6
48 pages
Large and Fast: Exploiting Memory Hierarchy: Computer Organization and Design
No ratings yet
Large and Fast: Exploiting Memory Hierarchy: Computer Organization and Design
107 pages
Week6 Memory Part2
No ratings yet
Week6 Memory Part2
23 pages
CH 4.ppt Type I
No ratings yet
CH 4.ppt Type I
60 pages
Chapter5 PDF
No ratings yet
Chapter5 PDF
95 pages
BiD 05
No ratings yet
BiD 05
97 pages
13_Large and Fast Exploiting Memory Hierarchy Final
No ratings yet
13_Large and Fast Exploiting Memory Hierarchy Final
118 pages
Computer Architecture: Memory Hierarchy Design
No ratings yet
Computer Architecture: Memory Hierarchy Design
60 pages
04_Cache Memory
No ratings yet
04_Cache Memory
61 pages
L - 3-AssociativeMapping - Virtual Memory
No ratings yet
L - 3-AssociativeMapping - Virtual Memory
52 pages
Large and Fast: Exploiting Memory Hierarchy
No ratings yet
Large and Fast: Exploiting Memory Hierarchy
48 pages
Characteristics Location Capacity Unit of Transfer Access Method Performance Physical Type Physical Characteristics Organisation
No ratings yet
Characteristics Location Capacity Unit of Transfer Access Method Performance Physical Type Physical Characteristics Organisation
53 pages
Unit 5 1 Cache Performance V 2
No ratings yet
Unit 5 1 Cache Performance V 2
29 pages
04 - Cache Memory PDF
No ratings yet
04 - Cache Memory PDF
71 pages
Computer Org and Arch: R.Magesh
No ratings yet
Computer Org and Arch: R.Magesh
48 pages
Characteristics Location Capacity Unit of Transfer Access Method Performance Physical Type Physical Characteristics Organisation
No ratings yet
Characteristics Location Capacity Unit of Transfer Access Method Performance Physical Type Physical Characteristics Organisation
53 pages
Comp Arch Lect5
No ratings yet
Comp Arch Lect5
26 pages
CAO - Lecutre7 Cache Memory
100% (1)
CAO - Lecutre7 Cache Memory
39 pages
Computer Organization and Architecture: Cache Memory
100% (1)
Computer Organization and Architecture: Cache Memory
57 pages
cache_memory
No ratings yet
cache_memory
51 pages
Cache Mapping
100% (1)
Cache Mapping
44 pages
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
51 pages
Characteristics Location Capacity Unit of Transfer Access Method Performance Physical Type Physical Characteristics Organisation
No ratings yet
Characteristics Location Capacity Unit of Transfer Access Method Performance Physical Type Physical Characteristics Organisation
53 pages
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
57 pages
Associative Mapping
No ratings yet
Associative Mapping
65 pages
William Stallings Computer Organization and Architecture 6th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 6th Edition Cache Memory
54 pages
CH10 - Memory Hierarchy
No ratings yet
CH10 - Memory Hierarchy
106 pages
ch5-1
No ratings yet
ch5-1
44 pages
Cache Organization (Direct Mapping)
No ratings yet
Cache Organization (Direct Mapping)
40 pages
Cache Memory
No ratings yet
Cache Memory
57 pages
CH05 COA11e
No ratings yet
CH05 COA11e
43 pages
Cache Memory-Direct Mapping
0% (1)
Cache Memory-Direct Mapping
30 pages
Class11 Cache
No ratings yet
Class11 Cache
41 pages
Cache Memory
No ratings yet
Cache Memory
61 pages
Cache + Associations Ch-4
No ratings yet
Cache + Associations Ch-4
52 pages
04 - Cache Memory
No ratings yet
04 - Cache Memory
79 pages
03-Chap4-Cache Memory Mapping
No ratings yet
03-Chap4-Cache Memory Mapping
24 pages
Computer Memory Organization: Elephants Don't Forget But Do Computers?
No ratings yet
Computer Memory Organization: Elephants Don't Forget But Do Computers?
9 pages
Neo Geo Architecture: Architecture of Consoles: A Practical Analysis, #23
From Everand
Neo Geo Architecture: Architecture of Consoles: A Practical Analysis, #23
Rodrigo Copetti
No ratings yet
Mega Drive Architecture: Architecture of Consoles: A Practical Analysis, #3
From Everand
Mega Drive Architecture: Architecture of Consoles: A Practical Analysis, #3
Rodrigo Copetti
No ratings yet
Khadijah Group Seminar Banking
No ratings yet
Khadijah Group Seminar Banking
14 pages
Ubd Instructional Unit Design
100% (2)
Ubd Instructional Unit Design
15 pages
32-21-02-610
No ratings yet
32-21-02-610
3 pages
Unit 5 Notes
No ratings yet
Unit 5 Notes
80 pages
Chang 2014
No ratings yet
Chang 2014
18 pages
Automech Report PDF
No ratings yet
Automech Report PDF
18 pages
List PDF
No ratings yet
List PDF
68 pages
ReleaseNotes 2024.2.0
No ratings yet
ReleaseNotes 2024.2.0
51 pages
Olbations Chapter 1
No ratings yet
Olbations Chapter 1
28 pages
Role of Research
No ratings yet
Role of Research
5 pages
Semi Detailed Lesson Plan Math
No ratings yet
Semi Detailed Lesson Plan Math
5 pages
GHQ 13 4
No ratings yet
GHQ 13 4
8 pages
Job Order Costing: Perhitungan Harga Pokok Produksi Menggunakan Metode (Studi Kasus Pada Usaha Konveksi "Mowin Concept")
No ratings yet
Job Order Costing: Perhitungan Harga Pokok Produksi Menggunakan Metode (Studi Kasus Pada Usaha Konveksi "Mowin Concept")
12 pages
OROFLEX 10 Layflat Hose
No ratings yet
OROFLEX 10 Layflat Hose
3 pages
Lawful Money Is Equitable Title To Labor-Credit Asset
100% (6)
Lawful Money Is Equitable Title To Labor-Credit Asset
2 pages
Operating Systems
No ratings yet
Operating Systems
130 pages
Epsc 121 Notes (Revised) - 1-1
No ratings yet
Epsc 121 Notes (Revised) - 1-1
36 pages
Assessment and Evaluation in Mathematics
No ratings yet
Assessment and Evaluation in Mathematics
95 pages
Final International Marketing
No ratings yet
Final International Marketing
55 pages
Manifold Design Calculations
No ratings yet
Manifold Design Calculations
5 pages
ELLN Re Skilling Proposal 1
No ratings yet
ELLN Re Skilling Proposal 1
18 pages
Client Complaint Form (Final)
No ratings yet
Client Complaint Form (Final)
4 pages
F - High Density1 POLYETHYLENE (HDPE) PIPES AND FITTINGS
No ratings yet
F - High Density1 POLYETHYLENE (HDPE) PIPES AND FITTINGS
12 pages
Ashok Stambh
No ratings yet
Ashok Stambh
10 pages
I. Lesson Plan Overview and Description
No ratings yet
I. Lesson Plan Overview and Description
5 pages
1996 Financial Statements
No ratings yet
1996 Financial Statements
634 pages
Бизнес этикет
No ratings yet
Бизнес этикет
14 pages
AI Unit-5
No ratings yet
AI Unit-5
57 pages
Word List: Who We Are
No ratings yet
Word List: Who We Are
32 pages

Lec 4b

Uploaded by

Lec 4b

Uploaded by

COMPUTER ORGANIZATION AND DESIGN

Cache - - 0001 0001 0001 0001 0001 0001 0001 0001

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 2

Request 0 164 83 192 10 90 175 673 168 59

Cache - - 00010 00010 00010 00010 00010 00010 00010 00001

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 3

Job size in keys

Job size in keys

Job size in keys

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 7

C(i,j) C(i,j) A(i,:)

C(i,j) C(i,j) A(i,k)

m = N*n2 B: N2 blocks of size b2 are read N times (N3 * b2 = N3 * (n/N)2 = N*n2)

So we can improve performance by increasing the blocksize b

01/19/2012 CS267 - Lecture 2

Chapter 5 — Large and Fast: Exploiting Memory Hierarchy — 11

You might also like

m = Nn2 B: N2 blocks of size b2 are read N times (N3 b2 = N3 * (n/N)2 = N*n2)