0% found this document useful (0 votes)

13 views18 pages

Lecture-1-02.01.2025

CS528 is a course on High Performance Computing covering topics such as parallel processing concepts, memory hierarchy designs, cache optimization techniques, and GPU architectures. The course includes assessments like class tests and mid-semester exams, with a total weightage of 100%. Additional information includes course-related communication details and a focus on memory systems and their classifications.

Uploaded by

Munesh Meena

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views18 pages

Lecture-1-02.01.2025

Uploaded by

Munesh Meena

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

CS528: HIGH PERFORMANCE

COMPUTING

Monday: 17:00 – 17:55

Tuesday: 16:00 – 16:55
Wednesday: 15:00 – 15:55
Thursday: 14:00 – 14:55
Room No: 5G2
CS528 High Performance Computing 3-0-0-6
Parallel Processing Concepts; Levels and model of parallelism: instruction, transaction, task, thread, memory,
function, data flow models, demand-driven computation

Memory Hierarchy Designs: Cache Recap, Virtual Memory Review, Address Translation

Cache Optimization Techniques- Improving Hit Time, Reducing Miss Penalty, Miss rate reduction techniques,
Software and Hardware Prefetching Techniques to reduce Miss Penalty

Instruction Level Parallelism: Basics, Dependences and Hazards, Dynamic Scheduling, Branch Prediction,
Hardware Speculation

Data level Parallelism: VLIW, SIMD, Data Alignment and Reordering

Thread Level Parallelism: Software and Hardware Multithreading, Block Multithreading, Interleaved
Multithreading and Simultaneous Multithreading

Memory Centric Computing: Processing Near Memory, Emerging Memory Technology, Flash Memory, Solid State
Drives

GPU: Architectures and Programming

Texts
◦1. J. L. Hennessy and D. A. Patterson, Computer Architecture: A
Quantitative Approach, 5th Edition, Morgan Kaufmann, 2012.
Assessments

Assessment Marks Weightage

Class Test – 1 20 10%
Mid-Semester 30 30%
Class Test -2 20 10%
End-Semester 50 50%

This is tentative, any changes will be informed

Additional info
◦Course related email’s subject prefix by CS528:
- Email ID: [email protected]
Processing Storage of Movement Controlling
the data the data of the data
The CPU

Main Memory

Input/Output (I/O)

System Interconnection
ARITHMETIC AND LOGIC CONTROL UNIT – TAKES REGISTERS – CONTAIN
UNIT – DATA DATA, SEND IT TO DATA USED FOR
PROCESSING FUNCTIONS PROCESSING AND SEND EXECUTION
OF A COMPUTER IT TO THE OUTPUT
Motherboard
Memory System

◦ A memory system is a hierarchy of storage devices with

different capacities, costs, and access times.
◦ CPU registers hold the most frequently used data.
◦ Small, fast cache memories nearby the CPU act as staging areas
for a subset of the data and instructions stored in the relatively
slow main memory.
◦ The main memory stages data stored on large, slow disks,
which in turn often serve as staging areas for data stored on the
disks or tapes of other machines connected by networks.
Memory System
◦ Memory is one of the most important functional units of a
computer.
– Used to store both instructions and data.
– Stores as bits (0’s and 1’s), usually organized in terms of
bytes.
◦ How are the data stored in memory accessed?
◦ Every memory location has a unique address.
◦ A memory is said to be byte addressable if every byte of data has a unique
address.
◦ Some memory systems are word addressable also (every addressed
locations consists of multiple bytes, say, 32 bits or 4 bytes).
Processor-Connect-Memory

◦ Address bus provides the

address of the memory
location to be accessed.
◦ Unidirectional
◦ Data bus transfers the data
read from memory, or data
to be written into memory.
◦ Bidirectional.
◦ Control bus provides
various signals
Memory Module

◦ Maximum number of memory

locations = 2n
◦ Number of bits stored in every
addressable location = m
◦ Signals
◦ RD/WR’ = (Read=1, Write =0)
◦ CS’ = 0 is enable , otherwise data
bus is in high impedance state
Classification of Memory Systems
◦ Volatile v/s Non-volatile
◦ Volatile – Example- CMOS static/dynamic memory
◦ Non-volatile – Example – ROM, Magnetic Disk, CD/DVD, SSD, Flash Drive, Resistive
Memory
◦ Random-access v/s Direct Sequential Access
◦ Random access – RAM and ROM – here the read/write time is independent of the
memory location being accessed
◦ Sequential access – Magnetic Tape – data is access sequentially in a particular order
◦ Direct or Semi-random access – Magnetic Disk- access can be made directly to the
track after which the access will be sequentially
◦ Read only versus Random Access
◦ Read Only Memory – ROM, PROM, EPROM, EEPROM
◦ RAM – SRAM (retained as long as power is ON), DRAM (periodic refresh- tiny
capacitors)
Access Time, Latency and Bandwidth
◦Terminologies used to measure speed of the memory system.
◦ Memory Access Time: Time between initiation of an operation (Read
or Write) and completion of that operation.
◦ Latency: Initial delay from the initiation of an operation to the time
the first data is available.
◦ Bandwidth: Maximum speed of data transfer in bytes per second.
◦In modern memory organizations, every read request reads a
block of words into some high-speed registers (LATENCY),
from where data are supplied to the processor one by one
(ACCESS TIME).
Design Issue of Memory System

◦The most important issue is

to bridge the processor-
memory gap that has been
widening with every passing
year.
◦ Advancements in memory
technology are unable to cope
with faster advancements in
processor technology.
Overcoming Design Issues
Using Cache – increases the effective speed of
memory system

• A fast memory (possibly organized in several levels)

that sits between processor and main memory.
• Faster than main memory and relatively small.
• Frequently accessed data and instructions are
stored here.
• Cache memory makes use of the fast SRAM
technology.

Using Virtual memory – increases the effective

size of the memory system

Technique used by the operating system to provide an illusion of very large memory to the processor.
• Program and data are actually stored on secondary memory that is much larger.
• Transfer parts of program and data from secondary memory to main memory only when needed.
Thank you

Module 2
No ratings yet
Module 2
46 pages
FOC Notes Unit 1 and Unit 2
50% (4)
FOC Notes Unit 1 and Unit 2
46 pages
Memory Subsystem: Dr. Gayathri Sivakumar Assistant Professor (SG-I) School of Electronics VIT, Chennai
No ratings yet
Memory Subsystem: Dr. Gayathri Sivakumar Assistant Professor (SG-I) School of Electronics VIT, Chennai
16 pages
V.I.P. Dasanayake - Computer System
100% (1)
V.I.P. Dasanayake - Computer System
12 pages
Introduction To Computer Part 1
No ratings yet
Introduction To Computer Part 1
56 pages
Introduction To Computer All Slides
No ratings yet
Introduction To Computer All Slides
146 pages
The Memory System: Deepak John, Department Computer Applications, SJCET-Pala
No ratings yet
The Memory System: Deepak John, Department Computer Applications, SJCET-Pala
63 pages
Mc9211unit 5 PDF
No ratings yet
Mc9211unit 5 PDF
89 pages
DLCA Unit 5-Memoryorganization
No ratings yet
DLCA Unit 5-Memoryorganization
58 pages
the memory system hamacher
No ratings yet
the memory system hamacher
20 pages
Notes MP unit 1
No ratings yet
Notes MP unit 1
8 pages
9CS 207 - MemoryOrganization
No ratings yet
9CS 207 - MemoryOrganization
22 pages
mini COA
No ratings yet
mini COA
59 pages
Memory System
No ratings yet
Memory System
70 pages
Computer-Architecture-Answers
No ratings yet
Computer-Architecture-Answers
20 pages
Unit 5 COA
No ratings yet
Unit 5 COA
34 pages
Unit 5
No ratings yet
Unit 5
21 pages
The University of Zambia in Conjuction With Zambia Ict College
No ratings yet
The University of Zambia in Conjuction With Zambia Ict College
12 pages
TOPIC - 1 Microprocessor
No ratings yet
TOPIC - 1 Microprocessor
38 pages
Chapter 7
No ratings yet
Chapter 7
25 pages
Lecture_2_COAL
No ratings yet
Lecture_2_COAL
22 pages
1 Introduction
No ratings yet
1 Introduction
52 pages
UNIT V (Memory System)
No ratings yet
UNIT V (Memory System)
51 pages
cheat_sheat.docx
No ratings yet
cheat_sheat.docx
5 pages
Technical Supoort Lesson 5
No ratings yet
Technical Supoort Lesson 5
25 pages
Memory Systems
No ratings yet
Memory Systems
93 pages
Ch04 The Memory System
No ratings yet
Ch04 The Memory System
45 pages
DL&CO_UNIT-4
No ratings yet
DL&CO_UNIT-4
43 pages
CSE243: Introduction To Computer Architecture and Hardware/Software Interface
No ratings yet
CSE243: Introduction To Computer Architecture and Hardware/Software Interface
25 pages
COA Mod4
No ratings yet
COA Mod4
71 pages
Chapter 4 - Memory Organization
No ratings yet
Chapter 4 - Memory Organization
35 pages
Dld&Co Cse-ds Unit 5-1
No ratings yet
Dld&Co Cse-ds Unit 5-1
38 pages
lec-2
No ratings yet
lec-2
29 pages
COA MemoryOrg
No ratings yet
COA MemoryOrg
19 pages
21EC52 Module 2
No ratings yet
21EC52 Module 2
46 pages
SSC Course 8 Memory
No ratings yet
SSC Course 8 Memory
33 pages
Chapter 3
No ratings yet
Chapter 3
8 pages
Chapter 5-The Memory System
No ratings yet
Chapter 5-The Memory System
80 pages
Computer Systems: Hardware (Book No. 1 Chapter 2)
No ratings yet
Computer Systems: Hardware (Book No. 1 Chapter 2)
111 pages
Introduction To Computer Organization: 2.1 Functional Units
No ratings yet
Introduction To Computer Organization: 2.1 Functional Units
9 pages
Chapter-3 Edited
No ratings yet
Chapter-3 Edited
42 pages
CHAPTER 5. Memory Element: Electrical Engineering Department PTSB
No ratings yet
CHAPTER 5. Memory Element: Electrical Engineering Department PTSB
93 pages
Memory Organization
No ratings yet
Memory Organization
52 pages
Memory Organization
No ratings yet
Memory Organization
24 pages
6-Module 4-11-03-2024
No ratings yet
6-Module 4-11-03-2024
77 pages
CAO CO1
No ratings yet
CAO CO1
74 pages
Lecture 3 Memory Systems
No ratings yet
Lecture 3 Memory Systems
59 pages
Chapter 5-The Memory System
100% (1)
Chapter 5-The Memory System
80 pages
Computer System: Operating Systems: Internals and Design Principles
No ratings yet
Computer System: Operating Systems: Internals and Design Principles
62 pages
R23_CO UNIT-4
No ratings yet
R23_CO UNIT-4
25 pages
Module 6_Memory
No ratings yet
Module 6_Memory
32 pages
Memory Interfacing
No ratings yet
Memory Interfacing
65 pages
CO - MODULE - 2 (A) - Memory-System
No ratings yet
CO - MODULE - 2 (A) - Memory-System
78 pages
COM155_Lec2_Computer Organization_Fall 2024
No ratings yet
COM155_Lec2_Computer Organization_Fall 2024
49 pages
CH 5 (1)
No ratings yet
CH 5 (1)
24 pages
Unit 5
No ratings yet
Unit 5
56 pages
LM Unit2
No ratings yet
LM Unit2
14 pages
Lecture 3 (Memory Hierarchy and Caches)
No ratings yet
Lecture 3 (Memory Hierarchy and Caches)
88 pages
5 Cache PDF
No ratings yet
5 Cache PDF
46 pages
LPIC-3 Exam 306-300 Mastery: 500 Practice Questions on High Availability & Storage Clusters
From Everand
LPIC-3 Exam 306-300 Mastery: 500 Practice Questions on High Availability & Storage Clusters
Steve Brown
No ratings yet
The complete guide to Hardware Technician Terminology: A simplified guide
From Everand
The complete guide to Hardware Technician Terminology: A simplified guide
Sumitra Kumari
No ratings yet
c++_taskA
No ratings yet
c++_taskA
6 pages
gpus
No ratings yet
gpus
37 pages
SysLab
No ratings yet
SysLab
21 pages
CS558_4b__Socket_Programming_VM-1
No ratings yet
CS558_4b__Socket_Programming_VM-1
6 pages
Parallel COA pre mid notes_compressed (1)
No ratings yet
Parallel COA pre mid notes_compressed (1)
64 pages
Tutorial sheet 1 - CLion Software and design (2) (1)
No ratings yet
Tutorial sheet 1 - CLion Software and design (2) (1)
2 pages
Organic Chemistry: Recent Advancement in Benzofulvene Synthesis
No ratings yet
Organic Chemistry: Recent Advancement in Benzofulvene Synthesis
14 pages
Lecture-3-07.01.2025
No ratings yet
Lecture-3-07.01.2025
16 pages
Lecture-2-06.01.2025
No ratings yet
Lecture-2-06.01.2025
21 pages
IBM Power Systems Virtual Server Level 2 Quiz - Attempt Review
No ratings yet
IBM Power Systems Virtual Server Level 2 Quiz - Attempt Review
14 pages
System Power Laptop Motherboard
No ratings yet
System Power Laptop Motherboard
5 pages
PVS Timezone Issue v1
No ratings yet
PVS Timezone Issue v1
14 pages
Introduction To Microsoft Excel 2013
No ratings yet
Introduction To Microsoft Excel 2013
15 pages
Hicham Aboujaoude Resume
No ratings yet
Hicham Aboujaoude Resume
1 page
IM Requirements V1.0
No ratings yet
IM Requirements V1.0
284 pages
Practical
No ratings yet
Practical
14 pages
EFMS 5v00 Service Manual (EN)
No ratings yet
EFMS 5v00 Service Manual (EN)
10 pages
Deployment Guide For Cisco Unified Presence Release 8.0 and 8.5
No ratings yet
Deployment Guide For Cisco Unified Presence Release 8.0 and 8.5
318 pages
Windows 10 1909 GP OS Administrative Guide
No ratings yet
Windows 10 1909 GP OS Administrative Guide
118 pages
Top 70 CCNA Interview Questions - PDF
No ratings yet
Top 70 CCNA Interview Questions - PDF
10 pages
Fault Handling
No ratings yet
Fault Handling
88 pages
Testing Object Oriented Software Systems
No ratings yet
Testing Object Oriented Software Systems
5 pages
Penawaran Adendum
No ratings yet
Penawaran Adendum
1 page
3.unit 1
No ratings yet
3.unit 1
29 pages
MCA Practical Solution
No ratings yet
MCA Practical Solution
49 pages
Operating Systems
No ratings yet
Operating Systems
21 pages
Onefs 8.2.1 Backup and Recovery Guide
No ratings yet
Onefs 8.2.1 Backup and Recovery Guide
124 pages
CodeVita 2017 Round 2 D
No ratings yet
CodeVita 2017 Round 2 D
2 pages
Runyes Intraoral Scanner
No ratings yet
Runyes Intraoral Scanner
6 pages
Carprog Mcu Reflash Step by Stepg
No ratings yet
Carprog Mcu Reflash Step by Stepg
12 pages
Linux Kernel Development
No ratings yet
Linux Kernel Development
20 pages
Excercise 1
No ratings yet
Excercise 1
4 pages
CS311 Exam
No ratings yet
CS311 Exam
16 pages
Notebook PC User Manual
No ratings yet
Notebook PC User Manual
114 pages
Windows 98 Img Dosbox Download
No ratings yet
Windows 98 Img Dosbox Download
3 pages
Multithreading in Java
No ratings yet
Multithreading in Java
5 pages
Compressed Audio Signal Processor IC With USB Host Controller
No ratings yet
Compressed Audio Signal Processor IC With USB Host Controller
26 pages
Toshiba Estudio 2500c Network Admin Guide
No ratings yet
Toshiba Estudio 2500c Network Admin Guide
78 pages

Lecture-1-02.01.2025

Uploaded by

Lecture-1-02.01.2025

Uploaded by

CS528: HIGH PERFORMANCE

Monday: 17:00 – 17:55

Data level Parallelism: VLIW, SIMD, Data Alignment and Reordering

GPU: Architectures and Programming

Assessment Marks Weightage

This is tentative, any changes will be informed

◦ A memory system is a hierarchy of storage devices with

◦ Address bus provides the

◦ Maximum number of memory

◦The most important issue is

• A fast memory (possibly organized in several levels)

Using Virtual memory – increases the effective

You might also like