U3.1 Concepts and Challenges[1] (1)

The document discusses instruction-level parallelism (ILP) and its significance in improving processor performance through pipelining. It outlines two approaches for exploiting ILP: dynamic hardware-based and static software-based, and explains the importance of understanding data dependences, hazards, and control dependences in maximizing parallelism. Additionally, it highlights the role of loop-level parallelism as a common method to enhance ILP within basic blocks of code.

Uploaded by

poojasrigopinath05

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

U3.1 Concepts and Challenges[1] (1)

Uploaded by

poojasrigopinath05

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 12

Unit-III

Instruction-Level Parallelism:
Concepts and Challenges
Introduction
• All processors since about 1985 have used pipelining to
overlap the execution of instructions and improve
performance.
• This potential overlap among instructions is called
instruction-level parallelism (ILP), because the instructions
can be evaluated in parallel.
• There are two largely separable approaches to exploiting ILP:
(1) an approach that relies on hardware to help discover and exploit the
parallelism dynamically, and
(2) an approach that relies on software technology to find parallelism
statically at compile time.
Processors using the dynamic, hardware-based approach,
including all recent Intel and many ARM processors, dominate
in the desktop and server markets
• The value of the CPI (cycles per instruction) for a
pipelined processor is the sum of the base CPI and all
contributions from stalls:
Pipeline CPI = Ideal pipeline CPI + Structural stalls +
Data hazard stalls + Control stalls

• The ideal pipeline CPI is a measure of the maximum

performance attainable by the implementation
Data Dependences and Hazards
• Determining how one instruction depends on another is critical
to determining how much parallelism exists in a program and
how that parallelism can be exploited.
• In particular, to exploit instruction-level parallelism, we must
determine which instructions can be executed in parallel.
Case(i)If two instructions are parallel, they can execute
simultaneously in a pipeline of arbitrary depth without causing
any stalls, assuming the pipeline has sufficient resources (and
thus no structural hazards exist).
Case(ii)If two instructions are dependent, they are not parallel
and must be executed in order, although they may often be
partially overlapped.
• The key in both cases is to determine whether an instruction is
dependent on another instruction
Data Dependences
• There are three different types of dependences:
1. data dependences (also called true data dependences),
2. name dependences, and
3. control dependences.
• An instruction j is data-dependent on
instruction i if either of the following holds:
– Instruction i produces a result that may be used by
instruction j.
– Instruction j is data-dependent on instruction k, and
instruction k is data dependent on instruction i.
(i) Data Dependences

• For example, consider the following RISC-V code sequence that

increments a vector of values in memory (starting at 0(x1)
ending with the last element at 0(x2)) by a scalar in register f2.
Loop: fld f0,0(x1) //f0=array element
fadd.d f4,f0,f2 //add scalar in f2
fsd f4,0(x1) //store result
addi x1,x1,-8 //decrement pointer 8 bytes
bne x1,x2,Loop //branch x1 is not equal x2
The data dependences in this code sequence involve both floating-
point data:
Loop: fld f0,0(x1) //f0=array element

fadd.d f4,f0,f2 //add scalar in f2

fsd f4,0(x1) //store result

and integer data:
addi x1,x1,-8 //decrement pointer //8 bytes (per DW)
• A data dependence conveys three things:
(1) the possibility of a hazard,
(2) the order in which results must be calculated, and
(3) an upper bound on how much parallelism can
possibly be exploited
(ii)Name Dependences

• A name dependence occurs when two instructions use the same

register or memory location, called a name, but there is no flow
of data between the instructions associated with that name.
• There are two types of name dependences between an
instruction i that precedes instruction j in program order:
– An antidependence between instruction i and instruction j
occurs when instruction j writes a register or memory location
that instruction i reads. The original ordering must be preserved
to ensure that i reads the correct value. In the following
example, there is an antidependence between fsd and addi on
register x1.
fsd f4,0(x1) //store result
addi x1,x1,8 //decrement pointer 8 bytes
– An output dependence occurs when instruction i and instruction
j write the same register or memory location. The ordering
between the instructions must be preserved to ensure that the
value finally written corresponds to instruction j.
Data Hazards
• Consider two instructions i and j, with i preceding j in program order. The
possible data hazards are
RAW (read after write):
o j tries to read a source before i writes it, so j incorrectly gets the old value.
o This hazard is the most common type and corresponds to a true data
dependence.
WAW (write after write):
o j tries to write an operand before it is written by i.
o The writes end up being performed in the wrong order, leaving the value
written by i rather than the value written by j in the destination. This hazard
corresponds to an output dependence.
WAR (write after read):
o j tries to write a destination before it is read by i, so i incorrectly gets the new
value. This hazard arises from an antidependence (or name dependence).
o WAR hazards cannot occur in most static issue pipelines
o A WAR hazard occurs either when there are some instructions that write
results early in the instruction pipeline and other instructions that read a
source late in the pipeline, or when instructions are reordered.
(iii) Control Dependences
• The last type of dependence is a control dependence. A control dependence
determines the ordering of an instruction, i, with respect to a branch instruction so
that instruction i is executed in correct program order and only when it should be.
if p1
{ S1; };
if p2
{ S2; }
S1 is control-dependent on p1, and
S2 is control-dependent on p2 but not on p1.
In general, two constraints are imposed by control dependences:
1. An instruction that is control-dependent on a branch cannot be moved before
the branch so that its execution is no longer controlled by the branch. For
example, we cannot take an instruction from the then portion of an if statement
and move it before the if statement.
2. 2. An instruction that is not control-dependent on a branch cannot be moved
after the branch so that its execution is controlled by the branch. For example,
we cannot take a statement before the if statement and move it into the then
portion.
What Is Instruction-Level Parallelism?
• Basic Block: The amount of parallelism available within a basic
block—a straight-line code sequence with no branches in
except to the entry and no branches out except at the exit—is
quite small.
• The simplest and most common way to increase the ILP is to
exploit parallelism among iterations of a loop. This type of
parallelism is often called loop-level parallelism.

MCP Unit 1
No ratings yet
MCP Unit 1
41 pages
Instruction Level Parallelism and Its Exploitation: Unit Ii by Raju K, Cse Dept
No ratings yet
Instruction Level Parallelism and Its Exploitation: Unit Ii by Raju K, Cse Dept
201 pages
Module 5 Instruction Level Parallelism and Pipelining (1)
No ratings yet
Module 5 Instruction Level Parallelism and Pipelining (1)
54 pages
Instruction Level Parallelism-Concepts N Challenges
100% (1)
Instruction Level Parallelism-Concepts N Challenges
4 pages
CSE 820 Graduate Computer Architecture Week 5 - Instruction Level Parallelism
No ratings yet
CSE 820 Graduate Computer Architecture Week 5 - Instruction Level Parallelism
38 pages
Lecture-7-15.01.2025
No ratings yet
Lecture-7-15.01.2025
19 pages
03 Dynamic Sched
No ratings yet
03 Dynamic Sched
84 pages
Topic2c Ss Dynamicscheduling
No ratings yet
Topic2c Ss Dynamicscheduling
94 pages
Instruction Level Parallelism: Soner Onder
No ratings yet
Instruction Level Parallelism: Soner Onder
25 pages
Cosc530 Ch3all6up
No ratings yet
Cosc530 Ch3all6up
8 pages
Pipelining Become Universal Technique in 1985
No ratings yet
Pipelining Become Universal Technique in 1985
16 pages
3a.ILP Dipendenze e Superscalare
No ratings yet
3a.ILP Dipendenze e Superscalare
24 pages
Instruction-Level Parallelism: Stalls Control Stalls WAW Stalls WAR Stalls RAW Stalls Structural CPI CPI
No ratings yet
Instruction-Level Parallelism: Stalls Control Stalls WAW Stalls WAR Stalls RAW Stalls Structural CPI CPI
50 pages
ILP Overview and Scoreboard
No ratings yet
ILP Overview and Scoreboard
60 pages
4-Advanced pipelining_241114_060906
No ratings yet
4-Advanced pipelining_241114_060906
80 pages
Chapter 2 ILP
No ratings yet
Chapter 2 ILP
89 pages
CS 6290 Instruction Level Parallelism
No ratings yet
CS 6290 Instruction Level Parallelism
45 pages
Instruction Level Pipelining
100% (1)
Instruction Level Pipelining
113 pages
EC483_Fall2024_W7
No ratings yet
EC483_Fall2024_W7
40 pages
Lecture 5
No ratings yet
Lecture 5
76 pages
13) Ilp1 PDF
No ratings yet
13) Ilp1 PDF
85 pages
CompanionAsset 9780128119051 Chapter03 (3)
No ratings yet
CompanionAsset 9780128119051 Chapter03 (3)
67 pages
Instruction-Level Parallelism (ILP), Since The
100% (1)
Instruction-Level Parallelism (ILP), Since The
57 pages
2-TypesofParallelism (1)
No ratings yet
2-TypesofParallelism (1)
69 pages
CS 6461: Computer Architecture Instruction Level Parallelism
No ratings yet
CS 6461: Computer Architecture Instruction Level Parallelism
41 pages
Instruction-Level Parallelism and Its Exploitation: Prof. Dr. Nizamettin AYDIN
No ratings yet
Instruction-Level Parallelism and Its Exploitation: Prof. Dr. Nizamettin AYDIN
170 pages
study guide chapter 3
No ratings yet
study guide chapter 3
3 pages
4th Lecture Computer Architecture
No ratings yet
4th Lecture Computer Architecture
15 pages
ACA Unit 3
No ratings yet
ACA Unit 3
17 pages
Instruction-Level Parallel Processors: Asim Munir
No ratings yet
Instruction-Level Parallel Processors: Asim Munir
28 pages
Instruction Level Parallelism
95% (21)
Instruction Level Parallelism
11 pages
CAQA5e ch3
No ratings yet
CAQA5e ch3
45 pages
Instruction-Level Parallel Processors: Objective
No ratings yet
Instruction-Level Parallel Processors: Objective
31 pages
Data Depend
No ratings yet
Data Depend
29 pages
Coa-Unit 4 Handout
No ratings yet
Coa-Unit 4 Handout
39 pages
Introduction To Instruction Level Parallelism (ILP) : ECE338 Parallel Computer Architecture Spring 2022
No ratings yet
Introduction To Instruction Level Parallelism (ILP) : ECE338 Parallel Computer Architecture Spring 2022
13 pages
Compiler Techniques For Exposing ILP
No ratings yet
Compiler Techniques For Exposing ILP
26 pages
Instruction level Parallelism
No ratings yet
Instruction level Parallelism
22 pages
Data Dependences and Hazards
No ratings yet
Data Dependences and Hazards
24 pages
Dependencies, Instruction Scheduling, Optimization, and Parallelism
No ratings yet
Dependencies, Instruction Scheduling, Optimization, and Parallelism
49 pages
Unit - 1 Microprocessor Architecture
No ratings yet
Unit - 1 Microprocessor Architecture
52 pages
Advanced Computer Architecture: Conditions of Parallelism
No ratings yet
Advanced Computer Architecture: Conditions of Parallelism
27 pages
Sobia Afifa
No ratings yet
Sobia Afifa
22 pages
Computer Organization and Architecture What Does Superscalar Mean?
No ratings yet
Computer Organization and Architecture What Does Superscalar Mean?
14 pages
Chapter 5 PPTV 41 STDV 1
No ratings yet
Chapter 5 PPTV 41 STDV 1
47 pages
CAP EndSem Unit 5
No ratings yet
CAP EndSem Unit 5
8 pages
Parallelism I: Inside The Core
No ratings yet
Parallelism I: Inside The Core
61 pages
Star Lion College of Engineering & Technology: Cs2354 Aca-2 Marks & 16 Marks
No ratings yet
Star Lion College of Engineering & Technology: Cs2354 Aca-2 Marks & 16 Marks
14 pages
Ch2 Lec7 Instruction Piplining
No ratings yet
Ch2 Lec7 Instruction Piplining
34 pages
L13
No ratings yet
L13
15 pages
Cs2354 Advanced Computer Architecture 2 Marks
No ratings yet
Cs2354 Advanced Computer Architecture 2 Marks
10 pages
L1.3b_OOOpipelines
No ratings yet
L1.3b_OOOpipelines
72 pages
Advanced Computer Architecture
No ratings yet
Advanced Computer Architecture
214 pages
3313
No ratings yet
3313
59 pages
Module 5_Processor Structure and Function
No ratings yet
Module 5_Processor Structure and Function
74 pages
Program and Network Properties 2.1 Conditions of Parallelism 2.2 Program Partitioning and Scheduling
No ratings yet
Program and Network Properties 2.1 Conditions of Parallelism 2.2 Program Partitioning and Scheduling
47 pages
Lec 8
No ratings yet
Lec 8
62 pages
Lec5 - ILP Issues in Pipeline Design
No ratings yet
Lec5 - ILP Issues in Pipeline Design
38 pages
WAN TECHNOLOGY FRAME-RELAY: An Expert's Handbook of Navigating Frame Relay Networks
From Everand
WAN TECHNOLOGY FRAME-RELAY: An Expert's Handbook of Navigating Frame Relay Networks
Mamta Devi
No ratings yet
ORACLE PL/SQL Interview Questions You'll Most Likely Be Asked
From Everand
ORACLE PL/SQL Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
5/5 (1)
Computer Architecture and Organization Chapter 5 &6
No ratings yet
Computer Architecture and Organization Chapter 5 &6
22 pages
Mepco Schlenk Engg College Autonomous PG CURRICULUM Web Curriculum MTECH IT
No ratings yet
Mepco Schlenk Engg College Autonomous PG CURRICULUM Web Curriculum MTECH IT
45 pages
ES Syllabus ECE PDF
No ratings yet
ES Syllabus ECE PDF
12 pages
Speculative Execution in High Performance Computer Architectures Chapman Hall CRC Computer Information Science Series 1st Edition David Kaeli
100% (5)
Speculative Execution in High Performance Computer Architectures Chapman Hall CRC Computer Information Science Series 1st Edition David Kaeli
84 pages
Instruction-Level Parallelism and Superscalar Processors
No ratings yet
Instruction-Level Parallelism and Superscalar Processors
22 pages
Computer Architecture: First Edition
No ratings yet
Computer Architecture: First Edition
6 pages
Instruction Level Parallelism
No ratings yet
Instruction Level Parallelism
2 pages
Instruction Pipeline
No ratings yet
Instruction Pipeline
28 pages
3-INSTRUCTION LEVEL PARALLELISM-12-Dec-2019Material - I - 12-Dec-2019 - ILP PDF
No ratings yet
3-INSTRUCTION LEVEL PARALLELISM-12-Dec-2019Material - I - 12-Dec-2019 - ILP PDF
15 pages
CS2354 (Aca)
No ratings yet
CS2354 (Aca)
4 pages
EEE440 Computer Architecture
No ratings yet
EEE440 Computer Architecture
7 pages
Ebooks File Computer Architecture John L. Hennessy All Chapters
100% (3)
Ebooks File Computer Architecture John L. Hennessy All Chapters
49 pages
Von Neumann Architecture vs. Parallel Processing
No ratings yet
Von Neumann Architecture vs. Parallel Processing
3 pages
Vliw Processors
No ratings yet
Vliw Processors
20 pages
Parallel Sorting Algorithms
No ratings yet
Parallel Sorting Algorithms
22 pages
ILP Saad Saeed
No ratings yet
ILP Saad Saeed
31 pages
Speculative Execution in High Performance Computer Architectures Chapman Hall Crc Computer Information Science Series 1st Edition David Kaeli pdf download
100% (2)
Speculative Execution in High Performance Computer Architectures Chapman Hall Crc Computer Information Science Series 1st Edition David Kaeli pdf download
74 pages
Computer Hardware Engineering: IS1200, Spring 2015
No ratings yet
Computer Hardware Engineering: IS1200, Spring 2015
17 pages
Parallel Processing Chapter - 3: Instruction Level Parallelism
No ratings yet
Parallel Processing Chapter - 3: Instruction Level Parallelism
33 pages
Parallelism
No ratings yet
Parallelism
22 pages
Lecture 06 - (New) Pipelining and Parallelism
No ratings yet
Lecture 06 - (New) Pipelining and Parallelism
36 pages
ACA Unit 4
No ratings yet
ACA Unit 4
41 pages
Instruction Level Parallelism
No ratings yet
Instruction Level Parallelism
17 pages
(eBook PDF) Computer Architecture: A Quantitative Approach 5th Edition All Chapters Instant Download
100% (4)
(eBook PDF) Computer Architecture: A Quantitative Approach 5th Edition All Chapters Instant Download
55 pages
Unit 5 SCADA
No ratings yet
Unit 5 SCADA
52 pages
Types of Parallelism
No ratings yet
Types of Parallelism
5 pages

U3.1 Concepts and Challenges[1] (1)

Uploaded by

U3.1 Concepts and Challenges[1] (1)

Uploaded by

Unit-III

• The ideal pipeline CPI is a measure of the maximum

• For example, consider the following RISC-V code sequence that

fadd.d f4,f0,f2 //add scalar in f2

fsd f4,0(x1) //store result

• A name dependence occurs when two instructions use the same

You might also like