0% found this document useful (0 votes)

8 views40 pages

module-3-chapter-2

The document discusses pipelining and superscalar techniques, focusing on linear and non-linear pipeline processors, instruction pipeline design, and arithmetic pipeline design. It covers various models, mechanisms, and design issues related to instruction execution phases, dynamic scheduling, hazard avoidance, and branch handling techniques. Additionally, it highlights fixed-point and floating-point operations, as well as the design of multifunctional arithmetic pipelines.

Uploaded by

Dr. Usha Divakarla NMAMIT

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views40 pages

module-3-chapter-2

Uploaded by

Dr. Usha Divakarla NMAMIT

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 40

Pipelining and Superscalar Techniques

• Linear Pipeline Processors

• Non-linear Pipeline Processors
• Instruction Pipeline Design
• Arithmetic Pipeline Design
• Superscalar Pipeline Design

2
LINEAR PIPELINE
PROCESSORS
• Linear Pipeline Processor
o Is a cascade of processing stages which are linearly connected to perform a fixed function over a stream of
data flowing from one end to other.
• Models of Linear Pipeline
o Synchronous Model
o Asynchronous Model

• Clocking and Timing Control

o Clock Cycle
o Pipeline Frequency
o Clock skewing
o Flow-through delay
o Speedup, Efficiency and Throughput
• Optimal number of Stages and Performance-Cost Ratio (PCR)
3
Clock Cycle τ=max {τi }1k + d=τm + d

Pipeline frequency f=1/τ

Total time required for k stages is Tk= [k+
(n-1)]τ
Speedup factor Si = Ti/Tk= nk τ/k τ +(n-1) τ =nk/k+(n-
1)
 Performance/cost ratio PCR=f/c+kh=1/(t/k+d)(c+kh)
 Efficiency Ek=n/k+(n-1)
 Throughput Hk=n/[k+(n-1)]τ
LINEAR PIPELINE PROCESSORS
LINEAR PIPELINE PROCESSORS
NON-LINEAR PIPELINE ROCESSORS

• Dynamic Pipeline
o Static v/s Dynamic Pipeline
o Streamline connection, feed-forward connection and feedback connection

• Reservation and Latency Analysis

o Reservation tables
o Evaluation time

• Latency Analysis
o Latency
o Collision
o Forbidden latencies
o Latency Sequence, Latency Cycle and Average Latency
NON-LINEAR PIPELINE
PROCESSORS

8
NON-LINEAR PIPELINE
PROCESSORS

9
NON-LINEAR PIPELINE PROCESSORS
INSTRUCTION PIPELINE
DESIGN
• Instruction Execution Phases
o E.g. Fetch, Decode, Issue, Execute, Write-back
o In-order Instruction issuing and Reordered Instruction issuing
• E.g. X = Y + Z , A = B x C
• Mechanisms/Design Issues for Instruction Pipelining
o Pre-fetch Buffers
o Multiple Functional Units
o Internal Data Forwarding
o Hazard Avoidance
• Dynamic Scheduling
• Branch Handling Techniques
INSTRUCTION PIPELINE DESIGN

Sumit Mittu, Assistant Professor, C SE/IT, Lovely Professional University

INSTRUCTION PIPELINE DESIGN

• Fetch: fetches instructions from memory; ideally one per cycle

• Decode: reveals instruction operations to be performed and identifies the resources needed
• Issue: reserves the resources and reads the operands from registers
• Execute: actual processing of operations as indicated by instruction
• Write Back: writing results into the registers
INSTRUCTION PIPELINE DESIGN
INSTRUCTION PIPELINE
DESIGN
Mechanisms/Design Issues of Instruction
Pipeline
• Pre-fetch Buffers
o Sequential Buffers
o Target Buffers
o Loop Buffers
INSTRUCTION PIPELINE DESIGN
Mechanisms/Design Issues of Instruction Pipeline

• Multiple Functional Units

o Reservation Station and Tags
o Slow-station as Bottleneck stage
• Subdivision of Pipeline Bottleneck stage
• Replication of Pipeline Bottleneck stage
INSTRUCTION PIPELINE DESIGN
Mechanisms/Design Issues of Instruction
Pipeline
INSTRUCTION PIPELINE DESIGN
Mechanisms/Design Issues of Instruction
Pipeline
• Internal Forwarding and Register Tagging
o Internal Forwarding:
• A “short-circuit” technique to replace unnecessary memory accesses by register-register
transfers in a sequence of fetch-arithmetic-store operations
o Register Tagging:
• Use of tagged registers , buffers and reservation stations, for exploiting concurrent activities
among multiple arithmetic units
o Store-Fetch Forwarding
• (M  R1, R2  M) replaced by (M  R1, R2  R1)
o Fetch-Fetch Forwarding
• (R1  M, R2  M) replaced by (R1  M, R2  R1)
o Store-Store Overwriting
• (M  R1, M  R2) replaced by (M  R2)
INSTRUCTION PIPELINE DESIGN
Mechanisms/Design Issues of Instruction
Pipeline

• Internal Forwarding and Register Tagging

INSTRUCTION PIPELINE DESIGN
Mechanisms/Design Issues of Instruction
Pipeline

• Internal Forwarding and Register Tagging

INSTRUCTION PIPELINE DESIGN
Mechanisms/Design Issues of Instruction Pipeline

• Hazard Detection and Avoidance

o Domain or Input Set of an instruction
o Range or Output Set of an instruction
o Data Hazards: RAW, WAR and WAW
o Resolution using Register Renaming approach
INSTRUCTION PIPELINE DESIGN
Dynamic Instruction Scheduling

• Idea of Static Scheduling

o Compiler based scheduling strategy to resolve Interlocking among instructions

• Dynamic Scheduling
o Tomasulo’s Algorithm (Register-Tagging Scheme)
• Hardware based dependence-resolution
o Scoreboarding Technique
• Scoreboard: the centralized control unit
• A kind of data-driven mechanism
INSTRUCTION PIPELINE DESIGN
Branch Handling Techniques
• Branch Taken, Branch Target, Delay Slot
• Effect of Branching
o Parameters:
k : No. of stages in the pipeline
n : Total no. of instructions or tasks
p : Percentage of Brach instructions over n
q : Percentage of successful branch instructions (branch taken) over p.
b : Delay Slot
τ : Pipeline Cycle Time

o Branch Penalty = q of (p of n) * bτ = pqnbτ

o Effective Execution Time:
• Teff = [k + (n-1)] τ + pqnbτ = [k + (n-1) + pqnb]τ
• Effect of Branching
o Effective Throughput:
• Heff = n/Teff
• Heff = n / {[k + (n-1) + pqnb]τ} = nf / [k + (n-1) + pqnb]
• As nInfinity and b = k-1
o H*eff = f / [pq(k-1)+1]
• If p=0 and q=0 (no branching occurs)
o H**eff = f = 1/τ
o Performance Degradation Factor
• D = 1 – H*eff / f = pq(k-1) / [pq(k-1)+1]
INSTRUCTION PIPELINE DESIGN
Branch Handling Techniques

Sumit Mittu, Assistant Professor, C SE/IT, Lovely Professional University

• Branch Prediction
o Static Branch Prediction: based on branch code types
o Dynamic Branch prediction: based on recent branch history
• Strategy 1: Predict the branch direction based on information found at decode stage.
• Strategy 2: Use a cache to store target addresses at effective address calculation stage.
• Strategy 3: Use a cache to store target instructions at fetch stage
o Brach Target Buffer Organization

• Delayed Branches
o A delayed branch of d cycles allows at most d-1 useful instructions to be executed following the
branch taken.
o Execution of these instructions should be independent of branch instruction to achieve a zero branch
penalty
INSTRUCTION PIPELINE DESIGN
Branch Handling Techniques

Sumit Mittu, Assistant Professor, C SE/IT, Lovely Professional University

• Finite-precision arithmetic
• Overflow and Underflow
• Fixed-Point operations
o Notations:
• Signed-magnitude, one’s complement and two-complement notation
o Operations:

• Addition: (n bit, n bit)  (n bit) Sum, 1 bit output carry

• Subtraction: (n bit, n bit)  (n bit) difference
• Multiplication: (n bit, n bit)  (2n bit) product
• Division: (2n bit, n bit)  (n bit) quotient, (n bit) remainder
• Floating-Point Numbers
o X = (m, e) representation
• m: mantissa or fraction
• e: exponent with an implied base or radix r.
•Actual Value X = m * r e
o Operations on numbers X = (mx, ex) and Y = (my, ey)
• Addition: (mx * rex-ey + my ) . xey
• Subtraction: (mx * rex-ey – my ) . xey) (mx *
• Multiplication: my ) . rex+ey
• Division:
(mx / my ) . rex – ey

• Elementary Functions
o Transcendental functions like: Trigonometric, Exponential, Logarithmic, etc.
• Separate units for fixed point operations and floating point operations
• Scalar and Vector Arithmetic Pipelines
• Uni-functional or Static Pipelines
• Arithmetic Pipeline Stages
o Majorly involve hardware to perform: Add and Shift micro-operations
o Addition using: Carry Propagation Adder (CPA) and Carry Save Adder (CSA)
o Shift using: Shift Registers

• Multiplication Pipeline Design

o E.g. To multiply two 8-bit numbers that yield a 16-bit product using CSA and CPA Wallace Tree.
ARITHMETIC PIPELINE DESIGN
Static Arithmetic Pipelines

Sumit Mittu, Assistant Professor, C SE/IT, Lovely Professional University

A x B= P,
where P is the 16-bit product.

P = A x B = P0 + P1 + P2 +…. + P7, where x and + are

arithmetic multiply and acid
operations,
ARITHMETIC PIPELINE DESIGN
Static Arithmetic Pipelines

Sumit Mittu, Assistant Professor, C SE/IT, Lovely Professional University

• Multifunctional Pipeline:
o Static multifunctional pipeline
o Dynamic multifunctional pipeline

• Case Study: T1/ASC static multifunctional pipeline architecture

3
6
ARITHMETIC PIPELINE DESIGN
Multifunctional Arithmetic Pipelines

Sumit Mittu, Assistant Professor, C SE/IT, Lovely Professional University

ARITHMETIC PIPELINE DESIGN
Multifunctional Arithmetic Pipelines

Sumit Mittu, Assistant Professor, C SE/IT, Lovely Professional University

Shop Manual Wa500
93% (27)
Shop Manual Wa500
1,046 pages
Pipelining and Superscalar Techniques: CSE539: Advanced Computer Architecture
No ratings yet
Pipelining and Superscalar Techniques: CSE539: Advanced Computer Architecture
49 pages
Chapter 6
No ratings yet
Chapter 6
71 pages
More On Pipelining
100% (1)
More On Pipelining
34 pages
Pipeline and Vector
No ratings yet
Pipeline and Vector
29 pages
Parallel Processing
No ratings yet
Parallel Processing
32 pages
Pipeline and Vector Processing
100% (1)
Pipeline and Vector Processing
18 pages
CA Slides#3 Pipeline Introduction
No ratings yet
CA Slides#3 Pipeline Introduction
26 pages
PIpeline Processing and Multi Processing
No ratings yet
PIpeline Processing and Multi Processing
16 pages
3.2 Pipeline Processing
No ratings yet
3.2 Pipeline Processing
18 pages
Module 3-Part 2 (1).pptx
No ratings yet
Module 3-Part 2 (1).pptx
50 pages
Pipelining and Vector Processing
No ratings yet
Pipelining and Vector Processing
28 pages
Pipelining and Vector Processing
No ratings yet
Pipelining and Vector Processing
30 pages
Presentation 5156 Content Document 20250301102853AM
No ratings yet
Presentation 5156 Content Document 20250301102853AM
40 pages
Instruction Pipeline Design, Arithmetic Pipeline Deign - Super Scalar Pipeline Design
No ratings yet
Instruction Pipeline Design, Arithmetic Pipeline Deign - Super Scalar Pipeline Design
34 pages
5.1-5.3 Pipelining and Parallel Processing
No ratings yet
5.1-5.3 Pipelining and Parallel Processing
56 pages
BNCS1209 Chapter 6
No ratings yet
BNCS1209 Chapter 6
25 pages
Pipelining basic concept
No ratings yet
Pipelining basic concept
23 pages
5.Pipeline and Multiprocessors
No ratings yet
5.Pipeline and Multiprocessors
16 pages
Chapter 5
No ratings yet
Chapter 5
38 pages
Unit-5-Parallel Processing
No ratings yet
Unit-5-Parallel Processing
11 pages
COA Unit-3 Slides
No ratings yet
COA Unit-3 Slides
76 pages
Unit-V NEW
No ratings yet
Unit-V NEW
21 pages
Chapter 9 - Pipeline and Vector Processing Section 9.1 - Parallel Processing
No ratings yet
Chapter 9 - Pipeline and Vector Processing Section 9.1 - Parallel Processing
10 pages
Chapter 5 Pipelining and Vector Processing Modified
No ratings yet
Chapter 5 Pipelining and Vector Processing Modified
37 pages
Parallel Chapter3
No ratings yet
Parallel Chapter3
29 pages
Pipelining
No ratings yet
Pipelining
21 pages
Pipelining: Advanced Computer Architecture
100% (1)
Pipelining: Advanced Computer Architecture
30 pages
COA M3 BIT (1)
No ratings yet
COA M3 BIT (1)
4 pages
Topic 10: Pipelining: Cos / Ele 375 Computer Architecture and Organization
No ratings yet
Topic 10: Pipelining: Cos / Ele 375 Computer Architecture and Organization
64 pages
Basics and Hazards of Pipeline Controller
No ratings yet
Basics and Hazards of Pipeline Controller
23 pages
Chapter 3 PPTV 31 Sem IIv 31
No ratings yet
Chapter 3 PPTV 31 Sem IIv 31
40 pages
module 4-Pipelining
No ratings yet
module 4-Pipelining
39 pages
Lec3 PDF
No ratings yet
Lec3 PDF
15 pages
Pipelining 2
No ratings yet
Pipelining 2
43 pages
Pipeline Basic Concept (1)
No ratings yet
Pipeline Basic Concept (1)
7 pages
Pipeline & Parallel Processing
No ratings yet
Pipeline & Parallel Processing
19 pages
Unit 5 - Pipeling and Multipoessors
No ratings yet
Unit 5 - Pipeling and Multipoessors
74 pages
Chapter 5 - CO - BIM - III
No ratings yet
Chapter 5 - CO - BIM - III
7 pages
CSO Lecture Notes Unit - 5
No ratings yet
CSO Lecture Notes Unit - 5
11 pages
Unit 3-2 COA
No ratings yet
Unit 3-2 COA
58 pages
Instruction Pipelining
No ratings yet
Instruction Pipelining
32 pages
1.4-Parallel Computer Architecture
No ratings yet
1.4-Parallel Computer Architecture
22 pages
4 Instruction Pipeline
No ratings yet
4 Instruction Pipeline
13 pages
FINAL Presentation
No ratings yet
FINAL Presentation
31 pages
Ca07 2014 PDF
No ratings yet
Ca07 2014 PDF
56 pages
Instruction Pipelining and SuperScalar Development - 2019
No ratings yet
Instruction Pipelining and SuperScalar Development - 2019
53 pages
Unit 6 COA
No ratings yet
Unit 6 COA
37 pages
6. Pipeline -3117 (1)
No ratings yet
6. Pipeline -3117 (1)
22 pages
Pipelining
No ratings yet
Pipelining
44 pages
Embedded Systems Design: Pipelining and Instruction Scheduling
No ratings yet
Embedded Systems Design: Pipelining and Instruction Scheduling
48 pages
Unit 5
No ratings yet
Unit 5
51 pages
Lec18 Pipeline
No ratings yet
Lec18 Pipeline
59 pages
COA Unit - V Notes
No ratings yet
COA Unit - V Notes
21 pages
Pipelining Vector Processing
No ratings yet
Pipelining Vector Processing
27 pages
Unit-3
No ratings yet
Unit-3
94 pages
CH-1 1 Pipelining
No ratings yet
CH-1 1 Pipelining
43 pages
UNIT-5: Pipeline and Vector Processing
No ratings yet
UNIT-5: Pipeline and Vector Processing
63 pages
Concept of Pipelining - Computer Architecture Tutorial What Is Pipelining?
100% (1)
Concept of Pipelining - Computer Architecture Tutorial What Is Pipelining?
5 pages
Worked Examples in Mechanics of Machines using MATLAB
From Everand
Worked Examples in Mechanics of Machines using MATLAB
Eric Ogur
No ratings yet
Worked Examples in Mechanics of Machines using MATLAB
From Everand
Worked Examples in Mechanics of Machines using MATLAB
Eric Okoth Ogur
No ratings yet
module-3-chapter-1
No ratings yet
module-3-chapter-1
58 pages
module-4-chapter-1
No ratings yet
module-4-chapter-1
28 pages
module3-chapter3
No ratings yet
module3-chapter3
32 pages
module4-chapter1
No ratings yet
module4-chapter1
12 pages
chapter2-Intelligentagents
No ratings yet
chapter2-Intelligentagents
47 pages
Chapter3&4-problemsolvingagents-Expertsystems
No ratings yet
Chapter3&4-problemsolvingagents-Expertsystems
71 pages
Cxa1691bm (S) PDF
No ratings yet
Cxa1691bm (S) PDF
11 pages
2G Basic Checks After Swap PDF
No ratings yet
2G Basic Checks After Swap PDF
13 pages
Sap Simple Finance Tutorial
50% (2)
Sap Simple Finance Tutorial
16 pages
Automation & Control Requirements For Package With PLC (Coc)
No ratings yet
Automation & Control Requirements For Package With PLC (Coc)
12 pages
ABB REF542plus Manuale ENG
No ratings yet
ABB REF542plus Manuale ENG
32 pages
SP1637
No ratings yet
SP1637
8 pages
SWT 3000 - Steckbrief
No ratings yet
SWT 3000 - Steckbrief
2 pages
Python Project
No ratings yet
Python Project
14 pages
Alm Control Simatic en
No ratings yet
Alm Control Simatic en
14 pages
Result
No ratings yet
Result
8 pages
Design of IoT Laboratory Exercises
No ratings yet
Design of IoT Laboratory Exercises
47 pages
User-Manual EDU100 en PDF
No ratings yet
User-Manual EDU100 en PDF
29 pages
C78IB003ML-E SOP ViscoQC100 Screen
No ratings yet
C78IB003ML-E SOP ViscoQC100 Screen
24 pages
ProdCat Tables 3-8
No ratings yet
ProdCat Tables 3-8
65 pages
Tutorial Mugen
No ratings yet
Tutorial Mugen
2 pages
Biology Peer Assessment Winter 2023 4
No ratings yet
Biology Peer Assessment Winter 2023 4
5 pages
2a-Esp8266-Sdk Getting Started Guide en
No ratings yet
2a-Esp8266-Sdk Getting Started Guide en
36 pages
Circuit Board Fabricators Inc Case Solut
No ratings yet
Circuit Board Fabricators Inc Case Solut
3 pages
mg_pm_PC4-R
No ratings yet
mg_pm_PC4-R
18 pages
Superior Windows XP 64
100% (1)
Superior Windows XP 64
13 pages
Color Image Processing
No ratings yet
Color Image Processing
15 pages
Lecture #4: Lo'ai Tawalbeh
No ratings yet
Lecture #4: Lo'ai Tawalbeh
58 pages
GR-1142 EN UM v1.0.0 PDF
No ratings yet
GR-1142 EN UM v1.0.0 PDF
100 pages
Tang Nano 20K 3920 Schematic
No ratings yet
Tang Nano 20K 3920 Schematic
6 pages
Operation Manual: 480 Legend Series
No ratings yet
Operation Manual: 480 Legend Series
28 pages
The MIT Press Leonardo Music Journal: This Content Downloaded From 132.248.9.8 On Thu, 09 May 2019 03:34:10 UTC
No ratings yet
The MIT Press Leonardo Music Journal: This Content Downloaded From 132.248.9.8 On Thu, 09 May 2019 03:34:10 UTC
5 pages
Line Trace
100% (1)
Line Trace
84 pages
CSA Cache
No ratings yet
CSA Cache
62 pages
TESLA 4000 Manual
No ratings yet
TESLA 4000 Manual
454 pages

module-3-chapter-2

Uploaded by

module-3-chapter-2

Uploaded by

Pipelining and Superscalar Techniques

• Linear Pipeline Processors

• Clocking and Timing Control

Pipeline frequency f=1/τ

• Reservation and Latency Analysis

Sumit Mittu, Assistant Professor, C SE/IT, Lovely Professional University

• Fetch: fetches instructions from memory; ideally one per cycle

• Multiple Functional Units

• Internal Forwarding and Register Tagging

• Internal Forwarding and Register Tagging

• Hazard Detection and Avoidance

• Idea of Static Scheduling

o Branch Penalty = q of (p of n) * bτ = pqnbτ

Sumit Mittu, Assistant Professor, C SE/IT, Lovely Professional University

Sumit Mittu, Assistant Professor, C SE/IT, Lovely Professional University

• Addition: (n bit, n bit)  (n bit) Sum, 1 bit output carry

• Multiplication Pipeline Design

Sumit Mittu, Assistant Professor, C SE/IT, Lovely Professional University

P = A x B = P0 + P1 + P2 +…. + P7, where x and + are

Sumit Mittu, Assistant Professor, C SE/IT, Lovely Professional University

• Case Study: T1/ASC static multifunctional pipeline architecture

Sumit Mittu, Assistant Professor, C SE/IT, Lovely Professional University

Sumit Mittu, Assistant Professor, C SE/IT, Lovely Professional University

You might also like