0% found this document useful (0 votes)

9 views

Pipeline 1

Uploaded by

Gopal Chandra Haldar

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

Pipeline 1

Uploaded by

Gopal Chandra Haldar

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Pipelining : Basic Concepts

CSE Sem-4
CPU Configuration and Instruction Execution Operations

Main Memory
1. Address of the next instruction is
transferred from PC to MAR. The
instruction is located in MM.
MAR MDR
2. Instruction is copied from memory to
MDR
CPU Bus
3. Instruction is transferred to IR to decode

4. Control unit send signals to appropriate

PC IR ACC ALU devices (ALU, ACC, memory) to
execute the instruction.

CU 5. The result is stored.

Common
Clock
Why Pipelining?
 CPU performance can be improved by:
• Improve the hardware by introducing faster circuits.
• Arrange the hardware such that more than one operation can be performed at the same time.

 Pipelining or Pipeline Processing:

• Pipelining is a technique where multiple instructions are overlapped during execution.
• Pipelining is a process of arrangement of hardware elements of the CPU such that its overall
performance is increased.
• It allows storing and executing instructions in an orderly process.
• Pipeline has two ends, the input end and the output end. Between these ends, there are multiple
stages/segments such that output of one stage is connected to input of next stage and each stage
performs a specific operation.
• Stages are purely a combination of sequential and combinational circuits performing arithmetic and
logic operations over the data stream flowing through the pipe.
Pipeline vs non-pipeline architecture:
p1 F D O E S F = Instruction Fetch
D = Decode
p2 - - - - - F D O E S
O = Operand Fetch
p3 - - - - - - - - - - F D O E S E = Execute
Clk 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 S = Storing result

Fig: 1 Non-pipelined architecture

If there are ‘K’ stages in the pipelined architecture, and
number of instructions are ‘n’ – it will take K clock pulses
S1 F1 F2 F3 to perform the 1st instruction, and after, at each clock pulse
Space or stages

S2 D1 D2 D3 1 instructions will be completed.

S3 O1 O2 O3
S4 E1 E2 E3 So number of clock pulse required to perform n
S5 S1 S2 S3 instructions in K stage pipelined architecture:
Clk 1 2 3 4 5 6 7 = K+(n-1)
Time
Speed up of pipelined architecture=
Fig: 2 Pipelined architecture
S = CP in non-pipeline / CP in pipeline
(Space time diagram)
Difference Between Pipeline and Non-pipeline Architecture
Pipeline Architecture Non-pipeline Architecture
All the actions (fetching, decoding, execution of
Pipelining is an implementation technique where
instructions and storing results into memory) are
multiple instructions are overlapped in execution.
grouped into a single step.
Many instructions are executed at the same time. Only one instruction is executed at a time.

Execution time is comparatively less and execution Execution takes more time or more number of
is done in a fewer cycles. cycles comparatively.
It has a high throughput (amount of instructions
It has a low throughput.
executed per unit time).
In a Non-Pipelining system, the CPU scheduler
The pipeline is filled by the CPU scheduler from a chooses the instruction from the pool of waiting
pool of work which is waiting to occur. instructions, when an execution unit gives a signal
that it is free.
CPU Scheduling is a process of determining which process will own CPU for execution while another process is
on hold.
Principle of Pipeline
• The problem is divided into a series of
tasks that have to be completed one
after another.
• Each subtask can be executed by a
hardware that operates concurrently with
other pipeline stages.
• All pipeline stages works sequentially,
receiving their input from the previous
stage and transferring their output to the
next stage.
• There is a constant stream of tasks into the pipe and there is overlapped execution at the
subtask level.
• Each stage gets a new input at the beginning of the clock cycle, each stage has a single clock
cycle available for implementing the needed operations, and each stage produces the result
to the next stage by the starting of the subsequent clock cycle.
Advantages of Pipeline

• It can reduce the number of cycles to perform multiple instructions.

• It can raise the multiple instructions that can be processed together and lower the delay
between completed instructions, which can increase the throughput of the system.
• The more pipeline stages a processor has, the more instructions it can process at once.
Today’s microprocessor manufacturer uses 2 to 40 stages pipeline.
• If pipelining is used, the ALU can be perform faster, but design will be more complex.
• Pipelining in theory increases performance over an un-pipelined core by a factor of the
number of stages.
• Pipelined CPUs generally work at a higher clock frequency than the RAM clock
frequency, increasing computers overall performance.
Disadvantages of Pipeline
• The design of pipelined processor is complex and costly, compared to non-pipelined
processor.
• A non-pipelined processor will have a defined instruction throughput. The performance
of a pipelined processor is much harder to predict and may vary widely for different
programs.
• In pipelining every branching operation is delayed, the entire pipeline must be cleared,
as the processor cannot know in advance where to read the next instruction and must
wait for the branch instruction to finish, leaving the pipeline empty.
• Problems occur when serial instructions being executed concurrently. Any instruction in
the pipeline might depend on the output of the previous instruction, causing the pipeline
control logic to wait and insert a wasted clock cycle into the pipeline until the
dependency is resolved.
Types of Pipeline

Pipeline

Linear pipeline Non linear

pipeline

Synchronous Asynchronous
Pipeline Pipeline

Uniform Delay Non-uniform

Pipeline Delay Pipeline
Synchronous Pipeline Model
• Each stage consists of an input buffer/latch, followed by a combinational circuit.
• The latches are fast registers for holding the intermediate results between the stages.
• The output of combinational circuit is applied to the input latch of the next segment.
• Each latch is synchronized with same clock pulse. When i/p is high, latch will transfer data to next
stage simultaneously.

Clock Pulse

Input Output
L S1 L S2 L L Sn

Difference between latch and register: A latch loses the information (data) when passed on
to the next stage. A register retains the information until it cleared.
Types of Synchronous Pipeline
 Uniform delay pipeline: In this type of pipeline, all the stages will take same time to
complete an operation.
In uniform delay pipeline,
Cycle Time (Tp) = Stage Delay + Latch Delay

 Non-Uniform delay pipeline: In this type of pipeline, different stages take different
time to complete an operation.
In this type of pipeline,
Cycle Time (Tp) = Maximum(Stage Delay) + Latch Delay

For example, if there are 4 stages with delays, 1 ns, 2 ns, 3 ns, and 4 ns, then
Tp = Maximum(1 ns, 2 ns, 3 ns, 4 ns) + Latch Delay
= 4 ns + Latch Delay
Performance Metrics for Pipeline
 Clock Period: It is the time required to complete a single stage (latch delay + stage delay). Time
delay for each interface latch is similar. But time delay for different combinational circuits or
different stages is different. So, max of all stage duration should take as common stage duration.
Total clock period for pipeline architecture:
τ = max τ𝑖 𝑘𝑖=1 + τl for i = 1 to k
where τ𝑖 is the time delay for stage i
τl is the time delay for latch

 Speed-up: Speed-up of k-stage pipeline processor over an equivalent pipelined processor as:
Sk=T1/Tk= nk / k+(n-1) where n is the number of instructions,
k is the number of stages in each instruction.
Performance Metrics for Pipeline
 Efficiency: Efficiency of a linear pipeline is measured by the percentage of busy time-space spans over
the total time-space span.
Total time-space span = number of clock pulses for pipeline * total stages
= [k+(n-1)] * k
Busy time-space span = number of instructions * number of stages
= n*k
Efficiency η = [n*k] / [k+(n-1) * k] = n / k+(n-1)
= number of instructions / number of clock pulses for pipeline

• Larger the number of tasks flowing through the pipeline, will increase the efficiency.

 Throughput: The number of instructions that can be completed by a pipeline per unit time.
W = number of instructions / [number of clock pulses for pipeline * clock period]
= n / [k+(n-1)] * τ
=η/τ
Numerical on Synchronous Pipeline
Question 1: Consider a 4-segment pipeline with stage delays (2 ns, 8 ns, 3 ns, 10 ns). Find the
time taken to execute 100 tasks in the above pipeline. [Consider that there is no latch delay]

Answer: CPU time for Pipeline= (k + n – 1) Tp [ k = stages, n = tasks, Tp = Clock cycle time]
As the above pipeline is a non-uniform pipeline, Tp = max(2, 8, 3, 10) = 10 ns
CPU time for Pipeline = (4 + 100 – 1) 10 ns = 1030 ns

Question 2: A 4-stage pipeline with stage delays 150 ns, 120 ns, 160 ns, 140 ns. Latches have a
delay of 5 ns each. Find the time taken to execute 1000 tasks in the pipeline.

Answer: CPU time for Pipeline= (k + n – 1) Tp [ k = stages, n = tasks, Tp = Clock cycle time]
For non-uniform pipeline, Tp = max(150, 120, 160, 140) = 160 ns
CPU time for Pipeline = (4 + 1000 – 1) (160+5) ns
= 1003 x 165 ns = 165,495 ns = 165.5 µs
Asynchronous Pipeline Model

 No latches are there is this mode.

 The data flow between adjacent stages is controlled by a handshaking protocol.

 When the stage Si is ready to transmit intermediate data, it sends a ready signal to next stage Si+1

 Stage Si+1 sends an acknowledgement signal that it’s ready to accept the incoming data

 Stage Si will send the data to Si+1

Input Data Data Output

Ready S1 Ready S2 Ready Sk Ready

ACK ACK ACK ACK

Non-linear Pipeline Model
 A pipeline containing feed-forward and feedback connections in addition to the streamline
connection is called a non-linear pipeline.

 Due to the feed-forward and feedback connections, it can be reconfigured to perform different
functions at different time. Due to this, it also called Dynamic Pipeline. In this pipeline, it is not
necessary that the output will be coming from last stage only. At any stage output can be generated for
different functions.

 Non-linear pipeline are used for recursion problems.

Output X Feedback path

Input
Streamline path Streamline path Output Y
S1 S2 S3

Feed-forward path

Feedback path
Difference Between Linear and Non-linear Pipeline
Linear pipeline Non linear pipeline
Non-Linear pipeline are dynamic pipeline
Linear pipeline are static pipeline because
because they can be reconfigured to perform
they are used to perform fixed functions.
variable functions at different times.

Non-Linear pipeline allows feed-forward and

Linear pipeline allows only streamline
feedback connections in addition to the
connections
streamline connection

Function partitioning is relatively difficult

It is relatively easy to partition a given
because the pipeline stages are interconnected
function into a sequence of linearly ordered
with loops in addition to streamline
sub functions.
connections

The Output of the pipeline is produced from The Output of the pipeline is not necessarily
the last stage. produced from the last stage

Design and Simulation of 8 Bit Arithmetic Logic Unit
100% (1)
Design and Simulation of 8 Bit Arithmetic Logic Unit
88 pages
Cheat Sheet Dark
No ratings yet
Cheat Sheet Dark
1 page
Ws 604
No ratings yet
Ws 604
21 pages
02 FTTX System Overview
100% (1)
02 FTTX System Overview
99 pages
Unit 6
No ratings yet
Unit 6
30 pages
Pipe Lining
No ratings yet
Pipe Lining
32 pages
CAO-II Module 2 Complete
100% (1)
CAO-II Module 2 Complete
32 pages
Computer Organization and Architecture Pipelining Set Execution, Stages and Throughput
No ratings yet
Computer Organization and Architecture Pipelining Set Execution, Stages and Throughput
7 pages
Co Unit 4
No ratings yet
Co Unit 4
17 pages
Pipeline Processing
No ratings yet
Pipeline Processing
43 pages
Computer Architecture Pipe Line
No ratings yet
Computer Architecture Pipe Line
28 pages
Chap-10: Speed and Efficiency
No ratings yet
Chap-10: Speed and Efficiency
29 pages
Pipeline Processing
No ratings yet
Pipeline Processing
28 pages
Unit 4 Coa
No ratings yet
Unit 4 Coa
25 pages
Comparison Between Pipelining
No ratings yet
Comparison Between Pipelining
9 pages
Pipe Lining
No ratings yet
Pipe Lining
23 pages
Pipelining I: Prepared By: Noshaba Nasir
No ratings yet
Pipelining I: Prepared By: Noshaba Nasir
32 pages
Uni1-2 Pipelining
No ratings yet
Uni1-2 Pipelining
12 pages
07-pipeline-notes
No ratings yet
07-pipeline-notes
145 pages
Slide 6
No ratings yet
Slide 6
46 pages
Pipelining Basic and Intermediate Concepts
No ratings yet
Pipelining Basic and Intermediate Concepts
75 pages
Lec8 Cache Coherence and Introduction to Pipline
No ratings yet
Lec8 Cache Coherence and Introduction to Pipline
41 pages
Pipelined Architecture With Its Diagram
No ratings yet
Pipelined Architecture With Its Diagram
20 pages
اسمبلي ٩
No ratings yet
اسمبلي ٩
3 pages
Computer Systems a Programmers Perspective, Section 4.4, “General Principles of Pipelining”
No ratings yet
Computer Systems a Programmers Perspective, Section 4.4, “General Principles of Pipelining”
7 pages
LECTURE 3 Pipelining
No ratings yet
LECTURE 3 Pipelining
27 pages
Pipelining Unit 3
No ratings yet
Pipelining Unit 3
19 pages
Module 4
No ratings yet
Module 4
12 pages
Stud CSA Mod4 p2 PipeliningBasics
No ratings yet
Stud CSA Mod4 p2 PipeliningBasics
83 pages
ACA - Chapter 6
No ratings yet
ACA - Chapter 6
75 pages
Pipelining
No ratings yet
Pipelining
29 pages
Pipe Lining
No ratings yet
Pipe Lining
23 pages
principles of Designing pipelined processor-1
No ratings yet
principles of Designing pipelined processor-1
32 pages
Pipelining I: Prepared By: Noshaba Nasir
No ratings yet
Pipelining I: Prepared By: Noshaba Nasir
19 pages
Lec3 1
No ratings yet
Lec3 1
18 pages
Basic Concepts1
No ratings yet
Basic Concepts1
18 pages
Lecture 7 - PIPELINING
No ratings yet
Lecture 7 - PIPELINING
16 pages
BCS-29 Advanced Computer Architecture: Linear & Nonlinear Pipelines Instruction Pipelines & Arithmetic Operations
No ratings yet
BCS-29 Advanced Computer Architecture: Linear & Nonlinear Pipelines Instruction Pipelines & Arithmetic Operations
33 pages
33 Hazards in Pipeline 06-04-2023
No ratings yet
33 Hazards in Pipeline 06-04-2023
27 pages
Lecture 11- Pipelining
No ratings yet
Lecture 11- Pipelining
39 pages
3.4 Pipelining performance2
No ratings yet
3.4 Pipelining performance2
12 pages
Instruction Pipeline 3.1.5
No ratings yet
Instruction Pipeline 3.1.5
7 pages
Pipelinenew
No ratings yet
Pipelinenew
43 pages
CSE 4293 Pipelining
No ratings yet
CSE 4293 Pipelining
36 pages
Lec18 Pipeline
No ratings yet
Lec18 Pipeline
59 pages
Unit 3 - Advanced Computer Architecture - Www.rgpvnotes.in
No ratings yet
Unit 3 - Advanced Computer Architecture - Www.rgpvnotes.in
15 pages
Pipelining Numericals
100% (1)
Pipelining Numericals
11 pages
6.1.CSE 4293 Pipelining
No ratings yet
6.1.CSE 4293 Pipelining
36 pages
5 Pipeline
No ratings yet
5 Pipeline
63 pages
Chapter 4.5 - 4.8 Piplined Processor and Hazards
No ratings yet
Chapter 4.5 - 4.8 Piplined Processor and Hazards
68 pages
PipeLining in Microprocessors
No ratings yet
PipeLining in Microprocessors
19 pages
pipelining
No ratings yet
pipelining
47 pages
CA Slides#3 Pipeline Introduction
No ratings yet
CA Slides#3 Pipeline Introduction
26 pages
2 - Performance Issue
No ratings yet
2 - Performance Issue
4 pages
L17 Pipelined Vs Non Pipelined
No ratings yet
L17 Pipelined Vs Non Pipelined
16 pages
Pipelining basic concept
No ratings yet
Pipelining basic concept
23 pages
Pipelining Basic Concepts and Approaches
No ratings yet
Pipelining Basic Concepts and Approaches
6 pages
CS 211: Computer Architecture: Instructor: Prof. Bhagi Narahari
No ratings yet
CS 211: Computer Architecture: Instructor: Prof. Bhagi Narahari
82 pages
CS 211: Computer Architecture: Instructor: Prof. Bhagi Narahari
No ratings yet
CS 211: Computer Architecture: Instructor: Prof. Bhagi Narahari
82 pages
Parallelism in Uniprocessor System and Granularity
100% (5)
Parallelism in Uniprocessor System and Granularity
5 pages
2 Performance Issue
No ratings yet
2 Performance Issue
4 pages
Pipelining and Others
No ratings yet
Pipelining and Others
34 pages
Aca Module 2
100% (1)
Aca Module 2
35 pages
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
From Everand
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
MARIO FRANCO
No ratings yet
What Is StarUML
100% (1)
What Is StarUML
165 pages
Assignment 2
No ratings yet
Assignment 2
39 pages
SC 100 Demo
No ratings yet
SC 100 Demo
16 pages
Tosh-_760XL, 760XD, 765L 765D Technical Reference Manual
No ratings yet
Tosh-_760XL, 760XD, 765L 765D Technical Reference Manual
163 pages
TSM Oce TDS320 - SDS
No ratings yet
TSM Oce TDS320 - SDS
204 pages
Virtual Instrumentation and Traditional Instruments
0% (1)
Virtual Instrumentation and Traditional Instruments
5 pages
Data Structure Viva Questions
No ratings yet
Data Structure Viva Questions
3 pages
Chipset Block Diagram
No ratings yet
Chipset Block Diagram
6 pages
Datasheet Mik750gr3
No ratings yet
Datasheet Mik750gr3
4 pages
Department of Electronics and Communication: CS2252 Microprocessors and Microcontrollers
No ratings yet
Department of Electronics and Communication: CS2252 Microprocessors and Microcontrollers
117 pages
Multimedia Middleware - Embedded Product and Application Development
No ratings yet
Multimedia Middleware - Embedded Product and Application Development
3 pages
Embedded
100% (1)
Embedded
9 pages
LT25i Sony Xperia V Software Upgrade Procedure - V1 .1
No ratings yet
LT25i Sony Xperia V Software Upgrade Procedure - V1 .1
17 pages
Upper-Case & Lower-Case Functions
No ratings yet
Upper-Case & Lower-Case Functions
13 pages
GC 02 Datasheet v01 190314 English A4
No ratings yet
GC 02 Datasheet v01 190314 English A4
2 pages
Manual Tracer SC
No ratings yet
Manual Tracer SC
28 pages
Arduino Nano3
100% (1)
Arduino Nano3
7 pages
Tutorial Arduino Bluetooth 8 Lamp PDF
No ratings yet
Tutorial Arduino Bluetooth 8 Lamp PDF
12 pages
A Beginner'S Guide To Git and Github
No ratings yet
A Beginner'S Guide To Git and Github
4 pages
Electronic Age
100% (3)
Electronic Age
18 pages
MCS-022 Solved Assignment 2015-16 PDF
No ratings yet
MCS-022 Solved Assignment 2015-16 PDF
60 pages
Literals in Java
No ratings yet
Literals in Java
1 page
Chapter 2-Simple Searching and Sorting Algorithms
100% (1)
Chapter 2-Simple Searching and Sorting Algorithms
21 pages
Power Off Reset Reason
No ratings yet
Power Off Reset Reason
3 pages
Finalrevised Secondary Memory RJB Revised Icps Module1 1
No ratings yet
Finalrevised Secondary Memory RJB Revised Icps Module1 1
30 pages
3db16219aaaafmzza - v1 - 9400 Awy Product Release Note r2.1.6
No ratings yet
3db16219aaaafmzza - v1 - 9400 Awy Product Release Note r2.1.6
36 pages

Pipeline 1

Uploaded by

Pipeline 1

Uploaded by

Pipelining : Basic Concepts

4. Control unit send signals to appropriate

CU 5. The result is stored.

 Pipelining or Pipeline Processing:

Fig: 1 Non-pipelined architecture

S2 D1 D2 D3 1 instructions will be completed.

• It can reduce the number of cycles to perform multiple instructions.

Linear pipeline Non linear

Uniform Delay Non-uniform

 No latches are there is this mode.

 The data flow between adjacent stages is controlled by a handshaking protocol.

 Stage Si will send the data to Si+1

Input Data Data Output

Ready S1 Ready S2 Ready Sk Ready

ACK ACK ACK ACK

 Non-linear pipeline are used for recursion problems.

Non-Linear pipeline allows feed-forward and

Function partitioning is relatively difficult

You might also like