Increasing Instruc: Microprocessors W of The Oe

1. Instruction pipelining increases processor efficiency by dividing instruction cycles into multiple phases (fetch, decode, read, execute) so that multiple instructions can be processed simultaneously. 2. In a pipelined processor, the functional units are kept busy almost all the time as new instructions continuously flow through the pipeline. This increases throughput compared to non-pipelined processors. 3. Pipelining works best for RISC processors with single-cycle instructions, but complex CISC instructions may require flushing the pipeline if they span multiple cycles, reducing efficiency.

Uploaded by

Yogabharath

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views

Increasing Instruc: Microprocessors W of The Oe

Uploaded by

Yogabharath

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Introduction to Programmable DSPs 45

2.6 PIPELINING
am
data
per
One of the approaches adopted for increasing the efficiency of the advanced microprocessors as w
as P-DSPs isinstruction pipelining. An
instruction cycle starting with the fetching of an instruc
and ending with the execution of the
an instruction including the time storage results can of the oe
into a number of microinstructions. Execution of each of the microinstructions is also referred to as

one phase of an instruction. For

example, an instruction cycle requiring four microinstructions a o
rd said to be in four phases as follows:
ds, 1. Fetch phase in which the instruction is fetched from the
program memory
2. Decode phase in which the instruction is decoded
in
3. Memory read phase in which the operand required for the execution of the instruction may be
er-
read from the data memory
is
4. Execution phase in which execution as well as the storage of the results in either one of the
ut registers or memory is carried out
Each of the above microinstructions may be carried out separately by four functional units. Let us
assume that each of the above four phases take equal time for completion. In this case in a conven-
tional microprocessor with no pipelining, cach of the functional units is busy only 25% of the time.
This is because only one instruction is processed at the CPU at a time. Figure 2.7 shows wheneach of
the functional unit is busy when a program containing three instructions I1,12, 13 is executed.

Value of T Fetch Decode Read Execute

1 1

11
2
3 1

4
11

5 2

6 2

12
12

9 13
10 13
13
11
13
12

Fig. 2.7 Instruction cycles ofprocessor with no pipelining

instructions
almost all the time by processing a number of
The functional units can be kept busy instructions I1,
in a machine with four functional units, four
simultaneously in the CPU. For example, decode phase,
as shown in Fig. 2.8. When II enters the
12, 13 and 14 can be processed simultaneously
When Il enters the operand read phase, 12 enters the
decode
12 can enter the opcode fetch phase. enters the operand
fetch phase. When Il enters the execute phase, 12
phase and 13 enters the opcode is fully
and 14 enters the opcode fetch phase. The pipeline
read phase, I3 enters the decode phase 14 keep the
useful work to do. The instructions that follow
loaded now and all the functional units have of the
functional units till the
busy is exited. Let T denote the time required for each phase
program
46 Digital Signal Processors

instruction. One clock eycle of the processor corresponds to 7. In a period of 127 only three instruc.
tions can be executed in a machine without pipelining. In the same period nine instructions can he
De
executed as shown in Fig. 2.8. Hence the throughput is increased by a factor of 3 in this case

Value of T Fetch Decode Read Execute

13 12 11

13 12

15 4 13 12
5

I6 l5 14 3
6

17 6 15 14

8 18 7 6 15

9 19 17 16
19 18 7
10
11 19 18
12 19

Fig.2.8 lnstruction cycles of a processor with pipelining

It may be noted that the initial latency of a machine with four phases is 4T. Hence for executing a
program with N instructions, the time required for execution is (N + 4)7, With a non-pipelined ma-
chine, the time required for executing N instructions is 4NT.
Instruction pipeline shown in Fig. 2.8 corresponds to a highly optimistic case. In the case of
processors requiring single clock cycle for execution for each of the instructions in the program, the
throughput shown in Fig. 2.8 can be achieved. This is normally achieved with restricted instruction set
computers (RISC). However in complex instruction set computers (CISC), there are also instructions
with multiple word requiring multiple clock cycles for execution. In this case all the functional units
cannot be kept busy all the time. For example, in the case of call and branch instructions of a P-DSP,
four phases or T states are required for the call/branch instruction to exit execution phase. By that time
two more single word instructions or one double instruction enters the instruction pipeline. These
instructions should not be executed. Hence two words have to be flushed out of the instruction
pipeline before the instructions are fetched starting from the new program address.
To overcome this problem, some of the P-DSPs have special branch/call and return instructions
called as delayed branch/call/return instructions. When the delayed branch instruction is executed, the
program branches to the new program address only after the two -word instructions or the single 2-
word instruction following the branch instruction are completely executed. Similarly, when the de-
layed call instruction is executed, the program calls to the subroutine only after the two I-word
instructions or the single 2-word instruction following the call instruction are completely executed.
When the delayed call/branch/return instructions are executed, there is no need for
line and maximum throughput is obtained. Examples of pipeline operation of delayed as well as
the flushing pipe-
undelayed branch/call instructions are given in Chapter 4
The throughput efficiency of the pipeline may also be reduced because of conflicts between the
instructions in the instruction pipeline in different phases. This happens if the same memory is used to
store the data and program and there is only a single address bus for addressing both the program and
Introduction to Programmable
DSPs 47

ta
data memory. Tms is true in he case of off-chip memory., For example, an instruction in fetcn p1as
maytry to fetch the
instnuction code from a memory chip accessed
that is also by anotherinstruci
that is in the
operand read phase.To avoid the conflict,the operand read phase
will be done ano
irst
hancode feteh phase will be repeated till there is no conflict again.
referred to as aepu
The nuber oi instructions that are processed simultaneously in the CPU, also
of the instruction pIpeline, differs in different families of P-DSPs. The pipeline depths
of some or
P-DSPs are given in Table 2.1.

Assignment Solution Week11
100% (1)
Assignment Solution Week11
5 pages
CO Gate 2023
No ratings yet
CO Gate 2023
6 pages
Lecture 7 - PIPELINING
No ratings yet
Lecture 7 - PIPELINING
16 pages
Unit 1
No ratings yet
Unit 1
44 pages
Computer Organization: An Introduction To RISC Hardware: 6.1 An Overview of Pipelining
No ratings yet
Computer Organization: An Introduction To RISC Hardware: 6.1 An Overview of Pipelining
12 pages
DSP Notes
No ratings yet
DSP Notes
15 pages
11 Processor Structure and Function 20 3 18
No ratings yet
11 Processor Structure and Function 20 3 18
27 pages
Chap-10: Speed and Efficiency
No ratings yet
Chap-10: Speed and Efficiency
29 pages
Ddco5-240207065925-3db65dc3 (1) - Pages-Deleted
No ratings yet
Ddco5-240207065925-3db65dc3 (1) - Pages-Deleted
8 pages
Advantages and Disadvantages MP
No ratings yet
Advantages and Disadvantages MP
5 pages
CO Pipelining PDF notes
No ratings yet
CO Pipelining PDF notes
10 pages
pipelining
No ratings yet
pipelining
47 pages
Pipe Lining
No ratings yet
Pipe Lining
23 pages
Instruction Pipeline Design, Arithmetic Pipeline Deign - Super Scalar Pipeline Design
No ratings yet
Instruction Pipeline Design, Arithmetic Pipeline Deign - Super Scalar Pipeline Design
34 pages
Pipeline: A Simple Implementation of A RISC Instruction Set
No ratings yet
Pipeline: A Simple Implementation of A RISC Instruction Set
16 pages
Session6-Pipelining approach
No ratings yet
Session6-Pipelining approach
11 pages
Pipelining and Others
No ratings yet
Pipelining and Others
34 pages
DSP Unit-5 Solutions
No ratings yet
DSP Unit-5 Solutions
17 pages
CAO-II Module 2 Complete
100% (1)
CAO-II Module 2 Complete
32 pages
Co - Unit Ii - Ii
No ratings yet
Co - Unit Ii - Ii
34 pages
Differentiate Organization and Architecture.: Advanced Computer Architechture Assignment 1
No ratings yet
Differentiate Organization and Architecture.: Advanced Computer Architechture Assignment 1
4 pages
Pipeline Processing
No ratings yet
Pipeline Processing
28 pages
Pipeline Processing
No ratings yet
Pipeline Processing
43 pages
COA CH 6
No ratings yet
COA CH 6
14 pages
CA unit-2 Chapter-2
No ratings yet
CA unit-2 Chapter-2
36 pages
Computer Organization - Hardwired V/s Micro-Programmed Control Unit
No ratings yet
Computer Organization - Hardwired V/s Micro-Programmed Control Unit
9 pages
Instruction Pipeline
No ratings yet
Instruction Pipeline
27 pages
DDCO-Jan25-Unit5
No ratings yet
DDCO-Jan25-Unit5
30 pages
Computer Architecture Pipe Line
No ratings yet
Computer Architecture Pipe Line
28 pages
Chapter 9 - Pipeline and Vector Processing Section 9.1 - Parallel Processing
No ratings yet
Chapter 9 - Pipeline and Vector Processing Section 9.1 - Parallel Processing
10 pages
2 Performance Issue
No ratings yet
2 Performance Issue
4 pages
CH7-Parallel and Pipelined Processing
No ratings yet
CH7-Parallel and Pipelined Processing
23 pages
Unit 5
No ratings yet
Unit 5
43 pages
5.Pipeline and Multiprocessors
No ratings yet
5.Pipeline and Multiprocessors
16 pages
Unit 5 - Pipeling and Multipoessors
No ratings yet
Unit 5 - Pipeling and Multipoessors
74 pages
Pipelining Unit 3
No ratings yet
Pipelining Unit 3
19 pages
ACA Question Bank
No ratings yet
ACA Question Bank
19 pages
2 - Performance Issue
No ratings yet
2 - Performance Issue
4 pages
ACA Unit 2,7th Sem CSE
No ratings yet
ACA Unit 2,7th Sem CSE
13 pages
CSO Lecture Notes Unit - 5
No ratings yet
CSO Lecture Notes Unit - 5
11 pages
Chapter 6
No ratings yet
Chapter 6
71 pages
Pipelining Seminar
No ratings yet
Pipelining Seminar
14 pages
MODULE-5 DDCO_BCS302 DR LAXMI G
No ratings yet
MODULE-5 DDCO_BCS302 DR LAXMI G
7 pages
CSN-221 Pipelines-Quiz: Enrollment No.: 18114031 Name - Hemil Panchiwala
No ratings yet
CSN-221 Pipelines-Quiz: Enrollment No.: 18114031 Name - Hemil Panchiwala
6 pages
ILP - Appendix C PDF
No ratings yet
ILP - Appendix C PDF
52 pages
CS17303 Computer Architecture Notes On Lesson Unit IV - Sumathi
No ratings yet
CS17303 Computer Architecture Notes On Lesson Unit IV - Sumathi
24 pages
Elec327b DSP Processors 1
No ratings yet
Elec327b DSP Processors 1
21 pages
Introduction To Pipelining Introduction To Pipelining
No ratings yet
Introduction To Pipelining Introduction To Pipelining
35 pages
Coa Lecture Unit 3 Pipelining
No ratings yet
Coa Lecture Unit 3 Pipelining
95 pages
Parallelism in Uniprocessor System and Granularity
100% (5)
Parallelism in Uniprocessor System and Granularity
5 pages
Pipelining - Computer Architecture and Organization
No ratings yet
Pipelining - Computer Architecture and Organization
40 pages
Lect3 Pipeline
No ratings yet
Lect3 Pipeline
4 pages
Contact Session 8 - With Annotation-1
No ratings yet
Contact Session 8 - With Annotation-1
47 pages
Module-03
No ratings yet
Module-03
9 pages
Slide 6
No ratings yet
Slide 6
46 pages
Practical Java Programming for IoT, AI, and Blockchain
From Everand
Practical Java Programming for IoT, AI, and Blockchain
Perry Xiao
No ratings yet
The Project Management Handbook: Simplified Agile, Scrum and Devops for BeginnersSim: Tech and computers simplified
From Everand
The Project Management Handbook: Simplified Agile, Scrum and Devops for BeginnersSim: Tech and computers simplified
Jack C. Stanely
No ratings yet
Beginning Software Engineering
From Everand
Beginning Software Engineering
Rod Stephens
4.5/5 (2)
Projects With Microcontrollers And PICC
From Everand
Projects With Microcontrollers And PICC
Guillermo Perez Guillen
5/5 (1)
Bare Metal C: Embedded Programming for the Real World
From Everand
Bare Metal C: Embedded Programming for the Real World
Stephen Oualline
No ratings yet
Interview Questions for IBM Mainframe Developers
From Everand
Interview Questions for IBM Mainframe Developers
Robert Wingate
1/5 (1)
Unit-4 Pipelinie and Vector Processing
No ratings yet
Unit-4 Pipelinie and Vector Processing
33 pages
Chapter - 1 Basic Structure of Computers: 1.1 Computer Types
No ratings yet
Chapter - 1 Basic Structure of Computers: 1.1 Computer Types
46 pages
Von Neumann Architecture vs. Harvard
No ratings yet
Von Neumann Architecture vs. Harvard
22 pages
Get (eBook PDF) Computer Systems: A Programmer's Perspective 3nd Edition free all chapters
100% (11)
Get (eBook PDF) Computer Systems: A Programmer's Perspective 3nd Edition free all chapters
56 pages
16-Bit Microcontroller IP
No ratings yet
16-Bit Microcontroller IP
2 pages
Input Unit: Memory: in Processing Element (PE) or CPU: Output
No ratings yet
Input Unit: Memory: in Processing Element (PE) or CPU: Output
24 pages
CAO - Processor Organization and Control Unit
No ratings yet
CAO - Processor Organization and Control Unit
120 pages
Pipeline Architecture PDF
100% (1)
Pipeline Architecture PDF
42 pages
Computer Archi
No ratings yet
Computer Archi
58 pages
Principles of Linear Pipelining
50% (2)
Principles of Linear Pipelining
71 pages
2-Introduction To Pentium Processor
92% (13)
2-Introduction To Pentium Processor
15 pages
2021 July ITT204-A
No ratings yet
2021 July ITT204-A
3 pages
Chapter1 - Basic Structure of Computers
0% (1)
Chapter1 - Basic Structure of Computers
119 pages
Computer Organization
No ratings yet
Computer Organization
150 pages
Paraphrase
No ratings yet
Paraphrase
51 pages
Design and Implementation of Synthesizable 32-Bit Four Stage Pipelined RISC Processor in FPGA Using Verilog/VHDL
No ratings yet
Design and Implementation of Synthesizable 32-Bit Four Stage Pipelined RISC Processor in FPGA Using Verilog/VHDL
8 pages
UNIT 5 (DSP Processor)
78% (9)
UNIT 5 (DSP Processor)
51 pages
coa mod 3 s4
No ratings yet
coa mod 3 s4
27 pages
Pulserain Fp51-1T Microcontroller: Technical Reference Manual
No ratings yet
Pulserain Fp51-1T Microcontroller: Technical Reference Manual
42 pages
Coa Question Bank
No ratings yet
Coa Question Bank
4 pages
Linley Group WP - SingleChipDataPlaneProcessors
No ratings yet
Linley Group WP - SingleChipDataPlaneProcessors
10 pages
StudyMaterial_CSE_3RD_Computer-System-Architecture
No ratings yet
StudyMaterial_CSE_3RD_Computer-System-Architecture
50 pages
Micunit 1
No ratings yet
Micunit 1
12 pages
Assignment 3
No ratings yet
Assignment 3
3 pages
Computer Architecture and Organization: EE-321 Spring 2021
No ratings yet
Computer Architecture and Organization: EE-321 Spring 2021
53 pages
Computing Without Clocks
100% (1)
Computing Without Clocks
8 pages
Syllabus: Veermata Jijabai Technological Institute
No ratings yet
Syllabus: Veermata Jijabai Technological Institute
41 pages
DSP Unit-5 Final
No ratings yet
DSP Unit-5 Final
97 pages
Gem 5 IO
No ratings yet
Gem 5 IO
161 pages

Increasing Instruc: Microprocessors W of The Oe

Uploaded by

Increasing Instruc: Microprocessors W of The Oe

Uploaded by

Introduction to Programmable DSPs 45

one phase of an instruction. For

Value of T Fetch Decode Read Execute

Fig. 2.7 Instruction cycles ofprocessor with no pipelining

Value of T Fetch Decode Read Execute

Fig.2.8 lnstruction cycles of a processor with pipelining

You might also like