0% found this document useful (0 votes)

13 views

Lecture 06 - (New) Pipelining and Parallelism

Uploaded by

amirhanzala831

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views

Lecture 06 - (New) Pipelining and Parallelism

Uploaded by

amirhanzala831

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 36

Pipelining + Parallelism

Week - 06
22-29 October 2018

1
Topics to Cover
‘Organizational techniques’ to • Parallelism
Improve Processor Speed • Instruction-level parallelism (ILP)
• Pipelining Pipelining
• Superscalar Superscalar
• Super-pipeline • Machine-level parallelism
Multicore systems
Cluster Computers
Flynn’s Taxonomy of Computers

2
Paper Pattern – Mid-Term (Total Marks = 30)
• Short Questions – 5 (2 marks each)

• Long question – 1 (10 marks)

• Numerical question – 2 (5 X 2 = 10)

3
Instruction Format
1. Multi-Stage Pipeline Opcode Address

• Instruction pipelining is an organizational approach, to improve the

processor performance.
• As in a pipeline, new inputs are accepted at one end before previously
accepted inputs appear as output at the other end.
• Each step in the instruction cycle (fetch -> decode -> execute) takes at
least one tick of the system clock, called a clock cycle.
• But this does not mean that the processor must wait until all steps are
completed before beginning to process the next instruction.
• The processor can execute the steps in parallel, a technique known as
pipelining. (e.g. overlapping of instruction processing steps)
4
Six-Stages of an Instruction
• The six-stages of an instruction are listed below:
1. Fetch instruction (FI): Read the next expected instruction into a
buffer. As told by PC.
2. Decode instruction (DI): Determine the opcode and the operand.
3. Calculate operands (CO): Calculate the effective address of each
source operand. This may involve address calculation.
4. Fetch operand (FO): Fetch each operand from memory to register.
5. Execute instruction (EI): Perform the indicated operation/result.
6. Write operand (WO): Store the result in memory.
5
Non-Pipelined Instruction Execution (Fig.
Next)
• Let’s assume that each execution stage in the processor requires a
single clock cycle.
• Figure uses a grid to represent a six-stage non-pipelined processor.
• When instruction I-1 has finished stage S6, instruction I-2 begins.
• Twelve clock cycles are required to execute the two instructions.
• In other words, for k execution stages, n instructions require (n*k)
cycles to process.
• Of course, it represents a major waste of CPU resources because each
stage is used only one-sixth of the time.

6
6-Stage Non-Pipelined Instruction Execution

1st instruction

2nd instruction

(n*k) Cycles => 12 cycles 7

Pipelined Execution (Fig. Next Slide)
• If, on the other hand, a processor supports pipelining, a new
instruction can enter stage S1 during the second clock cycle.
• Meanwhile, the first instruction has entered stage S2.
• This enables the overlapped execution of the two instructions.
• In Figure, two instructions I-1 and I-2, are shown progressing through
the pipeline.
• I-2 enters stage S1 as soon as I-1 has moved to stage S2.
• As a result, only seven clock cycles are required to execute I-1 & I-2.
• When the pipelining is full, all six stages are in use all the time.
• In general, for k execution stages, n instructions require k+(n-1) cycles.
8
6-Stage Pipelined Instruction Execution

New
Instruction
executed
Per cycle.

k + (n-1) cycles => 7 cycles

Q. In a six-stage pipelined processor, how many instructions can be executed in 12 clock cycles? Ans: 7.
9
2. Superscalar Architecture (Fig. Next Slide)
• A superscalar processor has two or more execution pipelines, making
it possible for two instructions to be in the execution stage at the
same time. For n-pipelines, n-instructions can execute during the same clock cycle.
• In the previous pipeline example, we assumed that the ‘instruction
execution’ stage (S4) required a single clock cycle.
• That was an overly simplistic approach.
• What would happen if stage S4 required two clock cycles?
• Then a bottleneck would occur, as shown in Figure next slide.
• Instruction I-2 cannot enter stage S4 until I-1 has completed the stage,
so I-2 has to wait one more cycle before entering stage S4.
Q. Define a way of increasing the efficiency of the pipeline. Ans: Superscalar approach. 10
Without Super-Scalar Pipelining
• As more instructions enter the pipeline, wasted cycles occur (shaded
in grey & blue).
• In general, for k stages (where one stage requires 2 execute cycles), n
instruction require (k + 2n – 1) cycles to process.

One cycle Wait for I-2

Two cycles Wait for I-3

k+(2n-1) Cycles
=> 11 cycles 11
With Super-Scalar Pipelining (Fig. Next
Slide)
• When a superscalar processor design is used, multiple instructions can
be in the execution stage at the same time.
• For n-pipelines, n-instructions can execute during the same clock
cycle.
• Let us introduce a second pipeline (superscalar) into our 6-staged
pipeline and assume that execution stage S4 requires two clock cycles.
• In Figure, odd-numbered instructions enter the u-pipeline and even-
numbered instructions enter the v-pipeline.
• This removes the wasted cycles, and it is now possible to process n
instructions in (k + n) cycles.
12
Two Pipelined Stages (Superscalar)
Every next
Instruction
Executed
Per clock cycle

(k + n) Cycles = > 10 cycles

13
3. Super-Pipeline
• In a Super-pipeline, many pipeline stages need less than half a clock
cycle.

• Super-pipeline is the breaking of stages of a given pipeline into

smaller stages. (thus making the pipeline deeper) in an attempt to
shorten the clock period and thus enhancing the instruction
throughput by keeping more and more instructions in flight at a time.

14
Super-pipeline Performance

15
‘Super-Scalar’ VS ‘Super-Pipeline’
• Simple pipeline system performs only one
pipeline stage per clock cycle.

• Super-pipeline system is capable of

performing two pipeline stages per clock
cycle.

• Super-scalar performs only one pipeline

stage per clock stage in each parallel
pipeline.
16
Pipeline Hazards/Problems
• Limits to Pipelining, Hazards prevent next instruction from executing
during its designated clock cycles.
1. Structural hazards: Hardware cannot support some combination of
instructions (single processor will limit to machine level parallelism).
2. Control hazards: Pipelining of branches causes later instruction
fetches to wait for the result of the branch. (limited ILP)
3. Data hazards: Instruction depends on result of prior instruction still
in the pipeline (data dependency).
 These might result in pipeline ‘stalls’ or ‘bubbles’ in the pipeline.

17
Preparatory Questions (Pipelining)
Q1. What is the ‘instruction pipelining’? How can we pipeline
instructions?
Q2. In a six-stage pipelined processor, how many instructions can be
executed in 18 clock cycles?
Q3. What is a ‘superscalar’ pipeline? How does it improve processor
performance?
Q4. What is a ‘superpipeline’? How does it differ from a normal
pipeline?
Q5. What are the ‘hazards’ to pipelining?

18
Parallelism
• Executing two or more operations at the same time is known as
Parallelism. It is used in ‘high-performance computing’.
• In ‘Parallel processing’ the computer does the simultaneous data
processing tasks concurrently (at one time).
• Goal of Parallelism
• Parallelism is done to increase the ‘computational speed’ of a
computer system.
• It increases the computer’s processing capability and increases its
throughput, ‘the amount of processing during a given interval of time’.
.

19
Types of ‘Parallelism’
• Parallelism can be of two types:
1) Instruction Level Parallelism (ILP) (Parallelism in Software)
i. Pipelining
ii. Superscalar } For uni-processor

2) Machine Parallelism (Parallelism in Hardware)

i. Multi-core processors
ii. Multi-computers (Clusters) } For multi-processors

20
1) Instruction Level Parallelism (ILP)
• Instruction-level parallelism exists when instructions in a sequence
are independent and thus can be executed in parallel by overlapping.
• As an example of the concept of ILP, consider the following two codes:

• The three instructions on the left are independent, and in theory all
three can be executed in parallel.
• In contrast the three instruction on the right can not be executed in
parallel because the second instruction uses the result of the first, and
the third instruction uses the result of the second.
21
Instruction Level Parallelism (ILP)
• Instruction-level parallelism (ILP): is a measure of how many of the
operations in a computer program can be performed simultaneously.
• Micro-architectural techniques that are used to exploit ILP include:
i. Instruction pipelining: where the execution of multiple instructions
can be partially overlapped. (You have already studied this)
• In Pipelining, while an instruction is being executed in the ALU, the
next instruction can be read from memory. (overlap fetch & execute)
ii. Superscalar: execution in which multiple execution units are used
to execute multiple instructions in parallel.

22
ii. Superscalar Approach
• A parallel processing system is able to perform concurrent data
processing to achieve faster execution time.
• In a Superscalar computer, the system has redundant functional units.
• For example, the system may have two or more ALUs and be able to
execute two or more instructions at the same time.
• ‘Parallel processing’ is established by distributing the data among the
multiple functional units.
• As the amount of hardware increases with parallel processing, and
with it, the cost of system increases.

23
Superscalar Processor with Multiple Functional
Units (Figure Next Slide)

• For example, the arithmetic, logic, and shift operations can be

separated into three units.

• All units are independence of each other, so one number can be

shifted while another number is being incremented.

• The operands are diverted to each unit under the supervision of a

complex ‘Control Unit’, which coordinates all the activities among the
various components.

24
Multiple Functional Units
• Figure shows one possible way
of separating the execution
unit into eight functional units
operating in parallel.

• The operands in the registers

are applied to one of the units
depending on the operation
specified by the instruction
associated with the operands.

25
2. Machine Parallelism
• Machine parallelism is a measure of the ability of the processor to
take advantage of instruction-level parallelism (ILP).
i. Multi-core processors: the system may have two or more
processors operating concurrently. (You have already studied this)
• Such multi-core system will support ‘multi-threaded’ programs and
‘multi-tasking’.
ii. Multi-computers (Clusters): consist of multiple independent
computers organized in a cooperative fashion. (e.g. networks)
• Clusters are ‘interconnected computers’ that can support workloads
that are beyond the capacity of a single multiprocessor computer.
26
Final Note
• Both instruction-level and machine parallelism are important factors
in enhancing performance.
• A program may not have enough instruction-level parallelism to take
full advantage of machine parallelism. (e.g. not enough independent
instructions, may not support multiple-threads).
• The use of a fixed-length instruction set architecture, as in a RISC,
enhances instruction-level parallelism. (e.g. pipelining)
• On the other hand, limited machine parallelism will limit performance
no matter what the nature of the program.

27
Types of Parallel Processor Systems
• The normal operation of a computer is to fetch instructions from
memory and to execute them in the processor. (instruction cycle)
• The sequence of instructions read from memory constitutes an
instruction stream.
• The operations performed on the data in the processor constitutes a
data stream.
• Parallel processing may occur in the ‘instruction stream’, in the ‘data
stream’ or both.
• ‘Flynn’ classified systems on the basis of these two streams in system.

28
Flynn’s Taxonomy of Parallel Processor
Systems
• On the basis of ‘instruction’ & ‘data streams’, Flynn classifies the
organization of a computer system by the number of instruction and
data items that are manipulated simultaneously.
• Flynn’s classification: divides computers into four major groups as:
1. Single instruction stream, single data stream (SISD)
2. Single instruction stream, multiple data stream (SIMD)
3. Multiple instruction stream, single data stream (MISD)
4. Multiple instruction stream, multiple data stream (MIMD)

29
1. Single Instruction, Single Data (SISD)
• In SISD, A single processor executes instructions sequentially from a
single instruction stream, each instruction processes one data item
stored in a single memory.

• Parallel processing in this case may be achieved by means of ‘multiple

functional units’ or by ‘pipeline’ processing.

• Uniprocessor systems fall into this category.

30
2. Single Instruction, Multiple Data (SIMD)
• In SIMD, same instruction is processed in all processor (cores) with
different data.
• SIMD represents an organization that includes many processing units
under the supervision of a common control unit.
• All processors receive the same instruction from the control unit but
operate on different items of data. (e.g GPU Graphics Processing Unit)
• The shared memory unit must contain different modules so that it can
communicate with all processors simultaneously.
• Vector processors & array processors fall into this category.

31
3. Multiple Instructions, Single Data (MISD)
• In MISD, different instructions (processors) operate on the same data.

• This structure is only of theoretical interest and not commercially

implemented. (since multiple processors on same data give same
results)

• Fault-tolerant systems fall into this category. Such systems, must be

able to continue working to a level of satisfaction in the presence of
faults.

32
4. Multiple Instruction, Multiple Data
(MIMD)
• In MIMD, a multiprocessor system is capable of processing (data
streams) of several programs (instruction streams) at the same time.

• Each processor uses its own data and executes its own program.

• Multiprocessor systems and multi-computers (clusters) fall into this

category.

33
Figure. A taxonomy of Parallel Processor
Architecture

34
Final Note
• Flynn’s classification depends on the distinction between the
performance of the ‘control unit’ and the ‘data-processing unit’.

• It emphasizes the ‘behavioural characteristics’ of the computer

system rather than its ‘operational and structural interconnections’.

35
Preparatory Questions (Parallelism)
1. What is ‘parallelism’? Describe its goal.
2. What are the two types of parallelism? Describe.
3. Describe ‘instruction-level parallelism’ and its types.
4. Describe ‘machine-level parallelism’ and its types.
5. Describe ‘Flynn’s taxonomy of parallel processor systems’ and its
types.

Lecture 06 - (New) Pipelining and Parallelism
No ratings yet
Lecture 06 - (New) Pipelining and Parallelism
37 pages
Superscalar - Superpipeline - Processor
No ratings yet
Superscalar - Superpipeline - Processor
10 pages
Instruction Pipelining and SuperScalar Development - 2019
No ratings yet
Instruction Pipelining and SuperScalar Development - 2019
53 pages
Module 4 - Parallel & Pipeline Processing - Final
No ratings yet
Module 4 - Parallel & Pipeline Processing - Final
31 pages
Parallelism in Microprocessor
No ratings yet
Parallelism in Microprocessor
17 pages
Lect5 PDF
No ratings yet
Lect5 PDF
21 pages
7TH_UNIT 2-21EC74H6_CA
No ratings yet
7TH_UNIT 2-21EC74H6_CA
95 pages
WINSEM2022-23_CSE4001_ETH_VL2022230503160_Reference_Material_I_22-12-2022_2.1_ILP
No ratings yet
WINSEM2022-23_CSE4001_ETH_VL2022230503160_Reference_Material_I_22-12-2022_2.1_ILP
34 pages
Instruction Level Parallelism
No ratings yet
Instruction Level Parallelism
10 pages
Super Scalar & Super Pipeline Approach To Processor
No ratings yet
Super Scalar & Super Pipeline Approach To Processor
13 pages
Advanced Computer Architecture: BY Dr. Radwa M. Tawfeek
No ratings yet
Advanced Computer Architecture: BY Dr. Radwa M. Tawfeek
36 pages
Lec18 Pipeline
No ratings yet
Lec18 Pipeline
59 pages
3-Pipelining_241110_203716
No ratings yet
3-Pipelining_241110_203716
59 pages
CH16-WS ILP and Superscalar-V2
No ratings yet
CH16-WS ILP and Superscalar-V2
42 pages
(123doc) Dien Tu Vien Thong c16 Instructionlevel Parallelism and Superscalar Processors 39 g3 Khotailieu
No ratings yet
(123doc) Dien Tu Vien Thong c16 Instructionlevel Parallelism and Superscalar Processors 39 g3 Khotailieu
71 pages
Superscalar Vs Superpipeline Processor
No ratings yet
Superscalar Vs Superpipeline Processor
17 pages
Pipe Lining
No ratings yet
Pipe Lining
66 pages
Session - 29 and 30 Instruction Pipelining and Pipeline Hazards, Instruction Level Parallelism
No ratings yet
Session - 29 and 30 Instruction Pipelining and Pipeline Hazards, Instruction Level Parallelism
25 pages
CH16 COA9e Instruction Level Parallelism and Superscalar Processors
No ratings yet
CH16 COA9e Instruction Level Parallelism and Superscalar Processors
20 pages
Unit 1
No ratings yet
Unit 1
5 pages
Batch 2 ICS 2101 AND BIT 2102 (1) - 1
No ratings yet
Batch 2 ICS 2101 AND BIT 2102 (1) - 1
17 pages
Pipeline Processing
No ratings yet
Pipeline Processing
28 pages
Input Unit: Memory: in Processing Element (PE) or CPU: Output
No ratings yet
Input Unit: Memory: in Processing Element (PE) or CPU: Output
24 pages
Chap 3 Memory System Organization and Architecture (Part 2)
No ratings yet
Chap 3 Memory System Organization and Architecture (Part 2)
41 pages
Parallelism in Uniprocessor System and Granularity
100% (5)
Parallelism in Uniprocessor System and Granularity
5 pages
Computer Architecture Pipe Line
No ratings yet
Computer Architecture Pipe Line
28 pages
Unit 5
No ratings yet
Unit 5
44 pages
PCC-CS402
No ratings yet
PCC-CS402
7 pages
5.1-5.3 Pipelining and Parallel Processing
No ratings yet
5.1-5.3 Pipelining and Parallel Processing
56 pages
Detailed Instruction Level Parallelism
No ratings yet
Detailed Instruction Level Parallelism
12 pages
Helping Slides Pipelining Hazards Solutions
No ratings yet
Helping Slides Pipelining Hazards Solutions
55 pages
CSO Computer Programming
No ratings yet
CSO Computer Programming
73 pages
4 Instruction Pipeline
No ratings yet
4 Instruction Pipeline
13 pages
Week 11 Reduced
No ratings yet
Week 11 Reduced
29 pages
QB106613
No ratings yet
QB106613
5 pages
Pipe Lining
No ratings yet
Pipe Lining
23 pages
Pipeline : Processor and Memory/cache Communication
No ratings yet
Pipeline : Processor and Memory/cache Communication
3 pages
Pipeline
No ratings yet
Pipeline
33 pages
L14 MipsPipeline Ovw
No ratings yet
L14 MipsPipeline Ovw
17 pages
Module 6
No ratings yet
Module 6
59 pages
Computer Architecture and Organization
No ratings yet
Computer Architecture and Organization
49 pages
CH16-WS ILP and Superscalar-v2
No ratings yet
CH16-WS ILP and Superscalar-v2
42 pages
Chapter 4.5 - 4.8 Piplined Processor and Hazards
No ratings yet
Chapter 4.5 - 4.8 Piplined Processor and Hazards
68 pages
Pipelining Basic Concepts and Approaches
No ratings yet
Pipelining Basic Concepts and Approaches
6 pages
XX Chapter16 InstructionLevelParallelismAndSuperscalarProcessors PDF
No ratings yet
XX Chapter16 InstructionLevelParallelismAndSuperscalarProcessors PDF
90 pages
Onur Digitaldesign - Comparch 2021 Lecture13 Pipelining Afterlecture
No ratings yet
Onur Digitaldesign - Comparch 2021 Lecture13 Pipelining Afterlecture
138 pages
Superscaling in Computer Architecture
No ratings yet
Superscaling in Computer Architecture
9 pages
Lec11 Pipeline 1 Notes
No ratings yet
Lec11 Pipeline 1 Notes
26 pages
CO Pipelining PDF notes
No ratings yet
CO Pipelining PDF notes
10 pages
Pipeline: A Simple Implementation of A RISC Instruction Set
No ratings yet
Pipeline: A Simple Implementation of A RISC Instruction Set
16 pages
Superscalar and VLIW Architectures
No ratings yet
Superscalar and VLIW Architectures
35 pages
5 Pipeline
No ratings yet
5 Pipeline
63 pages
Chapter 2 ILP
No ratings yet
Chapter 2 ILP
89 pages
Co Unit 4
No ratings yet
Co Unit 4
17 pages
EE457Unit9a_OoO
No ratings yet
EE457Unit9a_OoO
77 pages
Basics and Hazards of Pipeline Controller
No ratings yet
Basics and Hazards of Pipeline Controller
23 pages
Pipelining Basic and Intermediate Concepts
No ratings yet
Pipelining Basic and Intermediate Concepts
75 pages
Slide 6
No ratings yet
Slide 6
46 pages
Microprocessor Based Systems: Lecture No 05 Virtual Machines and Pipelining Concept
No ratings yet
Microprocessor Based Systems: Lecture No 05 Virtual Machines and Pipelining Concept
19 pages
CCNA Exam Excellence: Study Guide & Practice Tests
From Everand
CCNA Exam Excellence: Study Guide & Practice Tests
SUJAN
No ratings yet
Presentation (7)
No ratings yet
Presentation (7)
14 pages
DAA Lecture 5 Recursion
No ratings yet
DAA Lecture 5 Recursion
25 pages
6-3 Lab Manual - Computer Networks
No ratings yet
6-3 Lab Manual - Computer Networks
65 pages
AI LB Hanzala Amir BSCS-36 RC-298
No ratings yet
AI LB Hanzala Amir BSCS-36 RC-298
3 pages
Department Electives
No ratings yet
Department Electives
97 pages
Introduction
No ratings yet
Introduction
34 pages
Array Processors: SIMD Computer Organization
100% (1)
Array Processors: SIMD Computer Organization
45 pages
Advanced Computer Architecture
No ratings yet
Advanced Computer Architecture
5 pages
Pipelining Vector Processing
No ratings yet
Pipelining Vector Processing
27 pages
Large Scale Deep Learning With TensorFlow (PDFDrive)
No ratings yet
Large Scale Deep Learning With TensorFlow (PDFDrive)
240 pages
APT Config
No ratings yet
APT Config
9 pages
SUPERCOMPUTERS1
No ratings yet
SUPERCOMPUTERS1
12 pages
2013 02 Seisspace Promax Data Sheet
No ratings yet
2013 02 Seisspace Promax Data Sheet
4 pages
The Little Book of SEMAPHORES The Ins and Outs of Concurrency Control and Common Mistakes 2nd edition by Allen Downey ISBN 1441418687 978-1441418685 - The ebook with rich content is ready for you to download
No ratings yet
The Little Book of SEMAPHORES The Ins and Outs of Concurrency Control and Common Mistakes 2nd edition by Allen Downey ISBN 1441418687 978-1441418685 - The ebook with rich content is ready for you to download
48 pages
Computer Architecture
No ratings yet
Computer Architecture
2 pages
William Stallings Computer Organization and Architecture 8 Edition Computer Evolution and Performance
No ratings yet
William Stallings Computer Organization and Architecture 8 Edition Computer Evolution and Performance
52 pages
ILP Limitations
No ratings yet
ILP Limitations
31 pages
Parallel Query Processing in PostgreSQL
No ratings yet
Parallel Query Processing in PostgreSQL
15 pages
R23 M.Tech AI_DS Syllabus--RCEE
No ratings yet
R23 M.Tech AI_DS Syllabus--RCEE
58 pages
Cloud Computing Prelim Exam 1
No ratings yet
Cloud Computing Prelim Exam 1
20 pages
23S1-SS ZG653-M1-CS02B - WhatIsSoftArch
No ratings yet
23S1-SS ZG653-M1-CS02B - WhatIsSoftArch
39 pages
3 Mpi
No ratings yet
3 Mpi
42 pages
2629acomputer Architecture
No ratings yet
2629acomputer Architecture
15 pages
x86 Microarchitectures
No ratings yet
x86 Microarchitectures
17 pages
PRAM Models
No ratings yet
PRAM Models
6 pages
R22M.Tech - CSE CSSyllabus
No ratings yet
R22M.Tech - CSE CSSyllabus
49 pages
Cloud and Virtualization To Support Grid Infrastructures
No ratings yet
Cloud and Virtualization To Support Grid Infrastructures
20 pages
Alpha College of Engineering
No ratings yet
Alpha College of Engineering
4 pages
deduplication on upmem
No ratings yet
deduplication on upmem
14 pages
Paraller Processing To Fetch Large Volume of Data Using BI
No ratings yet
Paraller Processing To Fetch Large Volume of Data Using BI
6 pages
Lec. 12: Vector Computers: EECS 252 Graduate Computer Architecture
No ratings yet
Lec. 12: Vector Computers: EECS 252 Graduate Computer Architecture
31 pages
Aca 1st Unit
No ratings yet
Aca 1st Unit
13 pages
Advanced Computer Architecture: Tran Ngoc Thinh HCMC University of Technology
No ratings yet
Advanced Computer Architecture: Tran Ngoc Thinh HCMC University of Technology
46 pages
Lecture 1 - Introduction To Concurrency: CS3211 Parallel and Concurrent Programming
No ratings yet
Lecture 1 - Introduction To Concurrency: CS3211 Parallel and Concurrent Programming
32 pages

Lecture 06 - (New) Pipelining and Parallelism

Uploaded by

Lecture 06 - (New) Pipelining and Parallelism

Uploaded by

Pipelining + Parallelism

• Long question – 1 (10 marks)

• Numerical question – 2 (5 X 2 = 10)

• Instruction pipelining is an organizational approach, to improve the

(n*k) Cycles => 12 cycles 7

k + (n-1) cycles => 7 cycles

One cycle Wait for I-2

Two cycles Wait for I-3

(k + n) Cycles = > 10 cycles

• Super-pipeline is the breaking of stages of a given pipeline into

• Super-pipeline system is capable of

• Super-scalar performs only one pipeline

2) Machine Parallelism (Parallelism in Hardware)

• For example, the arithmetic, logic, and shift operations can be

• All units are independence of each other, so one number can be

• The operands are diverted to each unit under the supervision of a

• The operands in the registers

• Parallel processing in this case may be achieved by means of ‘multiple

• Uniprocessor systems fall into this category.

• This structure is only of theoretical interest and not commercially

• Fault-tolerant systems fall into this category. Such systems, must be

• Multiprocessor systems and multi-computers (clusters) fall into this

• It emphasizes the ‘behavioural characteristics’ of the computer

You might also like