0% found this document useful (0 votes)

17 views

Lect27-parallal-processing

Uploaded by

vharat sharma

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views

Lect27-parallal-processing

Uploaded by

vharat sharma

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

Pipeline and Vector Processing

 Parallel Processing
 Simultaneous data processing tasks for the purpose of increasing the
= computational speed
 Perform concurrent data processing to achieve faster execution time
 Multiple Functional Unit :
 Separate the execution unit into eight functional units operating in parallel
Adder-subtractor

Integer multiply

Logic unit

Shift unit

To Memory

Incrementer
Processor
registers
Floating-point
add-subtract

Floating-point
multiply

Floating-point
divide
 Pipelining : it is the process of Decomposing a sequential process into suboperations
with Each subprocess is executed in a special dedicated segment concurrently with all
other segments.
 It is a collection of processing segments through which binary information flows. Where
each segment performs partial processing dedicated by the way the task is partioned.
 Pipelining의 예제 : Fig. 9-2
 Multiply and add operation : Ai * Bi  Ci ( for i = 1, 2, …, 7 )
 3 개의 Suboperation Segment로 분리
» 1) R1  Ai, R 2  Bi : Input Ai and Bi
» 2) R3  R1 * R 2, R 4  Ci : Multiply and input Ci
» 3) R5  R3  R 4 : Add Ci
 Content of registers in pipeline example : Tab. 9-1
Ai Bi Ci

R1 R2

Multiplier

R3 R4

Adder

R5
Segment1 Segment 2 Segment 3
Clock pulse Number R1 R2 R3 R4 R5
1 A1 B1 - - -
2 A2 B2 A1*B1 C1 -
3 A3 B3 A2*B2 C2 A1*B1+C1
4 A4 B4 A3*B3 C3 A2*B2+C2
5 A5 B5 A4*B4 C4 A3*B3+C3
6 A6 A6 A5*B5 C5 A4*B4+C4
7 A7 A7 A6*B6 C6 A5*B5+C5
8- - A7*B7 C7 A6*B6+C6
9- - - - A7*B7+C7
General considerations
4 segment pipeline : the operand pass through all four segments in a
fixed sequence. Each segment consists of a combinational ckt Si that
performs a sub operation over the data stream. The segments are
separated by the registers to hold the intermediate results.
Clock

Input S1 R1 S2 R2 S3 R3 S4 R4

Fig.: Four Segment pipeline

Space-time diagram :
»Show segment utilization as a function of time
Task : T1, T2, T3,…, T6 executed in four segments.
»Total operation performed going through all the segment
Pipeline에서의 처리 시간 = 9 clock cycles

Clock cycles 1 2 3 4 5 6 7 8 9
1 T1 T2 T3 T4 T5 T6
Segment

2 T1 T2 T3 T4 T5 T6

3 T1 T2 T3 T4 T5 T6

4 T1 T2 T3 T4 T5 T6
 Speedup S : Nonpipeline / Pipeline
 S = n • tn / ( k + n - 1 ) • tp = 6 • 6 tn / ( 4 + 6 - 1 ) • tp = 36 tn / 9 tn = 4
» n : task number ( 6 )
» tn : time to complete each task in nonpipeline ( 6 cycle times = 6 tp)
k+n-1n » tp : clock cycle time ( 1 clock cycle )
» k : segment number ( 4 )
 If n  이면, S = tn / tp
 If we assume that the time it takes to process a task is the same in the pipeline and
nonpipeline circuits then we have
nonpipeline ( tn ) = pipeline ( k • tp )

S = tn / tp = k • tp / tp = k
Where k is the number of segments.

 Arithmetic Pipeline
 Floating-point Adder Pipeline Example :
 Add / Subtract two normalized floating-point binary number
» X = A x 2a = 0.9504 x 103
» Y = B x 2b = 0.8200 x 102
 4 segments suboperations
» 1) Compare exponents by subtraction :
3-2=1
 X = 0.9504 x 103
 Y = 0.8200 x 102
» 2) Align mantissas
 X = 0.9504 x 103
 Y = 0.08200 x 103
» 3) Add mantissas
 Z = 1.0324 x 103
» 4) Normalize result
 Z = 0.1324 x 104
Exponents
a b
Mantissas
A B

R R

Compare Difference
Segment 1 : exponents
by subtraction

Segment 2 : Choose exponent Align mantissas

Add or subtract
Segment 3 :
mantissas

R R

Adjust Normalize
Segment 4 :
exponent result

R R
Instruction Pipeline
Instruction Cycle
1) Fetch the instruction from memory
2) Decode the instruction
3) Calculate the effective address
4) Fetch the operands from memory
5) Execute the instruction
6) Store the result in the proper place
Segment 1 : Fetch instruction
from memory

Decode instruction
Segment 2 : and calculate the
effective address

Branch ?

Fetch operand
Segment 3 : from memory

Segment 4 :Execute instruction

Interrupt
handling Interrupt ?

Update PC

Empty pipe
 Example : Four-segment Instruction Pipeline
 Four-segment CPU pipeline :
» 1) FI : Instruction Fetch
» 2) DA : Decode Instruction & calculate EA
» 3) FO : Operand Fetch
» 4) EX : Execution
 Timing of Instruction Pipeline :

Step : 1 2 3 4 5 6 7 8 9 10 11 12 13
Instruction : 1 FI DA FO EX

2 FI DA FO EX

(Branch) 3 FI DA FO EX

4 FI FI DA FO EX

5 FI DA FO EX

6 FI DA FO EX

7 FI DA FO EX

No Branch
Branch
 Pipeline Conflicts : 3 major difficulties
 1) Resource conflicts
» memory access by two segments at the same time.
» Can be avoided by using separate instruction stream and data memories.
 2) Data dependency
» when an instruction depend on the result of a previous instruction, but this result is not
yet available
 3) Branch difficulties
» branch and other instruction (interrupt, ret, ..) that change the value of PC
 Data Dependency 해결 방법
 Hardware 적인 방법
» Hardware Interlock
 previous instruction의 결과가 나올 때 까지 Hardware 적인 Delay를 강제 삽입
» Operand Forwarding
 previous instruction의 결과를 곧바로 ALU 로 전달 (정상적인 경우, register를 경유함)
 Software 적인 방법
» Delayed Load
 previous instruction의 결과가 나올 때 까지 No-operation instruction 을 삽입
Assignment

 What do you mean by pipeline and parallel processing.

 Explain vector processing.

ATI Call Flow
No ratings yet
ATI Call Flow
39 pages
Icc, star-RC and PT Steps
100% (2)
Icc, star-RC and PT Steps
4 pages
DS D3041-A
No ratings yet
DS D3041-A
2 pages
Lect27 Parallal Processing
No ratings yet
Lect27 Parallal Processing
15 pages
ch09 Morris Mano
No ratings yet
ch09 Morris Mano
15 pages
Chap. 9 Pipeline and Vector Processing
No ratings yet
Chap. 9 Pipeline and Vector Processing
15 pages
Lect28-Pipeline_15012019
No ratings yet
Lect28-Pipeline_15012019
36 pages
COA DR MVN 5 UNIT - Latest PDF
No ratings yet
COA DR MVN 5 UNIT - Latest PDF
24 pages
Unit 5-2 COA
No ratings yet
Unit 5-2 COA
52 pages
Parallel Processing
No ratings yet
Parallel Processing
32 pages
Chap. 9 Pipeline and Vector Processing
0% (1)
Chap. 9 Pipeline and Vector Processing
12 pages
Pipeline and Vector Processing
100% (1)
Pipeline and Vector Processing
18 pages
Pipeline and Vector
No ratings yet
Pipeline and Vector
29 pages
Unit-V NEW
No ratings yet
Unit-V NEW
21 pages
Unit 3-2 COA
No ratings yet
Unit 3-2 COA
58 pages
Chap. 9 Pipeline and Vector Processing
No ratings yet
Chap. 9 Pipeline and Vector Processing
16 pages
Unit-5-Parallel Processing
No ratings yet
Unit-5-Parallel Processing
11 pages
FINAL Presentation
No ratings yet
FINAL Presentation
31 pages
Chapter 4 The Processor
No ratings yet
Chapter 4 The Processor
72 pages
Presentation 5156 Content Document 20250301102853AM
No ratings yet
Presentation 5156 Content Document 20250301102853AM
40 pages
Pipeline and Vector Processing
No ratings yet
Pipeline and Vector Processing
18 pages
Pipeline & Parallel Processing
No ratings yet
Pipeline & Parallel Processing
19 pages
Pipe Lining
No ratings yet
Pipe Lining
7 pages
Pipelining 2
No ratings yet
Pipelining 2
43 pages
UNIT-5: Pipeline and Vector Processing
No ratings yet
UNIT-5: Pipeline and Vector Processing
63 pages
Unit 6 - Pipeline, Vector Processing and Multiprocessors
No ratings yet
Unit 6 - Pipeline, Vector Processing and Multiprocessors
23 pages
Pipelining Vector Processing
No ratings yet
Pipelining Vector Processing
27 pages
5.Pipeline and Multiprocessors
No ratings yet
5.Pipeline and Multiprocessors
16 pages
Unit 4 - P 2
No ratings yet
Unit 4 - P 2
13 pages
Unit-4-Pipeline and Vector Processing
No ratings yet
Unit-4-Pipeline and Vector Processing
45 pages
CH-1 1 Pipelining
No ratings yet
CH-1 1 Pipelining
43 pages
3.2 Pipeline Processing
No ratings yet
3.2 Pipeline Processing
18 pages
Chapter 5 Pipelining and Vector Processing Modified
No ratings yet
Chapter 5 Pipelining and Vector Processing Modified
37 pages
COAU5
No ratings yet
COAU5
31 pages
Pipelining
No ratings yet
Pipelining
33 pages
UNIT-3: MIPS Instructions
No ratings yet
UNIT-3: MIPS Instructions
15 pages
Chapter 9 - Pipeline and Vector Processing Section 9.1 - Parallel Processing
No ratings yet
Chapter 9 - Pipeline and Vector Processing Section 9.1 - Parallel Processing
10 pages
Embedded Systems Design: Pipelining and Instruction Scheduling
No ratings yet
Embedded Systems Design: Pipelining and Instruction Scheduling
48 pages
V-Unit Co
No ratings yet
V-Unit Co
18 pages
Comp Architecture Chapter 4 - Pipelining
No ratings yet
Comp Architecture Chapter 4 - Pipelining
53 pages
Chapter 5 - CO - BIM - III
No ratings yet
Chapter 5 - CO - BIM - III
7 pages
Coa Mod 4 5
No ratings yet
Coa Mod 4 5
91 pages
ACA - Pipelining
No ratings yet
ACA - Pipelining
25 pages
Unit-6: Pipeline & Vector Processing
No ratings yet
Unit-6: Pipeline & Vector Processing
41 pages
module-3-chapter-2
No ratings yet
module-3-chapter-2
40 pages
Unit 5
No ratings yet
Unit 5
51 pages
Chapter 5
No ratings yet
Chapter 5
19 pages
Pipelining and Vector Processing
No ratings yet
Pipelining and Vector Processing
28 pages
Pipelining and Vector Processing Chapter 9
100% (6)
Pipelining and Vector Processing Chapter 9
29 pages
UNIT-4_Pipelining & Parallel processing
No ratings yet
UNIT-4_Pipelining & Parallel processing
34 pages
5.1-5.3 Pipelining and Parallel Processing
No ratings yet
5.1-5.3 Pipelining and Parallel Processing
56 pages
Chapter 10: Pipeline: Objectives
No ratings yet
Chapter 10: Pipeline: Objectives
8 pages
5. Pipeline -3117
No ratings yet
5. Pipeline -3117
21 pages
Module-5_DDCO
No ratings yet
Module-5_DDCO
35 pages
Pipeline Processing Coa
No ratings yet
Pipeline Processing Coa
34 pages
Unit 5
No ratings yet
Unit 5
36 pages
Onur Digitaldesign - Comparch 2021 Lecture14 Pipelined Processor Design Afterlecture
No ratings yet
Onur Digitaldesign - Comparch 2021 Lecture14 Pipelined Processor Design Afterlecture
97 pages
L15 MipsPipeline
No ratings yet
L15 MipsPipeline
26 pages
Unit-6 Pipelining
No ratings yet
Unit-6 Pipelining
63 pages
3140707-CO-UNIT-6
No ratings yet
3140707-CO-UNIT-6
48 pages
Cyclic Redundancy Check - CRC: CRC Solution Sequential Divider
No ratings yet
Cyclic Redundancy Check - CRC: CRC Solution Sequential Divider
12 pages
CSO Lecture Notes Unit - 5
No ratings yet
CSO Lecture Notes Unit - 5
11 pages
Foundations of Image Science
From Everand
Foundations of Image Science
Harrison H. Barrett
No ratings yet
Configuring and Managing Vsans
No ratings yet
Configuring and Managing Vsans
14 pages
MyBatis 3 User Guide
No ratings yet
MyBatis 3 User Guide
69 pages
3.2 Preprocessing
No ratings yet
3.2 Preprocessing
10 pages
Chapter6 PDF
No ratings yet
Chapter6 PDF
16 pages
Parts Code f64
No ratings yet
Parts Code f64
5 pages
Description of The Windows XP Recovery Console
No ratings yet
Description of The Windows XP Recovery Console
14 pages
Ax 2012 System Requirements
No ratings yet
Ax 2012 System Requirements
32 pages
Cubic Cyclonium Handbook
No ratings yet
Cubic Cyclonium Handbook
9 pages
Module 4 Full - Latest
No ratings yet
Module 4 Full - Latest
122 pages
Js SDK DG
No ratings yet
Js SDK DG
380 pages
Getting Started With Comptia Linux+ Training Course
No ratings yet
Getting Started With Comptia Linux+ Training Course
123 pages
Secret Codes For Phone
No ratings yet
Secret Codes For Phone
13 pages
Microcontroller and PLC (Elective) PDF
No ratings yet
Microcontroller and PLC (Elective) PDF
1 page
IBM Spectrum Virtualize - BP Zoning 101-Top 10-Part1 Hollywood
No ratings yet
IBM Spectrum Virtualize - BP Zoning 101-Top 10-Part1 Hollywood
38 pages
Harmony Current Firmware Revisions For Net90 Infi90 and Symphony Controllers and I O Modules
No ratings yet
Harmony Current Firmware Revisions For Net90 Infi90 and Symphony Controllers and I O Modules
5 pages
A Survey of Non - Relational Databases With Big Data: Bansari H. Kotecha Prof. Hetal Joshiyara
No ratings yet
A Survey of Non - Relational Databases With Big Data: Bansari H. Kotecha Prof. Hetal Joshiyara
6 pages
IBM Thinkpad 600e.ths6017f
No ratings yet
IBM Thinkpad 600e.ths6017f
4 pages
Operating Systems (UNIT-2) : Go, Change The World
No ratings yet
Operating Systems (UNIT-2) : Go, Change The World
54 pages
Database Performance Optimization. Andrey Avtomonov
100% (1)
Database Performance Optimization. Andrey Avtomonov
26 pages
Ertos Material
No ratings yet
Ertos Material
8 pages
2022 Kcse November Computer Studies Paper 1 Marking Scheme
100% (2)
2022 Kcse November Computer Studies Paper 1 Marking Scheme
9 pages
Guide Ethernet IP COGNEX-Sysmac
No ratings yet
Guide Ethernet IP COGNEX-Sysmac
8 pages
S322 06 S Operations Advanced Configuration Backup Restore RevB PDF
No ratings yet
S322 06 S Operations Advanced Configuration Backup Restore RevB PDF
27 pages
OEB9FA231 USN9810 Operation and Maintenance ISSUE1.00
100% (1)
OEB9FA231 USN9810 Operation and Maintenance ISSUE1.00
253 pages
RSA Security Analytics Virtual Host Setup Guide - 2
No ratings yet
RSA Security Analytics Virtual Host Setup Guide - 2
36 pages
Objective:: Process Creation and Execution - Part II
No ratings yet
Objective:: Process Creation and Execution - Part II
11 pages
System Programming (2150708) : Topic: Implementation of Lexical Analyser Using LEX Utility Tool in UNIX
No ratings yet
System Programming (2150708) : Topic: Implementation of Lexical Analyser Using LEX Utility Tool in UNIX
5 pages

Lect27-parallal-processing

Uploaded by

Lect27-parallal-processing

Uploaded by

Pipeline and Vector Processing

Fig.: Four Segment pipeline

Segment 2 : Choose exponent Align mantissas

Segment 4 :Execute instruction

 What do you mean by pipeline and parallel processing.

You might also like