0% found this document useful (0 votes)

3 views

Canvas Pipelining and Parallel Processors

Pipelining is a computer architecture technique that enhances processor performance by overlapping instruction execution across multiple stages, improving throughput and speedup. Parallel processing utilizes multiple processors to execute tasks concurrently, increasing performance and scalability while addressing challenges like cache coherency. Techniques such as branch prediction and prefetching are employed to mitigate disruptions caused by branch instructions in pipelines, ensuring efficient execution.

Uploaded by

231401088

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

Canvas Pipelining and Parallel Processors

Uploaded by

231401088

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Canvas Pipelining and Parallel Processors

Basic Concepts of Pipelining

Pipelining is a technique used in computer architecture to improve the performance of a processor

by overlapping the execution of instructions. It divides the instruction execution process into several
stages, each handled by a separate hardware unit. Think of it as an assembly line in a factory where
different tasks are performed simultaneously on different products.

Stages of a Pipeline

The typical stages of a pipeline include:

1. Fetch: The instruction is fetched from memory.

2. Decode: The fetched instruction is decoded to determine what operation needs to be

performed.

3. Execute: The operation is executed using the ALU (Arithmetic Logic Unit) or other resources.

4. Memory Access: Data is read from or written to memory, if needed.

5. Write Back: The result of the operation is written back to a register.

Each stage works in parallel with others, processing different instructions at the same time. For
example, while one instruction is being executed, another can be decoded, and yet another fetched.

Throughput and Speedup

 Throughput refers to the number of instructions completed per unit of time. By overlapping
tasks, pipelining increases throughput significantly.

 Speedup is the ratio of the time taken to execute instructions without pipelining to the time
taken with pipelining.

Formula for Speedup:

For an ideal pipeline with stages and no delays, the speedup approaches , meaning the pipeline is -
times faster than a single-stage execution process.

Pipeline Hazards

Pipeline hazards are situations that prevent the next instruction in the pipeline from executing during
its designated clock cycle. These can disrupt the smooth operation of the pipeline.

1. Structural Hazards:

o Occur when two or more instructions require the same hardware resource at the
same time.

o Example: Two instructions needing access to memory simultaneously.

2. Data Hazards:

o Arise when instructions depend on the results of previous instructions that have not
yet completed.
o Example: An instruction tries to use a value that is still being calculated by a previous
instruction.

3. Control Hazards:

o Occur due to the change in instruction flow, such as after a branch or jump
instruction.

o Example: A pipeline might fetch the wrong instruction following a branch until the
branch outcome is known.

Parallel Processors

Parallel processing involves the use of multiple processors to perform computations simultaneously,
thereby increasing computational power and reducing execution time. It is essential in modern
computing to handle complex and large-scale problems.

Introduction to Parallel Processors

 Parallel processors are systems that use two or more processors to execute tasks
concurrently.

 They can be classified based on their architecture, such as:

o Shared Memory Systems: Processors share a common memory space.

o Distributed Memory Systems: Each processor has its private memory, and
communication occurs over a network.

Benefits of Parallel Processing

1. Increased Performance: By dividing tasks among processors, computations are faster.

2. Scalability: Additional processors can be added to handle larger problems.

3. Fault Tolerance: Some systems can continue operation even if one processor fails.

Concurrent Access to Memory and Cache Coherency

Parallel processors often face challenges when accessing shared memory and maintaining cache
coherency.

1. Concurrent Access to Memory:

o Multiple processors may need to read or write to the same memory location
simultaneously.

o This can lead to issues such as data inconsistency.

o Solution: Synchronization techniques like locks, semaphores, or barriers are used to

manage access.

2. Cache Coherency:

o In systems with multiple processors, each processor may have its private cache.

o Cache Coherency ensures that all caches have the most recent value of shared data.
o Example Problem: If Processor A updates a variable in its cache, Processor B must
see this updated value in its cache.

Cache Coherency Protocols

Several protocols ensure cache coherency, including:

 MESI Protocol: Maintains four states for a cache block — Modified, Exclusive, Shared, Invalid.

 Directory-Based Protocols: Use a central directory to track the state of each cache block.

Conclusion

Pipelining and parallel processing are foundational concepts in modern computing, enhancing the
speed and efficiency of processors. While pipelining increases instruction throughput by overlapping
stages, parallel processing harnesses the power of multiple processors to handle larger workloads.
However, challenges like pipeline hazards and cache coherency must be addressed to fully realize the
potential of these techniques.

Handling of Branch Instructions in Pipelines

Branch instructions pose a significant challenge in instruction pipelines as they can disrupt the
sequential flow of program execution. These branches can either be unconditional or conditional,
and each type requires specific handling to maintain pipeline efficiency.

Types of Branch Instructions

1. Unconditional Branch:

o Always alters the program flow by loading the Program Counter (PC) with the target
address.

o Example: Jump instructions that redirect to a specific address unconditionally.

2. Conditional Branch:

o Alters the program flow only if a specified condition is met.

o If the condition is not satisfied, the execution continues with the next sequential
instruction.

Solutions to Handle Branch Instructions

Several techniques are employed to mitigate the disruption caused by branch instructions:

1. Prefetching Target Instructions

 Process: Prefetch both the target instruction and the next sequential instruction after the
branch.

 Advantage: Reduces branch penalties by ensuring the correct instruction stream is already
fetched based on the branch outcome.

 Challenge: Wastes resources if the branch outcome differs from the prediction.
2. Branch Target Buffer (BTB)

 What is BTB?: An associative memory included in the pipeline's fetch stage.

 Process:

o Stores addresses of previously executed branch instructions along with their target
addresses.

o When a branch instruction is decoded, the pipeline searches the BTB for the target
address.

 Advantage: Faster execution of repetitive branch patterns as target instructions are readily
available.

 Fallback: If the target address is not in the BTB, the pipeline fetches it and updates the BTB
for future use.

3. Loop Buffer

 What is it?: A small, high-speed register file maintained by the fetch stage of the pipeline.

 Process:

o Stores program loops, including all branches.

o Executes the loop directly from the buffer without accessing memory.

 Advantage: Eliminates memory access overhead during loop execution, improving

performance.

 Condition: The loop mode is removed after the final branch out.

4. Branch Prediction

 What is it?: Uses logic to predict the outcome of a conditional branch before it is executed.

 Process:

o If the prediction is correct, the pipeline continues execution without delays.

o If incorrect, the pipeline must flush the incorrect instructions and fetch the correct
path.

 Types of Branch Prediction:

o Static Prediction: Based on simple rules (e.g., "always predict not taken").

o Dynamic Prediction: Based on runtime behavior and past branch outcomes.

 Advantage: Reduces branch penalties and improves pipeline efficiency.

5. Delayed Branch

 What is it?: A compiler-level optimization that rearranges the code to minimize branch
penalties.

 Process:
o Inserts no-op (no operation) instructions or useful instructions after a branch to keep
the pipeline busy while fetching the target instruction.

 Example:

 BEQ R1, R2, TARGET ; Branch if R1 equals R2

 NO-OP ; Inserted to allow pipeline to fetch TARGET

 Advantage: Keeps the pipeline active, reducing idle cycles caused by branch instructions.

Summary Table of Techniques

Technique Advantages Challenges

Prefetching Reduces penalties for branch decisions Wastes resources if prediction fails

BTB Quick access to repetitive targets Requires memory for storage

Loop Buffer Efficient execution of loops Limited to loop scenarios

Branch Prediction Improves pipeline efficiency Misprediction causes pipeline flush

Delayed Branch Maintains pipeline flow Dependent on compiler optimizations

These techniques aim to minimize the disruption caused by branch instructions, ensuring smoother
execution of the instruction pipeline and improving overall performance.

Secom 777tc e
No ratings yet
Secom 777tc e
4 pages
UNIT 6
No ratings yet
UNIT 6
20 pages
Imp topics
No ratings yet
Imp topics
5 pages
Concept of Pipelining - Computer Architecture Tutorial What Is Pipelining?
100% (1)
Concept of Pipelining - Computer Architecture Tutorial What Is Pipelining?
5 pages
Lec 8 Performance enhancement-computer architecture
No ratings yet
Lec 8 Performance enhancement-computer architecture
23 pages
CO
No ratings yet
CO
11 pages
PCC-CS402
No ratings yet
PCC-CS402
7 pages
CoA Batch13
No ratings yet
CoA Batch13
30 pages
Module 3
No ratings yet
Module 3
20 pages
Coa Iat-2 QB Soln
No ratings yet
Coa Iat-2 QB Soln
16 pages
Module 3 Pipelining
No ratings yet
Module 3 Pipelining
7 pages
4-Concept of Pipelining
No ratings yet
4-Concept of Pipelining
20 pages
Instruction Pipelining
No ratings yet
Instruction Pipelining
32 pages
Architechture Solve Part 1-1
No ratings yet
Architechture Solve Part 1-1
8 pages
Rfghj
No ratings yet
Rfghj
20 pages
PIpeline Processing and Multi Processing
No ratings yet
PIpeline Processing and Multi Processing
16 pages
Computer architecture and organisation
No ratings yet
Computer architecture and organisation
12 pages
Pipe Lining
No ratings yet
Pipe Lining
5 pages
Unit 5
No ratings yet
Unit 5
43 pages
Chap 3 Memory System Organization and Architecture (Part 2)
No ratings yet
Chap 3 Memory System Organization and Architecture (Part 2)
41 pages
Pipeline Basic Concept (1)
No ratings yet
Pipeline Basic Concept (1)
7 pages
Pipelining
No ratings yet
Pipelining
13 pages
Avoiding Pipeline Stalls in Hyperthreaded Processors: Iit Bombay M.Tech1
No ratings yet
Avoiding Pipeline Stalls in Hyperthreaded Processors: Iit Bombay M.Tech1
16 pages
4 - Performance Issues
No ratings yet
4 - Performance Issues
48 pages
unit4.aca
No ratings yet
unit4.aca
6 pages
C.Arch Large
No ratings yet
C.Arch Large
57 pages
moduel 5
No ratings yet
moduel 5
46 pages
COA Module 3
No ratings yet
COA Module 3
4 pages
CO Pipelining PDF notes
No ratings yet
CO Pipelining PDF notes
10 pages
Pipelining, Introduction To Parallel Processing and Operating System
No ratings yet
Pipelining, Introduction To Parallel Processing and Operating System
50 pages
COA Lecture 10
No ratings yet
COA Lecture 10
22 pages
Qn3
No ratings yet
Qn3
1 page
Co-4-2nd Part
No ratings yet
Co-4-2nd Part
4 pages
ACA2
No ratings yet
ACA2
4 pages
Computer Organization - Hardwired V/s Micro-Programmed Control Unit
No ratings yet
Computer Organization - Hardwired V/s Micro-Programmed Control Unit
9 pages
Techopedia Explains: Amdahl's Law
No ratings yet
Techopedia Explains: Amdahl's Law
19 pages
CA3 Suggetions-Computer Architecture
No ratings yet
CA3 Suggetions-Computer Architecture
3 pages
ILP-Solution For CO5
No ratings yet
ILP-Solution For CO5
27 pages
Pentium 4 Pipe Lining
100% (5)
Pentium 4 Pipe Lining
7 pages
CA Classes-76-80
No ratings yet
CA Classes-76-80
5 pages
Pipelining basic concept
No ratings yet
Pipelining basic concept
23 pages
Computer System Organization
No ratings yet
Computer System Organization
26 pages
Pipeline and Vector Processing
100% (1)
Pipeline and Vector Processing
18 pages
Computer Architecture AllClasses-Outline-100-198
No ratings yet
Computer Architecture AllClasses-Outline-100-198
99 pages
25-pipelining-محاضرة
No ratings yet
25-pipelining-محاضرة
7 pages
Excercise Solution 3-5
No ratings yet
Excercise Solution 3-5
5 pages
CA - Slides
No ratings yet
CA - Slides
28 pages
Computer Architecture 1
No ratings yet
Computer Architecture 1
8 pages
Cs501 Notes (1)
No ratings yet
Cs501 Notes (1)
33 pages
Unit 5
No ratings yet
Unit 5
44 pages
M3.5 Instruction Level Parallesim
No ratings yet
M3.5 Instruction Level Parallesim
13 pages
System On Chip: Inside Processor Pipeline Stalls
No ratings yet
System On Chip: Inside Processor Pipeline Stalls
12 pages
5 Marks Q. Describe Array Processor Architecture
No ratings yet
5 Marks Q. Describe Array Processor Architecture
11 pages
# Tutorial 9 & 10
No ratings yet
# Tutorial 9 & 10
6 pages
Pipeline and Vector Processing
No ratings yet
Pipeline and Vector Processing
4 pages
Chapter 14 - Processor Structure and Function
No ratings yet
Chapter 14 - Processor Structure and Function
74 pages
unit-2
No ratings yet
unit-2
11 pages
HPC-Unit-2
No ratings yet
HPC-Unit-2
72 pages
Advanced Computer Architecture 2
No ratings yet
Advanced Computer Architecture 2
17 pages
Instruction Pipeline Design, Arithmetic Pipeline Deign - Super Scalar Pipeline Design
No ratings yet
Instruction Pipeline Design, Arithmetic Pipeline Deign - Super Scalar Pipeline Design
34 pages
Kafka Developer Certified: The Essential Guide
From Everand
Kafka Developer Certified: The Essential Guide
SUJAN
No ratings yet
Lastexception 63844763209
No ratings yet
Lastexception 63844763209
2 pages
Wings of Prey Manual
No ratings yet
Wings of Prey Manual
20 pages
Computer Science Syllabus Break Up For O Levels 20-21
No ratings yet
Computer Science Syllabus Break Up For O Levels 20-21
10 pages
5 Assign Plant to Company Code _ SAP OX18 – SAP Training Tutorials
No ratings yet
5 Assign Plant to Company Code _ SAP OX18 – SAP Training Tutorials
4 pages
Bài Tập Mạng Truyền Thông Công Nghiệp
100% (1)
Bài Tập Mạng Truyền Thông Công Nghiệp
36 pages
Lesson 1: Understanding The Computer System
No ratings yet
Lesson 1: Understanding The Computer System
50 pages
Panic Full 2019 03 14 073943.874
No ratings yet
Panic Full 2019 03 14 073943.874
137 pages
New P-300 Programmer/Controller: Get Improved Performance, Energy Savings, Plus USB or Ethernet Access
No ratings yet
New P-300 Programmer/Controller: Get Improved Performance, Energy Savings, Plus USB or Ethernet Access
2 pages
ProCurve Switch Best Practices
100% (2)
ProCurve Switch Best Practices
214 pages
2-MINI-LINK™️ 6352 Configuration Guide
No ratings yet
2-MINI-LINK™️ 6352 Configuration Guide
38 pages
2017 07 07 Rails On Docker Getting Started Docker Ruby Rails
No ratings yet
2017 07 07 Rails On Docker Getting Started Docker Ruby Rails
23 pages
Serial Key Windows
No ratings yet
Serial Key Windows
3 pages
Eclipse Software Release Notes 4.1 PDF
100% (1)
Eclipse Software Release Notes 4.1 PDF
17 pages
Chpt.5.Supplemental Cisco - ppt.2
No ratings yet
Chpt.5.Supplemental Cisco - ppt.2
52 pages
Tivoli Access Manager Problem Determination Using Logging and Tracing Features
No ratings yet
Tivoli Access Manager Problem Determination Using Logging and Tracing Features
41 pages
Brochure For HRM Software
No ratings yet
Brochure For HRM Software
19 pages
Vulnerability Information For Multiple Cisco Vulnerabilities - EP
No ratings yet
Vulnerability Information For Multiple Cisco Vulnerabilities - EP
8 pages
SNA Bullent 9 MCQS
No ratings yet
SNA Bullent 9 MCQS
6 pages
LA-D802P-R10_20160621B-GERBER-A31
No ratings yet
LA-D802P-R10_20160621B-GERBER-A31
51 pages
Logcat Prev CSC Log
No ratings yet
Logcat Prev CSC Log
89 pages
Unit1 and Unit2
No ratings yet
Unit1 and Unit2
85 pages
IT Support Analyst CV
No ratings yet
IT Support Analyst CV
2 pages
SIE PaaG EN PDF
No ratings yet
SIE PaaG EN PDF
12 pages
Presentation
No ratings yet
Presentation
37 pages
Datasheet
No ratings yet
Datasheet
40 pages
AK47's O Level: Computer Science 2210 April 2030 30 Minutes
No ratings yet
AK47's O Level: Computer Science 2210 April 2030 30 Minutes
6 pages
Binary Code and Error Detection
No ratings yet
Binary Code and Error Detection
109 pages
AZ-Edit Manual
No ratings yet
AZ-Edit Manual
288 pages
Oracle Exadata Database Service Cloudcustomer
No ratings yet
Oracle Exadata Database Service Cloudcustomer
731 pages

Canvas Pipelining and Parallel Processors

Uploaded by

Canvas Pipelining and Parallel Processors

Uploaded by

Canvas Pipelining and Parallel Processors

Basic Concepts of Pipelining

Pipelining is a technique used in computer architecture to improve the performance of a processor

The typical stages of a pipeline include:

1. Fetch: The instruction is fetched from memory.

2. Decode: The fetched instruction is decoded to determine what operation needs to be

4. Memory Access: Data is read from or written to memory, if needed.

5. Write Back: The result of the operation is written back to a register.

Throughput and Speedup

Formula for Speedup:

o Example: Two instructions needing access to memory simultaneously.

Introduction to Parallel Processors

 They can be classified based on their architecture, such as:

o Shared Memory Systems: Processors share a common memory space.

Benefits of Parallel Processing

1. Increased Performance: By dividing tasks among processors, computations are faster.

2. Scalability: Additional processors can be added to handle larger problems.

Concurrent Access to Memory and Cache Coherency

1. Concurrent Access to Memory:

o This can lead to issues such as data inconsistency.

o Solution: Synchronization techniques like locks, semaphores, or barriers are used to

Cache Coherency Protocols

Several protocols ensure cache coherency, including:

Handling of Branch Instructions in Pipelines

Types of Branch Instructions

o Example: Jump instructions that redirect to a specific address unconditionally.

o Alters the program flow only if a specified condition is met.

Solutions to Handle Branch Instructions

1. Prefetching Target Instructions

 What is BTB?: An associative memory included in the pipeline's fetch stage.

o Stores program loops, including all branches.

 Advantage: Eliminates memory access overhead during loop execution, improving

o If the prediction is correct, the pipeline continues execution without delays.

 Types of Branch Prediction:

o Dynamic Prediction: Based on runtime behavior and past branch outcomes.

 Advantage: Reduces branch penalties and improves pipeline efficiency.

 BEQ R1, R2, TARGET ; Branch if R1 equals R2

 NO-OP ; Inserted to allow pipeline to fetch TARGET

Summary Table of Techniques

Technique Advantages Challenges

BTB Quick access to repetitive targets Requires memory for storage

Loop Buffer Efficient execution of loops Limited to loop scenarios

Branch Prediction Improves pipeline efficiency Misprediction causes pipeline flush

Delayed Branch Maintains pipeline flow Dependent on compiler optimizations

You might also like