Unit 5

Uploaded by

Apurva Jarwal

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views

Unit 5

Uploaded by

Apurva Jarwal

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 44

Unit 5 : Processor Organization

-S.R.Milke
The Indirect Cycle
• The execution of an instruction may involve one or more
operands in memory, each of which requires a memory
access.
• Further if indirect addressing is used then additional memory
accesses are required
• We can think of the fetching of indirect addresses as one
more instruction stages
• The main line of activity consists of alternating instruction
fetch and instruction execution activities.
• After an instruction is fetched it is examined to determine if
any indirect addressing is involved
• If so the required operands are fetched using indirect
addressing
• Following execution an interrupt may be processed before
Instruction Cycle State Diagram
Data Flow
• The exact sequence of events during an
instruction cycle depends on the design of the
processor
• We can indicate in general terms what must
happen
• Let us assume that a processor that employs a
memory address register (MAR), a memory
buffer register (MBR) a program counter (PC)
and an instruction register (IR).
System attributes to Performance
Clock Rate and CPI
Clock Rate and CPI (Cont….)
Execution Time (CPU Time)
Execution Time (CPU Time) Cont….
Execution Time (CPU Time) Cont….
System Attributes
MIPS Rate
Throughput Rate (Performance)
Instruction Types and CPI
Example 1
Example 2
Consider a non-pipelined processor with a clock rate of 2.5 gigahertz and
average cycles per instruction of four. The same processor is upgraded to
a pipelined processor with five stages; but due to the internal pipeline
delay, the clock speed is reduced to 2 gigahertz. Assume that there are
no stalls in the pipeline. The speed up achieved in this pipelined
processor is __________.
(A) 3.2
(B) 3.0
(C) 2.2
(D) 2.0
Speedup = ExecutionTimeOld / ExecutionTimeNew

ExecutionTimeOld= CPIOld * CycleTimeOld

= CPIOld * CycleTimeOld = 4 * 1/2.5 Nanoseconds = 1.6 ns

Since there are no stalls, CPInew can be assumed 1 on average.

ExecutionTimeNew = CPInew * CycleTimenew

= 1 * 1/2 = 0.5 Speedup = 1.6 / 0.5 = 3.2

Parallelism
• Computer architects are constantly striving to
improve performance of the machines they design.
• Making the chips run faster by increasing their
clock speed is one way, However, most computer
architects look to parallelism (doing two or more
things at once) as a way to get even more
performance for a given clock speed.
• Parallelism comes in two general forms: –
1) instruction-level parallelism, and
2) processor-level parallelism.
Instruction-Level Parallelism
• Parallelism is exploited within individual
instructions to get more instructions/sec out
of the machine.
• We will consider two approached
– Pipelining
– Superscalar Architectures
Pipelining
• Fetching of instructions from memory is a major bottleneck
in instruction execution speed. However, computers have
the ability to fetch instructions from memory in advance .
• These instructions were stored in a set of registers called the
prefetch buffer.
• Thus, instruction execution is divided into two parts:
fetching and actual execution;
• The concept of a pipeline carries this strategy much further.
• Instead of dividing instruction execution into only two parts,
it is often divided into many parts, each one handled by a
dedicated piece of hardware, all of which can run in parallel.
Dual Pipelines
• If one pipeline is good, then surely two pipelines
are better.
• Here a single instruction fetch unit fetches pairs of
instructions together and puts each one into its
own pipeline, complete with its own ALU for
parallel operation.
• To be able to run in parallel, the two instructions
must not conflict over resource usage (e.g.,
registers), and neither must depend on the result
of the other.
Superscalar Architectures
• Going to four pipelines is conceivable, but doing so
duplicates too much hardware
• Instead, a different approach is used on highend CPUs.
• The basic idea is to have just a single pipeline but give it
multiple functional units.
• This is a superscalar architecture – using more than one
ALU, so that more than one instruction can be executed in
parallel.
• Implicit in the idea of a superscalar processor is that the
S3 stage can issue instructions considerably faster than the
S4 stage is able to execute them.
Processor-Level Parallelism
• Instruction-level parallelism (pipelining and
superscalar operation) rarely win more than a
factor of five or ten in processor speed.
• To get gains of 50, 100, or more, the only way is to
design computers with multiple CPUS
• We will consider three alternative architectures: –
• Array Computers
• Multiprocessors
• Multicomputers
Array Computers
• An array processor consists of a large number
of identical processors that perform the same
sequence of instructions on different sets of
data.
• A vector processor is efficient at executing a
sequence of operations on pairs of Data
elements; all of the addition operations are
performed in a single, heavily-pipelined adder.
Multiprocessors
• The processing elements in an array processor are
not independent CPUS, since there is only one
control unit.
• The first parallel system with multiple full-blown
CPUs is the multiprocessor.
• This is a system with more than one CPU sharing a
common memory co-ordinated in software.
• The simplest one is to have a single bus with
multiple CPUs and one memory all plugged into it.
Multicomputers
• Although multiprocessors with a small number of
processors (< 64) are relatively easy to build, large ones are
surprisingly difficult to construct.
• The difficulty is in connecting all the processors to the
memory.
• To get around these problems, many designers have simply
abandoned the idea of having a shared memory and just
build systems consisting of large numbers of
interconnected computers, each having its own private
memory, but no common memory.
• These systems are called multicomputers.

CCP Mock Test 2
No ratings yet
CCP Mock Test 2
6 pages
Advanced Computer Architecture
No ratings yet
Advanced Computer Architecture
2 pages
Organization CH 2
No ratings yet
Organization CH 2
102 pages
4-Concept of Pipelining
No ratings yet
4-Concept of Pipelining
20 pages
M2
No ratings yet
M2
56 pages
Chap2 Slides
No ratings yet
Chap2 Slides
127 pages
The Central Processing Unit:: What Goes On Inside The Computer
No ratings yet
The Central Processing Unit:: What Goes On Inside The Computer
42 pages
Module 07 - Multiprocessing
No ratings yet
Module 07 - Multiprocessing
60 pages
Pankaj
No ratings yet
Pankaj
27 pages
MPMC Module 5
No ratings yet
MPMC Module 5
25 pages
Unit 6 Mom
No ratings yet
Unit 6 Mom
23 pages
Computer System Overview: 1 Spring 2015
No ratings yet
Computer System Overview: 1 Spring 2015
48 pages
A257 Atharv Shimpi Coa Presentation
No ratings yet
A257 Atharv Shimpi Coa Presentation
16 pages
Module 4 - Parallel & Pipeline Processing - Final
No ratings yet
Module 4 - Parallel & Pipeline Processing - Final
31 pages
ACA Unit 4
No ratings yet
ACA Unit 4
41 pages
Lecture (2) .PPT-1
100% (1)
Lecture (2) .PPT-1
19 pages
Abstraction & Technology_1
No ratings yet
Abstraction & Technology_1
74 pages
Module 2
No ratings yet
Module 2
127 pages
UNIT 6
No ratings yet
UNIT 6
20 pages
Pipelining
No ratings yet
Pipelining
21 pages
Different Parallel Processing Architecture
No ratings yet
Different Parallel Processing Architecture
41 pages
OS Unit - II
No ratings yet
OS Unit - II
74 pages
Module-2-OS-BCS303
No ratings yet
Module-2-OS-BCS303
81 pages
Module 6
No ratings yet
Module 6
59 pages
Parallel Processing
No ratings yet
Parallel Processing
127 pages
4 - Performance Issues
No ratings yet
4 - Performance Issues
48 pages
Lesson 7 The Central Processing Unit (CPU)
No ratings yet
Lesson 7 The Central Processing Unit (CPU)
32 pages
Comp_Arch
No ratings yet
Comp_Arch
19 pages
Coa Unit 04
No ratings yet
Coa Unit 04
85 pages
UNIT 2 OS_ppt
No ratings yet
UNIT 2 OS_ppt
56 pages
Lesson 7: System Performance: Objective
No ratings yet
Lesson 7: System Performance: Objective
2 pages
CMP 304 Process
No ratings yet
CMP 304 Process
12 pages
Unit1 1.7 Instr Cycle
No ratings yet
Unit1 1.7 Instr Cycle
35 pages
Hpc_unit-1 Insem Notes
No ratings yet
Hpc_unit-1 Insem Notes
76 pages
Tuning Programs With Oprofi Le
No ratings yet
Tuning Programs With Oprofi Le
10 pages
Ca - Unit 4
No ratings yet
Ca - Unit 4
77 pages
Advanced Computer Architecture Prof Thriveni T K
No ratings yet
Advanced Computer Architecture Prof Thriveni T K
59 pages
Chapter - 1
No ratings yet
Chapter - 1
11 pages
Vectors
No ratings yet
Vectors
52 pages
CPU Scheduling
No ratings yet
CPU Scheduling
6 pages
Pentium 4 Pipe Lining
100% (5)
Pentium 4 Pipe Lining
7 pages
moduel 5
No ratings yet
moduel 5
46 pages
Ch_5. memory management.ppt
No ratings yet
Ch_5. memory management.ppt
86 pages
Computer Organization: - by Rama Krishna Thelagathoti (M.Tech CSE From IIT Madras)
No ratings yet
Computer Organization: - by Rama Krishna Thelagathoti (M.Tech CSE From IIT Madras)
118 pages
Unit 5
No ratings yet
Unit 5
43 pages
Concept of Pipelining - Computer Architecture Tutorial What Is Pipelining?
100% (1)
Concept of Pipelining - Computer Architecture Tutorial What Is Pipelining?
5 pages
Module 5
No ratings yet
Module 5
45 pages
Mu Os Fall 2020 ch01 Introduction
No ratings yet
Mu Os Fall 2020 ch01 Introduction
54 pages
HPC-Unit-1
No ratings yet
HPC-Unit-1
65 pages
T2 CPU Performance
No ratings yet
T2 CPU Performance
26 pages
Chapter 2 System Buses
No ratings yet
Chapter 2 System Buses
57 pages
Chapter 3 Processes
No ratings yet
Chapter 3 Processes
42 pages
chap4
No ratings yet
chap4
91 pages
Parallelism
No ratings yet
Parallelism
22 pages
CS0051 - M2-Threads, Processes and Mutual Exclusion
No ratings yet
CS0051 - M2-Threads, Processes and Mutual Exclusion
38 pages
Lecture 2
No ratings yet
Lecture 2
17 pages
1) Define MIPS. CPI and MFLOPS.: Q.1 Attempt Any FOUR
No ratings yet
1) Define MIPS. CPI and MFLOPS.: Q.1 Attempt Any FOUR
10 pages
Introduction To Parallel Processing: Unit-2
No ratings yet
Introduction To Parallel Processing: Unit-2
32 pages
Computer Architecture 1st Semester Spring Session Unit 3
No ratings yet
Computer Architecture 1st Semester Spring Session Unit 3
33 pages
1.parallel Processing
100% (7)
1.parallel Processing
20 pages
CS_4348_HW1
No ratings yet
CS_4348_HW1
5 pages
Operating Systems Interview Questions You'll Most Likely Be Asked
From Everand
Operating Systems Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Unit 6
No ratings yet
Unit 6
31 pages
Assignment Unit 04
No ratings yet
Assignment Unit 04
1 page
Unit 1 Financial Institutions
No ratings yet
Unit 1 Financial Institutions
60 pages
Cap Unit 6 Ans
No ratings yet
Cap Unit 6 Ans
10 pages
Unit 4 Ratio Analysis
No ratings yet
Unit 4 Ratio Analysis
17 pages
Accounts & Financial Management Unit 2
No ratings yet
Accounts & Financial Management Unit 2
17 pages
Accounts & Financial Management Unit 3
No ratings yet
Accounts & Financial Management Unit 3
9 pages
Hadoop
No ratings yet
Hadoop
11 pages
Project Report Osy
No ratings yet
Project Report Osy
13 pages
Trace
No ratings yet
Trace
20 pages
A3 Statement
No ratings yet
A3 Statement
2 pages
CPU Scheduling Algorithm Assignment
No ratings yet
CPU Scheduling Algorithm Assignment
29 pages
MCAP QP CT - I - 2 Marks - Key
No ratings yet
MCAP QP CT - I - 2 Marks - Key
3 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
T NG H P NLHĐH Đã Nén
No ratings yet
T NG H P NLHĐH Đã Nén
564 pages
UltraViewerService Log
No ratings yet
UltraViewerService Log
55 pages
Parallel Processing
100% (1)
Parallel Processing
4 pages
ch5 CPU Scheduling
No ratings yet
ch5 CPU Scheduling
72 pages
Exam Operating System FInal - Answer Scheme
100% (1)
Exam Operating System FInal - Answer Scheme
10 pages
Deadlock Prevention and Avoidance-1
No ratings yet
Deadlock Prevention and Avoidance-1
2 pages
MultiThreading - Masterclass
No ratings yet
MultiThreading - Masterclass
18 pages
Concurrent Processes
No ratings yet
Concurrent Processes
34 pages
Distributed Computing - 3B - LESSON PLAN - 2023 - 2024 Odd Sem
No ratings yet
Distributed Computing - 3B - LESSON PLAN - 2023 - 2024 Odd Sem
9 pages
Parallelizing The Naughty Dog Engine Using Fibers
No ratings yet
Parallelizing The Naughty Dog Engine Using Fibers
94 pages
OS Report Final
No ratings yet
OS Report Final
38 pages
MODULE 1 2024 OS
No ratings yet
MODULE 1 2024 OS
4 pages
Context Switch Question Answer
No ratings yet
Context Switch Question Answer
2 pages
Java p13
No ratings yet
Java p13
5 pages
NVIDIA CUDA Programming Guide 2.0
100% (3)
NVIDIA CUDA Programming Guide 2.0
107 pages
Lab 6
No ratings yet
Lab 6
5 pages
HADOOP PPT
No ratings yet
HADOOP PPT
21 pages
Process Scheduling Report
No ratings yet
Process Scheduling Report
6 pages
CSE 3320(hw2).pdf
No ratings yet
CSE 3320(hw2).pdf
6 pages
Java+Concurrency+eBook
No ratings yet
Java+Concurrency+eBook
98 pages
CS604 Quiz-2 By Vu Topper RM
No ratings yet
CS604 Quiz-2 By Vu Topper RM
54 pages

Unit 5

Uploaded by

Unit 5

Uploaded by

Unit 5 : Processor Organization

ExecutionTimeOld= CPIOld * CycleTimeOld

= CPIOld * CycleTimeOld = 4 * 1/2.5 Nanoseconds = 1.6 ns

Since there are no stalls, CPInew can be assumed 1 on average.

ExecutionTimeNew = CPInew * CycleTimenew

= 1 * 1/2 = 0.5 Speedup = 1.6 / 0.5 = 3.2

You might also like