Ex4 Updated

The document contains exercises related to computer architecture, focusing on processor instructions, control signals, resource utilization, instruction execution, and pipelining. It includes questions on instruction latency, data hazards, branch prediction, and the impact of various predictors on performance. The exercises require detailed analysis of instruction execution in single-cycle and pipelined datapaths, as well as the implications of adding NOP instructions and optimizing branch prediction strategies.

Uploaded by

ndvu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views4 pages

Ex4 Updated

Uploaded by

ndvu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

ICT1.

003 – Computer Architecture

Chapter 4: The Processor exercises
1. Consider the following instruc on:
Instruc on: and rd, rs1, rs2
Interpreta on: Reg[rd] = Reg[rs1] AND Reg[rs2]
a. What are the values of control signals generated by the control unit for this
instruc on?
b. Which resources (blocks) perform a useful func on for this instruc on?
c. Which resources (blocks) produce no output for this instruc on? Which resources
produce output that is not used?
2. Consider the following instruc on mix:

a. What frac on of all instruc ons use data memory?

b. What frac on of all instruc ons use instruc on memory?
c. What frac on of all instruc ons use the sign extend?
d. What is the sign extend doing during cycles in which its output is not needed?

3. In this exercise, we examine in detail how an instruc on is executed in a single-cycle

datapath. Problems in this exercise refer to a clock cycle in which the processor fetches
the following instruc on word: 0x00c6ba23.
a. What are the values of the ALU control unit’s inputs for this instruc on?
b. What is the new PC address a er this instruc on is executed? Highlight the path
through which this value is determined.
c. For each mux, show the values of its inputs and outputs during the execu on of this
instruc on. List values that are register outputs at Reg [xn].
d. What are the input values for the ALU and the two add units?
e. What are the values of all inputs for the registers unit?

4. Problems in this exercise assume that the logic blocks used to implement a processor’s
datapath have the following latencies:
“Register read” is the me needed a er the rising clock edge for the new register value
to appear on the output. This value applies to the PC only. “Register setup” is the
amount of me a register’s data input must be stable before the rising edge of the clock.
This value applies to both the PC and Register File.
a. What is the latency of an R-type instruc on (i.e., how long must the clock period be
to ensure that this instruc on works correctly)?
b. What is the latency of lw?
c. What is the latency of sw?
d. What is the latency of beq?
e. What is the latency of an arithme c, logical, or shi I-type (non-load) instruc on?
f. What is the minimum clock period for this CPU?
5. In this exercise, we examine how pipelining aﬀects the clock cycle me of the processor.
Problems in this exercise assume that individual stages of the datapath have the
following latencies:

Also, assume that instruc ons executed by the processor are broken down as
follows:

a. What is the clock cycle me in a pipelined and non-pipelined processor?

b. What is the total latency of an lw instruc on in a pipelined and non-pipelined
processor?
c. If we can split one stage of the pipelined datapath into two new stages, each with
half the latency of the original stage, which stage would you split and what is the
new clock cycle me of the processor?
d. Assuming there are no stalls or hazards, what is the u liza on of the data memory?
e. Assuming there are no stalls or hazards, what is the u liza on of the write-register
port of the “Registers” unit?
6. What is the minimum number of cycles needed to completely execute n instruc ons on
a CPU with a k stage pipeline? Jus fy your formula.
7. Add NOP instruc ons to the code below so that it will run correctly on a pipeline that
does not handle data hazards.
addi x11, x12, 5
add x13, x11, x12
addi x14, x11, 15
add x15, x13, x12
8. Consider a version of the pipeline that does not handle data hazards (i.e., the
programmer is responsible for addressing data hazards by inser ng NOP instruc ons
where necessary). Suppose that (a er op miza on) a typical n-instruc on program
requires an addi onal 1.4*n NOP instruc ons to correctly handle data hazards.
a. Suppose that the cycle me of this pipeline without forwarding is 250 ps. Suppose
also that adding forwarding hardware will reduce the number of NOPs from 1.4*n to
1.05*n, but increase the cycle me to 300 ps. What is the speedup of this new
pipeline compared to the one without forwarding?
b. Diﬀerent programs will require diﬀerent amounts of NOPs. How many NOPs (as a
percentage of code instruc ons) can remain in the typical program before that
program runs slower on the pipeline with forwarding?
c. however, this me let x represent the number of NOP instruc ons rela ve to n. (x
was equal to 1.4 in b) Your answer will be with respect to x.
d. Can a program with only 1.075*n NOPs possibly run faster on the pipeline with
forwarding? Explain why or why not.
e. At minimum, how many NOPs (as a percentage of code instruc ons) must a program
have before it can possibly run faster on the pipeline with forwarding?

9. The importance of having a good branch predictor depends on how o en condi onal
branches are executed. Together with branch predictor accuracy, this will determine how
much me is spent stalling due to mispredicted branches. In this exercise, assume that
the breakdown of dynamic instruc ons into various instruc on categories is as follows:

Also, assume the following branch predictor accuracies:

a. Stall cycles due to mispredicted branches increase the CPI. What is the extra CPI due
to mispredicted branches with the always-taken predictor? Assume that branch
outcomes are determined in the ID stage and applied in the EX stage that there are
no data hazards, and that no delay slots are used.
b. What is the CPI for the “always-not-taken” predictor.
c. What is the CPI for for the 2-bit predictor.
d. With the 2-bit predictor, what speedup would be achieved if we could convert half of
the branch instruc ons to some ALU instruc on? Assume that correctly and
incorrectly predicted instruc ons have the same chance of being replaced.
e. With the 2-bit predictor, what speedup would be achieved if we could convert half of
the branch instruc ons in a way that replaced each branch instruc on with two ALU
instruc ons? Assume that correctly and incorrectly predicted instruc ons have the
same chance of being replaced.
f. Some branch instruc ons are much more predictable than others. If we know that
80% of all executed branch instruc ons are easy-to-predict loop-back branches that
are always predicted correctly, what is the accuracy of the 2-bit predictor on the
remaining 20% of the branch instruc ons?
10. This exercise examines the accuracy of various branch predictors for the following
repea ng pa ern (e.g., in a loop) of branch outcomes: T, NT, T, T, NT.
a. What is the accuracy of always-taken and always-not-taken predictors for this
sequence of branch outcomes?
b. What is the accuracy of the 2-bit predictor for the ﬁrst four branches in this pa ern,
assuming that the predictor starts oﬀ in the bo om le state (predict not taken)?
c. What is the accuracy of the 2-bit predictor if this pa ern is repeated forever?
d. Design a predictor that would achieve a perfect accuracy if this pa ern is repeated
forever. You predictor should be a sequen al circuit with one output that provides a
predic on (1 for taken, 0 for not taken) and no inputs other than the clock and the
control signal that indicates that the instruc on is a condi onal branch.
e. What is the accuracy of your predictor if it is given a repea ng pa ern that is the
exact opposite of this one?
f. Design a predictor similar to (d), but now your predictor should be able to eventually
(a er a warm-up period during which it can make wrong predic ons) start perfectly
predic ng both this pa ern and its opposite. Your predictor should have an input
that tells it what the real outcome was. Hint: this input lets your predictor determine
which of the two repea ng pa erns it is given.

IT3030E Exercise Chap5 v2 Ans
No ratings yet
IT3030E Exercise Chap5 v2 Ans
11 pages
Assignment Solution Week11
100% (1)
Assignment Solution Week11
5 pages
Hazards PDF
No ratings yet
Hazards PDF
30 pages
Microcontroller Sample Questions CCEE
100% (3)
Microcontroller Sample Questions CCEE
35 pages
Brother LT2 B838
No ratings yet
Brother LT2 B838
16 pages
CompEng 361 - Homework 3 Solutions(1)
No ratings yet
CompEng 361 - Homework 3 Solutions(1)
6 pages
IT3030E-CA-Chap5-CPU-Exercises
No ratings yet
IT3030E-CA-Chap5-CPU-Exercises
9 pages
Illinois Exam2 Practice Solfa08
No ratings yet
Illinois Exam2 Practice Solfa08
4 pages
Cse590490 HW2
No ratings yet
Cse590490 HW2
5 pages
ECE 341 Final Exam Solution: Problem No. 1 (10 Points)
No ratings yet
ECE 341 Final Exam Solution: Problem No. 1 (10 Points)
9 pages
CO Assignment 4 Solution
100% (1)
CO Assignment 4 Solution
10 pages
CMPE361-Final - Sanple
No ratings yet
CMPE361-Final - Sanple
8 pages
Cs433 Fa20 Hw3 Solution
No ratings yet
Cs433 Fa20 Hw3 Solution
15 pages
Midtermarch 2
No ratings yet
Midtermarch 2
9 pages
Solution of Questions from Chapter 4-COAL.docx
No ratings yet
Solution of Questions from Chapter 4-COAL.docx
28 pages
Homework Set - 5
No ratings yet
Homework Set - 5
2 pages
491 Part%2B1%2B-%2BTarea
No ratings yet
491 Part%2B1%2B-%2BTarea
3 pages
1158 CS F342 20240527010246 Mid Semester Question Paper
No ratings yet
1158 CS F342 20240527010246 Mid Semester Question Paper
4 pages
CS433 hw1 Fall 07
No ratings yet
CS433 hw1 Fall 07
3 pages
Co MODULE 3_merged
No ratings yet
Co MODULE 3_merged
102 pages
CSE340 Practice Sheet Chapter 4
No ratings yet
CSE340 Practice Sheet Chapter 4
13 pages
PIPELINE
No ratings yet
PIPELINE
13 pages
4-The Processors
No ratings yet
4-The Processors
3 pages
Cs433 Sp12 Midterm Sol
No ratings yet
Cs433 Sp12 Midterm Sol
9 pages
F10 E1 Solution
No ratings yet
F10 E1 Solution
5 pages
COA Practice Problems
No ratings yet
COA Practice Problems
59 pages
Quiz Questions
No ratings yet
Quiz Questions
2 pages
Unit 3 Problems
No ratings yet
Unit 3 Problems
18 pages
Quiz For Chapter 4 With Solutions
100% (1)
Quiz For Chapter 4 With Solutions
30 pages
Comparch Comparch-002 Exams Midterm A8Xj46NCRo
No ratings yet
Comparch Comparch-002 Exams Midterm A8Xj46NCRo
9 pages
midterm-sol
No ratings yet
midterm-sol
7 pages
Exam2 Practice Sol
No ratings yet
Exam2 Practice Sol
6 pages
Mid 2
No ratings yet
Mid 2
8 pages
111-1 Final Exam
No ratings yet
111-1 Final Exam
15 pages
PS4-Solution
No ratings yet
PS4-Solution
6 pages
National University of Computer and Emerging Sciences, Lahore Campus
No ratings yet
National University of Computer and Emerging Sciences, Lahore Campus
4 pages
Csit Cog R2 A1 (1
No ratings yet
Csit Cog R2 A1 (1
3 pages
Computer Architecture and Design QP Set A CA 3
No ratings yet
Computer Architecture and Design QP Set A CA 3
6 pages
CS 4290/6290: High-Performance Computer Architecture Spring 2004 Midterm Quiz
No ratings yet
CS 4290/6290: High-Performance Computer Architecture Spring 2004 Midterm Quiz
3 pages
Coa Applied
No ratings yet
Coa Applied
13 pages
COA Major Assignment4
No ratings yet
COA Major Assignment4
2 pages
Quiz2 Soln spr12 PDF
No ratings yet
Quiz2 Soln spr12 PDF
2 pages
Midterm1 s15 Sol
No ratings yet
Midterm1 s15 Sol
26 pages
ITT204 - ktu qbank
No ratings yet
ITT204 - ktu qbank
8 pages
Mid Term 13-14
No ratings yet
Mid Term 13-14
3 pages
Sample Problems Pipe&Memory
No ratings yet
Sample Problems Pipe&Memory
57 pages
Assignment Nov 19
No ratings yet
Assignment Nov 19
7 pages
A4 Solution
No ratings yet
A4 Solution
4 pages
111 Computer Organization - Quiz 2
No ratings yet
111 Computer Organization - Quiz 2
3 pages
COE301 Final Solution 162
No ratings yet
COE301 Final Solution 162
10 pages
M116C 1 EE116C-Midterm2-w15 Solution
100% (1)
M116C 1 EE116C-Midterm2-w15 Solution
8 pages
NPTEL_ALL_ASSIGNMENTS (1)
No ratings yet
NPTEL_ALL_ASSIGNMENTS (1)
21 pages
Your Name:: Final Exam
No ratings yet
Your Name:: Final Exam
9 pages
ASSIGNMENT1 Acsa
No ratings yet
ASSIGNMENT1 Acsa
3 pages
Question 1 (50 Points) Pipelining
No ratings yet
Question 1 (50 Points) Pipelining
3 pages
Revision Questions 2
No ratings yet
Revision Questions 2
4 pages
Advanced Backend Code Optimization
From Everand
Advanced Backend Code Optimization
Sid Touati
No ratings yet
Feedback Control Theory
From Everand
Feedback Control Theory
Bruce Francis
5/5 (1)
Projects With Microcontrollers And PICC
From Everand
Projects With Microcontrollers And PICC
Guillermo Perez Guillen
5/5 (1)
Pic® Micro Principles V11
From Everand
Pic® Micro Principles V11
Clive W. Humphris
No ratings yet
Pic® Micro Principles Teachers Pack V11
From Everand
Pic® Micro Principles Teachers Pack V11
Clive W. Humphris
No ratings yet
Evaluation of Zeta(2,...,2,4,2,...,2) and Period Polynomial Relations
No ratings yet
Evaluation of Zeta(2,...,2,4,2,...,2) and Period Polynomial Relations
50 pages
The Pentagon Equation and the Confluence Relations, H. Furusho
No ratings yet
The Pentagon Equation and the Confluence Relations, H. Furusho
18 pages
Adv. Theor. Math. Phys., Blasiak Et Al
No ratings yet
Adv. Theor. Math. Phys., Blasiak Et Al
36 pages
Transformation of KZ Type Equations, T. Oshima
No ratings yet
Transformation of KZ Type Equations, T. Oshima
17 pages
F. Brown, Com. Math. Phys. 2024
No ratings yet
F. Brown, Com. Math. Phys. 2024
46 pages
Practice 1
No ratings yet
Practice 1
7 pages
(Universitext) Robert G. Underwood (Auth.) - Fundamentals of Hopf Algebras-Springer International Publishing (2015) (1)
100% (1)
(Universitext) Robert G. Underwood (Auth.) - Fundamentals of Hopf Algebras-Springer International Publishing (2015) (1)
164 pages
(Zurich Lectures in Advanced Mathematics) Robert Marsh - Lecture Notes on Cluster Algebras-European Mathematical Society (2014)
No ratings yet
(Zurich Lectures in Advanced Mathematics) Robert Marsh - Lecture Notes on Cluster Algebras-European Mathematical Society (2014)
121 pages
MIG Weldig
No ratings yet
MIG Weldig
92 pages
CSC 401 MidSem Spring 2023-2024 Operating Systems
No ratings yet
CSC 401 MidSem Spring 2023-2024 Operating Systems
2 pages
SC200 Rev
No ratings yet
SC200 Rev
2 pages
Olivetti M300-30 Motherboard Settings and Configuration
No ratings yet
Olivetti M300-30 Motherboard Settings and Configuration
5 pages
FSD Lab Manual
100% (3)
FSD Lab Manual
48 pages
Anchor L 2010
No ratings yet
Anchor L 2010
1 page
Automatic Accident Controller
No ratings yet
Automatic Accident Controller
20 pages
Features: Product Description
No ratings yet
Features: Product Description
2 pages
Muthumanickam R
No ratings yet
Muthumanickam R
3 pages
ds_pendant_kits_35469_en_lo
No ratings yet
ds_pendant_kits_35469_en_lo
2 pages
Enamel Plus Shiny
No ratings yet
Enamel Plus Shiny
14 pages
02-ACP-FOLLETO-Pluggable-Versions-V2-8P
No ratings yet
02-ACP-FOLLETO-Pluggable-Versions-V2-8P
8 pages
Steering System
100% (2)
Steering System
57 pages
David Ngu Teck Joung 21AGM06719
No ratings yet
David Ngu Teck Joung 21AGM06719
93 pages
PROII93 ReleaseNotes
No ratings yet
PROII93 ReleaseNotes
55 pages
Aos2 PDF
No ratings yet
Aos2 PDF
3 pages
PWM Based Speed Control of Ac Motor Using Power Electronics Device
No ratings yet
PWM Based Speed Control of Ac Motor Using Power Electronics Device
4 pages
Matthews I Wes Report
No ratings yet
Matthews I Wes Report
105 pages
WS99
No ratings yet
WS99
21 pages
LeCroy 8901A GPIB CAMAC Crate Controller User's Manual
No ratings yet
LeCroy 8901A GPIB CAMAC Crate Controller User's Manual
36 pages
01-HUAWEI Storage Product Sales Specialist Training V5.5
No ratings yet
01-HUAWEI Storage Product Sales Specialist Training V5.5
56 pages
UVM Presentation DAC2011 Final
100% (1)
UVM Presentation DAC2011 Final
105 pages
Direct and Indirect Speech
No ratings yet
Direct and Indirect Speech
1,706 pages
9998598840microprocessor Lab
No ratings yet
9998598840microprocessor Lab
48 pages
1188CK Example
No ratings yet
1188CK Example
2 pages
Perrin High Pressure Valves For Hydrogen Service
100% (1)
Perrin High Pressure Valves For Hydrogen Service
12 pages
WIBREE
No ratings yet
WIBREE
22 pages
DxDiag
No ratings yet
DxDiag
36 pages
HDMI Cables Explained
No ratings yet
HDMI Cables Explained
3 pages

Ex4 Updated

Uploaded by

Ex4 Updated

Uploaded by

ICT1.

003 – Computer Architecture

a. What frac on of all instruc ons use data memory?

3. In this exercise, we examine in detail how an instruc on is executed in a single-cycle

a. What is the clock cycle me in a pipelined and non-pipelined processor?

Also, assume the following branch predictor accuracies:

You might also like