0% found this document useful (0 votes)

19 views14 pages

Performance Matrices

The document discusses various performance metrics that can be used to evaluate computer systems, including execution time, throughput, component metrics like CPI, and the importance of using real programs for evaluation. It also discusses principles for experimentation like reproducibility and simulation validation.

Uploaded by

akpbbk123

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views14 pages

Performance Matrices

Uploaded by

akpbbk123

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Performance Metrics

Performance metrics
• determine the benefit/lack of benefit of designs
• computer design is too complex to intuit performance &
performance bottlenecks
• have to be careful about what you mean to measure & how
you measure it

Discussion
• good metrics for measuring computer performance
• what they should be used for
• what metrics you shouldn’t use & how metrics are misused
Performance of Computer Systems
Many different factors to take into account when determining
performance:
• Technology
• circuit speed (clock, MHz)
• processor technology (how many transistors on a chip)
• Organization
• type of processor (ILP)
• configuration of the memory hierarchy
• type of I/O devices
• number of processors in the system
• Software
• quality of the compilers
• organization & quality of OS, databases, etc.
“Principles” of Experimentation

Meaningful metrics
execution time & component metrics that explain it

Reproducibility
machine configuration, compiler & optimization level, OS, input

Real programs
no toys, kernels, synthetic programs
SPEC is the norm (integer, floating point, graphics, webserver)
TPC-B, TPC-C & TPC-D for database transactions

Simulation
long executions, warm start to mimic steady-state behavior
usually applications only; some OS simulation
simulator “validation” & internal checks for accuracy
Metrics that Measure Performance
Raw speed: peak performance (never attained)

Execution time: time to execute one program from beginning to

end
• the “performance bottom line”
• wall clock time, response time
• Unix time function: 13.7u 23.6s 18:27 3%

Throughput: total amount of work completed in a given time

• transactions (database) or packets (web servers) / second
• an indication of how well hardware resources are being used
• good metrics for chip designers or managers of computer
systems

(Often improving execution time will improve throughput & vice

versa.)

Component metrics: subsystem performance, e.g., memory

behavior
• help explain how execution time was obtained
• pinpoints performance bottlenecks
Execution Time

Performancea = 1 / (Execution Timea)

Processor A is faster than processor B, i.e.,

Execution TimeA < Execution TimeB

PerformanceA > PerformanceB
Relative Performance

PerformanceA / PerformanceB

= ExecutionTImeB / ExecutionTimeA

performance of A is n times greater than B

execution time of B is n times longer than A
CPU Execution Time
The time the CPU spends executing an application
• no memory effects
• no I/O
• no effects of multiprogramming
CPUExecutionTime = CPUClockCycles * ClockCycleTime
Cycle time (clock period) is measured in time or rate
• clock cycle time = 1/clock cycle rate

CPUExecutionTime = CPUClockCycles / ClockCycleRate

• clock cycle rate of 1 MHz = cycle time of 1 μs

• clock cycle rate of 1 GHz = cycle time of 1 ns
CPI
CPUClockCycles = NumberOfInstructions * CPI
Average number of clock cycles per instruction
• throughput metric
• component metric, not a measure of performance
• used for processor organization studies, given a fixed compiler
& ISA

Can have different CPIs for classes of instructions

e.g., floating point instructions take longer than integer

instructions

CPUClockCycl × Ci )
es = ∑(CPI i

where CPIi = CPI for a particular class of instructions

where Ci = the number of instructions of the ith class that have
been executed

Improving part of the architecture can improve a CPIi

• Talk about the contribution to CPI of a class of instructions
CPU Execution Time

CPUExecutionTime =

numberofInstructions * CPI * clockCycleTime

To measure:
• execution time: depends on all 3 factors
• time the program
• number of instructions: determined by the ISA
• programmable hardware counters
• profiling
• count number of times each basic block is executed
• instruction sampling
• CPI: determined by the ISA & implementation
• simulator: interpret (in software) every instruction &
calculate the number of cycles it takes to simulate it
• clock cycle time: determined by the implementation & process
technology

Factors are interdependent:

• RISC: increases instructions/program, but decreases CPI &
clock cycle time because the instructions are simple
• CISC: decreases instructions/program, but increases CPI &
clock cycle time because many instructions are more complex
Metrics Not to Use
MIPS (millions of instructions per second)
instruction count / execution time*10^6 =
clock rate / (CPI * 10^6)
- instruction set-dependent (even true for similar architectures)
- implementation-dependent
- compiler technology-dependent
- program-dependent
+ intuitive: the higher, the better

MFLOPS (millions of floating point operations per second)

floating point operations / (execution time * 10^6)
+ FP operations are independent of FP instruction
implementation

- different machines implement different FP operations

- different FP operations take different amounts of time
- only measures FP code

static metrics (code size)

Means
Measuring the performance of a workload
• arithmetic: used for averaging execution times.
• harmonic: used for averaging rates ("the average of",
as opposed to "the average statistic of")

• weighted means: the programs are executed with different

frequencies, for example:
Means

FP Ops Time (secs)

Computer A Computer B Computer C

program 1 100 1 10 20
program 2 100 1000 100 20
total 1001 110 40
arith mean 500.5 55 20

FP Ops Rate (FLOPS)

Computer A Computer B Computer C

program 1 100 100 10 5
program 2 100 .1 1 5
harm mean .2 1.5 5
arith mean 50.1 5.5 5

Computer C is ~25 times faster than A when measuring execution

time
Still true when measuring MFLOPS(a rate) with the harmonic mean
Speedup

Speedup = Execution TimebeforeImprovement /

ExecutionTime
afterImprovement

Amdahl’s Law:

Performance improvement from speeding up a part of a

computer system is limited by the proportion of time the
enhancement is used.

Performance Measures
No ratings yet
Performance Measures
25 pages
Module 3.3 - Problems On Performance
No ratings yet
Module 3.3 - Problems On Performance
54 pages
Designing For Performance - Performance Metrics
No ratings yet
Designing For Performance - Performance Metrics
19 pages
2 RISC V Performance ISA
No ratings yet
2 RISC V Performance ISA
72 pages
Lecture 02 CH01 Performance Power
No ratings yet
Lecture 02 CH01 Performance Power
76 pages
C A Lecture-3
No ratings yet
C A Lecture-3
41 pages
CS5204/EE5364 - Advanced Computer Architecture - Performance
No ratings yet
CS5204/EE5364 - Advanced Computer Architecture - Performance
56 pages
CSA Performance
No ratings yet
CSA Performance
40 pages
SEN307-Lecture-5
No ratings yet
SEN307-Lecture-5
34 pages
L-2 (Computer Performance)
No ratings yet
L-2 (Computer Performance)
52 pages
Chapter 2-Part 12 1
No ratings yet
Chapter 2-Part 12 1
38 pages
2 CPU Performance
No ratings yet
2 CPU Performance
35 pages
Lecture-4
No ratings yet
Lecture-4
37 pages
4 Perfrmance
No ratings yet
4 Perfrmance
30 pages
Chapter4 Performance
No ratings yet
Chapter4 Performance
36 pages
CS322 - Computer Architecture (CA) : Spring 2019 Section V3
No ratings yet
CS322 - Computer Architecture (CA) : Spring 2019 Section V3
56 pages
Performance Measures For Computers
No ratings yet
Performance Measures For Computers
53 pages
Measuring Computer Performance
No ratings yet
Measuring Computer Performance
26 pages
The Role of Performance: Chapter - 2
No ratings yet
The Role of Performance: Chapter - 2
40 pages
CSE 332 L4 - 14 Nov 2020
No ratings yet
CSE 332 L4 - 14 Nov 2020
41 pages
COMP 303 Computer Architecture
No ratings yet
COMP 303 Computer Architecture
34 pages
Lecture # 2
No ratings yet
Lecture # 2
33 pages
Computer Performance
No ratings yet
Computer Performance
22 pages
L14 Introduction To Performance Evaluation
No ratings yet
L14 Introduction To Performance Evaluation
48 pages
09 Perf
No ratings yet
09 Perf
22 pages
Lec10 Performance
No ratings yet
Lec10 Performance
22 pages
Computer Performance
No ratings yet
Computer Performance
18 pages
Lecture4 Performance Evaluation
No ratings yet
Lecture4 Performance Evaluation
34 pages
Computer Performance
No ratings yet
Computer Performance
17 pages
Week 2 - Lecture 2 - Performance Measurement
No ratings yet
Week 2 - Lecture 2 - Performance Measurement
25 pages
Cs2100 14 Understanding Performance
No ratings yet
Cs2100 14 Understanding Performance
46 pages
Puter Performance
No ratings yet
Puter Performance
15 pages
CS322 - Computer Architecture (CA) : Spring 2019 Section V3
No ratings yet
CS322 - Computer Architecture (CA) : Spring 2019 Section V3
52 pages
Computer Organization The Role of Performance
No ratings yet
Computer Organization The Role of Performance
45 pages
DA_CI
No ratings yet
DA_CI
13 pages
Comp Org Notes On Measuring Cpu Performance
No ratings yet
Comp Org Notes On Measuring Cpu Performance
4 pages
Performance
No ratings yet
Performance
51 pages
Module 2 [26-10-2024]
No ratings yet
Module 2 [26-10-2024]
50 pages
Lesson 3 - Computing For Performance
No ratings yet
Lesson 3 - Computing For Performance
38 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
17 pages
Chapter 8 - CPU Performance
No ratings yet
Chapter 8 - CPU Performance
40 pages
COD Ch. 2 The Role of Performance
No ratings yet
COD Ch. 2 The Role of Performance
28 pages
IT401 Computer Organization and Architecture: Prasun Ghosal
No ratings yet
IT401 Computer Organization and Architecture: Prasun Ghosal
30 pages
Computer Organization and Architecture (AT70.01)
No ratings yet
Computer Organization and Architecture (AT70.01)
29 pages
Co Unit1 Part3
No ratings yet
Co Unit1 Part3
11 pages
William Stallings Computer Organization and Architecture 8 Edition Computer Evolution and Performance
No ratings yet
William Stallings Computer Organization and Architecture 8 Edition Computer Evolution and Performance
28 pages
Computer Architecture Measurement
No ratings yet
Computer Architecture Measurement
26 pages
Chapter 1 Performance
No ratings yet
Chapter 1 Performance
32 pages
Lecture Ch4 Performance
No ratings yet
Lecture Ch4 Performance
25 pages
COD Ch. 2 The Role of Performance
No ratings yet
COD Ch. 2 The Role of Performance
13 pages
Cse - 321 - 2
No ratings yet
Cse - 321 - 2
37 pages
mc8jaad98cyzEpUC53EX5B CS NOTES CLASS 12 PYTHON REVISION TOUR
100% (1)
mc8jaad98cyzEpUC53EX5B CS NOTES CLASS 12 PYTHON REVISION TOUR
14 pages
Defining Performance
No ratings yet
Defining Performance
6 pages
Week 13 14 - Performance Evaluation
No ratings yet
Week 13 14 - Performance Evaluation
19 pages
M116C 1 M116C 1 Lect02-Performance
No ratings yet
M116C 1 M116C 1 Lect02-Performance
23 pages
Lecture4 Performance Evaluation 2011
No ratings yet
Lecture4 Performance Evaluation 2011
34 pages
Performance
No ratings yet
Performance
12 pages
Assessing and Understanding Performance
No ratings yet
Assessing and Understanding Performance
31 pages
System Attributes To Performance: April 7, 2011
100% (1)
System Attributes To Performance: April 7, 2011
4 pages
Measuring Performance: Chris Clack B261 Systems Architecture
No ratings yet
Measuring Performance: Chris Clack B261 Systems Architecture
19 pages
CCS367-Storage-Technologies-Lecture-Notes-1
No ratings yet
CCS367-Storage-Technologies-Lecture-Notes-1
174 pages
1707874118_Springboard_UI_UX_Design_Syllabus_IXC_021424
No ratings yet
1707874118_Springboard_UI_UX_Design_Syllabus_IXC_021424
20 pages
4.EnhancingCybersecurityMeasuresforRobustFraudDetectionandPreventioninU.S.onlineBanking
No ratings yet
4.EnhancingCybersecurityMeasuresforRobustFraudDetectionandPreventioninU.S.onlineBanking
18 pages
Docker-notes
No ratings yet
Docker-notes
2 pages
RF - Lesson 5 Exercise - Variables
No ratings yet
RF - Lesson 5 Exercise - Variables
7 pages
ThinkPad L14 Gen 4 Intel 21H2000RAR
No ratings yet
ThinkPad L14 Gen 4 Intel 21H2000RAR
2 pages
KV Pre Board 2 2022-23
No ratings yet
KV Pre Board 2 2022-23
8 pages
Cyber Security PHD Thesis
100% (2)
Cyber Security PHD Thesis
5 pages
MSM Download Tool
100% (1)
MSM Download Tool
23 pages
AST White Paper-EBS-OBIEE 11g Integration
No ratings yet
AST White Paper-EBS-OBIEE 11g Integration
41 pages
Arm Cpu Cores
No ratings yet
Arm Cpu Cores
64 pages
Introduction To Real Time Operating Systems
No ratings yet
Introduction To Real Time Operating Systems
36 pages
CRAW Mobile App Security Courses
No ratings yet
CRAW Mobile App Security Courses
2 pages
Swegon Magicad Plugin: User'S Guide
No ratings yet
Swegon Magicad Plugin: User'S Guide
18 pages
Tutorial For Merging Satellite-Based Precipitation Datasets With Ground Observations Using Rfmerge
No ratings yet
Tutorial For Merging Satellite-Based Precipitation Datasets With Ground Observations Using Rfmerge
11 pages
Session1 and For AT1 VicGov Cyber Incident Response Plan Template
No ratings yet
Session1 and For AT1 VicGov Cyber Incident Response Plan Template
26 pages
Course Title:-Final Project I
No ratings yet
Course Title:-Final Project I
36 pages
TDS Management Software
No ratings yet
TDS Management Software
3 pages
ICT Tools: Google Forms: Dr.B.Surendranath Reddy
No ratings yet
ICT Tools: Google Forms: Dr.B.Surendranath Reddy
23 pages
Analog Devices - Integrated, High Power Solutions For Xilinx FPGAs
No ratings yet
Analog Devices - Integrated, High Power Solutions For Xilinx FPGAs
16 pages
Human Computer Interaction: Week 3 User Interface Design
No ratings yet
Human Computer Interaction: Week 3 User Interface Design
18 pages
LTSpice Help
No ratings yet
LTSpice Help
135 pages
Class 19
No ratings yet
Class 19
15 pages
The Definitive Guide To Multi Device E Learning
No ratings yet
The Definitive Guide To Multi Device E Learning
34 pages
Windows Keyboard Shortcut Notes
No ratings yet
Windows Keyboard Shortcut Notes
3 pages
Dbms
No ratings yet
Dbms
4 pages
Oracle On Demand Infrastructure: Virtualization With Oracle VM
No ratings yet
Oracle On Demand Infrastructure: Virtualization With Oracle VM
9 pages
Android: How It Benefits Application Developers in The Near Future
No ratings yet
Android: How It Benefits Application Developers in The Near Future
5 pages
Interview With Katherine Garcia
No ratings yet
Interview With Katherine Garcia
1 page
Foundation Course for Advanced Computer Studies
From Everand
Foundation Course for Advanced Computer Studies
Franck Ismael Djédjé
No ratings yet

Performance Matrices

Uploaded by

Performance Matrices

Uploaded by

Performance Metrics

Execution time: time to execute one program from beginning to

Throughput: total amount of work completed in a given time

(Often improving execution time will improve throughput & vice

Component metrics: subsystem performance, e.g., memory

Performancea = 1 / (Execution Timea)

Processor A is faster than processor B, i.e.,

Execution TimeA < Execution TimeB

performance of A is n times greater than B

CPUExecutionTime = CPUClockCycles / ClockCycleRate

• clock cycle rate of 1 MHz = cycle time of 1 μs

Can have different CPIs for classes of instructions

e.g., floating point instructions take longer than integer

where CPIi = CPI for a particular class of instructions

Improving part of the architecture can improve a CPIi

numberofInstructions * CPI * clockCycleTime

Factors are interdependent:

MFLOPS (millions of floating point operations per second)

- different machines implement different FP operations

static metrics (code size)

• weighted means: the programs are executed with different

FP Ops Time (secs)

Computer A Computer B Computer C

FP Ops Rate (FLOPS)

Computer A Computer B Computer C

Computer C is ~25 times faster than A when measuring execution

Speedup = Execution TimebeforeImprovement /

Performance improvement from speeding up a part of a

You might also like