(2010-02-27) Measuring Performance

This document discusses various methods for measuring computer performance, including benchmarks. It describes common metrics like instructions per cycle (IPC), millions of instructions per second (MIPS), and millions of floating-point operations per second (MFLOPS). It also discusses different types of benchmarks, including program kernels, toy programs, synthetic benchmarks, and the SPEC benchmark suites which use real-world programs modified to minimize I/O effects. The SPEC benchmarks provide two metrics - SPECratio measures how many times faster a system can perform a single task compared to a reference machine, while SPECrate measures throughput by calculating how many benchmark tasks can be completed within a time interval.

Uploaded by

i_2loveu3235

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

93 views

(2010-02-27) Measuring Performance

Uploaded by

i_2loveu3235

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 11

Measuring performance

Kosarev Nikolay
MIPT
Feb, 2010

Agenda
Performance measures
Benchmarks
Summarizing results

Performance measures
Time to perform an individual operation
The first metric. Used if most instructions take the same
execution time.

Instruction mix
Idea is to categorize all instructions into classes by cycles
required to execute an instruction. Average instruction
execution time is calculated (IPC if measured in cycles).
Gibson instruction mix [1970]. Proposed weights for a set of
predefined instruction classes (based on programs running on
IBM 704 and 650)
Depends on the program executed, instruction set. Could be
optimized by compiler. Ignores major performance impacts
(memory hierarchy etc.)

Performance measures (cont.)

MIPS (millions of instructions per second)
Depends on instruction set (the heart of the differences between
RISC and CISC).

Relative MIPS. DEC VAX-11/780 (1 MIPS computer, reference

machine). Relative MIPS of machine M for predefined benchmark:

MFLOPS

(millions of floating-point operations per second)

Metric for supercomputers, tries but not corrects the primary

MIPS shortcoming

Performance measures (cont.)

Execution time
Ultimate measure of performance for a given application, consistent
across systems.
Total execution time (elapsed time). Includes system-overhead
effects (I/O operation, memory paging, time-sharing load, etc).
CPU time. Time spent for execution of application only by
microprocessor.
Better to report both measures for the end user.

Benchmarks
Program kernels
Small programs extracted from real applications. E.g. Livermore
Fortran Kernels (LFK) [1986].
Dont stress memory hierarchy in a realistic fashion, ignore operating
system.

Toy programs
Real applications but too small to characterize programs that are
likely to be executed by the users of a system. E.g. quicksort.

Synthetic benchmarks
Artificial programs, try to match profile and behavior of real
application. E.g. Whetstone [1976], Dhrystone [1984].
Ignore interactions between instructions (due to new ordering) that
lead to pipeline stalls, change of memory locality.

Benchmarks (cont.)
SPEC
SPEC (Standard Performance Evaluation Corporation)
Benchmark suites consist of real programs modified to be portable
and to minimize the effect of I/O activities on performance
5 SPEC generations: SPEC89, SPEC92, SPEC95, SPEC2000 and
SPEC2006 (used to measure desktop and server CPU performance)
Benchmarks organized in two suites: CINT and CFP
2 derived metrics: SPECratio and SPECrate
SPECSFS, SPECWeb (file server and web server benchmarks)
measure performance of I/O activities (from disk or network traffic)
as well as the CPU

Benchmarks (cont.)

Benchmarks (cont.)
SPECratio is a speed metric
How fast a computer can complete single task
Execution time normalized to a reference computer. Formula:

It measures how many times faster than a reference machine one

system can perform a task
Reference machine used for SPEC CPU2000/SPEC CPU2006 is Sun
UltraSPARC II system at 296MHz
Choice of the reference computer is irrelevant in performance
comparisons.

Benchmarks (cont.)
SPECrate is a throughput metric
Measures how many tasks the system completes within an
arbitrary time interval
Measured elapsed time from when all copies of one
benchmark are launched simultaneously until the last copy
finishes
Each benchmark measured independently
User is free to choose # of benchmark copies to run in order
to maximize performance
Formula
Reference factor normalization factor; benchmark duration is normalized to standard job length
(benchmark with the longest SPEC reference time). Unit time used to convert to unit of time more
appropriate for work (e.g. week)

Phy Interface Pci Express Sata Usb31 Architectures PIPE - Rev6 - 2 - 1
No ratings yet
Phy Interface Pci Express Sata Usb31 Architectures PIPE - Rev6 - 2 - 1
187 pages
Riscv Boom
No ratings yet
Riscv Boom
85 pages
Citi Quantum Computing
No ratings yet
Citi Quantum Computing
152 pages
Optimization For CSFB Call Delay
100% (1)
Optimization For CSFB Call Delay
11 pages
DDR5 Sdram
No ratings yet
DDR5 Sdram
2 pages
AMD Gem5 APU Simulator Micro 2015 Final PDF
No ratings yet
AMD Gem5 APU Simulator Micro 2015 Final PDF
62 pages
Efabless Caravel "Harness" Soc: Preliminary
No ratings yet
Efabless Caravel "Harness" Soc: Preliminary
30 pages
Constructing Effective UVM Testbench For DRAM Memory Controllers
No ratings yet
Constructing Effective UVM Testbench For DRAM Memory Controllers
5 pages
IB Hacme Casino User Guide
No ratings yet
IB Hacme Casino User Guide
31 pages
141 4 Motor Calculations
0% (1)
141 4 Motor Calculations
20 pages
Winshuttle Technical Architecture Guide Winshuttle Platform Whitepaper en
No ratings yet
Winshuttle Technical Architecture Guide Winshuttle Platform Whitepaper en
8 pages
Spec Cpu 2006
No ratings yet
Spec Cpu 2006
13 pages
A Case For CXL-Centric Sever Processors
No ratings yet
A Case For CXL-Centric Sever Processors
13 pages
Transparent Page Placement For CXL-Enabled Tiered Memory
No ratings yet
Transparent Page Placement For CXL-Enabled Tiered Memory
14 pages
01 Tutorial Intro Share
No ratings yet
01 Tutorial Intro Share
21 pages
Riscv Rocket Chip Tutorial Bootcamp Jan2015
No ratings yet
Riscv Rocket Chip Tutorial Bootcamp Jan2015
30 pages
The Berkeley Out - of - Order Machine (Boom!) : An Open - Source Industry - Compeeeve, Synthesizable, Parameterized Risc - V Processor
100% (1)
The Berkeley Out - of - Order Machine (Boom!) : An Open - Source Industry - Compeeeve, Synthesizable, Parameterized Risc - V Processor
45 pages
My CXL Presentation
100% (1)
My CXL Presentation
25 pages
03 Building Custom Socs
No ratings yet
03 Building Custom Socs
30 pages
CPU Design HOWTO PDF
No ratings yet
CPU Design HOWTO PDF
21 pages
2023 CXL DesignTradeoffs IEEE Micro
No ratings yet
2023 CXL DesignTradeoffs IEEE Micro
9 pages
GDC2003 Memory Optimization 18mar03
No ratings yet
GDC2003 Memory Optimization 18mar03
60 pages
Interrupts
No ratings yet
Interrupts
59 pages
PCIE Protocol
No ratings yet
PCIE Protocol
29 pages
Xeon D 1500 Datasheet Vol 1
No ratings yet
Xeon D 1500 Datasheet Vol 1
608 pages
RVfpga GettingStartedGuide
100% (1)
RVfpga GettingStartedGuide
103 pages
Risc V PDF
No ratings yet
Risc V PDF
117 pages
Pulpissimo: Datasheet: The Pulp Team
No ratings yet
Pulpissimo: Datasheet: The Pulp Team
101 pages
Chapter 4 - Cache Memory: Luis Tarrataca
No ratings yet
Chapter 4 - Cache Memory: Luis Tarrataca
159 pages
DRAM Command Guide
No ratings yet
DRAM Command Guide
2 pages
21CS43 - Module 1
No ratings yet
21CS43 - Module 1
21 pages
RISCV
No ratings yet
RISCV
451 pages
Intel SATA Controller
No ratings yet
Intel SATA Controller
59 pages
PCI SIG Arch Overview
No ratings yet
PCI SIG Arch Overview
37 pages
UVM Based Verification Environment For USB 3 Physical Layer and LTSSM of Link Layer
No ratings yet
UVM Based Verification Environment For USB 3 Physical Layer and LTSSM of Link Layer
5 pages
14.25 Tao Liu Richard Ho UVM Based RISC V Processor Verification Platform
No ratings yet
14.25 Tao Liu Richard Ho UVM Based RISC V Processor Verification Platform
22 pages
PCS White Paper
No ratings yet
PCS White Paper
14 pages
VLSI Lecture02 OpenIDEA (정무경)
No ratings yet
VLSI Lecture02 OpenIDEA (정무경)
69 pages
Block Diagram of Intel Atom Processor
No ratings yet
Block Diagram of Intel Atom Processor
23 pages
Ch04 The Memory System
No ratings yet
Ch04 The Memory System
45 pages
ASU DDR5 Digital Presentation
No ratings yet
ASU DDR5 Digital Presentation
59 pages
Verdi: Automated Debug System
No ratings yet
Verdi: Automated Debug System
8 pages
Intel 82802 Firmware Hub
No ratings yet
Intel 82802 Firmware Hub
53 pages
Boom Template Github
No ratings yet
Boom Template Github
11 pages
Pci-Express (Peripheral Component Interconnect) : Root Complex
No ratings yet
Pci-Express (Peripheral Component Interconnect) : Root Complex
5 pages
AMD64 Architecture Programmers Manual
No ratings yet
AMD64 Architecture Programmers Manual
386 pages
PCIe Enumeration and Setup Via ChatGPT
No ratings yet
PCIe Enumeration and Setup Via ChatGPT
22 pages
DDR4 White Paper
No ratings yet
DDR4 White Paper
8 pages
Arnold An eFPGA-Augmented RISC-V SoC For Low Power Iot End Nodes
No ratings yet
Arnold An eFPGA-Augmented RISC-V SoC For Low Power Iot End Nodes
14 pages
William Stallings Computer Organization and Architecture 9 Edition
No ratings yet
William Stallings Computer Organization and Architecture 9 Edition
52 pages
7 Series Memory Controllers
100% (1)
7 Series Memory Controllers
36 pages
File: /home/binod/documents/allfmca p/rocket-chip-master/README - MD Page 1 of 7
No ratings yet
File: /home/binod/documents/allfmca p/rocket-chip-master/README - MD Page 1 of 7
7 pages
Slides CW Benini
No ratings yet
Slides CW Benini
23 pages
Coresight v3 0 Architecture Specification IHI0029E
No ratings yet
Coresight v3 0 Architecture Specification IHI0029E
280 pages
Layering Protocol Verif
No ratings yet
Layering Protocol Verif
8 pages
Embedded System: 1 History
No ratings yet
Embedded System: 1 History
11 pages
Csa Mod 2
100% (1)
Csa Mod 2
28 pages
Embedded Systems Design - 2: Dr. N. Mathivanan
No ratings yet
Embedded Systems Design - 2: Dr. N. Mathivanan
10 pages
NVM Express 1 - 1a
No ratings yet
NVM Express 1 - 1a
166 pages
Pcie Intel Specification
No ratings yet
Pcie Intel Specification
9 pages
Got and PLT PDF
100% (1)
Got and PLT PDF
9 pages
Tcl 8.5 Network Programming
From Everand
Tcl 8.5 Network Programming
Wojciech Kocjan
No ratings yet
Chapter4 Performance
No ratings yet
Chapter4 Performance
36 pages
IT401 Computer Organization and Architecture: Prasun Ghosal
No ratings yet
IT401 Computer Organization and Architecture: Prasun Ghosal
30 pages
Performance: Latency
No ratings yet
Performance: Latency
7 pages
SMT and CMP Architectures
100% (3)
SMT and CMP Architectures
19 pages
EEF011 Computer Architecture 計算機結構: Exploiting Instruction-Level Parallelism with Software Approaches
0% (1)
EEF011 Computer Architecture 計算機結構: Exploiting Instruction-Level Parallelism with Software Approaches
40 pages
Content: - Introduction To Pipeline Hazard - Structural Hazard - Data Hazard - Control Hazard
No ratings yet
Content: - Introduction To Pipeline Hazard - Structural Hazard - Data Hazard - Control Hazard
27 pages
MULTITHREADING
No ratings yet
MULTITHREADING
30 pages
MULTIcycle OPERATIONS
No ratings yet
MULTIcycle OPERATIONS
24 pages
Limitation of ILP
No ratings yet
Limitation of ILP
28 pages
Lec18-Static BRANCH PREDICTION VLIW
No ratings yet
Lec18-Static BRANCH PREDICTION VLIW
40 pages
Compiler Techniques For Exposing ILP
No ratings yet
Compiler Techniques For Exposing ILP
26 pages
3.hardware Support For Exposing Parallelism
No ratings yet
3.hardware Support For Exposing Parallelism
21 pages
Prgramming in C PDF
No ratings yet
Prgramming in C PDF
70 pages
1.symmetric and Distributed Shared Memory Architectures
79% (19)
1.symmetric and Distributed Shared Memory Architectures
29 pages
HLTE321E.11 User ManualV0.1 DOCjm
No ratings yet
HLTE321E.11 User ManualV0.1 DOCjm
45 pages
GIS Climate Change and Disaster Managment
No ratings yet
GIS Climate Change and Disaster Managment
15 pages
AR600E
100% (1)
AR600E
1 page
Rate G.711, Ie If The Voice Stream Is Not Compressed
No ratings yet
Rate G.711, Ie If The Voice Stream Is Not Compressed
15 pages
RMU
No ratings yet
RMU
20 pages
Hydraulic With POR) Control
No ratings yet
Hydraulic With POR) Control
3 pages
Wget A Noobs Guide
No ratings yet
Wget A Noobs Guide
7 pages
Ext Dir Comand
No ratings yet
Ext Dir Comand
6 pages
T&D-May 2011
No ratings yet
T&D-May 2011
92 pages
Predicting House Sale Price Using Fuzzy Logic, Artificial Neural Network and K-Nearest Neighbor
No ratings yet
Predicting House Sale Price Using Fuzzy Logic, Artificial Neural Network and K-Nearest Neighbor
6 pages
Đề thi lớp 10 - BẠC LIÊU 2023
No ratings yet
Đề thi lớp 10 - BẠC LIÊU 2023
3 pages
NWC HaTrang
No ratings yet
NWC HaTrang
15 pages
Avic-F960bt CRT5513
No ratings yet
Avic-F960bt CRT5513
8 pages
Accounting Analytics 1
No ratings yet
Accounting Analytics 1
44 pages
Web Technology LAB MANUAL-2024 (1)
No ratings yet
Web Technology LAB MANUAL-2024 (1)
50 pages
Anurag Nayak Report
No ratings yet
Anurag Nayak Report
36 pages
Bicicleta Lugano (CT700-DA)
No ratings yet
Bicicleta Lugano (CT700-DA)
1 page
Am STR 06029 01
No ratings yet
Am STR 06029 01
19 pages
Adding Sub Screen in CS01: Requirement
No ratings yet
Adding Sub Screen in CS01: Requirement
10 pages
List of Drawings TEFCO 16-05-2024
No ratings yet
List of Drawings TEFCO 16-05-2024
1 page
Spectroquant Prove 600: Technical Data Sheet
No ratings yet
Spectroquant Prove 600: Technical Data Sheet
3 pages
Chapter 1 - Economics of Power Generation
100% (2)
Chapter 1 - Economics of Power Generation
18 pages
b4j Wolverine Rmax4
No ratings yet
b4j Wolverine Rmax4
618 pages
Smart Care Service Pre-Installation Checklist
No ratings yet
Smart Care Service Pre-Installation Checklist
8 pages
Web Application Security
No ratings yet
Web Application Security
55 pages