0% found this document useful (0 votes)

296 views

Advanced Buses

The document discusses the AMBA Multi-layer AHB interconnect which enables parallel access between masters and slaves. It is fully compatible with AHB wrappers and is a topology rather than protocol evolution. The multi-layer AHB uses a flexible matrix to connect multiple AHB layers, with arbitration stages to handle requests between layers. It can implement hierarchical systems by making slaves local to layers or connecting multiple slaves and masters per layer.

Uploaded by

tkazuta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

296 views

Advanced Buses

Uploaded by

tkazuta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 44

AMBA Multi-layer AHB

Enables parallel access paths between multiple masters and

slaves
Fully compatible with AHB wrappers
It is a topology (not protocol) evolution
Pure combinational matrix (scales poorly)

AHB Interconnect
Slave1
Master1 Matrix

Slave1
AHB
Master2
Slave1
Multi-Layer AHB implementation
The matrix is completely flexible and can be adapted
MUXes are point arbitration stages
AHB layer can be AHB-lite: single master, no
req/grant, no split/retry
Multi-layer AHB
A layer loosing arbitration is waited by means of
HREADY
When a layer is waited, input stage samples pipelined
address and control signals
Hierarchical systems

• Slaves accessed only by masters on a given layer can

be made local to the layer
Multiple slaves
Multiple slaves appear a single
slave to the matrix
• combine low bandwidth
slaves
• group slaves accessed only
by one master (e.g. DMA
controller)

Alternatively, a slave can be an

AHB-to-APB bridge, thus
allowing connection to multiple
low-bandwidth slaves
Multiple masters per layer

Combine masters that have

low bandwidth requirements
Putting it alltogether…
Interconnect matrix and Slave4
are used for across-layer
communication
Dual port slaves

Common for off-chip SDRAM controllers

• Master1: bandwidth limited high priority traffic with low latency
requirements
• Master2: default traffic
Traffic mismatches
7000000

6000000 Lower speedup!

Independent tasks (matrix
5000000
multiply) more than 2x
Exec. Time
With & without semaphore 4000000 Shared

synchronization
Bridging
3000000 MultiLayer
8 processors (small cache)
2000000

1000000

0
Semaphore No semaphore

Traffic mismatches degrade topology evolution benefits

Crossbars

Application-level speedup at the cost of

increased complexity in crossbar logic
Scales poorly
area and delay scale with N2
Impractical beyond 10x10!
STBus
On-chip interconnect solution by STM

Multiple outstanding transactions with out-of-order completion

Type 1-3: increasing complexity (and performance)
Supports Packets (request and response)
Support for protection, caches, locking
Deployed in a number of large-scale SoCs in STM
Transaction mapping
Transaction Transaction level Split transaction into
request and response
Req Packet Resp Packet Packet level packet pair

Cell level Break each packet down

into a number of tokens
depending on bus width
Signal level

Physical encoding
(e.g., req/gnt handshaking to
E.g., 32 bits STBus. transfer a cell)
LD8 transaction
1 request packet, 1 response packet
1 request cell, 2 response cells
Type 1-2-3

Equivalent to
AHB
functionality
Topology – Shared Bus

Low performance, low cost

Topology – Full Crossbar

High performance, high wiring complexity and cost

Read on STbus
Analysis: Protocol differences

AMBA

STBUS
Protocol matching
STBus node
Upsize converter
STBus at work Downsize converter
Freq. converter

Type 2 128 Bit

VLIW
IPTG T2
IP 1 LX
IPTG 166Mhz
IPTG
Type3
Off-chip
IPTG

IPTG Mem. Ctrl

IP 2
IPTG
IPTG
IPTG LMI
IPTG
IPTG IP 3
IPTG 64 Bit
IPTG T3
IPTG 64 Bit IPTG 250MHz
IP 3
IPTG T3
IPTG 166MHz IP 5
IPTG
Critical overview

Protocol is not fully transaction-centric

Cannot connect initiator to target directly
Packets are atomic on the interconnect
Cannot initiate nor receive multiple packets at the same time
Large data transfers may starve other initiators
Complex bridge engineering
Bridges are protocol specific
AMBA 3.0 (AMBA AXI)
• High bandwidth – low latency designs
• High frequency operation
• Flexibility in the implementation
• Backward compatible with AHB and APB

• Burst-based transactions with only first address issued

• Address information can be issues ahead of actual data transfer
• Multiple outstanding addresses
• Out-of-order transaction completion
• easy addition of register stages for timing closure
Topology – Partial Crossbar
Design paradigm change
Master

Master
Slave

Slave
Communication
architecture

AXI AXI Target

Initiator

• Point-to-point interface specification

• Independent of the details of the communication architecture
• Communication architecture can freely evolve
• Transaction-based specification of the interface
• Open Core Protocol (OCP) is another example of this paradigm
Internal data lanes
AXI AXI

Master
Master

Slave
Slave
crossbar
shared
bus
Master

Slave

Master

Slave
Most systems use one of three interconnect approaches:
-shared address and data buses
-Shared address buses and multiple data buses
-Multilayer, with multiple address and data buses
Channel-based Architecture
Five groups of signals
Read Address “AR” signal name prefix
Read Data “R” signal name prefix
Write Address “AW” signal name prefix
Write Data “W” signal name prefix
Write Response “B” signal name prefix
R. ADDRESS W. ADDRESS WRITE DATA

READ DATA RESPONSE

Channels are independent and asynchronous wrt each other

Read transaction

Single address for burst transfers

Write transaction
Channels - One way flow
AWVALID WVALID RVALID BVALID
AWDDR WLAST RLAST BRESP
AWLEN WDATA RDATA BID
AWSIZE WSTRB RRESP BREADY
AWBURST WID RID
AWLOCK WREADY RREADY
AWCACHE
AWPROT Channel: a set of unidirectional information
AWID signals
AWREADY
Valid/Ready handshake mechanism
READY is the only return signal
Valid: source IF has valid data/control signals
Ready: destination IF is ready to accept data
Last: indicates last word of a burst transaction
Burst support
• Variable-length bursts, from 1 to 16 data transfers per burst
• Bursts with a transfer size of 8-1024 bits
• Wrapping, incrementing and non-incrementing bursts
• Atomic operations, using locked accesses
AMBA 2.0 AHB Burst
ADDRESS A11 A12 A13 A14 A21 A22 A23 D31

DATA D11 D12 D13 D14 D21 D22 D23 D31

AHB Burst
Address and Data are locked together
Two pipeline stages
HREADY controls pipeline operation
AXI - One Address for Burst

ADDRESS A11 A21 D31

DATA D11 D12 D13 D14 D21 D22 D23 D31

AXI Burst
One Address for entire burst
AXI - Outstanding
Transactions
ADDRESS A11 A21 D31

DATA D11 D12 D13 D14 D21 D22 D23 D31

AXI Burst
One Address for entire burst
Allows multiple outstanding addresses
Problem:
Slow slave

ADDRESS A11 A21 A31

DATA D11 D12

If one slave is very slow, all data is held

up.
Out-of-Order Completion
ADDRESS A11 A21 D31

DATA D21 D22 D23 D31 D11 D12 D13 D14

Out of order completion allowed

Fast slaves may return data ahead of slow slaves

Each transaction has an ID attached (given by the master IF)

Channels have ID signals - AID, RID, etc.

Transactions with the same ID must be ordered

The interconnect in a multi-master system must append

another tag to ID to make each master’s ID unique

AXI - Data Interleaving

ADDRESS A11 A21 D31

DATA D21 D22 D11 D23 D12 D31 D13 D14

Returned data can even be interleaved

Gives maximum use of data bus
Note - Data within a burst is always in order
Burst read
Valid high until ready high

The valid-ready handshake regulates data transfer

Overlapping burst read
Address of second burst anticipated
Burst write
Register slices for max
frequency
WID
Channels are WDATA
WSTRB
asynchronous WLAST
WVALID
Register slices can WREADY
be applied across
any channel
Allows maximum
frequency of operation
by changing delay into latency
Allows system topology to be matched to
performance requirements
Comparison
Memorie settate con 2 wait states

wImpossibile nascondere
latenza dell’arbitraggio e
AHB
della risposta degli slave

STBUS low buf

Inizia una nuova richiesta
mentre si processa ancora la
risposta
STBUS high buf
Vengono iniziate più richieste
mentre si processano le
risposte
Il complesso arbitraggio
AXI attua un interleaving
delle transazioni
Scalability
Highly parallel benchmark (no slave bottlenecks)
110% 180%
170%
100%
160%
90% 150%
140%
Re la tive e xe cution time

Re la tive e xe cution time

80% 130%
120%
70%
110%
60% 2 Co re s 100% 2 Co re s
4 Co re s 90% 4 Co re s
50% 6 Co re s 80% 6 Co re s
8 Co re s 70% 8 Co re s
40%
60%
30% 50%
40%
20% 30%
20%
10%
10%
0% 0%
AHB AXI S TBus S TBus (B) AHB AXI S TBus S TBus (B)

1 kB cache (low bus 256 B cache (high

traffic) bus traffic)
Scalability
10 0 %
100%
90 %
90%

Inte rc onne c t us a ge e ffic ie nc y

80 %
80%
70 %
70%
Inte rc onne c t bus y

60 %
60%
50 % 2 Core s
50% 2 Core s
4 Core s
4 Core s
40 % 6 Core s
40% 6 Core s
8 Core s
8 Core s
30% 30 %

20% 20 %

10% 10 %

0% 0%
AHB AXI S TBus S TBus (B) AHB AXI STBu s STBu s (B)

Increasing contention: AXI, STBus show 80%+

efficiency, AHB < 50%
Saturation of shared bus architectures
Networks-on-Chip (NoCs)
Same paradigm of Wide Area Networks and
of large scale multi-processors
IP core
NI
master IP core
NI
master
Packet
switch
TAIL PAYLOAD HEADER switch
NoC switch
IP core
… FLIT NI
FLIT FLIT FLIT switch master

IP core IP core
NI NI
slave IP core slave
NI
slave
Clean separation
at session layer Modularity at HW level Physical design aware
Core issues end-to-end Only 2 building blocks: Path segmentation
transactions network interface Regular routing
Network deals with
lower level issues switch
Shared buses vs NoCs
NoCs Pros….

- Each integrated IP core adds bus load capacitance

+ Only point-to-point one-way links are used

- Bus timing problems in deep sub-micron designs

+ Better suited for GALS paradigm

- Arbiter delay grows with no of masters. Instance-specific arbiter

+ Distributed routing decisions. Reinstantiable switches

- Bus bandwidth is shared among all masters

+ Bus bandwidth scales with network dimension
Shared buses vs NoCs
NoCs Cons….

+ After bus is granted, bus access latency is null

- Unpredictable latency due to network congestion problems

+ Very low silicon cost

- High area cost

+ Simple bus-IP core interface

- Network-IP core interface can be very complex (e.g. packetization,..)

+ Design guidelines are well known

- New design paradigm

VW Jetta 2010 Wiring Diagrams Eng
75% (4)
VW Jetta 2010 Wiring Diagrams Eng
766 pages
IP Routing Protocols All-in-one: OSPF EIGRP IS-IS BGP Hands-on Labs
From Everand
IP Routing Protocols All-in-one: OSPF EIGRP IS-IS BGP Hands-on Labs
Redouane MEDDANE
No ratings yet
Axi - Interview
100% (1)
Axi - Interview
6 pages
IMAS Assessment With Solution PDF
100% (1)
IMAS Assessment With Solution PDF
4 pages
Advanced Bus
No ratings yet
Advanced Bus
53 pages
Amba Specification Advanced Extensible Interface Bus (Axi)
No ratings yet
Amba Specification Advanced Extensible Interface Bus (Axi)
37 pages
System Busses / Networks-on-Chip: EECE 579 - Advanced Topics in VLSI Design Spring 2009 Brad Quinton
No ratings yet
System Busses / Networks-on-Chip: EECE 579 - Advanced Topics in VLSI Design Spring 2009 Brad Quinton
102 pages
Industrial Automation
No ratings yet
Industrial Automation
46 pages
CS621 Final Term
No ratings yet
CS621 Final Term
111 pages
Amba Protocols Overview
No ratings yet
Amba Protocols Overview
25 pages
4-Embedded Buses 1x1
No ratings yet
4-Embedded Buses 1x1
96 pages
Lecture 6 - Interconnection Networks
No ratings yet
Lecture 6 - Interconnection Networks
50 pages
01 L20S1 - Networking Review 7-56
No ratings yet
01 L20S1 - Networking Review 7-56
43 pages
A 0850109
No ratings yet
A 0850109
9 pages
Local Area Network: LAN Applications
No ratings yet
Local Area Network: LAN Applications
19 pages
module-3-chapter-1
No ratings yet
module-3-chapter-1
58 pages
Moving From Amba Ahb To Axi Bus in Soc Designs: A Comparative Study
No ratings yet
Moving From Amba Ahb To Axi Bus in Soc Designs: A Comparative Study
4 pages
Moving From Amba Ahb To Axi Bus in Soc Designs: A Comparative Study
No ratings yet
Moving From Amba Ahb To Axi Bus in Soc Designs: A Comparative Study
4 pages
Module 3
No ratings yet
Module 3
25 pages
Multiprocessors Interconnection Networks
No ratings yet
Multiprocessors Interconnection Networks
32 pages
Networking Principles
No ratings yet
Networking Principles
19 pages
Structure Type Subtype Features Pros Cons Bisection Width Links / Switches
No ratings yet
Structure Type Subtype Features Pros Cons Bisection Width Links / Switches
1 page
ACA Mod3
No ratings yet
ACA Mod3
59 pages
AXI IIT Paper
No ratings yet
AXI IIT Paper
60 pages
Lecture Note On Switch Architectures
No ratings yet
Lecture Note On Switch Architectures
63 pages
Lans
No ratings yet
Lans
33 pages
Unit Iv Hardware Accelerates & Networks
No ratings yet
Unit Iv Hardware Accelerates & Networks
59 pages
ARM Interview Question
No ratings yet
ARM Interview Question
28 pages
System Bus Noc
No ratings yet
System Bus Noc
102 pages
Standards: Required To Allow For Interoperability Between Equipment Advantages
No ratings yet
Standards: Required To Allow For Interoperability Between Equipment Advantages
48 pages
03 On-Chip Bus PDF
No ratings yet
03 On-Chip Bus PDF
54 pages
CH 11
No ratings yet
CH 11
35 pages
Data Communication Important Topics
No ratings yet
Data Communication Important Topics
64 pages
Data and Computer Communications: - Local Area Network
No ratings yet
Data and Computer Communications: - Local Area Network
55 pages
Local Area Network Overview
No ratings yet
Local Area Network Overview
64 pages
Lecture Notes of Week 4-5
No ratings yet
Lecture Notes of Week 4-5
28 pages
Ece 747 Digital Signal Processing Architecture: Soc Lecture - Working With Buses & Interconnects
No ratings yet
Ece 747 Digital Signal Processing Architecture: Soc Lecture - Working With Buses & Interconnects
25 pages
Performance Comparison of AMBA Bus-Based System-On
No ratings yet
Performance Comparison of AMBA Bus-Based System-On
6 pages
Overview of Embedded Busses
No ratings yet
Overview of Embedded Busses
6 pages
1-Introduction To Computer Networks
No ratings yet
1-Introduction To Computer Networks
39 pages
Network 34
No ratings yet
Network 34
76 pages
Medium Access Protocols
No ratings yet
Medium Access Protocols
40 pages
Computer Network
No ratings yet
Computer Network
86 pages
AXI
No ratings yet
AXI
15 pages
Lecture5 (Share Memory" According To Connection)
No ratings yet
Lecture5 (Share Memory" According To Connection)
9 pages
Lecture 4 On Chip Interfaces 2021
No ratings yet
Lecture 4 On Chip Interfaces 2021
37 pages
Data Communications: LAN Technology
No ratings yet
Data Communications: LAN Technology
39 pages
Buses Lecture
No ratings yet
Buses Lecture
65 pages
Supervisory Control & Data Acquisition: Communication Technology
No ratings yet
Supervisory Control & Data Acquisition: Communication Technology
65 pages
Basics of Bus Interconnection
No ratings yet
Basics of Bus Interconnection
41 pages
AXI_Protocols_Overview_July9_2020
No ratings yet
AXI_Protocols_Overview_July9_2020
21 pages
An Implementation of Open Core Protocol For The On-Chip Bus: CH - Suryanarayana, M.Vinodh Kumar
No ratings yet
An Implementation of Open Core Protocol For The On-Chip Bus: CH - Suryanarayana, M.Vinodh Kumar
4 pages
Ax I Reference Paper
No ratings yet
Ax I Reference Paper
5 pages
Data and Computer Communications: Tenth Edition by William Stallings
No ratings yet
Data and Computer Communications: Tenth Edition by William Stallings
42 pages
Lan and Inter-Working Devices: M.S.Chawla Sde (Computer) RTTC Rajpura
No ratings yet
Lan and Inter-Working Devices: M.S.Chawla Sde (Computer) RTTC Rajpura
37 pages
Interconnection Networks
No ratings yet
Interconnection Networks
31 pages
DC new 45 - DC old 56
No ratings yet
DC new 45 - DC old 56
104 pages
Network 2: Protocols, Routing, Wireless: Prof - Lawrence Rauchwerger
No ratings yet
Network 2: Protocols, Routing, Wireless: Prof - Lawrence Rauchwerger
36 pages
What Is An Interconnection Network
No ratings yet
What Is An Interconnection Network
5 pages
Unit1-EEE Computer Networks - Ethetnet
No ratings yet
Unit1-EEE Computer Networks - Ethetnet
39 pages
First Hop Redundancy Protocol: Network Redundancy Protocol
From Everand
First Hop Redundancy Protocol: Network Redundancy Protocol
Mulayam Singh
No ratings yet
MULTICAST IP ROUTING Part-2: IP routing & forwarding
From Everand
MULTICAST IP ROUTING Part-2: IP routing & forwarding
Ummed Singh
No ratings yet
Final SLAM Report (Capstone Project)
No ratings yet
Final SLAM Report (Capstone Project)
41 pages
Software Platform".: Common Language Runtime (CLR)
No ratings yet
Software Platform".: Common Language Runtime (CLR)
4 pages
NETBEANS
No ratings yet
NETBEANS
22 pages
Account Statement From 1 Mar 2020 To 31 Mar 2020: TXN Date Value Date Description Ref No./Cheque No. Debit Credit Balance
No ratings yet
Account Statement From 1 Mar 2020 To 31 Mar 2020: TXN Date Value Date Description Ref No./Cheque No. Debit Credit Balance
2 pages
Assembler Tutorial - Pps
100% (1)
Assembler Tutorial - Pps
22 pages
Super 50 Parts 450302
No ratings yet
Super 50 Parts 450302
47 pages
Link State Shortest Path Algo
No ratings yet
Link State Shortest Path Algo
3 pages
20.2 - Engineering User Guide 2of3
No ratings yet
20.2 - Engineering User Guide 2of3
331 pages
ST Joseph College
No ratings yet
ST Joseph College
3 pages
SAP-HR Mydoc 2
No ratings yet
SAP-HR Mydoc 2
15 pages
CBT Q&a Faiyaz-2
No ratings yet
CBT Q&a Faiyaz-2
4 pages
Muhammad Sufyan - Professional Resume - Network Engineer
No ratings yet
Muhammad Sufyan - Professional Resume - Network Engineer
3 pages
HAHA2
No ratings yet
HAHA2
6 pages
SS2 Data Processing Practical Examination
100% (2)
SS2 Data Processing Practical Examination
2 pages
Equifax Complaint
No ratings yet
Equifax Complaint
28 pages
MAN Vs MACHINE
100% (2)
MAN Vs MACHINE
4 pages
Data Structures Explained - CodeHype
No ratings yet
Data Structures Explained - CodeHype
10 pages
Course Code: CS 281 Course Title: Digital Logic Design (DLD)
No ratings yet
Course Code: CS 281 Course Title: Digital Logic Design (DLD)
15 pages
Cyber Security of
No ratings yet
Cyber Security of
8 pages
Unknowns: A Marks) Answer All The Questions
No ratings yet
Unknowns: A Marks) Answer All The Questions
12 pages
Grundfosliterature 3379301 PDF
No ratings yet
Grundfosliterature 3379301 PDF
24 pages
796 Ict P1 Mock Questions 2024
No ratings yet
796 Ict P1 Mock Questions 2024
4 pages
Infosys
No ratings yet
Infosys
25 pages
Continuity and Differentiability - Short Notes - VIJETA SERIES CLASS-12TH
No ratings yet
Continuity and Differentiability - Short Notes - VIJETA SERIES CLASS-12TH
2 pages
Tunesmith Quick Start Tutorial
100% (1)
Tunesmith Quick Start Tutorial
14 pages
VTU Exam Question Paper With Solution of 15CS52 Computer Networks Dec-2017-Priyadharshini A
No ratings yet
VTU Exam Question Paper With Solution of 15CS52 Computer Networks Dec-2017-Priyadharshini A
53 pages
Creating Interoperable PCells Using the Python Language
No ratings yet
Creating Interoperable PCells Using the Python Language
2 pages
Theory:: Aim: Implement A Lexical Analyzer For A Subset of C Using LEX Implementation Should Support Error Handling
No ratings yet
Theory:: Aim: Implement A Lexical Analyzer For A Subset of C Using LEX Implementation Should Support Error Handling
5 pages

Advanced Buses

Uploaded by

Advanced Buses

Uploaded by

AMBA Multi-layer AHB

 Enables parallel access paths between multiple masters and

• Slaves accessed only by masters on a given layer can

Alternatively, a slave can be an

Combine masters that have

Common for off-chip SDRAM controllers

6000000 Lower speedup!

Traffic mismatches degrade topology evolution benefits

 Application-level speedup at the cost of

 Multiple outstanding transactions with out-of-order completion

Cell level Break each packet down

Low performance, low cost

High performance, high wiring complexity and cost

Type 2 128 Bit

IPTG Mem. Ctrl

 Protocol is not fully transaction-centric

• Burst-based transactions with only first address issued

AXI AXI Target

• Point-to-point interface specification

READ DATA RESPONSE

Channels are independent and asynchronous wrt each other

Single address for burst transfers

DATA D11 D12 D13 D14 D21 D22 D23 D31

ADDRESS A11 A21 D31

DATA D11 D12 D13 D14 D21 D22 D23 D31

DATA D11 D12 D13 D14 D21 D22 D23 D31

ADDRESS A11 A21 A31

DATA D11 D12

 If one slave is very slow, all data is held

DATA D21 D22 D23 D31 D11 D12 D13 D14

 Out of order completion allowed

 Each transaction has an ID attached (given by the master IF)

 Transactions with the same ID must be ordered

 The interconnect in a multi-master system must append

another tag to ID to make each master’s ID unique

ADDRESS A11 A21 D31

DATA D21 D22 D11 D23 D12 D31 D13 D14

 Returned data can even be interleaved

The valid-ready handshake regulates data transfer

STBUS low buf

Re la tive e xe cution time

 1 kB cache (low bus  256 B cache (high

Inte rc onne c t us a ge e ffic ie nc y

 Increasing contention: AXI, STBus show 80%+

- Each integrated IP core adds bus load capacitance

- Bus timing problems in deep sub-micron designs

- Arbiter delay grows with no of masters. Instance-specific arbiter

- Bus bandwidth is shared among all masters

+ After bus is granted, bus access latency is null

+ Very low silicon cost

+ Simple bus-IP core interface

+ Design guidelines are well known

You might also like

Enables parallel access paths between multiple masters and

Application-level speedup at the cost of

Multiple outstanding transactions with out-of-order completion

Protocol is not fully transaction-centric

If one slave is very slow, all data is held

Out of order completion allowed

Each transaction has an ID attached (given by the master IF)

Transactions with the same ID must be ordered

The interconnect in a multi-master system must append

Returned data can even be interleaved

1 kB cache (low bus 256 B cache (high

Increasing contention: AXI, STBus show 80%+