Tms320c64x Architecture

The TMS320C64x is a family of 16-bit VLIW DSPs from Texas Instruments that can process information at rates up to 8000 MIPS. The C64x has enhanced features over previous C62x DSPs such as double the number of registers, support for packed 8-bit and 64-bit data types, increased access to registers in the opposite register file, and additional functional units. Common applications of the C64x include wireless infrastructure equipment, DSL systems, and cable modems.

Uploaded by

Muru Gan

0% found this document useful (0 votes)

2K views

Tms320c64x Architecture

Uploaded by

Muru Gan

You are on page 1/ 29

TMS320C64x

• TMS320C64x is a family of 16-bit Very Long

Instruction Word (VLIW) DSP from Texas Instruments
• At clock rates of up to 1 GHz, C64x DSPs can process
information at rates up to 8000 MIPS
• C64x DSPs can do more work each cycle with built-in
extensions.
• They can process all C62x object code unmodified
(but not vice-versa)
Applications for the C64x

TMS320C64x can be used as a CPU in the following

devices:

 Wireless local base stations;

 Remote access server (RAS);
 Digital subscriber loop (DSL) systems;
 Cable modems;
 Multichannel telephony systems;
 Pooled modems;
New extensions

• Register file enhancements

• Data path extensions
• Packed data processing
• Additional functional unit hardware
• Increased orthogonality
Register file enhancements

• The ’C64x register file has double the number of

general-purpose registers than the ’C62x/’C67x cores
• There are 32 32-bit registers per data path
A0-A31 for file A and B0-B31 for file B
• A0 may also be used as a condition register bringing
the total to six condition registers.
• In all ’C6000 devices, registers A4-A7 and B4-B7 can
be used for circular addressing.
Packed data processing
• The ’C64x register file supports all the ’C62x data
types and extends this by additionally supporting
packed 8-bit types and 64-bit fixed-point data types.
• Packed data types store either four 8-bit values or
two 16-bit values in a single 32-bit register or four 16-
bit values in a 64-bit register pair.
• Besides being able to perform all the ’C62x
instructions, the ’C64x also contains many 8–bit and
16–bit extensions to the instruction set.
Eg: MPYU4 instruction performs four 8x8 unsigned
multiplies with a single instruction on a .M unit.
Data path extensions
• On the ’C64x, all eight of the functional units have
access to the register file on the opposite side via a
cross path.
• on the ’C62x/’C67x, only six functional units have
access to the register file on the opposite side via a
cross path; the .D units do not have a data cross
path.
• The ’C64x pipelines data cross path accesses
allowing multiple units per side to read the same
cross path source simultaneously.
• In ’C62x/’C67x, only one functional unit per data path
per execute packet could get an operand from the
opposite register file.
Additional Functional Unit Hardware
• the .L units can perform byte shifts and the .M units
can perform bi-directional variable shifts in addition to
the .S unit’s ability to do shifts.
• Bit-count and rotate hardware on the .M unit extends
support for bit-level algorithms such as binary
morphology, image metric calculations and encryption
algorithms.
Increased Orthogonality
• The .D unit can now perform 32-bit logical
instructions in addition to the .S and .L units.
• Also, the .D unit now directly supports load and store
instructions for double-word data values
Block diagram
L1 Program cache
Direct-mapped
SDRAM 16 K Bytes total

EMIF A
SBSRAM

ZBT RAM EMIF B

Enhanced L2
DMA
CPU
Memory
FIFO Controller 1024K
CORE
(64-channel) bytes
SRAM

.
I/O devices

L1 Data cache
2-way set-associative
16 K Bytes total
C64X CPU
Architecture Overview
• 2 (almost) identical fixed-point data paths that
each contain
– 1 ALU (The .L Unit)
– 1 Shifter (The .S Unit)
– 1 Multiplier (The .M Unit)
– 1 Adder/Subtractor used for address
generation (The .D Unit)
– 1 register file containing thirty-two 32-bit
registers
• The 8 execution units in the 2 data paths are
capable of executing up to 8 instructions in
parallel.
• Can operate on 8-, 16-, 32-, and 40-bit data

• Can perform double-word (64-bit) loads and

stores by using 2 registers for the one operation.
General-Purpose Register Files
 The C64x register file contains 32 32-bit registers (A0-
A31 for file A and B0-B31 for file B);
 can be used for data, pointers or conditions
 Values larger than 32 bits (40-bit long and 64-bit float
quantities) are stored in register pairs.
 Packed data types are: four 8-bit values or two 16-bit
values in a single 32-bit register, four 16-bit values in a
64-bit register pair.

Odd register 39 32 31 Even register 0

Zero filled
Delay Slots
• Delay slots mean “how many CPU cycles come
between the current instruction and when the
results of the instruction can be used by another
instruction”
• Single Cycle Instructions: 0 delay slots
• 16x16 Single Multiply and .M Unit non-multiply
Instructions: 1 delay slot
• Store: 0 delay slots
– If a load occurs before a store (either in parallel or not),
then the old data is loaded from memory before the new
data is stored.
– If a load occurs after a store, (either in parallel or not), then
the new data is stored before the data is loaded.
• C64x Multiply Extensions: 3 delay slots
• Load: 4 delay slots
• Branch: 5 delay slots
– The branch target is in the PG slot when the branch
condition is determined in E1. There are 5 slots between
PG and E1 when the branch target begins executing useful
code again.
Memory
 The C64x has different spaces for program and data memory;
 Uses two-level cache memory scheme;

Internal Memory
The C64x has a 32-bit byte-addressable memory with the
following features:

 Separate data and program address spaces;

 Large on chip RAM, up to 7MB;

 2-level cache;
 Single internal program memory port with an
instruction-fetch bandwidth of 256 bits;

 Two 64-bit internal data memory ports;

Memory Map (Internal and External
Memory)
• Level 1 Program Cache is 128 Kbit direct
mapped
• Level 1 Data cache is 128Kbit 2-way set-
associative
• Shared Level 2 Program/Data
Memory/Cache of 4Mbit
– Can be configured as mapped memory
– Cache (up to 256 Kbytes)
– Combination of the two
Memory Buses
• Instruction fetch using 32-bit address bus
and 256-bit data bus
• two 64-bit load buses (LD1 and LD2)
• two 64-bit store buses (ST1 and ST2)
Interrupts
• 16 prioritized interrupts: INT_00 to INT_15
• INT_00 has the highest priority and is dedicated
to RESET. This halts the CPU and returns it to
a known state
• The first four interrupts (INT_00 – INT_03) are
fixed and non maskable
• INT_01 – INT_03 are generally used to alert the
CPU of an impending hardware problem, such
as an imminent power failure
• The remaining interrupts are maskable and can
be programmed
Interrupt Performance
Consideration
• Overhead for all CPU interrupts is 7 cycles
• Interrupt latency is 11 cycles
• Interrupts can be recognized every 2
cycles
• 2 occurrences of a specific interrupt can
be recognized in 2 cycles
Peripheral Set
• 2 multichannel buffered audio serial ports
• 2 inter-integrated circuit bus modules (I2Cs)
• 3 multichannel buffered serial ports (McBSPs)
• 3 32-bit general-purpose timers
• 1 user-configurable 16-bit or 32-bit host-port interface
(HPI16/HPI32)
• 1 16-pin general-purpose input/output port (GP0) with
programmable interrupt/event generation modes
• 1 32-bit glueless external memory interface (EMIFA),
capable of interfacing to synchronous and asynchronous
memories and peripherals.
ZBT RAM
• Zero Bus Turnaround (ZBT) is a synchronous SRAM
architecture optimized for networking and
telecommunications applications.
• It can increase the internal bandwidth of a switch
fabric when compared to standard SyncBurst SRAM.
• The ZBT architecture is optimized for switching and
other applications with highly random READs and
WRITEs.
• ZBT SRAMs eliminate all idle cycles when turning the
data bus around from a WRITE operation to a READ
operation
Packaging – Top View
Packaging - Bottom View
Sum of products example

C code: TI TMS C64x code:

int DotP(short* m, short* n, int count) { LOOP:

int i, product, sum = 0; [A0] SUB .L1 A0, 1, A0
for(i = 0; i < count; i++)
| | [!A0] ADD .S1 A6, A5, A5
{
|| MPY .M1X B4, A4, A6
product = m[i] * n[i];
| | [B0] BDEC .S2 LOOP, B0
sum+=product;
} LDH .D1T1 *A3++, A4
return(sum); LDH .D2T2 *B5++, B4
}
Another code example
MIPS:

loop: LW R1, 0(R11)

MUL R2, R1, R10
SW R2, 0(R12)
ADDI R12, R12, #-4
ADDI R11, R11, #-4
BGTZ R12, loop

TI TMS C64x:

ADDK .S1 #-4,A11 || LDW .D1 A1,0(A11) || MVK .S2 #-4,B1

ADDK .S1 #-4,A11 || LDW .D1 A1,0(A11) || MUL .M1 A1,A10,A2 || ADDK .S2 #-12,B12
loop: ADDK .S1 #-4,A11 || LDW .D1 A1,0(A11) || MUL .M1 A1,A10,A2 || STW .D2x A2,0(B12) ||
ADD .L2 B12,B1,B12 || BGTZ .S2 B12, loop

ADD .L2 B12, B1, B12 || MUL .M1 A1,A10,A2 || STW .D2x A2,0(B12)
ADD .L2 B12, B1, B12 || STW .D2x A2,0(B12)
Special purpose instructions
Instruction Description Example Application
BITC4 Bit counter Machine vision
GMPY4 Galois Field MPY Reed Solomon support
SHFL Bit interleaving Convolution encoder
DEAL Bit de-interleaving Cable modem
SWAP4 Byte swap Endian swap
XPNDx Bit expansion Graphics
MPYHIx, MPYLIx Extended precision 16x32 MPYs Audio
AVGx Quad 8-bit, Dual 16-bit average Motion compensation
SUBABS4 Quad 8-bit Absolute of Motion estimation
differences
SSHVL, SSHVR Signed variable shift GSM
THE END

Microcontroller 8051
No ratings yet
Microcontroller 8051
72 pages
Arm-Cortex m3
No ratings yet
Arm-Cortex m3
13 pages
Hardware Interfaces To 8051: 1. LCD 2. Keyboard 3. ADC 4. DAC 5. Stepper Motor 6. DC Motor
No ratings yet
Hardware Interfaces To 8051: 1. LCD 2. Keyboard 3. ADC 4. DAC 5. Stepper Motor 6. DC Motor
32 pages
Serial Communication Bus-Interface (Unit3)
100% (2)
Serial Communication Bus-Interface (Unit3)
44 pages
Experiment No. 1: Aim: Study of Tanner Tools THEORY: Tanner Tools
No ratings yet
Experiment No. 1: Aim: Study of Tanner Tools THEORY: Tanner Tools
24 pages
8-Bit Microprocessor: VLSI Architecture Project Report On
No ratings yet
8-Bit Microprocessor: VLSI Architecture Project Report On
35 pages
InternalArchitecture 8086 - PPT
100% (1)
InternalArchitecture 8086 - PPT
21 pages
LPC2148 UART Programming
No ratings yet
LPC2148 UART Programming
5 pages
Application Specific Processors
No ratings yet
Application Specific Processors
8 pages
FPGA PPT Presentation On Flow
No ratings yet
FPGA PPT Presentation On Flow
21 pages
CSL331 System Software and Microprocessor Lab
No ratings yet
CSL331 System Software and Microprocessor Lab
144 pages
Keil Interfacing Programs For 8051
No ratings yet
Keil Interfacing Programs For 8051
10 pages
Program For Searching A Number or Character in String For 8086
No ratings yet
Program For Searching A Number or Character in String For 8086
23 pages
Chapter 1.1-8085 Architecture-Introduction
100% (2)
Chapter 1.1-8085 Architecture-Introduction
34 pages
TMS320C54xx Instruction Set
No ratings yet
TMS320C54xx Instruction Set
338 pages
6th Unit DSP
No ratings yet
6th Unit DSP
34 pages
ARM7TDMI Processor
No ratings yet
ARM7TDMI Processor
44 pages
Cache Memory-Direct Mapping
0% (1)
Cache Memory-Direct Mapping
30 pages
GSM Based Home Security System
No ratings yet
GSM Based Home Security System
20 pages
Lecture 18 Conditional Jumps Instructions PDF
No ratings yet
Lecture 18 Conditional Jumps Instructions PDF
7 pages
Real Time Systems - 7th Sem - ECE - VTU - Unit 1 - Introduction To Real Time Systems - Ramisuniverse
100% (6)
Real Time Systems - 7th Sem - ECE - VTU - Unit 1 - Introduction To Real Time Systems - Ramisuniverse
10 pages
8051 Timer Counter
No ratings yet
8051 Timer Counter
40 pages
Keyboard Interfacing
No ratings yet
Keyboard Interfacing
24 pages
Fundamentals of Programming With DSK6713
100% (1)
Fundamentals of Programming With DSK6713
13 pages
Unit I Embedded Computing: 2 Marks Questions
No ratings yet
Unit I Embedded Computing: 2 Marks Questions
28 pages
8085
No ratings yet
8085
67 pages
Ahb-Apb Bridge
No ratings yet
Ahb-Apb Bridge
15 pages
8096 Microcontroller
67% (3)
8096 Microcontroller
51 pages
Question Bank: Discrete Fourier Transforms & Fast Fourier Transforms
No ratings yet
Question Bank: Discrete Fourier Transforms & Fast Fourier Transforms
10 pages
Verilog Code of ROM With Testbenches
No ratings yet
Verilog Code of ROM With Testbenches
10 pages
8086 Instruction Format
No ratings yet
8086 Instruction Format
60 pages
Elective III-410252C-Embedded and Real Time Operating Systems Question Bank
100% (1)
Elective III-410252C-Embedded and Real Time Operating Systems Question Bank
2 pages
The 8086 Microprocessor Supports 8 Types of Instructions
No ratings yet
The 8086 Microprocessor Supports 8 Types of Instructions
6 pages
AP Unit 3
No ratings yet
AP Unit 3
133 pages
Multiplexer, Demultiplexer and Encoder With Simulation and RTL Schematic
No ratings yet
Multiplexer, Demultiplexer and Encoder With Simulation and RTL Schematic
26 pages
DR Raj Kamal RTOSMobSystem
No ratings yet
DR Raj Kamal RTOSMobSystem
66 pages
Arm Assembly Programs
No ratings yet
Arm Assembly Programs
8 pages
Programmable Interrupt Controller: Submitted To
No ratings yet
Programmable Interrupt Controller: Submitted To
15 pages
ARM INstruction Set
No ratings yet
ARM INstruction Set
6 pages
Department of Electronics and Communication 8051 Microcontroller and Interfacing Objective Questions and Answers Iii Year Unit - I
No ratings yet
Department of Electronics and Communication 8051 Microcontroller and Interfacing Objective Questions and Answers Iii Year Unit - I
20 pages
Unit 2 Architecture of 8051 Microcontroller
No ratings yet
Unit 2 Architecture of 8051 Microcontroller
25 pages
3 Stage and 5 Stage ARM
No ratings yet
3 Stage and 5 Stage ARM
4 pages
Dsp12 - PP 8 Point Radix-2 Dit-Fft
No ratings yet
Dsp12 - PP 8 Point Radix-2 Dit-Fft
37 pages
Microcontroller Notes MODULE 1
100% (2)
Microcontroller Notes MODULE 1
49 pages
Dspa 17ec751 M5
No ratings yet
Dspa 17ec751 M5
34 pages
EE8551-Microprocessors and Microcontrollers
No ratings yet
EE8551-Microprocessors and Microcontrollers
13 pages
8086 Interview Questions:: 8086 Microprocessor
No ratings yet
8086 Interview Questions:: 8086 Microprocessor
20 pages
DC Lab Exp6 17l238 Rep
No ratings yet
DC Lab Exp6 17l238 Rep
12 pages
Questions From 10 Question Papers: Module 1: Number Systems and Codes
No ratings yet
Questions From 10 Question Papers: Module 1: Number Systems and Codes
4 pages
Week 6: Assignment Solutions
No ratings yet
Week 6: Assignment Solutions
4 pages
Module 1
No ratings yet
Module 1
79 pages
Dual-Port Memory Block Diagram PDF
No ratings yet
Dual-Port Memory Block Diagram PDF
8 pages
Characteristics of DSP
100% (1)
Characteristics of DSP
15 pages
Programmable Interval Timer - 8254
No ratings yet
Programmable Interval Timer - 8254
27 pages
A-Overview of EE3463: Intro To Micro's - Lab 1
No ratings yet
A-Overview of EE3463: Intro To Micro's - Lab 1
48 pages
Archi 8086
No ratings yet
Archi 8086
3 pages
3 2nd Theory
No ratings yet
3 2nd Theory
9 pages
new lec18
No ratings yet
new lec18
28 pages
IA32 Instruction Set (Short Form)
No ratings yet
IA32 Instruction Set (Short Form)
79 pages
80386dx_basics.microprocessor
No ratings yet
80386dx_basics.microprocessor
55 pages
Question Bank
No ratings yet
Question Bank
2 pages
Ec1483-Hci-Lesson Plan
No ratings yet
Ec1483-Hci-Lesson Plan
4 pages
Hci-Unit-2-Short Notes
No ratings yet
Hci-Unit-2-Short Notes
28 pages
Web Interface
No ratings yet
Web Interface
25 pages
Enggcoladdress
No ratings yet
Enggcoladdress
13 pages
Fluke - An - pm668
No ratings yet
Fluke - An - pm668
6 pages
Information Technology in Insurance Sector
0% (1)
Information Technology in Insurance Sector
21 pages
EXamen 8 Ciberseguridad
No ratings yet
EXamen 8 Ciberseguridad
14 pages
Syllabus Primavera P6
No ratings yet
Syllabus Primavera P6
6 pages
Working Procedure of Email CN 1
No ratings yet
Working Procedure of Email CN 1
4 pages
Acapdev Lab Experiment 7
No ratings yet
Acapdev Lab Experiment 7
9 pages
Vba 21 4192 Are
No ratings yet
Vba 21 4192 Are
2 pages
CV Assignment 2 RecognitionAR
No ratings yet
CV Assignment 2 RecognitionAR
5 pages
Can DFS Replication Communicate With FRS
No ratings yet
Can DFS Replication Communicate With FRS
29 pages
LECTURE2 Historical Antecedents of Science and Technology
No ratings yet
LECTURE2 Historical Antecedents of Science and Technology
5 pages
Finisar Ftlx6875mcc 10g DWDM 80km Multi-Rate High Optical Output Tunable SFP T-SFP Optical Transceiver Product Specification A01
No ratings yet
Finisar Ftlx6875mcc 10g DWDM 80km Multi-Rate High Optical Output Tunable SFP T-SFP Optical Transceiver Product Specification A01
14 pages
Ceh PDF
No ratings yet
Ceh PDF
16 pages
Problems in Uncertainty With Solutions Physics 1
No ratings yet
Problems in Uncertainty With Solutions Physics 1
13 pages
Why Micropython: Marco Zennaro, PHD Ictp
No ratings yet
Why Micropython: Marco Zennaro, PHD Ictp
35 pages
User's Manual For Thermal Printer
No ratings yet
User's Manual For Thermal Printer
17 pages
FM Tuner MODEL V-2952: Features
No ratings yet
FM Tuner MODEL V-2952: Features
2 pages
Computer Assisted Language Learning
50% (2)
Computer Assisted Language Learning
5 pages
FOC DOCUMENTATION Sem - 2 NEwwwww
No ratings yet
FOC DOCUMENTATION Sem - 2 NEwwwww
74 pages
Manually Refreshing Materialized Views and Creating Refresh Groups in Oracle
No ratings yet
Manually Refreshing Materialized Views and Creating Refresh Groups in Oracle
2 pages
Aesthetic Wallpapers - Google Search 3
No ratings yet
Aesthetic Wallpapers - Google Search 3
1 page
Assessment management plan
No ratings yet
Assessment management plan
5 pages
Data Sheet: GL Fa-Gm
No ratings yet
Data Sheet: GL Fa-Gm
4 pages
Nekobin
No ratings yet
Nekobin
2 pages
A1000
No ratings yet
A1000
8 pages
Nec NL3224BC35-20 TFT LCD Module Specification Preliminary
No ratings yet
Nec NL3224BC35-20 TFT LCD Module Specification Preliminary
28 pages
A Brief Computer History
No ratings yet
A Brief Computer History
8 pages
MSRS & FTView Integration V1.6
No ratings yet
MSRS & FTView Integration V1.6
258 pages
Borders Maths Coursework
100% (2)
Borders Maths Coursework
5 pages
XX1 - My Quarq Conventional - F - Quarq - Power - Meters - Users - Guide PDF
No ratings yet
XX1 - My Quarq Conventional - F - Quarq - Power - Meters - Users - Guide PDF
136 pages
E-Bomb: A Weapon of Electrical Mass Destruction
100% (1)
E-Bomb: A Weapon of Electrical Mass Destruction
24 pages