0% found this document useful (0 votes)

124 views

Final Instruction Set

The document discusses the Thumb-2 instruction set used by the ARM Cortex M4 processor. It focuses on key features of Thumb-2 including its combination of 16-bit and 32-bit instructions to provide both code density and performance. The document then provides details on data types, commonly used directives, instruction sets, addressing modes, arithmetic and logical operations, and data conversion instructions supported by the Cortex M4 processor.

Uploaded by

Nisha kcr

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

124 views

Final Instruction Set

Uploaded by

Nisha kcr

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 107

Module 4

ARM CORTEX M4-Instruction Set Summary

Dr. Nisha K C R
Professor-Dept. of ECE
New Horizon College of Engineering

Dr.K C R Nisha_Professor-ECE
Dr.K C R Nisha_Professor-ECE
Dr.K C R Nisha_Professor-ECE
Dr.K C R Nisha_Professor-ECE
Dr.K C R Nisha_Professor-ECE
Dr.K C R Nisha_Professor-ECE
Thumb 2 Technology
• Thumb-2 is a superset of Thumb instructions,
including new 32-bit instructions for more
complex operations.
• Thumb-2 is a combination of both 16-bit and 32-
bit instructions.
• The Thumb-2 technology extended the Thumb
3

Instruction Set Architecture (ISA) into a highly

efficient and powerful instruction set that delivers
significant benefits in terms of ease of use, code
size, and performance
Dr.K C R Nisha_Professor-ECE
Thumb 2 Technology
• The extended instruction set in Thumb-2 is a superset of the previous 16-bit
Thumb instruction set, with additional 16-bit instructions alongside 32-bit
instructions. It allows more complex operations to be carried out in the
Thumb state, thus allowing higher efficiency by reducing the number of states
switching between ARM state and Thumb state.
• Focused on small memory system devices such as microcontrollers and
reducing the size of the processor, the Cortex-M3,M4 supports only the
Thumb-2 (and traditional Thumb) instruction set.

Dr.K C R Nisha_Professor-ECE
Dr.K C R Nisha_Professor-ECE
Dr.K C R Nisha_Professor-ECE
Data Types in Cortex M4

Dr.K C R Nisha_Professor-ECE
Commonly Used Directives

Dr.K C R Nisha_Professor-ECE
Instruction Set - Cortex M4

Dr.K C R Nisha_Professor-ECE
LIST OF SYMBOLS

Dr.K C R Nisha_Professor-ECE
Moving data within the processor
Move data from one register to another
Move data between a register and a special register (e.g., CONTROL,
PRIMASK, FAULTMASK, BASEPRI)
Move an immediate constant into a register

Dr.K C R Nisha_Professor-ECE
Immediate Addressing

Dr.K C R Nisha_Professor-ECE
Memory Access Instructions

Dr.K C R Nisha_Professor-ECE
Optional Modifier to mention the data

Dr.K C R Nisha_Professor-ECE
Memory Access Instructions

Dr.K C R Nisha_Professor-ECE
Indexed Addressing Mode

Data in Memory

Dr.K C R Nisha_Professor-ECE
Indexed Addressing Mode

Offset Data in Memory

Dr.K C R Nisha_Professor-ECE
Illustration

Dr.K C R Nisha_Professor-ECE
Memory Access Instructions

Example

Dr.K C R Nisha_Professor-ECE
Memory Access Instructions

Example

LDR R3, [R0, R2, LSL #2] ; Read memory [R0+(R2 << 2)] into R3
Dr.K C R Nisha_Professor-ECE
Memory Access Instructions

Dr.K C R Nisha_Professor-ECE
Post Indexing
LDR R0, [R1], #offset ; Read memory[R1], then R1 updated to R1+offset

STR R0, [R1], #12 ; Store memory [R1], then R1 updated to R1+offset

STR r0, [r1], #12

Dr.K C R Nisha_Professor-ECE
Multiple load and Multiple store Instructions

LDR R4,=0x20000000 ; Set R4 to 0x20000000 (address)

LDMIA R4, {R0-R3} ; Read 4 words and store them to R0 - R3

Dr.K C R Nisha_Professor-ECE
Multiple load and Multiple store
Instructions
• Suppose you wanted to load a subset of all registers,
for example, registers r0 to r3, from memory, where
the data starts at address 0xBEEF0000 and continues
upward in memory. The instruction would simply be
LDMIA r9, {r0-r3}

• where the base register r9 holds the address

0xBEEF0000. The addressing mode used here is called
Increment After, or IA. This says to increment the
address after each value has been loaded from
memory

Dr.K C R Nisha_Professor-ECE
Multiple load and Multiple store
Instructions
• This has the same effect as four separate LDR instructions, or
LDR r0, [r9]
LDR r1, [r9, #4]
LDR r2, [r9, #8]
LDR r3, [r9, #12]
• At the end of the load sequence, register r9 has not been changed and still
holds the value 0xBEEF0000. If you wanted to load data into registers r0
through r3 and r12, you could simply add it to the end of the list, i.e.,
LDMIA r9, {r0-r3, r12}
• The lowest register will always be loaded from the lowest address in
memory, and the highest register will be loaded from the highest address.
For example, you could say
LDMIA r9, {r5, r3, r0-r2, r14}
and register r0 will be loaded first, followed by registers r1, r2, r3, r5, and r14.

Dr.K C R Nisha_Professor-ECE
For example, if register r10 contained 0x4000,
LDMIA r10, {r0, r1, r4}
would begin by loading register r0 with data from address 0x4000. The value in the base
register is incremented by one word after the first load is complete. The second register,
r1, is loaded with data from 0x4004, and register r4 is loaded with data from 0x4008.
Note here that the base register is not updated after the instruction completes. The other
three suffixes indicate whether the base register is changed before or after the load or
store, as well as whether it is incremented or decremented, as shown in Figure

Dr.K C R Nisha_Professor-ECE
Stack
• PUSH and POP make it very easy to conceptually deal with stacks
(since the instruction implicitly contains the addressing mode)
• Suppose a stack that starts at address 0x20000200, grows downward
in memory (a full descending stack), and has two words pushed onto
it with the following code:
AREA Example3, CODE, READONLY
ENTRY
SRAM_BASE EQU 0X 20000200
LDR sp, =SRAM_BASE
LDR r3, =0xBABEFACE
LDR r4, =0xDEADBEEF
PUSH {r3}
PUSH {r4}

POP {r4}
POP {r3}
stop B stop ; stop program Dr.K C R Nisha_Professor-ECE
Stack

Dr.K C R Nisha_Professor-ECE
Stack

Dr.K C R Nisha_Professor-ECE
Multiple load and Multiple store Instructions

LDR R8,=0x8000 ; Set R8 to 0x8000 (address)

STMIA R8!, {R0-R3} ; R8 change to 0x8010 after the store

Dr.K C R Nisha_Professor-ECE
Push and Pop Instruction

Example

Dr.K C R Nisha_Professor-ECE
Recall -Memory Access Instructions

Dr.K C R Nisha_Professor-ECE
Arithmetic Operation

Carry bit is set on addition when crossing 255-0 and Carry bit is cleared
on subtraction when crossing 0-255

Dr.K C R Nisha_Professor-ECE
Arithmetic Data Operation

Dr.K C R Nisha_Professor-ECE
Generalized format

Note: If optional S suffix is present

Dr.K C R Nisha_Professor-ECE
Condition code bits contains the status of previous arithmetic operation
Dr.K C R Nisha_Professor-ECE
Logical Shift and Rotate Operations

Software uses Logical and Shift operations to

• Combine Information
• Extract Information
• Test Information

Unary Operation
Produces its result given a single input parameter
Ex. Negate, Complement, Increment, Decrement

Dr.K C R Nisha_Professor-ECE
Logical Operations

Generalized format

Note: Z and N condition code bits are updated

Some flexible second operand2 may affect carry bit

Dr.K C R Nisha_Professor-ECE
Shift and Rotate Operations

Dr.K C R Nisha_Professor-ECE
Data Conversion Operations
Instructions are available for handling signed and unsigned extensions of data; for
example, to convert an 8-bit value to 32-bit, or from 16-bit to 32-bit. The signed and
unsigned instructions are available in both 16-bit and 32-bit forms

Dr.K C R Nisha_Professor-ECE
Data Conversion Operations
• For SXTB/SXTH, the data are sign extended using bit[7]/bit[15] of Rn.
• With UXTB and UXTH, the value is zero extended to 32-bit.

• Illustration:
• For example, if R0 is 0x55AA8765; 0101 0101 1010 1010 1000 0111 0110 0101

• SXTB R1, R0 ; R1 = 0x00000065; 0000 0000 0000 0000 0000 0000 0110 0101

SXTH R1, R0 ; R1 = 0xFFFF8765 ; 1111 1111 1111 1111 1000 0111 0110 0101

UXTB R1, R0 ; R1 = 0x00000065 ; 0000 0000 0000 0000 0000 0000 0110 0101
UXTH R1, R0 ; R1 = 0x00008765 0000 0000 0000 0000 1000 0111 0110 0101

• These instructions are useful for converting between different data types.

Dr.K C R Nisha_Professor-ECE
Data Conversion Operations

The 32-bit form of these instructions can access high registers, and optionally
rotate the input data before the signed extension operations,

Dr.K C R Nisha_Professor-ECE
Reverse Operation
• These instructions are usually used for converting data between little endian and big
endian.
• The 16-bit form of these instructions can only access low registers (R0 to R7).

• REV reverses the byte order in a data word, and REVH reverses the byte order
inside a half-word.
For example, if R0 is 0x12345678, in executing the following:
REV R1, R0 ; R1 will be 0x78563412
REVH R2, R0 ; R2 will be 0x34127856.
• REVSH - processes the lower half-word and then sign extends the result.
For example, if R0 is 0x33448899, running:
REVSH R1, R0; R1 will become 0xFFFF9988.

Dr.K C R Nisha_Professor-ECE
Reverse Operation

Dr.K C R Nisha_Professor-ECE
Bit field processing Instruction
To make the Cortex!-M3 and Cortex-M4 processor an excellent architecture for
control applications, these processors support a number of bit-field processing
operations,

• BFC (Bit Field Clear) clears 1 to 31 adjacent bits in any position of a register.
The syntax of the instruction is:
BFC <Rd>, <#lsb>, <#width> ;

For example:
LDR R0,=0x1234FFFF; 0001 0010 0011 0100 1111 1111 1111 1111
BFC R0, #4, #8
This will give R0 = 0x1234F00F.
Dr.K C R Nisha_Professor-ECE
Bit field processing Instruction

• BFI (Bit Field Insert) copies 1 to 31 bits (#width) from one register to any location
(#lsb) in another register. The syntax is:

BFI <Rd>, <Rn>, <#lsb>, <#width>

For example:
LDR R0,=0x12345678 ;
LDR R1,=0x3355AACC ;

BFI R1, R0, #8, #16 ; Insert R0[15:0] to R1[23:8]

This will give R1 = 0x335678CC;

Note:
The parameter #lsb indicates the least significant bit of the bitfield (in other words, where to
start in the bit pattern going from right to left) and should be in the range of 0 to 31. The #width
parameter indicates the width of the bitfield; this parameter should be in the range of 1 to (32-
lsb).

Dr.K C R Nisha_Professor-ECE
Compare and Test

The compare and test instructions are used to update the flags in the APSR,
which may then be used by a conditional branch or conditional execution
Dr.K C R Nisha_Professor-ECE
Program flow control
Several Program Control Instruction:

• Branch Instructions
• Function Call Instructions
• Conditional branch
• Combined compare and conditional branch
• Conditional execution (IF-THEN instruction)
• Table branch

A number of instructions can cause branch operations:

• Branch instructions (e.g., B, BX)
• A data processing instruction that updates R15 (PC) (e.g., MOV, ADD)
• A memory read instruction that writes to PC (e.g., LDR, LDM, POP)
Dr.K C R Nisha_Professor-ECE
Program flow control
Function Call Instructions
• To call a function, the Branch and Link (BL) instruction or Branch and Link with eXchange
(BLX) instructions can be used (Table 5.32).
• They execute the branch and at the same time save the return address to the Link
Register (LR), so that the processor can branch back to the original program after the
function call is completed.
When these instructions are executed:
• The Program Counter is set to the branch target address.
• The Link Register (LR/R14) is updated to hold the return address, which is the address of
the instruction after the executed BL/BLX instruction.
• If the instruction is BLX, the Thumb bit in EPSR will also be updated using the LSB of the
register holding the branch target address.

Dr.K C R Nisha_Professor-ECE
Program flow control

Conditional Branch Instructions

Conditional branches are executed conditionally based on
the current value in APSR (N, Z, C, and V flags, as shown in Table 5.33).

Dr.K C R Nisha_Professor-ECE
Program flow control

Conditional Branch Instructions

Dr.K C R Nisha_Professor-ECE
Program flow control

Conditional Branch Instructions

Dr.K C R Nisha_Professor-ECE
Program flow control

CMP R0, #1 ; compare R0 to 1

BEQ p2 ; if Equal, then go to p2
MOVS R3,#1 ;R3=1
B p3 ; go to p3
p2 ; label p2
MOVS R3, #2
p3 ; go to p3

….; other subsequence operations

Dr.K C R Nisha_Professor-ECE
Program flow control
CONDITIONAL EXECUTION- IF Then Instructions

• Besides conditional branches, Cortex-M3 and Cortex-M4 processors also

support conditional execution.
• After an IT (IF-THEN) instruction is executed, up to four of the subsequent
instructions can be conditionally executed based on the condition specified by
the IT instruction and the APSR value.
• An IT instruction block consists of an IT instruction, with conditional execution
details, followed by one to four conditional execution instructions.
• The conditional execution instructions can be data processing instructions or
memory access instructions.
• The last conditional execution instruction in the IT block can also be a
conditional branch instruction.
• The IT instruction statement contains the IT instruction opcode with up to an
additional three optional suffixes of “T” (then) and “E” (else), followed by the
condition to check against, which is the same as the condition symbol for
conditional branches.
• The “T”/”E” indicates how many subsequence instructions are inside the IT
instruction block, and whether they should or should not be executed if the
condition is met. Dr.K C R Nisha_Professor-ECE
Program flow control
CONDITIONAL EXECUTION- IF Then Instructions

Dr.K C R Nisha_Professor-ECE
Program flow control
CONDITIONAL EXECUTION- IF Then Instructions
Different combinations of “T” and “E” sequence are possible:
•Just one conditional execution instruction: IT
•Two conditional execution instructions: ITT, ITE
•Three conditional execution instructions: ITTT, ITTE, ITET, ITEE
•Four conditional execution instructions: ITTTT, ITTTE, ITTET, ITTEE, ITETT,
•ITETE, ITEET, ITEEE

Example

Dr.K C R Nisha_Professor-ECE
Program flow control
Example:

Dr.K C R Nisha_Professor-ECE
To find Factorial

AREA Prog2, CODE, READONLY

ENTRY ; mark first instruction to execute

MOV r6,#10 ; load 10 into r6

MOV r7,#1 ; if n=0, at least n!=1
Loop CMP r6,# 0
ITTT GT ; Start of IF Then Block
MULGT r7, r6, r7
SUBGT r6, r6, #1
BGT loop ; end of IF-THEN block
stop B stop ; stop program

Dr.K C R Nisha_Professor-ECE
Euclid’s algorithm for computing the GCD of two
positive integers (a,b) can be written as
while (a != b) {
if (a>b) a=a – b;
else b=b – a; }

Dr.K C R Nisha_Professor-ECE
Table branch instructions
The Cortex-M3 and Cortex-M4 support two table branch instructions to
implement branch Tables:
TBB (Table Branch Byte) and TBH (Table Branch Half-word).
• The TBB is used when all the entries in the branch table are organized as a byte
array (offset from base address is less than 2x2^8=512 bytes)

• TBH is used when all the entries are organized as a half-word array (offset from base
address is less than 2x2^16=128K bytes).
The TBB instruction has the syntax:

TBB [Rn, Rm]

where Rn stores the base address of the branch table and Rm is the branch
table index.

The syntax of TBH is TBH [Rn, Rm, LSL #1]

Dr.K C R Nisha_Professor-ECE
Table branch instructions

Dr.K C R Nisha_Professor-ECE
Saturation operations
• Saturation is commonly used in signal processing. For example, after certain operations
such as amplification, the amplitude of a signal can exceed the maximum
allowed output range.
• If the value is adjusted by simply cutting off the MSB bits, the resulted signal waveform
could be completely distorted as shown in Fig.
• The saturation operation reduces the distortion by forcing the value to the maximum
allowed value.
• The distortion still exists, but if the value does not exceed the maximum range by too
much it is less noticeable.

Dr.K C R Nisha_Professor-ECE
Saturation operations

Dr.K C R Nisha_Professor-ECE
Saturation operations
• The Cortex-M3 processor supports two instructions that provide saturation
adjustment of signed and unsigned data.
• SSAT (for signed data) and USAT (for unsigned data).

For example, if a 32-bit signed value is to be saturated into a 16-bit signed value,
the following instruction can be used:
SSAT R1, #16, R0

For example, you can convert a 32-bit signed value to a 16-bit unsigned value
using:

USAT R1, #16, R0

Dr.K C R Nisha_Professor-ECE
Saturation operations
• Algorithms for handling speech data, adaptive control algorithms, and routines for filtering
are often sensitive to quantization effects when implemented on a micro- processor or
microcontroller.
• Saturated math is one such approach, especially when dealing with signed data.

• For example, consider a digital waveform in Figure 7.7, possibly the output of an adaptive
predictor, where the values are represented by 16-bit signed integers;
• the largest positive value in a register would be 0x00007FFF and the largest negative
value would be 0xFFFF8000.
• If this signal were scaled in some way, it’s quite possible that the largest value would
overflow, effectively flipping the MSB of a value so that a positive number suddenly
becomes negative, and the waveform might appear as in Figure 7.8.
• Using saturated math instructions, the signal would get clipped, and the waveform might
appear as in Figure 7.9, not correcting the values but at least keeping them within limits.

Dr.K C R Nisha_Professor-ECE
Saturation operations

Dr.K C R Nisha_Professor-ECE
Multiply and MAC Instructions

Dr.K C R Nisha_Professor-ECE
SIMD Instructions

For details refer text book chapter 5, table 5.53

Dr.K C R Nisha_Professor-ECE
Floating Point
In C programming, you can define data as floating point values. For example,
a value can be declared as single precision:
float pi = 3.141592F;

or double precision:
double pi = 3.1415926535897932384626433832795;

Floating point data allows the processor to handle a much wider data range
(compared to integers or fixed point data) as well as very small values.

To represent a tiny small number and large number using fewer number of
bits in computer memory----- Floating Point
Ex.
00000000005=0.5X10^-10 –> Small No.
50000000000=5X10^10 -> Large No.

Dr.K C R Nisha_Professor-ECE
Floating Point

Dr.K C R Nisha_Professor-ECE
Special registers- CONTROL Registers

Dr.K C R Nisha_Professor-ECE
Floating Point Registers
• The Cortex-M4 processor has an
optional floating point unit.
• In the architecture, the FPU is
viewed as a co-processor.
• To be consistent with other ARM
architectures, the floating point unit
is defined as Co-Processor #10 and
#11 in the CPACR programmer’s
model

• Provides additional registers for

floating point data processing, as
well as a Floating Point Status and
Control Register (FPSCR)

Dr.K C R Nisha_Professor-ECE
Floating Point Registers-Overview

• The FPU adds a number of

registers to the processor
system:
• CPACR (Co-processor Access
Control Register) in SCB
(System Control Block)
– The CPACR register allows you
to enable or disable the FPU. It
is located in address
0xE000ED88 and
• Floating point register bank
• Floating point Status and
Control Register (FPSCR)

Dr.K C R Nisha_Professor-ECE
Floating Point Registers-Overview
• Floating point register
bank
– The floating point register bank
contains thirty-two 32-bit
registers, which can be
organized as sixteen 64-bit
double-word registers
– S0 to S15 are caller saved
registers
– S16 to S31 are callee saved
registers.

Dr.K C R Nisha_Professor-ECE
Floating Point Registers
• S0 to S31/D0 to D15
• Each of the 32-bit registers S0 to S31 (“S” for single precision)
can be accessed using floating point instructions, or accessed
as a pair, in the symbol of D0 toD15 (“D” for double-
word/double-precision).
• For example, S1 and S0 are paired together to become D0,
and S3 and S2 are paired together to become D1.

Note: Floating point unit in the Cortex-M4 does not support

double precision floating point calculations
(Can use Floating point instructions for transferring double
precision data).
Dr.K C R Nisha_Professor-ECE
Floating Point Status Control Registers

• Floating point status and control register (FPSCR)

• The FPSCR contains various bit fields for a couple
of reasons:
– To define some of the floating point operation
behaviors
– To provide status information about the floating point
operation results

Dr.K C R Nisha_Professor-ECE
Dr.K C R Nisha_Professor-ECE
Floating Point Registers

Memory-mapped floating point unit control registers

• In addition to the floating point register bank and FPSCR, the
floating point unit also introduces several additional memory-
mapped registers into the system.
– For example, the Coprocessor Access Control Register (CPACR) is used
to enable or disable the floating point unit.
– By default the floating point unit is disabled to reduce power
consumption.

Note: Before using any floating point instructions, the floating point unit
must be enabled by programming the CPACR register

Dr.K C R Nisha_Professor-ECE
Moving data within the processor

For the Cortex-M4 processor with the floating point unit, one can also:
• Move data between a register in the core register bank and a register
in the floating point unit register bank
• Move data between registers in the floating point register bank
• Move data between a floating point system register (such as the FPSCR
• Floating point Status and Control Register) and a core register
• Move immediate data into a floating point register
Dr.K C R Nisha_Professor-ECE
Memory Access Instructions-FPU

Dr.K C R Nisha_Professor-ECE
Floating Point Instructions

Dr.K C R Nisha_Professor-ECE
Addition of Floating point numbers
Ex.1

Ex.2
LDR r0, =0xE000ED88 ; Read-modify-write LDR
LDR r1, [r0]
ORR r1, r1, #(0xF << 20) ; Enable CP10, CP11
STR r1, [r0]
VMOV.F s0, #0x3FC00000 ; single-precision 1.0
VMOV.F s1, s0
VADD.F s2, s1, s0 ; 1.5+1.5=??
Dr.K C R Nisha_Professor-ECE
Floating point numbers
Ex.3
;Function-------à X^3+2X-8

VMOV.F32 s0,#2.0 ; use s0 to hold 2.0 temporarily

VMUL.F32 s2, s1, s1 ; initial squaring of input
VMUL.F32 s3, s1, s0 ; multiply input by 2
VMOV.F32 s0, #8.0 ; use s0 to hold 8.0 temporarily
VMUL.F32 s2, s2, s1 ; finish cubing of input
VSUB.F32 s3, s3, s0 ; subtract off 8.0 from 2x
VADD.F32 s0, s2, s3 ; add in x^3 to return reg
BX lr ; return

Dr.K C R Nisha_Professor-ECE
Data exchange from one memory
location to another

Address data
R0+R5 20000000 12345678 R3

Address data
R1+R5 20000050 1511 2522 R4

Dr.K C R Nisha_Professor-ECE
GCD- Lab manual
AREA MYCODE, CODE,READONLY
ENTRY
EXPORT START
START
LDR R0,=4
LDR R1,=2
LOOP CMP R0,R1
BEQ STOP
BLT LESS
SUBS R0,R0,R1
B LOOP
LESS SUB R1,R1,R0
B LOOP
STOP B STOP
END
Dr.K C R Nisha_Professor-ECE
To find Factorial

AREA Prog2, CODE, READONLY

ENTRY ; mark first instruction to execute

MOV r6,#10 ; load 10 into r6

MOV r7,#1 ; if n=0, at least n!=1
Loop CMP r6,# 0
ITTT GT ; Start of IF Then Block
MULGT r7, r6, r7
SUBGT r6, r6, #1
BGT loop ; end of IF-THEN block
stop B stop ; stop program

Dr.K C R Nisha_Professor-ECE
Reference
1. The Definitive Guide to ARM Cortex M3 and Cortex M4 Processors Joseph Yiu,3rd
Edition,2014 Elseiver
2. ARM Assembly Language Fundamentals and Techniques, William Hohl et.al, CRC press

Dr.K C R Nisha_Professor-ECE

Cloning of A Computer System
100% (1)
Cloning of A Computer System
30 pages
SiFive U8 Takes RISC-V Out of Order
No ratings yet
SiFive U8 Takes RISC-V Out of Order
3 pages
4-Instruction set
No ratings yet
4-Instruction set
45 pages
Module 2 PDF
No ratings yet
Module 2 PDF
114 pages
AppendixD Assembly Arm
No ratings yet
AppendixD Assembly Arm
53 pages
Ramesh Mandal PPT 3rd Year
No ratings yet
Ramesh Mandal PPT 3rd Year
26 pages
8051 Programming PDF
No ratings yet
8051 Programming PDF
74 pages
Embbed
No ratings yet
Embbed
38 pages
Mic Unit III
No ratings yet
Mic Unit III
70 pages
Arithmetic Instructions
No ratings yet
Arithmetic Instructions
100 pages
Topic 3 ARM Instruction Set Part - 1
No ratings yet
Topic 3 ARM Instruction Set Part - 1
47 pages
05 Instruction Set
No ratings yet
05 Instruction Set
36 pages
U1 - 8051 ALP Instructions
No ratings yet
U1 - 8051 ALP Instructions
70 pages
lab2
No ratings yet
lab2
15 pages
UNIT2.2-PPT
No ratings yet
UNIT2.2-PPT
20 pages
8051 Instruction
No ratings yet
8051 Instruction
28 pages
Either Source or Destination Is One Of: CPU Register
No ratings yet
Either Source or Destination Is One Of: CPU Register
81 pages
Central Processing Unit
No ratings yet
Central Processing Unit
22 pages
MICROCONTROLLER MODULE-2 NOTES
No ratings yet
MICROCONTROLLER MODULE-2 NOTES
28 pages
8051 Assembly Language
No ratings yet
8051 Assembly Language
39 pages
8051 Instruction Set
No ratings yet
8051 Instruction Set
79 pages
2.1 2.2 8086 Addressing Modes and Instruction Set
No ratings yet
2.1 2.2 8086 Addressing Modes and Instruction Set
55 pages
3.2 Arm Addressing Mode and Instruction Set
No ratings yet
3.2 Arm Addressing Mode and Instruction Set
32 pages
l5 instruction set and addressing modes
No ratings yet
l5 instruction set and addressing modes
48 pages
Module 2 (Lecture 3)
No ratings yet
Module 2 (Lecture 3)
23 pages
MP02_insruction-set-1
No ratings yet
MP02_insruction-set-1
31 pages
Experiment 04 EEE 3210
No ratings yet
Experiment 04 EEE 3210
8 pages
M68HC11
No ratings yet
M68HC11
107 pages
Arm 2
No ratings yet
Arm 2
30 pages
Chapter 8
No ratings yet
Chapter 8
31 pages
MP 6
No ratings yet
MP 6
23 pages
8051 Cont
No ratings yet
8051 Cont
43 pages
657668478
No ratings yet
657668478
78 pages
Lecture5 INSTRUCTIONS MICROPROCESSOR APLICATIONS
No ratings yet
Lecture5 INSTRUCTIONS MICROPROCESSOR APLICATIONS
58 pages
ARM Instruction Set
No ratings yet
ARM Instruction Set
25 pages
3 Instruction Set
No ratings yet
3 Instruction Set
72 pages
Chapter 4
No ratings yet
Chapter 4
12 pages
8051 Instruction - Set
No ratings yet
8051 Instruction - Set
48 pages
Instruction Set of 8086 MR ARVIND VISHNUBHATLA
No ratings yet
Instruction Set of 8086 MR ARVIND VISHNUBHATLA
26 pages
MC&A Unit 3
No ratings yet
MC&A Unit 3
76 pages
ARM MCU Unit2 Part1
No ratings yet
ARM MCU Unit2 Part1
44 pages
Module_2_06012025
No ratings yet
Module_2_06012025
86 pages
19ECE304 - Chapter 3,5 - ARM
No ratings yet
19ECE304 - Chapter 3,5 - ARM
115 pages
COA Ch4 Cpu
No ratings yet
COA Ch4 Cpu
42 pages
Module 2
No ratings yet
Module 2
33 pages
Instruction Set 8051 - v1
No ratings yet
Instruction Set 8051 - v1
10 pages
Lecture 5: Assembly Language
No ratings yet
Lecture 5: Assembly Language
42 pages
Assembly 2 PDF
No ratings yet
Assembly 2 PDF
102 pages
8086 Ins Set-UK
No ratings yet
8086 Ins Set-UK
12 pages
Intruction Set & Assembler Directives of 8086
No ratings yet
Intruction Set & Assembler Directives of 8086
81 pages
8051 Instruction Set
100% (1)
8051 Instruction Set
79 pages
Instruction Set and Programming of Intel 8085 Microprocessor
No ratings yet
Instruction Set and Programming of Intel 8085 Microprocessor
25 pages
Address
No ratings yet
Address
24 pages
UNBC-T4-8088
No ratings yet
UNBC-T4-8088
6 pages
Chapter 4
No ratings yet
Chapter 4
92 pages
Module#2 ISA
No ratings yet
Module#2 ISA
15 pages
Chapter2 Part 2 Machine Instructions and Programs
No ratings yet
Chapter2 Part 2 Machine Instructions and Programs
38 pages
MC Module 2
No ratings yet
MC Module 2
70 pages
Practical Reverse Engineering: x86, x64, ARM, Windows Kernel, Reversing Tools, and Obfuscation
From Everand
Practical Reverse Engineering: x86, x64, ARM, Windows Kernel, Reversing Tools, and Obfuscation
Bruce Dang
No ratings yet
Pic® Micro Principles Teachers Pack V11
From Everand
Pic® Micro Principles Teachers Pack V11
Clive W. Humphris
No ratings yet
Pic® Micro Principles on Your Mobile
From Everand
Pic® Micro Principles on Your Mobile
Clive W. Humphris
No ratings yet
Pic® Micro Principles V11
From Everand
Pic® Micro Principles V11
Clive W. Humphris
No ratings yet
Math Co Processor 8087
No ratings yet
Math Co Processor 8087
22 pages
RTL Design and VLSI Implementation of An Efficient Convolutional Encoder and Adaptive Viterbi Decoder
No ratings yet
RTL Design and VLSI Implementation of An Efficient Convolutional Encoder and Adaptive Viterbi Decoder
3 pages
Get Principles of High-Performance Processor Design: For High Performance Computing, Deep Neural Networks and Data Science Junichiro Makino free all chapters
100% (7)
Get Principles of High-Performance Processor Design: For High Performance Computing, Deep Neural Networks and Data Science Junichiro Makino free all chapters
50 pages
The Analytical Engine: Charles Babbage (1791-1871)
No ratings yet
The Analytical Engine: Charles Babbage (1791-1871)
35 pages
Chapter IV Computer Arithmetic
No ratings yet
Chapter IV Computer Arithmetic
133 pages
C28x Workshop
No ratings yet
C28x Workshop
400 pages
MIPS Instruction Set
No ratings yet
MIPS Instruction Set
7 pages
Diagnostics Guide
No ratings yet
Diagnostics Guide
714 pages
Intel 8086 Microprocessor Family
No ratings yet
Intel 8086 Microprocessor Family
3 pages
Assembly Language Ass#2
No ratings yet
Assembly Language Ass#2
29 pages
Computer Science Project Work Grade 11 HSEB NOTES
No ratings yet
Computer Science Project Work Grade 11 HSEB NOTES
46 pages
Sunvox-Manual - en (WarmPlace Wiki)
No ratings yet
Sunvox-Manual - en (WarmPlace Wiki)
43 pages
CNE302 Computer Organization and Architecture: Lecture 01 - Introduction Instructor
No ratings yet
CNE302 Computer Organization and Architecture: Lecture 01 - Introduction Instructor
39 pages
Simulation of a Virtual CPU Executing Mathematical Functions in Python
No ratings yet
Simulation of a Virtual CPU Executing Mathematical Functions in Python
38 pages
Tiva C Series LaunchPad
No ratings yet
Tiva C Series LaunchPad
127 pages
DFB1X 030 Falcon Accelerator
No ratings yet
DFB1X 030 Falcon Accelerator
10 pages
PWM Module 28335
No ratings yet
PWM Module 28335
115 pages
Leiterman J C 3264bit 80x86 Assembly Language Architecture
No ratings yet
Leiterman J C 3264bit 80x86 Assembly Language Architecture
426 pages
80486
0% (1)
80486
21 pages
Principles Of Highperformance Processor Design For High Performance Computing Deep Neural Networks And Data Science 1st Ed 2021 Junichiro Makino download
No ratings yet
Principles Of Highperformance Processor Design For High Performance Computing Deep Neural Networks And Data Science 1st Ed 2021 Junichiro Makino download
55 pages
Embedded Systems
No ratings yet
Embedded Systems
171 pages
Diagnostics Manual
No ratings yet
Diagnostics Manual
1,048 pages
68k Chips Faq
No ratings yet
68k Chips Faq
29 pages
DDI0408I Cortex A9 Fpu r4p1 TRM
No ratings yet
DDI0408I Cortex A9 Fpu r4p1 TRM
27 pages
Introduction To Information & Communication Technology: IICT by Muhammad Hussain
No ratings yet
Introduction To Information & Communication Technology: IICT by Muhammad Hussain
54 pages
8087
No ratings yet
8087
23 pages
Custom PC Retrograde
No ratings yet
Custom PC Retrograde
97 pages
Floating Point Unit Implementation and Verification For Machine Learning and AI Applications
No ratings yet
Floating Point Unit Implementation and Verification For Machine Learning and AI Applications
116 pages

Final Instruction Set

Uploaded by

Final Instruction Set

Uploaded by

Module 4

ARM CORTEX M4-Instruction Set Summary

Instruction Set Architecture (ISA) into a highly

Offset Data in Memory

STR r0, [r1], #12

LDR R4,=0x20000000 ; Set R4 to 0x20000000 (address)

• where the base register r9 holds the address

LDR R8,=0x8000 ; Set R8 to 0x8000 (address)

Note: If optional S suffix is present

Software uses Logical and Shift operations to

Note: Z and N condition code bits are updated

BFI <Rd>, <Rn>, <#lsb>, <#width>

BFI R1, R0, #8, #16 ; Insert R0[15:0] to R1[23:8]

This will give R1 = 0x335678CC;

A number of instructions can cause branch operations:

Conditional Branch Instructions

Conditional Branch Instructions

Conditional Branch Instructions

CMP R0, #1 ; compare R0 to 1

….; other subsequence operations

• Besides conditional branches, Cortex-M3 and Cortex-M4 processors also

AREA Prog2, CODE, READONLY

MOV r6,#10 ; load 10 into r6

TBB [Rn, Rm]

The syntax of TBH is TBH [Rn, Rm, LSL #1]

USAT R1, #16, R0

For details refer text book chapter 5, table 5.53

• Provides additional registers for

• The FPU adds a number of

Note: Floating point unit in the Cortex-M4 does not support

• Floating point status and control register (FPSCR)

Memory-mapped floating point unit control registers

VMOV.F32 s0,#2.0 ; use s0 to hold 2.0 temporarily

AREA Prog2, CODE, READONLY

MOV r6,#10 ; load 10 into r6

You might also like