0% found this document useful (0 votes)

2 views

ROP tutorial

This document provides a tutorial on Return-Oriented Programming (ROP) focusing on stack memory allocation, function calls, and how to manipulate the program counter (PC) for arbitrary code execution. It explains the mechanics of function calls, recursive functions, and how to create loops using stack pointer manipulation. Additionally, it includes examples of hackstrings and their behavior in the context of ROP, emphasizing the importance of controlling the stack and understanding function addresses for successful exploitation.

Uploaded by

tahoangphuc111

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

ROP tutorial

Uploaded by

tahoangphuc111

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

ROP Tutorial

Stack
The stack is used for static memory allocation. Read Wikipedia:
https://en.wikipedia.org/wiki/Stack-based_memory_allocation
Short explanation:
• When a function need an amount of memory, it decreases the stack pointer (SP) by
that many bytes, and own the memory pointed to by SP.
• When a function no longer need that memory, it increases the stack pointer.

Function call
For the nX/U8 core, when a function calls another functions, the address of the caller is
stored in the LCSR:LR register, and the arguments are (often) stored into the registers R0-
R15 (and occasionally pushed on the stack, although I can't recall any example right now).

For example, if the function f taking 1 1-byte argument in r2, has the structure
0:1234 st r2, 08124h
0:1236 rt

and a snippet of code

0:2466 mov r2, #3
0:2468 bl 0:1234h
0:246c mov r0, #0

is executed, the following will happen:

• At 0:2466, r2 is set to decimal value 3.

• At 0:2468, when the command bl 0:1234h is called,

– The LCSR:LR register value is set to 0:246ch (the address of the command
right after the bl command) (that is, LCSR = 0 and LR = 246ch)
– The CSR:PC register value is set to 0:1234h.
• Next, 0:1234 is executed.

• After 1 command, at 0:1236, when the rt command is called, the value of CSR:PC is
set to the value of LCSR:LR (which is 0:246ch).

• Then, the command 0:246ch is executed.

Recursive function
Next. When a function a that is called by another function parent needs to call yet another
function child, it needs to remember the value of LCSR:LR when it was called (by parent).
That is done by the command
push lr

which allocates 4 bytes of memory on the stack, and store the value of lcsr and lr there.
When the function returns, instead of
pop lr
ret

(which is supposed to return the value of lr and lcsr and then do the normal return), it
executes
pop pc

.
For more detailed instruction, read the nX/U8 core instruction manual. But basically, the
LR (2 bytes) is on the top, then the LCSR (1 byte), then an unused byte.
Note that, although the CPU allows up to 16 code/data segments, the calculators only uses
2 segments, so only the least significant bit of lcsr is important, all other bits are always 0.
Moreover because of word alignment of PC, the least significant bit of the PC is always zero.

So, how does that help with arbitrary code execution?

As we observed, there are a lot of pop pc commands in the code. And those commands set
the program counter to whatever on the top of the stack. Therefore, if we executes a pop pc
command and can control what is in the stack, we can make the PC to jump to arbitrary
location.
The remaining problem is to write a program in ROP.

Example
For an example, let's try understanding the hackstring that was used to understand how
F030 worked, taken from #286.
Note that you should lookup the command in the disassembly listing of the 570es+/991es+
ROM to understand what I'm saying.
<from #52 character> cv24 M 1 - 0 cv26 X - Int cs23 0 - cv24 M 1 - 0 cv26 cs4
- A 4 0 - ! cs32 0 -
First, convert it to hexadecimal for ease of understanding. You know where to look for a
character table, for example you can use the Javascript code in #301.
?? ?? ?? ... (there are 52 bytes) ... ?? ?? ??
ee 54 31 ?? 30 f0 58 ?? 6a 27 30 ??
ee 54 31 ?? 30 f0 04 ?? 41 34 30 ?? 57 b6 30 ??

The values of ?? are not important, the behavior of the hackstring is the same for all (non-
null) values of ??.
And what does this hackstring do?
First, the 100-byte hackstring is repeated in the memory, from the input area (8154h) to the
end of the writable memory (8e00h).
(for more information why does that happen, basically a strcpy(0x81b8, 0x8154) get
called, and all the 100 bytes inside the 0x8154..0x81b8 ranges (includes 0x8154, excludes
0x81b8) are not null; and the strcpy in the calculator is implemented like this: (pseudo-
code)
void strcpy(char* dest, char* src) {
while (*src != NULL) {
*dest = *src;
++dest;
++src;
}
}

Fortunately there are some non-writable (and so they're always null) memory in the range
0x8e00..0xefff, so that doesn't loop forever.

)
That is, the byte at 8154h has the value of the first byte in the hackstring, the byte at 8155h
has the value of the second byte in the string, ..., the byte at address x (8154h <= x <
8e00h) has the value of the (x - 8154h) mod 100 (0-indexing) byte in the hackstring.

Then, (eventually) when the PC is at address 0:2768h, LR = 0:2768h and SP = 8da4h. As

you can calculate, the 4 bytes at address 8da4h .. 8da7h has the values of the 52 .. 55th
bytes of the hackstring. (0-indexing) Now the 4 bytes ee 54 31 ?? are on the top of the
stack.
When the pop pc command is executed, pc = 54eeh and csr = 1. (remember the
endianness)
When the command at 1:54eeh is executed, er0 = 0f030h and r2 = 58h.
... hopefully you can deduce what will happen next, at least until the command 0:154f0h is
executed the second time. You should be able to figure out that the top of the stack at that
time is
41 34 30 ?? 57 b6 30 ??

To understand the remaining 8 bytes of the hackstring, you need to know about some
function addresses.
First, remember that a function which calls another function must start with push lr
(there may be some commands that does not change lr before that) and ends with pop pc.
So, if we make the pc be right after the push lr command, eventually a pop pc would be
executed with the same sp, and the 4 bytes on the top of the stack is not changed (assume
the function really want to returns to its caller). That way we can continue to keep control
of the pc.
So, some useful functions in the calculator:
• 0:343eh: A function, given an address pointed to with er0, and a number r2, print r2
lines on the screen using the string pointed to by er0.
• 0:b654h: A function taking no parameter, returning no parameter (i.e., void
f(void), blinks the cursor and waits for the user to press the key shift before
returning.

So 8 bytes
41 34 30 ?? 57 b6 30 ??

(in the above hackstring) calls those two functions in order.

Note that I just mentioned the multiline print function is at 0:343eh. Why using the bytes
41 34 30? (so that pop pc set the pc to 0:3431?

First, the actual command executed is at 0:3440, because of word alignment, the value of pc
is always even. (not so for sp, be careful!)
Now assume we set the pc to 0:343e. It will push the value of lr on the top of the stack, and
then execute some commands until 0:3478, and then set the value of pc to the value of lr
we pushed earlier. Which is not what we want. (as we can't set the lr to a desired value)
That's also the reason why we don't like gadgets ending with ret.
Instead, we jump to the command right after the push lr. That way the pop pc will pop
the top of the stack at that time, and we can control the value of pc.
(some notes regarding this:
(Summary: jumps to the command right after push lr is always correct, but other options
may also be correct)
1. If we jump to a command before the push lr, the pc will often jump to some
unexpected places, as the command right before push lr is often pop pc or ret.
2. If we jump to the push lr command, the value of pc after the corresponding pop pc
will depends on the value of lr at the start of the function. Only useful if we can
control the value of lr. A similar situation happen if we jump to the beginning of (or
inside) a function returning with ret - we need to control the value of lr.
3. If we jump to the command right after the push lr the function is executed and the
corresponding pop pc will pop the top of the stack. Good.
4. If we jump to some position after the push lr the function body is executed except
some commands at the first.
Typically, option (3) is used because it's the simplest, but occasionally option (2) or (4)
should be used instead if typing them into the calculator takes significantly less keystrokes.
I will (try to) explain this part later if some hackstring uses that.
)

Loops
Next, about loops.
The only way to loop in return-oriented programming is to modify the value of the stack
pointer sp. If you search for sp in the disassembly, you can see that the only commands
modifying sp and is sufficiently near a (later) rt or pop pc (such that we can easily reason
about what will happen if those commands (between the command modifying sp and the
return command) is executed) are:
• mov sp, er14 (often followed by pop er14 and pop pc)
• add sp, #... (positive amount means pop that many bytes from the stack)

Only the first one is (currently) used for looping.

So, to loop we should:
• When sp = A, and a pop pc command is executed, execute something that doesn't
modify the stack.
• Execute a gadget with pop er14; pop pc and put A-2 on the top of the stack.
• Execute a gadget with mov sp, er14; pop er14; pop pc.
(it's possible to loop with something that does modify the stack, more information later)
When the last gadget mov sp, er14; pop er14; pop pc is executed, the following
happens:
• mov sp, er14: sp is set to the value A - 2.
• pop er14: sp is increased by 2 bytes. er14 contains whatever in that 2 bytes.
• pop pc: pc has the value at A.

This is an infinite loop. I have never written a conditional statement/loop, but it should be
possible with some table lookup/etc.

Example:
Using the hackstring in #290:
<52 characters> cv24 M 1 - Fvar cv26 cv40 - Int cs23 0 - tan-1 D 0 - cs26
cv26 cs16 D 1 - cv12 = 0 - sin 2 0 - 0 cv34 Int cs23 0 - (-) cs32 0 - frac
Ans ^ cs32 0 - <2 remaining characters>

Hexadecimal:
?? ... (52 bytes) ... ??
ee 54 31 ?? 46 f0 fe ?? 6a 27 30 ?? b2 44 30 ?? 40 f0
1d 44 31 ?? e2 3d 30 ?? a0 32 30 ?? 30 f8
6a 27 30 ?? 60 b6 30 ?? ae 8b 5e b6 30 ??
?? ??

Only the 10 last bytes are related to looping.

60 b6 30 ?? corresponds to the address 0:b660h. The commands from 0:b660h to 0:b662h
are executed.
Then, ae 8b is popped into er14. (that is, 8baeh)
5e b6 30 ?? corresponds to the address 0:b65eh. The commands from 0:b65eh to 0:b662h
are executed, effectively jumps to the start of the loop.
There are a lot of mov sp, er14; pop er14; pop pc command sequences.
Choose one that you find typing most easily.

Loops which modifies the stack

What is "modify the stack" and what is "not modify the stack"? This is pretty self-
explanatory.
Note:
• A pop command only increases the stack pointer, does not change the value on the
stack.
• A push command often modifies the stack, unless it's possible to prove that the stack
value is always equal to the pushed value. That includes push lr.
So, to loop we should:
• When sp = A, and a pop pc command is executed, execute something that may
modifies the stack.
• Return the stack to the original value.
• Execute a gadget with pop er14; pop pc and put A-2 on the top of the stack.
• Execute a gadget with mov sp, er14; pop er14; pop pc.
Typically the stack is returned to the original value by calling the null-terminated string
copy function on a 100-byte region that is not modified before the used stack content. As
it's quite hard to determine which region is "not modified", this is often done by trial and
error.

Example

TODO

TODO: Is there any unclear part (that you can't understand)?

from user202729

Combine Mastery in SwiftUI
0% (1)
Combine Mastery in SwiftUI
437 pages
Coal 5,6,7
No ratings yet
Coal 5,6,7
13 pages
Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)
Cheat Engine Tutorials
No ratings yet
Cheat Engine Tutorials
5 pages
Syringe Di 4000 Ds 3000 Catalogue Nonepca
100% (1)
Syringe Di 4000 Ds 3000 Catalogue Nonepca
4 pages
Section 1
100% (1)
Section 1
94 pages
4-Reversing With Ida Pro From Scratch PDF
No ratings yet
4-Reversing With Ida Pro From Scratch PDF
17 pages
Holberton School Stack
No ratings yet
Holberton School Stack
19 pages
Masdswe. (1)
No ratings yet
Masdswe. (1)
14 pages
Rema Lab 2 Mihirpatel k042
No ratings yet
Rema Lab 2 Mihirpatel k042
6 pages
Practical 8 Call and Subroutine
50% (2)
Practical 8 Call and Subroutine
5 pages
NET3001 4 AdvAsm
No ratings yet
NET3001 4 AdvAsm
43 pages
M.Saad
No ratings yet
M.Saad
35 pages
Lecture5 INSTRUCTIONS MICROPROCESSOR APLICATIONS
No ratings yet
Lecture5 INSTRUCTIONS MICROPROCESSOR APLICATIONS
58 pages
Stack
No ratings yet
Stack
13 pages
Fuzzy
No ratings yet
Fuzzy
259 pages
Net3001 5 C
No ratings yet
Net3001 5 C
48 pages
4. Buffer Overflow
No ratings yet
4. Buffer Overflow
39 pages
05 Software Security 2
No ratings yet
05 Software Security 2
33 pages
Assembly Registers
No ratings yet
Assembly Registers
6 pages
Theory Viet.
No ratings yet
Theory Viet.
156 pages
5. ROP
No ratings yet
5. ROP
48 pages
01 Lecture02
No ratings yet
01 Lecture02
78 pages
Chapter 7 B
No ratings yet
Chapter 7 B
34 pages
Embedded Sys - Week 3-Part 2
No ratings yet
Embedded Sys - Week 3-Part 2
13 pages
Quiz2 Practice Solutions-1
No ratings yet
Quiz2 Practice Solutions-1
10 pages
LAB1
No ratings yet
LAB1
14 pages
Lab 10
No ratings yet
Lab 10
5 pages
Why Assembly Language?
No ratings yet
Why Assembly Language?
74 pages
8086 Assembler Tutorial For Beginners (Part 9)
No ratings yet
8086 Assembler Tutorial For Beginners (Part 9)
1 page
Cyasm Assembler User'S Guide: March 3, 1998
No ratings yet
Cyasm Assembler User'S Guide: March 3, 1998
33 pages
3A Programming Cheat Sheet V2
No ratings yet
3A Programming Cheat Sheet V2
2 pages
AGD Tablas EN
No ratings yet
AGD Tablas EN
5 pages
Lab 4: Introduction To x86 Assembly
No ratings yet
Lab 4: Introduction To x86 Assembly
14 pages
Assembly Paper Key
No ratings yet
Assembly Paper Key
7 pages
Review of Assembly Language: Program "Text" Contains Binary Instructions
No ratings yet
Review of Assembly Language: Program "Text" Contains Binary Instructions
27 pages
Microcontroller 8051
No ratings yet
Microcontroller 8051
72 pages
NEW333
No ratings yet
NEW333
9 pages
The Stack
No ratings yet
The Stack
9 pages
EE234 - lec_04
No ratings yet
EE234 - lec_04
70 pages
handwriting_20250321_232300_via_10015_io
No ratings yet
handwriting_20250321_232300_via_10015_io
8 pages
Programming The PSoC With 8051 Assembly Instructions
No ratings yet
Programming The PSoC With 8051 Assembly Instructions
6 pages
Step by Step Format String Exploitation On Windows
100% (1)
Step by Step Format String Exploitation On Windows
20 pages
Assembly - Procedures
No ratings yet
Assembly - Procedures
5 pages
L09-Call and Stack
No ratings yet
L09-Call and Stack
33 pages
Assembler Programming Using Debug
100% (2)
Assembler Programming Using Debug
16 pages
Electric Network Analysis Lab
No ratings yet
Electric Network Analysis Lab
11 pages
Ret 2 Win
No ratings yet
Ret 2 Win
18 pages
Lab Session 3
No ratings yet
Lab Session 3
11 pages
Assembly Programming
No ratings yet
Assembly Programming
6 pages
Ghost in The Shellcode - TI-1337 (Pwnable 100) SkullSecurity
No ratings yet
Ghost in The Shellcode - TI-1337 (Pwnable 100) SkullSecurity
2 pages
Stack and Subroutine
No ratings yet
Stack and Subroutine
6 pages
Instruction Set 8051 - v1
No ratings yet
Instruction Set 8051 - v1
10 pages
Chapter 4 Stack Organization
No ratings yet
Chapter 4 Stack Organization
18 pages
Subroutines and Loop Delay
No ratings yet
Subroutines and Loop Delay
8 pages
NPTEL - Electrical Engineering - Microprocessor13
No ratings yet
NPTEL - Electrical Engineering - Microprocessor13
7 pages
Debug Introduction
No ratings yet
Debug Introduction
12 pages
Lab01 CPU Solutions
No ratings yet
Lab01 CPU Solutions
8 pages
Cracking Oxford Advanced Learner's Dictionary (CD-COPS 1.8)
No ratings yet
Cracking Oxford Advanced Learner's Dictionary (CD-COPS 1.8)
10 pages
Practical Reverse Engineering: x86, x64, ARM, Windows Kernel, Reversing Tools, and Obfuscation
From Everand
Practical Reverse Engineering: x86, x64, ARM, Windows Kernel, Reversing Tools, and Obfuscation
Bruce Dang
No ratings yet
Learn Programming Using C#
From Everand
Learn Programming Using C#
Taurius Litvinavicius
No ratings yet
Conceptual Programming: Conceptual Programming: Learn Programming the old way!
From Everand
Conceptual Programming: Conceptual Programming: Learn Programming the old way!
Avishek Sharma
No ratings yet
Projects With Microcontrollers And PICC
From Everand
Projects With Microcontrollers And PICC
Guillermo Perez Guillen
5/5 (1)
Sammy Bodin Resume
No ratings yet
Sammy Bodin Resume
2 pages
Capwap Base Mib
No ratings yet
Capwap Base Mib
45 pages
Embedded-C Assignments - PART1
No ratings yet
Embedded-C Assignments - PART1
7 pages
Risk Assessment: Concrete Work (PCC/RCC)
100% (1)
Risk Assessment: Concrete Work (PCC/RCC)
2 pages
19CS050 Shegar Dipti Sunil DSA Journal
No ratings yet
19CS050 Shegar Dipti Sunil DSA Journal
131 pages
AT Business Practitioner
No ratings yet
AT Business Practitioner
6 pages
Action Plan Template With Tracking Chart
No ratings yet
Action Plan Template With Tracking Chart
7 pages
Corporate Git Branching Strategies by DevOps Shack
No ratings yet
Corporate Git Branching Strategies by DevOps Shack
3 pages
LG Washer Manual 2019 PDF
No ratings yet
LG Washer Manual 2019 PDF
88 pages
Digital Painting and Graphic Design: Learning Standards
No ratings yet
Digital Painting and Graphic Design: Learning Standards
27 pages
Date Sheet of First Internal Examinations-Odd - Semester - 2024-25
No ratings yet
Date Sheet of First Internal Examinations-Odd - Semester - 2024-25
15 pages
FM 200
No ratings yet
FM 200
20 pages
Delta Sidewall Fans - Submittal
No ratings yet
Delta Sidewall Fans - Submittal
4 pages
Scheduled Commuter & NSOP Airplanes Deficiency Report: SL No Provision Deficiencies Ref CAR / Standards
No ratings yet
Scheduled Commuter & NSOP Airplanes Deficiency Report: SL No Provision Deficiencies Ref CAR / Standards
3 pages
Elective Syllabus
100% (1)
Elective Syllabus
14 pages
Teclas de Atalhos VS Code
No ratings yet
Teclas de Atalhos VS Code
1 page
SAP PI - Training
No ratings yet
SAP PI - Training
142 pages
Week 2
No ratings yet
Week 2
24 pages
Resume Mohammed
No ratings yet
Resume Mohammed
3 pages
Science, Technology and Society
No ratings yet
Science, Technology and Society
70 pages
Ece3044 Wearable-technology-And-iot Eth 1.0 49 Ece3044
No ratings yet
Ece3044 Wearable-technology-And-iot Eth 1.0 49 Ece3044
2 pages
Aiaa 2013 3734
No ratings yet
Aiaa 2013 3734
7 pages
Microlearning A New
No ratings yet
Microlearning A New
11 pages
Structured Programming LanguageDiscussion Quiz
No ratings yet
Structured Programming LanguageDiscussion Quiz
2 pages
Material For Fibre Optic Lines PDF
No ratings yet
Material For Fibre Optic Lines PDF
21 pages
Computer Fundamentals
No ratings yet
Computer Fundamentals
3 pages
Atulkumar Bca 5thsem A35404819038 NTCC Amity University Jharkhand
No ratings yet
Atulkumar Bca 5thsem A35404819038 NTCC Amity University Jharkhand
76 pages

ROP tutorial

Uploaded by

ROP tutorial

Uploaded by

ROP Tutorial

and a snippet of code

is executed, the following will happen:

• At 0:2468, when the command bl 0:1234h is called,

• Then, the command 0:246ch is executed.

So, how does that help with arbitrary code execution?

Then, (eventually) when the PC is at address 0:2768h, LR = 0:2768h and SP = 8da4h. As

(in the above hackstring) calls those two functions in order.

Only the first one is (currently) used for looping.

Only the 10 last bytes are related to looping.

Loops which modifies the stack

TODO: Is there any unclear part (that you can't understand)?

You might also like