0% found this document useful (0 votes)

96 views

Quick Primer On LLVM IR: (For Those Already Familiar With LLVM IR, Feel Free To)

The document provides an overview of LLVM IR, including: - LLVM IR is a low-level intermediate representation used by the LLVM compiler framework that is platform-independent. - Compilers are split into front-end, middle-end, and back-end components that take LLVM IR as input/output and optimize/compile it. - LLVM IR examples are shown for a simple C program, demonstrating its static typing and use of registers.

Uploaded by

Chinmay Agnihotri

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

96 views

Quick Primer On LLVM IR: (For Those Already Familiar With LLVM IR, Feel Free To)

Uploaded by

Chinmay Agnihotri

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 13

Quick primer on LLVM IR

(For those already familiar with LLVM IR, feel free to jump to the next
section).

LLVM IR is a low-level intermediate representation used by the LLVM

compiler framework. You can think of LLVM IR as a platform-
independent assembly language with an infinite number of function
local registers.

When developing compilers there are huge benefits with compiling

your source language to an intermediate representation (IR) 1 instead of
compiling directly to a target architecture (e.g. x86). As many
optimization techniques are general (e.g. dead code elimination,
constant propagation), these optimization passes may be performed
directly on the IR level and thus shared between all targets 2.

Compilers are therefore often split into three components, the front-
end, middle-end and back-end; each with a specific task that takes IR
as input and/or produces IR as output.

 Front-end: compiles source language to IR.

 Middle-end: optimizes IR.
 Back-end: compiles IR to machine code.

Example program in LLVM IR assembly

To get a glimpse of what LLVM IR assembly may look like, lets consider
the following C program.
1
2 int f(int a, int b) {
3 return a + 2*b;
4}
5
6 int main() {
7 return f(10, 20);
}

Using Clang3, the above C code compiles to the following LLVM IR

assembly.

1
2
3 define i32 @f(i32 %a, i32 %b) {
4 ; <label>:0
5 %1 = mul i32 2, %b
6 %2 = add i32 %a, %1
7 ret i32 %2
8 }
9
1 define i32 @main() {
0 ; <label>:0
1 %1 = call i32 @f(i32 10, i32 20)
1 ret i32 %1
1 }
2

By looking at the LLVM IR assembly above, we may observe a few

noteworthy details about LLVM IR, namely:

 LLVM IR is statically typed (i.e. 32-bit integer values are denoted

with the i32 type).
 Local variables are scoped to each function (i.e. %1 in
the @main function is different from %1 in the @f function).
 Unnamed (temporary) registers are assigned local IDs
(e.g. %1, %2) from an incrementing counter in each function.
 Each function may use an infinite number of registers (i.e. we are
not limited to 32 general purpose registers).
 Global identifiers (e.g. @f) and local identifiers (e.g. %a, %1) are
distinguished by their prefix (@ and %, respectively).
 Most instructions do what you’d think, mul performs
multiplication, add addition, etc.
 Line comments are prefixed with ; as is quite common for
assembly languages.

The structure of LLMV IR assembly

The contents of an LLVM IR assembly file denotes a module. A module

contains zero or more top-level entities, such as global
variables and functions.

A function declaration contains zero basic blocks and a function

definition contains one or more basic blocks (i.e. the body of the
function).

A more detailed example of an LLVM IR module is given below,

including the global definition @foo and the function
definition @f containing three basic blocks
(%entry, %block_1 and %block_2).

1 ; Global variable initialized to the 32-bit integer value 21.

2 @foo = global i32 21
3
4 ; f returns 42 if the condition cond is true, and 0 otherwise.
5 define i32 @f(i1 %cond) {
6 ; Entry basic block of function containing zero non-branching instructions and a
7 ; conditional branching terminator instruction.
8 entry:
9 ; The conditional br terminator transfers control flow to block_1 if %cond
1 ; is true, and to block_2 otherwise.
0 br i1 %cond, label %block_1, label %block_2
1
1 ; Basic block containing two non-branching instructions and a return terminator.
1 block_1:
2 %tmp = load i32, i32* @foo
1 %result = mul i32 %tmp, 2
3 ret i32 %result
1
4 ; Basic block with zero non-branching instructions and a return terminator.
1 block_2:
5 ret i32 0
1 }
6
1
7
1
8
1
9
2
0
2
1
2
2

Basic block

A basic block is a sequence of zero or more non-branching instructions

followed by a branching instruction (referred to as the terminator
instruction). The key idea behind a basic block is that if a single
instruction of the basic block is executed, then all instructions of the
basic block are executed. This notion simplifies control flow analysis.

Instruction

An instruction is a non-branching LLVM IR instruction, usually

performing a computation or accessing memory (e.g. add, load), but
not changing the control flow of the program.

Terminator instruction

A terminator instruction is at the end of each basic block, and

determines where to transfer control flow once the basic block finishes
executing. For instance ret terminators returns control flow back to the
caller function, and br terminators branches control flow either
conditionally or unconditionally.

Static Single Assignment form

One very important property of LLVM IR is that it is in SSA-form (Static

Single Assignment), which essentially means that each register is
assigned exactly once. This property simplifies data flow analysis.

To handle variables that are assigned more than once in the original
source code, a notion of phi instructions are used in LLVM IR.
A phi instruction essentially returns one value from a set of incoming
values, based on the control flow path taken during execution to reach
the phi instruction. Each incoming value is therefore associated with a
predecessor basic block.

For a concrete example, consider the following LLVM IR function.

1
2
3
4
5
6 define i32 @f(i32 %a) {
7 ; <label>:0
8 switch i32 %a, label %default [
9 i32 42, label %case1
1 ]
0
1 case1:
1 %x.1 = mul i32 %a, 2
1 br label %ret
2
1 default:
3 %x.2 = mul i32 %a, 3
1 br label %ret
4
1 ret:
5 %x.0 = phi i32 [ %x.2, %default ], [ %x.1, %case1 ]
1 ret i32 %x.0
6 }
1
7
1
8

The phi instruction (sometimes referred to as phi nodes) in the above

example essentially models the set of possible incoming values as
distinct assignment statements, exactly one of which is executed
based on the control flow path taken to reach the basic block of
the phi instruction during execution. One way to illustrate the
corresponding data flow is as follows:

In general, when developing compilers which translates source code

into LLVM IR, all local variables of the source code may be transformed
into SSA-form, with the exception of variables of which the address is
taken.

To simplify the implementation of LLVM front-ends, one

recommendation is to model local variables in the source language as
memory allocated variables (using alloca), model assignments to local
variables as store to memory, and uses of local variables as load from
memory. The reason for this is that it may be non-trivial to directly
translate a source language into LLVM IR in SSA-form. As long as the
memory accesses follows certain patters, we may then rely on
the mem2reg LLVM optimization pass to translate memory allocate local
variables to registers in SSA-form (using phi nodes where necessary).

LLVM IR library in pure Go

The two main libraries for working with LLVM IR in Go are:

 llvm.org/llvm/bindings/go/llvm: the official LLVM bindings for the

Go programming language.
 github.com/llir/llvm: a pure Go library for interacting with LLVM
IR.

The official LLVM bindings for Go uses Cgo to provide access to the rich
and powerful API of the LLVM compiler framework, while
the llir/llvm project is entirely written in Go and relies on LLVM IR to
interact with the LLVM compiler framework.

This post focuses on llir/llvm, but should generalize to working with

other libraries as well.

Why write a new library?

The primary motivation for developing a pure Go library for interacting

with LLVM IR was to make it more fun to code compilers and static
analysis tools that rely on and interact with the LLVM compiler
framework. In part because the compile time of projects relying on the
official LLVM bindings for Go could be quite substantial (Thanks
to @aykevl, the author of TinyGo, there are now ways to speed up the
compile time by dynamically linking against a system-installed version
of LLVM4).
Another leading motivation was to try and design an idiomatic Go API
from the ground up. The main difference between the API of the LLVM
bindings for Go and llir/llvm is how LLVM values are modelled. In the
LLVM bindings for Go, LLVM values are modelled as a concrete struct
type, which essentially contains every possible method of every
possible LLVM value. My personal experience with using this API is that
it was difficult to know what subsets of methods you were allowed to
invoke for a given value. For instance, to retrieve the Opcode of an
instruction, you’d invoke the InstructionOpcode method – which is
quite intuitive. However, if you happen to invoke the Opcode method
instead (which is used to retrieve the Opcode of constant expressions),
you’d get the runtime errors “cast<Ty>() argument of incompatible
type!”.

The llir/llvm library was therefore designed to provide compile time

guarantees by further relying on the Go type system. LLVM values
in llir/llvm are modelled as an interface type. This approach only
exposes the minimum set of methods shared by all values, and if you
want to access more specific methods or fields, you’d use a type
switch (as illustrated in the analysis example below).

Usage examples

Now, lets consider a few concrete usage examples. Given that we have
a library to work with, what may we wish to do with LLVM IR?

Firstly, we may want to parse LLVM IR produced by other tools, such as

Clang and the LLVM optimizer opt (see the input example below).

Secondly, we may want to process LLVM IR to perform analysis of our

own (e.g. custom optimization passes) or implement interpreters and
Just-in-Time compilers (see the analysis example below).

Thirdly, we may want to produce LLVM IR to be consumed by other

tools. This is the approach taken when developing a front-end for a
new programming language (see the output example below).

Input example - Parsing LLVM IR

1 // This example program parses an LLVM IR assembly file, and prints the parsed
2 // module to standard output.
3 package main
4
5
6
7
8
9
1
0
1 import (
1 "fmt"
1
2 "github.com/llir/llvm/asm"
1 )
3
1 func main() {
4 // Parse LLVM IR assembly file.
1 m, err := asm.ParseFile("foo.ll")
5 if err != nil {
1 panic(err)
6 }
1 // process, interpret or optimize LLVM IR.
7
1 // Print LLVM IR module.
8 fmt.Println(m)
1 }
9
2
0
2
1

Analysis example - Processing LLVM IR

1 // This example program analyses an LLVM IR module to produce a callgraph in

2 // Graphviz DOT format.
3 package main
4
5 import (
6 "bytes"
7 "fmt"
8 "io/ioutil"
9
1 "github.com/llir/llvm/asm"
0 "github.com/llir/llvm/ir"
1 )
1
1 func main() {
2 // Parse LLVM IR assembly file.
1 m, err := asm.ParseFile("foo.ll")
3 if err != nil {
1 panic(err)
4 }
1 // Produce callgraph of module.
5 callgraph := genCallgraph(m)
1 // Output callgraph in Graphviz DOT format.
6 if err := ioutil.WriteFile("callgraph.dot", callgraph, 0644); err != nil {
1 panic(err)
7 }
1 }
8
1 // genCallgraph returns the callgraph in Graphviz DOT format of the given LLVM IR
9 // module.
2 func genCallgraph(m *ir.Module) []byte {
0 buf := &bytes.Buffer{}
2 buf.WriteString("digraph {\n")
1 // For each function of the module.
2 for _, f := range m.Funcs {
2 // Add caller node.
2 caller := f.Ident()
3 fmt.Fprintf(buf, "\t%q\n", caller)
2 // For each basic block of the function.
4 for _, block := range f.Blocks {
2 // For each non-branching instruction of the basic block.
5 for _, inst := range block.Insts {
2 // Type switch on instruction to find call instructions.
6 switch inst := inst.(type) {
2 case *ir.InstCall:
7 callee := inst.Callee.Ident()
2 // Add edges from caller to callee.
8 fmt.Fprintf(buf, "\t%q -> %q\n", caller, callee)
2 }
9 }
3 // Terminator of basic block.
0 switch term := block.Term.(type) {
3 case *ir.TermRet:
1 // do something.
3 _ = term
2 }
3 }
3 }
3 buf.WriteString("}")
4 return buf.Bytes()
3 }
5
3
6
3
7
3
8
3
9
4
0
4
1
4
2
4
3
4
4
4
5
4
6
4
7
4
8
4
9
5
0
5
1
5
2
5
3
5
4
5
5
5
6
5
7
5
8
5
9
6
0

Output example - Producing LLVM IR

1 // This example produces LLVM IR code equivalent to the following C code, which
2 // implements a pseudo-random number generator.
3 //
4 // int abs(int x);
5 //
6 // int seed = 0;
7 //
8 // // ref: https://en.wikipedia.org/wiki/Linear_congruential_generator
9 // // a = 0x15A4E35
1 // // c = 1
0 // int rand(void) {
1 // seed = seed*0x15A4E35 + 1;
1 // return abs(seed);
1
2
1 // }
3 package main
1
4 import (
1 "fmt"
5
1 "github.com/llir/llvm/ir"
6 "github.com/llir/llvm/ir/constant"
1 "github.com/llir/llvm/ir/types"
7 )
1
8 func main() {
1 // Create convenience types and constants.
9 i32 := types.I32
2 zero := constant.NewInt(i32, 0)
0 a := constant.NewInt(i32, 0x15A4E35) // multiplier of the PRNG.
2 c := constant.NewInt(i32, 1) // increment of the PRNG.
1
2 // Create a new LLVM IR module.
2 m := ir.NewModule()
2
3 // Create an external function declaration and append it to the module.
2 //
4 // int abs(int x);
2 abs := m.NewFunc("abs", i32, ir.NewParam("x", i32))
5
2 // Create a global variable definition and append it to the module.
6 //
2 // int seed = 0;
7 seed := m.NewGlobalDef("seed", zero)
2
8 // Create a function definition and append it to the module.
2 //
9 // int rand(void) { ... }
3 rand := m.NewFunc("rand", i32)
0
3 // Create an unnamed entry basic block and append it to the `rand` function.
1 entry := rand.NewBlock("")
3
2 // Create instructions and append them to the entry basic block.
3 tmp1 := entry.NewLoad(seed)
3 tmp2 := entry.NewMul(tmp1, a)
3 tmp3 := entry.NewAdd(tmp2, c)
4 entry.NewStore(tmp3, seed)
3 tmp4 := entry.NewCall(abs, tmp3)
5 entry.NewRet(tmp4)
3
6 // Print the LLVM IR assembly of the module.
3 fmt.Println(m)
7 }
3
8
3
9
4
0
4
1
4
2
4
3
4
4
4
5
4
6
4
7
4
8
4
9
5
0
5
1
5
2
5
3
5
4
5
5
5
6
5
7
5
8
5
9
6
0
6
1
6
2
6
3

Closing notes
The design and implementation of llir/llvm has been guided by a
community of people who have contributed – not only by writing code –
but through shared discussions, pair-programming sessions, bug
hunting, profiling investigations, and most of all, a curiosity for learning
and taking on exciting challenges.

One particularly challenging part of the llir/llvm project has been to

construct an EBNF grammar for LLVM IR covering the entire LLVM IR
assembly language as of LLVM v7.0. This was challenging, not because
the process itself is difficult, but because there existed no official
grammar covering the entire language. Several community projects
have attempted to define a formal grammar for LLVM IR assembly, but
these have, to the best of our knowledge, only covered subsets of the
language.

The exciting part of having a grammar for LLVM IR is that it enables a

lot of interesting projects. For instance, generating syntactically valid
LLVM IR assembly to be used for fuzzing tools and libraries consuming
LLVM IR (the same approach as taken by GoSmith). This could be used
for cross-validation efforts between LLVM projects implemented in
different languages, and also help tease out potential security
vulnerabilities and bugs in implementations.

Programming With Miranda
No ratings yet
Programming With Miranda
312 pages
Jeffrey Dean CSE Summa Sum1990
No ratings yet
Jeffrey Dean CSE Summa Sum1990
34 pages
Tiger Language
No ratings yet
Tiger Language
52 pages
PrecisionRTL Style
No ratings yet
PrecisionRTL Style
345 pages
Spring Framework Reference Documentation PDF
No ratings yet
Spring Framework Reference Documentation PDF
1,194 pages
LLVM
No ratings yet
LLVM
474 pages
The LLVM Compiler Framework and Infrastructure
No ratings yet
The LLVM Compiler Framework and Infrastructure
61 pages
C Programming: Core Concepts and Techniques
From Everand
C Programming: Core Concepts and Techniques
William Smith
No ratings yet
Object Files in LLVM
No ratings yet
Object Files in LLVM
19 pages
Developed by University of Illinois at Urbana-Champaign CIS Dept Cisc 471 Matthew Warner
No ratings yet
Developed by University of Illinois at Urbana-Champaign CIS Dept Cisc 471 Matthew Warner
9 pages
Compiler Design Code Optimization
No ratings yet
Compiler Design Code Optimization
5 pages
Three Address Code
100% (1)
Three Address Code
19 pages
Target Code Generation: Utkarsh Jaiswal 11CS30038
No ratings yet
Target Code Generation: Utkarsh Jaiswal 11CS30038
15 pages
Unit-Iv: Mathematics, Biology and Computers For Chemists
No ratings yet
Unit-Iv: Mathematics, Biology and Computers For Chemists
90 pages
Intermediate Code Generation
No ratings yet
Intermediate Code Generation
62 pages
3.1 Static Random Access Memory (SRAM)
No ratings yet
3.1 Static Random Access Memory (SRAM)
6 pages
Chapter - 2 Instruction Set Architecture 2.1 Memory Locations and Addresses
No ratings yet
Chapter - 2 Instruction Set Architecture 2.1 Memory Locations and Addresses
11 pages
Compiler Design
No ratings yet
Compiler Design
14 pages
Thesis Hisham PDF
No ratings yet
Thesis Hisham PDF
151 pages
T Diagrams
100% (1)
T Diagrams
22 pages
An Introduction To GCC For The GNU Compilers GCC and G Revised and Updated
No ratings yet
An Introduction To GCC For The GNU Compilers GCC and G Revised and Updated
89 pages
Mos Ram
No ratings yet
Mos Ram
15 pages
Antlr C Sharp Code Generation Using Visual
No ratings yet
Antlr C Sharp Code Generation Using Visual
8 pages
Types of Modelling PDF
No ratings yet
Types of Modelling PDF
14 pages
Fulltext01 PDF
No ratings yet
Fulltext01 PDF
168 pages
Compiler Record
No ratings yet
Compiler Record
48 pages
Dynamic Code Generation With Java Compiler API in Java 6: by Swaminathan Bhaskar 10/10/2009
No ratings yet
Dynamic Code Generation With Java Compiler API in Java 6: by Swaminathan Bhaskar 10/10/2009
14 pages
Cfengine 3 Concepts Guide
No ratings yet
Cfengine 3 Concepts Guide
62 pages
Gretl Guide
No ratings yet
Gretl Guide
336 pages
Life Cycle of Source Program Compiler Design
No ratings yet
Life Cycle of Source Program Compiler Design
10 pages
Instruction Scheduler in LLVM
No ratings yet
Instruction Scheduler in LLVM
20 pages
Z-Functional Programming in Haskell
No ratings yet
Z-Functional Programming in Haskell
26 pages
Phyml Maximum Likelihood Trees
No ratings yet
Phyml Maximum Likelihood Trees
37 pages
Vlsi Implementation of 32KB Sleepy Sram Thesis
No ratings yet
Vlsi Implementation of 32KB Sleepy Sram Thesis
50 pages
Haskell Design Patterns - Sample Chapter
No ratings yet
Haskell Design Patterns - Sample Chapter
27 pages
tc1 6 Architecture Vol1
100% (1)
tc1 6 Architecture Vol1
225 pages
Smart Syntax Highlighting For Dynamic Language Case: Common Lisp in Emacs
No ratings yet
Smart Syntax Highlighting For Dynamic Language Case: Common Lisp in Emacs
61 pages
Lecture 8 VHDL Test Benches
No ratings yet
Lecture 8 VHDL Test Benches
101 pages
Csc3205-Symbol-Table
100% (1)
Csc3205-Symbol-Table
13 pages
Tutorial LLV M Back End Cpu 0
No ratings yet
Tutorial LLV M Back End Cpu 0
605 pages
Kaleidoscope - Implementing A Language With LLVM in Objective Caml
No ratings yet
Kaleidoscope - Implementing A Language With LLVM in Objective Caml
142 pages
Write An LLVMBackend Tutorial For Cpu 0
No ratings yet
Write An LLVMBackend Tutorial For Cpu 0
189 pages
Module 3 Problem Solving and Reasoning
No ratings yet
Module 3 Problem Solving and Reasoning
31 pages
Compiler Writing Tools
100% (2)
Compiler Writing Tools
17 pages
Create New Language
No ratings yet
Create New Language
26 pages
COA_Module4
No ratings yet
COA_Module4
19 pages
Wolfe, Cave-Guided Search. An Alternative To The Feature Integration Model of Visual Search
No ratings yet
Wolfe, Cave-Guided Search. An Alternative To The Feature Integration Model of Visual Search
15 pages
An Implementation of Smart Contracts by PDF
No ratings yet
An Implementation of Smart Contracts by PDF
9 pages
24 Steps of Compiler Design
No ratings yet
24 Steps of Compiler Design
11 pages
RTL Coding Styles That Yield Simulation and Synthesis Mismatches
No ratings yet
RTL Coding Styles That Yield Simulation and Synthesis Mismatches
15 pages
Concrete Semantics With Isabelle/HOL
No ratings yet
Concrete Semantics With Isabelle/HOL
308 pages
Verilog Tutorial: Fall 2005-2006
No ratings yet
Verilog Tutorial: Fall 2005-2006
25 pages
3
No ratings yet
3
14 pages
Best Coding Practices in C
No ratings yet
Best Coding Practices in C
11 pages
Resource Reservation System
No ratings yet
Resource Reservation System
75 pages
Two Pass Assembler Code
No ratings yet
Two Pass Assembler Code
4 pages
Elementary Cellular Automata
No ratings yet
Elementary Cellular Automata
16 pages
The Next 700 Programming Languages
100% (1)
The Next 700 Programming Languages
10 pages
Barbara Liskov, Programming With Abstract Data Types
100% (1)
Barbara Liskov, Programming With Abstract Data Types
10 pages
NW.js Essentials
From Everand
NW.js Essentials
Alessandro Benoit
No ratings yet
Mastering Nim Programming: High-Performance Metaprogramming and Compile-Time Execution
From Everand
Mastering Nim Programming: High-Performance Metaprogramming and Compile-Time Execution
Robert Johnson
No ratings yet
VOS3000 Web Interface Developing Manual
No ratings yet
VOS3000 Web Interface Developing Manual
198 pages
Vsphere Automation SDK 65 Net Programming Guide
No ratings yet
Vsphere Automation SDK 65 Net Programming Guide
70 pages
Hardeep Singh Resume
No ratings yet
Hardeep Singh Resume
6 pages
PowerFactory API v3
No ratings yet
PowerFactory API v3
80 pages
Deepti Project Report
No ratings yet
Deepti Project Report
63 pages
Card Issuance System Selection Guide: Access
No ratings yet
Card Issuance System Selection Guide: Access
16 pages
Ribbons 1 1
No ratings yet
Ribbons 1 1
126 pages
Real Time Data Viewer With Witsml
No ratings yet
Real Time Data Viewer With Witsml
99 pages
Resume Ankit Rai
No ratings yet
Resume Ankit Rai
2 pages
Report of Ug Siemens NX Modeling
No ratings yet
Report of Ug Siemens NX Modeling
22 pages
Keerthi P: Contact: 832-856-2493
No ratings yet
Keerthi P: Contact: 832-856-2493
8 pages
Ole
No ratings yet
Ole
5 pages
LightScribe Public Windows SDK Documentation
100% (1)
LightScribe Public Windows SDK Documentation
23 pages
EAST System Architecture
No ratings yet
EAST System Architecture
76 pages
Making VAT Digital HMRC Info 29-5-19
No ratings yet
Making VAT Digital HMRC Info 29-5-19
2 pages
Detecting Malicious Facebook Applications: Ntroduction
No ratings yet
Detecting Malicious Facebook Applications: Ntroduction
99 pages
Implementation of DCM Module For AUTOSAR Version 4.0: Deepika C. K., Bjyu G., Vishnu V. S
No ratings yet
Implementation of DCM Module For AUTOSAR Version 4.0: Deepika C. K., Bjyu G., Vishnu V. S
8 pages
Java Card
No ratings yet
Java Card
21 pages
Omni Flow Computer 3000-6000
100% (1)
Omni Flow Computer 3000-6000
2 pages
Studies On Application of Cloud Computing Techniques in GIS: YANG Jinnan, WU Sheng
No ratings yet
Studies On Application of Cloud Computing Techniques in GIS: YANG Jinnan, WU Sheng
4 pages
Docu46538 White Paper EMC Documentum D2 External Widgets
No ratings yet
Docu46538 White Paper EMC Documentum D2 External Widgets
15 pages
Sri Ramakrishna Institute of Technology: Academic Year 2015 - 16
No ratings yet
Sri Ramakrishna Institute of Technology: Academic Year 2015 - 16
5 pages
CSA Guide v4 FINAL PDF
100% (2)
CSA Guide v4 FINAL PDF
152 pages
Control-M 9 Ports Diagram PDF
50% (2)
Control-M 9 Ports Diagram PDF
1 page
TeamViewer API Documentation
No ratings yet
TeamViewer API Documentation
60 pages
Jowua v5n4 5 PDF
No ratings yet
Jowua v5n4 5 PDF
17 pages
Introduction of IPTV BMS Subsystem V1.0
No ratings yet
Introduction of IPTV BMS Subsystem V1.0
39 pages
Spring Modules Validation
No ratings yet
Spring Modules Validation
11 pages

Quick Primer On LLVM IR: (For Those Already Familiar With LLVM IR, Feel Free To)

Uploaded by

Quick Primer On LLVM IR: (For Those Already Familiar With LLVM IR, Feel Free To)

Uploaded by

Quick primer on LLVM IR

LLVM IR is a low-level intermediate representation used by the LLVM

When developing compilers there are huge benefits with compiling

 Front-end: compiles source language to IR.

Example program in LLVM IR assembly

Using Clang3, the above C code compiles to the following LLVM IR

By looking at the LLVM IR assembly above, we may observe a few

 LLVM IR is statically typed (i.e. 32-bit integer values are denoted

The structure of LLMV IR assembly

The contents of an LLVM IR assembly file denotes a module. A module

A function declaration contains zero basic blocks and a function

A more detailed example of an LLVM IR module is given below,

1 ; Global variable initialized to the 32-bit integer value 21.

A basic block is a sequence of zero or more non-branching instructions

An instruction is a non-branching LLVM IR instruction, usually

A terminator instruction is at the end of each basic block, and

Static Single Assignment form

One very important property of LLVM IR is that it is in SSA-form (Static

For a concrete example, consider the following LLVM IR function.

The phi instruction (sometimes referred to as phi nodes) in the above

In general, when developing compilers which translates source code

To simplify the implementation of LLVM front-ends, one

LLVM IR library in pure Go

 llvm.org/llvm/bindings/go/llvm: the official LLVM bindings for the

This post focuses on llir/llvm, but should generalize to working with

Why write a new library?

The primary motivation for developing a pure Go library for interacting

The llir/llvm library was therefore designed to provide compile time

Firstly, we may want to parse LLVM IR produced by other tools, such as

Secondly, we may want to process LLVM IR to perform analysis of our

Thirdly, we may want to produce LLVM IR to be consumed by other

Input example - Parsing LLVM IR

Analysis example - Processing LLVM IR

1 // This example program analyses an LLVM IR module to produce a callgraph in

Output example - Producing LLVM IR

One particularly challenging part of the llir/llvm project has been to

The exciting part of having a grammar for LLVM IR is that it enables a

You might also like