0% found this document useful (0 votes)

117 views

Unit 1: Compiler Design

The document discusses compiler design and concepts related to compilers. It defines a compiler as a translator that converts a high-level language into machine language. It describes the main phases of compilation as lexical analysis, syntax analysis, semantic analysis, intermediate code generation, code optimization, and code generation. It also discusses single-pass and multi-pass compilers, bootstrapping, finite state machines, regular expressions, and optimization of deterministic finite automata.

Uploaded by

lakshay oberoi

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

117 views

Unit 1: Compiler Design

Uploaded by

lakshay oberoi

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 74

Unit 1

Compiler Design
Introduction to Compiler
• A compiler is a translator that converts the high-level
language into the machine language.
• High-level language is written by a developer and
machine language can be understood by the processor.
• Compiler is used to show errors to the programmer.
• The main purpose of compiler is to change the code
written in one language without changing the meaning
of the program.
• When you execute a program which is written in HLL
programming language then it executes into two parts.
• In the first part, the source program compiled and
translated into the object program (low level language).
• In the second part, object program translated into the
target program through the assembler.
Fig 1 : Execution process of source program in Compiler
Compiler Phases

• The compilation process contains the sequence of

various phases. Each phase takes source program in
one representation and produces output in another
representation. Each phase takes input from its
previous stage.
• There are the various phases of compiler:
Fig 2 : phases of compiler
• Lexical Analysis:
Lexical analyzer phase is the first phase of compilation
process. It takes source code as input. It reads the
source program one character at a time and converts it
into meaningful lexemes. Lexical analyzer represents
these lexemes in the form of tokens.
• Syntax Analysis
Syntax analysis is the second phase of compilation
process. It takes tokens as input and generates a parse
tree as output. In syntax analysis phase, the parser
checks that the expression made by the tokens is
syntactically correct or not.
• Semantic Analysis
Semantic analysis is the third phase of compilation process. It
checks whether the parse tree follows the rules of language.
Semantic analyzer keeps track of identifiers, their types and
expressions. The output of semantic analysis phase is the
annotated tree syntax.
• Intermediate Code Generation
In the intermediate code generation, compiler generates the
source code into the intermediate code. Intermediate code is
generated between the high-level language and the machine
language. The intermediate code should be generated in such a
way that you can easily translate it into the target machine
code.
• Code Optimization
Code optimization is an optional phase. It is used to improve
the intermediate code so that the output of the program
could run faster and take less space. It removes the
unnecessary lines of the code and arranges the sequence of
statements in order to speed up the program execution.
• Code Generation
Code generation is the final stage of the compilation process.
It takes the optimized intermediate code as input and maps
it to the target machine language. Code generator translates
the intermediate code into the machine code of the specified
computer.
Example :
Compiler Passes

A pass refers to the number of times the compiler goes

through the source code. There are single-pass compilers and
multi-pass compilers. Single-pass compiler goes through the
program only once. In other words, the single pass compiler
allows the source code to pass through each compilation unit
only once. It immediately translates each code section into its
final machine code.
Multi-pass compiler goes through the source code several
times. In other words, it allows the source code to pass
through each compilation unit several times. Each pass takes
the result of the previous pass as input and creates
intermediate outputs. Therefore, the code improves in each
pass. The final code is generated after the final pass. Multi-
pass compilers perform additional tasks such as intermediate
code generation, machine dependent code optimization, and
machine independent code optimization.
Difference Between Phases and Passes of Compiler
Definition
Phases refer to units or steps in the compilation process.
Passes, in contrast, refer to the total number of times the
compiler goes through the source code before converting it
into the target machine code. Thus, this is the main difference
between phases and passes of compiler.
There are six main phases in the compilation process while
there are two types of compilers as single pass and multi-pass
compilers. Hence, this is another difference between phases
and passes of compiler.
Conclusion
A compiler is a special software that supports this conversion.
The main difference between phases and passes of compiler
is that phases are the steps in the compilation process while
passes are the number of times the compiler traverses
through the source code.
Bootstrapping

• Bootstrapping is widely used in the compilation

development.
• Bootstrapping is used to produce a self-hosting
compiler. Self-hosting compiler is a type of compiler
that can compile its own source code.
• Bootstrap compiler is used to compile the compiler and
then you can use this compiled compiler to compile
everything else as well as future versions of itself.
A compiler can be characterized by three languages:
1.Source Language
2.Target Language
3.Implementation Language
The T- diagram shows a compiler SCIT for Source S,
Target T, implemented in I.
Follow some steps to produce a new language L for
machine A:
1. Create a compiler SCAA for subset, S of the desired
language, L using language "A" and that compiler
runs on machine A.

2. Create a compiler LCSA for language L written in a

subset of L.
3. Compile LCSA using the compiler SCAA to obtain LCAA. LCAA is a
compiler for language L, which runs on machine A and produces
code for machine A.

The process described by the T-diagrams is called bootstrapping.

Finite state machine

• Finite state machine is used to recognize patterns.

• Finite automata machine takes the string of symbol as
input and changes its state accordingly. In the input,
when a desired symbol is found then the transition
occurs.
• While transition, the automata can either move to the
next state or stay in the same state.
• FA has two states: accept state or reject state. When
the input string is successfully processed and the
automata reached its final state then it will accept
otherwise it is in reject state.
A finite automata consists of following:

Q: finite set of states

∑: finite set of input symbol
q0: initial state
F: final state
δ: Transition function
Transition function can be defined as
δ: Q x ∑ →Q
FA is characterized into two ways:

1.DFA (finite automata)

2.NDFA (non deterministic finite automata)
DFA

DFA stands for Deterministic Finite Automata.

Deterministic refers to the uniqueness of the
computation. In DFA, the input character goes to one
state only. DFA doesn't accept the null move that
means the DFA cannot change state without any input
character.
DFA has five tuples {Q, ∑, q0, F, δ}
Q: set of all states
∑: finite set of input symbol where δ: Q x ∑ →Q
q0: initial state
F: final state
δ: Transition function
Example
See an example of deterministic finite automata:
Q = {q0,q1,q2}
∑ = {0, 1}
q0 = {q0}
F = {q2}
NDFA
• NDFA refer to the Non Deterministic Finite Automata. It is used to transit
the any number of states for a particular input. NDFA accepts the NULL
move that means it can change state without reading the symbols.

• NDFA also has five states same as DFA. But NDFA has different transition
function.

• Transition function of NDFA can be defined as:

δ: Q x ∑ →2Q
Example :
See an example of non deterministic finite automata:
Q = {q0, q1, q2}
∑ = {0, 1}
q0 = {q0}
F = {q2}
Regular expression

• Regular expression is a sequence of pattern that defines

a string. It is used to denote regular languages.
• It is also used to match character combinations in
strings. String searching algorithm used this pattern to
find the operations on string.
• In regular expression, x* means zero or more
occurrence of x. It can generate {e, x, xx, xxx,
xxxx,.....}
• In regular expression, x+ means one or more
occurrence of x. It can generate {x, xx, xxx, xxxx,.....}
Operations on Regular Language

The various operations on regular language are:

• Union: If L and M are two regular languages then their
union L U M is also a union.
L U M = {s | s is in L or s is in M}

• Intersection: If L and M are two regular languages

then their intersection is also an intersection.
L ⋂ M = {st | s is in L and t is in M}
Kleene closure: If L is a regular language then its
kleene closure L* will also be a regular language.
L* = Zero or more occurrence of language L.
Example
Write the regular expression for the language:
L = {abn w:n ≥ 3, w ∈ (a,b)+}
Solution:
The string of language L starts with "a" followed by at
least three b's. It contains at least one "a" or one "b" that
is string are like abbba, abbbbbba, abbbbbbbb,
abbbb.....a
So regular expression is:
r= ab3b* (a+b)+
Here + is a positive closure i.e. (a+b)+ = (a+b)* - ∈
Optimization of DFA

To optimize the DFA you have to follow the various

steps. These are as follows:

Step 1: Remove all the states that are unreachable from

the initial state via any set of the transition of DFA.

Step 2: Draw the transition table for all pair of states.

Step 3: Now split the transition table into two tables T1
and T2. T1 contains all final states and T2 contains non-
final states.

Step 4: Find the similar rows from T1 such that:

1.δ (q, a) = p
2.δ (r, a) = p
That means, find the two states which have same value
of a and b and remove one of them.
• Step 5: Repeat step 3 until there is no similar rows are
available in the transition table T1.
• Step 6: Repeat step 3 and step 4 for table T2 also.
• Step 7: Now combine the reduced T1 and T2 tables.
The combined transition table is the transition table of
minimized DFA.
Example
Solution:
• Step 1: In the given DFA, q2 and q4 are the
unreachable states so remove them.
• Step 2: Draw the transition table for rest of the states.
Step 3:
Now divide rows of transition table into two sets as:
1. One set contains those rows, which start from non-
final sates:
2. Other set contains those rows, which starts from final
states.

Step 4: Set 1 has no similar rows so set 1 will be the

same.
Step 5: In set 2, row 1 and row 2 are similar since q3
and q5 transit to same state on 0 and 1. So skip q5 and
then replace q5 by q3 in the rest.
Step 6: Now combine set 1 and set 2 as:

Now this is the transition table of minimized DFA.

LEX
• Lex is a program that generates lexical analyzer. It is
used with YACC (Yet Another Compiler Compiler)
parser generator.
• The lexical analyzer is a program that transforms an
input stream into a sequence of tokens.
• It reads the input stream and produces the source
code as output through implementing the lexical
analyzer in the C program.
The function of Lex is as follows:
Firstly lexical analyzer creates a program lex.1 in the Lex
language. Then Lex compiler runs the lex.1 program and
produces a C program lex.yy.c.

Finally C compiler runs the lex.yy.c program and

produces an object program a.out.

a.out is lexical analyzer that transforms an input stream

into a sequence of tokens.
Lex file format
A Lex program is separated into three sections by %%
delimiters. The formal of Lex source is as follows:
1.{ definitions }
2.%%
3. { rules }
4.%%
5.{ user subroutines }
• Definitions include declarations of constant, variable
and regular definitions.
• Rules define the statement of form p1 {action1} p2
{action2}....pn {action}.
• Where pi describes the regular expression
and action1 describes the actions the lexical analyzer
should take when pattern pi matches a lexeme.
• User subroutines are auxiliary procedures needed by
the actions. The subroutine can be loaded with the
lexical analyzer and compiled separately.
Formal grammar

• Formal grammar is a set of rules. It is used to identify

correct or incorrect strings of tokens in a language. The
formal grammar is represented as G.
• Formal grammar is used to generate all possible strings
over the alphabet that is syntactically correct in the
language.
• Formal grammar is used mostly in the syntactic analysis
phase (parsing) particularly during the compilation.
Formal grammar G is written as follows:
G = <V, N, P, S>
Where:
N describes a finite set of non-terminal symbols.
V describes a finite set of terminal symbols.
P describes a set of production rules
S is the start symbol.
Example:
L = {a, b}, N = {S, R, B}
Production rules:
S = bR
R = aR
R = aB
B = b
Through this production we can produce some strings like: bab, baab,
baaab etc.
This production describes the string of shape banab.

Fig : Formal grammar

BNF Notation

• BNF stands for Backus-Naur Form. It is used to write

a formal representation of a context-free grammar. It is
also used to describe the syntax of a programming
language.
• BNF notation is basically just a variant of a context-free
grammar.
In BNF, productions have the form:
Left side → definition
Where leftside ∈ (Vn∪ Vt)+ and definition ∈ (Vn∪ Vt)*. In
BNF, the leftside contains one non-terminal.
We can define the several productions with the same
leftside. All the productions are separated by a vertical
bar symbol "|".
There is the production for any grammar as follows:
S → aSa
S → bSb
S → c
In BNF, we can represent above grammar as follows:
S → aSa| bSb| c
YACC

• YACC stands for Yet Another Compiler Compiler.

• YACC provides a tool to produce a parser for a given
grammar.
• YACC is a program designed to compile a LALR (1)
grammar here LALR means Look ahead left to right.
• It is used to produce the source code of the syntactic
analyzer of the language produced by LALR (1)
grammar.
• The input of YACC is the rule or grammar and the output
is a C program.
These are some points about YACC:
Input: A CFG- file.y
Output: A parser y.tab.c (yacc)
• The output file "file.output" contains the parsing tables.
• The file "file.tab.h" contains declarations.
• The parser called the yyparse ().
• Parser expects to use a function called yylex () to get
tokens.
The basic operational sequence is as follows:

This file contains the desired grammar in YACC format.

It shows the YACC program.

It is the c source program created by YACC.

C Compiler

Executable file that will parse grammar given in

gram.Y
Context free grammar
Context free grammar is a formal grammar which is used to generate all
possible strings in a given formal language.
Context free grammar G can be defined by four tuples as:
G= (V, T, P, S)
Where, G describes the grammar

T describes a finite set of terminal symbols.

V describes a finite set of non-terminal symbols

P describes a set of production rules

S is the start symbol.

In CFG, the start symbol is used to derive the string. You
can derive the string by repeatedly replacing a non-
terminal by the right hand side of the production, until all
non-terminal have been replaced by terminal symbols.
Example:
L= {wcwR | w € (a, b)*}
Production rules:
S → aSa
S → bSb
S → c
Now check that abbcbba string can be derived from the given CFG.
S ⇒ aSa
S ⇒ abSba
S ⇒ abbSbba
S ⇒ abbcbba
By applying the production S → aSa, S → bSb recursively and
finally applying the production S → c, we get the string abbcbba.
Capabilities of CFG

There are the various capabilities of CFG:

• Context free grammar is useful to describe most of the
programming languages.
• If the grammar is properly designed then an efficient parser
can be constructed automatically.
• Using the features of associatively & precedence information,
suitable grammars for expressions can be constructed.
• Context free grammar is capable of describing nested
structures like: balanced parentheses, matching begin-end,
corresponding if-then-else's & so on.
Derivation
• Derivation is a sequence of production rules. It is used
to get the input string through these production rules.
During parsing we have to take two decisions. These
are as follows:
• We have to decide the non-terminal which is to be
replaced.
• We have to decide the production rule by which the
non-terminal will be replaced.
• We have two options to decide which non-terminal to
be replaced with production rule.
Left-most Derivation

In the left most derivation, the input is scanned and

replaced with the production rule from left to right. So in
left most derivatives we read the input string from left to
right.
Example:
Production rules:
S = S + S
S = S - S
S = a | b |c
Input:
a-b+c
The left-most derivation is:
S = S + S
S = S - S + S
S = a - S + S
S = a - b + S
S = a - b + c
Right-most Derivation

In the right most derivation, the input is scanned and

replaced with the production rule from right to left. So in
right most derivatives we read the input string from right
to left.
Example : S = S + S
S = S - S
S = a | b |c
Input:
a–b+c
The right-most derivation is:
S = S - S
S = S - S + S
S = S - S + c
S = S - b + c
S = a - b + c
Parse tree

• Parse tree is the graphical representation of symbol. The

symbol can be terminal or non-terminal.
• In parsing, the string is derived using the start symbol.
The root of the parse tree is that start symbol.
• Parse tree follows the precedence of operators. The
deepest sub-tree traversed first. So, the operator in the
parent node has less precedence over the operator in the
sub-tree.
The parse tree follows these points:

• All leaf nodes have to be terminals.

• All interior nodes have to be non-terminals.
• In-order traversal gives original input string.
Construct parse tree for E --> E + E I E * E I id
Construct parse tree for s --> SS* I SS+ I a
Ambiguity

A grammar is said to be ambiguous if there exists more

than one leftmost derivation or more than one
rightmost derivation or more than one parse tree for
the given input string. If the grammar is not ambiguous
then it is called unambiguous.
Example:
S = aSb | SS
S = ∈
For the string aabb, the above grammar generates two
parse trees:
If the grammar has ambiguity then it is not good for a
compiler construction. No method can automatically
detect and remove the ambiguity but you can remove
ambiguity by re-writing the whole grammar without
ambiguity.
Problem :

Check whether the grammar G with production

rules −
X → X+X | X*X |X| a
is ambiguous or not.
Solution :
Let’s find out the derivation tree for the string
"a+a*a". It has two leftmost derivations.
Derivation 1 − X → X+X → a +X → a+ X*X → a+a*X → a+a*a
Parse tree 1 −
Derivation 2 − X → X*X → X+X*X → a+ X*X → a+a*X → a+a*a
Parse tree 2 −

Since there are two parse trees for a single string

"a+a*a", the grammar G is ambiguous.

Intermediate Code Generation-17-19
No ratings yet
Intermediate Code Generation-17-19
90 pages
Kebere Goshu: Bahir Dar University
0% (1)
Kebere Goshu: Bahir Dar University
22 pages
Assembly Language Short Notes by Vu
No ratings yet
Assembly Language Short Notes by Vu
16 pages
Chapter 3 - Simple Sorting and Searching
100% (1)
Chapter 3 - Simple Sorting and Searching
18 pages
Chapter 7 - 9 Final
No ratings yet
Chapter 7 - 9 Final
70 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
29 pages
Chapter 3 - Inheritance and Polymorphism
100% (1)
Chapter 3 - Inheritance and Polymorphism
37 pages
Topic 2-Computer Graphics - Images
No ratings yet
Topic 2-Computer Graphics - Images
14 pages
Chapter 5 Turing Machines
No ratings yet
Chapter 5 Turing Machines
47 pages
Chapter 2 - Lexical Analyser
No ratings yet
Chapter 2 - Lexical Analyser
40 pages
Data Recovery Presentation
No ratings yet
Data Recovery Presentation
8 pages
Chapter 1 3
No ratings yet
Chapter 1 3
85 pages
Research Methods in Computer Science 1
No ratings yet
Research Methods in Computer Science 1
14 pages
Jimma University JIT School of Computing Advanced Database System Lab
100% (1)
Jimma University JIT School of Computing Advanced Database System Lab
70 pages
Chapter 3
No ratings yet
Chapter 3
33 pages
Computer Organization and Architecture: Chapter One Digital Logic and Digital Systems
No ratings yet
Computer Organization and Architecture: Chapter One Digital Logic and Digital Systems
50 pages
Mcsa Test Q
No ratings yet
Mcsa Test Q
6 pages
502 Object Oriented Analysis and Design
No ratings yet
502 Object Oriented Analysis and Design
110 pages
DLF 2 Mark Question Banks
100% (1)
DLF 2 Mark Question Banks
5 pages
CD Unit-3
No ratings yet
CD Unit-3
146 pages
Chapter 5- Recovery Techniques
No ratings yet
Chapter 5- Recovery Techniques
24 pages
MCQ 1 Programming
No ratings yet
MCQ 1 Programming
9 pages
Chapter 1
No ratings yet
Chapter 1
35 pages
Chapter 2 Processes and Process Management
No ratings yet
Chapter 2 Processes and Process Management
115 pages
Introduction of IR Models
No ratings yet
Introduction of IR Models
62 pages
Error Detection and Correction
No ratings yet
Error Detection and Correction
54 pages
Event Driven Programming
No ratings yet
Event Driven Programming
59 pages
Lecture #6: Complex 3D Modeling With Polygon Mesh
No ratings yet
Lecture #6: Complex 3D Modeling With Polygon Mesh
9 pages
Calculator Code
100% (1)
Calculator Code
7 pages
How To Install DSpace On Windows64 SantoshGupta 2017
No ratings yet
How To Install DSpace On Windows64 SantoshGupta 2017
4 pages
Ch03 Regular
No ratings yet
Ch03 Regular
14 pages
DataBase Recovery Techniques
100% (1)
DataBase Recovery Techniques
37 pages
Relational Data Model
No ratings yet
Relational Data Model
41 pages
Computational Tractability Asymptotic Order of Growth Implementing Gale-Shapley Survey of Common Running
No ratings yet
Computational Tractability Asymptotic Order of Growth Implementing Gale-Shapley Survey of Common Running
80 pages
Chapter 5
No ratings yet
Chapter 5
40 pages
Chapter 2-Computer Security Attacks and Threats
No ratings yet
Chapter 2-Computer Security Attacks and Threats
40 pages
Elective Focus Basket Details
No ratings yet
Elective Focus Basket Details
46 pages
PDF
No ratings yet
PDF
14 pages
Course Outline Microprocessor & Assembly Language Programming
50% (2)
Course Outline Microprocessor & Assembly Language Programming
3 pages
Android MCQs
No ratings yet
Android MCQs
23 pages
Haramaya University
No ratings yet
Haramaya University
29 pages
Chapter 1
No ratings yet
Chapter 1
58 pages
Chapter Two HTML: Internet Programming Compiled By:tadesse K
No ratings yet
Chapter Two HTML: Internet Programming Compiled By:tadesse K
162 pages
Theoretical Concept of Unix Operating System
100% (1)
Theoretical Concept of Unix Operating System
10 pages
Chapter 6
No ratings yet
Chapter 6
20 pages
Chapter-3 Real Time OS
No ratings yet
Chapter-3 Real Time OS
130 pages
18CSMP68 - Mobile Application Development
No ratings yet
18CSMP68 - Mobile Application Development
68 pages
2.basics & Algorithm-Flowchart PDF
No ratings yet
2.basics & Algorithm-Flowchart PDF
51 pages
Allslides Handout
No ratings yet
Allslides Handout
269 pages
Chapter 2 - Query Processing and Optimization
No ratings yet
Chapter 2 - Query Processing and Optimization
16 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
77 pages
University of Gondar: Document Image Retrieval
No ratings yet
University of Gondar: Document Image Retrieval
9 pages
Chapter3 DSDLC
No ratings yet
Chapter3 DSDLC
32 pages
Mapping Designs To Code: Larman, Chapter 20 CSE432 Object Oriented Software Engineering
No ratings yet
Mapping Designs To Code: Larman, Chapter 20 CSE432 Object Oriented Software Engineering
8 pages
Difference Between Iterative and Recursive
0% (1)
Difference Between Iterative and Recursive
37 pages
Introduction To OOAD What Is OOAD? What Is UML? What Are The United Process (UP) Phases
No ratings yet
Introduction To OOAD What Is OOAD? What Is UML? What Are The United Process (UP) Phases
41 pages
Presentation 1
No ratings yet
Presentation 1
18 pages
Interrupt Vectors and The Vector Table
100% (1)
Interrupt Vectors and The Vector Table
8 pages
UNIT-I PPT Introduction To Artificial Intelligence
No ratings yet
UNIT-I PPT Introduction To Artificial Intelligence
86 pages
Compiler Design Unit 1 Notes
No ratings yet
Compiler Design Unit 1 Notes
21 pages
Code Generation
No ratings yet
Code Generation
5 pages
Code Generation
No ratings yet
Code Generation
49 pages
Code Generation Part I: Front-End Code Optimizer Code Generator
No ratings yet
Code Generation Part I: Front-End Code Optimizer Code Generator
5 pages
Compiler Construction Notes
No ratings yet
Compiler Construction Notes
61 pages
Reading List: Aho-Sethi-Ullman: Chapter 6.1 6.2 Chapter 6.3 6.10 (Note: Glance Through It Only For
No ratings yet
Reading List: Aho-Sethi-Ullman: Chapter 6.1 6.2 Chapter 6.3 6.10 (Note: Glance Through It Only For
33 pages
Code Generation
No ratings yet
Code Generation
16 pages
2015-CSE-3.1-2 Syllabus - 2015
No ratings yet
2015-CSE-3.1-2 Syllabus - 2015
46 pages
Compiler Design LectureNotes
No ratings yet
Compiler Design LectureNotes
45 pages
Chapter 6 - Intermediate Code Generation
No ratings yet
Chapter 6 - Intermediate Code Generation
42 pages
Compiler Design Unit-1 - 3
No ratings yet
Compiler Design Unit-1 - 3
4 pages
Compiler Construction
No ratings yet
Compiler Construction
5 pages
Compiler L 400
No ratings yet
Compiler L 400
25 pages
01 Intro
No ratings yet
01 Intro
21 pages
Blue Print Exit Exam
No ratings yet
Blue Print Exit Exam
223 pages
Unit 1
No ratings yet
Unit 1
29 pages
Compiler Java
100% (1)
Compiler Java
330 pages
CS6660 Compiler Design Question Bank
100% (3)
CS6660 Compiler Design Question Bank
10 pages
UNIT-5 Notes
No ratings yet
UNIT-5 Notes
14 pages
Compiler Lab Manual PDF
No ratings yet
Compiler Lab Manual PDF
82 pages
Principles of Compiler Design
100% (2)
Principles of Compiler Design
35 pages
Compiler Construction Iii B.E. - Vi Sem: Unit - I
No ratings yet
Compiler Construction Iii B.E. - Vi Sem: Unit - I
77 pages
Computer Abbreviations - Part 1
No ratings yet
Computer Abbreviations - Part 1
36 pages
Evoluation of Programming Languages DR Jivtode
No ratings yet
Evoluation of Programming Languages DR Jivtode
31 pages
Compiler Design: - Language Processor - Language Processing System - Phases of Compiler
No ratings yet
Compiler Design: - Language Processor - Language Processing System - Phases of Compiler
11 pages
LLVM
No ratings yet
LLVM
12 pages
Unit-1 Introduction Compiler by AnkitaChauhan
No ratings yet
Unit-1 Introduction Compiler by AnkitaChauhan
27 pages
Compiler-All-Anna-Question Till Nov-2016 PDF
No ratings yet
Compiler-All-Anna-Question Till Nov-2016 PDF
39 pages
Master Resume '23
No ratings yet
Master Resume '23
2 pages
Compiler
No ratings yet
Compiler
79 pages
Compiler Design - Software Design Project
0% (1)
Compiler Design - Software Design Project
54 pages

Unit 1: Compiler Design

Uploaded by

Unit 1: Compiler Design

Uploaded by

Unit 1

• The compilation process contains the sequence of

A pass refers to the number of times the compiler goes

• Bootstrapping is widely used in the compilation

2. Create a compiler LCSA for language L written in a

The process described by the T-diagrams is called bootstrapping.

• Finite state machine is used to recognize patterns.

Q: finite set of states

1.DFA (finite automata)

DFA stands for Deterministic Finite Automata.

• Transition function of NDFA can be defined as:

• Regular expression is a sequence of pattern that defines

The various operations on regular language are:

• Intersection: If L and M are two regular languages

To optimize the DFA you have to follow the various

Step 1: Remove all the states that are unreachable from

Step 2: Draw the transition table for all pair of states.

Step 4: Find the similar rows from T1 such that:

Step 4: Set 1 has no similar rows so set 1 will be the

Now this is the transition table of minimized DFA.

Finally C compiler runs the lex.yy.c program and

a.out is lexical analyzer that transforms an input stream

• Formal grammar is a set of rules. It is used to identify

Fig : Formal grammar

• BNF stands for Backus-Naur Form. It is used to write

• YACC stands for Yet Another Compiler Compiler.

This file contains the desired grammar in YACC format.

It shows the YACC program.

Executable file that will parse grammar given in

T describes a finite set of terminal symbols.

V describes a finite set of non-terminal symbols

P describes a set of production rules

S is the start symbol.

There are the various capabilities of CFG:

In the left most derivation, the input is scanned and

In the right most derivation, the input is scanned and

• Parse tree is the graphical representation of symbol. The

• All leaf nodes have to be terminals.

A grammar is said to be ambiguous if there exists more

Check whether the grammar G with production

Since there are two parse trees for a single string

You might also like