From Regular Expressions To Automata

The document discusses techniques for converting regular expressions to finite state automata (FSAs), which are used for lexical analysis in compilers. It covers converting a non-deterministic FSA (NFA) to a deterministic FSA (DFA) using subset construction, and simulating an NFA directly using the subset construction approach. It also describes constructing an NFA from a regular expression using a syntax-directed approach based on the regular expression's parse tree.

Uploaded by

Nishat Afroj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views

From Regular Expressions To Automata

Uploaded by

Nishat Afroj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 33

From Regular Expressions to

Automata
Compiler Design Lexical Analysis
s.l. dr. ing. Ciprian-Bogdan Chirila
[email protected]
http://www.cs.upt.ro/~chirila
Outline
 Conversion of a NFA to DFA
 Simulation of an NFA
 Construction of an NFA from a Regular
Expression
From Regular Expressions to
Automata
 regular expression describes
◦ lexical analyzers
◦ pattern processing software
 implies simulation of DFA or NFA
 NFA simulation is less straightforward
 Techniques
◦ to convert NFA to DFA
◦ the subset construction technique
 simulating NFA directly
 when NFA to DFA is time consuming
◦ to convert regular expression to NFA and then to
DFA
Conversion of a NFA to a DFA
 subset construction
◦ each state of DFA corresponds to a set of
NFA states
 DFA states may be exponential in number
of NFA states
 for lexical analysis NFA and DFA
◦ have approximately the same number of
states
◦ the exponential behavior is not seen
Subset construction of an DFA from
an NFA
 Input
◦ an NFA N
 Output
◦ DFA D accepting the same language as N
 Method
◦ to construct a transition table Dtran for D
◦ each state of D is a set of NFA states
◦ to construct Dtran so D will simulate in parallel
all possible moves N can make on a given input
string
◦ to deal with ε –transitions of N properly
Operations on NFA states
Operation Description
ε-closure(s) set of NFA states reachable from NFA state s on ε-
transition alone
ε-closure(T) set of NFA states reachable from some NFA state s in set
T on ε-transitions alone
move(T,a) set of NFA states to which there is a transition on input
symbol a from some state s in T
Transitions
 s0 – startstate
 N can be in any states of ε-closure(s0)
 reading input string x
◦ N can be in the set of states T after
 reading input a
◦ N can go in ε-closure(move(T, a))
 accepting states of D are all sets of N
states that include at least one accepting
state of N
The Subset Construction
while(there is an unmarked state T in Dstates)
{
mark T;
for(each input symbol a)
{
U=ε-closure(move(T,a));
if (U is not in Dstates)
add U as unmarked state to Dstates;
Dtran[T,a]=U;
}
}
Computing ε-closure(T)
push all states of T onto stack;
initialize ε-closure(T) to T;
while(stack is not empty)
{
pop t, the top element, off stack;
for(each state u with an edge from t to u labeled ε)
if(u is not in ε-closure(T))
{
add u to ε-enclosure(T);
push u onto stack;
}
}
Example (a|b)*abb

 A= ε-closure(0) or A={0,1,2,4,7}
Example (a|b)*abb
 A={0,1,2,4,7}
 Dtran(A,a)= ε-closure(move(A,a))
 from {0,1,2,4,7} only {2,7} have a
transition on a to {3,8}
Example (a|b)*abb
 Dtran[A,a]= ε-closure(move(A,a)) =
ε-closure({3,8}) = {1,2,3,4,6,7,8}
 Dtran[A,a]=B
Example (a|b)*abb
 from {0,1,2,4,7} only {4} has a transition
on b to {5}
 Dtran[A,b]= ε-closure({5})={1,2,4,5,6,7}
 Dtran[A,b]=C
…
Example (a|b)*abb
NFA State DFA State a b
{0,1,2,4,7} A B C
{1,2,3,4,6,7,8} B B D
{1,2,4,5,6,7} C B C
{1,2,4,5,6,7,9} D B E
{1,2,3,5,6,7,10} E B C
Simulation of an NFA
 strategy in text editing programs
◦ to construct a NFA from a regular expression
◦ to simulate NFA using on-the-fly subset construction
 Input
◦ input string x terminated by eof
◦ NFA N
 start state s0
 accepting states F
 transition function move
 Output
◦ yes / no
 Method
◦ to keep the current states S reached from s0
◦ if c is the next input read by nextChar()
◦ we compute move(S,c) and then we use ε-closure()
Algorithm: Simulating an NFA
01 S=ε-closure(s0);
02 c=nextChar();
03 while(c!=eof) {
04 S=ε-enclosure(move(S,c));
05 c=nextChar();
06 }
07 if(S∩F!=ø) return “yes”;
08 else return “no”;
Implementation of NFA Simulation
 two stacks each holding a set of NFA
states

 a boolean array alreadyOn

 a two dimensional array move[s,a]

NFA Simulation Data Structures
 two stacks each holding a set of NFA
states
◦ used for the values of S in both sides of assign
= operator in line 4
S=ε-enclosure(move(S,c));
 right side – oldStates
 left side – newStates
◦ newStates->oldStates
NFA Simulation Data Structures
 boolean array alreadyOn
◦ indexed by NFA states
◦ indicates which states are in newStates
◦ array and stack hold the same information
◦ it is much faster to interrogate the array than
to search the stack
 two dimensional array move[s,a]
◦ the entries are set of states
◦ implemented by linked lists
Implementation of step 1
01 S=ε-closure(s0);

addState(s)
{
push s onto newStates;
alreadyOn[s]=TRUE;
for(t on move[s,ε])
if(!alreadyOn(t))
addState(t);
}
Implementation of step 4
04 S=ε-enclosure(move(S,c));

for (s on oldStates)
{
for (t on move[s,c])
if(!alreadyOn[t])
addState(t);
pop s from oldStates;
}

for (s on newStates)
{
pop s from newStates;
push s onto oldStates;
alreadyOn[s]=FALSE;
}
Construction of an NFA from a
Regular Expression
 to convert a regular expression to a NFA
 McNaughton-Yamada-Thompson
algorithm
 syntax-directed
◦ it works recursively up the parse tree of the
regular expression
 for each subexpression a NFA with a
single accepting state is built
Construction of an NFA from a
Regular Expression
 Input
◦ regular expression r over an alphabet Σ
 Output
◦ An NFA accepting L(r)
 Method
◦ to parse r into constituent subexpressions
◦ basis rules for handling subexpressions with no
operators
◦ inductive rules for creating larger NFAs from
subexpressions NFAs
 union, concatenation, closure
Basis Rules for Constructing NFA
 for expression ε

 for expression a
NFA for the Union of Two Regular
Expressions
 r=s|t
 N(s) and N(t) are NFA’s for regular
expressions s and t
NFA for the Concatenation of Two
Regular Expressions
 r=st
 N(s) and N(t) are NFA’s for regular
expressions s and t
Induction Rules for Constructing
NFA
 r=s*
 N(s) is the NFA for the regular expressions
s

 r=(s)
◦ L(r)=L(s)
◦ N(s) is equivalent to N(r)
Example
 parse tree for (a|b)*abb
Example
 NFA for r1

 NFA for r2
Example
 NFA for r3=r1 | r2
Example
 NFA for r5=(r3)*
Example
 NFA for r7=r5r6

 …
Bibliography
 Alfred V. Aho, Monica S. Lam, Ravi Sethi,
Jeffrey D. Ullman – Compilers, Principles,
Techniques and Tools, Second Edition,
2007

Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)
3 - Lecture 07
No ratings yet
3 - Lecture 07
70 pages
Unit 01 - Part 3
No ratings yet
Unit 01 - Part 3
18 pages
SEM04a-NFA Construction and Minimum DFA
No ratings yet
SEM04a-NFA Construction and Minimum DFA
48 pages
CS-352 - Spring 2024 - Lec4
No ratings yet
CS-352 - Spring 2024 - Lec4
38 pages
Aho-3 7
No ratings yet
Aho-3 7
5 pages
Lec 6
No ratings yet
Lec 6
27 pages
Lec2 0 NFA
No ratings yet
Lec2 0 NFA
30 pages
Dfa 1
No ratings yet
Dfa 1
23 pages
Can We Build A Finite Automaton For Every Regular Expression?, - Build FA Based On The Definition of Regular Expression
No ratings yet
Can We Build A Finite Automaton For Every Regular Expression?, - Build FA Based On The Definition of Regular Expression
66 pages
2_4 Finite Automata
No ratings yet
2_4 Finite Automata
23 pages
Compiler Design Lab manual
No ratings yet
Compiler Design Lab manual
32 pages
lec_4_ch_2
No ratings yet
lec_4_ch_2
39 pages
Non Deterministic Finite Automata (NFA)
No ratings yet
Non Deterministic Finite Automata (NFA)
26 pages
Lesson 13
No ratings yet
Lesson 13
35 pages
Lab Assignment-I
No ratings yet
Lab Assignment-I
6 pages
CC lec 5
No ratings yet
CC lec 5
24 pages
4-Lexical Analysis Part3
No ratings yet
4-Lexical Analysis Part3
37 pages
Lecture 3 Lexical Analyzer
No ratings yet
Lecture 3 Lexical Analyzer
44 pages
Lecture02 Scanning 2
No ratings yet
Lecture02 Scanning 2
79 pages
Lecture 4 - NFA To DFA
No ratings yet
Lecture 4 - NFA To DFA
38 pages
02 Automata
No ratings yet
02 Automata
78 pages
Compiler Design and Construction6
No ratings yet
Compiler Design and Construction6
23 pages
Lecture 3
No ratings yet
Lecture 3
30 pages
Two Issues in Lexical Analysis
No ratings yet
Two Issues in Lexical Analysis
11 pages
Lec 4
No ratings yet
Lec 4
17 pages
Patterns, Automata, and Regular Expressions
No ratings yet
Patterns, Automata, and Regular Expressions
4 pages
TOC Lec3
No ratings yet
TOC Lec3
51 pages
paska08b
No ratings yet
paska08b
44 pages
Dfa and Nfa
No ratings yet
Dfa and Nfa
50 pages
Week-2 Lecture 2 Lexical Analysis
No ratings yet
Week-2 Lecture 2 Lexical Analysis
15 pages
5_6280299294167667209
No ratings yet
5_6280299294167667209
8 pages
CPSC 388 - Compiler Design and Construction: Scanner - Regular Expressions To DFA
No ratings yet
CPSC 388 - Compiler Design and Construction: Scanner - Regular Expressions To DFA
23 pages
Transition Diagram
No ratings yet
Transition Diagram
13 pages
Finite Automata-Topic RE to NFA
No ratings yet
Finite Automata-Topic RE to NFA
25 pages
Lecture 5-FSMs-NFA-2-DFA
No ratings yet
Lecture 5-FSMs-NFA-2-DFA
62 pages
Lec02 Lexicalanalyzer
100% (1)
Lec02 Lexicalanalyzer
50 pages
Lect 07
No ratings yet
Lect 07
46 pages
CS 346: Compilers: Lexical Analyzer Lexical Analyzer
No ratings yet
CS 346: Compilers: Lexical Analyzer Lexical Analyzer
52 pages
lec-2
No ratings yet
lec-2
10 pages
Finite Automata
No ratings yet
Finite Automata
16 pages
ICS312 Set 29: Deterministic Finite Automata Nondeterministic Finite Automata
No ratings yet
ICS312 Set 29: Deterministic Finite Automata Nondeterministic Finite Automata
21 pages
c2 2 PDF
No ratings yet
c2 2 PDF
28 pages
lecture 3
No ratings yet
lecture 3
29 pages
Cse384 Compiler Design Laboratory Lab Manual
No ratings yet
Cse384 Compiler Design Laboratory Lab Manual
55 pages
Finite Automata
No ratings yet
Finite Automata
34 pages
Unit 2 Pattern Matches
No ratings yet
Unit 2 Pattern Matches
36 pages
Lexical
No ratings yet
Lexical
30 pages
CS 160
No ratings yet
CS 160
4 pages
UNIT 1 FiniteAutomata
No ratings yet
UNIT 1 FiniteAutomata
39 pages
Chapter 3 implementation_of_lexical_analysis
No ratings yet
Chapter 3 implementation_of_lexical_analysis
63 pages
[6] Lambda-NFAs and Conversions
No ratings yet
[6] Lambda-NFAs and Conversions
3 pages
Flat CH 2
No ratings yet
Flat CH 2
86 pages
Thomson Const Algo
No ratings yet
Thomson Const Algo
4 pages
CMP3008 LN3 NonDeterminism
No ratings yet
CMP3008 LN3 NonDeterminism
40 pages
Lecture 08
No ratings yet
Lecture 08
39 pages
04 Regular Expressions & FAs
No ratings yet
04 Regular Expressions & FAs
46 pages
Finite Autometa PDF
No ratings yet
Finite Autometa PDF
40 pages
Laplace Transforms Essentials
From Everand
Laplace Transforms Essentials
Morteza Shafii-Mousavi
3.5/5 (3)
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
NFA To Minimized DFA
No ratings yet
NFA To Minimized DFA
32 pages
Regular Expression
No ratings yet
Regular Expression
46 pages
CH 2 - Finite Automata
No ratings yet
CH 2 - Finite Automata
72 pages
Optimization of DFA-Based Pattern Matchers
No ratings yet
Optimization of DFA-Based Pattern Matchers
2 pages
Compiler Design: Lexical Analysis Sample Exercises and Solutions
No ratings yet
Compiler Design: Lexical Analysis Sample Exercises and Solutions
30 pages
Unit-II (Introduction To Finite Automata)
No ratings yet
Unit-II (Introduction To Finite Automata)
80 pages
Equivalence NFA To DFA
No ratings yet
Equivalence NFA To DFA
11 pages
Solved Problems On Automato
No ratings yet
Solved Problems On Automato
50 pages
Complete Download Automata and Computability: A Programmer’s Perspective Ganesh Gopalakrishnan PDF All Chapters
100% (1)
Complete Download Automata and Computability: A Programmer’s Perspective Ganesh Gopalakrishnan PDF All Chapters
55 pages
Toa - Lecture Notes-15
No ratings yet
Toa - Lecture Notes-15
24 pages
Toc MCQ
No ratings yet
Toc MCQ
1,103 pages
Compiler Notes 1
No ratings yet
Compiler Notes 1
21 pages
Tutorial3-NFA To DFA Conversion
No ratings yet
Tutorial3-NFA To DFA Conversion
1 page
Compiler Design Note1
No ratings yet
Compiler Design Note1
111 pages
Cosc261 Notes 1
No ratings yet
Cosc261 Notes 1
17 pages
TOC Unit I MCQ
No ratings yet
TOC Unit I MCQ
4 pages
IT5502 Compiler Engineering Ass I
No ratings yet
IT5502 Compiler Engineering Ass I
2 pages
200 Problem Set 6
No ratings yet
200 Problem Set 6
7 pages
Compiler Design - Compilers Principles and Practice - A.hosking - Compiler Course Slides
No ratings yet
Compiler Design - Compilers Principles and Practice - A.hosking - Compiler Course Slides
237 pages
Automata Theory Lecture Notes
No ratings yet
Automata Theory Lecture Notes
145 pages
TOC Module-1 Notes
No ratings yet
TOC Module-1 Notes
19 pages
Assignment 02(Second Semester 2024-2025)
No ratings yet
Assignment 02(Second Semester 2024-2025)
4 pages
Compiler Design and Construction Note
No ratings yet
Compiler Design and Construction Note
97 pages
At 1 & 2 Unit Questions
No ratings yet
At 1 & 2 Unit Questions
3 pages
Compiler Design Lecture Notes
No ratings yet
Compiler Design Lecture Notes
308 pages
Recognition of Tokens: The Question Is How To Recognize The Tokens?
No ratings yet
Recognition of Tokens: The Question Is How To Recognize The Tokens?
15 pages
12.conversion of NDFA to DFA
No ratings yet
12.conversion of NDFA to DFA
3 pages
ATC Class Work Finite Automata Book by Padma Reddy
No ratings yet
ATC Class Work Finite Automata Book by Padma Reddy
18 pages
Deterministic and Non Deterministic
No ratings yet
Deterministic and Non Deterministic
23 pages

From Regular Expressions To Automata

Uploaded by

From Regular Expressions To Automata

Uploaded by

From Regular Expressions to

 a boolean array alreadyOn

 a two dimensional array move[s,a]

You might also like