0% found this document useful (0 votes)

3 views

L3_FSM

Uploaded by

mekasiddu44

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

L3_FSM

Uploaded by

mekasiddu44

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

23CS2204

Compiler Design
Dr. Sadu Chiranjeevi
Assistant Professor
Department of Computer Science and Engineering
[email protected]

1
How to describe tokens?
• Programming language tokens can be
described by regular languages
• Regular languages
– Are easy to understand
– There is a well understood and useful theory
– They have efficient implementation
• Regular languages have been discussed in
great detail in the “Theory of Computation”
course
How to specify tokens
• Regular definitions
– Let ri be a regular expression and di be a
distinct name
– Regular definition is a sequence of
definitions of the form
d1  r1
d2 r2
…..
dn  rn
– Where each ri is a regular expression
over Σ U {d1, d2, …, di-1}
Examples
• My fax number
91-(512)-259-7586
• Σ = digit U {-, (, ) }
• Country  digit + digit2

• Area ‘(‘ digit ‘)’

+ digit3

• Exchange  digit+ digit3

• Phone  digit+ digit4
• Number  country ‘-’ area ‘-’
exchange ‘-’ phone
Examples
…
• My email address
[email protected]
• Σ = letter U {@, . }
• letter  a| b| …| z| A| B| …| Z
• name  letter+
• address  name ‘@’ name ‘.’ name
‘.’ name
Examples …
• Identifier
letter  a| b| …|z| A| B| …| Z
digit  0| 1| …| 9
identifier  letter(letter|digit)*|_(letter|digit)*

• Regular expressions are only specifications;

implementation is still required

• Given a string s and a regular expression R,

does s Є L(R) ?

• Solution to this problem is the basis of the lexical

analyzers

• However, just the yes/no answer is not sufficient

• Goal: Partition the input into tokens

1. Write a regular expression for lexemes of each
token
• number  digit+
2. Construct R matching all lexemes of all tokens
• R = R1 + R2 + R3 + …..
3. Let input be x1…xn
• for 1 ≤ i ≤ n check x1…xi Є L(R)
4. x1…xi Є L(R)  x1…xi Є L(Rj) for some j
• smallest such j is token class of x1…xi
5. Remove x1…xi from input; go to (3)
Transition Diagrams
• Regular expression are declarative specifications
• Transition diagram is an implementation
• A transition diagram consists of
– An input alphabet belonging to Σ
– A set of states S
– A set of transitions statei → 𝑖𝑛𝑝𝑢𝑡 statej
– A set of final states F
– A start state n
• Transition s1 →𝑎 s2 is read:
in state s1 on input 𝑎 go to state s2
• If end of input is reached in a final state then accept
• Otherwise, reject
Pictorial notation
• A state

• A final state

• Transition

• Transition from state i to state j on an

input a a
i j
How to recognize tokens
• Consider
relop  < | <= | = | <> | >= | >
id  (letter|_)(letter|digit)*
num  digit+ (‘.’ digit+)? (E(‘+’|’-’)? digit+)?
delim  blank | tab | newline
ws  delim+

• Construct an analyzer that will return

<token, attribute> pairs
Transition diagram for relops
Transition diagram for identifier
letter

Letter|_ other *

digit

Transition diagram for white spaces

delim

delim *
other
Transition diagram for unsigned numbers
Implementation of transition
diagrams
Token nexttoken() {
while(1) {
switch (state) {
……
case 10: c=nextchar();
if(isletter(c)) state=10;
elseif (isdigit(c)) state=10;
else state=11;
break;
……
}
}
}
Lexical analyzer generator
• Input to the generator
– List of regular expressions in priority order
– Associated actions for each of regular expression
(generates kind of token and other book keeping
information)

• Output of the generator

– Program that reads input character stream and breaks
that into tokens
– Reports lexical errors (unexpected characters), if any
LEX: A lexical analyzer
generator
lex.yy.c
Token C code for C
specifications LEX Lexical Compiler
Lex.l analyzer
Object code
a.out
Input Lexical
tokens
program analyzer
Format of Lex file
• A Lex program is separated into three sections by %%
delimiters. The formal of Lex source is as follows:

{ definitions }
%%
{ rules }
%%
{ user subroutines }
Format of Lex file
• Definitions include declarations of constant, variable
and regular definitions.

• Rules define the statement of form p1 {action1} p2

{action2}....pn {action}.

• Where pi describes the regular expression and action1

describes the actions what action the lexical analyzer
should take when pattern pi matches a lexeme.

• User subroutines are auxiliary procedures needed by

the actions. The subroutine can be loaded with the
lexical analyzer and compiled separately.
Lex Program
/*lex program to count number of words*/
%{
#include<stdio.h>
#include<string.h>
int i = 0;
%}

/* Rules Section*/
%%
([a-zA-Z0-9])* {i++;} /* Rule for counting number of words*/

"\n" {printf("%d\n", i); i = 0;}

int yywrap(void){}

int main()
{
// The function that starts the analysis
yylex();

return 0;
}

Sample PLSQL Code
No ratings yet
Sample PLSQL Code
5 pages
Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)
CS3304 9 LanguageSyntax 2 PDF
No ratings yet
CS3304 9 LanguageSyntax 2 PDF
39 pages
04 Lexi Cal A Analysis
No ratings yet
04 Lexi Cal A Analysis
39 pages
Compiler Design Lexical Analysis
No ratings yet
Compiler Design Lexical Analysis
24 pages
Compiler Course: Lexical Analysis
No ratings yet
Compiler Course: Lexical Analysis
50 pages
Unit 1 (B)
No ratings yet
Unit 1 (B)
69 pages
Lexical Analysis
No ratings yet
Lexical Analysis
57 pages
pr
No ratings yet
pr
40 pages
Chapter 2 Lexical Analysis
No ratings yet
Chapter 2 Lexical Analysis
33 pages
Ch3 1
No ratings yet
Ch3 1
52 pages
CD Chapter 1
No ratings yet
CD Chapter 1
28 pages
Lecture 2.76
No ratings yet
Lecture 2.76
31 pages
lect03
No ratings yet
lect03
19 pages
(Lec 1-3) Introduction To Compilers
No ratings yet
(Lec 1-3) Introduction To Compilers
34 pages
Unit II - Lexical Analysis-20-1-2021
No ratings yet
Unit II - Lexical Analysis-20-1-2021
49 pages
Lexical Analysis1
No ratings yet
Lexical Analysis1
44 pages
Compiler-Lexical Analysis
100% (1)
Compiler-Lexical Analysis
59 pages
Lexical Analyzer Parser
No ratings yet
Lexical Analyzer Parser
38 pages
CD - Ch.1
No ratings yet
CD - Ch.1
28 pages
CD - Ch.1
No ratings yet
CD - Ch.1
28 pages
Chapter 1
No ratings yet
Chapter 1
28 pages
Slides 02 - Compiler Construction - UET CS - Lexical Analyzer Rev 2
No ratings yet
Slides 02 - Compiler Construction - UET CS - Lexical Analyzer Rev 2
69 pages
1st Phase Lexical Analyzer
No ratings yet
1st Phase Lexical Analyzer
33 pages
Lexical Analyzer Parser
No ratings yet
Lexical Analyzer Parser
38 pages
Chapter 2 - Lexical Analyser
No ratings yet
Chapter 2 - Lexical Analyser
38 pages
Overview of Compiler Environment Pass and Phase Phases of Compiler Regular Expression Lexical Analyzer LEX Tool Bootstrapping
No ratings yet
Overview of Compiler Environment Pass and Phase Phases of Compiler Regular Expression Lexical Analyzer LEX Tool Bootstrapping
35 pages
CD ppt1
No ratings yet
CD ppt1
62 pages
Chapter 3 - Lexical Analysis
No ratings yet
Chapter 3 - Lexical Analysis
34 pages
Chapter 2 - Lexical Analysis
No ratings yet
Chapter 2 - Lexical Analysis
69 pages
Unit 2-Introduction to Compilers
No ratings yet
Unit 2-Introduction to Compilers
51 pages
Compiler Lecture 3
No ratings yet
Compiler Lecture 3
16 pages
Lexical Analyzer
100% (1)
Lexical Analyzer
38 pages
Chapter 3 - Lexical Analysis
100% (3)
Chapter 3 - Lexical Analysis
51 pages
02. Chapter 3 - Lexical Analysis (1)
No ratings yet
02. Chapter 3 - Lexical Analysis (1)
52 pages
Lecture003_LEXandYACC
No ratings yet
Lecture003_LEXandYACC
64 pages
CD PPTS 2
No ratings yet
CD PPTS 2
27 pages
Compiler Design 4
No ratings yet
Compiler Design 4
28 pages
Lexical Analysis and Lexical Analyzer Generators: COP5621 Compiler Construction
No ratings yet
Lexical Analysis and Lexical Analyzer Generators: COP5621 Compiler Construction
52 pages
Class 2019 Lex
No ratings yet
Class 2019 Lex
30 pages
Lexical Analysis 1
No ratings yet
Lexical Analysis 1
26 pages
Lexical Analysis: Leonidas Fegaras
No ratings yet
Lexical Analysis: Leonidas Fegaras
28 pages
Compiler
No ratings yet
Compiler
60 pages
02. Chapter 3 - Lexical Analysis
No ratings yet
02. Chapter 3 - Lexical Analysis
51 pages
Lexical Analysis
No ratings yet
Lexical Analysis
36 pages
Day 2 - Lexial Analyzer
No ratings yet
Day 2 - Lexial Analyzer
37 pages
CD Unit 1
No ratings yet
CD Unit 1
42 pages
VMKV Engineering College Department of Computer Science & Engineering Principles of Compiler Design Unit I Part-A
No ratings yet
VMKV Engineering College Department of Computer Science & Engineering Principles of Compiler Design Unit I Part-A
80 pages
3a. Context Free Grammar
No ratings yet
3a. Context Free Grammar
18 pages
Lexical Analysis-1
No ratings yet
Lexical Analysis-1
9 pages
Lexical and syntax analysis
No ratings yet
Lexical and syntax analysis
63 pages
ATCD Mod 3
No ratings yet
ATCD Mod 3
46 pages
Lex Yacc Tutorial: Kun-Yuan Hsieh
No ratings yet
Lex Yacc Tutorial: Kun-Yuan Hsieh
64 pages
Lexical Analysis
No ratings yet
Lexical Analysis
44 pages
Chapter 3 - Lexical Analysis
No ratings yet
Chapter 3 - Lexical Analysis
51 pages
CompilerD L3
No ratings yet
CompilerD L3
36 pages
Chapter Two (3) (Autosaved)
No ratings yet
Chapter Two (3) (Autosaved)
29 pages
Ch3myppt
No ratings yet
Ch3myppt
59 pages
Chapter 2 Lexical Analysis
No ratings yet
Chapter 2 Lexical Analysis
14 pages
Learn C++
From Everand
Learn C++
Durgesh
4.5/5 (9)
C Programming
From Everand
C Programming
Netra
No ratings yet
Weather Forecasting Project
No ratings yet
Weather Forecasting Project
18 pages
Shahzad
No ratings yet
Shahzad
8 pages
Assignment 1
No ratings yet
Assignment 1
14 pages
100_C_Programs_TurboC_Updated
No ratings yet
100_C_Programs_TurboC_Updated
11 pages
Oop DS U3
No ratings yet
Oop DS U3
17 pages
PHP Vs Node
No ratings yet
PHP Vs Node
6 pages
Core Java
No ratings yet
Core Java
42 pages
Web Technology IMP Questions
No ratings yet
Web Technology IMP Questions
4 pages
C Debugging Quiz
No ratings yet
C Debugging Quiz
5 pages
Important questions Unit 1 and Unit 2
No ratings yet
Important questions Unit 1 and Unit 2
1 page
Java Characters
No ratings yet
Java Characters
46 pages
OOPS practical FYBSc.IT
No ratings yet
OOPS practical FYBSc.IT
46 pages
Tes2 Test2
No ratings yet
Tes2 Test2
30 pages
Data Structure Lab Manual
No ratings yet
Data Structure Lab Manual
91 pages
CS 23S OOP Mid1+Solution
No ratings yet
CS 23S OOP Mid1+Solution
8 pages
ICSE Class 10 Computer Notes
No ratings yet
ICSE Class 10 Computer Notes
4 pages
PPS Practical Solution
No ratings yet
PPS Practical Solution
14 pages
Fundamentals of C Programming CS 102 Int
No ratings yet
Fundamentals of C Programming CS 102 Int
72 pages
Chap4 OOP in PHP
No ratings yet
Chap4 OOP in PHP
22 pages
Intro To Computer's Assignment
No ratings yet
Intro To Computer's Assignment
5 pages
8-BeanIO Java tutorial
No ratings yet
8-BeanIO Java tutorial
7 pages
Chapter 13 Paper
No ratings yet
Chapter 13 Paper
1 page
Javappt Raji
No ratings yet
Javappt Raji
19 pages
Modul Praktikum Algoritma & Pemrograman (Bu Dinna)
No ratings yet
Modul Praktikum Algoritma & Pemrograman (Bu Dinna)
41 pages
Computer Languages
No ratings yet
Computer Languages
45 pages
Linked List Data Structure Operations and Its Pseudocodes
No ratings yet
Linked List Data Structure Operations and Its Pseudocodes
7 pages
Object Oriented Programming in C++
No ratings yet
Object Oriented Programming in C++
14 pages
Spos-Hci Mock
No ratings yet
Spos-Hci Mock
12 pages
Experiment 2
No ratings yet
Experiment 2
7 pages

L3_FSM

Uploaded by

L3_FSM

Uploaded by

23CS2204

• Area ‘(‘ digit ‘)’

• Exchange  digit+ digit3

• Regular expressions are only specifications;

• Given a string s and a regular expression R,

• Solution to this problem is the basis of the lexical

• However, just the yes/no answer is not sufficient

• Goal: Partition the input into tokens

• Transition from state i to state j on an

• Construct an analyzer that will return

Transition diagram for white spaces

• Output of the generator

• Rules define the statement of form p1 {action1} p2

• Where pi describes the regular expression and action1

• User subroutines are auxiliary procedures needed by

"\n" {printf("%d\n", i); i = 0;}

You might also like