lecture 4

This document discusses syntax analysis in compilers, focusing on the role of parsers and the use of context-free grammars (CFG) to define the structure of programming languages. It outlines different types of parsers, including top-down and bottom-up parsers, and explains how syntax trees are constructed from tokens. Additionally, it provides formal definitions and examples of context-free grammars used in programming language constructs.

Uploaded by

enochmack04

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

lecture 4

Uploaded by

enochmack04

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

COMPILERS

Lecture 4
Lecture Outline
■ Syntax Analysis
■ Role of the Parser
■ Grammars
■ Context-free Grammars
Introduction
■ The purpose of syntax analysis (also known as parsing) is
to recombine the tokens the lexical analysis splits.
– Not back into a list of characters, but into something that
reflects the structure of the text.
– This “something” is typically a data structure called the
syntax tree/parse tree of the text.
■ The syntax analysis must also reject invalid texts by
reporting syntax errors
Introduction
■ By design, every programming language has precise rules
that prescribe the syntactic structure of well-formed
programs.
– For example in C, a program is made up of functions, functions
out of declarations and statements, statement out of expressions,
and so on.
■ The syntax of programming language constructs can be
specified by context-free grammars or BNF (Backus-Naur
Form) notation
The Role of the Syntax Analyzer
■ The parser reconstructs a derivation by which a given Context
Free Grammar (CFG) can generate a given input string.
– CFG is a recursive notation for describing sets of strings and
imposing a structure on each such string.
■ The syntax analyzer must also reject invalid texts by reporting
syntax errors.
■ Same basic strategy:
– A notation suitable for human understanding is transformed into
a machine-like low-level notation suitable for efficient execution.
– This process is called parser generation.
Types of Parsers
■ There are three general types of parsers for grammars:
universal, top-down, and bottom-up.
■ Universal parsing methods such as the Cocke-Younger-
Kasami algorithm and Earley's algorithm can parse any
grammar.
– These general methods are, however, too inefficient to
use in production compilers.
Types of Parsers: Top-down Parser
■ The top-down parser is the parser that generates parse
for the given input string with the help of grammar
productions by expanding the non-terminals
– It starts from the start symbol and ends on the terminals.
– It uses left most derivation
– Top-down methods build parse trees from the top (root) to the
bottom (leaves)
Types of Parsers: Top-down Parser
■ Top-down parser is classified into 2 types:
– Recursive descent parser is also known as the Brute force parser
or the backtracking parser. It generates the parse tree by using
brute force and backtracking.
– Non-recursive descent parser is also known as LL(1) parser or
predictive parser or without backtracking parser or dynamic
parser. It uses a parsing table to generate the parse tree instead
of backtracking.
Types of Parsers: Bottom-up Parser
■ Bottom-up Parser is the parser that generates the parse
tree for the given input string with the help of grammar
productions by compressing the non-terminals.
– It starts from non-terminals and ends on the start symbol.
– It uses the reverse of the rightmost derivation.
– Bottom-up methods start from the leaves and work their way up to
the root.
Types of Parsers: Bottom-up Parser
■ Bottom-up parser is classified into two types: LR parser, and
Operator precedence parser
■ LR parser is the bottom-up parser that generates the parse
tree for the given string by using unambiguous grammar. It
follows the reverse of the rightmost derivation.
■ LR parser is of four types:
– LR(0)
– SLR(1)
– LALR(1)
– CLR(1)
Types of Parsers: Bottom-up Parser
■ Operator precedence parser generates the parse tree
from given grammar and string but the only condition is
two consecutive non-terminals and epsilon never appears
on the right-hand side of any production.
Types of Parsers
Syntax Analysis
■ There are a number of tasks that might be conducted
during parsing, such as
– collecting information about various tokens into the
symbol table,
– performing type checking and other kinds of semantic
analysis, and
– generating intermediate code.
Syntax Tree/Parse Tree
■ The syntax tree is a tree structure.
– The leaves of this tree are the tokens found by the lexical
analysis.
– If the leaves are read from left to right, the sequence is
the same as in the input text.
■ What is important in the syntax tree is how these leaves
are combined to form the structure of the tree and how
the interior nodes of the tree are labelled.
Syntax Tree
Context-free Grammar
■ The notation we use for human manipulation is context-
free grammars, which is a recursive notation for
describing sets of strings and imposing a structure on
each such string.
– Context-free grammars describe sets of strings, i.e.,
languages.
– A context-free grammar also defines structure on the
strings in the language it defines.
Context-free Grammar
■ It recursively defines several sets of strings. Each set is
denoted by a name, which is called a nonterminal.
– The set of nonterminals is disjoint from the set of
terminals.
■ One of the nonterminals are chosen to denote the
language described by the grammar.
– This is called the start symbol of the grammar.
Context-free Grammar
■ The sets are described by a number of productions.
■ Each production describes some of the possible strings
that are contained in the set denoted by a nonterminal.
■ A production has the form:
𝑁 → 𝑋! … 𝑋"
■ where 𝑁 is a nonterminal and 𝑋! … 𝑋" are zero or more
symbols, each of which is either a terminal or a
nonterminal.
Example
𝐴→𝑎
■ Says that the set denoted by the nonterminal A contains
the one-character string a.
𝐵→
𝐵 → 𝑎𝐵
– where the first production indicates that the empty string
is part of the set B.
■ Productions with empty right-hand sides are called empty
productions.
Example
■ The examples have used only one nonterminal per
grammar.
■ When several nonterminals are used, we must make it
clear which of these is the start symbol.
■ By convention (if nothing else is stated), the nonterminal
on the left-hand side of the first production is the start
symbol.
Formal definition of Context Free Grammar
A context free grammar is a 4-tuple 𝑉, Σ, 𝑅, 𝑆 , where
■ 𝑉 is a finite set called variables,
■ Σ is a finite set, disjoint from 𝑉, called the terminals,
■ 𝑅 is a finite set of rules, with each rule being a variable
and a string from variables and terminals, and
■ 𝑆 ∈ 𝑉 is the start variables.
CFG
■ If u, v, and w are strings of variables and terminals, and
𝐴 → 𝑤 is a rule of the grammar, we say that 𝑢𝐴𝑣 yields
𝑢𝑤𝑣, written 𝑢𝐴𝑣 ⇒ 𝑢𝑤𝑣.
■ Say that u derives v, written 𝑢 ⇒∗ 𝑣, if a sequence
𝑢! , 𝑢$ , 𝑢% , … 𝑢& exists for 𝑘 ≥ 0 and 𝑢 ⇒ 𝑢! ⇒ 𝑢$ ⇒ ⋯ ⇒
𝑢& = 𝑣
■ The language of the grammar is 𝑤 ∈ Σ ∗ 𝑆 ⇒∗ 𝑤
Example
■ As an example, the grammar
T→R
T → aTa
R→b
R → bR
■ has T as start symbol and denotes the set of strings that
start with any number of as followed by a non-zero
number of b’s and then the same number of a’s with
which it started.
Context-free Grammar
■ When writing a grammar for a programming language, one
normally starts by dividing the constructs of the language into
different syntactic categories.
– A syntactic category is a sub-language that embodies a
particular concept.
■ Examples of common syntactic categories in programming
languages are:
– Expressions are used to express calculation of values.
– Statements express actions that occur in a particular
sequence.
– Declarations express properties of names used in other parts
of the program.
Examples
Simple expression grammar

𝐸𝑥𝑝 → 𝐸𝑥𝑝 + 𝐸𝑥𝑝

𝐸𝑥𝑝 → 𝐸𝑥𝑝 − 𝐸𝑥𝑝
𝐸𝑥𝑝 → 𝐸𝑥𝑝 ∗ 𝐸𝑥𝑝
𝐸𝑥𝑝 → 𝐸𝑥𝑝/𝐸𝑥𝑝
𝐸𝑥𝑝 → 𝒏𝒖𝒎
𝐸𝑥𝑝 → (𝐸𝑥𝑝)
Example
Simple statement grammar

𝑆𝑡𝑎𝑡 → 𝒊𝒅 ≔ 𝐸𝑥𝑝
𝑆𝑡𝑎𝑡 → 𝑆𝑡𝑎𝑡; 𝑆𝑡𝑎𝑡;
𝑆𝑡𝑎𝑡 → 𝒊𝒇 𝐸𝑥𝑝 𝒕𝒉𝒆𝒏 𝑆𝑡𝑎𝑡 𝒆𝒍𝒔𝒆 𝑆𝑡𝑎𝑡
𝑆𝑡𝑎𝑡 → 𝒊𝒇 𝐸𝑥𝑝 𝒕𝒉𝒆𝒏 𝑆𝑡𝑎𝑡

You Are Asked To Write A MapReduce Program With Py...
No ratings yet
You Are Asked To Write A MapReduce Program With Py...
5 pages
1 Syntax Analyzer
No ratings yet
1 Syntax Analyzer
33 pages
12.2Unit 2
No ratings yet
12.2Unit 2
25 pages
UNIT- 4 AI
No ratings yet
UNIT- 4 AI
35 pages
1 Syntax Analyzer
No ratings yet
1 Syntax Analyzer
33 pages
Syntax Analysis: The Role of Parser
No ratings yet
Syntax Analysis: The Role of Parser
3 pages
Syntax Analysis
No ratings yet
Syntax Analysis
58 pages
II. Parser: Syntax Analysis
No ratings yet
II. Parser: Syntax Analysis
18 pages
Unit 2
No ratings yet
Unit 2
45 pages
Syntax Analysis (Part-I)
No ratings yet
Syntax Analysis (Part-I)
88 pages
Lecture05-Syntax Analysis-CFG
No ratings yet
Lecture05-Syntax Analysis-CFG
19 pages
CSC 461 Final
No ratings yet
CSC 461 Final
170 pages
CD Module2 16 03 23 PDF
No ratings yet
CD Module2 16 03 23 PDF
36 pages
4 - Syntax Analyzer (CFG)
No ratings yet
4 - Syntax Analyzer (CFG)
41 pages
Unit-2 PCD
No ratings yet
Unit-2 PCD
36 pages
COMPILER DESIGN UNIT 2
No ratings yet
COMPILER DESIGN UNIT 2
44 pages
Chapter 3 (1)
No ratings yet
Chapter 3 (1)
43 pages
4 - Syntax Analyzer(CFG) Copy
No ratings yet
4 - Syntax Analyzer(CFG) Copy
42 pages
Describing Syntax and Semantics: CSE 325/CSE 425: Concepts of Programming Language
No ratings yet
Describing Syntax and Semantics: CSE 325/CSE 425: Concepts of Programming Language
46 pages
3-Module 2 - Role of Parser - Parse Tree-02-08-2024
No ratings yet
3-Module 2 - Role of Parser - Parse Tree-02-08-2024
76 pages
Principals of Programming Language 1.2
No ratings yet
Principals of Programming Language 1.2
86 pages
toc 2
No ratings yet
toc 2
23 pages
Unit 2 - Sessions 1 - 2
No ratings yet
Unit 2 - Sessions 1 - 2
133 pages
Unit 2 - Sessions 1 - 2
No ratings yet
Unit 2 - Sessions 1 - 2
36 pages
Describing Syntax and Semantics
No ratings yet
Describing Syntax and Semantics
6 pages
Chapter – 3
No ratings yet
Chapter – 3
46 pages
Mod - 3 (2)
No ratings yet
Mod - 3 (2)
51 pages
Unit 2
No ratings yet
Unit 2
29 pages
Word Level Analysis
No ratings yet
Word Level Analysis
49 pages
Morphological Parsing
No ratings yet
Morphological Parsing
19 pages
Day 5 - Syntax Analysis
No ratings yet
Day 5 - Syntax Analysis
46 pages
Why Syntax Analysis?
No ratings yet
Why Syntax Analysis?
15 pages
2 Syntax Analysis - Introduction
No ratings yet
2 Syntax Analysis - Introduction
8 pages
Unit-3 Syntax Analysis
No ratings yet
Unit-3 Syntax Analysis
319 pages
FLAT 1
No ratings yet
FLAT 1
16 pages
Unit2 TopDownParsing
No ratings yet
Unit2 TopDownParsing
12 pages
MODULE 3 Syntax Analysis
100% (1)
MODULE 3 Syntax Analysis
182 pages
What Is Parsing: Parsing Is The Process of Analyzing An Input Sequence in Order
No ratings yet
What Is Parsing: Parsing Is The Process of Analyzing An Input Sequence in Order
9 pages
toc mod3
No ratings yet
toc mod3
72 pages
SEN 317 Lecture 3
No ratings yet
SEN 317 Lecture 3
10 pages
ATCD PPT Module-3
No ratings yet
ATCD PPT Module-3
136 pages
SPCC - 5
No ratings yet
SPCC - 5
19 pages
Chapter 3- Syntax Analysis
No ratings yet
Chapter 3- Syntax Analysis
9 pages
Chapter 3 Syntax Analysis
No ratings yet
Chapter 3 Syntax Analysis
54 pages
Lecture 5
No ratings yet
Lecture 5
28 pages
Types of Parser
No ratings yet
Types of Parser
17 pages
CD Unit 2
No ratings yet
CD Unit 2
19 pages
2. Simple Syntax Directed Translation
No ratings yet
2. Simple Syntax Directed Translation
51 pages
Chapter 3
No ratings yet
Chapter 3
77 pages
Compiler Construction Material
No ratings yet
Compiler Construction Material
98 pages
nlp unit 2
No ratings yet
nlp unit 2
13 pages
Notes On Formal Grammars: What Is A Grammar?
No ratings yet
Notes On Formal Grammars: What Is A Grammar?
8 pages
Unit Iii
No ratings yet
Unit Iii
17 pages
Compiler Construction Material
No ratings yet
Compiler Construction Material
94 pages
2.1 - Lexical Analysis
No ratings yet
2.1 - Lexical Analysis
102 pages
Chapter 3 Syntax Analysis
No ratings yet
Chapter 3 Syntax Analysis
20 pages
1.describing Syntax and Semantics
No ratings yet
1.describing Syntax and Semantics
110 pages
Unit 2
No ratings yet
Unit 2
10 pages
FLAT Unitt-1
No ratings yet
FLAT Unitt-1
9 pages
Syntax & Semantics
No ratings yet
Syntax & Semantics
34 pages
Ian Talks Regex A-Z
From Everand
Ian Talks Regex A-Z
Ian Eress
No ratings yet
Linux Programming: Dear Students
No ratings yet
Linux Programming: Dear Students
2 pages
How To Read XML Files in Datastage Server Edition
No ratings yet
How To Read XML Files in Datastage Server Edition
18 pages
CS304 - MCQS Object Oriented Programming
No ratings yet
CS304 - MCQS Object Oriented Programming
10 pages
Madhukrishna Nipankar Resume.pdf
No ratings yet
Madhukrishna Nipankar Resume.pdf
1 page
Resume in Latex
No ratings yet
Resume in Latex
12 pages
SAP HANA SQL Script Reference en
No ratings yet
SAP HANA SQL Script Reference en
156 pages
QB104451 2013 Regulation PDF
No ratings yet
QB104451 2013 Regulation PDF
12 pages
11-AutoSys Basic Commands Quick Reference
100% (1)
11-AutoSys Basic Commands Quick Reference
1 page
Eeq 5 V 4 JCDQR 4 Emxk 21
No ratings yet
Eeq 5 V 4 JCDQR 4 Emxk 21
4 pages
A "Crash Course" To (Matlab And) C Programming For TTT4225
No ratings yet
A "Crash Course" To (Matlab And) C Programming For TTT4225
49 pages
Introduction To Python Solutions
No ratings yet
Introduction To Python Solutions
36 pages
Introduction To Octave - Sandeep Nagar
No ratings yet
Introduction To Octave - Sandeep Nagar
80 pages
The Complete Rust Programming Reference Guide Rahul Sharma download
100% (5)
The Complete Rust Programming Reference Guide Rahul Sharma download
65 pages
Error List
No ratings yet
Error List
350 pages
Nic 225296
No ratings yet
Nic 225296
830 pages
Syllabi MTech Artificial Intelligence
No ratings yet
Syllabi MTech Artificial Intelligence
56 pages
Pranav Goel: Objective
No ratings yet
Pranav Goel: Objective
3 pages
50 Coding Interview Questions V2
No ratings yet
50 Coding Interview Questions V2
37 pages
Full Stack Development (R20a0516)
No ratings yet
Full Stack Development (R20a0516)
131 pages
Name Null? Type Emp - No Not Null Number (5) Last - Name VARCHAR2 (10) Dept - No Not Null Number (5) Salary NUMBER (6,2)
No ratings yet
Name Null? Type Emp - No Not Null Number (5) Last - Name VARCHAR2 (10) Dept - No Not Null Number (5) Salary NUMBER (6,2)
22 pages
Cabañasrd, From Native To Cross-Platform With Xamarin Forms (3/4)
No ratings yet
Cabañasrd, From Native To Cross-Platform With Xamarin Forms (3/4)
11 pages
Command Injection Essence
No ratings yet
Command Injection Essence
11 pages
ODEX Programmers Guide PDF
0% (1)
ODEX Programmers Guide PDF
96 pages
Expp 10
No ratings yet
Expp 10
4 pages
CS10-8L: Computer Programming Laboratory Machine Problem #3: Variables, Input and Output
No ratings yet
CS10-8L: Computer Programming Laboratory Machine Problem #3: Variables, Input and Output
4 pages
Varianta 4
No ratings yet
Varianta 4
19 pages
JS Interview Questions-1
No ratings yet
JS Interview Questions-1
25 pages
MD070 Application Extensions Technical Design
No ratings yet
MD070 Application Extensions Technical Design
18 pages
Paper A Parallel Bloom Filter String Searching Algorithm
No ratings yet
Paper A Parallel Bloom Filter String Searching Algorithm
28 pages

lecture 4

Uploaded by

lecture 4

Uploaded by

COMPILERS

𝐸𝑥𝑝 → 𝐸𝑥𝑝 + 𝐸𝑥𝑝

You might also like