SlideShare a Scribd company logo
Syntax – Intro and Overview

           CS331
Syntax
• Syntax defines what is grammatically valid in a programming
  language
   – Set of grammatical rules
   – E.g. in English, a sentence cannot begin with a period
   – Must be formal and exact or there will be ambiguity in a
     programming language
• We will study three levels of syntax
   – Lexical
       • Defines the rules for tokens: literals, identifiers, etc.
   – Concrete
       • Actual representation scheme down to every semicolon, i.e. every lexical
         token
   – Abstract
       • Description of a program’s information without worrying about specific
         details such as where the parentheses or semicolons go
BNF Grammar
• BNF = Backus-Naur Form to specify a grammar
   – Equivalent to a context free grammar
• Set of rewriting rules (a rule that can be applied multiple
  times) defined on a set of nonterminal symbols, a set of
  terminal symbols, and a start symbol
   – Terminals, ∑ : Basic alphabet from which programs are
     constructed. E.g., letters, digits, or keywords such as “int”,
     “main”, “{“, “}”
   – Nonterminals, N : Identify grammatical categories
   – Start Symbol: One of the nonterminals which identifies the
     principal category. E.g., “Sentence” for english, “Program” for a
     programming language
Rewriting Rules
• Rewriting Rules, ρ
   – Written using the symbols  and |
     | is a separator for alternative definitions, i.e. “OR”
      is used to define a rule, i.e. “IS”

   – Format
       • LHS  RHS1 | RHS2 | RHS3 | …
       • LHS is a single nonterminal
       • RHS is any sequence of terminals and nonterminals
Sample Grammars
• Grammar for subset of English
      Sentence  Noun Verb
      Noun  Jack | Jill
      Verb  eats | bites
• Grammar for a digit
      Digit  0 | 1 | 2 | 3 | 4 | 5 | 6 |7 |8 |9
• Grammar for signed integers
      SignedInteger  Sign Integer
      Sign  + | -
      Integer  Digit | Digit Integer
• Grammar for subset of Java
      Assignment  Variable = Expression
      Expression  Variable | Variable + Variable | Variable – Variable
      Variable  X | Y
Derivation
 • Process of parsing data using a grammar
     – Apply rewrite rules to non-terminals on the RHS of an existing
       rule
     – To match, the derivation must terminate and be composed of
       terminals only
 • Example
       Digit  0 | 1 | 2 | 3 | 4 | 5 | 6 |7 |8 |9
       Integer  Digit | Digit Integer
     – Is 352 an Integer?
         Integer → Digit Integer → 3 Integer →
         3 Digit Integer → 3 5 Integer →
         3 5 Digit → 3 5 2

Intermediate formats are called sentential forms
This was called a Leftmost Derivation since we replaced the
leftmost nonterminal symbol each time (could also do Rightmost)
Derivation and Parse Trees
• The derivation can be
  visualized as a parse
  tree
                  Integer

          Digit             Integer

                  Digit               Integer
      3

             5                Digit

                      2
Parse Tree Sketch for Programs
BNF and Languages
• The language defined by a BNF grammar is the set of all
  strings that can be derived
   – Language can be infinite, e.g. case of integers
• A language is ambiguous if it permits a string to be parsed
  into two separate parse trees
   – Generally want to avoid ambiguous grammars
   – Example:
       • Expr  Integer | Expr + Expr | Expr * Expr | Expr - Expr
       • Parse: 3*4+1
            – Expr * Expr → Integer * Expr →
              3 * Expr → 3 * Expr+Expr → … 3 * 4 + 1
            – Expr + Expr → Expr + Integer → Expr + 1
              Expr * Expr +1 → … 3 * 4 + 1
Ambiguity
• Example for
    AmbExp  Integer | AmbExp – AmbExp
    2-3-4
Ambiguous IF Statement
Dangling ELSE:

       if (x<0)
       if (y<0) { y=y-1 }
       else { y=0 };

Does the else go with the first or second if?
Dangling Else Ambiguity
How to fix ambiguity?
• Use explicit grammar without ambiguity
  – E.g., add an “ENDIF” for every “IF”
  – Java makes a separate category for if-else vs. if:
     IfThenStatement  If (Expr) Statement
     IfThenElseStatement  If (Expr) StatementNoShortIf else
       Statement
     StatementNoShortIf contains everything except
       IfThenStatement, so the else always goes with the
       IfThenElse statement not the IfThenStatement
• Use precedence on symbols
Alternative to BNF
• The use of regular expressions is an alternate
  way to express a language
Regex to EBNF
• The book uses some deviations from “standard”
  regular expressions in Extended Backus Naur
  Format (defined in a few slides)
    { M } means zero or more occurrences of M
    ( M | N) means one of M or N must be chosen
    [M]      means M is optional

  Use “{“ to mean the literal { not the regex {
RegEx Examples
• Booleans
   – “true” | “false”
• Integers
   – (0-9)+
• Identifiers
   – (a-zA-Z){a-zA-Z0-9}
• Comments (letters/space only)
   – “//”{a-zA-Z }(“r” | “n” | “rn”)
• Regular expressions seem pretty powerful
   – Can you write one for the language anbn? (i.e. n a’s followed by n
     b’s)
Extended BNF
• EBNF – variation of BNF that simplifies
  specification of recursive rules using regular
  expressions in the RHS of the rule
• Example:
   – BNF rule
        Expr  Term | Expr + Term | Expr – Term
        Term  Factor | Term * Factor | Term / Factor
   – EBNF equivalent
        Expr  Term { [+|-] Term }
        Term  Factor { [* | / ] Factor }
• EBNF tends to be shorter and easier to read
EBNF
• Consider:
Expr  Term{ (+|-) Term }
Term  Factor { (* | / ) Factor }
Factor  Identifier | Literal | (Expr)

Parse for X+2*Y
BNF and Lexical Analysis
• Lexicon of a programming language – set of all
  nonterminals from which programs are written
• Nonterminals – referred to as tokens
   – Each token is described by its type (e.g. identifier,
     expression) and its value (the string it represents)
   – Skipping whitespace or comments


                                              or punctuation
Categories of Lexical Tokens
• Identifiers
• Literals
   Includes Integers, true, false, floats, chars
• Keywords
   bool char else false float if int main true while
• Operators
   = || && == != < <= > >= + - * / % ! [ ]
• Punctuation
   ;.{}()

   Issues to consider: Ignoring comments, role of whitespace,
      distinguising the < operator from <=, distinguishing
      identifiers from keywords like “if”
A Simple Lexical Syntax for a Small
             Language, Clite
Primary → Identifier [ "["Expression"]" ] | Literal | "("Expression")"
       | Type "("Expression")"

Identifier → Letter { Letter | Digit }
Letter → a | b | … | z | A | B | … Z
Digit → 0 | 1 | 2 | … | 9
Literal → Integer | Boolean | Float | Char
Integer → Digit { Digit }
Boolean → true | false
Float → Integer . Integer
Char → ‘ ASCIICHAR ‘
Major Stages in Compilation
• Lexical Analysis
   – Translates source into a stream of Tokens, everything else
     discarded
• Syntactic Analysis
   – Parses tokens, detects syntax errors, develops abstract
     representation
• Semantic Analysis
   – Analyze the parse for semantic consistency, transform into a
     format the architecture can efficiently run on
• Code Generation
   – Use results of abstract representation as a basis for generating
     executable machine code
Lexical Analysis & Compiling
               Process




Difficulties: 1 to many mapping from HL source to machine code
       Translation must be correct
       Translation should be efficient
Lexical Analysis of Clite
• Lexical Analysis – transforms a program
  into tokens (type, value). The rest is tossed.
• Example Clite program:
           // Simple Program
           int main() {
              int x;
              x = 3;
           }

Result of Lexical Analysis:
Lexical Analysis (2)
Result of Lexical Analysis:
 1 Type: Int Value:    int
 2 Type: Main Value:   main
 3 Type: LeftParen     Value: (   // Simple Program
 4 Type: RightParen    Value: )   int main() {
 5 Type: LeftBrace     Value: {      int x;
 6 Type: Int Value:    int           x = 3;
 7 Type: Identifier    Value: x   }
 8 Type: Semicolon     Value: ;
 9 Type: Identifier    Value: x
 10 Type: Assign       Value: =
 11 Type: IntLiteral   Value: 3
 12 Type: Semicolon    Value: ;
 13 Type: RightBrace   Value: }
 14 Type: Eof Value:   <<EOF>>
Lexical Analysis of Clite in Java
public class TokenTester {
     public static void main (String[] args) {
      Lexer lex = new Lexer (args[0]);
      Token t;
      int i = 1;

     do
     {
        t = lex.next();
        System.out.println(i+" Type: "+t.type()
                   +"tValue: "+t.value());
        i++;
      } while (t != Token.eofTok);
     }
 }

The source code for how the Lexer and Token classes are arranged
is the topic of chapter 3
Lexical to Concrete
• From the stream of tokens generated by our
  lexical analyzer we can now parse them
  using a concrete syntax
Concrete EBNF Syntax for Clite
Program → int main ( ) { Declarations Statements }
Declarations → { Declaration }
Declaration → Type Identifier [ "["Integer"]" ] { , Identifier ["["Integer"]"] };
Type → int | bool | float | char
Statements →{ Statement }
Statement → ; | Block | Assignment | IfStatement | WhileStatement
Block → { Statements }
Assignment → Identifier ["["Expression"]" ] = Expression ;
IfStatement → if "(" Expression ")" Statement [ else Statement ]
WhileStatement → while "("Expression")" Statement




                                                            Concrete Syntax;
                                                            Higher than lexical
                                                            syntax!
Concrete EBNF Syntax for Clite
Expression → Conjunction { || Conjunction }
Conjunction →Equality { && Equality }
Equality → Relation [ EquOp Relation ]
EquOp → == | !=
Relation → Addition [ RelOp Addition ]
RelOp → < | <= | > | >=                                  References lexical
Addition → Term { AddOp Term }                           syntax
AddOp → + | -
Term → Factor { MulOp Factor }
MulOp → * | / | %
Factor → [ UnaryOp ] Primary
UnaryOp → - | !
Primary → Identifier [ "["Expression"]" ] | Literal | "("Expression")" |
         Type "(" Expression ")"
Syntax Diagram
• Alternate way to specify a language
• Popularized with Pascal
• Not any more powerful than BNF, EBNF, or regular
  expressions
Linking Syntax and Semantics
• What we’ve described so far has been
  concrete syntax
  – Defines all parts of the language above the
    lexical level
     • Assignments, loops, functions, definitions, etc.
     • Uses BNF or variant to describe the language
• An abstract syntax links the concrete syntax
  to the semantic level
Abstract Syntax
• Defines essential syntactic elements without
  describing how they are concretely
  constructed
• Consider the following Pascal and C loops
   Pascal                            C
   while i<n do begin                while (i<n) {
     i:=i+1                                i=i+1;
   end                               }

Small differences in concrete syntax; identical abstract construct
Abstract Syntax Format
• Defined using rules of the form
  – LHS = RHS
     • LHS is the name of an abstract syntactic class
     • RHS is a list of essential components that define the
       class
        – Similar to defining a variable. Data type or abstract
          syntactic class, and name
        – Components are separated by ;
• Recursion naturally occurs among the
  definitions as with BNF
Abstract Syntax Example
• Loop
    Loop = Expression test ; Statement body
   – The abstract class Loop has two components, a test which is a
    member of the abstract class Expression, and a body which is a
    member of an abstract class Statement

• Nice by-product: If parsing abstract syntax in Java, it
  makes sense to actually define a class for each abstract
  syntactic class, e.g.
      class Loop extends Statement {
        Expression test;
        Statement body;
      }
Abstract Syntax of Clite
Program = Declarations decpart; Statements body;
Declarations = Declaration*
Declaration = VariableDecl   |   ArrayDecl
VariableDecl = Variable v; Type t
ArrayDecl = Variable v; Type t; Integer size
Type = int | bool | float | char
Statements = Statement*
Statement = Skip | Block | Assignment |
            Conditional | Loop
Skip =
Block = Statements
Conditional = Expression test;
             Statement thenbranch, elsebranch
Loop = Expression test; Statement body
Assignment = VariableRef target;   Expression source
Expression = VariableRef | Value | Binary | Unary
Abstract Syntax of Clite
VariableRef = Variable | ArrayRef
Binary = Operator op; Expression term1, term2
Unary = UnaryOp op; Expression term
Operator = BooleanOp | RelationalOp | ArithmeticOp
BooleanOp = && | ||
RelationalOp = = | ! | != | < | <= | > | >=
ArithmeticOp = + | - | * | /
UnaryOp = ! | -
Variable = String id
ArrayRef = String id; Expression index
Value = IntValue | BoolValue | FloatValue | CharValue
IntValue = Integer intValue
FloatValue = Float floatValue
BoolValue = Boolean boolValue
CharValue = Character charValue
Java AbstractSyntax for Clite
class Loop extends Statement {
  Expression test;
  Statement body;
}
Class Assignment extends Statement {
  // Assignment = Variable target; Expression source
  Variable target;
  Expression source;
}


…
Much more… see the file (when available)
Abstract Syntax Tree
• Just as we can build a parse tree from a BNF grammar, we
  can build an abstract syntax tree from an abstract syntax
• Example for: x+2*y
  Expression = Variable | Value | Binary
  Binary = Operator op ; Expression term1, term2




                                                   Binary node




              Expr
Sample Clite Program
• Compute nth fib number
Abstract Syntax for Loop of Clite Program
Concrete and Abstract Syntax
• Aren’t the two redundant?
   – A little bit
• The concrete syntax tells the programmer exactly
  what to write to have a valid program
• The abstract syntax allows valid programs in two
  different languages to share common abstract
  representations
   – It is closer to semantics
   – We need both!
What’s coming up?
• Semantic analysis
   – Do the types match? What does this mean?
      char a=‘c’;
      int sum=0;
      sum = sum = a;
• Can associate machine code with the abstract
  parse
   – Code generation
   – Code optimization
Ad

More Related Content

What's hot (20)

Finite automata
Finite automataFinite automata
Finite automata
Bipul Roy Bpl
 
Lecture: Context-Free Grammars
Lecture: Context-Free GrammarsLecture: Context-Free Grammars
Lecture: Context-Free Grammars
Marina Santini
 
Asymptotic notations
Asymptotic notationsAsymptotic notations
Asymptotic notations
Nikhil Sharma
 
COMPILER DESIGN
COMPILER DESIGNCOMPILER DESIGN
COMPILER DESIGN
Vetukurivenkatashiva
 
pointer-to-object-.pptx
pointer-to-object-.pptxpointer-to-object-.pptx
pointer-to-object-.pptx
ZaibunnisaMalik1
 
Lecture 3,4
Lecture 3,4Lecture 3,4
Lecture 3,4
shah zeb
 
NLP_KASHK:Finite-State Morphological Parsing
NLP_KASHK:Finite-State Morphological ParsingNLP_KASHK:Finite-State Morphological Parsing
NLP_KASHK:Finite-State Morphological Parsing
Hemantha Kulathilake
 
Introduction TO Finite Automata
Introduction TO Finite AutomataIntroduction TO Finite Automata
Introduction TO Finite Automata
Ratnakar Mikkili
 
Master method
Master method Master method
Master method
Rajendran
 
Context free grammar
Context free grammarContext free grammar
Context free grammar
Radhakrishnan Chinnusamy
 
Problem solving in Artificial Intelligence.pptx
Problem solving in Artificial Intelligence.pptxProblem solving in Artificial Intelligence.pptx
Problem solving in Artificial Intelligence.pptx
kitsenthilkumarcse
 
Intermediate code generator
Intermediate code generatorIntermediate code generator
Intermediate code generator
sanchi29
 
Lecture 1 - Lexical Analysis.ppt
Lecture 1 - Lexical Analysis.pptLecture 1 - Lexical Analysis.ppt
Lecture 1 - Lexical Analysis.ppt
NderituGichuki1
 
Regular Expression Examples.pptx
Regular Expression Examples.pptxRegular Expression Examples.pptx
Regular Expression Examples.pptx
GhulamRabani9
 
Algorithms Lecture 2: Analysis of Algorithms I
Algorithms Lecture 2: Analysis of Algorithms IAlgorithms Lecture 2: Analysis of Algorithms I
Algorithms Lecture 2: Analysis of Algorithms I
Mohamed Loey
 
language , grammar and automata
language , grammar and automatalanguage , grammar and automata
language , grammar and automata
ElakkiyaS11
 
Operator precedence
Operator precedenceOperator precedence
Operator precedence
Akshaya Arunan
 
Chapter _4_Semantic Analysis .pptx
Chapter _4_Semantic Analysis .pptxChapter _4_Semantic Analysis .pptx
Chapter _4_Semantic Analysis .pptx
ArebuMaruf
 
Automata theory
Automata theoryAutomata theory
Automata theory
Pardeep Vats
 
Minimization of DFA
Minimization of DFAMinimization of DFA
Minimization of DFA
International Institute of Information Technology (I²IT)
 

Viewers also liked (20)

The History of Programming Languages
The History of Programming LanguagesThe History of Programming Languages
The History of Programming Languages
nTier Custom Solutions
 
A Brief History of Programming
A Brief History of ProgrammingA Brief History of Programming
A Brief History of Programming
jxyz
 
History of programming
History of programmingHistory of programming
History of programming
Sharwin Calimlim
 
Lect 1. introduction to programming languages
Lect 1. introduction to programming languagesLect 1. introduction to programming languages
Lect 1. introduction to programming languages
Varun Garg
 
2 1 bussistem
2 1 bussistem2 1 bussistem
2 1 bussistem
putusumaye
 
Use case diagrams
Use case diagramsUse case diagrams
Use case diagrams
Mir Majid
 
Climbing the Abstract Syntax Tree (Bulgaria PHP 2016)
Climbing the Abstract Syntax Tree (Bulgaria PHP 2016)Climbing the Abstract Syntax Tree (Bulgaria PHP 2016)
Climbing the Abstract Syntax Tree (Bulgaria PHP 2016)
James Titcumb
 
Hugging Abstract Syntax Trees: A Pythonic Love Story (OSDC 2010)
Hugging Abstract Syntax Trees: A Pythonic Love Story (OSDC 2010)Hugging Abstract Syntax Trees: A Pythonic Love Story (OSDC 2010)
Hugging Abstract Syntax Trees: A Pythonic Love Story (OSDC 2010)
Tom Lee
 
Dynamic Semantics Specification and Interpreter Generation
Dynamic Semantics Specification and Interpreter GenerationDynamic Semantics Specification and Interpreter Generation
Dynamic Semantics Specification and Interpreter Generation
Eelco Visser
 
Applying SOS to MDE
Applying SOS to MDEApplying SOS to MDE
Applying SOS to MDE
Tjerk W
 
Ch1
Ch1Ch1
Ch1
kinnarshah8888
 
Finite automata
Finite automataFinite automata
Finite automata
Dr. Abhineet Anand
 
Ch5b
Ch5bCh5b
Ch5b
kinnarshah8888
 
Chapter Seven(1)
Chapter Seven(1)Chapter Seven(1)
Chapter Seven(1)
bolovv
 
Ch2
Ch2Ch2
Ch2
kinnarshah8888
 
01. introduction
01. introduction01. introduction
01. introduction
babaaasingh123
 
Programming Language
Programming LanguageProgramming Language
Programming Language
Education Front
 
4 evolution-of-programming-languages
4 evolution-of-programming-languages4 evolution-of-programming-languages
4 evolution-of-programming-languages
Rohit Shrivastava
 
Programming languages
Programming languagesProgramming languages
Programming languages
Archana Maharjan
 
A Few of My Favorite (Python) Things
A Few of My Favorite (Python) ThingsA Few of My Favorite (Python) Things
A Few of My Favorite (Python) Things
Michael Pirnat
 
A Brief History of Programming
A Brief History of ProgrammingA Brief History of Programming
A Brief History of Programming
jxyz
 
Lect 1. introduction to programming languages
Lect 1. introduction to programming languagesLect 1. introduction to programming languages
Lect 1. introduction to programming languages
Varun Garg
 
Use case diagrams
Use case diagramsUse case diagrams
Use case diagrams
Mir Majid
 
Climbing the Abstract Syntax Tree (Bulgaria PHP 2016)
Climbing the Abstract Syntax Tree (Bulgaria PHP 2016)Climbing the Abstract Syntax Tree (Bulgaria PHP 2016)
Climbing the Abstract Syntax Tree (Bulgaria PHP 2016)
James Titcumb
 
Hugging Abstract Syntax Trees: A Pythonic Love Story (OSDC 2010)
Hugging Abstract Syntax Trees: A Pythonic Love Story (OSDC 2010)Hugging Abstract Syntax Trees: A Pythonic Love Story (OSDC 2010)
Hugging Abstract Syntax Trees: A Pythonic Love Story (OSDC 2010)
Tom Lee
 
Dynamic Semantics Specification and Interpreter Generation
Dynamic Semantics Specification and Interpreter GenerationDynamic Semantics Specification and Interpreter Generation
Dynamic Semantics Specification and Interpreter Generation
Eelco Visser
 
Applying SOS to MDE
Applying SOS to MDEApplying SOS to MDE
Applying SOS to MDE
Tjerk W
 
Chapter Seven(1)
Chapter Seven(1)Chapter Seven(1)
Chapter Seven(1)
bolovv
 
4 evolution-of-programming-languages
4 evolution-of-programming-languages4 evolution-of-programming-languages
4 evolution-of-programming-languages
Rohit Shrivastava
 
A Few of My Favorite (Python) Things
A Few of My Favorite (Python) ThingsA Few of My Favorite (Python) Things
A Few of My Favorite (Python) Things
Michael Pirnat
 
Ad

Similar to Syntax (20)

Lexical analysis Compiler design pdf to read
Lexical analysis Compiler design pdf to readLexical analysis Compiler design pdf to read
Lexical analysis Compiler design pdf to read
shubhamsingaal
 
Lexical analysis compiler design to read and study
Lexical analysis compiler design to read and studyLexical analysis compiler design to read and study
Lexical analysis compiler design to read and study
shubhamsingaal
 
Lecture 02 lexical analysis
Lecture 02 lexical analysisLecture 02 lexical analysis
Lecture 02 lexical analysis
Iffat Anjum
 
Compiler Design_Lexical Analysis phase.pptx
Compiler Design_Lexical Analysis phase.pptxCompiler Design_Lexical Analysis phase.pptx
Compiler Design_Lexical Analysis phase.pptx
RushaliDeshmukh2
 
atc 3rd module compiler and automata.ppt
atc 3rd module compiler and automata.pptatc 3rd module compiler and automata.ppt
atc 3rd module compiler and automata.ppt
ranjan317165
 
Control structure
Control structureControl structure
Control structure
baran19901990
 
Compiler Design in Engineering for Designing
Compiler Design in Engineering for DesigningCompiler Design in Engineering for Designing
Compiler Design in Engineering for Designing
Adityaverma750841
 
3a. Context Free Grammar.pdf
3a. Context Free Grammar.pdf3a. Context Free Grammar.pdf
3a. Context Free Grammar.pdf
TANZINTANZINA
 
Lexical1
Lexical1Lexical1
Lexical1
ASHOK KUMAR REDDY
 
Bishwadeep Bose_Compiler Design.pdf made by student
Bishwadeep Bose_Compiler Design.pdf made by studentBishwadeep Bose_Compiler Design.pdf made by student
Bishwadeep Bose_Compiler Design.pdf made by student
debmalyachaki5
 
Lecture3 lexical analysis
Lecture3 lexical analysisLecture3 lexical analysis
Lecture3 lexical analysis
Mahesh Kumar Chelimilla
 
Lexical analysis - Compiler Design
Lexical analysis - Compiler DesignLexical analysis - Compiler Design
Lexical analysis - Compiler Design
Kuppusamy P
 
Chapter-3 compiler.pptx course materials
Chapter-3 compiler.pptx course materialsChapter-3 compiler.pptx course materials
Chapter-3 compiler.pptx course materials
gadisaAdamu
 
CH 2.pptx
CH 2.pptxCH 2.pptx
CH 2.pptx
Obsa2
 
Computational model language and grammar bnf
Computational model language and grammar bnfComputational model language and grammar bnf
Computational model language and grammar bnf
Taha Shakeel
 
02. Chapter 3 - Lexical Analysis NLP.ppt
02. Chapter 3 - Lexical Analysis NLP.ppt02. Chapter 3 - Lexical Analysis NLP.ppt
02. Chapter 3 - Lexical Analysis NLP.ppt
charvivij
 
Lecture 04 syntax analysis
Lecture 04 syntax analysisLecture 04 syntax analysis
Lecture 04 syntax analysis
Iffat Anjum
 
Unitiv 111206005201-phpapp01
Unitiv 111206005201-phpapp01Unitiv 111206005201-phpapp01
Unitiv 111206005201-phpapp01
riddhi viradiya
 
CS-4337_03_Chapter3- syntax and semantics.pdf
CS-4337_03_Chapter3- syntax and semantics.pdfCS-4337_03_Chapter3- syntax and semantics.pdf
CS-4337_03_Chapter3- syntax and semantics.pdf
FutureKids1
 
Lexicalanalyzer
LexicalanalyzerLexicalanalyzer
Lexicalanalyzer
Royalzig Luxury Furniture
 
Lexical analysis Compiler design pdf to read
Lexical analysis Compiler design pdf to readLexical analysis Compiler design pdf to read
Lexical analysis Compiler design pdf to read
shubhamsingaal
 
Lexical analysis compiler design to read and study
Lexical analysis compiler design to read and studyLexical analysis compiler design to read and study
Lexical analysis compiler design to read and study
shubhamsingaal
 
Lecture 02 lexical analysis
Lecture 02 lexical analysisLecture 02 lexical analysis
Lecture 02 lexical analysis
Iffat Anjum
 
Compiler Design_Lexical Analysis phase.pptx
Compiler Design_Lexical Analysis phase.pptxCompiler Design_Lexical Analysis phase.pptx
Compiler Design_Lexical Analysis phase.pptx
RushaliDeshmukh2
 
atc 3rd module compiler and automata.ppt
atc 3rd module compiler and automata.pptatc 3rd module compiler and automata.ppt
atc 3rd module compiler and automata.ppt
ranjan317165
 
Compiler Design in Engineering for Designing
Compiler Design in Engineering for DesigningCompiler Design in Engineering for Designing
Compiler Design in Engineering for Designing
Adityaverma750841
 
3a. Context Free Grammar.pdf
3a. Context Free Grammar.pdf3a. Context Free Grammar.pdf
3a. Context Free Grammar.pdf
TANZINTANZINA
 
Bishwadeep Bose_Compiler Design.pdf made by student
Bishwadeep Bose_Compiler Design.pdf made by studentBishwadeep Bose_Compiler Design.pdf made by student
Bishwadeep Bose_Compiler Design.pdf made by student
debmalyachaki5
 
Lexical analysis - Compiler Design
Lexical analysis - Compiler DesignLexical analysis - Compiler Design
Lexical analysis - Compiler Design
Kuppusamy P
 
Chapter-3 compiler.pptx course materials
Chapter-3 compiler.pptx course materialsChapter-3 compiler.pptx course materials
Chapter-3 compiler.pptx course materials
gadisaAdamu
 
CH 2.pptx
CH 2.pptxCH 2.pptx
CH 2.pptx
Obsa2
 
Computational model language and grammar bnf
Computational model language and grammar bnfComputational model language and grammar bnf
Computational model language and grammar bnf
Taha Shakeel
 
02. Chapter 3 - Lexical Analysis NLP.ppt
02. Chapter 3 - Lexical Analysis NLP.ppt02. Chapter 3 - Lexical Analysis NLP.ppt
02. Chapter 3 - Lexical Analysis NLP.ppt
charvivij
 
Lecture 04 syntax analysis
Lecture 04 syntax analysisLecture 04 syntax analysis
Lecture 04 syntax analysis
Iffat Anjum
 
Unitiv 111206005201-phpapp01
Unitiv 111206005201-phpapp01Unitiv 111206005201-phpapp01
Unitiv 111206005201-phpapp01
riddhi viradiya
 
CS-4337_03_Chapter3- syntax and semantics.pdf
CS-4337_03_Chapter3- syntax and semantics.pdfCS-4337_03_Chapter3- syntax and semantics.pdf
CS-4337_03_Chapter3- syntax and semantics.pdf
FutureKids1
 
Ad

Recently uploaded (20)

Social Problem-Unemployment .pptx notes for Physiotherapy Students
Social Problem-Unemployment .pptx notes for Physiotherapy StudentsSocial Problem-Unemployment .pptx notes for Physiotherapy Students
Social Problem-Unemployment .pptx notes for Physiotherapy Students
DrNidhiAgarwal
 
How to Customize Your Financial Reports & Tax Reports With Odoo 17 Accounting
How to Customize Your Financial Reports & Tax Reports With Odoo 17 AccountingHow to Customize Your Financial Reports & Tax Reports With Odoo 17 Accounting
How to Customize Your Financial Reports & Tax Reports With Odoo 17 Accounting
Celine George
 
Presentation on Tourism Product Development By Md Shaifullar Rabbi
Presentation on Tourism Product Development By Md Shaifullar RabbiPresentation on Tourism Product Development By Md Shaifullar Rabbi
Presentation on Tourism Product Development By Md Shaifullar Rabbi
Md Shaifullar Rabbi
 
GDGLSPGCOER - Git and GitHub Workshop.pptx
GDGLSPGCOER - Git and GitHub Workshop.pptxGDGLSPGCOER - Git and GitHub Workshop.pptx
GDGLSPGCOER - Git and GitHub Workshop.pptx
azeenhodekar
 
Multi-currency in odoo accounting and Update exchange rates automatically in ...
Multi-currency in odoo accounting and Update exchange rates automatically in ...Multi-currency in odoo accounting and Update exchange rates automatically in ...
Multi-currency in odoo accounting and Update exchange rates automatically in ...
Celine George
 
Metamorphosis: Life's Transformative Journey
Metamorphosis: Life's Transformative JourneyMetamorphosis: Life's Transformative Journey
Metamorphosis: Life's Transformative Journey
Arshad Shaikh
 
New Microsoft PowerPoint Presentation.pptx
New Microsoft PowerPoint Presentation.pptxNew Microsoft PowerPoint Presentation.pptx
New Microsoft PowerPoint Presentation.pptx
milanasargsyan5
 
How to Set warnings for invoicing specific customers in odoo
How to Set warnings for invoicing specific customers in odooHow to Set warnings for invoicing specific customers in odoo
How to Set warnings for invoicing specific customers in odoo
Celine George
 
Ultimate VMware 2V0-11.25 Exam Dumps for Exam Success
Ultimate VMware 2V0-11.25 Exam Dumps for Exam SuccessUltimate VMware 2V0-11.25 Exam Dumps for Exam Success
Ultimate VMware 2V0-11.25 Exam Dumps for Exam Success
Mark Soia
 
How to manage Multiple Warehouses for multiple floors in odoo point of sale
How to manage Multiple Warehouses for multiple floors in odoo point of saleHow to manage Multiple Warehouses for multiple floors in odoo point of sale
How to manage Multiple Warehouses for multiple floors in odoo point of sale
Celine George
 
Exploring-Substances-Acidic-Basic-and-Neutral.pdf
Exploring-Substances-Acidic-Basic-and-Neutral.pdfExploring-Substances-Acidic-Basic-and-Neutral.pdf
Exploring-Substances-Acidic-Basic-and-Neutral.pdf
Sandeep Swamy
 
Geography Sem II Unit 1C Correlation of Geography with other school subjects
Geography Sem II Unit 1C Correlation of Geography with other school subjectsGeography Sem II Unit 1C Correlation of Geography with other school subjects
Geography Sem II Unit 1C Correlation of Geography with other school subjects
ProfDrShaikhImran
 
Political History of Pala dynasty Pala Rulers NEP.pptx
Political History of Pala dynasty Pala Rulers NEP.pptxPolitical History of Pala dynasty Pala Rulers NEP.pptx
Political History of Pala dynasty Pala Rulers NEP.pptx
Arya Mahila P. G. College, Banaras Hindu University, Varanasi, India.
 
pulse ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulse
pulse  ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulsepulse  ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulse
pulse ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulse
sushreesangita003
 
K12 Tableau Tuesday - Algebra Equity and Access in Atlanta Public Schools
K12 Tableau Tuesday  - Algebra Equity and Access in Atlanta Public SchoolsK12 Tableau Tuesday  - Algebra Equity and Access in Atlanta Public Schools
K12 Tableau Tuesday - Algebra Equity and Access in Atlanta Public Schools
dogden2
 
Handling Multiple Choice Responses: Fortune Effiong.pptx
Handling Multiple Choice Responses: Fortune Effiong.pptxHandling Multiple Choice Responses: Fortune Effiong.pptx
Handling Multiple Choice Responses: Fortune Effiong.pptx
AuthorAIDNationalRes
 
Quality Contril Analysis of Containers.pdf
Quality Contril Analysis of Containers.pdfQuality Contril Analysis of Containers.pdf
Quality Contril Analysis of Containers.pdf
Dr. Bindiya Chauhan
 
How to Manage Opening & Closing Controls in Odoo 17 POS
How to Manage Opening & Closing Controls in Odoo 17 POSHow to Manage Opening & Closing Controls in Odoo 17 POS
How to Manage Opening & Closing Controls in Odoo 17 POS
Celine George
 
Biophysics Chapter 3 Methods of Studying Macromolecules.pdf
Biophysics Chapter 3 Methods of Studying Macromolecules.pdfBiophysics Chapter 3 Methods of Studying Macromolecules.pdf
Biophysics Chapter 3 Methods of Studying Macromolecules.pdf
PKLI-Institute of Nursing and Allied Health Sciences Lahore , Pakistan.
 
Unit 6_Introduction_Phishing_Password Cracking.pdf
Unit 6_Introduction_Phishing_Password Cracking.pdfUnit 6_Introduction_Phishing_Password Cracking.pdf
Unit 6_Introduction_Phishing_Password Cracking.pdf
KanchanPatil34
 
Social Problem-Unemployment .pptx notes for Physiotherapy Students
Social Problem-Unemployment .pptx notes for Physiotherapy StudentsSocial Problem-Unemployment .pptx notes for Physiotherapy Students
Social Problem-Unemployment .pptx notes for Physiotherapy Students
DrNidhiAgarwal
 
How to Customize Your Financial Reports & Tax Reports With Odoo 17 Accounting
How to Customize Your Financial Reports & Tax Reports With Odoo 17 AccountingHow to Customize Your Financial Reports & Tax Reports With Odoo 17 Accounting
How to Customize Your Financial Reports & Tax Reports With Odoo 17 Accounting
Celine George
 
Presentation on Tourism Product Development By Md Shaifullar Rabbi
Presentation on Tourism Product Development By Md Shaifullar RabbiPresentation on Tourism Product Development By Md Shaifullar Rabbi
Presentation on Tourism Product Development By Md Shaifullar Rabbi
Md Shaifullar Rabbi
 
GDGLSPGCOER - Git and GitHub Workshop.pptx
GDGLSPGCOER - Git and GitHub Workshop.pptxGDGLSPGCOER - Git and GitHub Workshop.pptx
GDGLSPGCOER - Git and GitHub Workshop.pptx
azeenhodekar
 
Multi-currency in odoo accounting and Update exchange rates automatically in ...
Multi-currency in odoo accounting and Update exchange rates automatically in ...Multi-currency in odoo accounting and Update exchange rates automatically in ...
Multi-currency in odoo accounting and Update exchange rates automatically in ...
Celine George
 
Metamorphosis: Life's Transformative Journey
Metamorphosis: Life's Transformative JourneyMetamorphosis: Life's Transformative Journey
Metamorphosis: Life's Transformative Journey
Arshad Shaikh
 
New Microsoft PowerPoint Presentation.pptx
New Microsoft PowerPoint Presentation.pptxNew Microsoft PowerPoint Presentation.pptx
New Microsoft PowerPoint Presentation.pptx
milanasargsyan5
 
How to Set warnings for invoicing specific customers in odoo
How to Set warnings for invoicing specific customers in odooHow to Set warnings for invoicing specific customers in odoo
How to Set warnings for invoicing specific customers in odoo
Celine George
 
Ultimate VMware 2V0-11.25 Exam Dumps for Exam Success
Ultimate VMware 2V0-11.25 Exam Dumps for Exam SuccessUltimate VMware 2V0-11.25 Exam Dumps for Exam Success
Ultimate VMware 2V0-11.25 Exam Dumps for Exam Success
Mark Soia
 
How to manage Multiple Warehouses for multiple floors in odoo point of sale
How to manage Multiple Warehouses for multiple floors in odoo point of saleHow to manage Multiple Warehouses for multiple floors in odoo point of sale
How to manage Multiple Warehouses for multiple floors in odoo point of sale
Celine George
 
Exploring-Substances-Acidic-Basic-and-Neutral.pdf
Exploring-Substances-Acidic-Basic-and-Neutral.pdfExploring-Substances-Acidic-Basic-and-Neutral.pdf
Exploring-Substances-Acidic-Basic-and-Neutral.pdf
Sandeep Swamy
 
Geography Sem II Unit 1C Correlation of Geography with other school subjects
Geography Sem II Unit 1C Correlation of Geography with other school subjectsGeography Sem II Unit 1C Correlation of Geography with other school subjects
Geography Sem II Unit 1C Correlation of Geography with other school subjects
ProfDrShaikhImran
 
pulse ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulse
pulse  ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulsepulse  ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulse
pulse ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulse
sushreesangita003
 
K12 Tableau Tuesday - Algebra Equity and Access in Atlanta Public Schools
K12 Tableau Tuesday  - Algebra Equity and Access in Atlanta Public SchoolsK12 Tableau Tuesday  - Algebra Equity and Access in Atlanta Public Schools
K12 Tableau Tuesday - Algebra Equity and Access in Atlanta Public Schools
dogden2
 
Handling Multiple Choice Responses: Fortune Effiong.pptx
Handling Multiple Choice Responses: Fortune Effiong.pptxHandling Multiple Choice Responses: Fortune Effiong.pptx
Handling Multiple Choice Responses: Fortune Effiong.pptx
AuthorAIDNationalRes
 
Quality Contril Analysis of Containers.pdf
Quality Contril Analysis of Containers.pdfQuality Contril Analysis of Containers.pdf
Quality Contril Analysis of Containers.pdf
Dr. Bindiya Chauhan
 
How to Manage Opening & Closing Controls in Odoo 17 POS
How to Manage Opening & Closing Controls in Odoo 17 POSHow to Manage Opening & Closing Controls in Odoo 17 POS
How to Manage Opening & Closing Controls in Odoo 17 POS
Celine George
 
Unit 6_Introduction_Phishing_Password Cracking.pdf
Unit 6_Introduction_Phishing_Password Cracking.pdfUnit 6_Introduction_Phishing_Password Cracking.pdf
Unit 6_Introduction_Phishing_Password Cracking.pdf
KanchanPatil34
 

Syntax

  • 1. Syntax – Intro and Overview CS331
  • 2. Syntax • Syntax defines what is grammatically valid in a programming language – Set of grammatical rules – E.g. in English, a sentence cannot begin with a period – Must be formal and exact or there will be ambiguity in a programming language • We will study three levels of syntax – Lexical • Defines the rules for tokens: literals, identifiers, etc. – Concrete • Actual representation scheme down to every semicolon, i.e. every lexical token – Abstract • Description of a program’s information without worrying about specific details such as where the parentheses or semicolons go
  • 3. BNF Grammar • BNF = Backus-Naur Form to specify a grammar – Equivalent to a context free grammar • Set of rewriting rules (a rule that can be applied multiple times) defined on a set of nonterminal symbols, a set of terminal symbols, and a start symbol – Terminals, ∑ : Basic alphabet from which programs are constructed. E.g., letters, digits, or keywords such as “int”, “main”, “{“, “}” – Nonterminals, N : Identify grammatical categories – Start Symbol: One of the nonterminals which identifies the principal category. E.g., “Sentence” for english, “Program” for a programming language
  • 4. Rewriting Rules • Rewriting Rules, ρ – Written using the symbols  and | | is a separator for alternative definitions, i.e. “OR”  is used to define a rule, i.e. “IS” – Format • LHS  RHS1 | RHS2 | RHS3 | … • LHS is a single nonterminal • RHS is any sequence of terminals and nonterminals
  • 5. Sample Grammars • Grammar for subset of English Sentence  Noun Verb Noun  Jack | Jill Verb  eats | bites • Grammar for a digit Digit  0 | 1 | 2 | 3 | 4 | 5 | 6 |7 |8 |9 • Grammar for signed integers SignedInteger  Sign Integer Sign  + | - Integer  Digit | Digit Integer • Grammar for subset of Java Assignment  Variable = Expression Expression  Variable | Variable + Variable | Variable – Variable Variable  X | Y
  • 6. Derivation • Process of parsing data using a grammar – Apply rewrite rules to non-terminals on the RHS of an existing rule – To match, the derivation must terminate and be composed of terminals only • Example Digit  0 | 1 | 2 | 3 | 4 | 5 | 6 |7 |8 |9 Integer  Digit | Digit Integer – Is 352 an Integer? Integer → Digit Integer → 3 Integer → 3 Digit Integer → 3 5 Integer → 3 5 Digit → 3 5 2 Intermediate formats are called sentential forms This was called a Leftmost Derivation since we replaced the leftmost nonterminal symbol each time (could also do Rightmost)
  • 7. Derivation and Parse Trees • The derivation can be visualized as a parse tree Integer Digit Integer Digit Integer 3 5 Digit 2
  • 8. Parse Tree Sketch for Programs
  • 9. BNF and Languages • The language defined by a BNF grammar is the set of all strings that can be derived – Language can be infinite, e.g. case of integers • A language is ambiguous if it permits a string to be parsed into two separate parse trees – Generally want to avoid ambiguous grammars – Example: • Expr  Integer | Expr + Expr | Expr * Expr | Expr - Expr • Parse: 3*4+1 – Expr * Expr → Integer * Expr → 3 * Expr → 3 * Expr+Expr → … 3 * 4 + 1 – Expr + Expr → Expr + Integer → Expr + 1 Expr * Expr +1 → … 3 * 4 + 1
  • 10. Ambiguity • Example for AmbExp  Integer | AmbExp – AmbExp 2-3-4
  • 11. Ambiguous IF Statement Dangling ELSE: if (x<0) if (y<0) { y=y-1 } else { y=0 }; Does the else go with the first or second if?
  • 13. How to fix ambiguity? • Use explicit grammar without ambiguity – E.g., add an “ENDIF” for every “IF” – Java makes a separate category for if-else vs. if: IfThenStatement  If (Expr) Statement IfThenElseStatement  If (Expr) StatementNoShortIf else Statement StatementNoShortIf contains everything except IfThenStatement, so the else always goes with the IfThenElse statement not the IfThenStatement • Use precedence on symbols
  • 14. Alternative to BNF • The use of regular expressions is an alternate way to express a language
  • 15. Regex to EBNF • The book uses some deviations from “standard” regular expressions in Extended Backus Naur Format (defined in a few slides) { M } means zero or more occurrences of M ( M | N) means one of M or N must be chosen [M] means M is optional Use “{“ to mean the literal { not the regex {
  • 16. RegEx Examples • Booleans – “true” | “false” • Integers – (0-9)+ • Identifiers – (a-zA-Z){a-zA-Z0-9} • Comments (letters/space only) – “//”{a-zA-Z }(“r” | “n” | “rn”) • Regular expressions seem pretty powerful – Can you write one for the language anbn? (i.e. n a’s followed by n b’s)
  • 17. Extended BNF • EBNF – variation of BNF that simplifies specification of recursive rules using regular expressions in the RHS of the rule • Example: – BNF rule Expr  Term | Expr + Term | Expr – Term Term  Factor | Term * Factor | Term / Factor – EBNF equivalent Expr  Term { [+|-] Term } Term  Factor { [* | / ] Factor } • EBNF tends to be shorter and easier to read
  • 18. EBNF • Consider: Expr  Term{ (+|-) Term } Term  Factor { (* | / ) Factor } Factor  Identifier | Literal | (Expr) Parse for X+2*Y
  • 19. BNF and Lexical Analysis • Lexicon of a programming language – set of all nonterminals from which programs are written • Nonterminals – referred to as tokens – Each token is described by its type (e.g. identifier, expression) and its value (the string it represents) – Skipping whitespace or comments or punctuation
  • 20. Categories of Lexical Tokens • Identifiers • Literals Includes Integers, true, false, floats, chars • Keywords bool char else false float if int main true while • Operators = || && == != < <= > >= + - * / % ! [ ] • Punctuation ;.{}() Issues to consider: Ignoring comments, role of whitespace, distinguising the < operator from <=, distinguishing identifiers from keywords like “if”
  • 21. A Simple Lexical Syntax for a Small Language, Clite Primary → Identifier [ "["Expression"]" ] | Literal | "("Expression")" | Type "("Expression")" Identifier → Letter { Letter | Digit } Letter → a | b | … | z | A | B | … Z Digit → 0 | 1 | 2 | … | 9 Literal → Integer | Boolean | Float | Char Integer → Digit { Digit } Boolean → true | false Float → Integer . Integer Char → ‘ ASCIICHAR ‘
  • 22. Major Stages in Compilation • Lexical Analysis – Translates source into a stream of Tokens, everything else discarded • Syntactic Analysis – Parses tokens, detects syntax errors, develops abstract representation • Semantic Analysis – Analyze the parse for semantic consistency, transform into a format the architecture can efficiently run on • Code Generation – Use results of abstract representation as a basis for generating executable machine code
  • 23. Lexical Analysis & Compiling Process Difficulties: 1 to many mapping from HL source to machine code Translation must be correct Translation should be efficient
  • 24. Lexical Analysis of Clite • Lexical Analysis – transforms a program into tokens (type, value). The rest is tossed. • Example Clite program: // Simple Program int main() { int x; x = 3; } Result of Lexical Analysis:
  • 25. Lexical Analysis (2) Result of Lexical Analysis: 1 Type: Int Value: int 2 Type: Main Value: main 3 Type: LeftParen Value: ( // Simple Program 4 Type: RightParen Value: ) int main() { 5 Type: LeftBrace Value: { int x; 6 Type: Int Value: int x = 3; 7 Type: Identifier Value: x } 8 Type: Semicolon Value: ; 9 Type: Identifier Value: x 10 Type: Assign Value: = 11 Type: IntLiteral Value: 3 12 Type: Semicolon Value: ; 13 Type: RightBrace Value: } 14 Type: Eof Value: <<EOF>>
  • 26. Lexical Analysis of Clite in Java public class TokenTester { public static void main (String[] args) { Lexer lex = new Lexer (args[0]); Token t; int i = 1; do { t = lex.next(); System.out.println(i+" Type: "+t.type() +"tValue: "+t.value()); i++; } while (t != Token.eofTok); } } The source code for how the Lexer and Token classes are arranged is the topic of chapter 3
  • 27. Lexical to Concrete • From the stream of tokens generated by our lexical analyzer we can now parse them using a concrete syntax
  • 28. Concrete EBNF Syntax for Clite Program → int main ( ) { Declarations Statements } Declarations → { Declaration } Declaration → Type Identifier [ "["Integer"]" ] { , Identifier ["["Integer"]"] }; Type → int | bool | float | char Statements →{ Statement } Statement → ; | Block | Assignment | IfStatement | WhileStatement Block → { Statements } Assignment → Identifier ["["Expression"]" ] = Expression ; IfStatement → if "(" Expression ")" Statement [ else Statement ] WhileStatement → while "("Expression")" Statement Concrete Syntax; Higher than lexical syntax!
  • 29. Concrete EBNF Syntax for Clite Expression → Conjunction { || Conjunction } Conjunction →Equality { && Equality } Equality → Relation [ EquOp Relation ] EquOp → == | != Relation → Addition [ RelOp Addition ] RelOp → < | <= | > | >= References lexical Addition → Term { AddOp Term } syntax AddOp → + | - Term → Factor { MulOp Factor } MulOp → * | / | % Factor → [ UnaryOp ] Primary UnaryOp → - | ! Primary → Identifier [ "["Expression"]" ] | Literal | "("Expression")" | Type "(" Expression ")"
  • 30. Syntax Diagram • Alternate way to specify a language • Popularized with Pascal • Not any more powerful than BNF, EBNF, or regular expressions
  • 31. Linking Syntax and Semantics • What we’ve described so far has been concrete syntax – Defines all parts of the language above the lexical level • Assignments, loops, functions, definitions, etc. • Uses BNF or variant to describe the language • An abstract syntax links the concrete syntax to the semantic level
  • 32. Abstract Syntax • Defines essential syntactic elements without describing how they are concretely constructed • Consider the following Pascal and C loops Pascal C while i<n do begin while (i<n) { i:=i+1 i=i+1; end } Small differences in concrete syntax; identical abstract construct
  • 33. Abstract Syntax Format • Defined using rules of the form – LHS = RHS • LHS is the name of an abstract syntactic class • RHS is a list of essential components that define the class – Similar to defining a variable. Data type or abstract syntactic class, and name – Components are separated by ; • Recursion naturally occurs among the definitions as with BNF
  • 34. Abstract Syntax Example • Loop Loop = Expression test ; Statement body – The abstract class Loop has two components, a test which is a member of the abstract class Expression, and a body which is a member of an abstract class Statement • Nice by-product: If parsing abstract syntax in Java, it makes sense to actually define a class for each abstract syntactic class, e.g. class Loop extends Statement { Expression test; Statement body; }
  • 35. Abstract Syntax of Clite Program = Declarations decpart; Statements body; Declarations = Declaration* Declaration = VariableDecl | ArrayDecl VariableDecl = Variable v; Type t ArrayDecl = Variable v; Type t; Integer size Type = int | bool | float | char Statements = Statement* Statement = Skip | Block | Assignment | Conditional | Loop Skip = Block = Statements Conditional = Expression test; Statement thenbranch, elsebranch Loop = Expression test; Statement body Assignment = VariableRef target; Expression source Expression = VariableRef | Value | Binary | Unary
  • 36. Abstract Syntax of Clite VariableRef = Variable | ArrayRef Binary = Operator op; Expression term1, term2 Unary = UnaryOp op; Expression term Operator = BooleanOp | RelationalOp | ArithmeticOp BooleanOp = && | || RelationalOp = = | ! | != | < | <= | > | >= ArithmeticOp = + | - | * | / UnaryOp = ! | - Variable = String id ArrayRef = String id; Expression index Value = IntValue | BoolValue | FloatValue | CharValue IntValue = Integer intValue FloatValue = Float floatValue BoolValue = Boolean boolValue CharValue = Character charValue
  • 37. Java AbstractSyntax for Clite class Loop extends Statement { Expression test; Statement body; } Class Assignment extends Statement { // Assignment = Variable target; Expression source Variable target; Expression source; } … Much more… see the file (when available)
  • 38. Abstract Syntax Tree • Just as we can build a parse tree from a BNF grammar, we can build an abstract syntax tree from an abstract syntax • Example for: x+2*y Expression = Variable | Value | Binary Binary = Operator op ; Expression term1, term2 Binary node Expr
  • 39. Sample Clite Program • Compute nth fib number
  • 40. Abstract Syntax for Loop of Clite Program
  • 41. Concrete and Abstract Syntax • Aren’t the two redundant? – A little bit • The concrete syntax tells the programmer exactly what to write to have a valid program • The abstract syntax allows valid programs in two different languages to share common abstract representations – It is closer to semantics – We need both!
  • 42. What’s coming up? • Semantic analysis – Do the types match? What does this mean? char a=‘c’; int sum=0; sum = sum = a; • Can associate machine code with the abstract parse – Code generation – Code optimization

Editor's Notes

  • #15: RegExpr for CS A331, MM/DD/YY or MM/DD/YYYY