0% found this document useful (0 votes)

3 views13 pages

409.f

The document covers compiler construction concepts, detailing the differences between compilers and interpreters, and outlining the phases of compilation including lexical analysis, syntax analysis, and semantic analysis. It explains the roles of lexical analyzers and parsers, their outputs, and the advantages and disadvantages of lexical analysis. Additionally, it provides examples, mnemonics, and key concepts related to the compilation process.

Uploaded by

gremasaa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views13 pages

409.f

Uploaded by

gremasaa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

Compiler Construction (CSC 409)

1. Core Concepts

Front: What is a compiler?

Back: A translator that converts high-level code (e.g., C++) into machine code all at once.

Example: C code → .exe file.

Analogies: Like translating an entire book into another language.

Front: What is an interpreter?

Back: Executes code line-by-line (e.g., Python).

Example: Runs print("Hello") directly without creating an executable.

Key Difference: Interpreter = Tour Guide; Compiler = Book Translator.

2. Phases of Compilation (Use Mnemonic: Lazy Squirrels Sell

Intermediate Coconuts Gracefully)

Front: List the 6 phases of compilation in order.

Back:

1. Lexical Analysis
2. Syntax Analysis
3. Semantic Analysis
4. Intermediate Code Generation
5. Code Optimization
6. Gode Generation

Front: What does lexical analysis do?

Back: Converts source code into tokens (e.g., id, =, +).

Example: a = b + 5 → Tokens: id(a), =, id(b), +, num(5).

Front: What is the role of syntax analysis?

Back: Validates code structure using grammar rules and builds a parse tree.

Example: Detects a + = b as invalid (adjacent + and =).

Front: What happens in semantic analysis?

Back: Checks meaning (types, scope).

Example: int x = "text"; → Error (type mismatch).

Front: What is intermediate code generation?

Back: Creates platform-independent code (e.g., three-address code).

Example:

temp1 = b * 60.0
a = temp1 + c

Front: What is code optimization?

Back: Improves code efficiency.

Example: Replacing x = x + 0 → x.

Front: What is code generation?

Back: Converts intermediate code to machine-specific instructions.

Example:

MOV b, R1
MUL #60.0, R1
ADD c, R1

3. Lexical Analysis Deep Dive

Front: Define lexeme vs. token.

Back:

• Lexeme: Raw text (e.g., "if").

• Token: Categorized lexeme (e.g., keyword).

Example:

Lexemes: "123", Tokens: num.

Front: What are regular expressions used for in lexical analysis?

Back: To define patterns for tokens (e.g., [a-zA-Z]+ for identifiers).

Front: What is lookahead?

Back: Checking the next character to resolve ambiguities.

Example: == (one token) vs. = followed by = (two tokens).

Front: Name 2 lexical error recovery techniques.

Back:

1. Panic Mode: Skip input until a valid token.

2. Local Correction: Insert/delete/replace characters.

4. Syntax & Semantic Analysis

Front: What is a parse tree?

Back: A hierarchical representation of code structure.

Example:

=
/\
a +
/\
b *
/\
c 60

Front: What is operator precedence?

Back: Rules defining which operations execute first (e.g., * before +).

Example: a + b * c → a + (b * c).

Front: What is type checking?

Back: Ensuring variables/operations use compatible types.

Example: int x = 5.5; → Error (float assigned to int).

5. Intermediate Code & Optimization

Front: What is three-address code?

Back: Intermediate code with ≤ 3 operands per instruction.

Example:

t1 = b * 60
t2 = a + t1

Front: What is common subexpression elimination?

Back: Reusing repeated calculations.

Example:

Original: t1 = b + c; t2 = b + c;
Optimized: t1 = b + c; t2 = t1;

Front: What is loop optimization?

Back: Moving invariant code outside loops.

Example:

Before: for (i=0; i<n; i++) { x = y*z; }

After: x = y*z; for (i=0; i<n; i++) {}
6. Symbol Table & Error Handling

Front: What is the symbol table?

Back: A "dictionary" storing identifiers and their attributes (type, scope, memory address).

Example: rate: float, Address: 0x1000.

Front: Name 4 error recovery techniques.

Back:

1. Panic Mode
2. Phrase-Level Recovery
3. Error Productions
4. Global Correction

Front: What is error production?

Back: Adding grammar rules to handle common errors (e.g., missing semicolon).

7. Tools & Examples

Front: Tools for lexical and syntax analysis?

Back:

• Lexical: Lex/Flex
• Syntax: Yacc/Bison

Front: Trace position = initial + rate * 60 through compilation phases.

Back:

1. Lexical: Tokens → id, =, id, +, id, *, num.

2. Syntax: Validates * before +.
3. Semantic: Converts 60 to float.
4. Intermediate: temp1 = rate * 60.0; position = initial + temp1.
5. CodeGen: Assembly instructions.
8. Quick Comparisons

Front: Compiler vs. Interpreter

Back:

• Compiler: Faster execution, standalone executable.

• Interpreter: Slower, platform-independent.

Front: Lexical vs. Syntax Analyzer

Back:

• Lexical: Tokens.
• Syntax: Parse tree.

Front: Local vs. Loop Optimization

Back:

• Local: Redundant code removal.

• Loop: Move code outside loops.

9. Exam Hotspots

Front: What is relocatable code?

Back: Machine code with adjustable memory addresses (handled by the loader).

Front: What does the preprocessor do?

Back: Processes macros (#define), file inclusion (#include), and conditional compilation (#ifdef).

Front: What is the loader/linker?

Back: Combines object files into an executable and loads it into memory.
1. Core Concepts

Term Definition/Example

Compiler Translates entire HLL code to machine code (e.g., C → .exe).

Interpreter Executes code line-by-line (e.g., Python).

Assembler Converts assembly code (e.g., MOV AX, 5) to machine code.

Preproces Handles macros (#define), file inclusion (#include), and conditional

sor compilation.

2. Phases of Compilation

Phase Input → Output Example

Lexical Characters → Tokens (e.g., a = b + 5 →

position → id, 60 → num.
Analysis id, =, id, +, num).

Syntax Validates a + b * c as a + (b * c)
Tokens → Parse Tree
Analysis (operator precedence).

Semantic Checks int x = "text"; → type

Parse Tree → Annotated Tree
Analysis mismatch error.

Intermedi Annotated Tree → Three-Address temp1 = rate * 60.0; position =

ate Code Code initial + temp1.

Code
Optimizat Intermediate Code → Optimized Code Replaces x = x + 0 → x.
ion

Code
MOVF rate, R1; MULF #60.0, R1;
Generatio Optimized Code → Machine Code
ADDF initial, R1.
n
3. Lexical Analysis

Term Definition Example

Lexeme Raw sequence of characters (e.g., "if", "123"). "return" → lexeme.

"return" → keyword
Token Categorized lexeme (e.g., keyword, identifier).
token.

Rule (regex) defining valid lexemes (e.g., [a-z]+ for [0-9]+ → pattern for
Pattern
identifiers). numbers.

Lookah Checking next character to resolve ambiguities = followed by = → ==

ead (e.g., = vs. ==). token.

4. Error Handling

Technique Description Example

Panic Mode Skip input until a valid token (e.g., ;). Skip until ; after a = b +.

Local
Insert/delete/replace characters to fix errors. Replace fi with if.
Correction

Error Add grammar rules to handle common errors Allow if (x) { ... }
Productions (e.g., missing ;). without ;.

5. Tools & Components

Componen
Role Tools
t

Lexical
Generates tokens from source code. Lex, Flex.
Analyzer
Syntax
Builds parse tree using grammar rules. Yacc, Bison.
Analyzer

Symbol Stores identifiers (variables, functions) and Hash table, tree

Table attributes (type, address). structures.

6. Compiler vs. Interpreter

Feature Compiler Interpreter

Entire code at once → standalone

Execution Line-by-line at runtime.
executable.

Slower due to runtime

Speed Faster execution.
translation.

Error Immediate (during

Early (during compilation).
Detection execution).

Examples C, C++, Rust. Python, JavaScript.

7. Key Optimization Techniques

Technique Description Example

Common Subexpression Reuse repeated t1 = b + c used in a = t1 +

Elimination calculations. d.

Move computations x = y * z moved outside

Loop Invariant Code Motion
outside loops. loop.

Remove unreachable
Dead Code Elimination Delete if (false) { ... }.
code.
8. Mnemonics & Examples

Mnemon
Purpose Example
ic

Lazy
Phases of Compilation: Lex → Syntax → Semantic → position = initial +
Squirrel
Intermediate → Codegen → Generate. rate * 60.
s...

Token
"123" → lexeme,
vs. Lexeme = raw text, Token = categorized lexeme.
num → token.
Lexeme

9. Quick Reference Table

Concept Key Points

Symbol Table Stores identifiers (e.g., rate: float, Address: 0x1000).

Intermediate Code Platform-independent (e.g., three-address code).

Error Recovery Panic mode, local correction, error productions.

Lexical Errors Misspelled keywords, invalid characters.

Structured and Corrected Summary: Lexical Analyzer vs. Parser

1. Comparison Table: Lexical Analyzer vs. Parser

Lexical Analyzer Parser

Scans the input program character
Performs syntax analysis on token stream.
stream.

Identifies tokens (e.g., keywords, Generates an abstract syntax tree (AST) or

identifiers). parse tree.

Inserts tokens into the symbol table Updates symbol table with semantic
(basic entries). information (e.g., type, scope).

Generates lexical errors (e.g., Generates syntax errors (e.g., missing

invalid characters). semicolons).

2. Why Separate Lexical and Syntax Analysis?

1. Simplicity of Design
a. Separating tokenization (lexical) from grammar validation (syntax) simplifies compiler
architecture.
2. Efficiency
a. Specialized buffering techniques in the lexical analyzer speed up tokenization.
3. Specialization
a. Lexical analyzers use regular expressions, while parsers use context-free grammars
(CFG).
4. Portability
a. Lexical analyzers handle platform-specific input (e.g., file encoding), isolating these
details from the parser.

3. Advantages of Lexical Analysis

1. Foundation for Parsing

a. Provides tokens for syntax and semantic analysis (e.g., compilers, interpreters).
2. Error Localization
a. Pinpoints lexical errors (e.g., @num → invalid character).
3. Reusability
a. Lexical rules (e.g., regex for identifiers) can be reused across projects.
4. Web Development
a. Used in browsers to parse HTML/CSS/JavaScript into tokens for rendering.
4. Disadvantages of Lexical Analysis

1. Time-Consuming
a. Requires careful design of regex patterns for token recognition.
2. Complex Regular Expressions
a. Some patterns (e.g., floating-point numbers) are harder to define than PEG or EBNF
rules.
3. Debugging Overhead
a. Testing tokenization rules (e.g., edge cases like 0x1F vs. 0x1G) can be tedious.
4. Runtime Overhead
a. Generating token tables (e.g., DFA/NFA) adds initial compilation time.

5. Key Concepts

• Lexeme: Raw text matched to a token (e.g., "if", "123").

• Token: Categorized lexeme (e.g., keyword, number).
• Symbol Table: Stores identifiers with attributes (type, scope, memory address).

6. Example Workflow

Input Code:

int x = 42 + 5.3;

1. Lexical Analyzer Output:

a. Tokens: int (keyword), x (identifier), = (operator), 42 (integer), + (operator), 5.3 (float), ;
(punctuation).
2. Parser Output:
a. AST: Declaration
├── Type: int
├── Variable: x
└── Initializer:
└── BinaryExpression (+)
├── 42
└── 5.3

b. Syntax Error: Mixing int and float in an expression (detected in semantic analysis).
7. Summary Table

Aspect Lexical Analyzer Parser

Input Character stream Token stream

Output Tokens Parse tree/AST

Tools Lex, Flex Yacc, Bison, ANTLR

Errors Invalid tokens Grammar violations

Key Role Tokenization Structure validation

8. Mnemonics for Exam Prep

• "Lex Before Parse": Lexical analysis always precedes syntax analysis.

• "Regex for Tokens, CFG for Grammar": Lexical uses regex; parsing uses CFG.

CSC 404 07-11-2023
No ratings yet
CSC 404 07-11-2023
24 pages
409 pass q solved
No ratings yet
409 pass q solved
3 pages
Compiler Ct
No ratings yet
Compiler Ct
8 pages
CD Overview
No ratings yet
CD Overview
9 pages
Compiler Construction Notes
No ratings yet
Compiler Construction Notes
6 pages
turning machine
No ratings yet
turning machine
7 pages
CSC 401 F
No ratings yet
CSC 401 F
8 pages
English
No ratings yet
English
5 pages
c1 Open Cloze the Frequency Effect
No ratings yet
c1 Open Cloze the Frequency Effect
3 pages
section c
No ratings yet
section c
16 pages
Eight 8
No ratings yet
Eight 8
2 pages
The Entire Story of Jesus Stolen From Kemet Egypt
86% (7)
The Entire Story of Jesus Stolen From Kemet Egypt
23 pages
### Original Exam Questions Corresponding to the Solved Prob
No ratings yet
### Original Exam Questions Corresponding to the Solved Prob
6 pages
Unit 1 - Assignment - Updated - AKTU
No ratings yet
Unit 1 - Assignment - Updated - AKTU
2 pages
PCD
No ratings yet
PCD
14 pages
Questions With What, Who and Whom Lingbase
No ratings yet
Questions With What, Who and Whom Lingbase
1 page
Muhammad Hamza BSCS-E3-22-23 Compiler
No ratings yet
Muhammad Hamza BSCS-E3-22-23 Compiler
11 pages
Super Teacher Worksheets Word Search
No ratings yet
Super Teacher Worksheets Word Search
3 pages
Compiler Design_ 2-Mark and 16-Mark Answers (1)
No ratings yet
Compiler Design_ 2-Mark and 16-Mark Answers (1)
19 pages
Compilation Stages - New
No ratings yet
Compilation Stages - New
42 pages
1_Introduction to Compiler
No ratings yet
1_Introduction to Compiler
26 pages
Unit 5
No ratings yet
Unit 5
1 page
1-Introduction to programming language translators-13-12-2024
No ratings yet
1-Introduction to programming language translators-13-12-2024
38 pages
Compiler Design KCS5
No ratings yet
Compiler Design KCS5
10 pages
ACJC 2020 GP Prelim P2 Question Paper
No ratings yet
ACJC 2020 GP Prelim P2 Question Paper
6 pages
CD Important Questions
No ratings yet
CD Important Questions
44 pages
Introduction
No ratings yet
Introduction
46 pages
Odisha
No ratings yet
Odisha
9 pages
AK CD CSE 305 ASSIGNMENT 1
No ratings yet
AK CD CSE 305 ASSIGNMENT 1
15 pages
3-5 Presentation Rubric Ccss
No ratings yet
3-5 Presentation Rubric Ccss
1 page
Castes&Tribes TLink
No ratings yet
Castes&Tribes TLink
177 pages
Simple Present Tense
No ratings yet
Simple Present Tense
3 pages
Amin-Louati CV & Portfolio-2018
No ratings yet
Amin-Louati CV & Portfolio-2018
6 pages
Predicative Complexes With The Infinitive
No ratings yet
Predicative Complexes With The Infinitive
3 pages
Compiler Design Note1
No ratings yet
Compiler Design Note1
111 pages
Compiler Design Notes
No ratings yet
Compiler Design Notes
35 pages
WINSEM2024-25_CSI2005_TH_VL2024250502429_2024-12-13_Reference-Material-I (1)
No ratings yet
WINSEM2024-25_CSI2005_TH_VL2024250502429_2024-12-13_Reference-Material-I (1)
42 pages
Behavioral Modeling
No ratings yet
Behavioral Modeling
1 page
Lecture 01
No ratings yet
Lecture 01
47 pages
Compiler Construction Notes After Mid
No ratings yet
Compiler Construction Notes After Mid
18 pages
Gate Compiler Design-
No ratings yet
Gate Compiler Design-
72 pages
CD_Unit1_Lecture2-3
No ratings yet
CD_Unit1_Lecture2-3
32 pages
CD Unit 1
No ratings yet
CD Unit 1
54 pages
Introduction Compiler
No ratings yet
Introduction Compiler
47 pages
Compiler Design Note
No ratings yet
Compiler Design Note
313 pages
Verb Phrase
No ratings yet
Verb Phrase
37 pages
Build Your Own Scripting Language For Java
No ratings yet
Build Your Own Scripting Language For Java
10 pages
Coding Decoding Quiz 5
No ratings yet
Coding Decoding Quiz 5
9 pages
Day - 1 Intro To Compilers
No ratings yet
Day - 1 Intro To Compilers
53 pages
Compilers
No ratings yet
Compilers
25 pages
COE 205 Lab Manual Lab 3: Defining Data and Symbolic Constants - Page 25
No ratings yet
COE 205 Lab Manual Lab 3: Defining Data and Symbolic Constants - Page 25
11 pages
Compiler CH1
No ratings yet
Compiler CH1
24 pages
CC CS 2019 Solved
No ratings yet
CC CS 2019 Solved
6 pages
1-Structure and Phases of a Compiler-19!07!2024 (1)
No ratings yet
1-Structure and Phases of a Compiler-19!07!2024 (1)
99 pages
2-Introduction to Compilation and Lexical Analysis-19!07!2024
No ratings yet
2-Introduction to Compilation and Lexical Analysis-19!07!2024
135 pages
CD Sanchit Sir Notes
No ratings yet
CD Sanchit Sir Notes
115 pages
CH1 3
No ratings yet
CH1 3
32 pages
Schools in Britain 3
50% (2)
Schools in Britain 3
8 pages
7MCE1C4-Principles of Compiler Design
No ratings yet
7MCE1C4-Principles of Compiler Design
117 pages
CH 02 - PL
No ratings yet
CH 02 - PL
92 pages
Unit 1 Slides
No ratings yet
Unit 1 Slides
49 pages
Chapter 1
No ratings yet
Chapter 1
43 pages
Chapter 1 - Introduction To Comp
No ratings yet
Chapter 1 - Introduction To Comp
27 pages
Ancient Egyptian Hierogriphics Is Bantu
No ratings yet
Ancient Egyptian Hierogriphics Is Bantu
14 pages
Unit 1
No ratings yet
Unit 1
50 pages
Module 1
No ratings yet
Module 1
133 pages
Compiler Design Slide Chapter 1-6
No ratings yet
Compiler Design Slide Chapter 1-6
250 pages
Lecture#1 2
No ratings yet
Lecture#1 2
54 pages
CS602PC - Compiler - Design - Lecture Notes - Unit - 1
100% (2)
CS602PC - Compiler - Design - Lecture Notes - Unit - 1
19 pages
CSE353 Slides
No ratings yet
CSE353 Slides
76 pages
Dream Symbols and Language
100% (12)
Dream Symbols and Language
16 pages
Compiler Design: Dr. M. Moshiul Hoque Dept. of CSE, CUET
No ratings yet
Compiler Design: Dr. M. Moshiul Hoque Dept. of CSE, CUET
53 pages
Introduction To Compiling
100% (1)
Introduction To Compiling
26 pages
Garment L-I
No ratings yet
Garment L-I
70 pages
Slides 01 - Compiler Construction - UET CS - Introduction
No ratings yet
Slides 01 - Compiler Construction - UET CS - Introduction
37 pages
Lecture1 - Compiler Design
No ratings yet
Lecture1 - Compiler Design
52 pages
PROGRAMMING WITH PYTHON: Master the Basics and Beyond with Hands-On Projects and Expert Guidance (2024 Guide for Beginners)
From Everand
PROGRAMMING WITH PYTHON: Master the Basics and Beyond with Hands-On Projects and Expert Guidance (2024 Guide for Beginners)
ERROL HOWARD
No ratings yet
Cebuano Literature
50% (2)
Cebuano Literature
9 pages
Buana
No ratings yet
Buana
24 pages
Python Reference: An Alphabetical Guide
From Everand
Python Reference: An Alphabetical Guide
Jo Foster
No ratings yet
CS6660 Compiler Design MSM
No ratings yet
CS6660 Compiler Design MSM
128 pages
Compiler Desining Complete Notes
No ratings yet
Compiler Desining Complete Notes
175 pages
1-Phases of Compiler
No ratings yet
1-Phases of Compiler
66 pages
Introduction To Compilation
No ratings yet
Introduction To Compilation
33 pages
Overview of Compiler Environment Pass and Phase Phases of Compiler Regular Expression Lexical Analyzer LEX Tool Bootstrapping
No ratings yet
Overview of Compiler Environment Pass and Phase Phases of Compiler Regular Expression Lexical Analyzer LEX Tool Bootstrapping
35 pages
Mostra B2
No ratings yet
Mostra B2
19 pages
Unit 1
No ratings yet
Unit 1
29 pages
Compiler Design Ch1
No ratings yet
Compiler Design Ch1
13 pages
Unit 1
No ratings yet
Unit 1
29 pages
1 Lexial Analysis
No ratings yet
1 Lexial Analysis
24 pages
John Mikhail - Universal Moral Grammar - Theory, Evidence and The Future PDF
No ratings yet
John Mikhail - Universal Moral Grammar - Theory, Evidence and The Future PDF
10 pages
Games For High School
No ratings yet
Games For High School
8 pages
C Programming
From Everand
C Programming
Netra
No ratings yet
C# Package Mastery: 100 Essentials in 1 Hour - 2024 Edition
From Everand
C# Package Mastery: 100 Essentials in 1 Hour - 2024 Edition
Tenko
No ratings yet
Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)