0% found this document useful (0 votes)

15 views35 pages

Syntax Analysis: COP5621 Compiler Construction

The document discusses the role of a parser in compiler construction, focusing on syntax analysis and error handling. It outlines various parsing techniques, including top-down and bottom-up methods, as well as the importance of grammars and error recovery strategies. Additionally, it covers concepts such as the viable-prefix property, FIRST and FOLLOW sets, and the characteristics of LL(1) grammars.

Uploaded by

sillymclaren6

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views35 pages

Syntax Analysis: COP5621 Compiler Construction

Uploaded by

sillymclaren6

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

1

Syntax Analysis
Part I
Chapter 4

COP5621 Compiler Construction

Position of a Parser in the

Compiler Model
Token,
Source tokenval Parser
Lexical Intermediate
Program and rest of
Analyzer representation
Get next front-end
token
Lexical error Syntax error
Semantic error

Symbol Table
3

The Parser
• The task of the parser is to check syntax
• The syntax-directed translation stage in the
compiler’s front-end checks static semantics and
produces an intermediate representation (IR) of
the source program
– Abstract syntax trees (ASTs)
– Control-flow graphs (CFGs) with triples, three-address
code, or register transfer lists
– WHIRL (SGI Pro64 compiler) has 5 IR levels!
4

Error Handling
• A good compiler should assist in identifying and
locating errors
– Lexical errors: important, compiler can easily recover
and continue
– Syntax errors: most important for compiler, can almost
always recover
– Static semantic errors: important, can sometimes
recover
– Dynamic semantic errors: hard or impossible to detect
at compile time, runtime checks are required
– Logical errors: hard or impossible to detect
5

Viable-Prefix Property
• The viable-prefix property of LL/LR parsers
allows early detection of syntax errors
– Goal: detection of an error as soon as possible
without consuming unnecessary input
– How: detect an error as soon as the prefix of the
input does not match a prefix of any string in
the language
Error is
Error is detected here
… detected here …
Prefix Prefix DO 10 I = 1;0
for (;)
… …
6

Error Recovery Strategies

• Panic mode
– Discard input until a token in a set of designated
synchronizing tokens is found
• Phrase-level recovery
– Perform local correction on the input to repair the error
• Error productions
– Augment grammar with productions for erroneous
constructs
• Global correction
– Choose a minimal sequence of changes to obtain a
global least-cost correction
7

Grammars (Recap)
• Context-free grammar is a 4-tuple
G=(N,T,P,S) where
– T is a finite set of tokens (terminal symbols)
– N is a finite set of nonterminals
– P is a finite set of productions of the form
→
where   (NT)* N (NT)*
and   (NT)*
– S is a designated start symbol S  N
8

Notational Conventions Used

• Terminals
a,b,c,…  T
specific terminals: 0, 1, id, +
• Nonterminals
A,B,C,…  N
specific nonterminals: expr, term, stmt
• Grammar symbols
X,Y,Z  (NT)
• Strings of terminals
u,v,w,x,y,z  T*
• Strings of grammar symbols
,,  (NT)*
9

Derivations (Recap)
• The one-step derivation is defined by
A
where A →  is a production in the grammar
• In addition, we define
–  is leftmost lm if  does not contain a nonterminal
–  is rightmost rm if  does not contain a nonterminal
– Transitive closure * (zero or more steps)
– Positive closure + (one or more steps)
• The language generated by G is defined by
L(G) = {w | S + w}
10

Derivation (Example)
E→E+E
E→E*E
E→(E)
E→-E
E → id

E  - E  - id
E rm E + E rm E + id rm id + id
E * E
E + id * id + id
11

Chomsky Hierarchy: Language

Classification
• A grammar G is said to be
– Regular if it is right linear where each production is of
the form
A→wB or A→w
or left linear where each production is of the form
A→Bw or A→w
– Context free if each production is of the form
A→
where A  N and   (NT)*
– Context sensitive if each production is of the form
A→
where A  N, ,,  (NT)*, || > 0
– Unrestricted
12

Chomsky Hierarchy

L(regular)  L(context free)  L(context sensitive)  L(unrestricted)

Where L(T) = { L(G) | G is of type T }

That is, the set of all languages
generated by grammars G of type T

Examples:
Every finite language is regular
L1 = { anbn | n  1 } is context free
L2 = { anbncn | n  1 } is context sensitive
13

Parsing
• Universal (any C-F grammar)
– Cocke-Younger-Kasimi
– Earley
• Top-down (C-F grammar with restrictions)
– Recursive descent (predictive parsing)
– LL (Left-to-right, Leftmost derivation) methods
• Bottom-up (C-F grammar with restrictions)
– Operator precedence parsing
– LR (Left-to-right, Rightmost derivation) methods
• SLR, canonical LR, LALR
14

Top-Down Parsing
• LL methods (Left-to-right, Leftmost
derivation) and recursive-descent parsing
Grammar: Leftmost derivation:
E→T+T E lm T + T
T→(E) lm id + T
T→-E
T → id
lm id + id
E E E E

T T T T T T

+ id + id + id
15

Left Recursion (Recap)

• Productions of the form
A→A
|
|
are left recursive
• When one of the productions in a grammar
is left recursive then a predictive parser may
loop forever
16

General Left Recursion

Elimination
Arrange the nonterminals in some order A1, A2, …, An
for i = 1, …, n do
for j = 1, …, i-1 do
replace each
Ai → Aj 
with
Ai → 1  | 2  | … | k 
where
Aj → 1 | 2 | … | k
enddo
eliminate the immediate left recursion in Ai
enddo
17

Immediate Left-Recursion
Elimination
Rewrite every left-recursive production
A→A
|
|
|A
into a right-recursive production:
A →  AR
|  AR
AR →  AR
|  AR
|
18

Example Left Rec. Elimination

A→BC|a
B→CA|Ab Choose arrangement: A, B, C
C→AB|CC|a

i = 1: nothing to do
i = 2, j = 1: B→CA|Ab
 B→CA|BCb|ab
(imm) B → C A BR | a b BR
BR → C b BR | 
i = 3, j = 1: C→AB|CC|a
 C→BCB|aB|CC|a
i = 3, j = 2: C→BCB|aB|CC|a
 C → C A BR C B | a b BR C B | a B | C C | a
(imm) C → a b BR C B CR | a B CR | a CR
CR → A BR C B CR | C CR | 
19

Left Factoring
• When a nonterminal has two or more productions
whose right-hand sides start with the same
grammar symbols, the grammar is not LL(1) and
cannot be used for predictive parsing
• Replace productions
A →  1 |  2 | … |  n | 
with
A →  AR | 
AR → 1 | 2 | … | n
20

Predictive Parsing
• Eliminate left recursion from grammar
• Left factor the grammar
• Compute FIRST and FOLLOW
• Two variants:
– Recursive (recursive calls)
– Non-recursive (table-driven)
21

FIRST
• FIRST() = the set of terminals that begin all strings
derived from 

FIRST(a) = {a} if a  T
FIRST() = {}
FIRST(A) = A→ FIRST() for A→  P
FIRST(X1X2…Xk) =
if for all j = 1, …, i-1 :   FIRST(Xj) then
add non- in FIRST(Xi) to FIRST(X1X2…Xk)
if for all j = 1, …, k :   FIRST(Xj) then
add  to FIRST(X1X2…Xk)
22

FOLLOW
• FOLLOW(A) = the set of terminals that can
immediately follow nonterminal A

FOLLOW(A) =
for all (B →  A )  P do
add FIRST()\{} to FOLLOW(A)
for all (B →  A )  P and   FIRST() do
add FOLLOW(B) to FOLLOW(A)
for all (B →  A)  P do
add FOLLOW(B) to FOLLOW(A)
if A is the start symbol S then
add $ to FOLLOW(A)
23

LL(1) Grammar
• A grammar G is LL(1) if for each collections of
productions
A → 1 | 2 | … | n
for nonterminal A the following holds:

1. FIRST(i)  FIRST(j) =  for all i  j

2. if i *  then
2.a. j *  for all i  j
2.b. FIRST(j)  FOLLOW(A) = 
for all i  j
24

Non-LL(1) Examples

Grammar Not LL(1) because

Recursive Descent Parsing

• Grammar must be LL(1)
• Every nonterminal has one (recursive) procedure
responsible for parsing the nonterminal’s syntactic
category of input tokens
• When a nonterminal has multiple productions,
each production is implemented in a branch of a
selection statement based on input look-ahead
information
26

Using FIRST and FOLLOW to

Write a Recursive Descent Parser
procedure rest();
begin
expr → term rest if lookahead in FIRST(+ term rest) then
rest → + term rest match(‘+’); term(); rest()
else if lookahead in FIRST(- term rest) then
| - term rest match(‘-’); term(); rest()
| else if lookahead in FOLLOW(rest) then
term → id return
else error()
end;

FIRST(+ term rest) = { + }

FIRST(- term rest) = { - }
FOLLOW(rest) = { $ }
27

Non-Recursive Predictive
Parsing
• Given an LL(1) grammar G=(N,T,P,S)
construct a table M[A,a] for A  N, a  T
and use a driver program with a stack
input a + b $

stack
Predictive parsing
X output
program (driver)
Y
Z Parsing table
$ M
28

Constructing a Predictive Parsing

Table
for each production A →  do
for each a  FIRST() do
add A →  to M[A,a]
enddo
if   FIRST() then
for each b  FOLLOW(A) do
add A →  to M[A,b]
enddo
endif
enddo
Mark each undefined entry in M error
29

Example Table A→ FIRST() FOLLOW(A)

E → T ER ( id $)
ER → + T ER + $)
E → T ER
ER → + T ER |  ER →  
T → F TR T → F TR ( id +$)
TR → * F TR |  TR → * F TR * +$)
F → ( E ) | id TR →  
F→(E) ( *+$)
F → id id

id + * ( ) $
E E → T ER E → T ER
ER ER → + T ER ER →  ER → 
T T → F TR T → F TR
TR TR →  TR → * F TR TR →  TR → 
F F → id F→(E)
30

LL(1) Grammars are

Unambiguous
Ambiguous grammar A→ FIRST() FOLLOW(A)
S → i E t S SR | a S → i E t S SR i e$
SR → e S |  S→a a
E→b SR → e S e e$
SR →  
E→b b t
Error: duplicate table entry
a b e i t $
S S→a S → i E t S SR
SR → 
SR SR → 
SR → e S
E E→b
31

Predictive Parsing Program

push($)
(Driver)
push(S)
a := lookahead
repeat
X := pop()
if X is a terminal or X = $ then
match(X) // move to next token, a := lookahead
else if M[X,a] = X → Y1Y2…Yk then
push(Yk, Yk-1, …, Y2, Y1) // such that Y1 is on top
produce output and/or invoke actions
else error()
endif
until X = $
32

Example Table-Driven Parsing

Stack Input Production applied
$E id+id*id$
$ERT id+id*id$ E → T ER
$ERTRF id+id*id$ T → F TR
$ERTRid id+id*id$ F → id
$ERTR +id*id$
$ER +id*id$ TR → 
$ERT+ +id*id$ ER → + T ER
$ERT id*id$
$ERTRF id*id$ T → F TR
$ERTRid id*id$ F → id
$ERTR *id$
$ERTRF* *id$ TR → * F TR
$ERTRF id$
$ERTRid id$ F → id
$ERTR $
$ER $ TR → 
$ $ ER → 
33

Panic Mode Recovery

FOLLOW(E) = { $ ) }
FOLLOW(ER) = { $ ) }
Add synchronizing actions to FOLLOW(T) = { + $ ) }
undefined entries based on FOLLOW FOLLOW(TR) = { + $ ) }
FOLLOW(F) = { * + $ ) }

id + * ( ) $
E E → T ER E → T ER synch synch
ER ER → + T ER ER →  ER → 
T T → F TR synch T → F TR synch synch
TR TR →  TR → * F TR TR →  TR → 
F F → id synch synch F→(E) synch synch
synch: pop A and skip input till synch token
or skip until FIRST(A) found
34

Phrase-Level Recovery
Change input stream by inserting missing *
For example: id id is changed into id * id

id + * ( ) $
E E → T ER E → T ER synch synch
ER ER → + T ER ER →  ER → 
T T → F TR synch T → F TR synch synch
TR insert * TR →  TR → * F TR TR →  TR → 
F F → id synch synch F→(E) synch synch

insert : insert missing and redo the production

Error Productions
E → T ER
Add error production:
ER → + T ER | 
TR → F TR
T → F TR
to ignore missing *, e.g.: id id
TR → * F TR | 
F → ( E ) | id

id + * ( ) $
E E → T ER E → T ER synch synch
ER ER → + T ER ER →  ER → 
T T → F TR synch T → F TR synch synch
TR TR → F T R TR →  TR → * F TR TR →  TR → 
F F → id synch synch F→(E) synch synch

Role of Parser in Compiler Design
No ratings yet
Role of Parser in Compiler Design
31 pages
Chapter 4 - Syntax Analysis Part 1
No ratings yet
Chapter 4 - Syntax Analysis Part 1
36 pages
Free Parser for Syntax Analysis
No ratings yet
Free Parser for Syntax Analysis
36 pages
Syntax Analysis in Compiler Design
No ratings yet
Syntax Analysis in Compiler Design
39 pages
Syntax Analysis: Parsing Techniques Overview
No ratings yet
Syntax Analysis: Parsing Techniques Overview
93 pages
Syntax Analysis: COP5621 Compiler Construction
No ratings yet
Syntax Analysis: COP5621 Compiler Construction
36 pages
Parser and Syntax Analysis in Compilers
No ratings yet
Parser and Syntax Analysis in Compilers
61 pages
Role of the Parser in Compilers
No ratings yet
Role of the Parser in Compilers
53 pages
Syntactic Analysis in Compiler Design
No ratings yet
Syntactic Analysis in Compiler Design
44 pages
Parsing Techniques Overview
No ratings yet
Parsing Techniques Overview
68 pages
Syntax Analysis and Parsing Techniques
No ratings yet
Syntax Analysis and Parsing Techniques
34 pages
Understanding Syntax Analysis in Parsing
No ratings yet
Understanding Syntax Analysis in Parsing
38 pages
Role of Parser in Compiler Design
No ratings yet
Role of Parser in Compiler Design
82 pages
Top-Down Parsing in Compiler Design
No ratings yet
Top-Down Parsing in Compiler Design
34 pages
Unit - II Top Down Parsing
No ratings yet
Unit - II Top Down Parsing
67 pages
Syntax Analysis: Parsing Techniques Explained
No ratings yet
Syntax Analysis: Parsing Techniques Explained
73 pages
Syntax Analysis and Parsing Techniques
No ratings yet
Syntax Analysis and Parsing Techniques
81 pages
Parsing Techniques: Top-Down vs Bottom-Up
No ratings yet
Parsing Techniques: Top-Down vs Bottom-Up
49 pages
Role of Parser in Compiler Design
No ratings yet
Role of Parser in Compiler Design
34 pages
Compiler Design: Syntax Analysis Overview
No ratings yet
Compiler Design: Syntax Analysis Overview
91 pages
Non-Recursive Predictive Parsing Explained
No ratings yet
Non-Recursive Predictive Parsing Explained
14 pages
Syntax Analyzer and Context-Free Grammars
No ratings yet
Syntax Analyzer and Context-Free Grammars
27 pages
Parser Classification in Syntax Analysis
No ratings yet
Parser Classification in Syntax Analysis
37 pages
Compiler Design: Syntax Analyzers & Parsing
No ratings yet
Compiler Design: Syntax Analyzers & Parsing
117 pages
Predictive Parsing and Left Recursion
No ratings yet
Predictive Parsing and Left Recursion
34 pages
Top Down Parsing Techniques Explained
No ratings yet
Top Down Parsing Techniques Explained
10 pages
Top-Down Parsing Techniques Explained
No ratings yet
Top-Down Parsing Techniques Explained
111 pages
Syntax Analysis in Compiler Design
No ratings yet
Syntax Analysis in Compiler Design
74 pages
Syntax Analysis in Parsing Techniques
No ratings yet
Syntax Analysis in Parsing Techniques
92 pages
Understanding Top-Down Parsing Techniques
No ratings yet
Understanding Top-Down Parsing Techniques
41 pages
Syntax Analysis and Parsing Techniques
No ratings yet
Syntax Analysis and Parsing Techniques
122 pages
Syntax Analysis in Compiler Design
No ratings yet
Syntax Analysis in Compiler Design
68 pages
Top-Down Parsing Techniques in Compiler Design
No ratings yet
Top-Down Parsing Techniques in Compiler Design
30 pages
Top-Down Parsing Techniques Explained
No ratings yet
Top-Down Parsing Techniques Explained
158 pages
Film Parsing Techniques Explained
No ratings yet
Film Parsing Techniques Explained
105 pages
Top-Down Parsing and Syntax Analysis
No ratings yet
Top-Down Parsing and Syntax Analysis
67 pages
Syntax Analysis and Parsing Techniques
No ratings yet
Syntax Analysis and Parsing Techniques
29 pages
Syntax Analysis and Parsing Techniques
No ratings yet
Syntax Analysis and Parsing Techniques
6 pages
Predictive Parsing Techniques Explained
No ratings yet
Predictive Parsing Techniques Explained
35 pages
Syntax Analysis and Parsing Techniques
No ratings yet
Syntax Analysis and Parsing Techniques
54 pages
Syntax Analysis in Compiler Design
No ratings yet
Syntax Analysis in Compiler Design
60 pages
Top-Down Parsing in Compiler Design
100% (1)
Top-Down Parsing in Compiler Design
60 pages
Understanding Compilers and Parsing Techniques
No ratings yet
Understanding Compilers and Parsing Techniques
96 pages
Top-Down Parsing Techniques Explained
No ratings yet
Top-Down Parsing Techniques Explained
45 pages
Understanding Context-Free Grammar and Parsing
No ratings yet
Understanding Context-Free Grammar and Parsing
90 pages
Elimination of Left Recursion in Grammars
No ratings yet
Elimination of Left Recursion in Grammars
32 pages
Syntax Analysis in Compiler Design
No ratings yet
Syntax Analysis in Compiler Design
39 pages
Unit II - Basic Parsing Techniques: 1. Introduction To Parsers
No ratings yet
Unit II - Basic Parsing Techniques: 1. Introduction To Parsers
10 pages
Ambiguity and Parsing Techniques Explained
No ratings yet
Ambiguity and Parsing Techniques Explained
24 pages
Understanding Syntax Analysis in Compilers
No ratings yet
Understanding Syntax Analysis in Compilers
168 pages
Understanding Syntax Analysis in Compilers
No ratings yet
Understanding Syntax Analysis in Compilers
75 pages
Context-Free Grammar and Parse Trees
No ratings yet
Context-Free Grammar and Parse Trees
10 pages
LL(1) Top-Down Parsing Techniques
No ratings yet
LL(1) Top-Down Parsing Techniques
45 pages
Syntax Analysis in Compiler Design
No ratings yet
Syntax Analysis in Compiler Design
51 pages
Compiler Design: Syntax Analysis Overview
No ratings yet
Compiler Design: Syntax Analysis Overview
67 pages
Predictive Parsing and CFG Overview
No ratings yet
Predictive Parsing and CFG Overview
7 pages
Syntax Analysis in Compiler Design
No ratings yet
Syntax Analysis in Compiler Design
180 pages
Understanding Parsing Techniques
No ratings yet
Understanding Parsing Techniques
24 pages
Parsing Techniques and CFG Overview
No ratings yet
Parsing Techniques and CFG Overview
180 pages
Understanding Idioms and Grammar Basics
No ratings yet
Understanding Idioms and Grammar Basics
14 pages
Anthropology: Subfields and Connections
No ratings yet
Anthropology: Subfields and Connections
10 pages
18th Century English Language Evolution
No ratings yet
18th Century English Language Evolution
4 pages
Whole-Class Repeated Reading for Fluency
No ratings yet
Whole-Class Repeated Reading for Fluency
22 pages
11th Grade English Test Paper
No ratings yet
11th Grade English Test Paper
2 pages
Expressing Preferences in English
No ratings yet
Expressing Preferences in English
1 page
Context-Free Grammar and Derivations
No ratings yet
Context-Free Grammar and Derivations
35 pages
Irregular Verbs List and Forms
No ratings yet
Irregular Verbs List and Forms
5 pages
Grade 9 Students' Pronunciation Skills
No ratings yet
Grade 9 Students' Pronunciation Skills
16 pages
Grade 11 Past Tense Study Guide
No ratings yet
Grade 11 Past Tense Study Guide
4 pages
Learning English as a Second Language
No ratings yet
Learning English as a Second Language
8 pages
Speech Evaluation Report for Articulation Disorder
No ratings yet
Speech Evaluation Report for Articulation Disorder
4 pages
Dyslexia: Cognitive Dysfunction Insights
No ratings yet
Dyslexia: Cognitive Dysfunction Insights
10 pages
EFL 25-Hour Revision Course for Kids
No ratings yet
EFL 25-Hour Revision Course for Kids
166 pages
Teaching Effective Pronunciation Techniques
No ratings yet
Teaching Effective Pronunciation Techniques
8 pages
STD 6, Worksheet - Term ICSE
No ratings yet
STD 6, Worksheet - Term ICSE
5 pages
Understanding Inchoative and Sense Verbs
No ratings yet
Understanding Inchoative and Sense Verbs
14 pages
Cantonese Language Programmes Overview
No ratings yet
Cantonese Language Programmes Overview
9 pages
Portuguese Language Learning Verification
No ratings yet
Portuguese Language Learning Verification
3 pages
Classroom Rules and English Basics
100% (1)
Classroom Rules and English Basics
30 pages
Global Test Common Core
No ratings yet
Global Test Common Core
2 pages
Past Simple Tense Exercises Guide
No ratings yet
Past Simple Tense Exercises Guide
2 pages
Comparative Adjectives and Luck Stories
No ratings yet
Comparative Adjectives and Luck Stories
3 pages
Grade 7 English Daily Lesson Log
No ratings yet
Grade 7 English Daily Lesson Log
7 pages
Linguistic Reinforcement Course Overview
No ratings yet
Linguistic Reinforcement Course Overview
2 pages
Grade 4 English Periodical Test Guide
No ratings yet
Grade 4 English Periodical Test Guide
6 pages
Sociolinguistics and Social Semiotics
No ratings yet
Sociolinguistics and Social Semiotics
10 pages
English/Sepedi Math Dictionary for Grades R-3
No ratings yet
English/Sepedi Math Dictionary for Grades R-3
64 pages
Samarkand State Institute Transcript
No ratings yet
Samarkand State Institute Transcript
2 pages
English Phonetics Test: Fill-in & Transcription
No ratings yet
English Phonetics Test: Fill-in & Transcription
2 pages

Syntax Analysis: COP5621 Compiler Construction

Uploaded by

Syntax Analysis: COP5621 Compiler Construction

Uploaded by

1

COP5621 Compiler Construction

Position of a Parser in the

Error Recovery Strategies

Notational Conventions Used

Chomsky Hierarchy: Language

L(regular)  L(context free)  L(context sensitive)  L(unrestricted)

Where L(T) = { L(G) | G is of type T }

Left Recursion (Recap)

General Left Recursion

Example Left Rec. Elimination

1. FIRST(i)  FIRST(j) =  for all i  j

Grammar Not LL(1) because

Recursive Descent Parsing

Using FIRST and FOLLOW to

FIRST(+ term rest) = { + }

Constructing a Predictive Parsing

Example Table A→ FIRST() FOLLOW(A)

LL(1) Grammars are

Predictive Parsing Program

Example Table-Driven Parsing

Panic Mode Recovery

insert *: insert missing * and redo the production

You might also like

insert : insert missing and redo the production