0% found this document useful (0 votes)

5 views

Parser

The document discusses context-free grammars and their role in programming language syntax, detailing various types of parsers including top-down and bottom-up methods. It addresses common programming errors, error recovery strategies, and the differences between context-free grammars and regular expressions. Additionally, it covers concepts like FIRST and FOLLOW sets, LL(1) grammars, and parsing techniques such as recursive-descent and shift-reduce parsing.

Uploaded by

paled27319

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

Parser

Uploaded by

paled27319

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 40

Context-Free Grammars

• Precise syntactic specifications of a

programming language
• For some classes, we can construct
automatically an efficient parser
• Allows a language to evolve
The Parser
The Parser

Three general types of parsers

Universal parsing methods:

• can parse any grammars
• too inefficient to use in production compilers
The Parser

Three general types of parsers

Top-down methods:
• Parse-trees built from root to leaves.
• Input to parser scanned from left to right one symbol at a time
The Parser

Three general types of parsers

Bottom-up methods:
• Start from leaves and work their way up to the root.
• Input to parser scanned from left to right one symbol at a time
Dealing With Errors
If compiler had to process only correct programs, its
design and implementation would be simplified greatly!

• Few languages have been designed with

error handling in mind.
• Error handling is left to compiler designer.
• Bugs caused about 50% of the total cost,
same as they used to be 50 years ago!
Common Programming Errors
• Lexical errors: misspellings of
identifiers, keywords, or operators
• Syntactic errors: misplaced semicolons,
extra or missing braces, case without
switch, … .
• Semantic errors: type mismatches
between operators and operands
• Logical errors: anything else!
Wish List
• Report the presence of errors clearly
and accurately
• Recover from each error quickly enough
to detect subsequent errors
• Add minimal overhead to the processing
of correct programs

Easier said than done!

Error-Recovery Strategies
• Simplest: quit with an informative error
message when detecting the first error
• Panic-mode Recovery: discards input
symbols one at a time until a designated
synchronizing tokens is found.
• Phrase-level Recovery: perform local
correction on the remaining input. The
choice of local correction is left to the
compiler designer.
• Error Production: production rules for
common errors.
Context-Free Grammar

Terminals Nonterminals
(token name)
Example:

Start Productions
Symbol
Derivations
• Starting with start symbol
• At each step: a nonterminal replaced
with the body of a production

Example:

Deriving: -(id + id)

More on Derivations
means derive in one step

means derive in zero or more steps

means derive in one or more steps

Leftmost derivations, the leftmost nonterminal in each sentential is always

chosen.

Rightmost derivations, the rightmost nonterminal in each sentential is

always chosen.
Example
For the context-free grammar:
Parse Trees
• What is the relationship between a
parse-tree and derivations?
– Parse tree is the graphical representation
of derivations
– Filters out order of nonterminal
replacement
– many-to-one relationship between
derivations and parse-tree
Context-Free Grammar Vs
Regular Expressions
• Grammars are more powerful notations than
regular expressions
– Every construct that can be described by a regular
expression can be described by a grammar, but not
vice-versa

Regular expression -> NFA then:

(a|b)*abb
Question Worth Asking
If grammars are much powerful than regular
expressions, why not using them in lexical
analysis too?
• Lexical rules are quite simple and do not
need notation as powerful as grammars
• Regular expressions are more concise and
easier to understand for tokens
• More efficient lexical analyzers can be
generated from regular expressions than
from grammars
How Can We Enhance Our
Grammar?
• Eliminating ambiguity
• Eliminating left-recursion
• Left factoring
Eliminating Ambiguity
Sometimes we can re-write grammar to
eliminate ambiguity
Eliminating Left-Recursion

How about something like:

Left-Factoring
• A way of delaying the decision until
more info is available

Example:

stmt -> EXP else stmt | EXP

EXP -> if expr then stmt
Top-Down Parsing
• Constructing a parse tree for an input
string starting from root
• Parse tree built in preorder (depth-first)
• Finding left-most derivation
• At each step of a top-down parse:
– determine the production to be applied
– matching terminal symbols in production
body with input string
Given: and:
Recursive-Descent Parsing
How?
Example of Backtracking
and input
Important Concepts:
FIRST and FOLLOW
Example
FIRST FOLLOW

( id )$
+ε )$

( id +)$

*ε +)$

( id *+)$
LL(1) Grammars
• For recursive-descent parsers with no
backtracking
• L = scan from left to right
• L = left-most derivation
• 1 symbol lookahead
• Cannot be left-recursive or ambiguous
• If A-> F | T
– FIRST(F) and FIRST(T) are disjoint
– if ε is in FIRST(T) then FIRST(F) and FOLLOW(A)
are disjoint … and likewise when ε is in FIRST(F)
Parsing Table
Parsing Table
• Two dimensional array
– Rows: nonterminals Columns: input symbols
• M[A,a] where A is nonterminal and a is terminal
or $
• Gives the production rule to use.
First Follow
( id )$
+ε )$
( id +)$
*ε +)$
( id *+)$
Exercise
For the following productions:

S-> +SS | * SS | a

• Write predictive parser

• Write parsing table
• Show how to parse: +*aaa
Bottom-Up Parsing
• Given a string of terminals
• Build parse tree starting from leaves
and working up toward the root
• reverse of right-most derivation
• Used for type of grammars called LR
• LR parsers are difficult to build by hand
• We use automatic parser generators for
LR grammars
Given: and the string:
Shift-Reduce Parsing
• Form of bottom-up parsing
• Consists of:
– Stack: holds grammar symbols
– input buffer: holds the rest of the string to be
parsed
• Handle always appears on the top of the stack

Initial position: Final position (success)

Actions: shift, reduce, accept, error

Exercise
Let’s apply shift-reduce to the following
input: 00S11
and the following productions:
S-> 0S1 | 01

Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)
Fundamental Nahw
100% (3)
Fundamental Nahw
146 pages
Chapter 3
No ratings yet
Chapter 3
96 pages
CD Unit-2
100% (1)
CD Unit-2
60 pages
Compiler Design - Syntax Analysis
No ratings yet
Compiler Design - Syntax Analysis
14 pages
Compiler Design Unit II-1
No ratings yet
Compiler Design Unit II-1
46 pages
2.2 - Syntax Analysis (Upto Top-down Parsing)
No ratings yet
2.2 - Syntax Analysis (Upto Top-down Parsing)
91 pages
Grammars
No ratings yet
Grammars
34 pages
Unit Iii
No ratings yet
Unit Iii
95 pages
Module 2 C D Notes
No ratings yet
Module 2 C D Notes
21 pages
CD UNIT-2
No ratings yet
CD UNIT-2
107 pages
Unit - Ii Topdown Parsing 1. Context-Free Grammars: Definition
No ratings yet
Unit - Ii Topdown Parsing 1. Context-Free Grammars: Definition
26 pages
CD UNIT 3
No ratings yet
CD UNIT 3
76 pages
CD Unit-Ii
No ratings yet
CD Unit-Ii
56 pages
Cs1622 Parsing Part2 Bun
No ratings yet
Cs1622 Parsing Part2 Bun
5 pages
CD Unit 2
No ratings yet
CD Unit 2
19 pages
3-Module 2 - Role of Parser - Parse Tree-02-08-2024
No ratings yet
3-Module 2 - Role of Parser - Parse Tree-02-08-2024
76 pages
CD UNIT II
No ratings yet
CD UNIT II
11 pages
CH03
No ratings yet
CH03
57 pages
Chapter 3 Syntax Analyzer
No ratings yet
Chapter 3 Syntax Analyzer
46 pages
CD Chapter-3
No ratings yet
CD Chapter-3
105 pages
ACD-UNIT-4 Notes
No ratings yet
ACD-UNIT-4 Notes
32 pages
Chapter 3 Syntax Analysis
No ratings yet
Chapter 3 Syntax Analysis
54 pages
Chapter-3 so far
No ratings yet
Chapter-3 so far
50 pages
APznzaYtAWjYy0s_GBEoizaF1ROv5e2pS_Nl6BcNYabrBN8gt4KeYj7LFiXdkYVxT_V92vXdgLmWE0ZcbyVltch5fozoqQQ4KdG766DLjO8aJsMIPKjEjniZOjL0qtNhMykCRh_ohPtDpZvrHNBAvbbZBhvxDpVEqpjDluyzuJGi-VI3NuG46DY_24QwGBEoRdfQYjfevW6tvweeRG (1)
No ratings yet
APznzaYtAWjYy0s_GBEoizaF1ROv5e2pS_Nl6BcNYabrBN8gt4KeYj7LFiXdkYVxT_V92vXdgLmWE0ZcbyVltch5fozoqQQ4KdG766DLjO8aJsMIPKjEjniZOjL0qtNhMykCRh_ohPtDpZvrHNBAvbbZBhvxDpVEqpjDluyzuJGi-VI3NuG46DY_24QwGBEoRdfQYjfevW6tvweeRG (1)
100 pages
Chapter 3
No ratings yet
Chapter 3
180 pages
UNIT 3 Syntax Analysis-Part1: Harshita Sharma
No ratings yet
UNIT 3 Syntax Analysis-Part1: Harshita Sharma
70 pages
Syntax Analysis
No ratings yet
Syntax Analysis
58 pages
Parsing
No ratings yet
Parsing
33 pages
Chapter 3 Syntax Analyzer1
No ratings yet
Chapter 3 Syntax Analyzer1
58 pages
Atcd Unit 2
No ratings yet
Atcd Unit 2
49 pages
Chapter – three
No ratings yet
Chapter – three
139 pages
Compiler Design Lec-Three Syntax Analysis
No ratings yet
Compiler Design Lec-Three Syntax Analysis
60 pages
CD Unit 2
No ratings yet
CD Unit 2
15 pages
CD Unit2
No ratings yet
CD Unit2
73 pages
compiler_design- Module3
No ratings yet
compiler_design- Module3
19 pages
Unit 2 Basic Parsing Techniques
No ratings yet
Unit 2 Basic Parsing Techniques
34 pages
Role of Parse1
No ratings yet
Role of Parse1
20 pages
CD Unit Ii
No ratings yet
CD Unit Ii
38 pages
Chapter – 3
No ratings yet
Chapter – 3
46 pages
UNIT-2(CD)
No ratings yet
UNIT-2(CD)
12 pages
Chapter-3-Syntax Analysis
No ratings yet
Chapter-3-Syntax Analysis
126 pages
Chapter 3 (2)
No ratings yet
Chapter 3 (2)
41 pages
CD Unit-Ii
No ratings yet
CD Unit-Ii
37 pages
Chapter-4 - CS-411 Compiler Construction
No ratings yet
Chapter-4 - CS-411 Compiler Construction
8 pages
Syntax Analysis (Part-I)
No ratings yet
Syntax Analysis (Part-I)
88 pages
Session 3
No ratings yet
Session 3
18 pages
CC_unit_3
No ratings yet
CC_unit_3
51 pages
Compiler Design Chapter-3
0% (1)
Compiler Design Chapter-3
177 pages
MODULE 3 Syntax Analysis
100% (1)
MODULE 3 Syntax Analysis
182 pages
CD Unit-Ii
No ratings yet
CD Unit-Ii
34 pages
CSE 4102 Syntax Analysis or Parsing
No ratings yet
CSE 4102 Syntax Analysis or Parsing
73 pages
CC 3
No ratings yet
CC 3
29 pages
Syntax Analyzer
No ratings yet
Syntax Analyzer
38 pages
Unit - 3 Syntax Analyzer
No ratings yet
Unit - 3 Syntax Analyzer
43 pages
KCA015 Unit2
No ratings yet
KCA015 Unit2
29 pages
Syntax Analysis Parsing (1)
No ratings yet
Syntax Analysis Parsing (1)
9 pages
Chapter 3 Syntax Analysis
No ratings yet
Chapter 3 Syntax Analysis
78 pages
PCD 1.4 Syntax Analysis
No ratings yet
PCD 1.4 Syntax Analysis
33 pages
C Depart
No ratings yet
C Depart
7 pages
Unit-2 F&CD
No ratings yet
Unit-2 F&CD
31 pages
Notes on the Verb to BE
No ratings yet
Notes on the Verb to BE
8 pages
Pragmatic Competence (Mouton Series in Pragmatics) (Naoko Taguchi)
No ratings yet
Pragmatic Competence (Mouton Series in Pragmatics) (Naoko Taguchi)
386 pages
Assistant Customer 1 Assistant Customer 2 Assistant Customer 3 Assistant Customer 4 Assistant Customer 5
No ratings yet
Assistant Customer 1 Assistant Customer 2 Assistant Customer 3 Assistant Customer 4 Assistant Customer 5
9 pages
Determine Rs
No ratings yet
Determine Rs
33 pages
Verb To Be - Theory
No ratings yet
Verb To Be - Theory
10 pages
#Growingforyou: Workbook
No ratings yet
#Growingforyou: Workbook
8 pages
Prosody in Relation To Paralinguistic PH
No ratings yet
Prosody in Relation To Paralinguistic PH
15 pages
2.1 Preterite
No ratings yet
2.1 Preterite
11 pages
EnglishFile4e Pre-Intermediate TG PCM Grammar 6A
No ratings yet
EnglishFile4e Pre-Intermediate TG PCM Grammar 6A
1 page
Lat English Grammar
No ratings yet
Lat English Grammar
32 pages
Future Tense
No ratings yet
Future Tense
12 pages
Ambiguity in grammar 1
No ratings yet
Ambiguity in grammar 1
23 pages
Syllabus of Sociolinguistics Now
No ratings yet
Syllabus of Sociolinguistics Now
1 page
Language Barrier Exploring The Lived Experiences o
No ratings yet
Language Barrier Exploring The Lived Experiences o
8 pages
08teamwork1 Extension Unit8
No ratings yet
08teamwork1 Extension Unit8
2 pages
Unit 3 past perfect
No ratings yet
Unit 3 past perfect
2 pages
Gateway 1 Term 1 Test 1 B (Answer Key)
No ratings yet
Gateway 1 Term 1 Test 1 B (Answer Key)
2 pages
Teaching Writing Skills
No ratings yet
Teaching Writing Skills
31 pages
Uses:: Modal Verb:Would
No ratings yet
Uses:: Modal Verb:Would
5 pages
Conceptual Metaphors in Mylo Xyloto Album by Coldplay: Selvia Neilil Kamaliah
No ratings yet
Conceptual Metaphors in Mylo Xyloto Album by Coldplay: Selvia Neilil Kamaliah
10 pages
Revisao 2 Anos
No ratings yet
Revisao 2 Anos
2 pages
1-GRAMMAR FOR 1º-2º ESO - Elementary (Cae)
No ratings yet
1-GRAMMAR FOR 1º-2º ESO - Elementary (Cae)
2 pages
Irregular Verbs List (1)
No ratings yet
Irregular Verbs List (1)
3 pages
1 - M26 - S4, Translation, Pr. Agliz
No ratings yet
1 - M26 - S4, Translation, Pr. Agliz
19 pages
EPREUVE ANGLAIS 4eme
100% (9)
EPREUVE ANGLAIS 4eme
2 pages
Text Analysis Based On Natural Language Processing NLP
No ratings yet
Text Analysis Based On Natural Language Processing NLP
10 pages
Diagramming Sentences Reference
No ratings yet
Diagramming Sentences Reference
2 pages
The Functional Analysis of English 3rd Edition by Thomas Bloor, Meriel Bloor ISBN 1444156659 9781444156652 - Download the ebook now for instant access to all chapters
100% (19)
The Functional Analysis of English 3rd Edition by Thomas Bloor, Meriel Bloor ISBN 1444156659 9781444156652 - Download the ebook now for instant access to all chapters
85 pages
Infinitive Simple Past Past Participle: English
No ratings yet
Infinitive Simple Past Past Participle: English
11 pages

Uploaded by

Uploaded by

Context-Free Grammars

• Precise syntactic specifications of a

Three general types of parsers

Universal parsing methods:

Three general types of parsers

Three general types of parsers

• Few languages have been designed with

Easier said than done!

Deriving: -(id + id)

means derive in zero or more steps

means derive in one or more steps

Leftmost derivations, the leftmost nonterminal in each sentential is always

Rightmost derivations, the rightmost nonterminal in each sentential is

Regular expression -> NFA then:

How about something like:

stmt -> EXP else stmt | EXP

• Write predictive parser

Initial position: Final position (success)

Actions: shift, reduce, accept, error

You might also like