0% found this document useful (0 votes)

74 views

Assignment 2 04042021 045308pm

The document discusses writing a lexical analyzer in C++. It describes tokenizing keywords, special characters, operators and numbers from a given input. It also discusses implementing a symbol table to store identifiers and their attributes.

Uploaded by

atif

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

74 views

Assignment 2 04042021 045308pm

Uploaded by

atif

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 9

Bahria University, Islamabad Campus

Department of Computer Science

Assignment # 2
Class: BS(CS) -5A/B
(Fall 2019 Semester)

Course: Compiler Construction Date: / /2021

Deadline: / /2021 Total Marks: 20

Name:saif ali khan enrollmenr:01-134191-092

A scanner needs to compute as many attributes as are necessary to allow further processing.
Since the scanner will have to compute possibly several attributes for each token. It is often
helpful to collect all the attributes into a single structured data type which we could call a token
record. Such a record could be declared in C as
typedef struct {
TokenType tokenval ;
char * stringval;
Int numval;
} TokenRecord ;
A common arrangement is for the scanner to return the token value only and place the other
attributes in global variables where they can be accessed by other parts of compiler. Although the
task of scanner is to convert the entire source program into a sequence of tokens, the scanner will
rarely do this all at once. Instead the scanner will operate under the control of parser , returning
the single next token from the input on demand. So the scanner will be declared as a function
such as
TokenType getToken(void);
Regular expressions: Regular expressions represent patterns of string of characters. Patterns
recognized by scanner are defined by Regular expressions.
Reserved words and identifiers: Reserved words are the simplest to write a regular expression:
they are represented by their fixed sequence of characters. If we wanted to collect the reserved
words into one definition, we could write something like
Reserved = if | while | do | ….
Identifiers: identifiers are strings of characters which are not fixed. Typically an identifier must
begin with a letter and contain only letters and digits. We can express this in terms of regular
definitions as letter = [a-zA-Z] digit = [0-9]
identifier = letter(letter|digit)*
Numbers: Numbers can be just sequence of digits (natural numbers), or decimal numbers, or
numbers with an exponent. We can write regular definitions for these numbers as follows:
nat = [0-9]+ signedNat = (+|-)? nat number =
signedNat (“.” nat) ? (E signedNat) ?
Finite automate: Finite automata or finite-state machines are a mathematical way of describing
particular kind of algorithms. In particular, finite automata can be used to describe the process of
recognizing patterns in input strings, and so can be used to construct scanners. Finite automata
can be described using transition diagrams the following example illustrates this. The transition
diagram makes it easy to visualize the scanner algorithm and code can be written easily by hand
if the scanner it to process simple language. consider the following operators. < , <= , <> , > ,
>= , =
We can represent them as a transition diagram. Then we can write code using the transition
diagram.

Transition Table: In the above code example, the Finite automata has been hardwired right into
the code. It is also possible to express the DFA as a data structure and then write “generic” code
that will take its actions from the data structure.
A simple data structure that is adequate for this purpose is a transition table. A two dimensional
array, indexed by state and input character that expresses the values of the transition function T.
Consider the DFA for identifier:

The DFA can be represented by the following transition table.

Input char Letter Digit other
State
1 2 Error Error
2 2 2 3
3 Yes Accept
Then the generic code can be expressed as:
State = 1
ch = next input character ;
while not Accept [ state ] and not error state do
new state = T[state ,ch]
if Advance[state,ch] then ch = next input
char state = new state end while if
Accept[state] then accept ;
Lexical Analyzer in C++
Now you have studied the basic theory to code for a Finite Automata. Using the suggested style
of coding write code to recognize the following key words and language constructs.
Keywords.:
If, do, for, while, begin , end , switch , else , break.
All Special Characters in C++:
; , [, ] , ( , ) , { , } ,
Operators:
The table is given below. The precedence at the top is maximum, at the bottom minimum.

Operators associativity
* / % multiply, divide, mod Left to right
+ - add, subtract Left to right
<< >> shift left, shift right Left to right
< <= > >= less than, less or equal, greater, greater or equal Left to right
== != equal, not equal Left to right
&& logical and Left to right
|| logical or Left to right
= += -= *= /= %= >>= <<= assignments Right to left

Tasks

1- Read file character by character in a char variable.

2- Check whether the character read is a space, tab, newline, if so skip and read next
character.
3- Consume C style comments starting with // consume all character until newline char
found.
4- If the character just read is not a space character, then proceed as follows. You can
divide and handle tokens in the following categories.
5- Tokens comprising of a single character only – like comma, semicolon, parenthesis,
brackets, arithmetic operators etc. --- so handle them first.
6- Task 5. Detect key words return corresponding tokens. 7- Task 6. Detect
identifiers return corresponding tokens.
8- Make a symbol table for identifiers with the following functions: entry
*Search(string). entry * make_entry(string).

#include <iostream>
#include<fstream>
#include<string>
#include<cctype>
using namespace std;
ifstream f("symbol.txt",ios::in);
char ch;
int i=0;
typedef enum {IF=1,ELSE,WHILE,DO,FOR,LPRN,RPRN,OPR,Break, Switch, Auto, Cin,Goto,
True,
False,UID,FLOAT,LCRBR,RCRBR,LSQRBR,RSQRBR,PLUSPLUS,PLUS,PLUSEQ,EQEQ,E
QUAL,Div,Not,
Mul,SEMICLN,CLN,COMMA,COMMENT,LESS,LESSEQ,GREATER,GREATEREQ,INT,ST
RING,CHAR,DOUBLE}TokenType;

struct token{
TokenType TknType;
int no;
string name;
string entry_no;
};
token tk[30];
token t;
static int a=0;

token makeToken(){
while(!f.eof()){
ch=f.get();//a
if (isalpha(ch)|| ch=='_'){
string s=" ";
i++;
do{
s=s+ch;//a
ch=f.get();//b
if(ch==' '||ch=='\n'||ch=='\t'){
t.name =s;
t.entry_no="Varriable"<<endl;
t.no=i;
tk[a]=t;
a++;
}
}
while(isalpha(ch)|| ch=='_'||isdigit(ch));
}
else if(isdigit(ch)){
string s =" ";
i++;
do{
s=s+ch;
ch=f.get();

if(ch==' '||ch=='\n'||ch=='\t'){
t.name=s;
t.entry_no="NUM";
t.no==i;
return t;
}
}
while(isdigit(ch));
}

else if(ch=='(') {
t.TknType=LPRN;
t.entry_no="LPRN";
returntokn;}
elseif(ch==')'){
t.TknType=RPRN;
t.entry_no="RPRN";
return t;
}
elseif(ch=='['){
t.TknType=LSQRBR;
t.entry_no="LSQRBR";
return t;
}
elseif(ch==']')
{
t.TknType=RSQRBR;
t.entry_no="RSQRBR";
return t;
}
elseif(ch=='{'){
t.TknType=LCRBR;
t.entry_no="LCRBR";
return t;
}
elseif(ch=='}'){
t.TknType=RCRBR;
t.entry_no="RCRBR";
return t;
}
elseif(ch==';'){
t.TknType=CLN;
t.entry_no="CLN";
return t;
}
elseif(ch==':'){
t.TknType=SEMICLN;
t.entry_no="SEMICLN";
return t;
}elseif(ch==','){
t.TknType=COMMA;
t.entry_no="COMMA";
return t;
}
elseif( ch=='/'){
if(ch=='/')
{
t.TknType=COMMENT;
t.entry_no="COMMENT";
return t;
}
elseif(ch=='*'){
t.TknType=COMMENT;
t.entry_no="COMMENT";
return t;
}
}
elseif(ch=='*'){
if( ch=='/'){
t.TknType=COMMENT;
t.entry_no="COMMENT";
return t;
}
}
elseif(ch=='+'){
if(ch=='=')
{
t.TknType=PLUSEQ;
t.entry_no="PLUSEQ";
return t;
}
elseif(ch=='+'){
t.TknType=PLUSPLUS;
t.entry_no="PLUSPLUS";
return t;
}
else{
t.TknType=PLUS;
t.entry_no="PLUS";
return t;
}
}
elseif(ch=='='){
if(ch=='='){
t.TknType=EQEQ;
t.entry_no="EQEQ";
return t;
}
else{
t.TknType=EQUAL;
t.entry_no="EQAUL";
return t;
}
}
elseif(ch=='<'){
if(ch=='='){
t.TknType=LESSEQ;
t.entry_no="LESSEQ";
return t;
}
else
{
t.TknType=EQUAL;
t.entry_no="EQUAL";
return t;
}
}
elseif(ch=='>'){
if(ch=='='){
t.TknType=GREATEREQ;
t.entry_no="GREATEREQ";
return t;
}
else{
t.TknType=GREATER;
t.entry_no="GREATER";
return t;
}
if(s=="int"){
t.TknType=INT;
t.entry_no="NULL";
return t;
}
elseif(s=="string"){
t.TknType=STRING;
t.entry_no="NULL";
return t;
}
elseif(s=="char"){
t.TknType=CHAR;
t.entry_no="NULL";
return t;
}
elseif(s=="for")
{
t.TknType=FOR;
t.entry_no="NULL";
return t;
}
elseif(s=="while"){
t.TknType=WHILE;
t.entry_no="NULL";
return t;
}
elseif(s=="do"){
t.TknType=DO;
t.entry_no="NULL";
return t;
}
elseif(s=="if"){
t.TknType=IF;
t.entry_no="NULL";
return t;
}
elseif(s=="else"){
t.TknType=ELSE;
t.entry_no="NULL";
return t;
}
elseif(s=="float"){
t.TknType=FLOAT;
t.entry_no="NULL";
return t;
}
elseif(s=="double"){
t.TknType=DOUBLE;
t.entry_no="NULL";
return t;
}

}
}
}

}
int main(){
makeToken();
for(int i=0;i<30;i++){
cout<<tk[i].TknType<<" "<<tk[i].name<<" "<<tk[i].no<<" "<<tk[i].entry_no<<endl;
}
return 0;
}

Binary Trading
91% (64)
Binary Trading
120 pages
Grade5 Large Numbers PDF
82% (33)
Grade5 Large Numbers PDF
3 pages
Evangelism by Fire PDF
25% (4)
Evangelism by Fire PDF
2 pages
Invoice PDF
No ratings yet
Invoice PDF
1 page
Technical Service Manual: Prismasync V3.2
No ratings yet
Technical Service Manual: Prismasync V3.2
164 pages
Mentcare Case Study 05042021 072850pm
No ratings yet
Mentcare Case Study 05042021 072850pm
3 pages
Bed Making
No ratings yet
Bed Making
24 pages
Top 50 AWS Interview Questions & Answers
No ratings yet
Top 50 AWS Interview Questions & Answers
10 pages
Assignment 2 04042021 045308pm
No ratings yet
Assignment 2 04042021 045308pm
14 pages
CC Assignment
No ratings yet
CC Assignment
11 pages
01 134201 011 9556776808 04042022 115152pm
No ratings yet
01 134201 011 9556776808 04042022 115152pm
13 pages
Chapter 33
No ratings yet
Chapter 33
107 pages
Applications of FA
No ratings yet
Applications of FA
29 pages
unit5
No ratings yet
unit5
43 pages
2.00222E+12 Compiler Design Lab Record 2023 12 20
No ratings yet
2.00222E+12 Compiler Design Lab Record 2023 12 20
173 pages
lect03
No ratings yet
lect03
19 pages
Lexical Analysis: Textbook:Modern Compiler Design
No ratings yet
Lexical Analysis: Textbook:Modern Compiler Design
43 pages
1st Phase Lexical Analyzer
No ratings yet
1st Phase Lexical Analyzer
33 pages
Chapter 3 - Lexical Analysis
100% (1)
Chapter 3 - Lexical Analysis
51 pages
Compiler Lab
No ratings yet
Compiler Lab
28 pages
Assignment 1 (Lexical Analyzer)
No ratings yet
Assignment 1 (Lexical Analyzer)
17 pages
3 - Lexical Analysis (Compatibility Mode) PDF
No ratings yet
3 - Lexical Analysis (Compatibility Mode) PDF
28 pages
LA Using Transition Table
No ratings yet
LA Using Transition Table
5 pages
2 Scan 1
No ratings yet
2 Scan 1
24 pages
UNIT-I - Lexical Analysis
No ratings yet
UNIT-I - Lexical Analysis
51 pages
CD File 380
No ratings yet
CD File 380
42 pages
Implementation of Finite Automat in Code
No ratings yet
Implementation of Finite Automat in Code
18 pages
Rajat Prasad CD File
No ratings yet
Rajat Prasad CD File
39 pages
Compiler Lab Print Merged
No ratings yet
Compiler Lab Print Merged
45 pages
Practical File: Be (Cse) 6 Semester
No ratings yet
Practical File: Be (Cse) 6 Semester
54 pages
02. Chapter 3 - Lexical Analysis
No ratings yet
02. Chapter 3 - Lexical Analysis
51 pages
2 - Scanner
No ratings yet
2 - Scanner
49 pages
Chapter 3 - Lexical Analysis
No ratings yet
Chapter 3 - Lexical Analysis
51 pages
unit2
No ratings yet
unit2
93 pages
Compiler Course: Lexical Analysis
No ratings yet
Compiler Course: Lexical Analysis
50 pages
Ch2-CC
No ratings yet
Ch2-CC
47 pages
Programming Languaged Scanning Week 1-2
No ratings yet
Programming Languaged Scanning Week 1-2
7 pages
Cmp 335 Regular Expression Exercises Note
No ratings yet
Cmp 335 Regular Expression Exercises Note
18 pages
Lecture 3
No ratings yet
Lecture 3
22 pages
CD Dhruv Tyagi 167.PDF
No ratings yet
CD Dhruv Tyagi 167.PDF
36 pages
Chapter 2 - 1 Lexical Analysis
No ratings yet
Chapter 2 - 1 Lexical Analysis
30 pages
Chapter 3 - Lexical Analysis
100% (3)
Chapter 3 - Lexical Analysis
51 pages
Chapter 3 - Lexical Analysis
No ratings yet
Chapter 3 - Lexical Analysis
34 pages
CD Record
No ratings yet
CD Record
84 pages
Compiler all practicals
No ratings yet
Compiler all practicals
38 pages
CD_Unit II_Notes
No ratings yet
CD_Unit II_Notes
20 pages
Compiler Construction Notes
No ratings yet
Compiler Construction Notes
21 pages
2024 CSN352 Lec 8
No ratings yet
2024 CSN352 Lec 8
48 pages
Lecture 3-4 Updated
No ratings yet
Lecture 3-4 Updated
26 pages
CD PPTS 2
No ratings yet
CD PPTS 2
27 pages
slides chp 3 and 4
No ratings yet
slides chp 3 and 4
21 pages
Lecture 2.76
No ratings yet
Lecture 2.76
31 pages
Delhi Technological University Department of Computer Science and Engineering
No ratings yet
Delhi Technological University Department of Computer Science and Engineering
57 pages
Compiler Construction: Lexical Analysis
No ratings yet
Compiler Construction: Lexical Analysis
37 pages
unit1
No ratings yet
unit1
34 pages
Compiler Lab Manual
No ratings yet
Compiler Lab Manual
88 pages
Lexical Analysis
No ratings yet
Lexical Analysis
29 pages
Unit 2-Introduction to Compilers
No ratings yet
Unit 2-Introduction to Compilers
51 pages
SPCC Practicalss
No ratings yet
SPCC Practicalss
6 pages
CD Lab File 181
No ratings yet
CD Lab File 181
42 pages
Compiler Design Practical File PDF
No ratings yet
Compiler Design Practical File PDF
33 pages
Lecture08 4up
No ratings yet
Lecture08 4up
5 pages
Lecture II - Lexical Analysis.handouts
No ratings yet
Lecture II - Lexical Analysis.handouts
71 pages
compiler construction Lecture 3-4
No ratings yet
compiler construction Lecture 3-4
78 pages
Chapter 3 - Scanning: 3.1 Kinds of Tokens
No ratings yet
Chapter 3 - Scanning: 3.1 Kinds of Tokens
17 pages
Learn C++
From Everand
Learn C++
Durgesh
4.5/5 (9)
C Programming
From Everand
C Programming
Netra
No ratings yet
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
CC Ass 2
No ratings yet
CC Ass 2
9 pages
Name:atif Ali Enrollment: (01-134191-008)
No ratings yet
Name:atif Ali Enrollment: (01-134191-008)
15 pages
Assignment No. 2: Instructions
No ratings yet
Assignment No. 2: Instructions
2 pages
2 Assignment No 2 Chap 4 03042021 020349am
No ratings yet
2 Assignment No 2 Chap 4 03042021 020349am
3 pages
Feb 2024 Timesheet Surya
No ratings yet
Feb 2024 Timesheet Surya
6 pages
Male Sterilization: Vasectomy
No ratings yet
Male Sterilization: Vasectomy
1 page
Screw Thread Measurement
No ratings yet
Screw Thread Measurement
55 pages
All-Terrain Wheel-Legged Robot
No ratings yet
All-Terrain Wheel-Legged Robot
18 pages
SHIPOWNERS Sampling Procedures For Tankers
No ratings yet
SHIPOWNERS Sampling Procedures For Tankers
5 pages
Yash C++ Project
No ratings yet
Yash C++ Project
28 pages
WELTECH PQ (20210203)
No ratings yet
WELTECH PQ (20210203)
52 pages
Solidworks Flow Simulation Instructor Guide: Presenter Date
No ratings yet
Solidworks Flow Simulation Instructor Guide: Presenter Date
19 pages
STRIDE BP Home BP Monitors 01 Jul 2024
No ratings yet
STRIDE BP Home BP Monitors 01 Jul 2024
4 pages
Instrucciones Uso Tyvek 800 J
No ratings yet
Instrucciones Uso Tyvek 800 J
24 pages
The Simplest Radio Locator
No ratings yet
The Simplest Radio Locator
15 pages
Adeeb Ansari: Cell No: 9168557876 Address: Bhivandi, Kalyan, Mumbai. Skype Id: Adeebansari
No ratings yet
Adeeb Ansari: Cell No: 9168557876 Address: Bhivandi, Kalyan, Mumbai. Skype Id: Adeebansari
2 pages
MM ZG512 Manufacturing Strategy: Rajiv Gupta BITS Pilani Live Lecture 6
No ratings yet
MM ZG512 Manufacturing Strategy: Rajiv Gupta BITS Pilani Live Lecture 6
26 pages
Qualitative Research Method
100% (2)
Qualitative Research Method
22 pages
BBA Book List Christ PDF
No ratings yet
BBA Book List Christ PDF
169 pages
Linux Practicals Final Updated 14-10-2011
No ratings yet
Linux Practicals Final Updated 14-10-2011
227 pages
Cats vs. Gravity: Quiz By: RMD
No ratings yet
Cats vs. Gravity: Quiz By: RMD
2 pages
2024 Academic Handbook
No ratings yet
2024 Academic Handbook
94 pages
Atitude test section 2 B
No ratings yet
Atitude test section 2 B
4 pages
PSI ASTMMaterialCodes
No ratings yet
PSI ASTMMaterialCodes
1 page
Real Estate Advertising Guidelines
No ratings yet
Real Estate Advertising Guidelines
26 pages
CarlyJohnson AntoniaSousa LoveBeautyandPlanet
No ratings yet
CarlyJohnson AntoniaSousa LoveBeautyandPlanet
145 pages
CSC103 - Programming Fundamentals: Program Development Lecture 3-4
No ratings yet
CSC103 - Programming Fundamentals: Program Development Lecture 3-4
46 pages

Uploaded by

Uploaded by

Bahria University, Islamabad Campus

Department of Computer Science

Course: Compiler Construction Date: / /2021

Name:saif ali khan enrollmenr:01-134191-092

The DFA can be represented by the following transition table.

1- Read file character by character in a char variable.

You might also like