0% found this document useful (0 votes)

4 views17 pages

SS Unit-2

The document provides an overview of language processors and assemblers, detailing the functions and activities involved in language processing, including lexical, syntax, and semantic analysis, as well as code generation and optimization. It also discusses tools like LEX and YACC for lexical analysis and parsing, and the structure and types of assemblers, including one-pass and two-pass assemblers. Additionally, it covers the symbol table's role in storing identifier information and the algorithm for a single-pass assembler in x86 architecture.

Uploaded by

harshkhokhariya10

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views17 pages

SS Unit-2

Uploaded by

harshkhokhariya10

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Software Engineering – UNIT-2

Language Processors and Assemblers

1. Language Processors and Tools

1.1 What is a Language Processor?

A language processor is system software that translates a program written in one language
into another language.

Commonly:

Source Language Target Language Processor

High-level language Machine language Compiler / Interpreter

Assembly language Machine language Assembler

1.2 Language Processing Activities

When a source program is translated, the language processor performs the following main
activities:

1. Lexical Analysis

 Reads the source program character by character.

 Groups characters into meaningful units called tokens.

Example:

int a = 10;

Tokens:

 int → keyword
 a → identifier
 = → operator
 10 → constant
2. Syntax Analysis

 Checks whether the sequence of tokens follows the grammar rules.

 Builds a parse tree or syntax structure.

Example:

a = 10;

is syntactically correct.

3. Semantic Analysis

 Checks meaning of statements.

 Checks:
o type compatibility
o undeclared variables
o multiple declarations

Example:

int a;
a = "hello"; ❌ type mismatch

4. Intermediate Code Generation

 Generates machine-independent intermediate code.

Example:

t1 = 10
a = t1

5. Code Optimization

 Improves performance and reduces resource usage.

Example:

a = 2 * 4

Optimized to:

a = 8
6. Target Code Generation

 Generates machine-dependent code (assembly / machine code).

Summary of Language Processing Activities

Source Program
↓
Lexical Analysis
↓
Syntax Analysis
↓
Semantic Analysis
↓
Intermediate Code
↓
Optimization
↓
Target Code

1.3 Language Processing Tools

Some important tools used during language processing:

 Lexical analyzer generators → LEX

 Parser generators → YACC
 Code generators
 Debuggers

Profilers

Below are short and clear class-notes for the given tools used in Language Processors /
Compiler Design.

1. Lexical Analyzer Generators → LEX

LEX is a tool used to automatically generate a lexical analyzer (scanner).

Meaning

A lexical analyzer generator produces a program that can recognize tokens such as keywords,
identifiers, operators, numbers, etc.

Role of LEX

 Takes regular expressions as input

 Generates a C program for the scanner
 The generated scanner:
o reads the source program
o groups characters into tokens
o sends tokens to the parser

Working idea
Input (patterns in LEX file)
↓
LEX tool
↓
Lexical analyzer program

Main features

 Easy specification of tokens

 Automatically handles pattern matching
 Works together with YACC

Example use

LEX is used to detect:

 identifiers
 keywords
 constants
 operators

2. Parser Generators → YACC

YACC (Yet Another Compiler Compiler) is a tool used to generate a syntax analyzer
(parser).

Meaning

A parser generator creates a parser automatically from a grammar specification.

Role of YACC

 Takes context-free grammar rules as input

 Produces a C program for the parser
 The parser checks whether the program follows grammar rules

Working idea
Grammar rules
↓
YACC
↓
Parser program

Important points

 YACC mainly generates LALR parsers

 Works with tokens generated by LEX
 Used to build parse tree or syntax structure

Common use

LEX + YACC are commonly used together:

LEX → tokens
YACC → syntax checking

3. Code Generators
A code generator is a tool or compiler phase that produces the target code from the
intermediate representation.

Meaning

It converts:

Intermediate code → machine code / assembly code

Main responsibilities

 Select appropriate machine instructions

 Assign registers
 Produce efficient target code

Output of code generator

 Assembly code
or
 Object code

Importance

 Affects performance of final program

 Ensures correct mapping of operations to hardware

4. Debuggers
A debugger is a tool used to find and fix errors in a program.

Purpose

 Help programmers observe program execution

 Locate logical and runtime errors

Main functions

 Set breakpoints
 Execute program step by step
 Inspect variable values
 Trace program flow

Examples of errors handled

 wrong output
 runtime crashes
 incorrect logic

5. Profilers
A profiler is a performance analysis tool.

Meaning

It measures how a program behaves during execution.

Main objectives

 Find time-consuming functions

 Identify performance bottlenecks
 Measure memory usage

Typical information given by a profiler

 function execution time

 number of function calls
 CPU usage
 memory consumption

Importance

Profilers are mainly used for:

 program optimization
 improving execution speed

2. Symbol Table

2.1 Definition

A symbol table is a data structure used by a compiler or assembler to store information about
identifiers.

2.2 Information Stored in a Symbol Table

For each symbol (identifier), it stores:

 name
 type
 scope
 memory location (address)
 size
 parameter information (for functions)

2.3 Why Symbol Table is Needed

 To check declarations
 To support type checking
 To generate correct addresses
 To support scope handling

2.4 Operations on Symbol Table

Main operations:

 insert(symbol, information)
 lookup(symbol)
 update(symbol)

2.5 Example
Name Type Scope Address
a int local 1000
sum function global 2000

3. Search and Allocation Data Structures

These data structures are used mainly to implement:

 symbol tables
 literal tables
 label tables

3.1 Search Data Structures

Used for fast lookup.

(a) Linear List

 Simple list of symbols.

 Search time is O(n).

(b) Binary Search Tree

 Faster search.
 Average search time O(log n).

(c) Hash Table (Most commonly used)**

 Key → hash function → table index

 Very fast lookup.
 Average time O(1).

3.2 Allocation Data Structures

Used for memory allocation and management.

(a) Stack Allocation

 Used for local variables and parameters.

 Follows LIFO.
(b) Heap Allocation

 Used for dynamic memory.

 Memory can be allocated and freed in any order.

(c) Free List

 Keeps track of free memory blocks.

4. LEX and YACC – Overview

4.1 LEX
LEX is a lexical analyzer generator.

Purpose

 Automatically generates a lexical analyzer.

Input

 Token specifications using regular expressions.

Output

 C program implementing yylex().

Example rule in LEX

[0-9]+ { return NUMBER; }
Role of LEX

 Converts input stream into tokens.

 Passes tokens to the parser.

4.2 YACC
YACC stands for Yet Another Compiler Compiler.

Purpose

 Generates a parser.

Input

 Grammar rules written in BNF-like format.

Output

 C program implementing yyparse().

Example grammar
E : E '+' T
| T
;

Role of YACC

 Performs syntax analysis.

 Builds parse tree.

Relationship between LEX and YACC

Input Program
↓
LEX → produces tokens
↓
YACC → checks grammar and structure

5. Assemblers

5.1 Elements of Assembly Language Programming

Main elements:

(1) Mnemonics

Symbolic names of machine instructions.

Examples:

 MOV
 ADD
 SUB
 JMP

(2) Operands

Registers, memory locations, or constants.

Example:

MOV AX, BX

AX and BX are operands.

(3) Labels

Used to name addresses.

Example:
LOOP1:
ADD AX, BX

(4) Directives (Pseudo-ops)

They guide the assembler but do not generate machine code.

Examples:

 DB
 DW
 EQU
 ORG
 END

(5) Comments

Used for readability.

5.2 Assembler Design

An assembler mainly performs the following functions:

1. Scan source program

2. Maintain symbol table
3. Translate mnemonics into opcodes
4. Assign addresses to labels
5. Produce object code

General structure of an assembler:

Source Program
↓
Lexical processing
↓
Symbol table handling
↓
Instruction translation
↓
Object code generation
5.3 Types of Assemblers

5.3.1 One-Pass Assembler

Definition

A one-pass assembler processes the source program only once.

Problem

Forward references cannot be resolved immediately.

Example:

JMP NEXT
...
NEXT: MOV AX, BX

Technique used

 Backpatching
 Forward reference lists

Advantages

 Faster
 Less memory usage

Disadvantages

 More complex
 Difficult symbol handling

5.3.2 Two-Pass Assembler

Definition

The source program is processed two times.

Pass 1

 Assign addresses
 Build symbol table
 No object code is generated

Pass 2

 Generate object code

 Resolve all symbol references

Advantages

 Simple design
 Easy handling of forward references

Disadvantages

 Requires two scans

 Slightly slower

Comparison

Feature One-Pass Two-Pass

Number of scans 1 2
Forward reference handling Complex Easy
Implementation Difficult Simple
Speed Faster Slower

6. x86 Single-Pass Assembler Algorithm

This is an algorithmic view of a single-pass assembler for x86-like architecture.

Data Structures Used

 SYMTAB – Symbol table

 FWDREF list – list of unresolved references
 LOCCTR – location counter
 OPTAB – opcode table

Algorithm: x86 Single-Pass Assembler

Step 1

Initialize:

 LOCCTR = starting address

 SYMTAB = empty
 FWDREF = empty

Step 2

If statement contains a label:

 If label is not in SYMTAB:

o Enter label with current LOCCTR
o Resolve any pending forward references for this label
 Else:
o Report duplicate symbol error

Step 4

If statement is an instruction:
 Search mnemonic in OPTAB
 Generate opcode
 For each operand:
o If symbol is already in SYMTAB:
 Use its address
o Else:
 Create an entry in SYMTAB with undefined address
 Add current object code position to FWDREF list

Step 5

If statement is a directive:

 Process directive
 Update LOCCTR accordingly

Step 6

Store generated machine code with placeholder zeros for unresolved addresses.

Step 7

Update LOCCTR by instruction length.

Step 8

Repeat steps 2 to 7 until END statement is found.

Step 9

After end of program:

 If any unresolved forward references remain:

o Report undefined symbol errors

Key idea of Single-Pass x86 assembler

 Address resolution and code generation are done together
 Forward references are handled using:
o lists of incomplete address fields
o backpatching when the label is defined

Compiler Design Fundamentals and Techniques
No ratings yet
Compiler Design Fundamentals and Techniques
45 pages
Compiler Design and Language Processing
No ratings yet
Compiler Design and Language Processing
19 pages
Compiler Design Notes
No ratings yet
Compiler Design Notes
30 pages
System Software and Machine Architecture
No ratings yet
System Software and Machine Architecture
34 pages
Compiler Design Overview and Phases
No ratings yet
Compiler Design Overview and Phases
22 pages
UNIT1 Compiler
No ratings yet
UNIT1 Compiler
195 pages
Compiler Design and Language Processing
No ratings yet
Compiler Design and Language Processing
42 pages
Language Translator Compiler Overview
No ratings yet
Language Translator Compiler Overview
71 pages
Compiler Design and Language Processing Guide
No ratings yet
Compiler Design and Language Processing Guide
313 pages
Compiler Design Overview and Phases
No ratings yet
Compiler Design Overview and Phases
23 pages
Compiler Design Complete Notes
No ratings yet
Compiler Design Complete Notes
20 pages
Overview of VBA Compiler Design
No ratings yet
Overview of VBA Compiler Design
24 pages
Compiler Design and Phases Overview
No ratings yet
Compiler Design and Phases Overview
49 pages
Language Processing System Overview
No ratings yet
Language Processing System Overview
54 pages
Compiler Design Course Overview
No ratings yet
Compiler Design Course Overview
125 pages
Lect Notes
100% (1)
Lect Notes
40 pages
Overview of Compiler Design Concepts
No ratings yet
Overview of Compiler Design Concepts
5 pages
Compiler Design Lecture Compilation
No ratings yet
Compiler Design Lecture Compilation
117 pages
Compiler Design and Phases Explained
No ratings yet
Compiler Design and Phases Explained
76 pages
Introduction to Compiler Design Basics
No ratings yet
Introduction to Compiler Design Basics
115 pages
Compilers Course Notes - CS 218
No ratings yet
Compilers Course Notes - CS 218
100 pages
Compilers Course Notes - CS 218
No ratings yet
Compilers Course Notes - CS 218
100 pages
Overview of Language Processors in Compilers
No ratings yet
Overview of Language Processors in Compilers
84 pages
Compiler Conts 13-11-2015
No ratings yet
Compiler Conts 13-11-2015
7 pages
Introduction to Compiler Basics
No ratings yet
Introduction to Compiler Basics
33 pages
Compiler Construction Overview
No ratings yet
Compiler Construction Overview
37 pages
Compiler Design and Implementation Guide
No ratings yet
Compiler Design and Implementation Guide
60 pages
Compiler Design Fundamentals
No ratings yet
Compiler Design Fundamentals
31 pages
Compilerdesignnotes
100% (1)
Compilerdesignnotes
210 pages
Understanding Compiler Phases and Tools
No ratings yet
Understanding Compiler Phases and Tools
10 pages
Overview of Compiler Design Phases
No ratings yet
Overview of Compiler Design Phases
111 pages
Compiler Design: Phases and Tools
100% (1)
Compiler Design: Phases and Tools
36 pages
Compiler Design and Lexical Analysis Guide
No ratings yet
Compiler Design and Lexical Analysis Guide
22 pages
Compiler Basics and Phases Explained
No ratings yet
Compiler Basics and Phases Explained
13 pages
Understanding Compilers and Assemblers
No ratings yet
Understanding Compilers and Assemblers
10 pages
Overview of Compiler and Parsing Techniques
No ratings yet
Overview of Compiler and Parsing Techniques
62 pages
Compiler Design Overview and Structure
No ratings yet
Compiler Design Overview and Structure
250 pages
Compiler Phases and Lexical Analysis Guide
No ratings yet
Compiler Phases and Lexical Analysis Guide
17 pages
Compiler Design Overview and Phases
No ratings yet
Compiler Design Overview and Phases
18 pages
Introduction to Compiler Design Concepts
No ratings yet
Introduction to Compiler Design Concepts
29 pages
Lexical Analysis and Parsing Overview
No ratings yet
Lexical Analysis and Parsing Overview
24 pages
Compiler Design 4799dee2 b703 4a1b 9a7e D0e6c4a48d2f
No ratings yet
Compiler Design 4799dee2 b703 4a1b 9a7e D0e6c4a48d2f
67 pages
Introduction to Compilers and Their Functions
No ratings yet
Introduction to Compilers and Their Functions
175 pages
50XE
No ratings yet
50XE
54 pages
Compiler Design Overview and Phases
100% (1)
Compiler Design Overview and Phases
193 pages
Language Processing and Assemblers Explained
No ratings yet
Language Processing and Assemblers Explained
4 pages
Overview of Language Processors
No ratings yet
Overview of Language Processors
4 pages
Language Processing in Compiler Design
No ratings yet
Language Processing in Compiler Design
51 pages
Compiler vs Interpreter Explained
No ratings yet
Compiler vs Interpreter Explained
2 pages
Compiler Design Overview and Phases
No ratings yet
Compiler Design Overview and Phases
14 pages
Compiler Design and Implementation Guide
No ratings yet
Compiler Design and Implementation Guide
76 pages
Compiler Design Overview and Phases
100% (1)
Compiler Design Overview and Phases
66 pages
Structure and Phases of Compilers
No ratings yet
Structure and Phases of Compilers
45 pages
Unit I - Part III: The Structure of A Compiler Phases of A Compiler
No ratings yet
Unit I - Part III: The Structure of A Compiler Phases of A Compiler
45 pages
Compiler Design Lecture Notes Overview
No ratings yet
Compiler Design Lecture Notes Overview
159 pages
Artful Embrace: Mughals and Franks
No ratings yet
Artful Embrace: Mughals and Franks
46 pages
Fall Detection System Market Insights
No ratings yet
Fall Detection System Market Insights
3 pages
Pediatrics in Review August 2024
No ratings yet
Pediatrics in Review August 2024
62 pages
Ionic Wind in DC Corona Discharge
No ratings yet
Ionic Wind in DC Corona Discharge
10 pages
Visual and Auditory Reflexes Explained
No ratings yet
Visual and Auditory Reflexes Explained
14 pages
EDC7 Common Rail Wiring Diagrams
100% (6)
EDC7 Common Rail Wiring Diagrams
8 pages
Supreme Court Ruling on PP 1017
No ratings yet
Supreme Court Ruling on PP 1017
96 pages
Symptom Analysis and Diagnosis Guide
No ratings yet
Symptom Analysis and Diagnosis Guide
320 pages
Pneumatic vs Hydraulic Systems Explained
No ratings yet
Pneumatic vs Hydraulic Systems Explained
9 pages
English Test on Canada and Ukraine
No ratings yet
English Test on Canada and Ukraine
4 pages
Interhandel Case Overview
No ratings yet
Interhandel Case Overview
11 pages
Redington Senegal Account Statement 2024
No ratings yet
Redington Senegal Account Statement 2024
2 pages
Pronouns Practice Worksheet
No ratings yet
Pronouns Practice Worksheet
5 pages
Meek Server Configuration Guide
No ratings yet
Meek Server Configuration Guide
2 pages
Advanced Riddles for Critical Thinking
No ratings yet
Advanced Riddles for Critical Thinking
2 pages
Caesar's Arrogance and Fatal Omens
No ratings yet
Caesar's Arrogance and Fatal Omens
6 pages
DHL Express Rate Guide 2019: Philippines
No ratings yet
DHL Express Rate Guide 2019: Philippines
25 pages
ActivInspire Primary User Guide
No ratings yet
ActivInspire Primary User Guide
27 pages
Analog Communications Lecture Notes
No ratings yet
Analog Communications Lecture Notes
108 pages
Stick Electrode Welding Techniques Guide
No ratings yet
Stick Electrode Welding Techniques Guide
44 pages
Health Service Tariffs at RSUP Cipto Mangunkusumo
No ratings yet
Health Service Tariffs at RSUP Cipto Mangunkusumo
145 pages
Centralized vs. Decentralized Business Structures
No ratings yet
Centralized vs. Decentralized Business Structures
19 pages
Bachelor of Networking & Telecommunications
No ratings yet
Bachelor of Networking & Telecommunications
5 pages
Nutritionreviews62 s082
No ratings yet
Nutritionreviews62 s082
16 pages
Lockout/Tagout Safety Procedures
100% (1)
Lockout/Tagout Safety Procedures
5 pages
Marketing 8th Edition by Grewal & Levy
No ratings yet
Marketing 8th Edition by Grewal & Levy
17 pages
Web Technology Applications Overview
No ratings yet
Web Technology Applications Overview
10 pages
Oracle Fusion Manufacturing Interview Guide
No ratings yet
Oracle Fusion Manufacturing Interview Guide
14 pages
Beams AdvAcc11 Chapter
No ratings yet
Beams AdvAcc11 Chapter
21 pages
JEE Main 2024 Key & Solutions
No ratings yet
JEE Main 2024 Key & Solutions
14 pages