DESKTOP

Algorithms for Compiler Design: PREDICTIVE PARSING ERROR RECOVERY

7/24/2010 8:06:00 PM

9.6 PREDICTIVE PARSING ERROR RECOVERY
An error is detected during predictive parsing when the terminal on the top of the stack does not match the next input symbol, or when nonterminal A is on top of the stack and a is the next input symbol. M[A,a] is the error entry used to for recovery. Panic mode recovery can be used to recover from an error detected by the LL parser. The effectiveness of panic mode recovery depends on the choice of the synchronizing token. Several heuristics can be used when selecting the synchronizing token in order to ensure quick recovery from common errors:

All the symbols in the FOLLOW(A) must be kept in the set of synchronizing tokens, because if we skip until an a symbol in FOLLOW(A) is read, and we pop A from the stack, it is likely that the parsing can continue.

Since the syntactic structure of a language is very often hierarchical, we add the symbols that begin higher constructs to the synchronizing set of lower constructs. For example, we add keywords to the synchronizing sets of nonterminals that generate expressions.

We also add the symbols in FIRST(A) to the synchronizing set of nonterminal A. This provides for a resumption of parsing according to A if a symbol in FIRST(A) appears in the input.

A derivation by an ∈-production can be used as a default. Error detection will be postponed, but the error will still be captured. This method reduces the number of nonterminals that must be considered during error recovery.

Note�
Another method of error recovery that can be implemented is called "phrase level recovery". In phrase level recovery, each error entry in the LL parsing table is examined, and based on language usage, an appropriate error-recovery procedure is constructed. For example, to recover from a construct error that starts with an operator, the error-recovery routine will insert an imaginary id into the input. Then, if some state terminal symbols are derived using an ∈-production, the error entries in that state are replaced by the derivation using the imaginary-id ∈-production. This has the effect of postponing error detection.

A phrase level error-recovery implementation for an LR parser is shown in Tables 1 and 2. The parsing table is constructed for the following grammar:

Table 1: LR Parsing Table

�
id

+

*

$

E

E → TE₁
� � �

T

T → FT₁
� � �

F

F → id
� � �

E₁
�
E₁→ +TE₁
�
E₁→ ∈

T₁
�
T₁ → ∈

T₁ → * FT₁

T₁ → ∈

id

pop
� � �

+
�
pop
� �

*
� �
pop
�

$
� � �
accept

The modified table is shown in Table 2. Routine e₁, when called, pushes an imaginary id into the input; and routine e₂, when called, removes all the remaining symbols from the input.

Table 2: Phrase Level Error-Recovery Implementation

�
id

+

*

$

E

E → TE₁

e₁

e₁

e₁

T

T → FT₁

e₁

e₁

e₁

�
F → id

e₁

e₁

e₁

E₁

E₁→ ∈

E₁→ +TE₁

E₁→ ∈

E₁→ ∈

T₁

T₁→ ∈

T₁→ ∈

T₁ → FT₁

T₁→ ∈

id

pop
� � �

+
�
pop
� �

� �
pop
�

$

e₂

e₂

e₂

accept

For example, if we trace the behavior of the parser shown in Table 2 for the input id + id $:

Stack Contents

Unspent Input

Moves

$E

id+id$

derive using E →TE₁

$E₁T

id+id$

derive using T →FT₁

$E₁T₁F

id+id$

derive using F → id

$E₁T₁id

id+id$

pop

$E₁T₁

+id$

derive using T₁→ ∈

$E₁

+id$

derive using E₁→ +TE₁

$E₁T+

+id$

pop

$E₁T

id$

call error routine e1

$E₁T

idid$

derive using T → FT₁

(imaginary id is pushed by e₁)

$E₁T₁F

idid$

derive using F → id

$E₁T₁id

idid$

pop

$E₁T₁

id$

derive using T₁ → FT₁

$E₁T₁F

id$

derive using F → id

$E₁T₁id

id$

pop

$E₁T₁

$

derive using T₁ → ∈

$E₁

$

derive using E₁ → ∈

$

$

accept

Similarly, if we trace the behavior for the input id idid $:

Stack Contents

Unspent Input

Moves

$E

id idid$

derive using E → TE₁

$E₁T

id+id$

derive using T → FT₁

$E₁T₁F

id+id$

derive using F → id

$E₁T₁id

id+id$

pop

$E₁T₁

idid$

derive using T₁→ ∈

$E₁

id*id$

derive using E₁→ ∈

$

**idid$

call error routine e₂

(idid$ is removed by e₂)**

$

$

accept

Other

Algorithms for Compiler Design: LOOP OPTIMIZATION

Algorithms for Compiler Design: ELIMINATING INDUCTION VARIABLES

Algorithms for Compiler Design: ELIMINATING LOCAL COMMON SUBEXPRESSIONS

Algorithms for Compiler Design:

Algorithms for Compiler Design

Algorithms for Compiler Design: THE MACHINE MODEL

Algorithms for Compiler Design: STRAIGHTFORWARD CODE GENERATION

Algorithms for Compiler Design: USING DAG FOR CODE GENERATION

Algorithms for Compiler Design: USING ALGEBRAIC PROPERTIES TO REDUCE THE REGISTER REQUIREMENT

Algorithms for Compiler Design: PEEPHOLE OPTIMIZATION

Algorithms for Compiler Design: IMPLEMENTATION OF THE TRANSLATIONS SPECIFIED BY SYNTAX-DIRECTED DEFINITIONS

Algorithms for Compiler Design: L-ATTRIBUTED DEFINITIONS

Algorithms for Compiler Design: SYNTAX-DIRECTED TRANSLATION SCHEMES

Algorithms for Compiler Design: INTERMEDIATE CODE GENERATION

Algorithms for Compiler Design: REPRESENTING THREE-ADDRESS STATEMENTS

Algorithms for Compiler Design: SYNTAX-DIRECTED TRANSLATION SCHEMES TO SPECIFY THE TRANSLATION OF VARIOUS PROGRAMMING LANGUAGE CONSTRUCTS

Algorithms for Compiler Design: THE ARRAY REFERENCE

Algorithms for Compiler Design: SWITCH/CASE

EXAMPLES for Syntax-Directed Definitions and Translations

What is a cross-compiler?

Top 10

- Understanding and Using Windows Server 2008 R2 UNIX Integration Components (part 1)

- Combine Multiple Events into a Single Event

- Connect-Back Shellcode

- Windows Server 2008 : Transport-Level Security - Active Directory Rights Management Services

- IIS 7.0 : Performance and Tuning - Configuring for Performance

- Troubleshooting and Testing Network Settings

- Network Programming with Windows Sockets : A Socket-Based Client

- Exchange Server 2010 server roles (part 3) - Edge Transport Server role

- IIS 7.0 : Implementing Access Control - IP and Domain Restrictions

- Using SharePoint Central Administration for Backup and Restore

Most View

- Windows Azure : Static reference data (part 2) - Performance disadvantages of a chatty interface & Caching static data

- How an Access Control List Is Used

- Working with the Automated Help System

- The Basics of the Offline Application Cache

- Windows 7 : Windows Driver Foundation Architecture (part 3) - Driver Frameworks

- Reporting Services with SQL Azure : Deploying the Report & Creating a Subreport

- Exchange Server 2007 : Configure the Client Access Server - Enable POP3 and IMAP4

- SQL Server 2008 : Managing Query Performance - Finding Similar Queries

- Leveraging and Optimizing Search in SharePoint 2010 : Uninstalling FAST Search Server 2010 for SharePoint

- Leverage and Locate Controls and Classes on Silverlight 4

- Advanced ASP.NET : Data-Access Components (part 2) - Using the Data-Access Component

- iPhone Application Developmen : Using the View-Based Application Template (part 1)

- Programming the Mobile Web : WebKit CSS Extensions (part 2) - Reflection Effects & Masked Images

- Getting the Most Out of the Microsoft Outlook Client : Implementing Outlook Anywhere

- SQL Server 2008 : Auditing SQL Server - Creating Server Audit Specifications

- Server-Side Browser Detection and Content Delivery : Mobile Detection (part 2) - Detecting the Context

- Working with Disks, Partitions, and Volumes in Vista

- Blind SQL Injection Exploitation : Using Time-Based Techniques

- Programming the Mobile Web : Mobilizing WordPress and Other CMSs

- SQL Server 2008 : Explaining XML - XML Schema

site
stats