Compiler Construction Using Java, JavaCC, and Yacc - Anthony J. Dos Reis

Compiler Construction Using Java, JavaCC, and Yacc

Buch | Hardcover
664 Seiten
2012
John Wiley & Sons Inc (Verlag)
978-0-470-94959-7 (ISBN)
114,44 inkl. MwSt
* Strong coverage of formal languages. The target languages (one is stack-oriented like Java's bytecode; the other is register oriented) are very easy to learn but are sufficiently powerful to support advanced compiler projects * Provides numerous, well-defined projects along with test cases.
Broad in scope, involving theory, the application of that theory, and programming technology, compiler construction is a moving target, with constant advances in compiler technology taking place. Today, a renewed focus on do-it-yourself programming makes a quality textbook on compilers, that both students and instructors will enjoy using, of even more vital importance. This book covers every topic essential to learning compilers from the ground up and is accompanied by a powerful and flexible software package for evaluating projects, as well as several tutorials, well-defined projects, and test cases.

ANTHONY J. DOS REIS is Associate Professor of Computer Science at the State University of New York at New Paltz. Before becoming a professor, Dr. Dos Reis worked at IBM as a systems programmer, creating IBM operating systems and compilers. His teaching interests include computer engineering, program translation, Java, and formal languages.

Preface xv

Chapter 1 Strings, Languages, and Compilers 1

1.1 Introduction 1

1.2 Basic Language Concepts 1

1.3 Basic Compiler Concepts 3

1.4 Basic Set Theory 4

1.5 Null String 6

1.6 Concatenation 7

1.7 Exponent Notation 7

1.8 Star Operator 8

1.9 Concatenation of Sets of Strings 9

1.10 Plus Operator 11

1.11 Question Mark Operator 11

1.12 Shorthand Notation for a Set Containing a Single String 12

1.13 Operator Precedence 12

1.14 Regular Expressions 13

1.15 Limitations of Regular Expressions 15

Problems 16

Chapter 2 Context-Free Grammars, Part 1 19

2.1 Introduction 19

2.2 What is a Context-Free Grammar? 20

2.3 Derivations Using a Context-Free Grammar 21

2.4 Language Defined by a Context-Free Grammar 23

2.5 Different Ways of Representing Contet-Free Grammars 25

2.6 Some Simple Grammars 26

2.7 Techniques for Generating Languages with Context-Free Grammars 29

2.8 Regular and Right Linear Grammars 35

2.9 Counting with Regular Grammars 37

2.0 Grammars for Lists 39

2.10 An Important Language that is Not Context Free 44

Problems 45

Chapter 3 Context-Free Grammars, Part 2 49

3.1 Introduction 49

3.2 Parse Trees 49

3.3 Leftmost and Rightmost Derivations 51

3.4 Substitution 52

3.5 Ambiguous Grammars 54

3.6 Determining Nullable Nonterminals 59

3.7 Eliminating Lambda Productions 60

3.8 Eliminating Unit Productions 64

3.9 Eliminating Useless Nonterminals 66

3.10 Recursion Conversions 71

3.11 Adding the Null String to a Language 76

Problems 77

Chapter 4 Context-Free Grammars, Part 3 83

4.1 Introduction 83

4.2 Grammars for Arithmetic Expressions 83

4.3 Specifying Associativity and Precedence in Grammars 90

4.4 Backus-Naur Form 92

4.5 Syntax Diagrams 94

4.6 Abstract Syntax Trees and Three-Address Code 96

4.7 Noncontracting Grammars 97

4.8 Essentially Noncontracting Grammars 97

4.9 Converting a Context-Free Grammar to an Essentially Noncontracting Grammar 98

4.10 Pumping Property of Context-Free Languages 101

Problems 104

Chapter 5 Chomsky’s Hierarchy 107

5.1 Introduction 107

5.2 Context-Sensitive Productions 107

5.3 Context-Sensitive Grammars no

5.4 Unrestricted Grammars 111

Problems 112

Chapter 6 Top-Down Parsing 115

6.1 Introduction 115

6.2 Top-Down Construction of a Parse Tree 115

6.3 Parses that Fail 117

6.4 A Bad Grammar for Top-Down Parsing 118

6.5 Deterministic Parsers 119

6.6 A Parser that Uses a Stack 120

6.7 Table Representation of a Stack Parser 124

6.8 Handling Productions with Nonleading Terminal 126

6.9 Writing a Stack Parser in Java 127

Problems 134

Chapter 7 LL(1) Grammars 137

7.1 Introduction 137

7.2 FIRST Set of the Right Side of a Production 137

7.3 Determining Operation Sequences 140

7.4 Determining Selection Sets of Lambda Productions 142

7.5 Whatever-Follows-Left-Follows-Rightmost Rule 145

7.6 Selection Sets for Productions with Nullable Right Sides 147

7.7 Selection Sets Containing End-of-Input Symbol 149

7.8 A Stack Parser for a Grammar with Lambda Productions 152

7.9 Converting a Non-LL( 1) Grammar to an LL( 1) Grammar 153

7.10 Parsing with an Ambiguous Grammar 160

7.11 Computing FIRST and FOLLOW Sets 163

Problems 165

Chapter 8 Table-Driven Stack Parser 171

8.1 Introduction 171

8.2 Unifying the Operations of a Stack Parser 172

8.3 Implementing a Table-Driven Stack Parser 175

8.4 Improving Our Table-Driven Stack Parser 180

8.5 Parsers that are Not Deterministic—A Digression on Theory 181

Problems 183

Chapter 9 Recursive-Descent Parsing 185

9.1 Introduction 185

9.2 Simple Recursive-Descent Parser 185

9.3 Handling Lambda Productions 192

9.4 A Common Error 197

9.5 Java Code for Productions 198

9.6 Left Factoring in a Recursive-Descent Parser 199

9.7 Eliminating Tail Recursion 204

9.8 Translating the Star, Plus, and Question Mark Operators 108

9.9 Doing Things Backward 210

Problems 211

Chapter 10 Recursive-Descent Translation 215

10.1 introduction 215

10.2 A Simple Translation Grammar 215

10.3 Converting a Translation Grammar to Java Code 217

10.4 Specifications for a Translation Grammar 218

10.5 Passing Information During a Parse 231

10.6 L-Attributed Grammars 236

10.7 New Token Manager 238

10.8 Solving the Token Lookahead Problem 241

10.9 Code for the New Token Manager 241

10.10 Translation Grammar for Prefix Expression Compiler 253

10.11 An Interesting Use of Recursion 257

Problems 261

Chapter 11 Assembly Language 265

11.1 Introduction 265

11.2 Structure of the J1 Computer 265

11.3 Machine Language Instructions 266

11.4 Assembly Language Instructions 268

11.5 Pushing Characters 269

11.6 aout Instruction 270

11.7 Using Labels 270

11.8 Using the Assembler 272

11.9 stav Instruction 275

11.10 Compiling an Assignment Statement 277

11.11 Compiling print and printin 280

11.12 Outputting Strings 28,

11.13 Inputting Decimal Numbers 283

11.14 Entry Directive 284

11.15 More Assembly Language 285

Problems 285

Chapter 12 SI—A Simple Compiler 289

12.1 Introduction 289

12.2 The Source Language 289

12.3 Grammar for Source Language 290

12.4 The Target Language 291

12.5 Symbol Table 292

12.6 Code Generator 293

12.7 Token Class 293

12.8 Writing the Translation Grammar 294

12.9 Implementing the SI Compiler 299

12.10 Trying Out SI 315

12.11 Advice on Extending the SI Compiler 318

12.12 Specifications for S2 320

Problems 324

Chapter 13 JavaCC 331

13.1 Introduction 331

13.2 JavaCC Extended Regular Expressions 333

13.3 JavaCC Input File 337

13.4 Specifying Actions for Regular Expressions 344

13.5 JavaCC Input File for Slj 348

13.6 Files Produced by JavaCC 355

13.7 Using the Star and Plus Operators 359

13.8 Choice Points and the Lookahead Directive 362

13.9 JavaCC’s Choice Algorithm 367

13.10 Syntactic and Semantic Lookahead 371

13.11 Using JavaCC to Create a Token Manager Only 372

13.12 Using the Token Chain 373

13.13 Suppressing Warning Messages 377

Problems 387

Chapter 14 Building on S2 383

14.1 Introduction 383

14.2 Extending println and print 383

14.3 Cascaded Assignment Statement 388

14.4 Unary Plus and Minus 313

14.5 readint Statement 393

14.6 Controlling the Token Trace from the Command Line 395

14.7 Specifications for S3 396

Problems 396

Chapter 15 Compiling Control Structures 399

15.1 Introduction 399

15.2 while Statement 399

15.3 if Statement 403

15.4 do-while Statement 407

15.5 Range Checking of Numerical Constants 408

15.6 Handling Backslash-Quote in a String 410

15.7 Handling Backslash-Quote with JavaCC 411

15.8 Universal Blocks in JavaCC 416

15.9 Handling Strings that Span Lines 418

15.10 Handling Strings that Span Lines Using JavaCC 419

15.11 SPECIAL_TOKEN Block in JavaCC 422

15.12 Error Recovery 424

15.13 Error Recovery in JavaCC 429

15.14 Specifications for S4 430

Problems 431

Chapter 16 Compiling Programs in Functional Form 435

16.1 Introduction 435

16.2 Separate Assembly and Linking 435

16.3 Calling and Returning from Fuctions 439

16.4 Source Language for S5 443

16.5 Symbol Table for S5 445

16.6 Code Generator for S5 446

16.7 Translation Grammar forS5 447

16.8 Linking with a Library 457

16.9 Specifications for S5 458

16.10 Extending S5 458

Problems 461

Chapter 17 Finite Automata 465

17.1 Introduction 465

17.2 Deterministic Finite Automata 466

17.3 Converting a DFA to a Regular Expression 468

17.4 Java Code for a DFA 472

17.5 Nondeterministic Finite Automata 474

17.6 Using an NFA as an Algorithm 476

17.7 Converting an NFA to a DFA with the Subset Algorithm 478

17.8 Converting a DFA to a Regular Grammar 479

17.9 Converting a Regular Grammar to an NFA 482 

17.10 Converting a Regular Expression to an NF A 484

17.11 Finding the Minimal DFA 488

17.12 Pumping Property of Regular Languages 493

Problems 495

Chapter 18 Capstone Project: Implementing Grep Using Compiler Technology 499

18.1 Introduction 499

18.2 Regular Expressions for Our Grep Program 501

18.3 Token Manager for Regular Expression 501

18.4 Grammar for Regular Expressions 503

18.5 Target Language for Our Regular Expression Compiler 503

18.6 Using an NFA for Pattern Matching 508

Problems 513

Chapter 19 Compiling to a Register-Oriented Architecture 515

19.1 Introduction 515

19.2 Using the Register Instruction Set 516

19.3 Modifications to the Symbol Table for R1 517

19.4 Parser and Code Generator for R1 518

Problems 526

Chapter 20 Optimization 529

20.1 Introduction 529

20.2 Using the ldc Instruction 531

20.3 Reusing Temporary Variables 532

20.4 Constant Folding 535

20.5 Register Allocation 537

20.6 Peephole Optimization 540

Problems 543

Chapter 21 Interpreters 547

21.1 Introduction 547

21.2 Converting SI to 11 549

21.3 Interpreting Statements that Transfer Control 552

21.4 Implementing the Compiler-Interpreter Cl 1 553

21.5 Advantages of Interpreters 558

Problems 559

Chapter 22 Bottom-Up Parsing 561

22.1 Introduction 561

22.2 Principles of Bottom-Up Parsing 561

22.3 Parsing with Right- versus Left-Recursive Grammars 565

22.4 Bottom-Up Parsing with an Ambiguous Grammar 566

22.5 Do-Not-Reduce Rule 569

22.6 SLR(l) Parsing 570

22.7 Shift/Reduce Conflicts 577

22.8 Reduce/Reduce Conflicts 579

22.9 LR(1) Parsing 579

Problems 584

Chapter 23 yacc 587

23.1 Introduction 587

23.2 yacc Input and Output Files 587

23.3 A Simple yacc-Generated Parser 588

23.4 Passing Values Using the Value Stack 596

23.5 Using yacc With an Ambiguous Grammar 602

23.6 Passing Values down the Parse Tree 604

23.7 Implementing Sly 606

23.8 jflex 612

Problems 618

Appendix A Stack Instruction Set 621

Appendix B Register Instruction Set 625

References 629

Index

Erscheint lt. Verlag 20.1.2012
Zusatzinfo Drawings: 10 B&W, 0 Color
Verlagsort New York
Sprache englisch
Maße 183 x 260 mm
Gewicht 1220 g
Themenwelt Mathematik / Informatik Informatik Programmiersprachen / -werkzeuge
Informatik Theorie / Studium Compilerbau
Technik Elektrotechnik / Energietechnik
ISBN-10 0-470-94959-7 / 0470949597
ISBN-13 978-0-470-94959-7 / 9780470949597
Zustand Neuware
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich