A compiler for a simple dialect of Scheme written in OCaml and x86 Assembly. It supports tail-call optimization, boxing and lexical addressing.
This was built as a continous semester project divided to multiple parts, each part was an assignment. The compiler was almost entierly built by us. However the staff did gave us some base files. The files contain a parsing combinator library and they covered much of the run-time support implementation.
The files we were provided with are found in the course git repository.
To clone it, run git clone https://www.cs.bgu.ac.il/~comp211/compiler
.
Course assignments descriptions
The following sub-sections list the different parts we implemented (in order of the assignments):
-
Reader: A parser which translates the concrete syntax of scheme to an AST of S-Expressions.
Implemented inreader.ml
.
It uses the parsing combinator library the staff gave us, which is implemented inpc.ml
. -
Tag parser: A parser which translates the S-Expressions AST to a Scheme AST. It is also responsible for expanding macros.
Implemented intag-parser.ml
. -
Sementic analysis: Performs semantic analysis and converts the Scheme AST to an internal implementation specific AST to help tag nodes with specific optimization and semantic analysis information such as the lexical address of a variable.
The analyses it performs are: Lexical addressing, Tail call annotation, Boxing.
Implemented insemantic-analyser.ml
. -
Code generation and run-time support: Responsible for generating the assembly code of the program and provide the standard scheme library. This part is implemented by the rest of the files. An explaination for them can be found in the
Final project
assignment description undersection 3
.This part constructs the constants table and free variable table and then generates the code.
- The entry function of the compiler is found at
compiler.ml
and the high-level flow of the compiler is implemented in the functiongenerate_code_from_files
.
Under it, at the very buttom of the file, is the entry code for the file which calls the above function with the input file. - The hard parts of the code generation were handling and manipulating the stack for tail-call optimizations, optional arguments and the
apply
procedure (which uses a run-time dynamic tail-call).