Complect is a toy compiler developed in Node.js. It operates as a stream-based compiler, processing code as a continuous stream of data. Currently, Complect functions primarily as a transpiler, converting Complect code into JavaScript using the Babel Abstract Syntax Tree (AST). Future enhancements include support for switchable AST backends, enabling compilation to WebAssembly (WASM), custom interpreters, and other targets.
The initial implementation of this compiler was created to support a talk I presented at OpenJS World 2022. You can find the contents of this talk here. Slides Video
- Preprocessor: Transforms an input stream into a stream of preprocessing tokens.
- Tokenizer: Converts the stream of preprocessing tokens into a stream of tokens.
- Abstract Syntax Tree (AST): Generates an AST from the stream of tokens, currently utilizing Babel.
- Output: The Babel AST outputs code in JavaScript.
Complect utilizes a handcrafted parser and lexer to give developers fine-grained control over the compilation process. The parser in Complect is a top-down parser of the LL(1) type, chosen for its simplicity and efficiency in parsing.
-
Handcrafted Parser and Lexer: Complect's components are manually written, unlike tools-generated parsers and lexers. This allows for greater customization and optimization specific to the language's needs.
-
Top-Down Parsing (LL(1)): Complect's choice of a top-down parser of the LL(1) type is not arbitrary. This parsing method, which proceeds from left to right and produces a leftmost derivation with a single token lookahead, is known for its efficiency and simplicity. The grammar of the Complect language has been meticulously designed to facilitate this type of parsing, ensuring that parsing decisions can be made with minimal lookahead.
-
Stream-Based Compilation: The compiler processes code as a data stream, enhancing efficiency and memory usage for large inputs. This design also allows easier insertion of future optimization layers, though synchronous data passing may be considered for simplicity in a toy compiler.
-
Babel AST Integration: Complect currently generates an Abstract Syntax Tree (AST) using Babel, which outputs JavaScript code. This integration with Babel allows leveraging its robust ecosystem to further process and transform the generated JavaScript code.
-
Modular AST Generation: The architecture supports pluggable backends, enabling output to JavaScript, WebAssembly, or custom interpreters.
- Support for switchable AST backends, including Binaryen for WebAssembly output and a custom AST for interpretation
- Modular AST generation to enable pluggable backends for JavaScript, WebAssembly, or custom interpreters
- Explore compiler optimization passes
You can use Complect as a command-line tool. The entry point of the application is cli.js. It reads from the standard input, compiles the input, and writes the output to the standard output.
Tests are written using Node's built-in test module.
npm run test
Linting is done using ESLint.
npm run lint
Complect supports basic programming constructs including variables (make), assignments (assign for simple values, = for expressions), conditionals (if/endif), loops (as/repeat), and output (print). The language is evolving as the parser develops—expect changes.
Expressions are currently limited to binary operations; simple assignments (e.g., f = 5) are not yet supported and require assign. The grammar is designed for LL(1) parsing with minimal lookahead.
make i 1
make f 0
make b 0
make output ''
as i <= 16
f = i % 3
b = i % 5
assign output ''
if f == 0
output = output + 'Fizz'
endif
if b == 0
output = output + 'Buzz'
endif
if output == ''
output = output + i
endif
print output
i = i + 1
repeat
Complect is created by Jarrod Connolly.
MIT License
See COPYING for the full license text.
