N2T/README.md

194 lines
9.2 KiB
Markdown
Raw Normal View History

2020-11-15 19:55:42 +01:00
# N2T
This repository contains my solutions for Nand to Tetris.
[N2T](https://www.nand2tetris.org/) is a course that teaches how
to build a fully functioning general-purpose computer from first
principles. It starts with the hardware design, including its own
instruction set for which you program an assembler. You finish the
course with a two-step compiler that translates a high-level programming
language called Jack into assembly code.
Even though I have a solid understanding of microcontrollers based on my
experience in the embedded industry, I still learned a couple of new concepts
and improved my general knowledge of how a computer works. Going through all
the steps to build a computer starting from first principles helps to facilitate
a deep understanding. If you are a person that needs hands-on examples to grasp
a concept, you will love this course as much as I do.
In the following, I explain how to use my solutions, mainly if I want to revisit
this class later. If you haven't done the course yet, you should not look at the
answers, but you can try to play the game I wrote in the Jack programming
language. It is not a game but a 1D cellular automaton simulator.
## Project 1: Boolean Logic
In this project, we start building the basic gates required for the computer.
Our basic building block is a Nand gate from which we make 15 additional gates.
We implement the gates in a simplified hardware description language (HDL). To
test the HDL files, run `./tools/HardwareSimulator.sh` and open one of the test
scripts located in `./projects/01`.
## Project 2: Boolean Arithmetic
Based on the previous projects' basic gates, we build arithmetic chips:
a half-adder, a full adder, a 16-bit adder, and a 16-bit incrementer
based on the simple adders. Finally, we create the ALU (arithmetic-logic
unit), which is the heart of the CPU that we make in the later projects.
The ALU takes two 16-bit inputs and computes an output depending on a
couple of control-bits' status. To test the arithmetic chips, use the
hardware simulator equally to the first project.
## Project 3: Memory
Till this point, all gates are stateless. To build a computer, we need
memory. For this purpose, the course introduces a DFF (data flip-flop).
We can create a one-bit register and build up from there to a 16k chip
with the DFF.
It found it rewarding to build memory from first principles, but even more
rewarding was how easy Vim makes it to write the HDL code for these chips. The
following picture shows how I create a 64-bit register from 8-bit registers in a
matter of seconds.
2020-11-15 19:57:48 +01:00
![Create HDL for 64-bit RAM in Vim](gifs/vim_ram64.gif)
## Project 4: Machine Language Programming
In project 4, we get familiar with the Hack machine language - our
computer's assembly language. We write two basic projects: fill the
screen when the user presses a button, and a second one that
multiplicates two input arguments. To try the scripts, start the CPU
emulator by executing `./tools/CPUEmulator.sh`. You can then open the
script located in `./projects/04/fill` or `./projects/04/mult`.
I have created a Vim syntax file for the Hack assembly language. Copy the file
2020-11-15 20:42:49 +01:00
[hackasm](vim/hackasm.vim) from the vim directory into your Vim installation's syntax
directory. You can then set the filetype to hackasm by running `:set ft=hackasm`
from within Vim, and you should see highlighting as shown on the following
screenshot.
![Hack Asm syntax highlighting in Vim](./gifs/vim_hack_syntax.png)
## Project 5: Computer Architecture
In this project, we assemble all prior building blocks into the main memory,
CPU, and finally into the full hack computer. There are test scripts similar to
project one to three to validate that the computer works as designed. Seeing it
all come together is incredibly rewarding. Even if you stop the course at this
point, you have developed a great intuition of how a computer works.
## Project 6: The Assembler
With the computer working, we now need a way to assemble the hackasm code into
machine instructions. The purpose of this project is to build the assembler in
the programming language of our choice.
My Python version has 203 lines of code and relies on Python 3.8 features. We
can test the assembler by changing the directory to `./projects/06` and then
running `python assembler.py pong/*.asm`. Note that my assembler can only
translate individual asm-files and does not search a directory.
Load the resulting hack file into the CPU emulator to verify that the
assembler works correctly.
2020-11-15 22:52:02 +01:00
## Project 7/8: VM Translator
At this point, we have finished the hardware part of our computer and can
program it via the Hack assembly language. Of course, a higher-level programming
language is desirable to write programs more comfortably. Therefore, our goal
is to implement the high-level programming language *Jack* and compile it down
into Hack assembly.
We use a two-step compilation process. First, we translate the Jack code into VM
code, which we then translate further into Hack assembly. The goal of projects 7
and 8 is to implement the VM translator in a programming language of our choice.
Even though Python is my primary language, and I would have been faster using
it, I decided to opt for Rust, a language that I hadn't used before. Rust comes
with a flourishing eco-system and static typing that came in handy for this
project.
My VM translator ended up with just under 900 lines of code. A considerable part
of that is the template Hack assembly code that the translator fills with
registers and addresses. I would have opted for the Jinja2 templating language
in Python, but instead, I used in-code strings combined with "format!\". While
this looks ugly at times, it avoids external dependencies. The approach is okay
for this course, but a template engine would have been desirable for a
real-world project.
To run the VM translator, switch into the `./vm_translator directory` and
execute `cargo build`. You can then execute
`./vm_translator/target/debug/vm_translator` on one of the example projects in
`./projects/08`.
I was happy with my choice to learn Rust even though I spent too much time
figuring out trivial things like how to read a file.
## Project 9: High-Level Language
With the VM translator in place, it is now time to understand the high-level
Jack languaged by implementing a project. I decided to implement a 1D cellular
automation in `./projects/09/CellAutomaton1D`. To run the program first compile
the project by running `./tool/JackCompiler.sh ./projects/09/CellAutomaton1D`.
Then, start the Virtual Machine Emulator and open the project. Make sure to
select the directory `CellAutomaton1D` and not one of the VM files.
![A 1D cellular automaton on the Hack computer](gifs/automaton_drawing.png)
The program consists of two parts. First, you configure the initial population
using Vim key bindings, and then the program simulates the selected rule. You
have the option to pause and single step the simulation.
![The menu for the automaton](gifs/automaton_menu.png)
I have also implemented Vim syntax highlighting for the jack programming
language. The best thing is that Vim yells at you when you forgot one of
the keywords, like let or do. And you will forget them all the time.
Simple put [jack.vim](vim/jack.vim) into your Vim's syntax
directory, and you should be good to go.
![Jack syntax highlighting in Vim](gifs/vim_jack_syntax.png)
## Project 10/11: Compiler
With a solid understanding of the Jack programming language, we are now ready to
implement the compiler. I opted for Rust again and had a great time. The
directory `jack_analyzer` contains the tokenizer and parser, which outputs an
XML syntax tree. In project 11, we use the analyzer to generate VM code. The
resulting Rust project is in the directory `jack_compiler`. To compile the
project, change into the directory and execute `cargo build`.
You can now recompile the cellular automaton by running the following command:
~~~
./jack_compiler/target/debug/jack_compiler projects/09/CellAutomaton1D
~~~
I honestly was happy with my decision to opt for Rust again. While the project
would undoubtedly have been manageable in Python, static typing made it so much
easier to cover all cases for keywords and tokens. The compiler has a little
over 1000 lines of code. That sounds pretty decent.
![Number of source code lines for the compiler](gifs/compiler_loc.png)
## Project 12: The Operating System
With the compiler in place, the last step is implementing the OS functionality
that previously came from the Virtual Machine Emulator. In particular, there is
a library for string handling, memory management, array handling, math
(multiplication and division), and display routines to draw characters and
geometric forms.
I particularly enjoy seeing how much effort goes into simple routines like
multiplication and division. Of course, as a high-level programmer, you take
these operations for granted but getting into the details of how to implement
them is rewarding. I still learned a lot, even for this last project.
## Conclusion
Nand to Tetris is one of my favorite classes of all time. I highly recommend it
to everybody who works in computer science and feels like they only partially
understand how a computer works. I certainly got many new insights throughout
the course, and I learned a new programming language, Rust, which I should use
more often over Python.