In Part 4, we introduced named words and a
dictionary that allowed our VM to call subroutines by name. But there was still one major gap:
We couldn’t write Forth-like code.
You still had to manually build vec![Instruction::Push(5), ...] in Rust. That changes now.
In this post, we’ll add a hand-rolled parser that understands simple Forth syntax — including word definitions like
: square dup * ; — and emits instructions automatically.
By the end of this part, you’ll be able to write:
5 square : square dup * ;
And run it on your virtual machine with no hardcoded addresses or manual instruction building.
The Goal
Our parser will:
Tokenize simple Forth input
Track whether we’re inside a : definition
Split instructions into main and definitions
Insert a Halt after top-level code to prevent fall-through
Track the correct addresses for word definitions
Build a final list of instructions ready to run
Here’s the full updated code.
Parser Support
First of all, we define our Parser struct — this separates the parsing logic from the VM runtime.
The heart of the parser is the parse method. We split the input on whitespace and interpret each token in turn.
fnparse(&mutself,input:&str){letmuttokens=input.split_whitespace().peekable();letmutdefining:Option<String>=None;letmutbuffer:Vec<Instruction>=Vec::new();whileletSome(token)=tokens.next(){matchtoken{":"=>{// Beginning of a word definitionletname=tokens.next().expect("Expected word name after ':'");defining=Some(name.to_string());buffer.clear();}";"=>{// End of word definitionifletSome(name)=defining.take(){buffer.push(Instruction::Return);letaddr=self.main.len()+self.definitions.len()+1;// +1 for HALTself.dictionary.insert(name,addr);self.definitions.extend(buffer.drain(..));}else{panic!("Unexpected ';' outside of word definition");}}word=>{// Otherwise, parse an instructionletinstr=ifletOk(n)=word.parse::<i32>(){Instruction::Push(n)}else{matchword{"dup"=>Instruction::Dup,"drop"=>Instruction::Drop,"swap"=>Instruction::Swap,"over"=>Instruction::Over,"+"=>Instruction::Add,"*"=>Instruction::Mul,"depth"=>Instruction::Depth,_=>Instruction::CallWord(word.to_string()),}};// Add to appropriate sectionifdefining.is_some(){buffer.push(instr);}else{self.main.push(instr);}}}}}
Breakdown of the cases:
: begins a new named word definition.
; ends the definition, emits a Return, and stores the word’s starting address in the dictionary.
A number becomes a Push(n) instruction.
Built-in words like + and * become direct Instruction variants.
Any unknown token is assumed to be a user-defined word, and gets translated to CallWord("name").
Finalizing the Program
Once parsing is complete, we combine the main program with definitions — separated by a Halt to ensure we don’t fall through.
fnfinalize(self)->(Vec<Instruction>,HashMap<String,usize>){letmutinstructions=self.main;instructions.push(Instruction::Halt);// Halts after main programinstructions.extend(self.definitions);(instructions,self.dictionary)}
Main Program
Our main() function now uses the parser to construct the program from a Forth-style string.
In Part 3, we introduced control flow and
subroutines into our virtual machine. That gave us branching logic and reusable code blocks — a huge step forward.
But one core Forth idea is still missing: the ability to define and name new words.
In this part, we’ll add a dictionary to our VM and support calling reusable routines by name. This will allow us to
define Forth-style words like:
: square dup * ;
5 square
Let’s get into it.
The Concept of a “Word”
In Forth, a word is any named function — even built-ins like + and * are just words. User-defined words are
created using : and ;, and then they behave just like native instructions.
To support this, we need:
A dictionary mapping word names to addresses
An instruction that can call a word by name
A way to define new words at specific locations in the program
Extending the Instruction Set
First, we extend our enum to support calling named words:
This works just like Call, but performs a dictionary lookup first.
Example: Defining square
Here’s a complete program that defines and calls a square word:
letprogram=vec![Instruction::Push(5),Instruction::CallWord("square".to_string()),Instruction::Halt,// : square dup * ;Instruction::Dup,Instruction::Mul,Instruction::Return,];letmutvm=VM::new(program);vm.add_word("square",3);// definition starts at index 3vm.run();println!("Final stack: {:?}",vm.stack);
Output:
[25]
We’ve now made it possible to extend the language from within the language — a hallmark of Forth.
Optional: Parsing : square dup * ;
Currently we define words manually by inserting them into the dictionary, but in true Forth style we’d like to write:
: square dup * ;
5 square
To support that, we’ll need a minimal parser or macro-assembler to convert high-level Forth code into VM instructions.
This will be the focus of a future post.
Conclusion
In this post, we gave our VM the ability to define and call named words, which turns our stack machine into
something far more expressive and composable.
Our VM now supports:
Arithmetic
Stack manipulation
Control flow and subroutines
A dictionary of named routines
In Part 5, we’ll push even further — implementing a simple parser that can read actual Forth-like text,
resolve words, and build programs dynamically.
We’re getting very close to having a minimal, working Forth interpreter — and it’s all built in Rust.
In Part 2, we extended our Forth-style virtual
machine with a bunch of classic stack manipulation words — from OVER and ROT to 2DUP, 2SWAP, and more.
This gave our machine more expressive power, but it still lacked something crucial: control flow. In this part,
we’ll fix that.
By adding branching and subroutine support, we allow our VM to make decisions and reuse logic — two
foundational ideas in all real programming languages.
Control Flow in Stack Machines
Stack machines like Forth typically handle control flow through explicit instruction manipulation — that is,
jumping to new parts of the program and returning when done.
We’ll implement:
Instruction
Stack Effect
Description
IfZero(offset)
( n -- )
Jumps offset if top is zero
Jump(offset)
( -- )
Always jumps offset
Call(addr)
( -- )
Saves return address and jumps
Return
( -- )
Pops return address and jumps to it
These instructions give us the power to create conditionals and function-like routines.
In order to support our ability to call subroutines, our virtual machine needs another stack. This stack is in charge
of remembering where we came from so that we can return back to the correct place. The return stack is just another
piece of state management for the virtual machine:
We use continue here because we don’t want to execute the usual ip += 1 after a jump.
IFZERO
Conditionally jumps based on the top stack value.
Stack effect: ( n -- )
Instruction::IfZero(offset)=>{letcond=self.stack.pop().expect("Stack underflow on IFZERO");ifcond==0{self.ip=((self.ipasisize)+offset)asusize;continue;}}
If the value is zero, we adjust ip by the offset. If not, we let the loop continue as normal.
CALL
Pushes the current instruction pointer onto the return stack and jumps to the absolute address.
If you accidentally used Call(5), you’d be jumping to Return, skipping your routine completely — a classic off-by-one bug that’s easy to spot once you think in terms of instruction addresses.
Conclusion
With these new control flow instructions, we’ve unlocked a huge amount of expressive power. Our VM can now:
Execute conditional logic
Jump forwards and backwards
Encapsulate and reuse stack behavior with subroutines
In the next part, we’ll take the leap into defining named words, allowing us to simulate real Forth syntax like:
: square dup * ;
5 square
We’ll build a dictionary, wire up some simple parsing, and move closer to an interactive REPL.
In Part 1, we built the foundation of a
Forth-inspired stack-based virtual machine in Rust. It could execute arithmetic expressions using a simple data stack,
with support for operations like PUSH, ADD, MUL, and basic stack manipulation like DUP, DROP, and SWAP.
In this post, we’re going to extend our instruction set with a broader set of stack manipulation words, modeled
after standard Forth operations.
Why focus on stack operations? Because in a language like Forth, the stack is everything. Understanding and
manipulating it precisely is key to building complex programs — without variables, parentheses, or traditional
control structures.
Stack Operations in Forth
Let’s take a look at some of the classic stack words used in Forth and what they do:
Word
Stack Effect
Description
OVER
( a b -- a b a )
Copies the second value to the top
ROT
( a b c -- b c a )
Rotates the third value to the top
NIP
( a b -- b )
Removes the second item
TUCK
( a b -- b a b )
Duplicates the top item under the second
2DUP
( a b -- a b a b )
Duplicates the top two items
2DROP
( a b -- )
Drops the top two items
2SWAP
( a b c d -- c d a b )
Swaps the top two pairs
DEPTH
( -- n )
Pushes the current stack depth
These tiny instructions are the building blocks for everything from loops and conditionals to data structures and
control flow. Let’s implement them.
Extending the Instruction Set
First, we add new variants to our Instruction enum:
Each of these stack operations is implemented as a new match arm in our run() method. Here’s the complete method
with all new instructions included:
OVER
Copies the second value from the top and pushes it to the top.
Stack effect: ( a b -- a b a )
Instruction::Over=>{ifself.stack.len()<2{panic!("Stack underflow on OVER");}letval=self.stack[self.stack.len()-2];self.stack.push(val);}
This implementation uses indexing to read the second-to-top value without popping. It’s a clean operation that doesn’t disturb the existing stack order — a very common primitive in Forth.
ROT
Rotates the third item to the top of the stack.
Stack effect: ( a b c -- b c a )
Instruction::Rot=>{ifself.stack.len()<3{panic!("Stack underflow on ROT");}letc=self.stack.pop().unwrap();letb=self.stack.pop().unwrap();leta=self.stack.pop().unwrap();self.stack.push(b);self.stack.push(c);self.stack.push(a);}
We pop all three values, then push them back in rotated order. It’s a destructive operation — it reshuffles the top 3 items completely.
NIP
Removes the second item, leaving the top item alone.
Stack effect: ( a b -- b )
Instruction::Nip=>{ifself.stack.len()<2{panic!("Stack underflow on NIP");}lettop=self.stack.pop().unwrap();self.stack.pop();// discard secondself.stack.push(top);}
Here we temporarily save the top, discard the second, then restore the top. This is essentially “keep the top, ignore the rest.”
TUCK
Duplicates the top item and inserts it beneath the second.
Stack effect: ( a b -- b a b )
Instruction::Tuck=>{ifself.stack.len()<2{panic!("Stack underflow on TUCK");}lettop=*self.stack.last().unwrap();self.stack.insert(self.stack.len()-2,top);}
We avoid popping by using last() and insert(). Inserting at len() - 2 puts the copy just beneath the second item, preserving the original order.
2DUP
Duplicates the top two stack items.
Stack effect: ( a b -- a b a b )
Instruction::TwoDup=>{ifself.stack.len()<2{panic!("Stack underflow on 2DUP");}letlen=self.stack.len();self.stack.push(self.stack[len-2]);self.stack.push(self.stack[len-1]);}
We peek at the last two items and push duplicates in-place. It’s a straightforward double copy.
2DROP
Removes the top two items from the stack.
Stack effect: ( a b -- )
Instruction::TwoDrop=>{ifself.stack.len()<2{panic!("Stack underflow on 2DROP");}self.stack.pop();self.stack.pop();}
Just two pops in a row. Very simple and direct.
2SWAP
Swaps the top two pairs on the stack.
Stack effect: ( a b c d -- c d a b )
Instruction::TwoSwap=>{ifself.stack.len()<4{panic!("Stack underflow on 2SWAP");}letd=self.stack.pop().unwrap();letc=self.stack.pop().unwrap();letb=self.stack.pop().unwrap();leta=self.stack.pop().unwrap();self.stack.push(c);self.stack.push(d);self.stack.push(a);self.stack.push(b);}
This is the most complex so far. We destructure two pairs from the stack, then push them back in swapped order.
DEPTH
Pushes the number of elements currently on the stack.
That last 3 is the result of DEPTH, reporting how many values were on the stack before it was called.
Conclusion
With just a few additional instructions, our little VM has become much more expressive. We’ve added powerful new tools
to inspect, duplicate, and reorder values on the stack — just like a real Forth environment.
This kind of “stack choreography” might feel alien at first, but it’s deeply intuitive once you start thinking in
terms of data flow. It’s the perfect foundation for:
Most of the code we write is eventually executed by some kind of virtual machine — whether it’s the
JVM, the CLR,
or the many interpreters embedded in your browser or shell.
But how do these machines actually work?
To understand this from the ground up, we’re going to build a stack-based virtual machine — the simplest kind of VM
there is.
Stack Machines and Reverse Polish Notation
Unlike register-based architectures (like x86 or ARM),
stack-based machines use a single stack for passing arguments and storing temporary values. Instructions operate by
pushing and popping values to and from this stack.
This is not just a novelty — it’s how many early languages and calculators (like HP RPN calculators) worked. It
eliminates the need for parentheses and operator precedence, making parsing trivial.
Enter Forth
Forth is a language built entirely on this stack-based model. It’s terse, powerful, and famously minimalist. Every
Forth program is a sequence of words (commands) that manipulate the data stack. New words can be defined at runtime,
giving Forth a unique mix of interactivity and extensibility.
Despite being decades old, the design of Forth still holds up as a brilliant way to think about interpreters, minimal
systems, and direct computing.
Here’s an example of a simple Forth snippet:
: square ( n -- n^2 ) dup * ;
5 square
This defines a word square that duplicates the top of the stack and multiplies it by itself. Then it pushes 5 and
runs square, leaving 25 on the stack.
Why Rust?
Rust gives us a perfect platform for building this kind of system:
It’s low-level enough to model memory and data structures precisely.
It’s safe and expressive, letting us move fast without segmentation faults.
It encourages clean architecture and high-performance design.
Over the next few posts, we’ll build a small but functional Forth-inspired virtual machine in Rust. In this first part, we’ll get a simple instruction set up and running — enough to perform arithmetic with a data stack.
Let’s get started.
Defining a Machine
Let’s start by defining the fundamental pieces of our stack-based virtual machine.
Our machine is going to be made up of some basic building blocks such as:
An instruction set (things to execute)
A stack (to hold our state)
A machine structure (something to bundle our pieces together)
The Instruction Set
First, we need a basic set of instructions. These represent the operations our VM knows how to perform. We’ll keep
it simple to begin with:
That’s the start of what our machine will be capable of executing. As we move through this series, this enum will
gather more and more complex operations that we can execute. For now though, these basic arithmetic operations will
be a good start.
The Machine
Now let’s define the structure of the virtual machine itself. Our VM will contain:
A stack (Vec<i32>) for evaluating instructions
A program (Vec<Instruction>) which is just a list of instructions to run
An instruction pointer (ip) to keep track of where we are in the program
#[derive(Debug)]structVM{stack:Vec<i32>,program:Vec<Instruction>,ip:usize,// instruction pointer}implVM{fnnew(program:Vec<Instruction>)->Self{Self{stack:Vec::new(),program,ip:0,}}// We'll implement `run()` in the next section...}
This lays the foundation for our virtual machine. In the next section, we’ll bring it to life by writing the dispatch
loop that runs our program.
run(): Getting Things Done
Now that we have a structure for our VM, it’s time to give it life — with a run() function.
This will be our dispatch loop — the engine that drives our machine. It will:
Read the instruction at the current position (ip)
Execute it by manipulating the stack
Move to the next instruction
Halt when we encounter the Halt instruction
Let’s add this to our impl VM block:
fnrun(&mutself){whileself.ip<self.program.len(){match&self.program[self.ip]{Instruction::Push(value)=>{self.stack.push(*value);}Instruction::Add=>{letb=self.stack.pop().expect("Stack underflow on ADD");leta=self.stack.pop().expect("Stack underflow on ADD");self.stack.push(a+b);}Instruction::Mul=>{letb=self.stack.pop().expect("Stack underflow on MUL");leta=self.stack.pop().expect("Stack underflow on MUL");self.stack.push(a*b);}Instruction::Dup=>{lettop=*self.stack.last().expect("Stack underflow on DUP");self.stack.push(top);}Instruction::Drop=>{self.stack.pop().expect("Stack underflow on DROP");}Instruction::Swap=>{letb=self.stack.pop().expect("Stack underflow on SWAP");leta=self.stack.pop().expect("Stack underflow on SWAP");self.stack.push(b);self.stack.push(a);}Instruction::Halt=>break,}self.ip+=1;}}
This loop is dead simple — and that’s exactly what makes it elegant. There are no registers, no heap, no branches just
yet — just a list of instructions and a stack to evaluate them on.
The use of expect on each of our pop operations is a small insurance policy. This allows us to report out and
invalid state on the stack. If we’re already at the top of stack (TOS) then we can’t pop more values.
In future parts, we’ll introduce new instructions to handle control flow, user-defined words, and maybe even a return
stack — all inspired by Forth.
But before we get ahead of ourselves, let’s write a small program and run it.
Running
We don’t have a parser or compiler yet, so we need to write our Forth program directly inside the Rust code. This will
take the form of a vector of instructions:
If you squint a little, you’ll notice this is equivalent to the following Forth-style program:
2 3 + 4 *
This is exactly the kind of thing you’d see in a Reverse Polish or Forth-based environment — values and operations in
sequence, evaluated by a stack machine.
Now, let’s run our program and inspect the result:
If everything has gone to plan, you should see this output in your terminal:
[20]
Giving us the final answer of 20. That confirms our machine is working — it’s reading instructions, performing
arithmetic, and leaving the result on the stack. A tiny virtual computer, built from scratch.
Conclusion
We’ve built the foundation of a working virtual machine — one that can evaluate simple arithmetic using a stack, just
like a classic Forth system. It’s small, simple, and powerful enough to demonstrate key ideas behind interpreters,
instruction dispatch, and virtual machines.