What is Sutra?¶

Sutra is a geometrically compiled language where logical operations over vector spaces are resolved at compile time into matrix multiplications. Source code parses, compiles, and executes — but the compilation target is not machine instructions. It is a sequence of tensor operations on vectors in a geometric substrate: matrix multiplies, elementwise multiplies, additions, cosines, softmax-weighted sums. Every value in a Sutra program is a vector — a coordinate in that substrate. Every operation moves the program through the substrate's geometry. The compiler's job is to take the program's surface logic — branches, loops, structured records — and resolve it down to a chain of matrix multiplications before the runtime ever touches a value.

What the compiler does¶

The Sutra compiler is a normal compiler in shape — lexer, parser, simplifier, validator, code generator. It reads a .su source file and emits a self-contained Python module. That module imports a small runtime class (_VSA) and calls into it for the language's primitives:

Primitive	What it computes
`basis_vector("name")`	embed the string through the substrate
`bundle(a, b, ...)`	sum the vectors and L2-normalize
`bind(role, filler)`	rotation binding: `Q_role @ filler`
`unbind(role, record)`	inverse rotation: `Q_role^T @ record`
`similarity(a, b)`	cosine similarity
`argmax_cosine(query, [candidates])`	nearest codebook entry
`select([scores], [options])`	softmax-weighted superposition
`do_while`, `while_loop`, `iterative_loop`, `foreach_loop`	declared loop functions; each cell tick is a substrate-resident RNN step

These are tensor operations. Bundle is a sum. Bind and unbind are matrix multiplies against orthogonal matrices. Similarity is a dot product. Argmax_cosine is a matrix-vector multiply followed by an argmax. Select is a softmax-weighted sum. A loop is iterated matrix-vector multiplication with a substrate-resident soft-halt check — see Loops for the declared-function surface.

The current default substrate is nomic-embed-text (768-dimensional vectors, mean-centered, served via Ollama). String literals in vector contexts auto-embed: vector v = "cat" is short for "embed the string 'cat' and bind the result to v." The runtime caches embeddings and batches Ollama round-trips at module init.

Why no host-side control flow¶

Sutra has functions, conditionals, and loops in its surface syntax, but none of them lower to a Python if or while on data values:

Conditionals lower to a softmax-weighted sum across all options. All branches contribute to the result; the weights decide how much. The commitment to a discrete answer happens at the final argmax_cosine or map lookup at the program's edge.
Loops are declared as first-class functions whose parameters are the recurrent state and whose body is a single cell evaluation. The four kinds (do_while / while_loop / iterative_loop / foreach_loop) compile to a fixed-T tensor-op unroll where each tick applies the cell on the substrate. A soft-halt mask freezes the state when the condition is met, so the host runs the unroll once but the logical loop terminates wherever the condition fires. The "loop counter" is the angular position on a helix in the substrate, not a host variable.

The reason this matters: a program with no host-side branches lowers to straight-line tensor work, which lets the simplifier read the whole program as one tensor expression and fold chains of operations into cached matrices. Compile a chain of bundle(bind(r1, f1), bind(r2, f2)), and the simplifier can stack the binds into one matmul.

Why composition is the win¶

The interesting thing is not that 1 + 1 happens to be a vector add. The interesting thing is that every operation has the same shape, so the compiler can compose them. A chain of bind/unbind/bundle/similarity on real LLM embeddings folds into a matrix expression at compile time. Conditionals embed into the same expression as softmax weights. Loops embed as iterated rotations.

Locally, this looks wasteful — 1 + 1 doing 768-d vector addition is more arithmetic than 1 + 1 needs. The trade is that the whole program has uniform shape, so there is no type-dispatch layer, no JIT, no branch predictor in the hot path, and the simplifier can fuse chains end-to-end.

What's a Sutra program for¶

The thing a Sutra program is good at is computing in the geometry of an embedding space: looking up structured records by role, computing analogies as displacement-plus-bind, classifying against bundled prototypes, walking a trajectory until it lands in a basin. The demo programs in examples/ show the surface — embed/retrieve, fuzzy branching, role-filler records, bundled triples, position-bound sequences, declared-function loops.

What it isn't good at is being a portable general-purpose language. You do not write a web server in Sutra, or a filesystem walker, or a UI event loop. The language is for the part of a system that lives in vector space.

What about the other substrates¶

Earlier Sutra work explored compiling to a Brian2 spiking simulation of the Drosophila mushroom body, and later to the Shiu et al. 2024 whole-brain LIF model. That research line was retired on 2026-04-26 — the substrate work outpaced the language's maturity, and keeping the half-finished compile-to-connectome path in the codebase wasn't paying for itself. The findings from that work (real FlyWire weight matrices do not function as rotation operators; CX ring-attractor circuits did not discriminate direction on real connectivity) stand as a record of which connectome substrates were tried.

The current compile target is PyTorch on the frozen-LLM semantic subspace, and that is the substrate the language reference and the demos describe.

Where to go next¶

Paradigms → — where Sutra sits relative to functional, declarative, object-oriented, and imperative languages, with side-by-side comparisons against Haskell, Prolog/SQL, Java, and C.
The vision page → — why frozen embedding spaces give Sutra primitives geometric meaning, and what the displacement-vector cartography work showed.
Hello Sutra → — write your first .su program by hand.
Compilation → — how the compiler progressively strips surface sugar down to polynomial and matrix arithmetic.
Demos → — the ten programs in the smoke test, what each one exercises.