
Peeling back the layers of artificial minds β one story, one system, one week at a time.
Welcome to my blog, a storytelling-engineering journal that explores the strange, emotional, and sometimes unpredictable world of simulated intelligence. This blog documents the design of increasingly complex AI agents β not as models in a paper, but as characters in a world. You will hopefully see my successes, but more often you will see my failures.
Each post follows a weekly build cycle with:
- new cognitive capabilities (memory, planning, belief, etc.)
- playable sandbox simulations (built in Unity)
- reflective writing that blends code, metaphor, and narrative
Latest Post
π© Week 1 β Tracing the Mind: How Glassbox Began
βWhen a mind becomes visible, it ceases to be a black box β and becomes a mirror.β
This week, I started the journey into Phase 2 β Glassbox, an interactive debugger for transformer models. The goal is simple: make attention visible. In practice? Not so simple.
Glassbox is a visual tool that lets you trace what a language model is paying attention to as it generates text. Itβs not interpretability in the abstract. Itβs literally watching what it thinks.
What works so far: * β Backend powered by HuggingFace + FastAPI * β Traces attention matrices from all layers and heads * β Frontend force-directed graph of token-to-token attention * β Full-stack communication via REST API
π§ What the Onion Means
Wherever you see this icon β π§ That marks a layered word: something that needs peeling.
Hover over the onion icon after a word to see its layered meaning!
These terms will have: - The official definition - And my AILO-style version β honest, funny, and functional
Example: Belief π§
AILO-style: A hunch the agent will probably act onβ¦ even if itβs wrong.
These definitions live in the glossary and appear in hover-tooltips throughout posts.
π½ Who Is the Martian?
Back in middle school, my Physics teacher told me the only way to understand something was to pretend you were explaining it to a man from Mars.
Iβve never forgotten that.
At the end of every post youβll find a Martian π½. This is a plain-language TLDR written for someone from another world (or another field).
It breaks down: - What I built - Why it matters - How to explain it to your obtuse martian friend
Because interpretability isnβt just for the models β itβs for the humans, too.