AILO Logo

Peeling back the layers of artificial minds β€” one story, one system, one week at a time.

Welcome to my blog, a storytelling-engineering journal that explores the strange, emotional, and sometimes unpredictable world of simulated intelligence. This blog documents the design of increasingly complex AI agents β€” not as models in a paper, but as characters in a world. You will hopefully see my successes, but more often you will see my failures.

Each post follows a weekly build cycle with:

Latest Post

🟩 Week 1 – Tracing the Mind: How Glassbox Began

β€œWhen a mind becomes visible, it ceases to be a black box β€” and becomes a mirror.”

This week, I started the journey into Phase 2 β€” Glassbox, an interactive debugger for transformer models. The goal is simple: make attention visible. In practice? Not so simple.

Glassbox is a visual tool that lets you trace what a language model is paying attention to as it generates text. It’s not interpretability in the abstract. It’s literally watching what it thinks.

What works so far: * βœ… Backend powered by HuggingFace + FastAPI * βœ… Traces attention matrices from all layers and heads * βœ… Frontend force-directed graph of token-to-token attention * βœ… Full-stack communication via REST API

Read the full post β†’


πŸ§… What the Onion Means

Wherever you see this icon β†’ πŸ§… That marks a layered word: something that needs peeling.

Hover over the onion icon after a word to see its layered meaning!

These terms will have: - The official definition - And my AILO-style version β€” honest, funny, and functional

Example: Belief πŸ§…
AILO-style: A hunch the agent will probably act on… even if it’s wrong.

These definitions live in the glossary and appear in hover-tooltips throughout posts.

πŸ‘½ Who Is the Martian?

Back in middle school, my Physics teacher told me the only way to understand something was to pretend you were explaining it to a man from Mars.

I’ve never forgotten that.

At the end of every post you’ll find a Martian πŸ‘½. This is a plain-language TLDR written for someone from another world (or another field).

It breaks down: - What I built - Why it matters - How to explain it to your obtuse martian friend

Because interpretability isn’t just for the models β€” it’s for the humans, too.

🌞