[ Mechanistic-Interpretability ]

Discover insights, tutorials, and thoughts on technology, homelab, and development.

9 min read

I Was Wrong About All Three Dormant Models

Jane Street published the Dormant LLM Challenge answer key today. My March submission claimed all three triggers. The answer key disagrees on every model. Here's what I actually got wrong, the methodology error that drove it, and what I'd do differently with the benefit of hindsight.

ai-research machine-learning security
Read more
10 min read

Topology of Thought

How curiosity, physics, and looking inside neural networks revealed something unexpected about how computation organizes itself

ai-research topology persistent-homology
Read more