Sign In
Access to Author tools and Claude Code Assistant requires authentication.

[ Blog ]

Homelab adventures, infrastructure deep-dives, and lessons learned building enterprise-grade systems on a budget

by Adam

I Was Wrong About Self-Improving Models: Here's What I Actually Found

A follow-up to my retracted post on self-improving models. The fidelity metric improved 18% but actual performance dropped 11%. Here's what went wrong, what I learned about Goodhart's Law, and why reproducibility filtering might be the answer.

self-improvement machine-learning goodharts-law reproducibility
Read more