[ Fine-Tuning ]

Discover insights, tutorials, and thoughts on technology, homelab, and development.

Dec 18, 2025 7 min read

When Models Talk the Talk but Don't Walk the Walk: A Journey into LLM Behavioral Consistency

We fine-tuned a security agent to 100% skill differentiation in probing tests, but it collapsed to a single behavior in deployment. This gap led us to develop a trust diagnostic framework.

llm fine-tuning behavioral-consistency

Dec 16, 2025 9 min read

Self-Improving Models Without Labels: What I Just Proved and Why It Matters

A 7B model taught itself to generate better security commands using only its own understanding signals. No human labels, no external reward. Here's how and why it matters.

ai machine-learning self-improvement