[ Attention ]

Discover insights, tutorials, and thoughts on technology, homelab, and development.

Dec 16, 2025 5 min read

Implementing Pi-Attention: 50% Memory Reduction for Long-Context LLMs

How periodic sparse attention achieves O(n) complexity while maintaining model quality

transformers attention memory-optimization