Mps on Rick Lamers' blog

Mps on Rick Lamers' blog https://ricklamers.io/tags/mps/ Recent content in Mps on Rick Lamers' blog Rick Lamers' blog https://ricklamers.io/cover.png https://ricklamers.io/cover.png Hugo -- 0.151.0 en Mon, 20 Apr 2026 12:00:00 +0200 KV Cache, Made Visible: Qwen3-0.6B on Apple Silicon https://ricklamers.io/posts/kvcache-exploration/ Mon, 20 Apr 2026 12:00:00 +0200 https://ricklamers.io/posts/kvcache-exploration/ A minimal Qwen3-0.6B in pure PyTorch, running on Apple Silicon, with a live UI that makes the memory and latency trade-offs of KV cache physically visible. Toggle the cache off and watch attention go quadratic in real time.