Kv-Cache on Pavel Nasovich's Blog

Kv-Cache on Pavel Nasovich's Bloghttps://forcewake.me/tags/kv-cache/Recent content in Kv-Cache on Pavel Nasovich's BlogHugo -- 0.157.0en-usCopyright 2026Sun, 05 Apr 2026 23:39:32 +0200TurboQuant Under the Hood: Google's 3-Bit Attack on the LLM Memory Wallhttps://forcewake.me/turboquant-kv-cache-compression/Thu, 26 Mar 2026 10:53:00 +0000https://forcewake.me/turboquant-kv-cache-compression/A deep technical walkthrough of TurboQuant, PolarQuant, and QJL: how Google turns random rotations, optimal scalar quantization, and a 1-bit residual sketch into a practical attack on the KV-cache memory wall without pretending every model problem is solved.