High-performance CLI proxy that sits between your AI client and any LLM API. Strips whitespace, deduplicates repeated context, compresses prompts — cutting token usage by 60–90% before the request ever reaches the model. Built in Rust for near-zero overhead.