We're launching Wafer Pass, a monthly subscription that gives you access to the fastest LLMs for use in personal agentic coding harnesses like OpenClaw, Claude Code, OpenCode, Cline, Kilo Code, with no per-token charges.
The first 2 LLMs we're supporting is GLM5.1-Turbo and Qwen3.5-397B-A17B-Turbo, two LLMs our team optimized from the original base models to 1.5-3x the speed SGLang/vLLM give you out of the box.
More Turbo models coming soon, included with all plans.
Wafer is the GPU development stack that lives inside your editor: profiling, compiler explorer, and GPU docs—all in one place.
Writing GPU kernels means jumping between fragmented tools: editing here, profiling there, docs in a browser, compiler explorer in another tab. Wafer pulls everything back into your IDE.