Wafer Pass - Flat rate to the best LLMs for OpenClaw, Hermes Agent, etc.
by•
We're launching Wafer Pass, a monthly subscription that gives you access to the fastest LLMs for use in personal agentic coding harnesses like OpenClaw, Claude Code, OpenCode, Cline, Kilo Code, with no per-token charges.
The first 2 LLMs we're supporting is GLM5.1-Turbo and Qwen3.5-397B-A17B-Turbo, two LLMs our team optimized from the original base models to 1.5-3x the speed SGLang/vLLM give you out of the box.
More Turbo models coming soon, included with all plans.


Replies
I hope we get access to all the models with this flat rate, with some sort of limitations from the tokens, or you are not doing that and giving a flat rate on all kinds of models.
Wafer
@nayan_surya98 hey! yes all models will be available at the flat rate, with generous request limits.
I understand there's a cost constraint here, but I wish I had an avenue to try this in my workflows to evaluate if it would actually work for me. That said, $10 for a week of experimentation as long as I've got an offramp is helpful. I just don't know if my needs are going to be satisfied or not. Sharing because it might be informative for your pitch, I'm sitting here thinking: "Ok but I don't know if I'm going to be dramatically under the usage limit so I'm over spending or over it so it's DOA for me".