Edgee compresses tokens before they reach LLM providers, reducing the token cost by up to 50%. Same code, fewer tokens, lower bills.
This is the 5th launch from Edgee. View more
Edgee Fallback Models
Launched this week
Your Claude Code session shouldn't die when Anthropic goes down or your plan runs out. Edgee Fallback Models keeps coding assistants running by routing to alternative models like Kimi K2.6, Gemma, GLM, or Qwen when Claude is unavailable, rate-limited, or just too expensive. Or one-click fallback to your own Bedrock, Vertex, or Azure account. Same Claude Code, different backend, zero code changes. Built for teams that can't afford to stop shipping.




Free Options
Launch Team



The auto-fallback when rate limits kick in is the part I always end up wiring by hand. Good luck with the launch!
Edgee
Kilo Code
@sachamorard love it!
The fallback angle is practical for agent workflows, especially when a coding session is mid-task and the provider limit hits. I’d be curious how you surface model switches in logs, since silent fallbacks can make debugging output differences harder.
Hey Sacha, went through Edgee Fallback's page and the "your Claude Code session shouldn't die when Anthropic goes down" framing is exactly the pain I've been living with this month. one thing I wanted to ask, when you fall back to Kimi or GLM mid-session, are you replaying the full context or doing a smarter summarization handoff? the model switch is the part I'd want to understand for long sessions.
mailX by mailwarm
Congrats on the launch!! This solves a real issue for developers who can’t afford downtime when Claude is rate limited or down. Keeping coding running with simple fallback models will make workflow feel more stable.