Edgee Forums on Product Hunt

p/edgee The Agent Gateway that TL;DR tokens

•2 reviews•904 followers

Start new thread

trending

•

19d ago

Edgee Fallback Models - Claude Code that never stops

Your Claude Code session shouldn't die when Anthropic goes down or your plan runs out. Edgee Fallback Models keeps coding assistants running by routing to alternative models like Kimi K2.6, Gemma, GLM, or Qwen when Claude is unavailable, rate-limited, or just too expensive. Or one-click fallback to your own Bedrock, Vertex, or Azure account. Same Claude Code, different backend, zero code changes. Built for teams that can't afford to stop shipping.

•

2mo ago

Edgee Team - Strava for your coding assistants

Who on your team shipped more with less? Which repo is silently eating your AI budget? Edgee Teams gives coding assistants their missing dashboard. Invite your team, connect GitHub, and every session gets tracked, attributed, and ranked. Compare compression ratios across developers. Share session stats publicly or keep them private. Climb the monthly leaderboard and claim the title of biggest token spender. Built for teams shipping with Claude Code, Codex, and other agents.

•

2mo ago

Edgee Codex Compressor - Use Codex at 35.6% lower costs

We benchmarked Codex alone against Codex routed through Edgee's compression gateway on the same repo, with the same model, under the same workflow. The result: Codex + Edgee used 49.5% fewer input tokens, improved cache hit rate from 76.1% to 85.4%, and reduced total session cost by 35.6%. This post breaks down why context compression makes Codex more efficient, more frugal, and materially cheaper to run without sacrificing useful output.

•

3mo ago

Edgee Claude Code Compressor - Extend Claude Pro's limit by 26.2%

You're mid-task. Claude is in flow. Then the plan limit hits and everything stops. You know the feeling — the session cuts out, the context is gone, and you're starting over. For heavy Claude Code users, this isn't an occasional annoyance. It's a regular ceiling on what you can get done in a day. We built Edgee's Claude Code Compressor to push that ceiling back.

•

4mo ago

Edgee - The AI Gateway that TL;DR tokens

Edgee compresses prompts before they reach LLM providers and reduces token costs by up to 50%. Same code, fewer tokens, lower bills.

•

4mo ago

Token Compression for LLMs: How to reduce context size without losing accuracy

Hey, I'm Sacha, co-founder at @Edgee

Over the last few months, we've been working on a problem we kept seeing in production AI systems:

LLM costs don't scale linearly with usage, they scale with context.
As teams add RAG, tool calls, long chat histories, memory, and guardrails, prompts become huge and token spend quickly becomes the main bottleneck.

So we built a token compression layer designed to run before inference.