Edgee

The Agent Gateway that TL;DR tokens

5.0•2 reviews•

1.1K followers

The Agent Gateway that TL;DR tokens

5.0•2 reviews•

1.1K followers

Visit website

AI Infrastructure Tools

•

AI Metrics and Evaluation

•

LLM Developer Tools

Edgee compresses tokens before they reach LLM providers, reducing the token cost by up to 50%. Same code, fewer tokens, lower bills.

ElevenAgents by ElevenLabsScale conversations without scaling your team

Promoted

Edgee Launches

Edgee Turbo ModelsUse Claude Code with Kimi K2.7 Code, MiniMax M2.7, and more

Launched on June 16th, 2026

Edgee Fallback ModelsClaude Code that never stops

Launched on May 24th, 2026

Edgee TeamStrava for your coding assistants

Launched on April 26th, 2026

Edgee Codex CompressorUse Codex at 35.6% lower costs

Launched on April 12th, 2026

Edgee Claude Code CompressorExtend Claude Pro's limit by 26.2%

Launched on March 22nd, 2026

View all Edgee launches

Forum Threads

p/edgee

•

5mo ago

Token Compression for LLMs: How to reduce context size without losing accuracy

Hey, I'm Sacha, co-founder at @Edgee

Over the last few months, we've been working on a problem we kept seeing in production AI systems:

LLM costs don't scale linearly with usage, they scale with context.
As teams add RAG, tool calls, long chat histories, memory, and guardrails, prompts become huge and token spend quickly becomes the main bottleneck.

So we built a token compression layer designed to run before inference.

View all

Edgee

The Agent Gateway that TL;DR tokens

The Agent Gateway that TL;DR tokens

Edgee Launches

Forum Threads

Token Compression for LLMs: How to reduce context size without losing accuracy

Edgee Launches

Forum Threads

Token Compression for LLMs: How to reduce context size without losing accuracy

What's great

What needs improvement

What's great

What needs improvement

vs Alternatives

What's great

What needs improvement

What's great

What needs improvement

vs Alternatives