Khaled Maâmra

Khaled Maâmra

Research Scientist working on AI

Badges

Tastemaker
Tastemaker
Gone streaking
Gone streaking
Gone streaking 5
Gone streaking 5

Maker History

  • Edgee
    EdgeeThe AI Gateway that TL;DR tokens
    Feb 2026
  • 🎉
    Joined Product HuntJanuary 8th, 2026

Forums

Token Compression for LLMs: How to reduce context size without losing accuracy

Hey, I'm Sacha, co-founder at @Edgee

Over the last few months, we've been working on a problem we kept seeing in production AI systems:

LLM costs don't scale linearly with usage, they scale with context.
As teams add RAG, tool calls, long chat histories, memory, and guardrails, prompts become huge and token spend quickly becomes the main bottleneck.

So we built a token compression layer designed to run before inference.

View more