Khaled Maâmra

Khaled Maâmra

Research Scientist working on AI

#93368872 followers 2 following

18 points

🔥 7 day streak

All activity

Khaled MaâmralaunchedEdgee

15h ago

Edgee compresses prompts before they reach LLM providers and reduces token costs by up to 50%. Same code, fewer tokens, lower bills.

EdgeeThe AI Gateway that TL;DR tokens

Khaled Maâmraleft a comment

6d ago

Hi everyone 👋 Super excited to see the discussion around this. We’ve been digging deep into hard vs soft compression, token scoring, and meta-tokenization, especially around what actually survives compression in production settings. One major challenge is that it’s not just about reducing tokens, but about retaining evaluation scores, alignment, and tool-calling reliability after compression....

Token Compression for LLMs: How to reduce context size without losing accuracy

Sacha MORARDJoin the discussion