Launching today

Promptly
An AI Cost Optimization Infrastructure for LLM Applications
30 followers
An AI Cost Optimization Infrastructure for LLM Applications
30 followers
Promptly is an OpenAI-compatible proxy that cuts your LLM spend by up to 60% with smart routing, prompt optimization, semantic caching, and context pruning. Works with OpenAI, Anthropic, and Google.










Really excited to finally share Promptly 🚀
If you’re working with LLMs, you’ve probably seen how quickly costs can scale in production.
Promptly helps optimize requests, reduce unnecessary tokens, and make AI systems more efficient - without changing your existing setup.
Would love to hear your thoughts and feedback 🙌