Hyperion is a high-performance LLM gateway with ~5µs overhead (~250× lower than LiteLLM).
It handles smart routing, semantic + exact-match caching, rate limiting, budget control, PII redaction, and failover through a single OpenAI-compatible API.
Built in Go with Redis, Qdrant, and ClickHouse.