Launched this week

GitHub

Launched this week

Transparent semantic cache for LLM API calls on Redis VS

6 followers

Transparent semantic cache for LLM API calls on Redis VS

6 followers

Visit website

Khazad is a transparent semantic cache for LLM API calls. It intercepts LLM HTTP traffic at the httpx transport layer and serves semantically-equivalent requests from a Redis 8 vector cache with zero code changes. Works with OpenAI, Anthropic, Gemini, Azure OpenAI, and Mistral. Model-aware and conversation-aware caching, full streaming support, TTL, and tunable similarity thresholds. Stop paying for the same prompt twice in dev, CI, demos, or production. Open source (MIT).

Overview
Reviews
Team
More

GitHub Reviews

Reviews

No reviews yetBe the first to leave a review for GitHub