Launching today
OpenInterpretability

OpenInterpretability

Open-source toolkit to audit what your LLM knows

3 followers

The first mech interp toolkit that runs inside Claude Code, Cursor, and Cline via MCP. Production probes (FabricationGuard, agent-probe-guard) catch hallucinations + agent failures. ProbeBench leaderboard, SAE training from 30-min free Colab to paper-grade. Apache-2.0.

OpenInterpretability makers

Here are the founders, developers, designers and product people who worked on OpenInterpretability