Launching today
OpenInterpretability

OpenInterpretability

Open-source toolkit to audit what your LLM knows

3 followers

The first mech interp toolkit that runs inside Claude Code, Cursor, and Cline via MCP. Production probes (FabricationGuard, agent-probe-guard) catch hallucinations + agent failures. ProbeBench leaderboard, SAE training from 30-min free Colab to paper-grade. Apache-2.0.

OpenInterpretability Reviews

AppSignal
AppSignal
Promoted
Reviews
Most Informative