I built Agent-Blackbox after losing a week debugging an 8-agent pipeline failure. Every agent pointed fingers at the next one, and logs were useless for tracing the actual decision chain.
What it does:
- Traces failures to the exact agent and decision point
- Reconstructs the full task-based-on chain across agents