Launching today

Tracea
Datadog for AI agents - traces, RCA, and team memory
12 followers
Datadog for AI agents - traces, RCA, and team memory
12 followers
Agents fail silently. You fire one off, it runs, nothing comes back - no trace, no cost data, no idea which call broke. Tracea captures every tool call, LLM response, and cost spike. Automatic RCA tells you exactly why it failed. YAML detection rules catch loops, spikes, and silent errors before they hit production. Self-hosted. One Docker command. No data leaves your network. Company Brain turns every session into team memory - agents start smarter each run.






Hey Product Hunt! 👋 Launching today as part of the @gustaf x @Y Combinator builder challenge.
I'm Darshan, the maker of @Tracea.
I built this because I kept running AI agent sessions that would fail silently - no trace of what broke, no cost visibility, no way to debug after the fact. I open-sourced an early version called @Observagent and 300 developers found it with zero marketing. That told me the problem was real.
@Tracea is the production-grade version:
🔍 Full event timeline: every tool call, LLM response, cost and latency spike in order
🧠 Automatic RCA: understand exactly why an agent failed
🚨 Detection + alerting: catch cost spikes, loops, and silent errors before they hurt
💡 Company Brain: sessions synthesized into team knowledge so agents start smarter each run
🔒 Self-hosted: one Docker command, nothing leaves your network
Works with every framework out of the box. No SDK lock-in, no integration work.
Would love to hear how you're currently handling visibility into your agent runs. Happy to answer anything!