PandaProbe
p/pandaprobe
open source agent engineering platform
0 reviews1 follower
Start new thread
trending
Sina Tayebati

8h ago

why are ai agents still so hard to debug in production?

feels like the industry figured out how to build ai agents faster than how to understand them.

everyone demos agents.
very few teams can confidently answer:

  • why an agent failed

  • what changed between runs

  • whether quality is improving or regressing

  • or if the agent is actually reliable over time

curious how people here are handling this today.