Dedy Ariansyah

Dedy Ariansyah

Building sovereign AI

Forums

Dedy Ariansyah•

1d ago

Auditi - Open source AI agents observability and evaluation

Tracing + evaluation in one open-source tool. LangSmith is closed-source. Langfuse is overcomplicated. Most logging tools lack built-in eval. Auditi combines all three. 2-line auto-instrumentation captures all OpenAI, Anthropic & Google API calls. 7+ LLM-as-Judge evaluators run automatically on traces. Human annotation workflows when AI judges aren't enough. Real-time cost tracking. Turn production traces into fine-tuning datasets. Self-host with docker compose up. Python SDK, FastAPI, React.
Nika•

4d ago

How much do you trust AI agents?

With the advent of clawdbots, it's as if we've all lost our inhibitions and "put our lives completely in their hands."

I'm all for delegating work, but not giving them too much personal/sensitive stuff to handle.