william argo

AI Behavioural Evaluation

About

I research how AI behaves when it meets real work. It started as fascination with what these systems could do, then turned into documenting the surprising failure modes nobody was talking about. The limits and the hype pushed me into writing papers on how these systems could be improved, and the testing methods I use grew out of that work. None of it was planned. It came from seeing gaps everywhere and building what the field was missing. I share my research and insights through Inquisitor Labs.

Links

Inquisitor Labs Home Page

Buy on Leanpub

Badges

Tastemaker

Maker History

Evaluate & Test AI Using Real-World WorkAI can't be tested with magic prompts -only real work.
Jul 2026

🎉

Joined Product HuntJune 18th, 2026

Forums

•

4h ago

Evaluate & Test AI Using Real-World Work - AI can't be tested with magic prompts -only real work.

A practical field manual for evaluating and testing AI systems using real‑world work, now aligned with the requirements of the EU AI Act. Learn how to expose hidden failures, assess risks, and build reliable AI systems before deployment.