william argo

william argo

AI Behavioural Evaluation

About

I research how AI behaves when it meets real work. It started as fascination with what these systems could do, then turned into documenting the surprising failure modes nobody was talking about. The limits and the hype pushed me into writing papers on how these systems could be improved, and the testing methods I use grew out of that work. None of it was planned. It came from seeing gaps everywhere and building what the field was missing. I share my research and insights through Inquisitor Labs.

Badges

Tastemaker
Tastemaker

Maker History

Forums

4h ago

Evaluate & Test AI Using Real-World Work - AI can't be tested with magic prompts -only real work.

A practical field manual for evaluating and testing AI systems using real‑world work, now aligned with the requirements of the EU AI Act. Learn how to expose hidden failures, assess risks, and build reliable AI systems before deployment.
View more