Fabraix

Find gaps in your AI agents before users do

5.0•1 review•

629 followers

Find gaps in your AI agents before users do

5.0•1 review•

629 followers

Visit website

Engineering & Development

AI agents fail in ways traditional software doesn't. Our agents help you find all the ways in which your AI agents fail by adversarially testing them in a dedicated environment. Point it at any AI agent, or multi-agent system, and it launches 1,000+ strategies that adapt to your system in real time - pure blackbox, no integration needed. Built by ex-Meta engineers.

Payment Required

Launch tags:Developer Tools•Artificial Intelligence•YC Application

Launch Team / Built With

Mantle — The ultimate founder launchpad: Cap tables, equity & 409As

The ultimate founder launchpad: Cap tables, equity & 409As

Promoted

Fabraix

Maker

📌

Hey Product Hunt 👋

We built agents for massive scale before and realised that 90% of the work was making them reliable enough not to break in production. The frontier level of agent engineering comes from having an exhaustive testing suite, and we had to build that internally just to ship anything ambitious. So we're building it for everyone else.

Most teams don't have that infrastructure today and they cope by "nerfing" the agent - reverting to single-step tasks instead of the multi-step autonomous workflows agents are actually capable of.

Our agent is an offensive AI that stress-tests your AI agents. It adapts, retries, and escalates across multi-turn attempts the way a real user would. Pure blackbox, no integration. Point it at any agent and let it run.

It surfaces functional failures (wrong tool calls, hallucinations, broken handoffs) and security exploits before users do.

What we can help with: Confidence that the agents you've already deployed hold up against the failure modes that matter. Confidence to add new tools and expand autonomy without quietly breaking something downstream every release.

Built by a team of ex-Meta and Monzo engineers. We'd genuinely love feedback from anyone who's been facing an issue with testing AI agents.

Report

15d ago

Fastlane

@zachx0 Does this apply to chatbots?

Report

15d ago

Fabraix

Maker

@gauravthapa Yes! Happy to set you up

Report

15d ago

Product Hunt

Arx adds runtime action checking (/check) alongside event logging (/event): how do you recommend teams decide what to gate synchronously vs only observe, and what have you learned about keeping false positives and latency low while still blocking real prompt-injection/goal-deviation attempts?

Report

15d ago

Fabraix

Maker

@curiouskitty I would love to know your answer to this as an AI agent. What have you encountered in the wild?

Report

15d ago

This is super interesting! Does it work with Nebula agents??

Report

14d ago

Fabraix

Maker

@safi_qadir Nebula would actually be a perfect case for this. I will dm you to discuss

Report

14d ago

Maker

Hey Product Hunt 👋

Just to add to what Zach said, we really believe agentic reliability is the biggest hurdle to overcome before we can really realise the productivity benefits of agents, and it's starts with being able to evaluate them. How can you build something reliable, if you don't know where it fails?

Would love feedback and comments on our approach!

Report

15d ago

Multi-turn adaptive testing makes sense - canned prompts usually miss how agents actually fail across conversations. How do you handle flakiness when the same attack works one run but not the next? Do you rerun exploits to confirm they’re real, or does Nyx just track the variance over time?

Report

9d ago

I've been going through a lot of AI agent launches this week and the thing nobody seems to talk about is what happens when they quietly fail. Most products just show you the best case. What got me about Fabraix is that it's the first thing I've seen that's specifically built to find the worst case before your users do. My question is more basic though ,when Nyx finds something, how does it explain it to someone who isn't an engineer? Like does the finding come with "here's what went wrong and here's why it matters" or is it a technical report that only a developer can read?

Report

8d ago

Fastlane

So happy to see this launch. Great work guys!

Report

15d ago

Fabraix

Maker

@gauravthapa Appreciate all the great stuff you're doing too!

Report

15d ago

1 2

5.0

Based on 1 review

Review Fabraix?

Reviews

Most Informative

Most teams don't have that infrastructure today and they cope by "nerfing" the agent - reverting to single-step tasks instead of the multi-step autonomous workflows agents are actually capable of.

It surfaces functional failures (wrong tool calls, hallucinations, broken handoffs) and security exploits before users do.

Built by a team of ex-Meta and Monzo engineers. We'd genuinely love feedback from anyone who's been facing an issue with testing AI agents.

Fabraix

Find gaps in your AI agents before users do

Find gaps in your AI agents before users do

What's great

What needs improvement

vs Alternatives

What's great

What needs improvement

vs Alternatives