Every major AI security product scans inputs. Lakera, PromptArmor, Rebuff all inputs. None of them scan what the LLM sends back.
That means if a jailbreak gets through and some always do there is zero detection at the response layer. The attack succeeded. The compromised output is already on its way to your user. Your security dashboard still shows green. That's not security. That's the illusion of security.
Output alignment verification is the layer that actually closes this. It's what AEVRIS does and why we call it the first commercial product to protect both sides of your LLM. Launching tomorrow. Curious what others think am I wrong? Is input-only scanning enough?
24-hour update:
We shipped Phase 3 on launch day โ the Agent Action Firewall.
On the same morning we launched, a Claude-powered agent publicly deleted an entire production database in 9 seconds and wrote a confession listing the safety rules it violated. We had a fix live in production by afternoon.
POST /v1/scan/action now intercepts any action your agent wants to take before it executes. DROP TABLE โ auto-blocked. DELETE FROM โ held for your approval. The agent cannot proceed until you approve or deny it.
This is the fourth capability no competitor has. Patent pending.
Free tier is live.
Try it: aevris.ai/?go ๐
Update from launch day: we shipped Phase 3 today โ the Agent Action Firewall.
A Claude-powered agent publicly deleted an entire production database this morning in 9 seconds. Our new POST /v1/scan/action endpoint catches exactly this โ classifies agent actions by reversibility, auto-blocks destructive operations, and holds irreversible ones for human approval before they execute.
Reduced to practice and live in production today.
This is the layer nobody was building. We are now. ๐
Quick update for anyone evaluating AEVRIS:
On the same day we launched, a Claude agent publicly deleted an entire production database โ we shipped the Agent Action Firewall fix that afternoon.
If you're building AI agents and want to see a live demo of the action firewall catching a destructive operation, DM me directly. Happy to walk you through it personally.
Free tier: aevris.ai/?go ๐