Launching today

ElevenAgents Guardrails 2.0
Configurable safety control for enterprise agent deployment.
98 followers
Configurable safety control for enterprise agent deployment.
98 followers
Voice agents drift, get manipulated, or go off-brand in production. Guardrails 2.0 adds real-time policy enforcement, prompt injection protection, and custom rules to ElevenAgents. For enterprise teams deploying agents at scale.

Voice agents are moving fast into production. But most teams don't have a way to enforce what their agents should and shouldn't say, especially when users actively try to push past system instructions.
ElevenAgents Guardrails 2.0 is a redesigned safety layer that validates user inputs and agent responses in real time, before anything reaches the end user.
Agents drift in long conversations. System prompts don't hold under pressure. One wrong response in healthcare or banking breaks trust fast. Guardrails 2.0 gives teams three independent enforcement layers: system prompt hardening, user input validation, and agent response validation. Custom Guardrails let you define policies in natural language and enforce them automatically across every call.
What makes it interesting:
🔒 Custom rules run as an independent parallel check, not a filter on the main model
🎯 Pre-built protections for focus, content safety, and prompt injection
⚙️ Execution modes tuned for voice latency tradeoffs
🚪 Configurable exit strategies when a guardrail fires
📋 Conversation history redaction for compliance-sensitive deployments
Built for enterprise teams deploying voice agents in regulated industries: healthcare, banking, and retail.
Note: currently in alpha. Redaction and Zero Retention Mode are enterprise-tier.
If you're moving voice agents from pilot to production, this is the infrastructure layer that makes it viable.
P.S. I hunt the latest and greatest launches in tech, SaaS and AI, follow to be notified → @rohanrecommends
This solves a real problem. We're building Kepion — a multi-agent platform with 31 AI agents that can create and run businesses. Voice interface is on our roadmap (Telegram voice messages, morning briefings, phone calls via Twilio), and ElevenLabs is our primary TTS choice.
The guardrails angle is especially relevant for us because our agents handle business-critical tasks — market research, financial analysis, legal checks. When these go through voice, the stakes are higher: a hallucinated number spoken aloud sounds more authoritative than one typed in chat.
We already have a supervisor layer (Warden for quality, Sentinel for fact-checking) that catches issues in text outputs. Guardrails 2.0 adds the missing piece: catching problems in the voice layer itself — prompt injection through voice input, off-brand responses, policy violations.
Question: does Guardrails 2.0 work with custom voice agents that use ElevenLabs TTS via API, or only with ElevenAgents specifically? We'd want to apply these rules to our own agent pipeline that outputs through ElevenLabs voices.
Sounds like a must have tool, like a shield that saves u from a PR crisis
Documentation.AI
Sometimes the voice agents can drift, and guardrails can help prevent that from happening. Congrats!!