AegisLM

See how easily your AI can be broken — in seconds

5 followers

See how easily your AI can be broken — in seconds

5 followers

Visit website

AI Metrics and Evaluation

AegisLM shows how easily modern AI can be broken. Test any model for prompt injection, jailbreaks, and data leaks in seconds. Input a prompt, run attacks, and see where it fails. Designed for builders who want to stress-test AI systems under real-world conditions. Try built-in attacks or create your own.

Free

Launch tags:Developer Tools•Artificial Intelligence

Launch Team / Built With

ElevenAgents by ElevenLabs — Scale conversations without scaling your team

Scale conversations without scaling your team

Promoted

Maker

📌

I built AegisLM as a small prototype after noticing how easy it is to break many AI systems with simple prompt injections or jailbreak-style inputs. This isn’t a polished product yet—it’s an early attempt to explore AI security from an adversarial perspective. Instead of focusing on what models can do, I wanted to see how they fail under real-world attacks. You can try a few basic attack scenarios, tweak inputs, and see where things break. Would really appreciate feedback on: What types of attacks I should test next Where the tool feels weak or incomplete How this could be made more useful in practice If you’re building with AI, curious to know—how do you currently test for failures?

Report

16d ago

RiteKit Company Logo API

@aca_050 This is a genuinely important angle—most teams obsess over capabilities and miss the failure modes. The adversarial testing approach makes sense, especially since prompt injection and jailbreaks are real production risks now. For the next phase, I'd suggest testing against models with different safety training, since vulnerabilities often vary by architecture and training approach.

Report

13d ago

Maker

@osakasaul Yeah, I've started observing that too. The behavior of the models varies greatly when they are given the same prompt based on their alignment or the structure of the system prompt.

Right now I’m mostly focused on making the failure cases easier to reproduce consistently, but comparing models with different safety training approaches is definitely something I want to dig into more.

Specifically, I would love to see how an open model behaves relative to a heavily aligned API model under the same prompts.

Report

13d ago

Reviews

Most Informative