All activity
Sujal Meghwalleft a comment
If I were evaluating it seriously, I’d benchmark three things first: 1) Long-context failure modes: instruction drift, prompt injection persistence, and context poisoning across the full 65K window (especially multi-turn reasoning chains). 2) OpenAI-compat edge cases: tool/function calling consistency, streaming behavior, and how error states are handled under load compared to GPT-style APIs....
What would you build or benchmark with 5M free tokens on a reasoning model?
Chirag AryaJoin the discussion
NullStrike Security is an offensive security firm focused on real-world, manual penetration testing. Unlike automated scanners, we simulate real attacker behavior to find exploitable vulnerabilities across web apps, APIs, cloud (AWS, Azure, GCP), and AI/LLM systems. Our reports focus on impact, exploit chains, and clear remediation helping startups and growing teams fix what actually matters before attackers do.

NullStrike SecurityManual penetration testing for cloud, web, APIs & AI systems
Sujal Meghwalleft a comment
I built NullStrike Security after seeing too many teams rely only on automated scans and compliance checklists while real, exploitable vulnerabilities went unnoticed. Our focus is simple: manual, attacker-style testing across web, APIs, cloud, and AI/LLM systems, with reports that show real impact and clear fixes. I’d love feedback from founders, engineers, and security teams what security...

NullStrike SecurityManual penetration testing for cloud, web, APIs & AI systems
