GitHub - Resklogits is a tokens processor that implement a shadow ban
byā¢
ReskLogits is a logits processor that implements a "shadow ban" system to filter dangerous content during text generation by language models (LLMs). - Resk-Security/resk-logits
Replies
Best
Maker
š
š”ļø 800 downloads in just 3 days: The LLM security market has spoken. Meet ReskLogits, the library redefining content moderation for Generative AI!
Say goodbye to abrupt Hard Blocks. Hello to the invisible Shadow Ban.
ReskLogits is an innovative logits processor that doesn't outright ban, but subtly penalizes dangerous tokens.
How it Works? We use a vectorized Aho-Corasick algorithm (GPU/CPU compatible) to detect harmful patterns in real-time.
The Result? A smooth user experience (the "Shadow Ban" is invisible š») and maximum security (increased Jailbreak resistance thanks to stateful detection).
š Your LLM can now avoid generating harmful content without ever breaking the conversation flow.
Replies