Resklogits is a tokens processor that implement a shadow ban

GitHub - Resklogits is a tokens processor that implement a shadow ban

by•8mo ago

ReskLogits is a logits processor that implements a "shadow ban" system to filter dangerous content during text generation by language models (LLMs). - Resk-Security/resk-logits

Replies

Best

Maker

📌

🛡️ 800 downloads in just 3 days: The LLM security market has spoken. Meet ReskLogits, the library redefining content moderation for Generative AI! Say goodbye to abrupt Hard Blocks. Hello to the invisible Shadow Ban. ReskLogits is an innovative logits processor that doesn't outright ban, but subtly penalizes dangerous tokens. How it Works? We use a vectorized Aho-Corasick algorithm (GPU/CPU compatible) to detect harmful patterns in real-time. The Result? A smooth user experience (the "Shadow Ban" is invisible 👻) and maximum security (increased Jailbreak resistance thanks to stateful detection). 👉 Your LLM can now avoid generating harmful content without ever breaking the conversation flow.

Report

8mo ago