Gurutva Murdia

Duplex - Multiplexed LLM Infrence Engine

by
Duplex is an advanced inference engine allowing developers to multiplex real-time prompt tests across local node weights (Ollama, LM Studio) and frontier cloud models simultaneously. It features a decentralized paradigm that allows users to route and compare multiple artificial intelligence models simultaneously in real time. Focused towards extreme user first privacy and engeneering first implementation

Add a comment

Replies

Best
Gurutva Murdia
"Hi all! Excited to launch Duplex today. It’s a tool designed to help you Infrence with LLM's without the usual friction. It’s been a fun journey going all solo building this project, and I can't wait to hear your takes / feedback on it. What do you think?"