Video SDK
Your complete platform for real-time communication
4.7•12 reviews•3K followers
Your complete platform for real-time communication
4.7•12 reviews•3K followers
VideosDK provides developer tools and low-latency infrastructure
to build, scale, and secure immersive live audio/video + AI communication.
This is the 5th launch from Video SDK. View more
AI Voice Agent SDK
Real-time Voice AI Agents
We are open-sourcing the easiest way for developers to build real-time Voice Agents and Virtual Avatars into any app—telephony, web, mobile, robotics, wearables, and beyond.





Free
Launch Team

Video SDK
👋 Hey Product Hunt, I’m Arjun, co-founder of VideoSDK.
I'm beyond excited to launch our Open-Source AI Voice Agent SDK.
Today, voice is becoming the new UI. We expect agents to understand us, respond instantly, and work seamlessly across web, mobile, and even telephony. But, to achieve this, developers have to stitch together: STT, LLM, TTS, glued with HTTP endpoints and, a prayer.
This most often results in agents that sound robotic, hallucinations and fail in product environments without observability.
So we built something to solve that: End-to-End infrastructure to build, deploy, and monitor your AI Voice Agents
Here’s what it offers:
Global WebRTC infra with <80ms latency
Native turn detection, VAD, and noise suppression
Modular pipelines for STT, LLM, TTS, avatars, and real-time model switching
Built-in RAG + memory for grounding and hallucination resistance
SDKs for web, mobile, Unity, IoT, and telephony — no glue code needed
Agent Cloud to scale infinitely with one-click deployments — or self-host with full control
Think of it like moving from a walkie-talkie to a modern cell towers that handles thousands of calls.
VideoSDK gives you the infrastructure to build voice agents that actually work in the real world, at scale.
I'd love your thoughts and questions! Happy to dive deep into architecture, use cases, or crazy edge cases you've been struggling with.
@arjun_kava1 Design is so sleek and user-friendly!
Video SDK
@1mirul Thanks a lot for your kind words.
Sellible
@arjun_kava1 very cool! Looking forward to trying out the sdk soon.
Video SDK
@preetraj Thanks a lot, Preet for your kind words.
this looks promising, makes your main product a full stack video & audio framework for building agents.
Congrats on the launch team!
Video SDK
@theanimeshs Thanks so much! Really appreciate the support! 🙌
SyncSignature
Another amazing launch! Let's go team @arjun_kava1 @sagar_kava
Video SDK
@sagar_kava @neelptl2602 Thanks a lot for your support.
Kombai
Congrats on the launch. @arjun_kava1 .
Video SDK
@sourabh_upreti Thanks so much! Appreciate the support 🙌
YouMind
Congrats on the launch! 🎉 @Video SDK This looks like a game-changer for voice AI development. The <80ms latency with global WebRTC infrastructure sounds impressive. Quick question - how does your native turn detection handle overlapping speech or interruptions? That's always been a challenge with voice agents. Also curious about the pricing model for the Agent Cloud vs self-hosting options!
Video SDK
@nicoleastor Thanks! 🙌 Super glad you found it interesting. Turn detection handles overlaps smartly in real-time — curious, are you exploring voice agents for a specific use case?
Video SDK
Congrats on the launch, team!!! 🥳
For Introducing Voice Agent SDK — an open-source framework to build real-time voice agents that actually work in production.
Built on VideoSDK, it empowers agents to join meetings, listen, speak, and think — all with under 80ms latency.
The cascading pipeline supports STT, LLM, TTS, VAD, and Turn Detection — fully provider-agnostic.
With A2A and MCP, you get multi-agent collaboration and seamless integration with external tools and services.
We can’t wait to see what the community builds with Voice Agent SDK — go create something amazing!
Video SDK
@deep_bhupatkar Thanks so much! Really appreciate the support! 🙌
Video SDK's AI Voice Agent SDK with its low - latency infrastructure and modular pipelines is a great help for developers building real - time voice applications! For developers who want to integrate custom AI models into the SDK, does Video SDK's AI Voice Agent SDK support easy integration of custom models?