Launching today
Vaani
Lip-synced AI dubbing for creators, brands and studios
423 followers
Lip-synced AI dubbing for creators, brands and studios
423 followers
Vaani is a voice-preserving AI dubbing tool to help you dub in 40+ languages, in one go, at a fraction of the cost of a traditional dub session. Where other tools give you a generic AI voice and lips that drift off-beat, Vaani clones your voice, preserves your music, and holds the meaning across languages, with frame-accurate lip sync. Built for anyone creating videos, from creators and brands to media companies, OTTs and studios.
Interactive








Free Options
Launch Team / Built With



Vaani
Hey Product Hunt 👋
I'm Abhinav Mohan, founder of Vaani.
🎬 How Vaani started
I was working on a personal video project a while back and wanted to make it accessible in a few languages beyond English. So I went looking for a tool and was genuinely shocked at what came back.
Voices that sounded like a customer-service IVR reading off a script. Lip sync that drifted two or three frames every line. Output that felt robotic in a way audiences clock instantly, even if they can't say why.
I kept thinking someone must have solved this for serious creators by now. Turns out, not quite. Most tools optimise for speed and the lowest acceptable output, not for keeping your actual voice intact across languages.
The more creators and brand teams I talked to, the clearer it became everyone was hitting the same wall.
And the data backs it up: a well-made dub routinely outperforms the original by 2–3× in non-English markets. But bad auto-dubbing actually hurts creators in the global YouTube algorithm, watch time drops, ranking falls, and channels lose ground in markets they used to win.
The wrong tool punishes you for trying to go global. That gap is exactly what we built Vaani for.
✨ What Vaani does
Vaani captures the fingerprint of your voice, timbre, cadence, breath, the way you swallow small words, and redraws it in 40+ languages with frame-accurate lip sync. One shoot. One render. Heard everywhere.
The core loop:
• Drop a video
• Pick the languages
• Vaani traces your voice, regenerates it in each script, and re-times every mouth frame by frame
👉 Try the interactive demo
⚡ Why Vaani is radically better
• Most tools translate words. Vaani retells meaning, so jokes land and tone holds.
• Most tools overdub audio. Vaani re-times every mouth frame by frame, so lips actually match. No other tool does frame-accurate lip sync at this scale.
• Most tools give one generic voice. Vaani captures your voice fingerprint and redraws it across 40+ languages, with more Indian languages than any other dubbing tool.
• Most tools make you dub one video at a time. Glot is a node-based batch canvas, drop multiple videos, fan out to multiple languages, add lip sync, render in parallel, download a zip.
• And it does all this at a fraction of the cost of existing tools.
👥 Who it's for
• YouTubers and podcasters reactivating their back catalog in 40+ markets
• Global brands shipping one ad in every language they sell in, same budget
• OTTs and studios regionalizing long-form content without losing the lead actor
• Course creators and educators reaching global cohorts
• Filmmakers who want a lip-synced, voice-cloned dub at a fraction of traditional cost
7 free minutes at signup,. Valid the next 7 days.
👉 Try Vaani free at https://vaani.media
💬 What we're hoping for today
Honestly? Feedback. Not upvotes. Specifically:
• What kind of content you'd try Vaani on first
• What's missing, every request goes into Linear, no exceptions
• If something breaks, please yell at us in the comments 🐛
🤝 The team
Built by a small team of four out of Bangalore 🇮🇳. I'll be in the comments all day myself, but the team's reading along, drop any technical, design, or roadmap question and we'll get it answered.
📬 Reach us
• 📧 hello@vaani.media
• LinkedIn: https://www.linkedin.com/company/advant-ai
• Discord Community: https://discord.gg/nJ5Meuh6v
Huge thanks to @rohanrecommends for hunting us, the @producthunt team for the launch infrastructure, and every one of you for reading this far. ❤️
I'll be in the comments all day (and most of the night, we're in IST, please forgive typos after midnight 🌙). Ask me anything.
Abhinav Mohan
Founder, Vaani
@rohanrecommends @producthunt @abhinav_cinephile The voice-preserving approach is really interesting. A lot of AI dubbing tools get the translation done, but lose the creator's identity in the process. How are you handling cultural nuances and tone across different languages to make sure the dubbed version still feels authentic to the original content?
Vaani
@rohanrecommends @producthunt @nicole_hynek
Hi Nicole, cultural nuance is the harder half of this.
Two things we lean on. The script step is not literal translation, we try to preserve intent and emotional weight so a joke can take a different shape in another language and still land. The voice clone tries to carry your cadence and emotional register, not just timbre, so the dub holds onto how you stressed words and paused.
Where we still rely on humans. Very regional humor and specific cultural references still need a quick review pass.
Happy to dig into a specific case if you have one in mind.
Abhinav
@rohanrecommends @producthunt @abhinav_cinephile Kudos. Are users allowed to edit every line or change them before exporting their final project?
The creator-trust angle here is strong. Bad dubbing doesn’t just sound off; it can make the original creator feel less credible in a market they’re trying to enter.
One workflow detail I’d care about: can users review the meaning/tone changes separately from the lip-sync render? For brand, education, and founder-led videos, the scary failure mode isn’t only “the lips drift,” it’s “the joke, caveat, or claim survived grammatically but not contextually.”
If Vaani can show a lightweight line-by-line review before final export — original intent, localized wording, voice/tone note, then render — I think teams would trust it for much higher-stakes content than simple back-catalog translation.
Vaani
@jim_jeffers Hi Jim, this is the sharpest framing of the problem I have read all day.
The failure mode you described, joke or caveat or claim surviving grammar but losing context, is exactly the one we have been designing against. Lip drift is a known visual problem with a known fix. The semantic drift is the dangerous one because nothing in the dub looks wrong, it just lands wrong.
On your workflow question. We do have an advanced editor (currently used with B2B customers) that gives a line level review pass before the lip sync render. The spec is close to what you sketched. Reviewers can see the original, the localized version, and approve or flag changes before the expensive part of the pipeline runs.
We chose to launch the one click version first because the quality of the auto output is the headline most people need to see. But your read is the right one. Teams buying dubbing for brand or founder content care less about throughput and more about not getting it wrong, and a review layer is what makes that possible. The editor lands as a separate launch soon.
Abhinav
@abhinav_cinephile That “semantic drift” phrase is exactly it — nothing looks broken, but the creator’s actual judgment changed.
Tiny future test I’d love to see: one original line + localized line + reviewer flag where the wording is grammatically fine but the caveat/joke/claim no longer feels like the creator would stand behind it. That is the layer that would make the editor feel trustworthy for founder/brand content.
Mailwarm
Do you let users review and change individual lines before export, or is it mostly one click end to end?
Vaani
@thamibenjelloun
Hi Thami, the honest answer is nuanced.
For most creators editing line by line is not actually useful. If you are a Hindi creator dubbing into French you likely do not speak French, so the only editing you can really do is fix brand names or specific English terms anyway. So we focused on making the auto translation good enough that you do not need to touch it. Brand names and cultural nuance preservation are handled inside the dub itself.
We do have an advanced editor with full line by line control that we use with B2B customers (studios running whole catalog passes), but we chose to launch the one click version first because we wanted people to see the quality of the auto output first. The editor lands as a separate launch soon.
Abhinav
Refocus
Voice preservation is the part I keep coming back to. Most dubbing tools flatten the speaker into a generic AI voice, so holding the original timbre across 40+ languages is a real differentiator. I work on voice AI for older adults, and we've found that a familiar-sounding voice changes how much people trust and engage with it, sometimes more than raw transcription accuracy. Curious how you handle emotional prosody: does the clone carry tone and pacing across languages, or mostly the timbre? Impressive that the lip sync stays frame-accurate at that scale.
Vaani
@igorgurovich Hi Igor, the older adults trust angle resonates, it is exactly the thing we keep landing on too.
On prosody. We try to carry tone and pacing, not only timbre. The original audio is the source of expressive features and those get mapped into the target language.
Honest caveat. It ports better in some pairs than others. English to Spanish or Hindi feels close to seamless. English to Japanese or Mandarin is harder because emotion is encoded differently in pitch and tone. We map as much as we can, more a translation of prosody than a clean copy.
Thanks on the lip sync, that one took the most iterations.
Abhinav
Stripo.email
Wishing you success with the launch! Out of curiosity, which type of content do users test first: short-form videos, ads, or long-form series?
Vaani
@marianna_tymchuk Hi Marianna, thanks for the well wishes.
The pattern we have seen so far is that creators tend to start with short form to validate the quality, then move into longer pieces once they trust the output. Ads tend to come in between, the production stakes are higher but the length is still small.
On the long form side, we now handle up to 30 minutes in a single run, and longer videos can be added too. So once a creator is past the initial trust check, they can put a whole episode or feature through the pipeline.
Curious what made you ask.
Abhinav
ReplyMind
Love how Vaani focuses on keeping the creator’s identity intact instead of just translating words.
The frame-accurate lip sync + voice fingerprinting sounds like a game changer, especially for long-form content and ads.
Quick question: how does it handle humor, cultural nuances, and fast-paced dialogue?
Killing it Abhinav this has huge potential! 🔥
Vaani
Hi Moon, thanks for that. Three good questions, taken in order:
Humor. This is the hardest of the three. Universal humor (timing, irony, sarcasm in delivery) carries over because the voice clone holds onto cadence and the way you stress a punchline. Regional humor and wordplay still benefit from a human pass, it is fundamentally a translation problem more than a voice one. We treat that as something to flag, not pretend we have solved.
Cultural nuances. The translation step preserves intent and emotional weight over exact words. Brand names, regional terms, and references that do not have direct equivalents get adapted rather than literally translated.
Fast paced dialogue. The lip sync runs frame by frame so the timing stays tight even when the speaker rattles off lines. The voice clone carries the pace from the original audio, so the dub does not slow down to fit the new language, it adjusts the rhythm to stay in sync with how you actually talk.
Appreciate the kind words. More soon.
Abhinav
DIY UX Test
The lip-sync is the hard part most dubbing tools skip, so leading with it is a strong signal. Congrats on the launch. How do you handle emotion and tone carrying across languages — does the dub keep the original delivery?