A TTS model should give you two things: an oscar-worthy performance and a verifiable signature to prove it's yours. DramaBox is the first to do both. Describe a scene the way you would to an actor, like 'a talk show host gasps in mock shock, bursts into laughter,' and the model interprets it as performance. Every output is watermarked with Resemble Watermarker. Open source, English-only for now, find it in your Resemble account or on Hugging Face.
Chatterbox Turbo is a 350M parameter open-source TTS model. It features paralinguistic tags (control laughs, sighs, etc.), zero-shot cloning, and runs 6x faster than real-time. Uniquely includes built-in PerTh watermarking for safety.
Resemble Clone can create the audio track for an immersive VR experience, an animated film, generate an entire audiobook, or power the voice of an Alexa Skill. Resemble AI solves the scalability problems that creatives face when creating speech content.
Resemble AI’s new built-in integration with GPT-3 can generate realistic ad copy and conversations instantly. You can use the existing library of professional voices, or create your own custom AI voice, to generate high-quality voiceovers and dialogues.
Edit revolutionizes audio editing. Simply upload your audio, edit the auto-generated transcript, and our AI creates new audio matching your changes. Perfect for podcasters, content creators, and businesses looking to streamline their audio production process.
Resemble’s Unity plugin extends Resemble Clone; a product that allows users to clone their voice with a few mins of data. Developers can add content through the GUI within Unity and tweak speech style and emotion by applying various emotions to the text.