Alan Cowen's profile on Product Hunt

All activity

17d ago

TADA (Text-Acoustic Dual Alignment) is Hume AI's open-source speech-language model that synchronizes text and audio one-to-one. TADA synchronizes text and speech into a single continuous stream via 1:1 token alignment. Generating audio at 5x the speed of conventional LLM-based TTS systems completely eliminates skipped words and content hallucinations across 1000+ tests.

TADA1:1 text-acoustic alignment for 5x faster speech generation

Alan CowenlaunchedOctave 2 by Hume AI

6mo ago

Introducing Octave 2. What’s new: - Fluent in 11+ languages - 40% faster (<200ms latency⁠⁠) & 50% cheaper than Octave 1 - Multi-speaker conversation - More reliable pronunciation - New voice conversion & phoneme editing capabilities

Octave 2 by Hume AIThe next-generation multilingual text-to-speech model

Alan CowenlaunchedEVI 3: Understand and generate any voice

10mo ago

Hume AI's EVI 3 is a new speech-language model for highly expressive, realistic & emotionally intelligent voice AI. Generates any voice/personality from prompts. Outperforms GPT-4o in empathy & naturalness.

EVI 3: Understand and generate any voiceHume AI's new voice that truly understands emotion

Alan Cowenleft a comment

1yr ago

Hey Product Hunt! I’m Alan Cowen, CEO and Chief Scientist at Hume AI. We're launching Octave, the first of a new generation of text-to-speech models. Traditional TTS models focus on the mechanical process of turning letters into sounds. Octave isn't a traditional TTS model, but a voice-enabled LLM, trained on 1000x more language. As a result, it understands the cognitive and emotional aspects...

Octave TTSDescribe any AI voice and prompt its emotional delivery

Alan CowenlaunchedOctave TTS

1yr ago

The first LLM for text-to-speech. While other TTS just “reads” words, Octave grasps their meaning. Create any AI voice with a descriptive prompt, guide its emotional delivery (angrier! more sarcasm!), and bring your stories to life with human-like expression.

Octave TTSDescribe any AI voice and prompt its emotional delivery