trending
Mati Staniszewski

11mo ago

I'm the ElevenLabs CEO - what do you want to do with voice AI but can't? (AMA)

Hi Everyone!
Solving AI audio end-to-end means tackling both generation and understanding - from text-to-speech to speech-to-text and everything in between. At ElevenLabs, we re working on breakthroughs in AI audio that bridge research and real-world use.
Ask me anything about what we re building, the challenges of scaling AI speech models, and where this space is headed. Also keen to hear what you ve built with ElevenLabs! 

Tereza Hurtová

22d ago

API audio: occasional volume drop?

Hey @ElevenLabs! The voice quality is miles ahead of anything else we've tried. Huge fan of what you guys are building! We re actually using it in our internal tool to generate audio summaries of specific Spotify topics. It works like magic, but we ve noticed one tiny hiccup: the audio occasionally fades out or loses volume mid-sentence. Has anyone else experienced this, or is there a specific API parameter we should look into to keep it consistent? Keep up the great work, looking forward to seeing where you take the product next!

Zac Zuo

4mo ago

ElevenLabs UI - Open-source components for AI audio & voice agents

ElevenLabs UI is an open-source component library built on shadcn/ui to help you build AI audio and voice agent experiences faster. It provides pre-built, customizable components for voice chat, transcription, and more, all under an MIT license.

Realistic audio with expanded emotional range

I'm trying to create realistic audio to support scenarios for frontline staff in homeless shelters and housing working with clients. The challenge is finding realistic voices that have a wide range of emotional affect. We are hoping to find a generative approach to developing multiple voices rather than creating voices with actors or ourselves. We've tried v3 Voice Design which expands on monotone generated voices but not much. We want voices that go from soft whispers to screaming and everything in between. Perhaps I'm not very good at prompting, but I've tried various attempts. Again, we're trying to do this without needing to record every voice which is not sustainable for our approach. Any recommendations? Thanks!

Zac Zuo

3mo ago

ElevenLabs Image & Video - The best audio, image & video models now in one platform

ElevenLabs now has image and video generation. Generate visuals with top models like Sora, Veo, and Kling, then export to the Studio to add high-quality voiceovers, music, AI sound effects, and captions. It's a unified creative platform.
Ankit Sharma

6mo ago

Eleven Music - The highest quality AI music model

Generate original, royalty-free songs with our AI music generator. Turn simple text prompts into custom music in seconds. Free song maker for musicians, producers, and content creators.

ElevenLabs Studio 3.0 - The best AI audio models in one powerful editor

Create, edit, and publish with AI. Add voiceovers, music, and sound effects, clean audio, and sync everything in one seamless editor.
Ankit Sharma

3mo ago

Scribe v2 Realtime - The most accurate real-time Speech to Text model.

Built for voice agents, meeting notetakers, and live applications, it transcribes in 150ms across 90+ languages, including English, French, German, Italian, Spanish, Portuguese, Hindi, and Japanese.
Ankit Sharma

6mo ago

Eleven Music API - First Music API trained on licensed data, commercial-ready

You can now integrate the highest quality AI music into your products and workflows. Since launch, creators have generated over 750k songs with Eleven Music.
Ankit Sharma

1yr ago

Voice Isolator by ElevenLabs - Free voice isolator and background noise remover

Remove unwanted background noise and extract crystal clear dialogue from any audio to make your next podcast, interview, or film sound like it was recorded in the studio.
1234
Next
Last