Qwen3

Qwen3

Think Deeper or Act Faster

5.0
β€’10 reviewsβ€’

1.4K followers

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud. - QwenLM/Qwen3
This is the 12th launch from Qwen3. View more

Qwen3-TTS

Launched this week
Voice design, cloning & 97ms streaming
A family of SOTA speech models (0.6B & 1.7B) supporting 10 languages. Features prompt-based Voice Design, 3s zero-shot cloning, and extreme low-latency streaming.
Qwen3-TTS gallery image
Qwen3-TTS gallery image
Qwen3-TTS gallery image
Qwen3-TTS gallery image
Qwen3-TTS gallery image
Free
Launch tags:Open Sourceβ€’Artificial Intelligenceβ€’Audio
Launch Team
Famulor AI
Famulor AI
One agent, all channels: phone, web & WhatsApp AI
Promoted

What do you think? …

Zac Zuo

Hi everyone!

The Qwen team just dropped what might be the most comprehensive open-source TTS release we have seen. Qwen3-TTS combines three things that are usually mutually exclusive: SOTA quality, extreme speed, and creative control.

The "Voice Design" feature is really robustβ€”just describing the persona (e.g., "sad old man") works surprisingly well.

Technically, the efficiency is wild. They use a 12Hz tokenizer to compress speech without losing detail, bringing the latency down to just 97ms 🀯

Open source TTS just raised the bar again. If you are building anything with voice, you might wanna check this out.

Demo Here.

Eugene Chernyak

97ms latency thats faster than I can decide what to have for lunch! This is a massive win for the open-source community. The voice design sounds like a dream for creators who are tired of hearing the same 3 robotic voices everywhere. Can’t wait to try describing a caffeinated marketing manager on a Monday morning - that would be my perfect persona:D Congrats on the launch!

Jim Engine

Okay but which languages? Why not show the 10 languages more obvious