Qwen3

Qwen3

Think Deeper or Act Faster

5.0
9 reviews

1.3K followers

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud. - QwenLM/Qwen3
This is the 12th launch from Qwen3. View more

Qwen3-TTS

Launching today
Voice design, cloning & 97ms streaming
A family of SOTA speech models (0.6B & 1.7B) supporting 10 languages. Features prompt-based Voice Design, 3s zero-shot cloning, and extreme low-latency streaming.
Qwen3-TTS gallery image
Qwen3-TTS gallery image
Qwen3-TTS gallery image
Qwen3-TTS gallery image
Qwen3-TTS gallery image
Free
Launch Team
Anima Playground
AI with an Eye for Design
Promoted

What do you think? …

Zac Zuo

Hi everyone!

The Qwen team just dropped what might be the most comprehensive open-source TTS release we have seen. Qwen3-TTS combines three things that are usually mutually exclusive: SOTA quality, extreme speed, and creative control.

The "Voice Design" feature is really robust—just describing the persona (e.g., "sad old man") works surprisingly well.

Technically, the efficiency is wild. They use a 12Hz tokenizer to compress speech without losing detail, bringing the latency down to just 97ms 🤯

Open source TTS just raised the bar again. If you are building anything with voice, you might wanna check this out.

Demo Here.

Eugene Chernyak

97ms latency thats faster than I can decide what to have for lunch! This is a massive win for the open-source community. The voice design sounds like a dream for creators who are tired of hearing the same 3 robotic voices everywhere. Can’t wait to try describing a caffeinated marketing manager on a Monday morning - that would be my perfect persona:D Congrats on the launch!

Jim Engine

Okay but which languages? Why not show the 10 languages more obvious