Fish Audio

Fish Audio

Expressive Voice Cloning and Text-to-Speech

4.5
6 reviews

882 followers

Fish Audio is the most expressive and emotionally rich text-to-speech model. It generates lifelike voices that capture emotion, rhythm, and nuance with remarkable realism. Fish Audio Voice Clone recreates a natural voice from just 10 seconds of audio—preserving accent, tone, and speaking habits. Proudly built by the open-source team behind So-VITS-SVC and Bert-VITS2, giving a soul to every voice.

Fish Audio

Launch date
Fish Audio S1
Fish Audio S1 Expressive Voice Cloning and Text-to-Speech

Launched on October 20th, 2025

Fish Speech 1.4
Fish Speech 1.4 Open-Source Multilingual Text-to-Speech with Voice Cloning

Launched on September 11th, 2024

Fish Speech
Fish Speech Few-shot voice cloning and text-to-speech

Launched on July 18th, 2024