Fish Audio

Fish Audio

Expressive Voice Cloning and Text-to-Speech

4.5
•6 reviews•

883 followers

Fish Audio is the most expressive and emotionally rich text-to-speech model. It generates lifelike voices that capture emotion, rhythm, and nuance with remarkable realism. Fish Audio Voice Clone recreates a natural voice from just 10 seconds of audio—preserving accent, tone, and speaking habits. Proudly built by the open-source team behind So-VITS-SVC and Bert-VITS2, giving a soul to every voice.

Products used by Fish Audio

Explore the tech stack and tools that power Fish Audio. See what products Fish Audio uses for development, design, marketing, analytics, and more.