Zhizhuo Zhou

Zhizhuo Zhou

Fish AudioFish Audio
CS PhD @ Stanford
238 points
All activity
We've open-sourced Fish Audio S2, a new generation of expressive TTS that lets you direct voices with natural language. Add cues like [whisper] or [laughing nervously], generate multi-speaker dialogue in one pass, and create scary-real voices across 80+ languages.
Fish Audio S2
Fish Audio S2Real Expressive AI Voices
Zhizhuo Zhouleft a comment
Wow this is really full stack!
AtomsTurn your ideas into products that sell
Fish Audio S1 is the most expressive and emotionally rich TTS model—creating lifelike voices that capture emotion, rhythm, and nuance. Clone any voice in 10 seconds, preserving accent, tone, and speaking habits with unmatched realism.
Fish Audio S1
Fish Audio S1Expressive Voice Cloning and Text-to-Speech
Zhizhuo Zhouleft a comment
The UI is crazy good
Integrity
IntegrityUnified project brain: docs, canvases, and AI chats together
Zhizhuo Zhouleft a comment
this looks lit
My Financé
My FinancéThe dashboard of your finances