Launching today
MiMo-V2.5 Voice

MiMo-V2.5 Voice

Bilingual ASR for dialects, code-switching, and songs

5.0
1 review

63 followers

MiMo-V2.5-ASR is an 8B open-source speech recognition model from Xiaomi that transcribes Mandarin, English, eight Chinese dialects, code-switched speech, and song lyrics. Built for ML engineers, researchers, and developers building real-world voice applications.
MiMo-V2.5 Voice gallery image
MiMo-V2.5 Voice gallery image
MiMo-V2.5 Voice gallery image
MiMo-V2.5 Voice gallery image
MiMo-V2.5 Voice gallery image
MiMo-V2.5 Voice gallery image
Free
Launch Team