Launching today

KugelAudio
Real-time text-to-speech model you can self-host
32 followers
Real-time text-to-speech model you can self-host
32 followers
Most natural real-time TTS with voice cloning and sub-60ms latency, on-prem or via API. Grammar-aware normalization reads phone numbers, IBANs, addresses, and medications naturally across 25+ languages, with word-level timestamps and IPA support. Adapters for LiveKit, Pipecat, and Vapi. Built by 4 in Berlin.





KugelAudio
Congratulations on the launch guys! Few questions:
1. Websites list languages like Hindi to be supported but can not find any voice related to that on playground.
2. I checked the websocket streaming API for TTS.. is it possible to have multi context support (just like elevenlabs) in the streaming API? is that part of plan in future?
Cheers!
Pushary
Sensational for German TTS
Congrats on the launch guys!