
What's great
Creating with Text to Speech software, even with the best AI tools is generally an iterative process, trying out voices, editing scripts for pacing and pronunciation, etc., especially if you want multiple speakers or an audiobook with multiple characters. Vois supports up to ten speakers/characters with automatic recognition when importing scripts. Very few TTS systems do that at present, combine that with it’s killer feature, it runs locally on your computer and does not operate on a token or time basis, just a very reasonable monthly or annual fixed cost. So however many interactions, generations or how much text you throw at it the cost is the same. This is new software, with a responsive developer who is actively supporting users and with considerable plans to build on a strong foundation. For an annual price close to the monthly cost of its competitors this is well worth trying, and the free trial works with unlimited text and 10 generations a day, just no export.
What needs improvement
Speed of generation on Windows system needs GPU acceleration and does not yet compare with performance on Apple systems. Although there are a wide range of languages and styles there is room for more, and for customization
vs Alternatives
A responsive developer who is actively supporting users, and with considerable plans to build on a strong foundation. For the annual price close to the monthly cost of its competitors this is well worth considering.
Does on-device TTS truly run fully offline?
Yes
Any issues with license or usage for commercial projects?
No.
Does it handle very long books without crashes?
I have yet to try that.


ElevenLabs
NotevibesLM