All activity
Stephen Jonesleft a comment
I'm sure matching transcription text to audio is a hard problem to solve but it seems like the quality of the results is highly dependent on the accuracy of the _large_ number of face poses provided by the user. In other words, your tool could perform perfectly but if the user-provided face poses are bad, the results will be invariably bad. Suggestion: It would be better if you provided...

UNOMI 3D Lip SyncAutomated 3D lip syncing software for animators & gamers
