p/microsoft-to-do-2

What will you do with To Do?

Start new thread

VALL-E - AI that can mimic a person's voice with just 3 second sample

by

Raycast

•3yr ago

VALL-E emerges in-context learning capabilities and can be used to synthesize high-quality personalized speech with only a 3-second sample. VALL-E synthetically preserves speaker's emotion and acoustic environment.

Replies

Best

Raycast

Hunter

Now anyone can have James Earl Jones (or anyone else for that matter!) voice their radio ads. Also relevant given Apple Books's just-announced Digital Narration audiobooks.

Report

3yr ago

@chrismessina how does one try it? followed links ...nothing

Report

3yr ago

Raycast

Hunter

@mefranco it's not publicly available to test yet, but if you follow the links, you'll see a bunch of audio file tests that show what it's capable of.

Report

3yr ago

@chrismessina Impressive … most impressive.

Report

3yr ago

Should give it a try

Report

3yr ago

SocialBu

The product is great and the launch is successful.

Report

3yr ago

Cool...

Report

3yr ago

Congrats on the Launching. We need more tools like these to enhance our creativity. Keep going, guys!

Report

3yr ago

Evoke

Wow! Only a 3 second sample. I'd rather see some examples in the images rather than how it works though Congrats on your launch

Report

3yr ago

Raycast

Hunter

@richard_gao2 did you click thru to any of the links?

Report

3yr ago

Evoke

@chrismessina Yep. I was mainly referring to the PH post, not the website. My bad, should have specified.

Report

3yr ago

What an innovative idea! Congratulations on a job well done!

Report

3yr ago

Ansy.ai

How do I try it ?

Report

3yr ago

Zappi Ad Predictor

Frankly terrifying - but incredible none the less 😅 Awesome launch, ?makers!

Report

3yr ago

Nice!

Report

3yr ago

1 2