
Microsoft calls VALL-E a "neural codec language model" that generates audio from text input and short samples from a target speaker. It can mimic any voice by listening to a voice sample as small as 3 seconds. VALL-E is not generally available yet.
from Gadgets News – Latest Technology News, Mobile News & Updates https://ift.tt/JIV1Kp4
https://ift.tt/sWq5jCD
No comments:
Post a Comment