Microsoft releases VALL-E which imitates human speech in just 3 seconds of audio

10/01/2023

Microsoft has recently released an artificial intelligence tool called VALL-E that imitates human speech with just 3 seconds of audio.

The tool has been trained with 60,000 hours of English speech data and uses 3-second clips of a specific voice to generate content. Unlike many current AI tools, VALL-E can replicate the emotion and tone of a speaker, even if the speaker has never spoken the words themselves.

A paper from Cornell University used VALL-E to synthesize several sounds, and you can listen to these AI-synthesized audios on GitHub.

In many cases, Vall-E outperforms current text-to-speech models, the researchers note. However, the study also writes that there are currently several problems with the AI model. For example, some words in the text hints may be unpronounced, missed entirely, or appear twice in the output. Additionally, the model currently has difficulty imitating certain voices, especially those with accents.

Like other new AI technologies, VALL-E has raised concerns about safety and ethics. Microsoft has issued an ethics statement regarding the use of VALL-E, but there is no clarity on future uses.

Currently, Microsoft Vall-E is not yet open source. Microsoft has created a Vall-E repository on GitHub, but it currently only contains a description file.

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

RELATED ARTICLES

Starting from 48,900, Geely Panda Karting officially starts pre-sale

Ford: Expand charging network, fuel/ hybrid/ pure electric in parallel

Chery’s two new cars are exposed, targeting overseas markets