Neural TTS
40 TopicsIntroducing AI-generated voices for Azure neural text to speech service
In this blog, we introduce two new voices created using the latest controllable new voice generation technology, a masculine voice named AIGenerate1 and a feminine voice named AIGenerate2, and provide a deeper view on the technology behind.12KViews4likes9CommentsNew technical research is advancing Azure’s Neural Text-to-Speech service
Our latest research innovation, code named NaturalSpeech, brings a new milestone to neural TTS achieving no significant difference with natural human recordings using side-by-side CMOS as metrics on a popular TTS dataset (LJSpeech) for the first time.7.4KViews0likes0CommentsCreating a branded AI voice that conveys emotions and speaks multiple languages
Today at Microsoft Inspire 2023, we're excited to announce the general availability (GA) of the new multi-style and multi-lingual custom neural voice (CNV) features inside Text to Speech, part of the Azure AI Speech capability.10KViews0likes0CommentsLow-resource technology updates for Azure Neural Text-to-Speech
In this blog, we summarize the latest updates on our low-resource technology, which has enabled Azure Neural TTS to expand to global languages quickly and allows speakers of under-represented languages to equally benefit from our product.6.7KViews0likes2CommentsAzure Custom Neural Voice introduces new emotional styles to support brand voices
Today we are glad to introduce the multi-style capability of Custom Neural Voice, a new feature in public preview, which enables users to create a brand or character voice that speaks with different emotions.6.9KViews1like0CommentsAzure Neural TTS previews a new contextual voice model for long-form paragraph reading
In this blog, we introduce a new technical innovation that considers contextual information to model TTS voices for paragraph or long-form content reading. This new technology significantly improves the coherence and expressiveness when generating long audios, using Paragraph MOS (Mean Opinion Score) as metrics. With this new technology, we are glad to announce the public preview of Roger, a contextual voice model in English (US), to enable customers to generate more expressive and natural-sounding long-form audio content using Azure Neural TTS service.7.2KViews2likes1Comment