Azure TTS Synthesized Characters Accuracy Issue

kong qiang 0 Reputation points
2025-02-09T08:11:39.0733333+00:00

When synthesizing SSML, for example, reading the following content:

xml

复制

<

运行 HTML

The backend counts 4 characters. However, if additional XML tags are added, such as:

xml

复制

<

运行 HTML

or using `

Azure Translator
Azure Translator
An Azure service to easily conduct machine translation with a simple REST API call.
445 questions
{count} votes

1 answer

Sort by: Most helpful
  1. Saideep Anchuri 2,110 Reputation points Microsoft Vendor
    2025-02-11T08:48:58.35+00:00

    Hi kong qiang

    Welcome to Microsoft Q&A Forum, thank you for posting your query here!

    When synthesizing SSML, the character count can vary based on the inclusion of special characters and XML tags. For instance, the character < must be represented as < in SSML for proper parsing. Adding XML tags or special characters that need escaping will increase the total character count. Therefore, the backend may count raw characters differently than the processed SSML due to these conversions, leading to discrepancies in the character count.

    Kindly refer below link: special-characters

    Thank You.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.