Hello Ana Laura Rocha Ciappesoni,
Welcome to the Microsoft Q&A and thank you for posting your questions here.
I understand that your AI text-to-speech is misreading a word in Catalan (tomàquets) but it reads perfectly its singular form (tomàquet) and you need a solution to this.
In my first reading I thought of linguistic customizations for pronunciation accuracy, and in my second view I thought of latency based on the previous projects.
In CAT 1:
- Check if the TTS service in use supports SSML, IPA input, or custom dictionary entries. For Microsoft Azure TTS, for instance, SSML can be utilized to fine-tune specific words, you can achieve this:
<speak version="1.0" xmlns="http://www.w3.org/2001/10/synthesis" xml:lang="ca-ES"> <voice name="ca-ES-JoanNeural"> Tomàquet <phoneme alphabet="ipa" ph="toˈmakəts"/> tomàquets. </voice> </speak>
- If possible, use the phoneme tag to force correct pronunciation for problematic words, you might need to employ a service of linguistic expert if there is a need.
- For Azure or other advanced TTS services, you can often report a pronunciation issue through their support or feedback portal.
- Before switching to other services like Narakeet or LOVO AI, ensure your primary use case isn't hindered by changing providers. If alternate services handle "tomàquets" better, use them selectively for specific content pipelines.
in CAT 2:
- If latency is due to network distance, using a TTS endpoint in a region geographically closer to your users can help. For example, the UK South region endpoint is
https://uksouth.tts.speech.microsoft.com/cognitiveservices/v1
. Make sure your SDK is configured to use this endpoint if you're based in or near the UK. - Use concise SSML inputs and minimize long pauses or complex tags that could slow down processing, this will provide faster TTS processing.
- Latency can also stem from unstable internet connections, try to check that your application runs on a stable, high-speed network.
- If your service serves global users, integrating Azure Front Door or a Content Delivery Network (CDN) could reduce latency by optimizing request routing and providing caching where possible.
I hope this is helpful! Do not hesitate to let me know if you have any other questions.
Please don't forget to close up the thread here by upvoting and accept it as an answer if it is helpful.