Random Words Detected by Azure Speech Recognizer in Silence
Hello Azure Support Team, I am currently using the Azure Speech Service to recognize speech inputs in my application. The setup of my speech recognizer is as follows: export const createSpeechRecognizer = () => { const speechRecognitionConfig =…
How to perform speech recognition to get speech input over a telephony phone call in Microsoft Azure?
Things We Have Done: Created an Azure Communication Service (ACS) instance and acquired an active phone number. Set up an event subscription to host the callback link required to interact with the purchased phone number. Deployed Azure Speech…
Speech SDK How to get proper intonation when synthesizing a partial sentance.
Hi, I'm working on reducing, to the absolute minimum possible, the round-trip latency in a voice bot project that uses an LLM and TTS . I have a number of strategies and one is send the first few words produced by the LLM for synthesis while the LLM…
TranslationRecognizer has stopped sending Synthesizing event in the past few days
My existing code (C#) has been using Microsoft.CognitiveServices.Speech SDK (1.41.1) to perform speech translation with voice synthesis successfully in the past 3 months. In the past few days, the Synthesizing event has stopped firing. (Other events…
Azure Speech Studio Andrew Multilingual voice sounds glitchy
I'm having some issues with the Andrew Multilingual (en-US-AndrewMultilingualNeural) voice in the Azure Speech Studio. There's a few instances in which the voice sounds raspy and really kind of glitchy. It seems to have a lot of trouble with the word…
Issue with Media Playback in Azure Communication Services Using Python
Context: We are building a bot using Azure Communication Services (ACS) and Azure Speech Services to handle phone calls. The bot uses text-to-speech (TTS) to play questions during calls and captures user responses. What We’ve Done: Created an ACS…
Azur language identification is not detecting language perfectly.
I am using Azure Continuous Language Identification in Azure Speech SDK with three languages: en-IN, hi-IN, mr-IN. It behaves differently, as when I say something in English, it transcribes it into Hindi or Marathi. In short, detecting the wrong…
Create a custom keyword for your virtual assistant in Speech Studio fails to process
When creating a new model under 'Create a custom keyword for your virtual assistant', the model creation process never completes for the Basic Tier. It just stays at 'Processing' after an entire day. This is my first time running through this process,…
Is there a way to permanently add IPA pronunciation to each SubscriptionID without referencing Lexicon files?
In the Text to speech service, I found many words that are pronounced incorrectly, and I solved this by adding phonemes and graphemes to the lexicon file. However, the problem is that each of my service users has different vocabularies and needs to…
Help Integrating Microsoft.CognitiveServices.Speech.Extension.MAS into Unity for AEC and Noise Suppression
I am working on a Unity project (version 2023.2.20f1) that utilizes the Microsoft.CognitiveServices.Speech v1.41.1 NuGet package for speech recognition. I am trying to extend this functionality to include acoustic echo cancellation (AEC) and noise…
video translation in Azure AI Speech - Supported regions
Dear Sir, Currently, video translation in Azure AI Speech is only supported in the East US region. Could you please let me know when can we expect the video translation in Azure AI Speech to be supported in other regions as well like West…
Azure Speech Avatar Live Chat Playground limit
Hi there, Is there some kind of limit for Azure Speech service playground for testing Live Avatar. I play with the Avatar options, works very nice for quick demos, but if I give long scenario (like prompt) in the settings (box for details about the…
microsoft tts reads english wierd when its set to hebrew.
when the hebrew tts reads a sentence with seperated english letters it reads it weird. for example : "השם שלך הוא P A Z" doesn't say the letter "P", "A", "Z", instead it pronounces the sound of the letter. how can…
Multilingual Voice returns wrong language when using numbers text to speech
Hello, when I am using your api and the following voice: "en-US-AndrewMultilingualNeural" I also use "es-ES" and "en-US" to specify when to use each language. It works when using text but when a just a number like…
Why does Azure Speech-to-Text detect French accurately in a standalone Python script but perform poorly in a real-time video call integration?
I'm working on a real-time translation project using Azure Speech Services. When I run my translation code in a standalone Python script, it accurately recognizes and translates French and English speech. However, when the same Speech-to-Text…
Slovak Text-to-Speech Pronunciation Issues with 'r' and 'l' Since September Update
Hello, We are using your Text-to-Speech service in the Slovak language. Since around September, we’ve noticed that certain words containing the letters "r" or "l" are being pronounced incorrectly. For example, the word prvý is…
Custom Speech Training stuck on "processing" since 4 days
I trained four custom speech recognition models for the locales de-DE, fr-FR, en-GB, pl-PL (on Friday the 15th) in the Speech Studio but they are stuck on "processing" since then. How can I solve this issue? It's really an urgent…
How to sign for STT service from neuralspace on Azure?
I want to sign STT SaaS service. However, after configuring it on Azure, when I click on "Configure Account" I am redirected to https://azmarketplace.neuralspace.ai/?token=X when I got a 503 error without an option to ask help there. How to…
How to identify filler words in Azure AI Speech
Hi team. Is there any feature in Azure Speech that can help us identify filler words? Please point me to the right documentation if there is any. Thanks, Sai Vishnu Soudri
Issues Accessing Azure Speech to Text REST API Version 2024-11-15
How can the latest Azure Speech to Text REST API, version 2024-11-15, be used? The documentation states this version is generally available, but an attempt to access the API using their migration guide results in a "resource not found" error.…