Azure speech to text appears very slow
Hi team, We have observed that the Azure speech-to-text is very slow. I am using continuousRecognitionAsync and I observe that Azure takes a total of close to 6s for just 3s audio. The parameters that I've set are: EndSilenceTimeoutMs =…
Cannot connect to text-to-speech service.
I am following this text-to-speech guide: https://learn.microsoft.com/en-us/azure/ai-services/speech-service/text-to-speech-avatar/real-time-synthesis-avatar With a valid key and the correct region, I am getting the error below. Any ideas on why this…
How to add a custom id for a single audio file in a speech to text batch process?
Hi, i am processing multiple audio files using azure speech to text batch process using contentUrls, So i have lets have 5 public urls, and when i get the result of these 5 i get it as "contenturl_0.json", "contenturl_1.json",2,3,4.…
Ryan Multilingual Voice Changed
Hello. I've been using the Ryan Multinlingual Voice for a couple months now and it was consistent. Today, I was using it again and I heard a completely different voice. I previewed the Ryan Multilingual voice and it had the same voice as before, but when…
Azure Speech-to-Text: language or quota limitaion
I want to use Azure's speech-to-text for my chatbot service. The use-case is to provide STT results when the user gives voice (usually one or two sentences). Here, the user does not select a language. I tried using Azure OpenAI's whisper, but the…
Custom Speech Model Training Stuck at Processing - Cannot be canceled or deleted
Steps: Uploaded a zip file with one wav file - 11 seconds, and text file with the utterance. Started training - the training is in "processing" stage for more than a day No option to cancel and cannot delete it. The speech resource is…
Which Azure Speech SDK Feature to Use for Real-Time Meeting Transcription with Speaker Diarization?
Hi , I am working on real-time meeting transcription using Azure Speech SDK and need: Accurate speaker diarization (identify who is speaking). Sentence-level segmentation (avoid merging multiple sentences into one recognition event). Improved…
Change Language for Synthesizer in TranslationRecognizer
Hi All, I use the TranslationRecognizer from the Speech SDK to do transcription and translation. I also use the synthesizing to change the language. This is how the recognizer ist created: def create_translation_recognizer(self): …
Azure Real time speech to text not working through python but curl is working
We have enabled Azure speech to text service with private end point, when we try to use below curl command it we able to get output curl -i --location 'https://xxxxxxxxxxx?language=en-US' --header 'Accept: application/json' --header…
Possible errors during batch speech to text?
Hi, im using azure speech to text batch api, what are the possible error messages that could come in the transcription report for the failed transcriptions, not for the failed batches, for the failed transcriptions. If i send 1000 files for transcription…
Issues with Processing Encrypted WAV Files Using Fast Transcription API
I have successfully used the Whisper model to transcribe a batch of 50 encrypted WAV files without manually inputting the file paths. However, I'm encountering difficulties when attempting to perform the same task with the Fast Transcription API. Could…
Azure speech service continuous speech recognition
I'm preyty new to Azure speech service and I'm using twilo/plivo service for connecting a number with azure stt and process it further after transcription. My problem is when I speak something, it's detecting well, and when I stop speaking or stay…
How do i change the request limit for whisper model in azure
Hi everyone! 👋 I’m currently using the Whisper model in Azure OpenAI Service and running into issues with the default request limits. I need to increase the request limit to better handle my application’s workload. Here’s what I’ve tried/checked so…
Getting Error when using TTS service with text containing special character '<'
I am using microsoft-cognitiveservices-speech-sdk for text to speech. Before sending the text to SpeechSynthesizer i am creating a ssml and then passing the smml to speakSsmlAsync method. I am getting a issue in this process , if my text contain…
Custom Neural Voice
We have full copyright and rights of an Author who is deceased and we would like to create his Voice via Custom Neural Voice - speech AI. Somehow we are unable to do so because Speech Portal requires the author to consent that his voice can be used.…
I can not open the Speech Studio
I can not open the Speech Studio, the Page is not loading, the page just keeps loading and loading and stay white
Running a Custom Text to Speech Model On Premise
Hi there, at my company, we are currently relying on Azure services for Speech-To-Text (STT) and Text-To-Speech (TTS) processing. These services work quite reliably for us and we would like to continue using these services. However recently, I got the…
Realtime Recognizer not utilising with Semantic Segmentation
Hi all! I'm using the Azure speechsdk.SpeechRecognizer for transcribing streamed real-time audio. While the transcription works, continuous talking will result in large paragraphs being outputted rather than sentence by sentence. I included the…
Speech Studio Bug report: Editing intonations has no effect for some Languages/Voices
Hi, Please see the screenshot below: That works fine. Which creates the following SSML: …
How to receive usage details for Azure Speech Service Batch Transcription?
I am using Azure Speech Service to transcribe audio files vie Batch Transcription. When I report on the costs, there is no way to report on the actual usage (time) based on which costs are calculated. I do see the requests, but if I change the scope to…