How to solve HTTP 400 while copying a stt-model from a ressource to another ressource
Hello, Before it works, but now it doesn't work anymore. Does anyone know how to solve it? Translated version looks like this: ' Error copying model into project Error copying model abc into project new. Error: Status: 400. Could not complete…
Recognizing speech... Info: Error opening streams - read error=1
I have this code to call the Speech recognition and it always fails with opening streams error. I tried both ways - audio stream and wav file. neither is working. def recognize_speech_from_base64(base64_audio: str,languagecode): """ …

Text to speech avatar: increase concurrent connections
Last time we had to do this via support ticket now its making me post here. We want to request higher limits for our AI Speech resource. I'm not posting the extra info needed until i hear back from an engineer.
Speech Service Batch Transcription 500 Error
I have a video processing API deployed for 2 weeks. One of the endpoints uses Azure Speech Service Batch Transcription (UKSouth). Users started complaining today as results were not delivered within ETA. I tried listing jobs in the queue and I got…
Andrew voice is now worse, but not in all regions
I've been using the Andrew voice for a while, and from time to time it gets worse, and someone fixes it. What happened this time is that in region West Europe the voice suddenly changed, and it really sucks. But I tried region Central US, and it sounds…
I need a bing keycode
How do I get a bing ketcode for speech recognition?
Exception [SPXERR_RUNTIME_ERROR 0x1b] When Using SpeechRecognizer with AudioProcessingOptions for Echo Cancellation
I am trying to implement Acoustic Echo Cancellation (AEC) using Microsoft's Speech SDK in C#. I am using AudioProcessingOptions.Create(AudioProcessingConstants.AUDIO_INPUT_PROCESSING_ENABLE_DEFAULT) to enable audio processing. However, I am encountering…
Adjusting Audio Speed in Azure AI Speech
Is it possible to adjust the speed of audio generated using an OpenAI voice? I know that OpenAI's REST API supports a parameter for speed, but I couldn't find anything similar in the Azure AI Speech documentation. Thanks in advance!
How to solve problem of a failed speech model
I'm building a speech model on Azure. The data for training was successfully uploaded, but the model that uses the data failed with an "Internal error" message. Pls tell me what to do. Thanks! Elise
How to get omissions and insertions from scripted assessments in streaming mode
Hi, I'm trying to use Pronunciation Assessment to assess audio files that are between 60 and 120 seconds. This is in csharp dotnet, with the Azure Speech Services SDK and en-GB. Because of the time limit on the non-streaming method…
Automating Deployment of Azure Speech-to-Text Model Across Subscriptions
I have successfully deployed an Azure Speech-to-Text model in my subscription. Now, I need to automate the process of deploying the same model to my colleague's subscription. Current Setup: I have an existing Speech-to-Text service deployed in my…
How to collect user voice in real-time from the browser and then send it to Azure Speech-to-Text via WebSocket?
I'm almost driven crazy by this problem. The audio stream I capture with MediaRecorder on Chrome only supports the webm format, while the Azure API only supports wav and ogg formats. And there is no complete example telling me how to create a support for…

408 Status code in Azure Speech to text Batch transcription
I am trying to transcribe an audio file with Azure speech to text Batch transcription. The transcription is successful. But when I try to get the transcription files using the below API curl -v -X GET…
Getting 408 in Batch speech to text transcription file
I am running azure speech to text using batch transcription API. I am able to create the transcription. It is getting succeed. But when trying to access the transcription files using the below API, I am getting 408 error. curl -v -X GET…
How to train a custom model from a custom model in Azure Speech Studio?
Hello! I have a custom model in Azure Speech Studio that I've trained a few months back with audio + human-labeled transcripts. Now, I need to further train the model to ensure it properly filters profanity words. I thought I could just select my custom…
GSTREAMER Command for .wav conversion
Good day! I am using GStreamer command to convert the input file to .wav format. "gst-launch-1.0 filesrc location=\"{inputVideoAudioPath}\" ! decodebin ! audioconvert ! audioresample ! audio/x-raw,format=S16LE,rate=16000,channels=1 !…
PushAudioInputStream write uses high CPU and memory when under load
Hi team, I observe high CPU and Memory usage when sending audio using the PushAudioInputStream write method during load. I am using the Java SDK version 1.42.0 Our use case involves getting multiple streams of Audio which we need to send to Azure for…
How to access confidence score of speech recognition results in JavaScript
I'm using real-time Azure AI Speech recognition with automatic language recognition. I would like to filter out low-confidence recognition results, which mostly caused by background noise or unclear microphone input. I'm using Node.js SDK. It seems there…
I would like to know if there are any other avatars besides the Asian figures for text to speech ? How do I access them?
I would like to have the option to selfct differen avatar figures besides the Asian one shown.
Azure platform: Can't Call API Key for Speech service in Google sheets.
Hi I'm new to azure and I can't call my Azure Speech service (Text to speech) API key in google sheets that's stored in my Key Vault secrets (for The Speech Service.). Can anyone help me? So far I've: Created a Resource Group. Added a Key Vault to…