Azure AI Speech

4 answers

Error when returning audio stream from server using speech synthesis

I was able to generate and produce audio speech on my local server. The API was generating a wav file and storing it in my local file system. The speech was played by an HTML audio tag picking up the local file. When I deployed to the Azure web…

asked

Rebecca Peltz 30

edited an answer

Saideep Anchuri 3,850 Microsoft External Staff

1 answer

How to enable Text-to-Speech (TTS) for Speech Services

I have an active Azure Speech Services resource under the Standard (S0) tier in the East US region. My API Key works for listing available voices (voices/list), but any Text-to-Speech (TTS) request returns a "404 Resource Not Found"…

asked

Mohammed Riyas 0

edited an answer

Vinodh247 29,361 MVP

1 answer

Azure Pronunciation Assessment API – Inconsistent Scores Between Python and TypeScript

Hi Azure Team, We are experiencing a significant discrepancy in pronunciation assessment scores when using the Azure Speech Pronunciation Assessment API with the Python SDK versus the TypeScript SDK. The same audio file and reference text produce good…

asked

Waleed 0

commented

Willem de Groot 0

1 answer

Exception [SPXERR_RUNTIME_ERROR 0x1b] When Using SpeechRecognizer with AudioProcessingOptions for Echo Cancellation

I am trying to implement Acoustic Echo Cancellation (AEC) using Microsoft's Speech SDK in C#. I am using AudioProcessingOptions.Create(AudioProcessingConstants.AUDIO_INPUT_PROCESSING_ENABLE_DEFAULT) to enable audio processing. However, I am encountering…

asked

Gaurav Rastogi 20

accepted

Gaurav Rastogi 20

1 answer

Is there a mapping between SessionId and LogId

Hi team, I would like to know if there is a mapping between sessionId that Azure SDK provides to the logId with which the transcripts and recordings are logged on Azure. I am referring to this [blog] to log transcripts with Azure. While I can retrieve…

asked

Sai Vishnu Soudri 60

commented

Pavankumar Purilla 4,025 Microsoft External Staff

3 answers

Azure Text-to Speech Error Code: 0x38 (SPXERR_AUDIO_SYS_LIBRARY_NOT_FOUND) when using Python 3.10 on Wep App Service

I followed these 2 guides and everything works fine on my local windows 10 machine using python 3.10., but when I deploy my code using VS Code to my linux basic web app service, running python 3.10, I receive the following…

asked

Jared Rice 30

commented

Rebecca Peltz 30

0 answers

zh-CN-XiaochenMultilingualNeural voice request timeout

Hey, When using voice zh-CN-XiaochenMultilingualNeural request is timed out. Other chinese voices, for example, zh-CN-XiaoxiaoMultilingualNeural or zh-CN-XiaoqiuNeural works perfectly. Where is the problem?

asked

Maks Lysohorov 0

commented

kothapally Snigdha 1,635 Microsoft External Staff

1 answer

Introducing interpretation in Microsoft Teams using Azure AI Speech. But when and how?

Hello, I saw a few weeks ago the following Microsoft Azure Video where a call was translated in realtime. https://www.youtube.com/watch?v=r8gzes7aA7s Will be good to test this and be part of the BETA Testgroups. Where can I find more information about…

asked

Jose Lopez Moreno-ADM 0

commented

Schroeter, Thomas 0

2 answers

Azure TTS Speaker Separation Problem

Hello, I followed the instructions at https://learn.microsoft.com/ko-kr/azure/ai-services/speech-service/get-started-stt-diarization?tabs=windows&pivots=programming-language-csharp, but I'm encountering an error. For reference, the subscription key…

asked

JW J 25

commented

JW J 25

1 answer

Internal error while fine tuning Whisper

While trying to fine tuning Whisper Large V2 I get an internal error: "Model training failedAn internal server error has occurred. Retry after a while. If the issue persists". I've tried several times and different region but the problem…

asked

Pier Luigi Susini 0

answered

Prashanth Veeragoni 1,190 Microsoft External Staff

1 answer

Azure Text to Speech Error: Unable to contact server. StatusCode: 1006,

Dear all,We use azure text to speech to synthesise speech. However, recently we keep seeing this error when using Azure Text to Speech service in our application: Error details: Unable to contact server. StatusCode: 1006,…

asked

Learn StepWise 0

commented

Prashanth Veeragoni 1,190 Microsoft External Staff

0 answers

Azure TTS: Getting non speech audio bytes at beginning and ending of TTS speech

We use Azure's Rest API with the TTS service to generate audio for one of our products. From our logs, it seems starting on February 28, 2025, we started getting audio back with non-speech bytes (two audio blips) at the beginning and end of the audio…

asked

Tom Westrick 0

edited the question

Tom Westrick 0

1 answer

TTS Azure - Connection was closed by the remote host. Error code: 1011

I am using the azure text to speech service with SDK with both python and java. azure-cognitiveservices-speech package with python com.microsoft.cognitiveservices.speech:client-sdk with java. My request has been working for at least a year and…

asked

Daniel Engelson 0

edited a comment

Daniel Engelson 0

1 answer

Why does ConversationTranscriber lose connection to server after about 20 min?

We are doing a live transcription speech to text using the ConversationTranscriber. After calling startTranscribingAsync it runs for about 20 min before a canceled event is received with an error saying "Unable to contact server. StatusCode: 0,…

asked

Mojica, Christopher 0

answered

Mojica, Christopher 0

0 answers

Azure Pronunciation Assessment Playground Stuck at "Validating Resource"

Hi Azure Team, We are experiencing an issue with the Azure Speech Pronunciation Assessment Playground. When selecting the speech resource (hub), it gets stuck at "Validating Resource" indefinitely. This issue started recently, but the same…

asked

Waleed 0

commented

SriLakshmi C 3,015 Microsoft External Staff

1 answer

Reuse SpeechRecognizer and stream for multiple audio streams?

Hi team, is there any best practice on how to reuse the SpeechRecognizer for stream recognizing user audios? In our application, we know where user start talking and end talking so we can signal speech recognizer for it. The reason I wanted to reuse…

asked

LeetGPT 60

edited a comment

eiden 0

1 answer

Azure speechsdk is not working after the deployment of the app in linux based web application (App service)

We have identified a issue with accessing the client’s microphone in our Azure DevOps-deployed web application. The problem arises because the use_default_microphone=True setting in Azure Speech SDK tries to access the server’s microphone instead of the…

asked

Bharani Karthick S 20

accepted

Bharani Karthick S 20

2 answers

How to solve problem of a failed speech model

I'm building a speech model on Azure. The data for training was successfully uploaded, but the model that uses the data failed with an "Internal error" message. Pls tell me what to do. Thanks! Elise

asked

Elise 0

answered

Suwarna S Kale 1,186

2 answers

Few questions on the limitations of phrase list feature in Azure AI Speech

Hi. I was looking at the phrase-list feature in speech-text streaming flow and I have a few questions: Is there a limit on the number of words we can provide as part of phraseListGrammar? Is there a limit on the number of characters in each word? Is…

asked

Sai Vishnu Soudri 60

answered

Nikolay Yordanov 0

0 answers

Speech Service Batch Transcription 500 Error

I have a video processing API deployed for 2 weeks. One of the endpoints uses Azure Speech Service Batch Transcription (UKSouth). Users started complaining today as results were not delivered within ETA. I tried listing jobs in the queue and I got…

asked

Javier Orozco 5

commented

VasaviLankipalle-MSFT 18,571

Filter

Content

1,940 questions with Azure AI Speech tags

Error when returning audio stream from server using speech synthesis

How to enable Text-to-Speech (TTS) for Speech Services

Azure Pronunciation Assessment API – Inconsistent Scores Between Python and TypeScript

Exception [SPXERR_RUNTIME_ERROR 0x1b] When Using SpeechRecognizer with AudioProcessingOptions for Echo Cancellation

Is there a mapping between SessionId and LogId

Azure Text-to Speech Error Code: 0x38 (SPXERR_AUDIO_SYS_LIBRARY_NOT_FOUND) when using Python 3.10 on Wep App Service

zh-CN-XiaochenMultilingualNeural voice request timeout

Introducing interpretation in Microsoft Teams using Azure AI Speech. But when and how?

Azure TTS Speaker Separation Problem

Internal error while fine tuning Whisper

Azure Text to Speech Error: Unable to contact server. StatusCode: 1006,

Azure TTS: Getting non speech audio bytes at beginning and ending of TTS speech

TTS Azure - Connection was closed by the remote host. Error code: 1011

Why does ConversationTranscriber lose connection to server after about 20 min?

Azure Pronunciation Assessment Playground Stuck at "Validating Resource"

Reuse SpeechRecognizer and stream for multiple audio streams?

Azure speechsdk is not working after the deployment of the app in linux based web application (App service)

How to solve problem of a failed speech model

Few questions on the limitations of phrase list feature in Azure AI Speech

Speech Service Batch Transcription 500 Error