Azure AI Speech

0 answers

How to access confidence score of speech recognition results in JavaScript

I'm using real-time Azure AI Speech recognition with automatic language recognition. I would like to filter out low-confidence recognition results, which mostly caused by background noise or unclear microphone input. I'm using Node.js SDK. It seems there…

asked

Gunwoo Kim 0

1 answer

PushAudioInputStream write uses high CPU and memory when under load

Hi team, I observe high CPU and Memory usage when sending audio using the PushAudioInputStream write method during load. I am using the Java SDK version 1.42.0 Our use case involves getting multiple streams of Audio which we need to send to Azure for…

asked

Sai Vishnu Soudri 60

commented

Pavankumar Purilla 3,410 Microsoft Vendor

0 answers

Automating Deployment of Azure Speech-to-Text Model Across Subscriptions

I have successfully deployed an Azure Speech-to-Text model in my subscription. Now, I need to automate the process of deploying the same model to my colleague's subscription. Current Setup: I have an existing Speech-to-Text service deployed in my…

asked

Dinnemidi Ananda Kumar 100

0 answers

Azure platform: Can't Call API Key for Speech service in Google sheets.

Hi I'm new to azure and I can't call my Azure Speech service (Text to speech) API key in google sheets that's stored in my Key Vault secrets (for The Speech Service.). Can anyone help me? So far I've: Created a Resource Group. Added a Key Vault to…

asked

Contentcreator1 0

edited a comment

Manas Mohanty 635 Microsoft Vendor

2 answers

Running Web Avatar Code Sample but with Audio-Only mode

I've been testing Microsoft's Web/Avatar code sample (cognitive-services-speech-sdk/samples/python/web/avatar at master · Azure-Samples/cognitive-services-speech-sdk) and successfully got it running. However, the pricing for the speech service with an…

asked

Saeed Zidan 0

commented

santoshkc 12,420 Microsoft Vendor

1 answer

I need a bing keycode

How do I get a bing ketcode for speech recognition?

asked

Rok1330 0

answered

Saideep Anchuri 2,370 Microsoft Vendor

0 answers

Exception [SPXERR_RUNTIME_ERROR 0x1b] When Using SpeechRecognizer with AudioProcessingOptions for Echo Cancellation

I am trying to implement Acoustic Echo Cancellation (AEC) using Microsoft's Speech SDK in C#. I am using AudioProcessingOptions.Create(AudioProcessingConstants.AUDIO_INPUT_PROCESSING_ENABLE_DEFAULT) to enable audio processing. However, I am encountering…

asked

Gaurav Rastogi 0

commented

Vikram Singh 1,805 Microsoft Employee

1 answer

Why speech translation and speech diaization cannot be combined into one service?

Hi, We have noticed that Azure AI Speech provide speech translation and speech diarization service separately. Is there a way to combine those two into one service? Furthermore, the segments from speech translation are very different than that from…

asked

Ning Zhu 0

commented

SriLakshmi C 2,810 Microsoft Vendor

0 answers

[ISSUE] Pronunciation Assessment in Polish Fails

Hello, the Azure Team, I have noticed audio passed to the Azure Pronunciation Assessment in Polish doesn't work at all. Uploaded audio and even the live microphone in your Azure Speech dashboard fail in Polish. I haven't tested all the languages but the…

asked

Jordan Cadon 0

commented

navba-MSFT 27,465 Microsoft Employee

1 answer

Incorrect Grammar Case for Dates in Microsoft TTS

Hello, I have noticed an issue with how Microsoft TTS (Text-to-Speech) handles dates in German, specifically regarding grammatical cases. When a date includes the month spelled out, the pronunciation is correct (dative case): ✅ zum 23. November 2025 →…

asked

Dimitrios Arampatzoglou 0

commented

Dimitrios Arampatzoglou 0

1 answer

How to solve problem of a failed speech model

I'm building a speech model on Azure. The data for training was successfully uploaded, but the model that uses the data failed with an "Internal error" message. Pls tell me what to do. Thanks! Elise

asked

Elise 0

commented

kothapally Snigdha 1,510 Microsoft Vendor

1 answer

Japanese Voice-to-Text: Preventing Unwanted Kanji Transcription for Names

When using Azure Speech to Text for batch transcription of conversations in Japanese, there is an issue with person names being transcribed into incorrect Kanji characters. A custom speech model has been created to handle specific industry terms, but…

asked

Thierry Tropée 20

accepted

Thierry Tropée 20

0 answers

My webhook throws a BadHttpRequestException "Unexpected end of request content."

I have a webhook that receives updates from Azure AI Speech when a batch transcription is complete and I am regularly getting Microsoft.AspNetCore.Server.Kestrel.Core.BadHttpRequestException "Unexpected end of request content.". My function app…

asked

Julian Kopka Heerup 0

commented

Julian Kopka Heerup 0

1 answer

Which Azure Speech SDK Feature to Use for Real-Time Meeting Transcription with Speaker Diarization?

Hi , I am working on real-time meeting transcription using Azure Speech SDK and need: Accurate speaker diarization (identify who is speaking). Sentence-level segmentation (avoid merging multiple sentences into one recognition event). Improved…

asked

Su Myat Hlaing 160

commented

SriLakshmi C 2,810 Microsoft Vendor

1 answer

Issue with Ryan Multilingual Voice – Accent Changing from American to Australian

Experiencing an issue where the Ryan Multilingual Voice unexpectedly changes its accent from American to Australian during audio generation. Our bot language is set to English - US, and am are consistently using Ryan Multilingual Voice. We have tested…

asked

vijo.jose 0

commented

Sina Salam 18,046

1 answer

Speech Functionality in Azure AI Foundry Prompt Flow

Hi, I have question regarding Azure AI Foundry prompt flow. Currently, I want to create a conversational chatbot mobile application that can receive 2 inputs from user. One input is from text input and another one is speech-to-text input. Currently, I…

asked

Farah Diana Masri 40

commented

Sapienza Marco 0

1 answer

Creating Custom Keywords Models are stuck "processing"

When creating a new model in the Speech Studio under 'Create a custom keyword for your virtual assistant', the model creation process never completes. I tried it with multiple models and keywords, even the examples like "hey computer" never…

asked

Ziq 15

commented

Kimberly Coston 0

1 answer

Features container Speech to text

Hello community, I would like to ask you some info about Speech to text containers. I would like to run the service on premise and I don't find any information in the documentation in these services are available in the version on-prem as well: …

asked

Fabio Puddu 5

commented

kothapally Snigdha 1,510 Microsoft Vendor

0 answers

How to get omissions and insertions from scripted assessments in streaming mode

Hi, I'm trying to use Pronunciation Assessment to assess audio files that are between 60 and 120 seconds. This is in csharp dotnet, with the Azure Speech Services SDK and en-GB. Because of the time limit on the non-streaming method…

asked

Nick Pattman 40

commented

Jason Dixon 0

1 answer

Difference in results for Batch speech to text in speech studio and API

Hi, i was running a dual channel hindi audio file on speech studio as well as api on batch speech to text, but i am getting different results in both, and it is consistent. The speech studio version is the correct one as it recognises the phrases…

asked

Sidharth Ajul 40

commented

Prashanth Veeragoni 320 Microsoft Vendor

Filter

Content

1,918 questions with Azure AI Speech tags

How to access confidence score of speech recognition results in JavaScript

PushAudioInputStream write uses high CPU and memory when under load

Automating Deployment of Azure Speech-to-Text Model Across Subscriptions

Azure platform: Can't Call API Key for Speech service in Google sheets.

Running Web Avatar Code Sample but with Audio-Only mode

I need a bing keycode

Exception [SPXERR_RUNTIME_ERROR 0x1b] When Using SpeechRecognizer with AudioProcessingOptions for Echo Cancellation

Why speech translation and speech diaization cannot be combined into one service?

[ISSUE] Pronunciation Assessment in Polish Fails

Incorrect Grammar Case for Dates in Microsoft TTS

How to solve problem of a failed speech model

Japanese Voice-to-Text: Preventing Unwanted Kanji Transcription for Names

My webhook throws a BadHttpRequestException "Unexpected end of request content."

Which Azure Speech SDK Feature to Use for Real-Time Meeting Transcription with Speaker Diarization?

Issue with Ryan Multilingual Voice – Accent Changing from American to Australian

Speech Functionality in Azure AI Foundry Prompt Flow

Creating Custom Keywords Models are stuck "processing"

Features container Speech to text

How to get omissions and insertions from scripted assessments in streaming mode

Difference in results for Batch speech to text in speech studio and API