1,774 questions with Azure AI Speech tags

Sort by: Updated
1 answer

Unable to Get Logical Results with Azure Pronunciation Assessment

I'm trying to use the pronunciationAssessment feature in the Azure Speech SDK, but I cannot get reasonable result. I've tested this with the word "school" and other words as well, but I always get a result of 0—no matter whether the word was…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,774 questions
asked 2024-11-07T20:28:30.9833333+00:00
tzviya langenthal 0 Reputation points
commented 2024-11-09T16:09:38.6533333+00:00
tzviya langenthal 0 Reputation points
0 answers

How to set Speech sensitivity of Speech to Text to ignore all noise.

I need to set the speech sensitivity so that I can change it in noisy enviroments. How to set Speech sensitivity of Speech to Text to ignore all noise.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,774 questions
asked 2024-11-06T15:18:10.4833333+00:00
saumyaraj zala 0 Reputation points
commented 2024-11-08T18:55:09.5833333+00:00
kothapally Snigdha 260 Reputation points Microsoft Vendor
1 answer

IPA phoneme for "Herrera" doesn't sound right

Hi, Here's what I'm using for the IPA phoneme for the Spanish name "Herrera." /eˈreɾa/ However, the first "r" isn't rolled and the second "r" sounds like a T. Is there another phoneme element I can use to get the rolled…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,774 questions
asked 2024-11-07T16:09:45.13+00:00
ckamida 0 Reputation points
commented 2024-11-08T17:25:37.5733333+00:00
YutongTie-MSFT 52,861 Reputation points
1 answer

Realtime Recognizer not utilising with Semantic Segmentation

Hi all! I'm using the Azure speechsdk.SpeechRecognizer for transcribing streamed real-time audio. While the transcription works, continuous talking will result in large paragraphs being outputted rather than sentence by sentence. I included the…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,774 questions
asked 2024-11-08T02:34:01.7166667+00:00
Thomas Bauer 0 Reputation points
commented 2024-11-08T15:21:50.5+00:00
Thomas Bauer 0 Reputation points
1 answer

Azure Speech Service Batch Synthesis

Azure Speech Service Batch Synthesis API is not creating the file as MP3 when the output format is correct (audio-24khz-160kbitrate-mono-mp3). Speech is created as WMA file

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,774 questions
asked 2024-11-06T16:14:19.5+00:00
C-Gregory Padron 0 Reputation points
edited the question 2024-11-08T08:47:44.0333333+00:00
vipullag-MSFT 26,391 Reputation points
0 answers

speech to text twilio telugu transcript is not coming empty transcript and intitally system is not responding

async def receive_json(self, text_data): try: event = text_data.get('event') if event == 'connected': logger.info("WebSocket connected event received") elif event == 'start': …

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,774 questions
Azure Functions
Azure Functions
An Azure service that provides an event-driven serverless compute platform.
5,110 questions
asked 2024-11-04T06:03:08.5633333+00:00
MR. JACK!! 0 Reputation points
edited the question 2024-11-08T08:44:11.4+00:00
vipullag-MSFT 26,391 Reputation points
0 answers

How to fix Exception with an error code: 0xe (SPXERR_MIC_NOT_AVAILABLE)

I have built a chatbot bot framework and am now looking to integrate speech functionality for the bot. I am trying to run the below code from ms learn quickstart for speech sdk using python. …

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,774 questions
asked 2024-11-08T03:08:41.2733333+00:00
Saeshav Subash 0 Reputation points
commented 2024-11-08T07:33:14.5666667+00:00
navba-MSFT 24,985 Reputation points Microsoft Employee
1 answer

Inconsistencies in IPA Pronunciation in Text to Speech

Hi, I'm using SSML to ensure specific pronunciation, however, I'm experiencing some inconsistencies. For example, here's the word 'would': <speak version='1.0' xmlns='http://www.w3.org/2001/10/synthesis' xml:lang='en-US'> <voice…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,774 questions
asked 2024-11-07T16:00:21.8+00:00
Chris Enzweiler 0 Reputation points
answered 2024-11-08T04:49:00.7266667+00:00
Avinash Devarakonda 325 Reputation points Microsoft Vendor
0 answers

Issue with Continuous Speech Recognition Returning Omitted Words in Azure Speech Service

Dear Azure Technical Support, I’m using the Azure Speech Service for continuous speech recognition, following the official JavaScript sample from the cognitive-services-speech-sdk repository. I’ve encountered a behavior I’d like to clarify. When using…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,774 questions
asked 2024-11-07T15:55:09.6266667+00:00
gabriel angelo 0 Reputation points
commented 2024-11-07T22:17:41.3866667+00:00
YutongTie-MSFT 52,861 Reputation points
1 answer One of the answers was accepted by the question author.

Will word boundary event always be triggered before the Synthesizing event?

We are using speech SDK to do text to speech, and we need to highlight the speaking word by leveraging the word boundary event. From…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,774 questions
asked 2024-11-06T03:09:33.8733333+00:00
Yu Lan 76 Reputation points Microsoft Employee
commented 2024-11-07T09:58:50.6566667+00:00
Yu Lan 76 Reputation points Microsoft Employee
0 answers

Bug Report: Mispronunciation of Isolated Hungarian Words in Azure Neural TTS (hu-HU-NoemiNeural), but not in context

Description: The Azure Neural TTS system is mispronouncing specific Hungarian words when using the hu-HU-NoemiNeural voice. The issue affects more than half of the vocabulary words in a recent production run of words (full SSML shared at bottom of this…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,774 questions
asked 2024-09-24T17:22:22.3+00:00
Verbari LLC 20 Reputation points
commented 2024-11-07T05:21:31.0933333+00:00
navba-MSFT 24,985 Reputation points Microsoft Employee
0 answers

How to disable the default "Disfluency Removal" of filler words after STT transcription in Azure AI Speech?

Azure AI Speech Services defaults to removing many filler words (uh, eh, etc.) via post-transcription "Disfluency Removal". My use case includes presentation analysis for filler words, which requires a verbatim transcript. Is there a…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,774 questions
asked 2024-10-19T02:25:07.9633333+00:00
Dennis 0 Reputation points
commented 2024-11-07T05:15:54.73+00:00
navba-MSFT 24,985 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

Can Pronunciation assessment be used with REST API?

Is it possible to utilize Pronunciation assessment with REST API and if so, what are the necessary steps to make it work?

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,774 questions
asked 2024-01-08T15:23:15.05+00:00
HyunHo Shin 20 Reputation points
commented 2024-11-06T17:15:10.1+00:00
Light 0 Reputation points
0 answers

Speech service SDK usage and issues

I am trying to connect the Azure Speech with my Azure OpenAI so that I have the option to use Azure OpenAI to ask queries either by text or voice method. Currently, I have issues with connecting the Azure AI Speech with my backend which is node.js. I am…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,774 questions
Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
3,238 questions
asked 2024-11-04T06:04:48.15+00:00
Farah Diana Masri 5 Reputation points
commented 2024-11-06T10:19:03.5366667+00:00
romungi-MSFT 46,831 Reputation points Microsoft Employee
0 answers

Azure TTS Error 404

I get error 404 when trying to fetch the mp3 file via fetch. I am using Node.js in the backend. More details: I created a functionality in my app that creates an XML document containing all SSML tags as specified by Microsoft Azure. Is it possible some…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,774 questions
asked 2024-10-30T10:59:10.9833333+00:00
Migs Marbella 0 Reputation points
commented 2024-11-05T12:08:53.1766667+00:00
santoshkc 9,400 Reputation points Microsoft Vendor
1 answer

Issue with Continuous Language Identification in Azure Speech SDK for Angular Application

We are currently using the "microsoft-cognitiveservices-speech-sdk" in our Angular application (version 14) for speech transcription and translation. The transcription and translation functionality is working as expected. However, we are…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,774 questions
asked 2024-10-14T04:41:10.32+00:00
sanjay.bisht 0 Reputation points
edited an answer 2024-11-05T07:19:20.3366667+00:00
romungi-MSFT 46,831 Reputation points Microsoft Employee
0 answers

Azure Speech Studio Andrew Multilingual voice sounds glitchy

I'm having some issues with the Andrew Multilingual (en-US-AndrewMultilingualNeural) voice in the Azure Speech Studio. There's a few instances in which the voice sounds raspy and really kind of glitchy. It seems to have a lot of trouble with the word…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,774 questions
asked 2024-10-23T14:32:22.36+00:00
Rene Lems 0 Reputation points
edited a comment 2024-11-04T13:39:28.33+00:00
Alexis Toro 0 Reputation points
1 answer

SpeakSsmlAsync Result always Canceled

Hello, I am building a project using Azure's SpeechSynthesizer. SpeechLog.txt I am running into the following problem: when calling SpeakSsmlAsync(ssmlText), the result always has a canceled state, and I am having a hard time understanding why. When I…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,774 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,902 questions
asked 2024-11-02T21:12:50.3466667+00:00
Sven Wind 0 Reputation points
answered 2024-11-04T04:31:00.9066667+00:00
navba-MSFT 24,985 Reputation points Microsoft Employee
0 answers

I need to know wether this API "Post-call transcription and analytics" can work with nodejs?

I need to know wether this API "Post-call transcription and analytics" can work with nodejs? If it is not, where I can get a proper Conversation converstion API with multi user and multi language dedection and retrun a text with given…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,774 questions
asked 2024-10-30T05:40:45.9666667+00:00
Nidoos Solutions 0 Reputation points
commented 2024-11-02T13:04:24.58+00:00
santoshkc 9,400 Reputation points Microsoft Vendor
1 answer

When using batch speech transscription the ITN feature only applies to the first option of the nBest results.

When using batch transscription the ITN feature only applies to the first option of the nBest results, whitch is not necessarily the one with the highest confidence. The batch transscription service returns a json result with the following structure…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,774 questions
asked 2024-10-28T19:17:58.12+00:00
Julian Kopka Heerup 0 Reputation points
commented 2024-11-01T09:39:00.3233333+00:00
Avinash Devarakonda 325 Reputation points Microsoft Vendor