1,772 questions with Azure AI Speech tags

Sort by: Updated
0 answers

How to set Speech sensitivity of Speech to Text to ignore all noise.

I need to set the speech sensitivity so that I can change it in noisy enviroments. How to set Speech sensitivity of Speech to Text to ignore all noise.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,772 questions
asked 2024-11-06T15:18:10.4833333+00:00
saumyaraj zala 0 Reputation points
commented 2024-11-07T23:28:40.7233333+00:00
kothapally Snigdha 260 Reputation points Microsoft Vendor
0 answers

Issue with Continuous Speech Recognition Returning Omitted Words in Azure Speech Service

Dear Azure Technical Support, I’m using the Azure Speech Service for continuous speech recognition, following the official JavaScript sample from the cognitive-services-speech-sdk repository. I’ve encountered a behavior I’d like to clarify. When using…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,772 questions
asked 2024-11-07T15:55:09.6266667+00:00
gabriel angelo 0 Reputation points
commented 2024-11-07T22:17:41.3866667+00:00
YutongTie-MSFT 52,856 Reputation points
0 answers

Unable to Get Logical Results with Azure Pronunciation Assessment

I'm trying to use the pronunciationAssessment feature in the Azure Speech SDK, but I cannot get reasonable results. Here's the code I ran: using Microsoft.CognitiveServices.Speech; using Microsoft.CognitiveServices.Speech.Audio; using…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,772 questions
asked 2024-11-07T20:28:30.9833333+00:00
tzviya langenthal 0 Reputation points
edited the question 2024-11-07T21:25:28.16+00:00
VarunTha 9,185 Reputation points Microsoft Vendor
0 answers

IPA phoneme for "Herrera" doesn't sound right

Hi, Here's what I'm using for the IPA phoneme for the Spanish name "Herrera." /eˈreɾa/ However, the first "r" isn't rolled and the second "r" sounds like a T. Is there another phoneme element I can use to get the rolled…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,772 questions
asked 2024-11-07T16:09:45.13+00:00
ckamida 0 Reputation points
commented 2024-11-07T19:26:06.7233333+00:00
YutongTie-MSFT 52,856 Reputation points
0 answers

Inconsistencies in IPA Pronunciation in Text to Speech

Hi, I'm using SSML to ensure specific pronunciation, however, I'm experiencing some inconsistencies. For example, here's the word 'would': <speak version='1.0' xmlns='http://www.w3.org/2001/10/synthesis' xml:lang='en-US'> <voice…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,772 questions
asked 2024-11-07T16:00:21.8+00:00
Chris Enzweiler 0 Reputation points
1 answer One of the answers was accepted by the question author.

Will word boundary event always be triggered before the Synthesizing event?

We are using speech SDK to do text to speech, and we need to highlight the speaking word by leveraging the word boundary event. From…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,772 questions
asked 2024-11-06T03:09:33.8733333+00:00
Yu Lan 76 Reputation points Microsoft Employee
commented 2024-11-07T09:58:50.6566667+00:00
Yu Lan 76 Reputation points Microsoft Employee
0 answers

speech to text twilio telugu transcript is not coming empty transcript and intitally system is not responding

async def receive_json(self, text_data): try: event = text_data.get('event') if event == 'connected': logger.info("WebSocket connected event received") elif event == 'start': …

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,772 questions
Azure Functions
Azure Functions
An Azure service that provides an event-driven serverless compute platform.
5,095 questions
SAP HANA on Azure Large Instances
SAP HANA on Azure Large Instances
Microsoft branding terminology for an Azure offer to run HANA instances on SAP HANA hardware deployed in Large Instance stamps in different Azure regions.
124 questions
asked 2024-11-04T06:03:08.5633333+00:00
MR. JACK!! 0 Reputation points
edited the question 2024-11-07T08:57:41.3733333+00:00
SriLakshmi C 505 Reputation points Microsoft Vendor
0 answers

Bug Report: Mispronunciation of Isolated Hungarian Words in Azure Neural TTS (hu-HU-NoemiNeural), but not in context

Description: The Azure Neural TTS system is mispronouncing specific Hungarian words when using the hu-HU-NoemiNeural voice. The issue affects more than half of the vocabulary words in a recent production run of words (full SSML shared at bottom of this…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,772 questions
asked 2024-09-24T17:22:22.3+00:00
Verbari LLC 20 Reputation points
commented 2024-11-07T05:21:31.0933333+00:00
navba-MSFT 24,910 Reputation points Microsoft Employee
0 answers

How to disable the default "Disfluency Removal" of filler words after STT transcription in Azure AI Speech?

Azure AI Speech Services defaults to removing many filler words (uh, eh, etc.) via post-transcription "Disfluency Removal". My use case includes presentation analysis for filler words, which requires a verbatim transcript. Is there a…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,772 questions
asked 2024-10-19T02:25:07.9633333+00:00
Dennis 0 Reputation points
commented 2024-11-07T05:15:54.73+00:00
navba-MSFT 24,910 Reputation points Microsoft Employee
1 answer

Azure Speech Service Batch Synthesis

Azure Speech Service Batch Synthesis API is not creating the file as MP3 when the output format is correct (audio-24khz-160kbitrate-mono-mp3). Speech is created as WMA file

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,772 questions
Azure Batch
Azure Batch
An Azure service that provides cloud-scale job scheduling and compute management.
339 questions
asked 2024-11-06T16:14:19.5+00:00
C-Gregory Padron 0 Reputation points
answered 2024-11-07T02:17:09.08+00:00
Avinash Devarakonda 245 Reputation points Microsoft Vendor
1 answer One of the answers was accepted by the question author.

Can Pronunciation assessment be used with REST API?

Is it possible to utilize Pronunciation assessment with REST API and if so, what are the necessary steps to make it work?

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,772 questions
asked 2024-01-08T15:23:15.05+00:00
HyunHo Shin 20 Reputation points
commented 2024-11-06T17:15:10.1+00:00
Light 0 Reputation points
0 answers

Speech service SDK usage and issues

I am trying to connect the Azure Speech with my Azure OpenAI so that I have the option to use Azure OpenAI to ask queries either by text or voice method. Currently, I have issues with connecting the Azure AI Speech with my backend which is node.js. I am…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,772 questions
Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
3,227 questions
asked 2024-11-04T06:04:48.15+00:00
Farah Diana Masri 5 Reputation points
commented 2024-11-06T10:19:03.5366667+00:00
romungi-MSFT 46,831 Reputation points Microsoft Employee
0 answers

Azure TTS Error 404

I get error 404 when trying to fetch the mp3 file via fetch. I am using Node.js in the backend. More details: I created a functionality in my app that creates an XML document containing all SSML tags as specified by Microsoft Azure. Is it possible some…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,772 questions
asked 2024-10-30T10:59:10.9833333+00:00
Migs Marbella 0 Reputation points
commented 2024-11-05T12:08:53.1766667+00:00
santoshkc 9,240 Reputation points Microsoft Vendor
1 answer

Issue with Continuous Language Identification in Azure Speech SDK for Angular Application

We are currently using the "microsoft-cognitiveservices-speech-sdk" in our Angular application (version 14) for speech transcription and translation. The transcription and translation functionality is working as expected. However, we are…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,772 questions
asked 2024-10-14T04:41:10.32+00:00
sanjay.bisht 0 Reputation points
edited an answer 2024-11-05T07:19:20.3366667+00:00
romungi-MSFT 46,831 Reputation points Microsoft Employee
0 answers

Azure Speech Studio Andrew Multilingual voice sounds glitchy

I'm having some issues with the Andrew Multilingual (en-US-AndrewMultilingualNeural) voice in the Azure Speech Studio. There's a few instances in which the voice sounds raspy and really kind of glitchy. It seems to have a lot of trouble with the word…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,772 questions
asked 2024-10-23T14:32:22.36+00:00
Rene Lems 0 Reputation points
edited a comment 2024-11-04T13:39:28.33+00:00
Alexis Toro 0 Reputation points
1 answer

SpeakSsmlAsync Result always Canceled

Hello, I am building a project using Azure's SpeechSynthesizer. SpeechLog.txt I am running into the following problem: when calling SpeakSsmlAsync(ssmlText), the result always has a canceled state, and I am having a hard time understanding why. When I…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,772 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,897 questions
asked 2024-11-02T21:12:50.3466667+00:00
Sven Wind 0 Reputation points
answered 2024-11-04T04:31:00.9066667+00:00
navba-MSFT 24,910 Reputation points Microsoft Employee
0 answers

I need to know wether this API "Post-call transcription and analytics" can work with nodejs?

I need to know wether this API "Post-call transcription and analytics" can work with nodejs? If it is not, where I can get a proper Conversation converstion API with multi user and multi language dedection and retrun a text with given…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,772 questions
asked 2024-10-30T05:40:45.9666667+00:00
Nidoos Solutions 0 Reputation points
commented 2024-11-02T13:04:24.58+00:00
santoshkc 9,240 Reputation points Microsoft Vendor
1 answer

When using batch speech transscription the ITN feature only applies to the first option of the nBest results.

When using batch transscription the ITN feature only applies to the first option of the nBest results, whitch is not necessarily the one with the highest confidence. The batch transscription service returns a json result with the following structure…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,772 questions
asked 2024-10-28T19:17:58.12+00:00
Julian Kopka Heerup 0 Reputation points
commented 2024-11-01T09:39:00.3233333+00:00
Avinash Devarakonda 245 Reputation points Microsoft Vendor
1 answer

Getting error code 0x38 (SPXERR_AUDIO_SYS_LIBRARY_NOT_FOUND) when deployed to cloud.

I am working on an interactive real time communication that uses both speech synthesizer and recognizer. In development it is working fine but when I deployed through azure web app in a Linux server it was giving the error. I don't want to process any…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,772 questions
asked 2024-10-30T11:40:14.46+00:00
Sajib Hossain 0 Reputation points
answered 2024-10-31T00:21:30.1166667+00:00
YutongTie-MSFT 52,856 Reputation points
1 answer

Stopping Audio Playback Mid-Stream with Microsoft Neural TTS Service and Speech SDK

I'm working with the Microsoft Neural Text-to-Speech (TTS) service using the Speech SDK. I've successfully implemented audio playback, but I'm facing a challenge with controlling the playback mid-stream. My question is: How can I implement a feature to…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,772 questions
asked 2024-10-27T06:32:10+00:00
MD SHAKIL KHAN 0 Reputation points
commented 2024-10-30T15:03:32.6666667+00:00
Saideep Anchuri 240 Reputation points Microsoft Vendor