Issue with Speech-to-Text Integration in Azure Communication Services Using C#

Question

A community member has associated this post with a similar question:
How to perform speech recognition to get speech input over a telephony phone call in Microsoft Azure?

Only moderators can edit this content.

Issue with Speech-to-Text Integration in Azure Communication Services Using C#

Admin Saad 0

Context: We are building a bot using Azure Communication Services (ACS) and Azure Speech Services to handle phone calls. The bot asks questions (via TTS) and captures user responses using speech-to-text (STT).

What We’ve Done:

Created an ACS instance and acquired an active phone number.
Set up an event subscription to handle callbacks for incoming calls.
Integrated Azure Speech Services for STT in C#.

Achievements:

Successfully connected calls using ACS.
Played TTS prompts generated from an Excel file.

Challenges:

User responses are not being captured. Despite setting InitialSilenceTimeout to 10 seconds, the bot skips to the next question after 1–2 seconds without recognizing speech.
The bot does not reprompt the user even when no response is detected.

Help Needed:

How can we ensure accurate real-time speech-to-text capture during ACS telephony calls?
Are there better configurations or alternate approaches for speech recognition in ACS?

Additional Context:

Following the official ACS C# sample.
Using Azure Speech Services and ACS SDKs.

Code Snippet (C#):

// Recognize user speech
async Task

Share via

Issue with Speech-to-Text Integration in Azure Communication Services Using C#