Azure speech service continuous speech recognition

VENKATA PRAKASH NALLAMOTHULA 0 Reputation points
2025-01-26T17:30:53.98+00:00

I'm preyty new to Azure speech service and I'm using twilo/plivo service for connecting a number with azure stt and process it further after transcription.

My problem is when I speak something, it's detecting well, and when I stop speaking or stay silent, it will automatically process the empty speech which contains empty transcription text and it is returning it, this happens for every 10-15 seconds.. it automatically detecting speech.. I'm not cancelling the continuous recognition until the end of the call.

Anyone has similar experiences or anything I can change in speech configuration? Please let me know.

I used azure SDK and used both intial and speech segmentation timeout, but no change.. I'm using it for real time so I cannot add more then a second.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,880 questions
{count} votes

1 answer

Sort by: Most helpful
  1. Azar 25,935 Reputation points MVP
    2025-01-27T06:43:39.2033333+00:00

    Hi there VENKATA PRAKASH NALLAMOTHULA

    Thnx for using QandA platform

    this can be if the service is configured to handle silence too aggressively. you can try adjusting the speechEndSilenceTimeoutMs, the service waits before considering speech to have ended. A value around 800ms to 1000ms works well for real-time scenarios. For background noise, enabling noise suppression or using a custom speech model to your environment can reduce false positives.

    If this helps kindly acept the answer thanksnmuch.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.