How to disable the default "Disfluency Removal" of filler words after STT transcription in Azure AI Speech?

Dennis 0 Reputation points
2024-10-19T02:25:07.9633333+00:00

Azure AI Speech Services defaults to removing many filler words (uh, eh, etc.) via post-transcription "Disfluency Removal". My use case includes presentation analysis for filler words, which requires a verbatim transcript. Is there a transcription configuration property to disable "Disfluency Removal"? I'm using Python swagger-client and processing as in this sample: (https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/samples/batch/python/python-client/main.py).

Perhaps something of the form:

properties = swagger_client.TranscriptionProperties()

properties.SpeechServiceResponse_PostProcessingOption = "None"

Do trained custom speech models include/default-to "Disfluency Removal"? Perhaps that is a solution?

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,772 questions
{count} votes

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.