Real-Time Audio Transcription and Translation Using Azure

Kayla Farivar 75 Reputation points
2025-01-23T00:42:25.1233333+00:00

I want to create a continuous translator that can specifically output audio into english text. I've been looking into the translation and language detection on Azure but I finding trouble having the tools separate. I have it as mic -> language detection -> translator -> text, but if midways the language is switched the language detector sends the wrong language to the translator. I need this all to be real time preferably. I don't mind having a list of languages as options in the beginning. Does azure have a system with this already that allows me to not establish the input language into the translator? Or is there a better way to go about all this?

Also, if any advice could point to resources in Python i'd appreciate it :) !

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,881 questions
{count} votes

1 answer

Sort by: Most helpful
  1. Saideep Anchuri 1,370 Reputation points Microsoft Vendor
    2025-01-23T02:33:13.8166667+00:00

    Hi Kayla Farivar

    Welcome to Microsoft Q&A Forum, thank you for posting your query here!

    Please use multilingual Speech Translation service for better results. It automatically detects source language and translate into target language. You might need to tune the continuous mode setting and can opt custom model approach to finetune your results.

    kindly refer below documentation: speech-translation

    Thank You.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.