Hello, thanks for reaching out to us, I can see three possible solutions here for your reference.
The first one is training your custom model or fine-tuning the base model, which is more related to a better speech recognition result.
Document reference for Fine-tuning - https://learn.microsoft.com/en-us/azure/ai-services/speech-service/custom-speech-ai-foundry-portal
Document reference for Custom model - https://learn.microsoft.com/en-us/azure/ai-services/speech-service/custom-speech-overview
The second one is combining Azure Language Service and Azure Speech Service, because Azure Language Service is good at extract the address, to extract an address from text using Azure Language Service, you would utilize the "Named Entity Recognition" (NER) feature, which identifies and classifies entities like locations (including addresses) within a text. You can play with it in the Azure Language Studio and see how it works on your scenario.
The last one I may try is Azure OpenAI, you may combine Azure Speech Service with Azure OpenAI to get a better result of address, but please also consider the price.
I hope this helps!
Regards,
Yutong
-Please kindly accept the answer if you feel helpful to support the community, thanks a lot.