Hi Peter Ma,
Greetings & Welcome to the Microsoft Q&A forum! Thank you for sharing your query.
Azure AI offers a robust solution for inference as a service, similar to AWS Bedrock. You can use the Azure AI Model Inference API, which allows you to consume various models, including those from OpenAI, Cohere, and Meta, using a unified API.
Regarding LoRA-based models, Azure AI does support them. You can fine-tune and deploy LoRA fine-tuned models, such as Llama 3.1 8B.
For more information: https://techcommunity.microsoft.com/blog/aiplatformblog/the-future-of-ai-deploying-your-lora-fine-tuned-llama-3-1-8b-on-azure-ai-why-its/4276562
I hope this information helps.