Severe Latency in Azure OpenAI Services (o1 and o3-mini Models) – Response Times Over 2 Minutes for Simple Queries

Movin Silva 105 Reputation points
2025-02-19T07:09:00.0066667+00:00

We are experiencing significant performance issues with the OpenAI models (o1 and o3-mini) on Azure, even within the Azure Playground. For simple queries like "Who are you?", the response time exceeds 2 minutes, which is far from normal. This delay is causing considerable disruption, and I have verified that the issue persists consistently.

Could you please investigate the cause of this latency and provide a resolution? The performance seems to be abnormally slow for trivial tasks, and I would appreciate guidance on resolving this.

Thank you for your assistance!

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
3,732 questions
{count} votes

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.