Azure online endpoint is Scaling taking long time

Tran Hong Thu (DPS.VI.DTS) 0 Reputation points
2025-02-13T07:38:15.4833333+00:00

I have a project to run an AI model using an online endpoint as a backend service, the endpoint is configured (manually set in the portal) to be auto-scale based on the number of requests.

Expect the endpoint to scale up between 1 to 2 minutes, like other services such as virtual machine scale set, etc...

But with the ML online endpoint, scaling takes a long time, about 12-18 minutes.

Do you have suggestions for speeding up the scaling time?

Azure Machine Learning
Azure Machine Learning
An Azure machine learning service for building and deploying models.
3,121 questions
{count} votes

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.