Azure online endpoint is Scaling taking long time
Tran Hong Thu (DPS.VI.DTS)
0
Reputation points
I have a project to run an AI model using an online endpoint as a backend service, the endpoint is configured (manually set in the portal) to be auto-scale based on the number of requests.
Expect the endpoint to scale up between 1 to 2 minutes, like other services such as virtual machine scale set, etc...
But with the ML online endpoint, scaling takes a long time, about 12-18 minutes.
Do you have suggestions for speeding up the scaling time?
Sign in to answer