Batch AI service quotas and limits
As with other Azure services, there are limits on certain resources associated with the Batch AI service. In Batch AI, these limits are default quotas applied at the subscription level for each region where the service is available. This article discusses those defaults, and how you can request quota increases.
Keep these quotas in mind as you design and scale up your Batch AI resources. For example, if your cluster doesn't reach the target number of nodes you specified, then you might have reached a Batch AI cores limit for your subscription.
If you plan to run production workloads in Batch AI, you may need to increase one or more of the quotas above the default.
Note
A quota is a credit limit, not a capacity guarantee. If you have large-scale capacity needs, please contact Azure support.
Resource quotas
Resource | Default limit | Maximum limit |
---|---|---|
Dedicated cores per region | 10 - 24 | N/A1 |
Low-priority cores per region | 10 - 24 | N/A2 |
Clusters per region | 20 | 2003 |
Note
Default limits vary depending on the type of subscription you have.
1 The number of dedicated cores per region can be increased, but the maximum number is unspecified. Contact Azure support to discuss increase options.
2 The number of low-priority cores per region can be increased, but the maximum number is unspecified. Contact Azure support to discuss increase options.
3 Contact Azure support if you want to request an increase beyond this limit.
Other limits
The following are strict limits, which cannot be exceeded once hit.
Resource | Maximum limit |
---|---|
Maximum workspaces per resource group | 800 |
Maximum cluster size | 100 nodes |
Maximum GPU MPI processes per node | 1-4 |
Maximum GPU workers per node | 1-4 |
Maximum job lifetime | 7 days1 |
Maximum parameter servers per node | 1 |
1 The maximum lifetime refers to the time that a job begins running and when it completes. Completed jobs persist indefinitely; data for jobs not completed within the maximum lifetime is not accessible.
View Batch AI quotas
View your current Batch AI subscription quotas in the Azure portal.
- On the left pane, click on All services. Then search for Batch AI and click to open the service.
- Click on Usage + quotas on the Batch AI menu.
- Select your subscription to view the quota limits.
Increase a Batch AI cores quota
Follow these steps to request a quota increase for your Batch AI subscription using the Azure portal.
On the left pane, click on All services. Then search for Batch AI and click to open the service.
Click on New support request on the Batch AI menu.
In Basics:
a. Issue Type > Quota
b. Subscription > Select your subscription.
c. Quota type > Batch AI
d. Support plan > Select your support plan.
Click Next.
In Problem:
a. Select a Severity according to your business impact.
b. In Quota Details, specify the location, quota type, and resource type. Specify the new limit you want to request. Click Save and continue.
c. Optional - Upload any relevant files with more information regarding your reason for increase.
Click Next.
In Contact information:
a. Select a Preferred contact method.
b. Verify and enter the required contact details.
Click Create to submit the support request.
Once you've submitted your support request, Azure support will contact you. Completing the request can take up to 2 business days.
Next steps
After becoming familiar with the quota limits, check out the following articles for getting started with using Batch AI.