Azure OpenAI instance, modifying deployment capacity to permit another model deployment

TH-4622 20 Reputation points
2025-02-27T12:51:25.97+00:00

I have an Azure OpenAI instance with multiple models. I want to deploy another model, so modified the deployment capacity of one of them (4o) downwards from 1000K TPM to 950K TPM to permit another 4o model deployment within quota. I selected Standard deployment type. This didn't allow me to deploy the model... it states "Sweden Central (no quota)", the only location that it will permit me to deploy to is Japan East, even when the TPM limit I selected was very low, e.g. 20K TPM.

In AI Foundry, under Quota for this 4o model it states Quota Allocation: 950K of 1M, which suggests to me there should be capacity? Maybe there's also a limit of number of models per region? Seems odd that I can deploy to Japan East but nowhere else? For data sovereignty reasons within the firm I am only permitted to deploy to EU, hence Sweden Central.

What do I need to do to be able to deploy the additional 4o model please?

Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
3,174 questions
{count} votes

Accepted answer
  1. Saideep Anchuri 3,140 Reputation points Microsoft Vendor
    2025-02-27T17:04:32.0966667+00:00

    Hi TH-4622

    You, must be going with default version 2024-11-20 which selects Japan east as default region , please change to model version to 2024-05-13/ 2024-08-06 to create model deployment with 600k, 300k.

    Kindly refer below screenshot:

    User's image

    kindly refer below link: deployment-model-availability

    Thank You.

    1 person found this answer helpful.

0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.