I would like to increase my openAI azure deployment token rate limit from 450k to 7.5M

Solutions Chimera 0 Reputation points
2025-01-04T13:19:50.9066667+00:00

Hi!

We are going to launch our MVP to 500 test users,
which runs with openAI gpt-4o azure deployment.

It has a max 450k token rate limit / minute, which will be not enough for us on
more busy hours, as we going to have 500 users for testing purposes,
and one single call for us because of the usecase is around 15k token.

So it is roughly 30 call/minute, we would like to increase it to 500 call/minute which would be 7.5M token per minute limit.

At least for a month between january 17 and february 17, as that will be our UAT.

Thank you in advance!

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
3,480 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. Anthony LAVABRE 1,065 Reputation points
    2025-01-04T18:26:24+00:00

    Hi,

    You can check the region where you have deploy your Azure OpenAI on this link because depending on the region, this is not possible : https://learn.microsoft.com/en-us/azure/ai-services/openai/quotas-limits

    You need to open a support ticket on the Azure portal for a token increase because you have reached the limit.

    Sincerely,


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.