Quota for Azure OpenAI Service Model Changed to 8k Without Notice

takeolexus 130 Reputation points
2025-03-10T03:22:54.3333333+00:00

Hello,

When I initially set up gpt-4o, the "Tokens per Minute Rate Limit" was 150K.

User's image

Recently, upon checking the quota, it displayed as "150K of 8K", and when viewing the model's editing screen, it showed "undefinedundefine".

User's image

Here is the quota status:User's image

When you press the edit button, it shows 150K.User's image

Could you please explain why this situation has occurred? Additionally, is the effective quota currently 8K?

Thank you for your assistance.

Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
3,223 questions
{count} votes

Accepted answer
  1. SriLakshmi C 3,250 Reputation points Microsoft External Staff
    2025-03-11T07:40:09.5166667+00:00

    Hello takeolexus,

    It appears that the quota limits for your Azure OpenAI Service model have been adjusted. Initially, the Tokens per Minute (TPM) limit was set to 150K, but it now reflects 8K TPM. This change may be due to updates in the service's quota policies or restrictions based on your subscription type.

    I attempted to reproduce the issue in my environment, but the quota remained unchanged, as shown in the screenshot below.

    User's image

    The effective quota currently stands at 8K TPM, which aligns with the lower limits applicable to certain subscription types, such as Monthly credit card-based plans, Azure for Students, and Free Trials. If your subscription falls into one of these categories, this will likely explain the quota reduction.

    Please refer this Other offer types.

    I Hope this helps. Do let me know if you have any further queries.


    If this answers your query, do click Accept Answer and Yes for was this answer helpful.

    Thank you!

    1 person found this answer helpful.

0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.