Can someone explain how an OpenAI service provisioned purchase equates to in TPM quota limits?

Jeremy Lau 20 Reputation points
2025-03-05T13:32:17.33+00:00

Hi there,

I am considering making an Azure openAI service provisioned reservation purchase. Take the following specification:

  • Region: Australia East
  • Scope: Shared
  • Product type: Provision Managed globally

Purchasing this on a month basis equates to $260 usd.

Can someone please help me explain how this then translates to how many PTUs I am expected to receive from this?

or better if someone can explain what the equivalent TPM quota limit this would be or how I am able to translate or configure that with this purchase.

any answers will be great please.

thanks,

Jeremy

Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
3,203 questions
{count} votes

1 answer

Sort by: Most helpful
  1. santoshkc 13,180 Reputation points Microsoft External Staff
    2025-03-05T15:11:37.9333333+00:00

    Hi @Jeremy Lau,

    Thank you for reaching out to Microsoft Q&A forum!

    When you are using an Azure OpenAI provisioned service for $260/month in Australia East with a Shared scope, you are essentially reserving a set number of Provisioned Throughput Units (PTUs), which directly determine your Tokens Per Minute (TPM) quota. The exact PTUs you receive for this cost depend on Azure’s pricing and allocation policies for the region. PTUs define how much capacity you have for generating and processing tokens, and each model (e.g., GPT-4-Turbo, GPT-3.5-Turbo) has a specific PTU-to-TPM mapping. For example, a single PTU might correspond to a few thousand tokens per minute, but this varies based on the model and Azure’s internal provisioning. Since your purchase falls under Provision Managed Globally, your service is likely pooled with others, meaning your TPM allocation may be dynamically managed rather than fixed.

    To estimate provisioned capacity using request level data, open the capacity planner in the Azure AI Foundry. The capacity calculator is under Shared resources > Model Quota > Azure OpenAI Provisioned.

    The Provisioned option and the capacity planner are only available in certain regions within the Quota pane, if you don't see this option setting the quota region to Sweden Central will make this option available. For more info, please look into this page.

    To determine the exact TPM quota for your purchase, you can check the Azure OpenAI Quota page. If your TPM is lower than needed, you may be able to increase PTUs by requesting a quota adjustment, though availability depends on regional capacity.

    For more info, please refer to:

    Let me know if you need help finding specific PTU-to-TPM mappings for your region and model.

    1 person found this answer helpful.
    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.