Azure Open AI GPT 4o Mini - Image Input - Token Limitation Error

Rahul Verma 5 Reputation points
2024-12-04T08:25:47.61+00:00

We have a standard subscription. Every time, we try to use the API for any image analysis, we get token limit error. See the attached image. I am using a highly scaled down image. 431x391, 29KB in size.

Screenshot 2024-12-04 at 13.50.16

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
3,480 questions
{count} vote

1 answer

Sort by: Most helpful
  1. Max Lacy 340 Reputation points
    2024-12-06T15:59:32.08+00:00

    I understand you are experiencing a limit issue when trying to utilize Open AI GPT 4o Mini specifically for image input. The error message "Requests the ChatCompletions_Create Operation under Azure OpenAi API version 2024-10-01 -preview have exceeded token rate limit of pricing if you would like to further increase the default rate limit..." can be misleading in that it points the user to a quota increase link.

    When a deployment is created, the assigned TPM will directly map to the tokens-per-minute rate limit enforced on its inferencing requests. A Requests-Per-Minute (RPM) rate limit will also be enforced whose value is set proportionally to the TPM assignment using the following ratio:

    6 RPM per 1000 TPM.

    When you created your AI GPT 4o Mini deployment the TPM was likely set to a different threshold than the GPT 4o TPM threshold resulting in different RPMs rate limits for each.

    To increasing your token per minute in the Azure AI Portal. Navigate to Deployments | <select deployment> | Edit.
    User's image

    If you found this answer helpful, consider accepting it.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.