Q: I'm using the `azure-ai-inference` package for Python and I get a 401 error when I try to authenticate using keys. What should I do?

Azure AI Services resource requires the version azure-ai-inference>=1.0.0b5 for Python. Ensure you're using that version.

Question 1

What's the difference between Azure OpenAI service and Azure AI model inference?

Accepted Answer

Azure OpenAI Service gives customers access to advanced language models from OpenAI. Azure AI model inference extends such capability giving customers access to all the flagship models in Azure AI under the same service, endpoint, and credentials. It includes Azure OpenAI, Cohere, Mistral AI, Meta Llama, AI21 labs, etc. Customers can seamlessly switch between models without changing their code.

Both Azure OpenAI Service and Azure AI model inference are part of the Azure AI services family and build on top of the same security and enterprise promise of Azure.

While Azure AI model inference focus on inference, Azure OpenAI Service can be used with more advanced APIs like batch, fine-tuning, assistants, and files.

Question 2

What's the difference between OpenAI and Azure OpenAI?

Accepted Answer

Azure AI Models and Azure OpenAI Service give customers access to advanced language models from OpenAI with the security and enterprise promise of Azure. Azure OpenAI codevelops the APIs with OpenAI, ensuring compatibility and a smooth transition from one to the other.

Customers get the security capabilities of Microsoft Azure while running the same models as OpenAI. It offers private networking, regional availability, and responsible AI content filtering.

Learn more about the Azure OpenAI service.

Question 3

What's the difference between Azure AI services and Azure AI Foundry?

Accepted Answer

Azure AI services are a suite of AI services that provide prebuilt APIs for common AI scenarios. Azure AI Services is part of the Azure AI Foundry platform. Azure AI services can be used in Azure AI Foundry portal to enhance your models with prebuilt AI capabilities.

Question 4

Why aren't all the models in the Azure AI model catalog supported in Azure AI services?

Accepted Answer

Azure AI model inference in AI services supports all the models in the Azure AI catalog having pay-as-you-go billing. For more information, see the Models article.

The Azure AI model catalog contains a wider list of models, however, those models require compute quota from your subscription. They also need to have a project or AI hub where to host the deployment. For more information, see deployment options in Azure AI Foundry.

Question 5

Which are the supported SDKs and programming languages for Azure AI model inference?

Accepted Answer

You can use Azure Inference SDK with any model supported by Azure AI model inference in Azure AI services, the AzureOpenAI class in OpenAI SDK, or the Azure OpenAI SDK.

Cohere SDK, Mistral SDK, and model provider-specific SDKs aren't supported when connected to Azure AI services.

For more information, see supported SDKs and programming languages.

Question 6

Does Azure AI model inference work with the latest Python library released by OpenAI (version>=1.0)?

Accepted Answer

Azure AI services support the latest release of the OpenAI Python library (version>=1.0).

Question 7

I'm making a request for a model that supports Azure AI model inference, but I'm getting a 404 error. What should I do?

Accepted Answer

Ensure you created a deployment for the given model and that the deployment name matches exactly the value you're passing in model parameter. Although routing isn't case sensitive, ensure there's no special punctuation or spaces as these are common mistakes.

Question 8

I'm using the `azure-ai-inference` package for Python and I get a 401 error when I try to authenticate using keys. What should I do?

Accepted Answer

Azure AI Services resource requires the version azure-ai-inference>=1.0.0b5 for Python. Ensure you're using that version.

Question 9

I'm using OpenAI SDK and indicated the Azure OpenAI inference endpoint as base URL (https://<resource-name>.openai.azure.com). However, I get a 404 error. What should I do?

Accepted Answer

Ensure you're using the correct endpoint for the Azure OpenAI service and the right set of credentials. Also, ensure that you're using the class AzureOpenAI from the OpenAI SDK as the authentication mechanism and URLs used are different.

Question 10

Does Azure AI model inference support custom API headers? We append custom headers to our API requests and are seeing HTTP 431 failure errors.

Accepted Answer

Our current APIs allow up to 10 custom headers, which are passed through the pipeline, and returned. We noticed some customers now exceed this header count resulting in HTTP 431 errors. There's no solution for this error, other than to reduce header volume. In future API versions, we no longer pass through custom headers. We recommend that you don't depend on custom headers in future system architectures.

Question 11

How is Azure AI model inference billed?

Accepted Answer

You're billed for inputs and outputs to the APIs, typically in tokens. There are no cost associated with the resource itself or the deployments.

The token price varies per each model and you're billed per 1,000 tokens. You can see the pricing details before deploying a given model. For more information about billing, see Manage cost.

Question 12

Where can I see the bill details?

Accepted Answer

Billing and costs are displayed in Azure Cost Management + Billing. You can see the usage details in the Azure portal.

Billing isn't shown in Azure AI Foundry portal.

Question 13

How can I place a spending limit to my bill?

Accepted Answer

You can set up a spending limit in the Azure portal under Azure Cost Management + Billing. This limit prevents you from spending more than the limit you set. Once spending limit is reached, the subscription will be disabled and you won't be able to use the endpoint until the next billing cycle.

Question 14

Do you use my company data to train any of the models?

Accepted Answer

Azure AI model inference doesn't use customer data to retrain models, and customer data is never shared with model providers.

Question 15

How do I obtain coverage under the Customer Copyright Commitment?

Accepted Answer

The Customer Copyright Commitment is a provision to be included in the December 1, 2023, Microsoft Product Terms that describes Microsoft’s obligation to defend customers against certain non-Microsoft intellectual property claims relating to Output Content. If the subject of the claim is Output Content generated from the Azure OpenAI Service (or any other Covered Product that allows customers to configure the safety systems), then to receive coverage, customers must have implemented all mitigations required by the Azure OpenAI Service documentation in the offering that delivered the Output Content. The required mitigations are documented here and updated on an ongoing basis. For new services, features, models, or use cases, new CCC requirements will be posted and take effect at or following the launch of such service, feature, model, or use case. Otherwise, customers will have six months from the time of publication to implement new mitigations to maintain coverage under the CCC. If a customer tenders a claim, the customer will be required to demonstrate compliance with the relevant requirements. These mitigations are required for Covered Products that allow customers to configure the safety systems, including Azure OpenAI Service; they don't impact coverage for customers using other Covered Products.

通过

Azure AI model inference frequently asked questions

General