你当前正在访问 Microsoft Azure Global Edition 技术文档网站。 如果需要访问由世纪互联运营的 Microsoft Azure 中国技术文档网站,请访问 https://docs.azure.cn

Azure AI model inference frequently asked questions

If you can't find answers to your questions in this document, and still need help check the Azure AI services support options guide.

General

What's the difference between Azure OpenAI service and Azure AI model inference?

Azure OpenAI Service gives customers access to advanced language models from OpenAI. Azure AI model inference extends such capability giving customers access to all the flagship models in Azure AI under the same service, endpoint, and credentials. It includes Azure OpenAI, Cohere, Mistral AI, Meta Llama, AI21 labs, etc. Customers can seamlessly switch between models without changing their code.

Both Azure OpenAI Service and Azure AI model inference are part of the Azure AI services family and build on top of the same security and enterprise promise of Azure.

While Azure AI model inference focus on inference, Azure OpenAI Service can be used with more advanced APIs like batch, fine-tuning, assistants, and files.

What's the difference between OpenAI and Azure OpenAI?

Azure AI Models and Azure OpenAI Service give customers access to advanced language models from OpenAI with the security and enterprise promise of Azure. Azure OpenAI codevelops the APIs with OpenAI, ensuring compatibility and a smooth transition from one to the other.

Customers get the security capabilities of Microsoft Azure while running the same models as OpenAI. It offers private networking, regional availability, and responsible AI content filtering.

Learn more about the Azure OpenAI service.

What's the difference between Azure AI services and Azure AI Foundry?

Azure AI services are a suite of AI services that provide prebuilt APIs for common AI scenarios. Azure AI Services is part of the Azure AI Foundry platform. Azure AI services can be used in Azure AI Foundry portal to enhance your models with prebuilt AI capabilities.

Models

Why aren't all the models in the Azure AI model catalog supported in Azure AI services?

Azure AI model inference in AI services supports all the models in the Azure AI catalog having pay-as-you-go billing. For more information, see the Models article.

The Azure AI model catalog contains a wider list of models, however, those models require compute quota from your subscription. They also need to have a project or AI hub where to host the deployment. For more information, see deployment options in Azure AI Foundry.

SDKs and programming languages

Which are the supported SDKs and programming languages for Azure AI model inference?

You can use Azure Inference SDK with any model supported by Azure AI model inference in Azure AI services, the AzureOpenAI class in OpenAI SDK, or the Azure OpenAI SDK.

Cohere SDK, Mistral SDK, and model provider-specific SDKs aren't supported when connected to Azure AI services.

For more information, see supported SDKs and programming languages.

Does Azure AI model inference work with the latest Python library released by OpenAI (version>=1.0)?

Azure AI services support the latest release of the OpenAI Python library (version>=1.0).

I'm making a request for a model that supports Azure AI model inference, but I'm getting a 404 error. What should I do?

Ensure you created a deployment for the given model and that the deployment name matches exactly the value you're passing in model parameter. Although routing isn't case sensitive, ensure there's no special punctuation or spaces as these are common mistakes.

I'm using the `azure-ai-inference` package for Python and I get a 401 error when I try to authenticate using keys. What should I do?

Azure AI Services resource requires the version azure-ai-inference>=1.0.0b5 for Python. Ensure you're using that version.

I'm using OpenAI SDK and indicated the Azure OpenAI inference endpoint as base URL (https://<resource-name>.openai.azure.com). However, I get a 404 error. What should I do?

Ensure you're using the correct endpoint for the Azure OpenAI service and the right set of credentials. Also, ensure that you're using the class AzureOpenAI from the OpenAI SDK as the authentication mechanism and URLs used are different.

Does Azure AI model inference support custom API headers? We append custom headers to our API requests and are seeing HTTP 431 failure errors.

Our current APIs allow up to 10 custom headers, which are passed through the pipeline, and returned. We noticed some customers now exceed this header count resulting in HTTP 431 errors. There's no solution for this error, other than to reduce header volume. In future API versions, we no longer pass through custom headers. We recommend that you don't depend on custom headers in future system architectures.

Pricing and Billing

How is Azure AI model inference billed?

You're billed for inputs and outputs to the APIs, typically in tokens. There are no cost associated with the resource itself or the deployments.

The token price varies per each model and you're billed per 1,000 tokens. You can see the pricing details before deploying a given model. For more information about billing, see Manage cost.

Where can I see the bill details?

Billing and costs are displayed in Azure Cost Management + Billing. You can see the usage details in the Azure portal.

Billing isn't shown in Azure AI Foundry portal.

How can I place a spending limit to my bill?

You can set up a spending limit in the Azure portal under Azure Cost Management + Billing. This limit prevents you from spending more than the limit you set. Once spending limit is reached, the subscription will be disabled and you won't be able to use the endpoint until the next billing cycle.

Data and Privacy

Do you use my company data to train any of the models?

Azure AI model inference doesn't use customer data to retrain models, and customer data is never shared with model providers.

The Customer Copyright Commitment is a provision to be included in the December 1, 2023, Microsoft Product Terms that describes Microsoft’s obligation to defend customers against certain non-Microsoft intellectual property claims relating to Output Content. If the subject of the claim is Output Content generated from the Azure OpenAI Service (or any other Covered Product that allows customers to configure the safety systems), then to receive coverage, customers must have implemented all mitigations required by the Azure OpenAI Service documentation in the offering that delivered the Output Content. The required mitigations are documented here and updated on an ongoing basis. For new services, features, models, or use cases, new CCC requirements will be posted and take effect at or following the launch of such service, feature, model, or use case. Otherwise, customers will have six months from the time of publication to implement new mitigations to maintain coverage under the CCC. If a customer tenders a claim, the customer will be required to demonstrate compliance with the relevant requirements. These mitigations are required for Covered Products that allow customers to configure the safety systems, including Azure OpenAI Service; they don't impact coverage for customers using other Covered Products.