Rediger

Del via


Region availability for models in serverless API endpoints

In this article, you learn about which regions are available for each of the models supporting serverless API endpoint deployments.

Important

Models that are in preview are marked as preview on their model cards in the model catalog.

Certain models in the model catalog can be deployed as a serverless API with pay-as-you-go billing. This kind of deployment provides a way to consume models as an API without hosting them on your subscription, while keeping the enterprise security and compliance that organizations need. This deployment option doesn't require quota from your subscription.

Region availability

Pay-as-you-go billing is available only to users whose Azure subscription belongs to a billing account in a country/region where the model provider has made the offer available (see "offer availability region" in the table in the next section). If the offer is available in the relevant region, the user then must have a Hub/Project in the Azure region where the model is available for deployment or fine-tuning, as applicable (see "Hub/Project Region" columns in the following tables).

Cohere models

Model Offer Availability Region Hub/Project Region for Deployment Hub/Project Region for Fine tuning
Cohere Command R+ 08-2024 Microsoft Managed countries/regions East US
East US 2
North Central US
South Central US
Sweden Central
West US
West US 3
Not available
Cohere Command R 08-2024 Microsoft Managed countries/regions East US
East US 2
North Central US
South Central US
Sweden Central
West US
West US 3
Not available
Cohere Command R+ Microsoft Managed countries/regions
Japan
Qatar
East US
East US 2
North Central US
South Central US
Sweden Central
West US
West US 3
Not available
Cohere Command R Microsoft Managed countries/regions
Japan
Qatar
East US
East US 2
North Central US
South Central US
Sweden Central
West US
West US 3
Not available
Cohere Rerank v3 - English Microsoft Managed countries/regions
Japan
Qatar
East US
East US 2
North Central US
South Central US
Sweden Central
West US
West US 3
Not available
Cohere Rerank v3 - Multilingual Microsoft Managed countries/regions
Japan
Qatar
East US
East US 2
North Central US
South Central US
Sweden Central
West US
West US 3
Not available
Cohere Embed v3 - English Microsoft Managed countries/regions
Japan
Qatar
East US
East US 2
North Central US
South Central US
Sweden Central
West US
West US 3
Not available
Cohere Embed v3 - Multilingual Microsoft Managed countries/regions
Japan
Qatar
East US
East US 2
North Central US
South Central US
Sweden Central
West US
West US 3
Not available

Gretel models

Model Offer Availability Region Hub/Project Region for Deployment Hub/Project Region for Fine tuning
Gretel-Navigator Microsoft Managed Countries East US 2 Not available

JAIS models

Model Offer Availability Region Hub/Project Region for Deployment Hub/Project Region for Fine tuning
JAIS 30B Chat Microsoft Managed countries/regions
Egypt
East US
East US 2
North Central US
South Central US
Sweden Central
West US
West US 3
Not available

Meta Llama models

Model Offer Availability Region Hub/Project Region for Deployment Hub/Project Region for Fine tuning
Llama 2 7B
Llama 2 13B
Llama 2 70B
Microsoft Managed countries/regions East US
East US 2
North Central US
South Central US
West US
West US 3
West US 3
Llama 2 7B Chat
Llama 2 70B Chat
Microsoft Managed countries/regions East US
East US 2
North Central US
South Central US
West US
West US 3
West US 3
Llama 3 8B Instruct
Llama 3 70B Instruct
Llama-3.2-1B-Instruct
Llama-3.2-3B-Instruct
Llama-3.3-70B-Instruct
Llama-Guard-3-11B-Vision
Llama-Guard-3-1B
Llama-3.2-3B
Llama-3.2-1B
Microsoft Managed countries/regions East US
East US 2
North Central US
South Central US
Sweden Central
West US
West US 3
Not available
Llama 3.1 8B Instruct
Llama 3.1 70B Instruct
Llama-3.2-11B-Vision-Instruct
Llama-3.2-90B-Vision-Instruct
Llama 3.3 70B Instruct
Microsoft Managed countries/regions East US
East US 2
North Central US
South Central US
West US
West US 3
Sweden Central
West US 3
Llama 3.1 405B Instruct Microsoft Managed countries/regions East US
East US 2
North Central US
South Central US
West US
West US 3
Not available

Microsoft Phi family models

Model Offer Availability Region Hub/Project Region for Deployment Hub/Project Region for Fine tuning
Phi-4 Not applicable East US 2
Sweden Central
Not available
Phi-3.5-vision-Instruct Not applicable East US
East US 2
North Central US
South Central US
Sweden Central
West US
West US 3
Not available
Phi-3.5-MoE-Instruct Not applicable East US
East US 2
North Central US
South Central US
Sweden Central
West US
West US 3
East US 2
Phi-3.5-Mini-Instruct Not applicable East US
East US 2
North Central US
South Central US
Sweden Central
West US
West US 3
East US 2 East US 2
Phi-3-Mini-4k-Instruct
Phi-3-Mini-128K-Instruct
Not applicable East US 2
Sweden Central
East US 2
Phi-3-Small-8K-Instruct
Phi-3-Small-128K-Instruct
Not applicable East US 2
Sweden Central
Not available
Phi-3-Medium-4K-Instruct
Phi-3-Medium-128K-Instruct
Not applicable East US 2
Sweden Central
East US 2

Mistral models

Model Offer Availability Region Hub/Project Region for Deployment Hub/Project Region for Fine tuning
Codestral-2501 Microsoft Managed Countries
Brazil
Hong Kong
Israel
East US
East US 2
North Central US
South Central US
Sweden Central
West US
West US 3
Not available
Mistral Nemo Microsoft Managed Countries
Brazil
Hong Kong
Israel
East US
East US 2
North Central US
South Central US
Sweden Central
West US
West US 3
Not available
Ministral-3B Microsoft Managed Countries
Brazil
Hong Kong
Israel
East US
East US 2
North Central US
South Central US
Sweden Central
West US
West US 3
Not available
Mistral Small Microsoft Managed Countries
Brazil
Hong Kong
Israel
East US
East US 2
North Central US
South Central US
Sweden Central
West US
West US 3
Not available
Mistral Large
Mistral-Large (2407)
Mistral-Large (2411)
Microsoft Managed Countries
Brazil
Hong Kong
Israel
East US
East US 2
North Central US
South Central US
Sweden Central
West US
West US 3
Not available

Nixtla models

Model Offer Availability Region Hub/Project Region for Deployment Hub/Project Region for Fine tuning
TimeGEN-1 Microsoft Managed countries/regions
Mexico
Israel
East US
East US 2
North Central US
South Central US
Sweden Central
West US
West US 3
Not available

NTTDATA models

Model Offer Availability Region Hub/Project Region for Deployment Hub/Project Region for Fine tuning
TimeGEN-1 Microsoft Managed countries/regions East US 2
South Central US
East US
West US 3
West US
North Central US
Not available

AI21 models

Model Offer Availability Region Hub/Project Region for Deployment Hub/Project Region for Fine tuning
AI21-Jamba-1.5-Mini Microsoft Managed countries/regions East US 2
South Central US
East US
West US 3
West US
North Central US
Not available
AI21-Jamba-1.5-Large Microsoft Managed countries/regions East US 2
South Central US
East US
West US 3
West US
North Central US
Not available

Bria models

Model Offer Availability Region Hub/Project Region for Deployment Hub/Project Region for Fine tuning
Bria-2.3-Fast Microsoft Managed Countries East US 2 Not available

Alternatives to region availability

If most of your infrastructure is in a particular region and you want to take advantage of models available only as serverless API endpoints, you can create a hub or project on the supported region and then consume the endpoint from another region.

Read Consume serverless API endpoints from a different hub or project to learn how to configure an existing serverless API endpoint in a different hub or project than the one where it was deployed.