Model Capacities - List
List ModelCapacities.
GET https://management.azure.com/subscriptions/{subscriptionId}/providers/Microsoft.CognitiveServices/modelCapacities?api-version=2024-04-01-preview&modelFormat={modelFormat}&modelName={modelName}&modelVersion={modelVersion}
URI Parameters
Name | In | Required | Type | Description |
---|---|---|---|---|
subscription
|
path | True |
string |
The ID of the target subscription. |
api-version
|
query | True |
string |
The API version to use for this operation. |
model
|
query | True |
string |
The format of the Model Regex pattern: |
model
|
query | True |
string |
The name of the Model Regex pattern: |
model
|
query | True |
string |
The version of the Model Regex pattern: |
Responses
Name | Type | Description |
---|---|---|
200 OK |
OK. Successfully retrieved modelCapacities. |
|
Other Status Codes |
Error response describing why the operation failed. |
Examples
ListModelCapacities
Sample request
GET https://management.azure.com/subscriptions/00000000-0000-0000-0000-000000000000/providers/Microsoft.CognitiveServices/modelCapacities?api-version=2024-04-01-preview&modelFormat=OpenAI&modelName=ada&modelVersion=1
Sample response
{
"value": [
{
"id": "/subscriptions/{subscriptionContext.SubscriptionId}/providers/Microsoft.CognitiveServices/locations/WestUS/models/OpenAI.ada.1/skuCapacities/Standard",
"type": "Microsoft.CognitiveServices/locations/models/skuCapacities",
"name": "Standard",
"location": "WestUS",
"properties": {
"model": {
"format": "OpenAI",
"name": "ada",
"version": "1"
},
"skuName": "Standard",
"availableCapacity": 300,
"availableFinetuneCapacity": 20
}
}
]
}
Definitions
Name | Description |
---|---|
Call |
The call rate limit Cognitive Services account. |
Deployment |
Properties of Cognitive Services account deployment model. |
Error |
The resource management error additional info. |
Error |
The error detail. |
Error |
Error response |
Model |
The list of cognitive services accounts operation response. |
Model |
Cognitive Services account ModelSkuCapacity. |
Request |
|
Throttling |
|
Value |
Gets the list of Cognitive Services accounts ModelSkuCapacity. |
CallRateLimit
The call rate limit Cognitive Services account.
Name | Type | Description |
---|---|---|
count |
number |
The count value of Call Rate Limit. |
renewalPeriod |
number |
The renewal period in seconds of Call Rate Limit. |
rules |
DeploymentModel
Properties of Cognitive Services account deployment model.
Name | Type | Description |
---|---|---|
callRateLimit |
The call rate limit Cognitive Services account. |
|
format |
string |
Deployment model format. |
name |
string |
Deployment model name. |
source |
string |
Optional. Deployment model source ARM resource ID. |
version |
string |
Optional. Deployment model version. If version is not specified, a default version will be assigned. The default version is different for different models and might change when there is new version available for a model. Default version for a model could be found from list models API. |
ErrorAdditionalInfo
The resource management error additional info.
Name | Type | Description |
---|---|---|
info |
object |
The additional info. |
type |
string |
The additional info type. |
ErrorDetail
The error detail.
Name | Type | Description |
---|---|---|
additionalInfo |
The error additional info. |
|
code |
string |
The error code. |
details |
The error details. |
|
message |
string |
The error message. |
target |
string |
The error target. |
ErrorResponse
Error response
Name | Type | Description |
---|---|---|
error |
The error object. |
ModelCapacityListResult
The list of cognitive services accounts operation response.
Name | Type | Description |
---|---|---|
nextLink |
string |
The link used to get the next page of ModelSkuCapacity. |
value |
Value[] |
Gets the list of Cognitive Services accounts ModelSkuCapacity. |
ModelSkuCapacityProperties
Cognitive Services account ModelSkuCapacity.
Name | Type | Description |
---|---|---|
availableCapacity |
number |
The available capacity for deployment with this model and sku. |
availableFinetuneCapacity |
number |
The available capacity for deployment with a fine-tune version of this model and sku. |
model |
Properties of Cognitive Services account deployment model. |
|
skuName |
string |
RequestMatchPattern
Name | Type | Description |
---|---|---|
method |
string |
|
path |
string |
ThrottlingRule
Name | Type | Description |
---|---|---|
count |
number |
|
dynamicThrottlingEnabled |
boolean |
|
key |
string |
|
matchPatterns | ||
minCount |
number |
|
renewalPeriod |
number |
Value
Gets the list of Cognitive Services accounts ModelSkuCapacity.
Name | Type | Description |
---|---|---|
id |
string |
Fully qualified resource ID for the resource. Ex - /subscriptions/{subscriptionId}/resourceGroups/{resourceGroupName}/providers/{resourceProviderNamespace}/{resourceType}/{resourceName} |
location |
string |
The location of the Model Sku Capacity. |
name |
string |
The name of the resource |
properties |
Cognitive Services account ModelSkuCapacity. |
|
type |
string |
The type of the resource. E.g. "Microsoft.Compute/virtualMachines" or "Microsoft.Storage/storageAccounts" |