Microsoft.MachineLearningServices workspaces/onlineEndpoints/deployments 2021-03-01-preview
- Latest
- 2024-10-01
- 2024-10-01-preview
- 2024-07-01-preview
- 2024-04-01
- 2024-04-01-preview
- 2024-01-01-preview
- 2023-10-01
- 2023-08-01-preview
- 2023-06-01-preview
- 2023-04-01
- 2023-04-01-preview
- 2023-02-01-preview
- 2022-12-01-preview
- 2022-10-01
- 2022-10-01-preview
- 2022-06-01-preview
- 2022-05-01
- 2022-02-01-preview
- 2021-03-01-preview
Bicep resource definition
The workspaces/onlineEndpoints/deployments resource type can be deployed with operations that target:
- Resource groups - See resource group deployment commands
For a list of changed properties in each API version, see change log.
Resource format
To create a Microsoft.MachineLearningServices/workspaces/onlineEndpoints/deployments resource, add the following Bicep to your template.
resource symbolicname 'Microsoft.MachineLearningServices/workspaces/onlineEndpoints/deployments@2021-03-01-preview' = {
parent: resourceSymbolicName
identity: {
type: 'string'
userAssignedIdentities: {
{customized property}: {
clientId: 'string'
principalId: 'string'
}
}
}
kind: 'string'
location: 'string'
name: 'string'
properties: {
appInsightsEnabled: bool
codeConfiguration: {
codeId: 'string'
scoringScript: 'string'
}
description: 'string'
environmentId: 'string'
environmentVariables: {
{customized property}: 'string'
}
livenessProbe: {
failureThreshold: int
initialDelay: 'string'
period: 'string'
successThreshold: int
timeout: 'string'
}
model: {
referenceType: 'string'
// For remaining properties, see AssetReferenceBase objects
}
properties: {
{customized property}: 'string'
}
requestSettings: {
maxConcurrentRequestsPerInstance: int
maxQueueWait: 'string'
requestTimeout: 'string'
}
scaleSettings: {
maxInstances: int
minInstances: int
scaleType: 'string'
// For remaining properties, see OnlineScaleSettings objects
}
endpointComputeType: 'string'
// For remaining properties, see OnlineDeployment objects
}
tags: {
{customized property}: 'string'
}
}
OnlineScaleSettings objects
Set the scaleType property to specify the type of object.
For Auto, use:
{
pollingInterval: 'string'
scaleType: 'Auto'
targetUtilizationPercentage: int
}
For Manual, use:
{
instanceCount: int
scaleType: 'Manual'
}
AssetReferenceBase objects
Set the referenceType property to specify the type of object.
For DataPath, use:
{
datastoreId: 'string'
path: 'string'
referenceType: 'DataPath'
}
For Id, use:
{
assetId: 'string'
referenceType: 'Id'
}
For OutputPath, use:
{
jobId: 'string'
path: 'string'
referenceType: 'OutputPath'
}
OnlineDeployment objects
Set the endpointComputeType property to specify the type of object.
For K8S, use:
{
containerResourceRequirements: {
cpu: int
cpuLimit: int
fpga: int
gpu: int
memoryInGB: int
memoryInGBLimit: int
}
endpointComputeType: 'K8S'
}
For Managed, use:
{
endpointComputeType: 'Managed'
instanceType: 'string'
readinessProbe: {
failureThreshold: int
initialDelay: 'string'
period: 'string'
successThreshold: int
timeout: 'string'
}
}
Property values
AssetReferenceBase
Name | Description | Value |
---|---|---|
referenceType | Set to 'DataPath' for type DataPathAssetReference. Set to 'Id' for type IdAssetReference. Set to 'OutputPath' for type OutputPathAssetReference. | 'DataPath' 'Id' 'OutputPath' (required) |
AutoScaleSettings
Name | Description | Value |
---|---|---|
pollingInterval | The polling interval in ISO 8691 format. Only supports duration with precision as low as Seconds. | string |
scaleType | [Required] Type of deployment scaling algorithm | 'Auto' (required) |
targetUtilizationPercentage | Target CPU usage for the autoscaler. | int |
CodeConfiguration
Name | Description | Value |
---|---|---|
codeId | ARM resource ID of the code asset. | string |
scoringScript | [Required] The script to execute on startup. eg. "score.py" | string Constraints: Min length = 1 Pattern = [a-zA-Z0-9_] (required) |
ContainerResourceRequirements
Name | Description | Value |
---|---|---|
cpu | The minimum amount of CPU cores to be used by the container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/ |
int |
cpuLimit | The maximum amount of CPU cores allowed to be used by the container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/ |
int |
fpga | The number of FPGA PCIE devices exposed to the container. Must be multiple of 2. | int |
gpu | The number of GPU cores in the container. | int |
memoryInGB | The minimum amount of memory (in GB) to be used by the container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/ |
int |
memoryInGBLimit | The maximum amount of memory (in GB) allowed to be used by the container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/ |
int |
DataPathAssetReference
Name | Description | Value |
---|---|---|
datastoreId | ARM resource ID of the datastore where the asset is located. | string |
path | The path of the file/directory in the datastore. | string |
referenceType | [Required] Specifies the type of asset reference. | 'DataPath' (required) |
IdAssetReference
Name | Description | Value |
---|---|---|
assetId | [Required] ARM resource ID of the asset. | string Constraints: Pattern = [a-zA-Z0-9_] (required) |
referenceType | [Required] Specifies the type of asset reference. | 'Id' (required) |
K8SOnlineDeployment
Name | Description | Value |
---|---|---|
containerResourceRequirements | Resource requirements for each container instance within an online deployment. | ContainerResourceRequirements |
endpointComputeType | [Required] The compute type of the endpoint. | 'K8S' (required) |
ManagedOnlineDeployment
Name | Description | Value |
---|---|---|
endpointComputeType | [Required] The compute type of the endpoint. | 'Managed' (required) |
instanceType | Compute instance type. | string |
readinessProbe | Deployment container liveness/readiness probe configuration. | ProbeSettings |
ManualScaleSettings
Name | Description | Value |
---|---|---|
instanceCount | Fixed number of instances for this deployment. | int |
scaleType | [Required] Type of deployment scaling algorithm | 'Manual' (required) |
Microsoft.MachineLearningServices/workspaces/onlineEndpoints/deployments
Name | Description | Value |
---|---|---|
identity | Service identity associated with a resource. | ResourceIdentity |
kind | Metadata used by portal/tooling/etc to render different UX experiences for resources of the same type. | string |
location | The geo-location where the resource lives | string (required) |
name | The resource name | string Constraints: Pattern = ^[a-zA-Z0-9][a-zA-Z0-9\-_]{0,254}$ (required) |
parent | In Bicep, you can specify the parent resource for a child resource. You only need to add this property when the child resource is declared outside of the parent resource. For more information, see Child resource outside parent resource. |
Symbolic name for resource of type: workspaces/onlineEndpoints |
properties | [Required] Additional attributes of the entity. | OnlineDeployment (required) |
tags | Resource tags | Dictionary of tag names and values. See Tags in templates |
OnlineDeployment
Name | Description | Value |
---|---|---|
appInsightsEnabled | If true, enables Application Insights logging. | bool |
codeConfiguration | Code configuration for the endpoint deployment. | CodeConfiguration |
description | Description of the endpoint deployment. | string |
endpointComputeType | Set to 'K8S' for type K8SOnlineDeployment. Set to 'Managed' for type ManagedOnlineDeployment. | 'K8S' 'Managed' (required) |
environmentId | ARM resource ID of the environment specification for the endpoint deployment. | string |
environmentVariables | Environment variables configuration for the deployment. | OnlineDeploymentEnvironmentVariables |
livenessProbe | Deployment container liveness/readiness probe configuration. | ProbeSettings |
model | Reference to the model asset for the endpoint deployment. | AssetReferenceBase |
properties | Property dictionary. Properties can be added, but not removed or altered. | OnlineDeploymentProperties |
requestSettings | Online deployment scoring requests configuration. | OnlineRequestSettings |
scaleSettings | Online deployment scaling configuration. | OnlineScaleSettings |
OnlineDeploymentEnvironmentVariables
Name | Description | Value |
---|
OnlineDeploymentProperties
Name | Description | Value |
---|
OnlineRequestSettings
Name | Description | Value |
---|---|---|
maxConcurrentRequestsPerInstance | The number of requests allowed to queue at once for this deployment. | int |
maxQueueWait | The maximum queue wait time in ISO 8601 format. Supports millisecond precision. | string |
requestTimeout | The request timeout in ISO 8601 format. Supports millisecond precision. | string |
OnlineScaleSettings
Name | Description | Value |
---|---|---|
maxInstances | Maximum number of instances for this deployment. | int |
minInstances | Minimum number of instances for this deployment. | int |
scaleType | Set to 'Auto' for type AutoScaleSettings. Set to 'Manual' for type ManualScaleSettings. | 'Auto' 'Manual' (required) |
OutputPathAssetReference
Name | Description | Value |
---|---|---|
jobId | ARM resource ID of the job. | string |
path | The path of the file/directory in the job output. | string |
referenceType | [Required] Specifies the type of asset reference. | 'OutputPath' (required) |
ProbeSettings
Name | Description | Value |
---|---|---|
failureThreshold | The number of failures to allow before returning an unhealthy status. | int |
initialDelay | The delay before the first probe in ISO 8601 format. | string |
period | The length of time between probes in ISO 8601 format. | string |
successThreshold | The number of successful probes before returning a healthy status. | int |
timeout | The probe timeout in ISO 8601 format. | string |
ResourceIdentity
Name | Description | Value |
---|---|---|
type | Defines values for a ResourceIdentity's type. | 'None' 'SystemAssigned' 'SystemAssigned,UserAssigned' 'UserAssigned' |
userAssignedIdentities | Dictionary of the user assigned identities, key is ARM resource ID of the UAI. | ResourceIdentityUserAssignedIdentities |
ResourceIdentityUserAssignedIdentities
Name | Description | Value |
---|
TrackedResourceTags
Name | Description | Value |
---|
UserAssignedIdentityMeta
Name | Description | Value |
---|---|---|
clientId | Aka application ID, a unique identifier generated by Azure AD that is tied to an application and service principal during its initial provisioning. | string |
principalId | The object ID of the service principal object for your managed identity that is used to grant role-based access to an Azure resource. | string |
ARM template resource definition
The workspaces/onlineEndpoints/deployments resource type can be deployed with operations that target:
- Resource groups - See resource group deployment commands
For a list of changed properties in each API version, see change log.
Resource format
To create a Microsoft.MachineLearningServices/workspaces/onlineEndpoints/deployments resource, add the following JSON to your template.
{
"type": "Microsoft.MachineLearningServices/workspaces/onlineEndpoints/deployments",
"apiVersion": "2021-03-01-preview",
"name": "string",
"identity": {
"type": "string",
"userAssignedIdentities": {
"{customized property}": {
"clientId": "string",
"principalId": "string"
}
}
},
"kind": "string",
"location": "string",
"properties": {
"appInsightsEnabled": "bool",
"codeConfiguration": {
"codeId": "string",
"scoringScript": "string"
},
"description": "string",
"environmentId": "string",
"environmentVariables": {
"{customized property}": "string"
},
"livenessProbe": {
"failureThreshold": "int",
"initialDelay": "string",
"period": "string",
"successThreshold": "int",
"timeout": "string"
},
"model": {
"referenceType": "string"
// For remaining properties, see AssetReferenceBase objects
},
"properties": {
"{customized property}": "string"
},
"requestSettings": {
"maxConcurrentRequestsPerInstance": "int",
"maxQueueWait": "string",
"requestTimeout": "string"
},
"scaleSettings": {
"maxInstances": "int",
"minInstances": "int",
"scaleType": "string"
// For remaining properties, see OnlineScaleSettings objects
},
"endpointComputeType": "string"
// For remaining properties, see OnlineDeployment objects
},
"tags": {
"{customized property}": "string"
}
}
OnlineScaleSettings objects
Set the scaleType property to specify the type of object.
For Auto, use:
{
"pollingInterval": "string",
"scaleType": "Auto",
"targetUtilizationPercentage": "int"
}
For Manual, use:
{
"instanceCount": "int",
"scaleType": "Manual"
}
AssetReferenceBase objects
Set the referenceType property to specify the type of object.
For DataPath, use:
{
"datastoreId": "string",
"path": "string",
"referenceType": "DataPath"
}
For Id, use:
{
"assetId": "string",
"referenceType": "Id"
}
For OutputPath, use:
{
"jobId": "string",
"path": "string",
"referenceType": "OutputPath"
}
OnlineDeployment objects
Set the endpointComputeType property to specify the type of object.
For K8S, use:
{
"containerResourceRequirements": {
"cpu": "int",
"cpuLimit": "int",
"fpga": "int",
"gpu": "int",
"memoryInGB": "int",
"memoryInGBLimit": "int"
},
"endpointComputeType": "K8S"
}
For Managed, use:
{
"endpointComputeType": "Managed",
"instanceType": "string",
"readinessProbe": {
"failureThreshold": "int",
"initialDelay": "string",
"period": "string",
"successThreshold": "int",
"timeout": "string"
}
}
Property values
AssetReferenceBase
Name | Description | Value |
---|---|---|
referenceType | Set to 'DataPath' for type DataPathAssetReference. Set to 'Id' for type IdAssetReference. Set to 'OutputPath' for type OutputPathAssetReference. | 'DataPath' 'Id' 'OutputPath' (required) |
AutoScaleSettings
Name | Description | Value |
---|---|---|
pollingInterval | The polling interval in ISO 8691 format. Only supports duration with precision as low as Seconds. | string |
scaleType | [Required] Type of deployment scaling algorithm | 'Auto' (required) |
targetUtilizationPercentage | Target CPU usage for the autoscaler. | int |
CodeConfiguration
Name | Description | Value |
---|---|---|
codeId | ARM resource ID of the code asset. | string |
scoringScript | [Required] The script to execute on startup. eg. "score.py" | string Constraints: Min length = 1 Pattern = [a-zA-Z0-9_] (required) |
ContainerResourceRequirements
Name | Description | Value |
---|---|---|
cpu | The minimum amount of CPU cores to be used by the container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/ |
int |
cpuLimit | The maximum amount of CPU cores allowed to be used by the container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/ |
int |
fpga | The number of FPGA PCIE devices exposed to the container. Must be multiple of 2. | int |
gpu | The number of GPU cores in the container. | int |
memoryInGB | The minimum amount of memory (in GB) to be used by the container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/ |
int |
memoryInGBLimit | The maximum amount of memory (in GB) allowed to be used by the container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/ |
int |
DataPathAssetReference
Name | Description | Value |
---|---|---|
datastoreId | ARM resource ID of the datastore where the asset is located. | string |
path | The path of the file/directory in the datastore. | string |
referenceType | [Required] Specifies the type of asset reference. | 'DataPath' (required) |
IdAssetReference
Name | Description | Value |
---|---|---|
assetId | [Required] ARM resource ID of the asset. | string Constraints: Pattern = [a-zA-Z0-9_] (required) |
referenceType | [Required] Specifies the type of asset reference. | 'Id' (required) |
K8SOnlineDeployment
Name | Description | Value |
---|---|---|
containerResourceRequirements | Resource requirements for each container instance within an online deployment. | ContainerResourceRequirements |
endpointComputeType | [Required] The compute type of the endpoint. | 'K8S' (required) |
ManagedOnlineDeployment
Name | Description | Value |
---|---|---|
endpointComputeType | [Required] The compute type of the endpoint. | 'Managed' (required) |
instanceType | Compute instance type. | string |
readinessProbe | Deployment container liveness/readiness probe configuration. | ProbeSettings |
ManualScaleSettings
Name | Description | Value |
---|---|---|
instanceCount | Fixed number of instances for this deployment. | int |
scaleType | [Required] Type of deployment scaling algorithm | 'Manual' (required) |
Microsoft.MachineLearningServices/workspaces/onlineEndpoints/deployments
Name | Description | Value |
---|---|---|
apiVersion | The api version | '2021-03-01-preview' |
identity | Service identity associated with a resource. | ResourceIdentity |
kind | Metadata used by portal/tooling/etc to render different UX experiences for resources of the same type. | string |
location | The geo-location where the resource lives | string (required) |
name | The resource name | string Constraints: Pattern = ^[a-zA-Z0-9][a-zA-Z0-9\-_]{0,254}$ (required) |
properties | [Required] Additional attributes of the entity. | OnlineDeployment (required) |
tags | Resource tags | Dictionary of tag names and values. See Tags in templates |
type | The resource type | 'Microsoft.MachineLearningServices/workspaces/onlineEndpoints/deployments' |
OnlineDeployment
Name | Description | Value |
---|---|---|
appInsightsEnabled | If true, enables Application Insights logging. | bool |
codeConfiguration | Code configuration for the endpoint deployment. | CodeConfiguration |
description | Description of the endpoint deployment. | string |
endpointComputeType | Set to 'K8S' for type K8SOnlineDeployment. Set to 'Managed' for type ManagedOnlineDeployment. | 'K8S' 'Managed' (required) |
environmentId | ARM resource ID of the environment specification for the endpoint deployment. | string |
environmentVariables | Environment variables configuration for the deployment. | OnlineDeploymentEnvironmentVariables |
livenessProbe | Deployment container liveness/readiness probe configuration. | ProbeSettings |
model | Reference to the model asset for the endpoint deployment. | AssetReferenceBase |
properties | Property dictionary. Properties can be added, but not removed or altered. | OnlineDeploymentProperties |
requestSettings | Online deployment scoring requests configuration. | OnlineRequestSettings |
scaleSettings | Online deployment scaling configuration. | OnlineScaleSettings |
OnlineDeploymentEnvironmentVariables
Name | Description | Value |
---|
OnlineDeploymentProperties
Name | Description | Value |
---|
OnlineRequestSettings
Name | Description | Value |
---|---|---|
maxConcurrentRequestsPerInstance | The number of requests allowed to queue at once for this deployment. | int |
maxQueueWait | The maximum queue wait time in ISO 8601 format. Supports millisecond precision. | string |
requestTimeout | The request timeout in ISO 8601 format. Supports millisecond precision. | string |
OnlineScaleSettings
Name | Description | Value |
---|---|---|
maxInstances | Maximum number of instances for this deployment. | int |
minInstances | Minimum number of instances for this deployment. | int |
scaleType | Set to 'Auto' for type AutoScaleSettings. Set to 'Manual' for type ManualScaleSettings. | 'Auto' 'Manual' (required) |
OutputPathAssetReference
Name | Description | Value |
---|---|---|
jobId | ARM resource ID of the job. | string |
path | The path of the file/directory in the job output. | string |
referenceType | [Required] Specifies the type of asset reference. | 'OutputPath' (required) |
ProbeSettings
Name | Description | Value |
---|---|---|
failureThreshold | The number of failures to allow before returning an unhealthy status. | int |
initialDelay | The delay before the first probe in ISO 8601 format. | string |
period | The length of time between probes in ISO 8601 format. | string |
successThreshold | The number of successful probes before returning a healthy status. | int |
timeout | The probe timeout in ISO 8601 format. | string |
ResourceIdentity
Name | Description | Value |
---|---|---|
type | Defines values for a ResourceIdentity's type. | 'None' 'SystemAssigned' 'SystemAssigned,UserAssigned' 'UserAssigned' |
userAssignedIdentities | Dictionary of the user assigned identities, key is ARM resource ID of the UAI. | ResourceIdentityUserAssignedIdentities |
ResourceIdentityUserAssignedIdentities
Name | Description | Value |
---|
TrackedResourceTags
Name | Description | Value |
---|
UserAssignedIdentityMeta
Name | Description | Value |
---|---|---|
clientId | Aka application ID, a unique identifier generated by Azure AD that is tied to an application and service principal during its initial provisioning. | string |
principalId | The object ID of the service principal object for your managed identity that is used to grant role-based access to an Azure resource. | string |
Terraform (AzAPI provider) resource definition
The workspaces/onlineEndpoints/deployments resource type can be deployed with operations that target:
- Resource groups
For a list of changed properties in each API version, see change log.
Resource format
To create a Microsoft.MachineLearningServices/workspaces/onlineEndpoints/deployments resource, add the following Terraform to your template.
resource "azapi_resource" "symbolicname" {
type = "Microsoft.MachineLearningServices/workspaces/onlineEndpoints/deployments@2021-03-01-preview"
name = "string"
identity = {
type = "string"
userAssignedIdentities = {
{customized property} = {
clientId = "string"
principalId = "string"
}
}
}
kind = "string"
location = "string"
tags = {
{customized property} = "string"
}
body = jsonencode({
properties = {
appInsightsEnabled = bool
codeConfiguration = {
codeId = "string"
scoringScript = "string"
}
description = "string"
environmentId = "string"
environmentVariables = {
{customized property} = "string"
}
livenessProbe = {
failureThreshold = int
initialDelay = "string"
period = "string"
successThreshold = int
timeout = "string"
}
model = {
referenceType = "string"
// For remaining properties, see AssetReferenceBase objects
}
properties = {
{customized property} = "string"
}
requestSettings = {
maxConcurrentRequestsPerInstance = int
maxQueueWait = "string"
requestTimeout = "string"
}
scaleSettings = {
maxInstances = int
minInstances = int
scaleType = "string"
// For remaining properties, see OnlineScaleSettings objects
}
endpointComputeType = "string"
// For remaining properties, see OnlineDeployment objects
}
})
}
OnlineScaleSettings objects
Set the scaleType property to specify the type of object.
For Auto, use:
{
pollingInterval = "string"
scaleType = "Auto"
targetUtilizationPercentage = int
}
For Manual, use:
{
instanceCount = int
scaleType = "Manual"
}
AssetReferenceBase objects
Set the referenceType property to specify the type of object.
For DataPath, use:
{
datastoreId = "string"
path = "string"
referenceType = "DataPath"
}
For Id, use:
{
assetId = "string"
referenceType = "Id"
}
For OutputPath, use:
{
jobId = "string"
path = "string"
referenceType = "OutputPath"
}
OnlineDeployment objects
Set the endpointComputeType property to specify the type of object.
For K8S, use:
{
containerResourceRequirements = {
cpu = int
cpuLimit = int
fpga = int
gpu = int
memoryInGB = int
memoryInGBLimit = int
}
endpointComputeType = "K8S"
}
For Managed, use:
{
endpointComputeType = "Managed"
instanceType = "string"
readinessProbe = {
failureThreshold = int
initialDelay = "string"
period = "string"
successThreshold = int
timeout = "string"
}
}
Property values
AssetReferenceBase
Name | Description | Value |
---|---|---|
referenceType | Set to 'DataPath' for type DataPathAssetReference. Set to 'Id' for type IdAssetReference. Set to 'OutputPath' for type OutputPathAssetReference. | 'DataPath' 'Id' 'OutputPath' (required) |
AutoScaleSettings
Name | Description | Value |
---|---|---|
pollingInterval | The polling interval in ISO 8691 format. Only supports duration with precision as low as Seconds. | string |
scaleType | [Required] Type of deployment scaling algorithm | 'Auto' (required) |
targetUtilizationPercentage | Target CPU usage for the autoscaler. | int |
CodeConfiguration
Name | Description | Value |
---|---|---|
codeId | ARM resource ID of the code asset. | string |
scoringScript | [Required] The script to execute on startup. eg. "score.py" | string Constraints: Min length = 1 Pattern = [a-zA-Z0-9_] (required) |
ContainerResourceRequirements
Name | Description | Value |
---|---|---|
cpu | The minimum amount of CPU cores to be used by the container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/ |
int |
cpuLimit | The maximum amount of CPU cores allowed to be used by the container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/ |
int |
fpga | The number of FPGA PCIE devices exposed to the container. Must be multiple of 2. | int |
gpu | The number of GPU cores in the container. | int |
memoryInGB | The minimum amount of memory (in GB) to be used by the container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/ |
int |
memoryInGBLimit | The maximum amount of memory (in GB) allowed to be used by the container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/ |
int |
DataPathAssetReference
Name | Description | Value |
---|---|---|
datastoreId | ARM resource ID of the datastore where the asset is located. | string |
path | The path of the file/directory in the datastore. | string |
referenceType | [Required] Specifies the type of asset reference. | 'DataPath' (required) |
IdAssetReference
Name | Description | Value |
---|---|---|
assetId | [Required] ARM resource ID of the asset. | string Constraints: Pattern = [a-zA-Z0-9_] (required) |
referenceType | [Required] Specifies the type of asset reference. | 'Id' (required) |
K8SOnlineDeployment
Name | Description | Value |
---|---|---|
containerResourceRequirements | Resource requirements for each container instance within an online deployment. | ContainerResourceRequirements |
endpointComputeType | [Required] The compute type of the endpoint. | 'K8S' (required) |
ManagedOnlineDeployment
Name | Description | Value |
---|---|---|
endpointComputeType | [Required] The compute type of the endpoint. | 'Managed' (required) |
instanceType | Compute instance type. | string |
readinessProbe | Deployment container liveness/readiness probe configuration. | ProbeSettings |
ManualScaleSettings
Name | Description | Value |
---|---|---|
instanceCount | Fixed number of instances for this deployment. | int |
scaleType | [Required] Type of deployment scaling algorithm | 'Manual' (required) |
Microsoft.MachineLearningServices/workspaces/onlineEndpoints/deployments
Name | Description | Value |
---|---|---|
identity | Service identity associated with a resource. | ResourceIdentity |
kind | Metadata used by portal/tooling/etc to render different UX experiences for resources of the same type. | string |
location | The geo-location where the resource lives | string (required) |
name | The resource name | string Constraints: Pattern = ^[a-zA-Z0-9][a-zA-Z0-9\-_]{0,254}$ (required) |
parent_id | The ID of the resource that is the parent for this resource. | ID for resource of type: workspaces/onlineEndpoints |
properties | [Required] Additional attributes of the entity. | OnlineDeployment (required) |
tags | Resource tags | Dictionary of tag names and values. |
type | The resource type | "Microsoft.MachineLearningServices/workspaces/onlineEndpoints/deployments@2021-03-01-preview" |
OnlineDeployment
Name | Description | Value |
---|---|---|
appInsightsEnabled | If true, enables Application Insights logging. | bool |
codeConfiguration | Code configuration for the endpoint deployment. | CodeConfiguration |
description | Description of the endpoint deployment. | string |
endpointComputeType | Set to 'K8S' for type K8SOnlineDeployment. Set to 'Managed' for type ManagedOnlineDeployment. | 'K8S' 'Managed' (required) |
environmentId | ARM resource ID of the environment specification for the endpoint deployment. | string |
environmentVariables | Environment variables configuration for the deployment. | OnlineDeploymentEnvironmentVariables |
livenessProbe | Deployment container liveness/readiness probe configuration. | ProbeSettings |
model | Reference to the model asset for the endpoint deployment. | AssetReferenceBase |
properties | Property dictionary. Properties can be added, but not removed or altered. | OnlineDeploymentProperties |
requestSettings | Online deployment scoring requests configuration. | OnlineRequestSettings |
scaleSettings | Online deployment scaling configuration. | OnlineScaleSettings |
OnlineDeploymentEnvironmentVariables
Name | Description | Value |
---|
OnlineDeploymentProperties
Name | Description | Value |
---|
OnlineRequestSettings
Name | Description | Value |
---|---|---|
maxConcurrentRequestsPerInstance | The number of requests allowed to queue at once for this deployment. | int |
maxQueueWait | The maximum queue wait time in ISO 8601 format. Supports millisecond precision. | string |
requestTimeout | The request timeout in ISO 8601 format. Supports millisecond precision. | string |
OnlineScaleSettings
Name | Description | Value |
---|---|---|
maxInstances | Maximum number of instances for this deployment. | int |
minInstances | Minimum number of instances for this deployment. | int |
scaleType | Set to 'Auto' for type AutoScaleSettings. Set to 'Manual' for type ManualScaleSettings. | 'Auto' 'Manual' (required) |
OutputPathAssetReference
Name | Description | Value |
---|---|---|
jobId | ARM resource ID of the job. | string |
path | The path of the file/directory in the job output. | string |
referenceType | [Required] Specifies the type of asset reference. | 'OutputPath' (required) |
ProbeSettings
Name | Description | Value |
---|---|---|
failureThreshold | The number of failures to allow before returning an unhealthy status. | int |
initialDelay | The delay before the first probe in ISO 8601 format. | string |
period | The length of time between probes in ISO 8601 format. | string |
successThreshold | The number of successful probes before returning a healthy status. | int |
timeout | The probe timeout in ISO 8601 format. | string |
ResourceIdentity
Name | Description | Value |
---|---|---|
type | Defines values for a ResourceIdentity's type. | 'None' 'SystemAssigned' 'SystemAssigned,UserAssigned' 'UserAssigned' |
userAssignedIdentities | Dictionary of the user assigned identities, key is ARM resource ID of the UAI. | ResourceIdentityUserAssignedIdentities |
ResourceIdentityUserAssignedIdentities
Name | Description | Value |
---|
TrackedResourceTags
Name | Description | Value |
---|
UserAssignedIdentityMeta
Name | Description | Value |
---|---|---|
clientId | Aka application ID, a unique identifier generated by Azure AD that is tied to an application and service principal during its initial provisioning. | string |
principalId | The object ID of the service principal object for your managed identity that is used to grant role-based access to an Azure resource. | string |