Microsoft.MachineLearningServices workspaces/onlineEndpoints/deployments 2021-03-01-preview

Bicep resource definition

The workspaces/onlineEndpoints/deployments resource type can be deployed with operations that target:

For a list of changed properties in each API version, see change log.

Resource format

To create a Microsoft.MachineLearningServices/workspaces/onlineEndpoints/deployments resource, add the following Bicep to your template.

resource symbolicname 'Microsoft.MachineLearningServices/workspaces/onlineEndpoints/deployments@2021-03-01-preview' = {
  parent: resourceSymbolicName
  identity: {
    type: 'string'
    userAssignedIdentities: {
      {customized property}: {
        clientId: 'string'
        principalId: 'string'
      }
    }
  }
  kind: 'string'
  location: 'string'
  name: 'string'
  properties: {
    appInsightsEnabled: bool
    codeConfiguration: {
      codeId: 'string'
      scoringScript: 'string'
    }
    description: 'string'
    environmentId: 'string'
    environmentVariables: {
      {customized property}: 'string'
    }
    livenessProbe: {
      failureThreshold: int
      initialDelay: 'string'
      period: 'string'
      successThreshold: int
      timeout: 'string'
    }
    model: {
      referenceType: 'string'
      // For remaining properties, see AssetReferenceBase objects
    }
    properties: {
      {customized property}: 'string'
    }
    requestSettings: {
      maxConcurrentRequestsPerInstance: int
      maxQueueWait: 'string'
      requestTimeout: 'string'
    }
    scaleSettings: {
      maxInstances: int
      minInstances: int
      scaleType: 'string'
      // For remaining properties, see OnlineScaleSettings objects
    }
    endpointComputeType: 'string'
    // For remaining properties, see OnlineDeployment objects
  }
  tags: {
    {customized property}: 'string'
  }
}

OnlineScaleSettings objects

Set the scaleType property to specify the type of object.

For Auto, use:

{
  pollingInterval: 'string'
  scaleType: 'Auto'
  targetUtilizationPercentage: int
}

For Manual, use:

{
  instanceCount: int
  scaleType: 'Manual'
}

AssetReferenceBase objects

Set the referenceType property to specify the type of object.

For DataPath, use:

{
  datastoreId: 'string'
  path: 'string'
  referenceType: 'DataPath'
}

For Id, use:

{
  assetId: 'string'
  referenceType: 'Id'
}

For OutputPath, use:

{
  jobId: 'string'
  path: 'string'
  referenceType: 'OutputPath'
}

OnlineDeployment objects

Set the endpointComputeType property to specify the type of object.

For K8S, use:

{
  containerResourceRequirements: {
    cpu: int
    cpuLimit: int
    fpga: int
    gpu: int
    memoryInGB: int
    memoryInGBLimit: int
  }
  endpointComputeType: 'K8S'
}

For Managed, use:

{
  endpointComputeType: 'Managed'
  instanceType: 'string'
  readinessProbe: {
    failureThreshold: int
    initialDelay: 'string'
    period: 'string'
    successThreshold: int
    timeout: 'string'
  }
}

Property values

AssetReferenceBase

Name Description Value
referenceType Set to 'DataPath' for type DataPathAssetReference. Set to 'Id' for type IdAssetReference. Set to 'OutputPath' for type OutputPathAssetReference. 'DataPath'
'Id'
'OutputPath' (required)

AutoScaleSettings

Name Description Value
pollingInterval The polling interval in ISO 8691 format. Only supports duration with precision as low as Seconds. string
scaleType [Required] Type of deployment scaling algorithm 'Auto' (required)
targetUtilizationPercentage Target CPU usage for the autoscaler. int

CodeConfiguration

Name Description Value
codeId ARM resource ID of the code asset. string
scoringScript [Required] The script to execute on startup. eg. "score.py" string

Constraints:
Min length = 1
Pattern = [a-zA-Z0-9_] (required)

ContainerResourceRequirements

Name Description Value
cpu The minimum amount of CPU cores to be used by the container. More info:
https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/
int
cpuLimit The maximum amount of CPU cores allowed to be used by the container. More info:
https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/
int
fpga The number of FPGA PCIE devices exposed to the container. Must be multiple of 2. int
gpu The number of GPU cores in the container. int
memoryInGB The minimum amount of memory (in GB) to be used by the container. More info:
https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/
int
memoryInGBLimit The maximum amount of memory (in GB) allowed to be used by the container. More info:
https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/
int

DataPathAssetReference

Name Description Value
datastoreId ARM resource ID of the datastore where the asset is located. string
path The path of the file/directory in the datastore. string
referenceType [Required] Specifies the type of asset reference. 'DataPath' (required)

IdAssetReference

Name Description Value
assetId [Required] ARM resource ID of the asset. string

Constraints:
Pattern = [a-zA-Z0-9_] (required)
referenceType [Required] Specifies the type of asset reference. 'Id' (required)

K8SOnlineDeployment

Name Description Value
containerResourceRequirements Resource requirements for each container instance within an online deployment. ContainerResourceRequirements
endpointComputeType [Required] The compute type of the endpoint. 'K8S' (required)

ManagedOnlineDeployment

Name Description Value
endpointComputeType [Required] The compute type of the endpoint. 'Managed' (required)
instanceType Compute instance type. string
readinessProbe Deployment container liveness/readiness probe configuration. ProbeSettings

ManualScaleSettings

Name Description Value
instanceCount Fixed number of instances for this deployment. int
scaleType [Required] Type of deployment scaling algorithm 'Manual' (required)

Microsoft.MachineLearningServices/workspaces/onlineEndpoints/deployments

Name Description Value
identity Service identity associated with a resource. ResourceIdentity
kind Metadata used by portal/tooling/etc to render different UX experiences for resources of the same type. string
location The geo-location where the resource lives string (required)
name The resource name string

Constraints:
Pattern = ^[a-zA-Z0-9][a-zA-Z0-9\-_]{0,254}$ (required)
parent In Bicep, you can specify the parent resource for a child resource. You only need to add this property when the child resource is declared outside of the parent resource.

For more information, see Child resource outside parent resource.
Symbolic name for resource of type: workspaces/onlineEndpoints
properties [Required] Additional attributes of the entity. OnlineDeployment (required)
tags Resource tags Dictionary of tag names and values. See Tags in templates

OnlineDeployment

Name Description Value
appInsightsEnabled If true, enables Application Insights logging. bool
codeConfiguration Code configuration for the endpoint deployment. CodeConfiguration
description Description of the endpoint deployment. string
endpointComputeType Set to 'K8S' for type K8SOnlineDeployment. Set to 'Managed' for type ManagedOnlineDeployment. 'K8S'
'Managed' (required)
environmentId ARM resource ID of the environment specification for the endpoint deployment. string
environmentVariables Environment variables configuration for the deployment. OnlineDeploymentEnvironmentVariables
livenessProbe Deployment container liveness/readiness probe configuration. ProbeSettings
model Reference to the model asset for the endpoint deployment. AssetReferenceBase
properties Property dictionary. Properties can be added, but not removed or altered. OnlineDeploymentProperties
requestSettings Online deployment scoring requests configuration. OnlineRequestSettings
scaleSettings Online deployment scaling configuration. OnlineScaleSettings

OnlineDeploymentEnvironmentVariables

Name Description Value

OnlineDeploymentProperties

Name Description Value

OnlineRequestSettings

Name Description Value
maxConcurrentRequestsPerInstance The number of requests allowed to queue at once for this deployment. int
maxQueueWait The maximum queue wait time in ISO 8601 format. Supports millisecond precision. string
requestTimeout The request timeout in ISO 8601 format. Supports millisecond precision. string

OnlineScaleSettings

Name Description Value
maxInstances Maximum number of instances for this deployment. int
minInstances Minimum number of instances for this deployment. int
scaleType Set to 'Auto' for type AutoScaleSettings. Set to 'Manual' for type ManualScaleSettings. 'Auto'
'Manual' (required)

OutputPathAssetReference

Name Description Value
jobId ARM resource ID of the job. string
path The path of the file/directory in the job output. string
referenceType [Required] Specifies the type of asset reference. 'OutputPath' (required)

ProbeSettings

Name Description Value
failureThreshold The number of failures to allow before returning an unhealthy status. int
initialDelay The delay before the first probe in ISO 8601 format. string
period The length of time between probes in ISO 8601 format. string
successThreshold The number of successful probes before returning a healthy status. int
timeout The probe timeout in ISO 8601 format. string

ResourceIdentity

Name Description Value
type Defines values for a ResourceIdentity's type. 'None'
'SystemAssigned'
'SystemAssigned,UserAssigned'
'UserAssigned'
userAssignedIdentities Dictionary of the user assigned identities, key is ARM resource ID of the UAI. ResourceIdentityUserAssignedIdentities

ResourceIdentityUserAssignedIdentities

Name Description Value

TrackedResourceTags

Name Description Value

UserAssignedIdentityMeta

Name Description Value
clientId Aka application ID, a unique identifier generated by Azure AD that is tied to an application and service principal during its initial provisioning. string
principalId The object ID of the service principal object for your managed identity that is used to grant role-based access to an Azure resource. string

ARM template resource definition

The workspaces/onlineEndpoints/deployments resource type can be deployed with operations that target:

For a list of changed properties in each API version, see change log.

Resource format

To create a Microsoft.MachineLearningServices/workspaces/onlineEndpoints/deployments resource, add the following JSON to your template.

{
  "type": "Microsoft.MachineLearningServices/workspaces/onlineEndpoints/deployments",
  "apiVersion": "2021-03-01-preview",
  "name": "string",
  "identity": {
    "type": "string",
    "userAssignedIdentities": {
      "{customized property}": {
        "clientId": "string",
        "principalId": "string"
      }
    }
  },
  "kind": "string",
  "location": "string",
  "properties": {
    "appInsightsEnabled": "bool",
    "codeConfiguration": {
      "codeId": "string",
      "scoringScript": "string"
    },
    "description": "string",
    "environmentId": "string",
    "environmentVariables": {
      "{customized property}": "string"
    },
    "livenessProbe": {
      "failureThreshold": "int",
      "initialDelay": "string",
      "period": "string",
      "successThreshold": "int",
      "timeout": "string"
    },
    "model": {
      "referenceType": "string"
      // For remaining properties, see AssetReferenceBase objects
    },
    "properties": {
      "{customized property}": "string"
    },
    "requestSettings": {
      "maxConcurrentRequestsPerInstance": "int",
      "maxQueueWait": "string",
      "requestTimeout": "string"
    },
    "scaleSettings": {
      "maxInstances": "int",
      "minInstances": "int",
      "scaleType": "string"
      // For remaining properties, see OnlineScaleSettings objects
    },
    "endpointComputeType": "string"
    // For remaining properties, see OnlineDeployment objects
  },
  "tags": {
    "{customized property}": "string"
  }
}

OnlineScaleSettings objects

Set the scaleType property to specify the type of object.

For Auto, use:

{
  "pollingInterval": "string",
  "scaleType": "Auto",
  "targetUtilizationPercentage": "int"
}

For Manual, use:

{
  "instanceCount": "int",
  "scaleType": "Manual"
}

AssetReferenceBase objects

Set the referenceType property to specify the type of object.

For DataPath, use:

{
  "datastoreId": "string",
  "path": "string",
  "referenceType": "DataPath"
}

For Id, use:

{
  "assetId": "string",
  "referenceType": "Id"
}

For OutputPath, use:

{
  "jobId": "string",
  "path": "string",
  "referenceType": "OutputPath"
}

OnlineDeployment objects

Set the endpointComputeType property to specify the type of object.

For K8S, use:

{
  "containerResourceRequirements": {
    "cpu": "int",
    "cpuLimit": "int",
    "fpga": "int",
    "gpu": "int",
    "memoryInGB": "int",
    "memoryInGBLimit": "int"
  },
  "endpointComputeType": "K8S"
}

For Managed, use:

{
  "endpointComputeType": "Managed",
  "instanceType": "string",
  "readinessProbe": {
    "failureThreshold": "int",
    "initialDelay": "string",
    "period": "string",
    "successThreshold": "int",
    "timeout": "string"
  }
}

Property values

AssetReferenceBase

Name Description Value
referenceType Set to 'DataPath' for type DataPathAssetReference. Set to 'Id' for type IdAssetReference. Set to 'OutputPath' for type OutputPathAssetReference. 'DataPath'
'Id'
'OutputPath' (required)

AutoScaleSettings

Name Description Value
pollingInterval The polling interval in ISO 8691 format. Only supports duration with precision as low as Seconds. string
scaleType [Required] Type of deployment scaling algorithm 'Auto' (required)
targetUtilizationPercentage Target CPU usage for the autoscaler. int

CodeConfiguration

Name Description Value
codeId ARM resource ID of the code asset. string
scoringScript [Required] The script to execute on startup. eg. "score.py" string

Constraints:
Min length = 1
Pattern = [a-zA-Z0-9_] (required)

ContainerResourceRequirements

Name Description Value
cpu The minimum amount of CPU cores to be used by the container. More info:
https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/
int
cpuLimit The maximum amount of CPU cores allowed to be used by the container. More info:
https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/
int
fpga The number of FPGA PCIE devices exposed to the container. Must be multiple of 2. int
gpu The number of GPU cores in the container. int
memoryInGB The minimum amount of memory (in GB) to be used by the container. More info:
https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/
int
memoryInGBLimit The maximum amount of memory (in GB) allowed to be used by the container. More info:
https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/
int

DataPathAssetReference

Name Description Value
datastoreId ARM resource ID of the datastore where the asset is located. string
path The path of the file/directory in the datastore. string
referenceType [Required] Specifies the type of asset reference. 'DataPath' (required)

IdAssetReference

Name Description Value
assetId [Required] ARM resource ID of the asset. string

Constraints:
Pattern = [a-zA-Z0-9_] (required)
referenceType [Required] Specifies the type of asset reference. 'Id' (required)

K8SOnlineDeployment

Name Description Value
containerResourceRequirements Resource requirements for each container instance within an online deployment. ContainerResourceRequirements
endpointComputeType [Required] The compute type of the endpoint. 'K8S' (required)

ManagedOnlineDeployment

Name Description Value
endpointComputeType [Required] The compute type of the endpoint. 'Managed' (required)
instanceType Compute instance type. string
readinessProbe Deployment container liveness/readiness probe configuration. ProbeSettings

ManualScaleSettings

Name Description Value
instanceCount Fixed number of instances for this deployment. int
scaleType [Required] Type of deployment scaling algorithm 'Manual' (required)

Microsoft.MachineLearningServices/workspaces/onlineEndpoints/deployments

Name Description Value
apiVersion The api version '2021-03-01-preview'
identity Service identity associated with a resource. ResourceIdentity
kind Metadata used by portal/tooling/etc to render different UX experiences for resources of the same type. string
location The geo-location where the resource lives string (required)
name The resource name string

Constraints:
Pattern = ^[a-zA-Z0-9][a-zA-Z0-9\-_]{0,254}$ (required)
properties [Required] Additional attributes of the entity. OnlineDeployment (required)
tags Resource tags Dictionary of tag names and values. See Tags in templates
type The resource type 'Microsoft.MachineLearningServices/workspaces/onlineEndpoints/deployments'

OnlineDeployment

Name Description Value
appInsightsEnabled If true, enables Application Insights logging. bool
codeConfiguration Code configuration for the endpoint deployment. CodeConfiguration
description Description of the endpoint deployment. string
endpointComputeType Set to 'K8S' for type K8SOnlineDeployment. Set to 'Managed' for type ManagedOnlineDeployment. 'K8S'
'Managed' (required)
environmentId ARM resource ID of the environment specification for the endpoint deployment. string
environmentVariables Environment variables configuration for the deployment. OnlineDeploymentEnvironmentVariables
livenessProbe Deployment container liveness/readiness probe configuration. ProbeSettings
model Reference to the model asset for the endpoint deployment. AssetReferenceBase
properties Property dictionary. Properties can be added, but not removed or altered. OnlineDeploymentProperties
requestSettings Online deployment scoring requests configuration. OnlineRequestSettings
scaleSettings Online deployment scaling configuration. OnlineScaleSettings

OnlineDeploymentEnvironmentVariables

Name Description Value

OnlineDeploymentProperties

Name Description Value

OnlineRequestSettings

Name Description Value
maxConcurrentRequestsPerInstance The number of requests allowed to queue at once for this deployment. int
maxQueueWait The maximum queue wait time in ISO 8601 format. Supports millisecond precision. string
requestTimeout The request timeout in ISO 8601 format. Supports millisecond precision. string

OnlineScaleSettings

Name Description Value
maxInstances Maximum number of instances for this deployment. int
minInstances Minimum number of instances for this deployment. int
scaleType Set to 'Auto' for type AutoScaleSettings. Set to 'Manual' for type ManualScaleSettings. 'Auto'
'Manual' (required)

OutputPathAssetReference

Name Description Value
jobId ARM resource ID of the job. string
path The path of the file/directory in the job output. string
referenceType [Required] Specifies the type of asset reference. 'OutputPath' (required)

ProbeSettings

Name Description Value
failureThreshold The number of failures to allow before returning an unhealthy status. int
initialDelay The delay before the first probe in ISO 8601 format. string
period The length of time between probes in ISO 8601 format. string
successThreshold The number of successful probes before returning a healthy status. int
timeout The probe timeout in ISO 8601 format. string

ResourceIdentity

Name Description Value
type Defines values for a ResourceIdentity's type. 'None'
'SystemAssigned'
'SystemAssigned,UserAssigned'
'UserAssigned'
userAssignedIdentities Dictionary of the user assigned identities, key is ARM resource ID of the UAI. ResourceIdentityUserAssignedIdentities

ResourceIdentityUserAssignedIdentities

Name Description Value

TrackedResourceTags

Name Description Value

UserAssignedIdentityMeta

Name Description Value
clientId Aka application ID, a unique identifier generated by Azure AD that is tied to an application and service principal during its initial provisioning. string
principalId The object ID of the service principal object for your managed identity that is used to grant role-based access to an Azure resource. string

Terraform (AzAPI provider) resource definition

The workspaces/onlineEndpoints/deployments resource type can be deployed with operations that target:

  • Resource groups

For a list of changed properties in each API version, see change log.

Resource format

To create a Microsoft.MachineLearningServices/workspaces/onlineEndpoints/deployments resource, add the following Terraform to your template.

resource "azapi_resource" "symbolicname" {
  type = "Microsoft.MachineLearningServices/workspaces/onlineEndpoints/deployments@2021-03-01-preview"
  name = "string"
  identity = {
    type = "string"
    userAssignedIdentities = {
      {customized property} = {
        clientId = "string"
        principalId = "string"
      }
    }
  }
  kind = "string"
  location = "string"
  tags = {
    {customized property} = "string"
  }
  body = jsonencode({
    properties = {
      appInsightsEnabled = bool
      codeConfiguration = {
        codeId = "string"
        scoringScript = "string"
      }
      description = "string"
      environmentId = "string"
      environmentVariables = {
        {customized property} = "string"
      }
      livenessProbe = {
        failureThreshold = int
        initialDelay = "string"
        period = "string"
        successThreshold = int
        timeout = "string"
      }
      model = {
        referenceType = "string"
        // For remaining properties, see AssetReferenceBase objects
      }
      properties = {
        {customized property} = "string"
      }
      requestSettings = {
        maxConcurrentRequestsPerInstance = int
        maxQueueWait = "string"
        requestTimeout = "string"
      }
      scaleSettings = {
        maxInstances = int
        minInstances = int
        scaleType = "string"
        // For remaining properties, see OnlineScaleSettings objects
      }
      endpointComputeType = "string"
      // For remaining properties, see OnlineDeployment objects
    }
  })
}

OnlineScaleSettings objects

Set the scaleType property to specify the type of object.

For Auto, use:

{
  pollingInterval = "string"
  scaleType = "Auto"
  targetUtilizationPercentage = int
}

For Manual, use:

{
  instanceCount = int
  scaleType = "Manual"
}

AssetReferenceBase objects

Set the referenceType property to specify the type of object.

For DataPath, use:

{
  datastoreId = "string"
  path = "string"
  referenceType = "DataPath"
}

For Id, use:

{
  assetId = "string"
  referenceType = "Id"
}

For OutputPath, use:

{
  jobId = "string"
  path = "string"
  referenceType = "OutputPath"
}

OnlineDeployment objects

Set the endpointComputeType property to specify the type of object.

For K8S, use:

{
  containerResourceRequirements = {
    cpu = int
    cpuLimit = int
    fpga = int
    gpu = int
    memoryInGB = int
    memoryInGBLimit = int
  }
  endpointComputeType = "K8S"
}

For Managed, use:

{
  endpointComputeType = "Managed"
  instanceType = "string"
  readinessProbe = {
    failureThreshold = int
    initialDelay = "string"
    period = "string"
    successThreshold = int
    timeout = "string"
  }
}

Property values

AssetReferenceBase

Name Description Value
referenceType Set to 'DataPath' for type DataPathAssetReference. Set to 'Id' for type IdAssetReference. Set to 'OutputPath' for type OutputPathAssetReference. 'DataPath'
'Id'
'OutputPath' (required)

AutoScaleSettings

Name Description Value
pollingInterval The polling interval in ISO 8691 format. Only supports duration with precision as low as Seconds. string
scaleType [Required] Type of deployment scaling algorithm 'Auto' (required)
targetUtilizationPercentage Target CPU usage for the autoscaler. int

CodeConfiguration

Name Description Value
codeId ARM resource ID of the code asset. string
scoringScript [Required] The script to execute on startup. eg. "score.py" string

Constraints:
Min length = 1
Pattern = [a-zA-Z0-9_] (required)

ContainerResourceRequirements

Name Description Value
cpu The minimum amount of CPU cores to be used by the container. More info:
https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/
int
cpuLimit The maximum amount of CPU cores allowed to be used by the container. More info:
https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/
int
fpga The number of FPGA PCIE devices exposed to the container. Must be multiple of 2. int
gpu The number of GPU cores in the container. int
memoryInGB The minimum amount of memory (in GB) to be used by the container. More info:
https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/
int
memoryInGBLimit The maximum amount of memory (in GB) allowed to be used by the container. More info:
https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/
int

DataPathAssetReference

Name Description Value
datastoreId ARM resource ID of the datastore where the asset is located. string
path The path of the file/directory in the datastore. string
referenceType [Required] Specifies the type of asset reference. 'DataPath' (required)

IdAssetReference

Name Description Value
assetId [Required] ARM resource ID of the asset. string

Constraints:
Pattern = [a-zA-Z0-9_] (required)
referenceType [Required] Specifies the type of asset reference. 'Id' (required)

K8SOnlineDeployment

Name Description Value
containerResourceRequirements Resource requirements for each container instance within an online deployment. ContainerResourceRequirements
endpointComputeType [Required] The compute type of the endpoint. 'K8S' (required)

ManagedOnlineDeployment

Name Description Value
endpointComputeType [Required] The compute type of the endpoint. 'Managed' (required)
instanceType Compute instance type. string
readinessProbe Deployment container liveness/readiness probe configuration. ProbeSettings

ManualScaleSettings

Name Description Value
instanceCount Fixed number of instances for this deployment. int
scaleType [Required] Type of deployment scaling algorithm 'Manual' (required)

Microsoft.MachineLearningServices/workspaces/onlineEndpoints/deployments

Name Description Value
identity Service identity associated with a resource. ResourceIdentity
kind Metadata used by portal/tooling/etc to render different UX experiences for resources of the same type. string
location The geo-location where the resource lives string (required)
name The resource name string

Constraints:
Pattern = ^[a-zA-Z0-9][a-zA-Z0-9\-_]{0,254}$ (required)
parent_id The ID of the resource that is the parent for this resource. ID for resource of type: workspaces/onlineEndpoints
properties [Required] Additional attributes of the entity. OnlineDeployment (required)
tags Resource tags Dictionary of tag names and values.
type The resource type "Microsoft.MachineLearningServices/workspaces/onlineEndpoints/deployments@2021-03-01-preview"

OnlineDeployment

Name Description Value
appInsightsEnabled If true, enables Application Insights logging. bool
codeConfiguration Code configuration for the endpoint deployment. CodeConfiguration
description Description of the endpoint deployment. string
endpointComputeType Set to 'K8S' for type K8SOnlineDeployment. Set to 'Managed' for type ManagedOnlineDeployment. 'K8S'
'Managed' (required)
environmentId ARM resource ID of the environment specification for the endpoint deployment. string
environmentVariables Environment variables configuration for the deployment. OnlineDeploymentEnvironmentVariables
livenessProbe Deployment container liveness/readiness probe configuration. ProbeSettings
model Reference to the model asset for the endpoint deployment. AssetReferenceBase
properties Property dictionary. Properties can be added, but not removed or altered. OnlineDeploymentProperties
requestSettings Online deployment scoring requests configuration. OnlineRequestSettings
scaleSettings Online deployment scaling configuration. OnlineScaleSettings

OnlineDeploymentEnvironmentVariables

Name Description Value

OnlineDeploymentProperties

Name Description Value

OnlineRequestSettings

Name Description Value
maxConcurrentRequestsPerInstance The number of requests allowed to queue at once for this deployment. int
maxQueueWait The maximum queue wait time in ISO 8601 format. Supports millisecond precision. string
requestTimeout The request timeout in ISO 8601 format. Supports millisecond precision. string

OnlineScaleSettings

Name Description Value
maxInstances Maximum number of instances for this deployment. int
minInstances Minimum number of instances for this deployment. int
scaleType Set to 'Auto' for type AutoScaleSettings. Set to 'Manual' for type ManualScaleSettings. 'Auto'
'Manual' (required)

OutputPathAssetReference

Name Description Value
jobId ARM resource ID of the job. string
path The path of the file/directory in the job output. string
referenceType [Required] Specifies the type of asset reference. 'OutputPath' (required)

ProbeSettings

Name Description Value
failureThreshold The number of failures to allow before returning an unhealthy status. int
initialDelay The delay before the first probe in ISO 8601 format. string
period The length of time between probes in ISO 8601 format. string
successThreshold The number of successful probes before returning a healthy status. int
timeout The probe timeout in ISO 8601 format. string

ResourceIdentity

Name Description Value
type Defines values for a ResourceIdentity's type. 'None'
'SystemAssigned'
'SystemAssigned,UserAssigned'
'UserAssigned'
userAssignedIdentities Dictionary of the user assigned identities, key is ARM resource ID of the UAI. ResourceIdentityUserAssignedIdentities

ResourceIdentityUserAssignedIdentities

Name Description Value

TrackedResourceTags

Name Description Value

UserAssignedIdentityMeta

Name Description Value
clientId Aka application ID, a unique identifier generated by Azure AD that is tied to an application and service principal during its initial provisioning. string
principalId The object ID of the service principal object for your managed identity that is used to grant role-based access to an Azure resource. string