Microsoft.MachineLearningServices workspaces/endpoints 2024-10-01-preview

Bicep resource definition

The workspaces/endpoints resource type can be deployed with operations that target:

For a list of changed properties in each API version, see change log.

Resource format

To create a Microsoft.MachineLearningServices/workspaces/endpoints resource, add the following Bicep to your template.

resource symbolicname 'Microsoft.MachineLearningServices/workspaces/endpoints@2024-10-01-preview' = {
  name: 'string'
  properties: {
    associatedResourceId: 'string'
    deployments: [
      {
        properties: {
          failureReason: 'string'
          type: 'string'
          // For remaining properties, see EndpointDeploymentResourceProperties objects
        }
      }
    ]
    endpointUri: 'string'
    failureReason: 'string'
    location: 'string'
    name: 'string'
    shouldCreateAiServicesEndpoint: bool
    endpointType: 'string'
    // For remaining properties, see EndpointResourceProperties objects
  }
}

EndpointDeploymentResourceProperties objects

Set the type property to specify the type of object.

For Azure.ContentSafety, use:

{
  model: {
    format: 'string'
    name: 'string'
    source: 'string'
    version: 'string'
  }
  raiPolicyName: 'string'
  sku: {
    capacity: int
    family: 'string'
    name: 'string'
    size: 'string'
    tier: 'string'
  }
  type: 'Azure.ContentSafety'
  versionUpgradeOption: 'string'
}

For Azure.OpenAI, use:

{
  model: {
    format: 'string'
    name: 'string'
    source: 'string'
    version: 'string'
  }
  raiPolicyName: 'string'
  sku: {
    capacity: int
    family: 'string'
    name: 'string'
    size: 'string'
    tier: 'string'
  }
  type: 'Azure.OpenAI'
  versionUpgradeOption: 'string'
}

For Azure.Speech, use:

{
  model: {
    format: 'string'
    name: 'string'
    source: 'string'
    version: 'string'
  }
  raiPolicyName: 'string'
  sku: {
    capacity: int
    family: 'string'
    name: 'string'
    size: 'string'
    tier: 'string'
  }
  type: 'Azure.Speech'
  versionUpgradeOption: 'string'
}

For managedOnlineEndpoint, use:

{
  endpointComputeType: 'string'
  model: 'string'
  type: 'managedOnlineEndpoint'
}

EndpointResourceProperties objects

Set the endpointType property to specify the type of object.

For Azure.ContentSafety, use:

{
  endpointType: 'Azure.ContentSafety'
}

For Azure.OpenAI, use:

{
  endpointType: 'Azure.OpenAI'
}

For Azure.Speech, use:

{
  endpointType: 'Azure.Speech'
}

For managedOnlineEndpoint, use:

{
  authMode: 'string'
  compute: 'string'
  description: 'string'
  endpointType: 'managedOnlineEndpoint'
  mirrorTraffic: {
    {customized property}: int
  }
  scoringUri: 'string'
  traffic: {
    {customized property}: int
  }
}

For serverlessEndpoint, use:

{
  authMode: 'string'
  capacityReservation: {
    capacityReservationGroupId: 'string'
    endpointReservedCapacity: int
  }
  contentSafety: {
    contentSafetyStatus: 'string'
  }
  endpointState: 'string'
  endpointType: 'serverlessEndpoint'
  inferenceEndpoint: {}
  marketplaceSubscriptionId: 'string'
  metadata: any(Azure.Bicep.Types.Concrete.AnyType)
  modelSettings: {
    modelId: 'string'
  }
  offer: {
    offerName: 'string'
    publisher: 'string'
  }
}

Property values

CognitiveServicesSku

Name Description Value
capacity int
family string
name string
size string
tier string

ContentSafetyEndpointDeploymentResourceProperties

Name Description Value
model Model used for the endpoint deployment. EndpointDeploymentModel (required)
raiPolicyName The name of RAI policy. string
sku CognitiveServicesSku
type Kind of the deployment. 'Azure.ContentSafety' (required)
versionUpgradeOption Deployment model version upgrade option. 'NoAutoUpgrade'
'OnceCurrentVersionExpired'
'OnceNewDefaultVersionAvailable'

ContentSafetyEndpointResourceProperties

Name Description Value
endpointType Type of the endpoint. 'Azure.ContentSafety' (required)

EndpointDeploymentModel

Name Description Value
format Model format string
name Model name. string
source Optional. Deployment model source ARM resource ID. string
version Model version. string

EndpointDeploymentResourceProperties

Name Description Value
failureReason The failure reason if the creation failed. string
type Set to 'Azure.ContentSafety' for type ContentSafetyEndpointDeploymentResourceProperties. Set to 'Azure.OpenAI' for type OpenAIEndpointDeploymentResourceProperties. Set to 'Azure.Speech' for type SpeechEndpointDeploymentResourceProperties. Set to 'managedOnlineEndpoint' for type ManagedOnlineEndpointDeploymentResourceProperties. 'Azure.ContentSafety'
'Azure.OpenAI'
'Azure.Speech'
'managedOnlineEndpoint' (required)

EndpointDeploymentResourcePropertiesBasicResource

Name Description Value
properties EndpointDeploymentResourceProperties (required)

EndpointResourceProperties

Name Description Value
associatedResourceId Byo resource id for creating the built-in model service endpoints. string
deployments Deployments info. EndpointDeploymentResourcePropertiesBasicResource[]
endpointType Set to 'Azure.ContentSafety' for type ContentSafetyEndpointResourceProperties. Set to 'Azure.OpenAI' for type OpenAIEndpointResourceProperties. Set to 'Azure.Speech' for type SpeechEndpointResourceProperties. Set to 'managedOnlineEndpoint' for type ManagedOnlineEndpointResourceProperties. Set to 'serverlessEndpoint' for type ServerlessEndpointResourceProperties. 'Azure.ContentSafety'
'Azure.OpenAI'
'Azure.Speech'
'managedOnlineEndpoint'
'serverlessEndpoint' (required)
endpointUri Uri of the endpoint. string
failureReason The failure reason if the creation failed. string
location Location of the endpoint.
Since input dto and when parse endpoint resource share the same contract
this Location field is just for parse the endpoint resource info
we won't let customer specify the endpoint resource location since we will create it the same location as workspace
string
name Name of the endpoint. string
shouldCreateAiServicesEndpoint Whether the proxy (non-byo) endpoint is a regular endpoint or a OneKeyV2 AI services account endpoint. bool

ManagedOnlineEndpointDeploymentResourceProperties

Name Description Value
endpointComputeType Enum to determine endpoint compute type. 'AzureMLCompute'
'Kubernetes'
'Managed'
model string
type Kind of the deployment. 'managedOnlineEndpoint' (required)

ManagedOnlineEndpointResourceProperties

Name Description Value
authMode Enum to determine endpoint authentication mode. 'AADToken'
'AMLToken'
'Key'
compute string
description string
endpointType Type of the endpoint. 'managedOnlineEndpoint' (required)
mirrorTraffic Dictionary of <integer> ManagedOnlineEndpointResourcePropertiesMirrorTraffic
scoringUri string
traffic Dictionary of <integer> ManagedOnlineEndpointResourcePropertiesTraffic

ManagedOnlineEndpointResourcePropertiesMirrorTraffic

Name Description Value

ManagedOnlineEndpointResourcePropertiesTraffic

Name Description Value

Microsoft.MachineLearningServices/workspaces/endpoints

Name Description Value
name The resource name string

Constraints:
Pattern = ^[a-zA-Z0-9][a-zA-Z0-9_.-]{2,32}$ (required)
parent In Bicep, you can specify the parent resource for a child resource. You only need to add this property when the child resource is declared outside of the parent resource.

For more information, see Child resource outside parent resource.
Symbolic name for resource of type: workspaces
properties EndpointResourceProperties (required)

OpenAIEndpointDeploymentResourceProperties

Name Description Value
model Model used for the endpoint deployment. EndpointDeploymentModel (required)
raiPolicyName The name of RAI policy. string
sku CognitiveServicesSku
type Kind of the deployment. 'Azure.OpenAI' (required)
versionUpgradeOption Deployment model version upgrade option. 'NoAutoUpgrade'
'OnceCurrentVersionExpired'
'OnceNewDefaultVersionAvailable'

OpenAIEndpointResourceProperties

Name Description Value
endpointType Type of the endpoint. 'Azure.OpenAI' (required)

ServerlessEndpointCapacityReservation

Name Description Value
capacityReservationGroupId string

Constraints:
Min length = 1
Pattern = [a-zA-Z0-9_] (required)
endpointReservedCapacity int

ServerlessEndpointContentSafety

Name Description Value
contentSafetyStatus Specifies the status of content safety. 'Disabled'
'Enabled' (required)

ServerlessEndpointInferenceEndpoint

Name Description Value

ServerlessEndpointModelSettings

Name Description Value
modelId string

Constraints:
Min length = 1
Pattern = [a-zA-Z0-9_] (required)

ServerlessEndpointResourceProperties

Name Description Value
authMode 'Key'
capacityReservation ServerlessEndpointCapacityReservation
contentSafety ServerlessEndpointContentSafety
endpointState State of the Serverless Endpoint. 'Creating'
'CreationFailed'
'Deleting'
'DeletionFailed'
'Online'
'Reinstating'
'Suspended'
'Suspending'
'Unknown'
endpointType Type of the endpoint. 'serverlessEndpoint' (required)
inferenceEndpoint ServerlessEndpointInferenceEndpoint
marketplaceSubscriptionId string
metadata Anything any
modelSettings ServerlessEndpointModelSettings
offer ServerlessOffer

ServerlessOffer

Name Description Value
offerName string

Constraints:
Min length = 1
Pattern = [a-zA-Z0-9_] (required)
publisher string

Constraints:
Min length = 1
Pattern = [a-zA-Z0-9_] (required)

SpeechEndpointDeploymentResourceProperties

Name Description Value
model Model used for the endpoint deployment. EndpointDeploymentModel (required)
raiPolicyName The name of RAI policy. string
sku CognitiveServicesSku
type Kind of the deployment. 'Azure.Speech' (required)
versionUpgradeOption Deployment model version upgrade option. 'NoAutoUpgrade'
'OnceCurrentVersionExpired'
'OnceNewDefaultVersionAvailable'

SpeechEndpointResourceProperties

Name Description Value
endpointType Type of the endpoint. 'Azure.Speech' (required)

ARM template resource definition

The workspaces/endpoints resource type can be deployed with operations that target:

For a list of changed properties in each API version, see change log.

Resource format

To create a Microsoft.MachineLearningServices/workspaces/endpoints resource, add the following JSON to your template.

{
  "type": "Microsoft.MachineLearningServices/workspaces/endpoints",
  "apiVersion": "2024-10-01-preview",
  "name": "string",
  "properties": {
    "associatedResourceId": "string",
    "deployments": [
      {
        "properties": {
          "failureReason": "string",
          "type": "string"
          // For remaining properties, see EndpointDeploymentResourceProperties objects
        }
      }
    ],
    "endpointUri": "string",
    "failureReason": "string",
    "location": "string",
    "name": "string",
    "shouldCreateAiServicesEndpoint": "bool",
    "endpointType": "string"
    // For remaining properties, see EndpointResourceProperties objects
  }
}

EndpointDeploymentResourceProperties objects

Set the type property to specify the type of object.

For Azure.ContentSafety, use:

{
  "model": {
    "format": "string",
    "name": "string",
    "source": "string",
    "version": "string"
  },
  "raiPolicyName": "string",
  "sku": {
    "capacity": "int",
    "family": "string",
    "name": "string",
    "size": "string",
    "tier": "string"
  },
  "type": "Azure.ContentSafety",
  "versionUpgradeOption": "string"
}

For Azure.OpenAI, use:

{
  "model": {
    "format": "string",
    "name": "string",
    "source": "string",
    "version": "string"
  },
  "raiPolicyName": "string",
  "sku": {
    "capacity": "int",
    "family": "string",
    "name": "string",
    "size": "string",
    "tier": "string"
  },
  "type": "Azure.OpenAI",
  "versionUpgradeOption": "string"
}

For Azure.Speech, use:

{
  "model": {
    "format": "string",
    "name": "string",
    "source": "string",
    "version": "string"
  },
  "raiPolicyName": "string",
  "sku": {
    "capacity": "int",
    "family": "string",
    "name": "string",
    "size": "string",
    "tier": "string"
  },
  "type": "Azure.Speech",
  "versionUpgradeOption": "string"
}

For managedOnlineEndpoint, use:

{
  "endpointComputeType": "string",
  "model": "string",
  "type": "managedOnlineEndpoint"
}

EndpointResourceProperties objects

Set the endpointType property to specify the type of object.

For Azure.ContentSafety, use:

{
  "endpointType": "Azure.ContentSafety"
}

For Azure.OpenAI, use:

{
  "endpointType": "Azure.OpenAI"
}

For Azure.Speech, use:

{
  "endpointType": "Azure.Speech"
}

For managedOnlineEndpoint, use:

{
  "authMode": "string",
  "compute": "string",
  "description": "string",
  "endpointType": "managedOnlineEndpoint",
  "mirrorTraffic": {
    "{customized property}": "int"
  },
  "scoringUri": "string",
  "traffic": {
    "{customized property}": "int"
  }
}

For serverlessEndpoint, use:

{
  "authMode": "string",
  "capacityReservation": {
    "capacityReservationGroupId": "string",
    "endpointReservedCapacity": "int"
  },
  "contentSafety": {
    "contentSafetyStatus": "string"
  },
  "endpointState": "string",
  "endpointType": "serverlessEndpoint",
  "inferenceEndpoint": {
  },
  "marketplaceSubscriptionId": "string",
  "metadata": {},
  "modelSettings": {
    "modelId": "string"
  },
  "offer": {
    "offerName": "string",
    "publisher": "string"
  }
}

Property values

CognitiveServicesSku

Name Description Value
capacity int
family string
name string
size string
tier string

ContentSafetyEndpointDeploymentResourceProperties

Name Description Value
model Model used for the endpoint deployment. EndpointDeploymentModel (required)
raiPolicyName The name of RAI policy. string
sku CognitiveServicesSku
type Kind of the deployment. 'Azure.ContentSafety' (required)
versionUpgradeOption Deployment model version upgrade option. 'NoAutoUpgrade'
'OnceCurrentVersionExpired'
'OnceNewDefaultVersionAvailable'

ContentSafetyEndpointResourceProperties

Name Description Value
endpointType Type of the endpoint. 'Azure.ContentSafety' (required)

EndpointDeploymentModel

Name Description Value
format Model format string
name Model name. string
source Optional. Deployment model source ARM resource ID. string
version Model version. string

EndpointDeploymentResourceProperties

Name Description Value
failureReason The failure reason if the creation failed. string
type Set to 'Azure.ContentSafety' for type ContentSafetyEndpointDeploymentResourceProperties. Set to 'Azure.OpenAI' for type OpenAIEndpointDeploymentResourceProperties. Set to 'Azure.Speech' for type SpeechEndpointDeploymentResourceProperties. Set to 'managedOnlineEndpoint' for type ManagedOnlineEndpointDeploymentResourceProperties. 'Azure.ContentSafety'
'Azure.OpenAI'
'Azure.Speech'
'managedOnlineEndpoint' (required)

EndpointDeploymentResourcePropertiesBasicResource

Name Description Value
properties EndpointDeploymentResourceProperties (required)

EndpointResourceProperties

Name Description Value
associatedResourceId Byo resource id for creating the built-in model service endpoints. string
deployments Deployments info. EndpointDeploymentResourcePropertiesBasicResource[]
endpointType Set to 'Azure.ContentSafety' for type ContentSafetyEndpointResourceProperties. Set to 'Azure.OpenAI' for type OpenAIEndpointResourceProperties. Set to 'Azure.Speech' for type SpeechEndpointResourceProperties. Set to 'managedOnlineEndpoint' for type ManagedOnlineEndpointResourceProperties. Set to 'serverlessEndpoint' for type ServerlessEndpointResourceProperties. 'Azure.ContentSafety'
'Azure.OpenAI'
'Azure.Speech'
'managedOnlineEndpoint'
'serverlessEndpoint' (required)
endpointUri Uri of the endpoint. string
failureReason The failure reason if the creation failed. string
location Location of the endpoint.
Since input dto and when parse endpoint resource share the same contract
this Location field is just for parse the endpoint resource info
we won't let customer specify the endpoint resource location since we will create it the same location as workspace
string
name Name of the endpoint. string
shouldCreateAiServicesEndpoint Whether the proxy (non-byo) endpoint is a regular endpoint or a OneKeyV2 AI services account endpoint. bool

ManagedOnlineEndpointDeploymentResourceProperties

Name Description Value
endpointComputeType Enum to determine endpoint compute type. 'AzureMLCompute'
'Kubernetes'
'Managed'
model string
type Kind of the deployment. 'managedOnlineEndpoint' (required)

ManagedOnlineEndpointResourceProperties

Name Description Value
authMode Enum to determine endpoint authentication mode. 'AADToken'
'AMLToken'
'Key'
compute string
description string
endpointType Type of the endpoint. 'managedOnlineEndpoint' (required)
mirrorTraffic Dictionary of <integer> ManagedOnlineEndpointResourcePropertiesMirrorTraffic
scoringUri string
traffic Dictionary of <integer> ManagedOnlineEndpointResourcePropertiesTraffic

ManagedOnlineEndpointResourcePropertiesMirrorTraffic

Name Description Value

ManagedOnlineEndpointResourcePropertiesTraffic

Name Description Value

Microsoft.MachineLearningServices/workspaces/endpoints

Name Description Value
apiVersion The api version '2024-10-01-preview'
name The resource name string

Constraints:
Pattern = ^[a-zA-Z0-9][a-zA-Z0-9_.-]{2,32}$ (required)
properties EndpointResourceProperties (required)
type The resource type 'Microsoft.MachineLearningServices/workspaces/endpoints'

OpenAIEndpointDeploymentResourceProperties

Name Description Value
model Model used for the endpoint deployment. EndpointDeploymentModel (required)
raiPolicyName The name of RAI policy. string
sku CognitiveServicesSku
type Kind of the deployment. 'Azure.OpenAI' (required)
versionUpgradeOption Deployment model version upgrade option. 'NoAutoUpgrade'
'OnceCurrentVersionExpired'
'OnceNewDefaultVersionAvailable'

OpenAIEndpointResourceProperties

Name Description Value
endpointType Type of the endpoint. 'Azure.OpenAI' (required)

ServerlessEndpointCapacityReservation

Name Description Value
capacityReservationGroupId string

Constraints:
Min length = 1
Pattern = [a-zA-Z0-9_] (required)
endpointReservedCapacity int

ServerlessEndpointContentSafety

Name Description Value
contentSafetyStatus Specifies the status of content safety. 'Disabled'
'Enabled' (required)

ServerlessEndpointInferenceEndpoint

Name Description Value

ServerlessEndpointModelSettings

Name Description Value
modelId string

Constraints:
Min length = 1
Pattern = [a-zA-Z0-9_] (required)

ServerlessEndpointResourceProperties

Name Description Value
authMode 'Key'
capacityReservation ServerlessEndpointCapacityReservation
contentSafety ServerlessEndpointContentSafety
endpointState State of the Serverless Endpoint. 'Creating'
'CreationFailed'
'Deleting'
'DeletionFailed'
'Online'
'Reinstating'
'Suspended'
'Suspending'
'Unknown'
endpointType Type of the endpoint. 'serverlessEndpoint' (required)
inferenceEndpoint ServerlessEndpointInferenceEndpoint
marketplaceSubscriptionId string
metadata Anything any
modelSettings ServerlessEndpointModelSettings
offer ServerlessOffer

ServerlessOffer

Name Description Value
offerName string

Constraints:
Min length = 1
Pattern = [a-zA-Z0-9_] (required)
publisher string

Constraints:
Min length = 1
Pattern = [a-zA-Z0-9_] (required)

SpeechEndpointDeploymentResourceProperties

Name Description Value
model Model used for the endpoint deployment. EndpointDeploymentModel (required)
raiPolicyName The name of RAI policy. string
sku CognitiveServicesSku
type Kind of the deployment. 'Azure.Speech' (required)
versionUpgradeOption Deployment model version upgrade option. 'NoAutoUpgrade'
'OnceCurrentVersionExpired'
'OnceNewDefaultVersionAvailable'

SpeechEndpointResourceProperties

Name Description Value
endpointType Type of the endpoint. 'Azure.Speech' (required)

Terraform (AzAPI provider) resource definition

The workspaces/endpoints resource type can be deployed with operations that target:

  • Resource groups

For a list of changed properties in each API version, see change log.

Resource format

To create a Microsoft.MachineLearningServices/workspaces/endpoints resource, add the following Terraform to your template.

resource "azapi_resource" "symbolicname" {
  type = "Microsoft.MachineLearningServices/workspaces/endpoints@2024-10-01-preview"
  name = "string"
  body = jsonencode({
    properties = {
      associatedResourceId = "string"
      deployments = [
        {
          properties = {
            failureReason = "string"
            type = "string"
            // For remaining properties, see EndpointDeploymentResourceProperties objects
          }
        }
      ]
      endpointUri = "string"
      failureReason = "string"
      location = "string"
      name = "string"
      shouldCreateAiServicesEndpoint = bool
      endpointType = "string"
      // For remaining properties, see EndpointResourceProperties objects
    }
  })
}

EndpointDeploymentResourceProperties objects

Set the type property to specify the type of object.

For Azure.ContentSafety, use:

{
  model = {
    format = "string"
    name = "string"
    source = "string"
    version = "string"
  }
  raiPolicyName = "string"
  sku = {
    capacity = int
    family = "string"
    name = "string"
    size = "string"
    tier = "string"
  }
  type = "Azure.ContentSafety"
  versionUpgradeOption = "string"
}

For Azure.OpenAI, use:

{
  model = {
    format = "string"
    name = "string"
    source = "string"
    version = "string"
  }
  raiPolicyName = "string"
  sku = {
    capacity = int
    family = "string"
    name = "string"
    size = "string"
    tier = "string"
  }
  type = "Azure.OpenAI"
  versionUpgradeOption = "string"
}

For Azure.Speech, use:

{
  model = {
    format = "string"
    name = "string"
    source = "string"
    version = "string"
  }
  raiPolicyName = "string"
  sku = {
    capacity = int
    family = "string"
    name = "string"
    size = "string"
    tier = "string"
  }
  type = "Azure.Speech"
  versionUpgradeOption = "string"
}

For managedOnlineEndpoint, use:

{
  endpointComputeType = "string"
  model = "string"
  type = "managedOnlineEndpoint"
}

EndpointResourceProperties objects

Set the endpointType property to specify the type of object.

For Azure.ContentSafety, use:

{
  endpointType = "Azure.ContentSafety"
}

For Azure.OpenAI, use:

{
  endpointType = "Azure.OpenAI"
}

For Azure.Speech, use:

{
  endpointType = "Azure.Speech"
}

For managedOnlineEndpoint, use:

{
  authMode = "string"
  compute = "string"
  description = "string"
  endpointType = "managedOnlineEndpoint"
  mirrorTraffic = {
    {customized property} = int
  }
  scoringUri = "string"
  traffic = {
    {customized property} = int
  }
}

For serverlessEndpoint, use:

{
  authMode = "string"
  capacityReservation = {
    capacityReservationGroupId = "string"
    endpointReservedCapacity = int
  }
  contentSafety = {
    contentSafetyStatus = "string"
  }
  endpointState = "string"
  endpointType = "serverlessEndpoint"
  inferenceEndpoint = {
  }
  marketplaceSubscriptionId = "string"
  metadata = ?
  modelSettings = {
    modelId = "string"
  }
  offer = {
    offerName = "string"
    publisher = "string"
  }
}

Property values

CognitiveServicesSku

Name Description Value
capacity int
family string
name string
size string
tier string

ContentSafetyEndpointDeploymentResourceProperties

Name Description Value
model Model used for the endpoint deployment. EndpointDeploymentModel (required)
raiPolicyName The name of RAI policy. string
sku CognitiveServicesSku
type Kind of the deployment. 'Azure.ContentSafety' (required)
versionUpgradeOption Deployment model version upgrade option. 'NoAutoUpgrade'
'OnceCurrentVersionExpired'
'OnceNewDefaultVersionAvailable'

ContentSafetyEndpointResourceProperties

Name Description Value
endpointType Type of the endpoint. 'Azure.ContentSafety' (required)

EndpointDeploymentModel

Name Description Value
format Model format string
name Model name. string
source Optional. Deployment model source ARM resource ID. string
version Model version. string

EndpointDeploymentResourceProperties

Name Description Value
failureReason The failure reason if the creation failed. string
type Set to 'Azure.ContentSafety' for type ContentSafetyEndpointDeploymentResourceProperties. Set to 'Azure.OpenAI' for type OpenAIEndpointDeploymentResourceProperties. Set to 'Azure.Speech' for type SpeechEndpointDeploymentResourceProperties. Set to 'managedOnlineEndpoint' for type ManagedOnlineEndpointDeploymentResourceProperties. 'Azure.ContentSafety'
'Azure.OpenAI'
'Azure.Speech'
'managedOnlineEndpoint' (required)

EndpointDeploymentResourcePropertiesBasicResource

Name Description Value
properties EndpointDeploymentResourceProperties (required)

EndpointResourceProperties

Name Description Value
associatedResourceId Byo resource id for creating the built-in model service endpoints. string
deployments Deployments info. EndpointDeploymentResourcePropertiesBasicResource[]
endpointType Set to 'Azure.ContentSafety' for type ContentSafetyEndpointResourceProperties. Set to 'Azure.OpenAI' for type OpenAIEndpointResourceProperties. Set to 'Azure.Speech' for type SpeechEndpointResourceProperties. Set to 'managedOnlineEndpoint' for type ManagedOnlineEndpointResourceProperties. Set to 'serverlessEndpoint' for type ServerlessEndpointResourceProperties. 'Azure.ContentSafety'
'Azure.OpenAI'
'Azure.Speech'
'managedOnlineEndpoint'
'serverlessEndpoint' (required)
endpointUri Uri of the endpoint. string
failureReason The failure reason if the creation failed. string
location Location of the endpoint.
Since input dto and when parse endpoint resource share the same contract
this Location field is just for parse the endpoint resource info
we won't let customer specify the endpoint resource location since we will create it the same location as workspace
string
name Name of the endpoint. string
shouldCreateAiServicesEndpoint Whether the proxy (non-byo) endpoint is a regular endpoint or a OneKeyV2 AI services account endpoint. bool

ManagedOnlineEndpointDeploymentResourceProperties

Name Description Value
endpointComputeType Enum to determine endpoint compute type. 'AzureMLCompute'
'Kubernetes'
'Managed'
model string
type Kind of the deployment. 'managedOnlineEndpoint' (required)

ManagedOnlineEndpointResourceProperties

Name Description Value
authMode Enum to determine endpoint authentication mode. 'AADToken'
'AMLToken'
'Key'
compute string
description string
endpointType Type of the endpoint. 'managedOnlineEndpoint' (required)
mirrorTraffic Dictionary of <integer> ManagedOnlineEndpointResourcePropertiesMirrorTraffic
scoringUri string
traffic Dictionary of <integer> ManagedOnlineEndpointResourcePropertiesTraffic

ManagedOnlineEndpointResourcePropertiesMirrorTraffic

Name Description Value

ManagedOnlineEndpointResourcePropertiesTraffic

Name Description Value

Microsoft.MachineLearningServices/workspaces/endpoints

Name Description Value
name The resource name string

Constraints:
Pattern = ^[a-zA-Z0-9][a-zA-Z0-9_.-]{2,32}$ (required)
parent_id The ID of the resource that is the parent for this resource. ID for resource of type: workspaces
properties EndpointResourceProperties (required)
type The resource type "Microsoft.MachineLearningServices/workspaces/endpoints@2024-10-01-preview"

OpenAIEndpointDeploymentResourceProperties

Name Description Value
model Model used for the endpoint deployment. EndpointDeploymentModel (required)
raiPolicyName The name of RAI policy. string
sku CognitiveServicesSku
type Kind of the deployment. 'Azure.OpenAI' (required)
versionUpgradeOption Deployment model version upgrade option. 'NoAutoUpgrade'
'OnceCurrentVersionExpired'
'OnceNewDefaultVersionAvailable'

OpenAIEndpointResourceProperties

Name Description Value
endpointType Type of the endpoint. 'Azure.OpenAI' (required)

ServerlessEndpointCapacityReservation

Name Description Value
capacityReservationGroupId string

Constraints:
Min length = 1
Pattern = [a-zA-Z0-9_] (required)
endpointReservedCapacity int

ServerlessEndpointContentSafety

Name Description Value
contentSafetyStatus Specifies the status of content safety. 'Disabled'
'Enabled' (required)

ServerlessEndpointInferenceEndpoint

Name Description Value

ServerlessEndpointModelSettings

Name Description Value
modelId string

Constraints:
Min length = 1
Pattern = [a-zA-Z0-9_] (required)

ServerlessEndpointResourceProperties

Name Description Value
authMode 'Key'
capacityReservation ServerlessEndpointCapacityReservation
contentSafety ServerlessEndpointContentSafety
endpointState State of the Serverless Endpoint. 'Creating'
'CreationFailed'
'Deleting'
'DeletionFailed'
'Online'
'Reinstating'
'Suspended'
'Suspending'
'Unknown'
endpointType Type of the endpoint. 'serverlessEndpoint' (required)
inferenceEndpoint ServerlessEndpointInferenceEndpoint
marketplaceSubscriptionId string
metadata Anything any
modelSettings ServerlessEndpointModelSettings
offer ServerlessOffer

ServerlessOffer

Name Description Value
offerName string

Constraints:
Min length = 1
Pattern = [a-zA-Z0-9_] (required)
publisher string

Constraints:
Min length = 1
Pattern = [a-zA-Z0-9_] (required)

SpeechEndpointDeploymentResourceProperties

Name Description Value
model Model used for the endpoint deployment. EndpointDeploymentModel (required)
raiPolicyName The name of RAI policy. string
sku CognitiveServicesSku
type Kind of the deployment. 'Azure.Speech' (required)
versionUpgradeOption Deployment model version upgrade option. 'NoAutoUpgrade'
'OnceCurrentVersionExpired'
'OnceNewDefaultVersionAvailable'

SpeechEndpointResourceProperties

Name Description Value
endpointType Type of the endpoint. 'Azure.Speech' (required)