Microsoft.MachineLearningServices workspaces/endpoints 2024-07-01-preview
Bicep resource definition
The workspaces/endpoints resource type can be deployed with operations that target:
- Resource groups - See resource group deployment commands
For a list of changed properties in each API version, see change log.
Resource format
To create a Microsoft.MachineLearningServices/workspaces/endpoints resource, add the following Bicep to your template.
resource symbolicname 'Microsoft.MachineLearningServices/workspaces/endpoints@2024-07-01-preview' = {
parent: resourceSymbolicName
name: 'string'
properties: {
associatedResourceId: 'string'
deployments: [
{
properties: {
failureReason: 'string'
type: 'string'
// For remaining properties, see EndpointDeploymentResourceProperties objects
}
}
]
endpointUri: 'string'
failureReason: 'string'
location: 'string'
name: 'string'
shouldCreateAiServicesEndpoint: bool
endpointType: 'string'
// For remaining properties, see EndpointResourceProperties objects
}
}
EndpointResourceProperties objects
Set the endpointType property to specify the type of object.
For Azure.ContentSafety, use:
{
endpointType: 'Azure.ContentSafety'
}
For Azure.OpenAI, use:
{
endpointType: 'Azure.OpenAI'
}
For Azure.Speech, use:
{
endpointType: 'Azure.Speech'
}
For managedOnlineEndpoint, use:
{
authMode: 'string'
compute: 'string'
description: 'string'
endpointType: 'managedOnlineEndpoint'
mirrorTraffic: {
{customized property}: int
}
scoringUri: 'string'
traffic: {
{customized property}: int
}
}
For serverlessEndpoint, use:
{
authMode: 'string'
capacityReservation: {
capacityReservationGroupId: 'string'
endpointReservedCapacity: int
}
contentSafety: {
contentSafetyStatus: 'string'
}
endpointState: 'string'
endpointType: 'serverlessEndpoint'
inferenceEndpoint: {}
marketplaceSubscriptionId: 'string'
metadata: any(Azure.Bicep.Types.Concrete.AnyType)
modelSettings: {
modelId: 'string'
}
offer: {
offerName: 'string'
publisher: 'string'
}
}
EndpointDeploymentResourceProperties objects
Set the type property to specify the type of object.
For Azure.ContentSafety, use:
{
model: {
format: 'string'
name: 'string'
source: 'string'
version: 'string'
}
raiPolicyName: 'string'
type: 'Azure.ContentSafety'
versionUpgradeOption: 'string'
}
For Azure.OpenAI, use:
{
model: {
format: 'string'
name: 'string'
source: 'string'
version: 'string'
}
raiPolicyName: 'string'
type: 'Azure.OpenAI'
versionUpgradeOption: 'string'
}
For Azure.Speech, use:
{
model: {
format: 'string'
name: 'string'
source: 'string'
version: 'string'
}
raiPolicyName: 'string'
type: 'Azure.Speech'
versionUpgradeOption: 'string'
}
For managedOnlineEndpoint, use:
{
endpointComputeType: 'string'
model: 'string'
type: 'managedOnlineEndpoint'
}
Property values
ContentSafetyEndpointDeploymentResourceProperties
Name | Description | Value |
---|---|---|
model | Model used for the endpoint deployment. | EndpointDeploymentModel (required) |
raiPolicyName | The name of RAI policy. | string |
type | Kind of the deployment. | 'Azure.ContentSafety' (required) |
versionUpgradeOption | Deployment model version upgrade option. | 'NoAutoUpgrade' 'OnceCurrentVersionExpired' 'OnceNewDefaultVersionAvailable' |
ContentSafetyEndpointResourceProperties
Name | Description | Value |
---|---|---|
endpointType | Type of the endpoint. | 'Azure.ContentSafety' (required) |
EndpointDeploymentModel
Name | Description | Value |
---|---|---|
format | Model format | string |
name | Model name. | string |
source | Optional. Deployment model source ARM resource ID. | string |
version | Model version. | string |
EndpointDeploymentResourceProperties
Name | Description | Value |
---|---|---|
failureReason | The failure reason if the creation failed. | string |
type | Set to 'Azure.ContentSafety' for type ContentSafetyEndpointDeploymentResourceProperties. Set to 'Azure.OpenAI' for type OpenAIEndpointDeploymentResourceProperties. Set to 'Azure.Speech' for type SpeechEndpointDeploymentResourceProperties. Set to 'managedOnlineEndpoint' for type ManagedOnlineEndpointDeploymentResourceProperties. | 'Azure.ContentSafety' 'Azure.OpenAI' 'Azure.Speech' 'managedOnlineEndpoint' (required) |
EndpointDeploymentResourcePropertiesBasicResource
Name | Description | Value |
---|---|---|
properties | EndpointDeploymentResourceProperties (required) |
EndpointResourceProperties
Name | Description | Value |
---|---|---|
associatedResourceId | Byo resource id for creating the built-in model service endpoints. | string |
deployments | Deployments info. | EndpointDeploymentResourcePropertiesBasicResource[] |
endpointType | Set to 'Azure.ContentSafety' for type ContentSafetyEndpointResourceProperties. Set to 'Azure.OpenAI' for type OpenAIEndpointResourceProperties. Set to 'Azure.Speech' for type SpeechEndpointResourceProperties. Set to 'managedOnlineEndpoint' for type ManagedOnlineEndpointResourceProperties. Set to 'serverlessEndpoint' for type ServerlessEndpointResourceProperties. | 'Azure.ContentSafety' 'Azure.OpenAI' 'Azure.Speech' 'managedOnlineEndpoint' 'serverlessEndpoint' (required) |
endpointUri | Uri of the endpoint. | string |
failureReason | The failure reason if the creation failed. | string |
location | Location of the endpoint. Since input dto and when parse endpoint resource share the same contract this Location field is just for parse the endpoint resource info we won't let customer specify the endpoint resource location since we will create it the same location as workspace |
string |
name | Name of the endpoint. | string |
shouldCreateAiServicesEndpoint | Whether the proxy (non-byo) endpoint is a regular endpoint or a OneKeyV2 AI services account endpoint. | bool |
ManagedOnlineEndpointDeploymentResourceProperties
Name | Description | Value |
---|---|---|
endpointComputeType | Enum to determine endpoint compute type. | 'AzureMLCompute' 'Kubernetes' 'Managed' |
model | string | |
type | Kind of the deployment. | 'managedOnlineEndpoint' (required) |
ManagedOnlineEndpointResourceProperties
Name | Description | Value |
---|---|---|
authMode | Enum to determine endpoint authentication mode. | 'AADToken' 'AMLToken' 'Key' |
compute | string | |
description | string | |
endpointType | Type of the endpoint. | 'managedOnlineEndpoint' (required) |
mirrorTraffic | Dictionary of <integer> | ManagedOnlineEndpointResourcePropertiesMirrorTraffic |
scoringUri | string | |
traffic | Dictionary of <integer> | ManagedOnlineEndpointResourcePropertiesTraffic |
ManagedOnlineEndpointResourcePropertiesMirrorTraffic
Name | Description | Value |
---|
ManagedOnlineEndpointResourcePropertiesTraffic
Name | Description | Value |
---|
Microsoft.MachineLearningServices/workspaces/endpoints
Name | Description | Value |
---|---|---|
name | The resource name | string Constraints: Pattern = ^[a-zA-Z0-9][a-zA-Z0-9_.-]{2,32}$ (required) |
parent | In Bicep, you can specify the parent resource for a child resource. You only need to add this property when the child resource is declared outside of the parent resource. For more information, see Child resource outside parent resource. |
Symbolic name for resource of type: workspaces |
properties | EndpointResourceProperties (required) |
OpenAIEndpointDeploymentResourceProperties
Name | Description | Value |
---|---|---|
model | Model used for the endpoint deployment. | EndpointDeploymentModel (required) |
raiPolicyName | The name of RAI policy. | string |
type | Kind of the deployment. | 'Azure.OpenAI' (required) |
versionUpgradeOption | Deployment model version upgrade option. | 'NoAutoUpgrade' 'OnceCurrentVersionExpired' 'OnceNewDefaultVersionAvailable' |
OpenAIEndpointResourceProperties
Name | Description | Value |
---|---|---|
endpointType | Type of the endpoint. | 'Azure.OpenAI' (required) |
ServerlessEndpointCapacityReservation
Name | Description | Value |
---|---|---|
capacityReservationGroupId | string Constraints: Min length = 1 Pattern = [a-zA-Z0-9_] (required) |
|
endpointReservedCapacity | int |
ServerlessEndpointContentSafety
Name | Description | Value |
---|---|---|
contentSafetyStatus | Specifies the status of content safety. | 'Disabled' 'Enabled' (required) |
ServerlessEndpointInferenceEndpoint
Name | Description | Value |
---|
ServerlessEndpointModelSettings
Name | Description | Value |
---|---|---|
modelId | string Constraints: Min length = 1 Pattern = [a-zA-Z0-9_] (required) |
ServerlessEndpointResourceProperties
Name | Description | Value |
---|---|---|
authMode | 'Key' | |
capacityReservation | ServerlessEndpointCapacityReservation | |
contentSafety | ServerlessEndpointContentSafety | |
endpointState | State of the Serverless Endpoint. | 'Creating' 'CreationFailed' 'Deleting' 'DeletionFailed' 'Online' 'Reinstating' 'Suspended' 'Suspending' 'Unknown' |
endpointType | Type of the endpoint. | 'serverlessEndpoint' (required) |
inferenceEndpoint | ServerlessEndpointInferenceEndpoint | |
marketplaceSubscriptionId | string | |
metadata | Anything | any |
modelSettings | ServerlessEndpointModelSettings | |
offer | ServerlessOffer |
ServerlessOffer
Name | Description | Value |
---|---|---|
offerName | string Constraints: Min length = 1 Pattern = [a-zA-Z0-9_] (required) |
|
publisher | string Constraints: Min length = 1 Pattern = [a-zA-Z0-9_] (required) |
SpeechEndpointDeploymentResourceProperties
Name | Description | Value |
---|---|---|
model | Model used for the endpoint deployment. | EndpointDeploymentModel (required) |
raiPolicyName | The name of RAI policy. | string |
type | Kind of the deployment. | 'Azure.Speech' (required) |
versionUpgradeOption | Deployment model version upgrade option. | 'NoAutoUpgrade' 'OnceCurrentVersionExpired' 'OnceNewDefaultVersionAvailable' |
SpeechEndpointResourceProperties
Name | Description | Value |
---|---|---|
endpointType | Type of the endpoint. | 'Azure.Speech' (required) |
ARM template resource definition
The workspaces/endpoints resource type can be deployed with operations that target:
- Resource groups - See resource group deployment commands
For a list of changed properties in each API version, see change log.
Resource format
To create a Microsoft.MachineLearningServices/workspaces/endpoints resource, add the following JSON to your template.
{
"type": "Microsoft.MachineLearningServices/workspaces/endpoints",
"apiVersion": "2024-07-01-preview",
"name": "string",
"properties": {
"associatedResourceId": "string",
"deployments": [
{
"properties": {
"failureReason": "string",
"type": "string"
// For remaining properties, see EndpointDeploymentResourceProperties objects
}
}
],
"endpointUri": "string",
"failureReason": "string",
"location": "string",
"name": "string",
"shouldCreateAiServicesEndpoint": "bool",
"endpointType": "string"
// For remaining properties, see EndpointResourceProperties objects
}
}
EndpointResourceProperties objects
Set the endpointType property to specify the type of object.
For Azure.ContentSafety, use:
{
"endpointType": "Azure.ContentSafety"
}
For Azure.OpenAI, use:
{
"endpointType": "Azure.OpenAI"
}
For Azure.Speech, use:
{
"endpointType": "Azure.Speech"
}
For managedOnlineEndpoint, use:
{
"authMode": "string",
"compute": "string",
"description": "string",
"endpointType": "managedOnlineEndpoint",
"mirrorTraffic": {
"{customized property}": "int"
},
"scoringUri": "string",
"traffic": {
"{customized property}": "int"
}
}
For serverlessEndpoint, use:
{
"authMode": "string",
"capacityReservation": {
"capacityReservationGroupId": "string",
"endpointReservedCapacity": "int"
},
"contentSafety": {
"contentSafetyStatus": "string"
},
"endpointState": "string",
"endpointType": "serverlessEndpoint",
"inferenceEndpoint": {
},
"marketplaceSubscriptionId": "string",
"metadata": {},
"modelSettings": {
"modelId": "string"
},
"offer": {
"offerName": "string",
"publisher": "string"
}
}
EndpointDeploymentResourceProperties objects
Set the type property to specify the type of object.
For Azure.ContentSafety, use:
{
"model": {
"format": "string",
"name": "string",
"source": "string",
"version": "string"
},
"raiPolicyName": "string",
"type": "Azure.ContentSafety",
"versionUpgradeOption": "string"
}
For Azure.OpenAI, use:
{
"model": {
"format": "string",
"name": "string",
"source": "string",
"version": "string"
},
"raiPolicyName": "string",
"type": "Azure.OpenAI",
"versionUpgradeOption": "string"
}
For Azure.Speech, use:
{
"model": {
"format": "string",
"name": "string",
"source": "string",
"version": "string"
},
"raiPolicyName": "string",
"type": "Azure.Speech",
"versionUpgradeOption": "string"
}
For managedOnlineEndpoint, use:
{
"endpointComputeType": "string",
"model": "string",
"type": "managedOnlineEndpoint"
}
Property values
ContentSafetyEndpointDeploymentResourceProperties
Name | Description | Value |
---|---|---|
model | Model used for the endpoint deployment. | EndpointDeploymentModel (required) |
raiPolicyName | The name of RAI policy. | string |
type | Kind of the deployment. | 'Azure.ContentSafety' (required) |
versionUpgradeOption | Deployment model version upgrade option. | 'NoAutoUpgrade' 'OnceCurrentVersionExpired' 'OnceNewDefaultVersionAvailable' |
ContentSafetyEndpointResourceProperties
Name | Description | Value |
---|---|---|
endpointType | Type of the endpoint. | 'Azure.ContentSafety' (required) |
EndpointDeploymentModel
Name | Description | Value |
---|---|---|
format | Model format | string |
name | Model name. | string |
source | Optional. Deployment model source ARM resource ID. | string |
version | Model version. | string |
EndpointDeploymentResourceProperties
Name | Description | Value |
---|---|---|
failureReason | The failure reason if the creation failed. | string |
type | Set to 'Azure.ContentSafety' for type ContentSafetyEndpointDeploymentResourceProperties. Set to 'Azure.OpenAI' for type OpenAIEndpointDeploymentResourceProperties. Set to 'Azure.Speech' for type SpeechEndpointDeploymentResourceProperties. Set to 'managedOnlineEndpoint' for type ManagedOnlineEndpointDeploymentResourceProperties. | 'Azure.ContentSafety' 'Azure.OpenAI' 'Azure.Speech' 'managedOnlineEndpoint' (required) |
EndpointDeploymentResourcePropertiesBasicResource
Name | Description | Value |
---|---|---|
properties | EndpointDeploymentResourceProperties (required) |
EndpointResourceProperties
Name | Description | Value |
---|---|---|
associatedResourceId | Byo resource id for creating the built-in model service endpoints. | string |
deployments | Deployments info. | EndpointDeploymentResourcePropertiesBasicResource[] |
endpointType | Set to 'Azure.ContentSafety' for type ContentSafetyEndpointResourceProperties. Set to 'Azure.OpenAI' for type OpenAIEndpointResourceProperties. Set to 'Azure.Speech' for type SpeechEndpointResourceProperties. Set to 'managedOnlineEndpoint' for type ManagedOnlineEndpointResourceProperties. Set to 'serverlessEndpoint' for type ServerlessEndpointResourceProperties. | 'Azure.ContentSafety' 'Azure.OpenAI' 'Azure.Speech' 'managedOnlineEndpoint' 'serverlessEndpoint' (required) |
endpointUri | Uri of the endpoint. | string |
failureReason | The failure reason if the creation failed. | string |
location | Location of the endpoint. Since input dto and when parse endpoint resource share the same contract this Location field is just for parse the endpoint resource info we won't let customer specify the endpoint resource location since we will create it the same location as workspace |
string |
name | Name of the endpoint. | string |
shouldCreateAiServicesEndpoint | Whether the proxy (non-byo) endpoint is a regular endpoint or a OneKeyV2 AI services account endpoint. | bool |
ManagedOnlineEndpointDeploymentResourceProperties
Name | Description | Value |
---|---|---|
endpointComputeType | Enum to determine endpoint compute type. | 'AzureMLCompute' 'Kubernetes' 'Managed' |
model | string | |
type | Kind of the deployment. | 'managedOnlineEndpoint' (required) |
ManagedOnlineEndpointResourceProperties
Name | Description | Value |
---|---|---|
authMode | Enum to determine endpoint authentication mode. | 'AADToken' 'AMLToken' 'Key' |
compute | string | |
description | string | |
endpointType | Type of the endpoint. | 'managedOnlineEndpoint' (required) |
mirrorTraffic | Dictionary of <integer> | ManagedOnlineEndpointResourcePropertiesMirrorTraffic |
scoringUri | string | |
traffic | Dictionary of <integer> | ManagedOnlineEndpointResourcePropertiesTraffic |
ManagedOnlineEndpointResourcePropertiesMirrorTraffic
Name | Description | Value |
---|
ManagedOnlineEndpointResourcePropertiesTraffic
Name | Description | Value |
---|
Microsoft.MachineLearningServices/workspaces/endpoints
Name | Description | Value |
---|---|---|
apiVersion | The api version | '2024-07-01-preview' |
name | The resource name | string Constraints: Pattern = ^[a-zA-Z0-9][a-zA-Z0-9_.-]{2,32}$ (required) |
properties | EndpointResourceProperties (required) | |
type | The resource type | 'Microsoft.MachineLearningServices/workspaces/endpoints' |
OpenAIEndpointDeploymentResourceProperties
Name | Description | Value |
---|---|---|
model | Model used for the endpoint deployment. | EndpointDeploymentModel (required) |
raiPolicyName | The name of RAI policy. | string |
type | Kind of the deployment. | 'Azure.OpenAI' (required) |
versionUpgradeOption | Deployment model version upgrade option. | 'NoAutoUpgrade' 'OnceCurrentVersionExpired' 'OnceNewDefaultVersionAvailable' |
OpenAIEndpointResourceProperties
Name | Description | Value |
---|---|---|
endpointType | Type of the endpoint. | 'Azure.OpenAI' (required) |
ServerlessEndpointCapacityReservation
Name | Description | Value |
---|---|---|
capacityReservationGroupId | string Constraints: Min length = 1 Pattern = [a-zA-Z0-9_] (required) |
|
endpointReservedCapacity | int |
ServerlessEndpointContentSafety
Name | Description | Value |
---|---|---|
contentSafetyStatus | Specifies the status of content safety. | 'Disabled' 'Enabled' (required) |
ServerlessEndpointInferenceEndpoint
Name | Description | Value |
---|
ServerlessEndpointModelSettings
Name | Description | Value |
---|---|---|
modelId | string Constraints: Min length = 1 Pattern = [a-zA-Z0-9_] (required) |
ServerlessEndpointResourceProperties
Name | Description | Value |
---|---|---|
authMode | 'Key' | |
capacityReservation | ServerlessEndpointCapacityReservation | |
contentSafety | ServerlessEndpointContentSafety | |
endpointState | State of the Serverless Endpoint. | 'Creating' 'CreationFailed' 'Deleting' 'DeletionFailed' 'Online' 'Reinstating' 'Suspended' 'Suspending' 'Unknown' |
endpointType | Type of the endpoint. | 'serverlessEndpoint' (required) |
inferenceEndpoint | ServerlessEndpointInferenceEndpoint | |
marketplaceSubscriptionId | string | |
metadata | Anything | any |
modelSettings | ServerlessEndpointModelSettings | |
offer | ServerlessOffer |
ServerlessOffer
Name | Description | Value |
---|---|---|
offerName | string Constraints: Min length = 1 Pattern = [a-zA-Z0-9_] (required) |
|
publisher | string Constraints: Min length = 1 Pattern = [a-zA-Z0-9_] (required) |
SpeechEndpointDeploymentResourceProperties
Name | Description | Value |
---|---|---|
model | Model used for the endpoint deployment. | EndpointDeploymentModel (required) |
raiPolicyName | The name of RAI policy. | string |
type | Kind of the deployment. | 'Azure.Speech' (required) |
versionUpgradeOption | Deployment model version upgrade option. | 'NoAutoUpgrade' 'OnceCurrentVersionExpired' 'OnceNewDefaultVersionAvailable' |
SpeechEndpointResourceProperties
Name | Description | Value |
---|---|---|
endpointType | Type of the endpoint. | 'Azure.Speech' (required) |
Terraform (AzAPI provider) resource definition
The workspaces/endpoints resource type can be deployed with operations that target:
- Resource groups
For a list of changed properties in each API version, see change log.
Resource format
To create a Microsoft.MachineLearningServices/workspaces/endpoints resource, add the following Terraform to your template.
resource "azapi_resource" "symbolicname" {
type = "Microsoft.MachineLearningServices/workspaces/endpoints@2024-07-01-preview"
name = "string"
body = jsonencode({
properties = {
associatedResourceId = "string"
deployments = [
{
properties = {
failureReason = "string"
type = "string"
// For remaining properties, see EndpointDeploymentResourceProperties objects
}
}
]
endpointUri = "string"
failureReason = "string"
location = "string"
name = "string"
shouldCreateAiServicesEndpoint = bool
endpointType = "string"
// For remaining properties, see EndpointResourceProperties objects
}
})
}
EndpointResourceProperties objects
Set the endpointType property to specify the type of object.
For Azure.ContentSafety, use:
{
endpointType = "Azure.ContentSafety"
}
For Azure.OpenAI, use:
{
endpointType = "Azure.OpenAI"
}
For Azure.Speech, use:
{
endpointType = "Azure.Speech"
}
For managedOnlineEndpoint, use:
{
authMode = "string"
compute = "string"
description = "string"
endpointType = "managedOnlineEndpoint"
mirrorTraffic = {
{customized property} = int
}
scoringUri = "string"
traffic = {
{customized property} = int
}
}
For serverlessEndpoint, use:
{
authMode = "string"
capacityReservation = {
capacityReservationGroupId = "string"
endpointReservedCapacity = int
}
contentSafety = {
contentSafetyStatus = "string"
}
endpointState = "string"
endpointType = "serverlessEndpoint"
inferenceEndpoint = {
}
marketplaceSubscriptionId = "string"
metadata = ?
modelSettings = {
modelId = "string"
}
offer = {
offerName = "string"
publisher = "string"
}
}
EndpointDeploymentResourceProperties objects
Set the type property to specify the type of object.
For Azure.ContentSafety, use:
{
model = {
format = "string"
name = "string"
source = "string"
version = "string"
}
raiPolicyName = "string"
type = "Azure.ContentSafety"
versionUpgradeOption = "string"
}
For Azure.OpenAI, use:
{
model = {
format = "string"
name = "string"
source = "string"
version = "string"
}
raiPolicyName = "string"
type = "Azure.OpenAI"
versionUpgradeOption = "string"
}
For Azure.Speech, use:
{
model = {
format = "string"
name = "string"
source = "string"
version = "string"
}
raiPolicyName = "string"
type = "Azure.Speech"
versionUpgradeOption = "string"
}
For managedOnlineEndpoint, use:
{
endpointComputeType = "string"
model = "string"
type = "managedOnlineEndpoint"
}
Property values
ContentSafetyEndpointDeploymentResourceProperties
Name | Description | Value |
---|---|---|
model | Model used for the endpoint deployment. | EndpointDeploymentModel (required) |
raiPolicyName | The name of RAI policy. | string |
type | Kind of the deployment. | 'Azure.ContentSafety' (required) |
versionUpgradeOption | Deployment model version upgrade option. | 'NoAutoUpgrade' 'OnceCurrentVersionExpired' 'OnceNewDefaultVersionAvailable' |
ContentSafetyEndpointResourceProperties
Name | Description | Value |
---|---|---|
endpointType | Type of the endpoint. | 'Azure.ContentSafety' (required) |
EndpointDeploymentModel
Name | Description | Value |
---|---|---|
format | Model format | string |
name | Model name. | string |
source | Optional. Deployment model source ARM resource ID. | string |
version | Model version. | string |
EndpointDeploymentResourceProperties
Name | Description | Value |
---|---|---|
failureReason | The failure reason if the creation failed. | string |
type | Set to 'Azure.ContentSafety' for type ContentSafetyEndpointDeploymentResourceProperties. Set to 'Azure.OpenAI' for type OpenAIEndpointDeploymentResourceProperties. Set to 'Azure.Speech' for type SpeechEndpointDeploymentResourceProperties. Set to 'managedOnlineEndpoint' for type ManagedOnlineEndpointDeploymentResourceProperties. | 'Azure.ContentSafety' 'Azure.OpenAI' 'Azure.Speech' 'managedOnlineEndpoint' (required) |
EndpointDeploymentResourcePropertiesBasicResource
Name | Description | Value |
---|---|---|
properties | EndpointDeploymentResourceProperties (required) |
EndpointResourceProperties
Name | Description | Value |
---|---|---|
associatedResourceId | Byo resource id for creating the built-in model service endpoints. | string |
deployments | Deployments info. | EndpointDeploymentResourcePropertiesBasicResource[] |
endpointType | Set to 'Azure.ContentSafety' for type ContentSafetyEndpointResourceProperties. Set to 'Azure.OpenAI' for type OpenAIEndpointResourceProperties. Set to 'Azure.Speech' for type SpeechEndpointResourceProperties. Set to 'managedOnlineEndpoint' for type ManagedOnlineEndpointResourceProperties. Set to 'serverlessEndpoint' for type ServerlessEndpointResourceProperties. | 'Azure.ContentSafety' 'Azure.OpenAI' 'Azure.Speech' 'managedOnlineEndpoint' 'serverlessEndpoint' (required) |
endpointUri | Uri of the endpoint. | string |
failureReason | The failure reason if the creation failed. | string |
location | Location of the endpoint. Since input dto and when parse endpoint resource share the same contract this Location field is just for parse the endpoint resource info we won't let customer specify the endpoint resource location since we will create it the same location as workspace |
string |
name | Name of the endpoint. | string |
shouldCreateAiServicesEndpoint | Whether the proxy (non-byo) endpoint is a regular endpoint or a OneKeyV2 AI services account endpoint. | bool |
ManagedOnlineEndpointDeploymentResourceProperties
Name | Description | Value |
---|---|---|
endpointComputeType | Enum to determine endpoint compute type. | 'AzureMLCompute' 'Kubernetes' 'Managed' |
model | string | |
type | Kind of the deployment. | 'managedOnlineEndpoint' (required) |
ManagedOnlineEndpointResourceProperties
Name | Description | Value |
---|---|---|
authMode | Enum to determine endpoint authentication mode. | 'AADToken' 'AMLToken' 'Key' |
compute | string | |
description | string | |
endpointType | Type of the endpoint. | 'managedOnlineEndpoint' (required) |
mirrorTraffic | Dictionary of <integer> | ManagedOnlineEndpointResourcePropertiesMirrorTraffic |
scoringUri | string | |
traffic | Dictionary of <integer> | ManagedOnlineEndpointResourcePropertiesTraffic |
ManagedOnlineEndpointResourcePropertiesMirrorTraffic
Name | Description | Value |
---|
ManagedOnlineEndpointResourcePropertiesTraffic
Name | Description | Value |
---|
Microsoft.MachineLearningServices/workspaces/endpoints
Name | Description | Value |
---|---|---|
name | The resource name | string Constraints: Pattern = ^[a-zA-Z0-9][a-zA-Z0-9_.-]{2,32}$ (required) |
parent_id | The ID of the resource that is the parent for this resource. | ID for resource of type: workspaces |
properties | EndpointResourceProperties (required) | |
type | The resource type | "Microsoft.MachineLearningServices/workspaces/endpoints@2024-07-01-preview" |
OpenAIEndpointDeploymentResourceProperties
Name | Description | Value |
---|---|---|
model | Model used for the endpoint deployment. | EndpointDeploymentModel (required) |
raiPolicyName | The name of RAI policy. | string |
type | Kind of the deployment. | 'Azure.OpenAI' (required) |
versionUpgradeOption | Deployment model version upgrade option. | 'NoAutoUpgrade' 'OnceCurrentVersionExpired' 'OnceNewDefaultVersionAvailable' |
OpenAIEndpointResourceProperties
Name | Description | Value |
---|---|---|
endpointType | Type of the endpoint. | 'Azure.OpenAI' (required) |
ServerlessEndpointCapacityReservation
Name | Description | Value |
---|---|---|
capacityReservationGroupId | string Constraints: Min length = 1 Pattern = [a-zA-Z0-9_] (required) |
|
endpointReservedCapacity | int |
ServerlessEndpointContentSafety
Name | Description | Value |
---|---|---|
contentSafetyStatus | Specifies the status of content safety. | 'Disabled' 'Enabled' (required) |
ServerlessEndpointInferenceEndpoint
Name | Description | Value |
---|
ServerlessEndpointModelSettings
Name | Description | Value |
---|---|---|
modelId | string Constraints: Min length = 1 Pattern = [a-zA-Z0-9_] (required) |
ServerlessEndpointResourceProperties
Name | Description | Value |
---|---|---|
authMode | 'Key' | |
capacityReservation | ServerlessEndpointCapacityReservation | |
contentSafety | ServerlessEndpointContentSafety | |
endpointState | State of the Serverless Endpoint. | 'Creating' 'CreationFailed' 'Deleting' 'DeletionFailed' 'Online' 'Reinstating' 'Suspended' 'Suspending' 'Unknown' |
endpointType | Type of the endpoint. | 'serverlessEndpoint' (required) |
inferenceEndpoint | ServerlessEndpointInferenceEndpoint | |
marketplaceSubscriptionId | string | |
metadata | Anything | any |
modelSettings | ServerlessEndpointModelSettings | |
offer | ServerlessOffer |
ServerlessOffer
Name | Description | Value |
---|---|---|
offerName | string Constraints: Min length = 1 Pattern = [a-zA-Z0-9_] (required) |
|
publisher | string Constraints: Min length = 1 Pattern = [a-zA-Z0-9_] (required) |
SpeechEndpointDeploymentResourceProperties
Name | Description | Value |
---|---|---|
model | Model used for the endpoint deployment. | EndpointDeploymentModel (required) |
raiPolicyName | The name of RAI policy. | string |
type | Kind of the deployment. | 'Azure.Speech' (required) |
versionUpgradeOption | Deployment model version upgrade option. | 'NoAutoUpgrade' 'OnceCurrentVersionExpired' 'OnceNewDefaultVersionAvailable' |
SpeechEndpointResourceProperties
Name | Description | Value |
---|---|---|
endpointType | Type of the endpoint. | 'Azure.Speech' (required) |