Supported metrics for Microsoft.MachineLearningServices/workspaces/onlineEndpoints

Article
11/19/2024

The following table lists the metrics available for the Microsoft.MachineLearningServices/workspaces/onlineEndpoints resource type.

Table headings

Metric - The metric display name as it appears in the Azure portal.
Name in Rest API - Metric name as referred to in the REST API.
Unit - Unit of measure.
Aggregation - The default aggregation type. Valid values: Average, Minimum, Maximum, Total, Count.
Dimensions - Dimensions available for the metric.
Time Grains - Intervals at which the metric is sampled. For example, PT1M indicates that the metric is sampled every minute, PT30M every 30 minutes, PT1H every hour, and so on.
DS Export- Whether the metric is exportable to Azure Monitor Logs via Diagnostic Settings.

For information on exporting metrics, see - Metrics export using data collection rules and Create diagnostic settings in Azure Monitor.

For information on metric retention, see Azure Monitor Metrics overview.

For a list of supported logs, see Supported log categories - Microsoft.MachineLearningServices/workspaces/onlineEndpoints

Category	Metric	Name in REST API	Unit	Aggregation	Dimensions	Time Grains	DS Export
Traffic	Connections Active The total number of concurrent TCP connections active from clients.	`ConnectionsActive`	Count	Average	<none>	PT1M	No
Traffic	Data Collection Errors Per Minute The number of data collection events dropped per minute.	`DataCollectionErrorsPerMinute`	Count	Minimum, Maximum, Average	`deployment`, `reason`, `type`	PT1M	No
Traffic	Data Collection Events Per Minute The number of data collection events processed per minute.	`DataCollectionEventsPerMinute`	Count	Minimum, Maximum, Average	`deployment`, `type`	PT1M	No
Traffic	Network Bytes The bytes per second served for the endpoint.	`NetworkBytes`	BytesPerSecond	Average	<none>	PT1M	No
Traffic	New Connections Per Second The average number of new TCP connections per second established from clients.	`NewConnectionsPerSecond`	CountPerSecond	Average	<none>	PT1M	No
Traffic	Request Latency The average complete interval of time taken for a request to be responded in milliseconds	`RequestLatency`	Milliseconds	Average	`deployment`	PT1M	Yes
Traffic	Request Latency P50 The average P50 request latency aggregated by all request latency values collected over the selected time period	`RequestLatency_P50`	Milliseconds	Average	`deployment`	PT1M	Yes
Traffic	Request Latency P90 The average P90 request latency aggregated by all request latency values collected over the selected time period	`RequestLatency_P90`	Milliseconds	Average	`deployment`	PT1M	Yes
Traffic	Request Latency P95 The average P95 request latency aggregated by all request latency values collected over the selected time period	`RequestLatency_P95`	Milliseconds	Average	`deployment`	PT1M	Yes
Traffic	Request Latency P99 The average P99 request latency aggregated by all request latency values collected over the selected time period	`RequestLatency_P99`	Milliseconds	Average	`deployment`	PT1M	Yes
Traffic	Requests Per Minute The number of requests sent to online endpoint within a minute	`RequestsPerMinute`	Count	Average	`deployment`, `statusCode`, `statusCodeClass`, `modelStatusCode`	PT1M	No

Share via

Supported metrics for Microsoft.MachineLearningServices/workspaces/onlineEndpoints

Next steps

Feedback

Additional resources