Supported metrics for Microsoft.MachineLearningServices/workspaces/onlineEndpoints
The following table lists the metrics available for the Microsoft.MachineLearningServices/workspaces/onlineEndpoints resource type.
Table headings
Metric - The metric display name as it appears in the Azure portal.
Name in Rest API - Metric name as referred to in the REST API.
Unit - Unit of measure.
Aggregation - The default aggregation type. Valid values: Average, Minimum, Maximum, Total, Count.
Dimensions - Dimensions available for the metric.
Time Grains - Intervals at which the metric is sampled. For example, PT1M
indicates that the metric is sampled every minute, PT30M
every 30 minutes, PT1H
every hour, and so on.
DS Export- Whether the metric is exportable to Azure Monitor Logs via Diagnostic Settings.
For information on exporting metrics, see - Metrics export using data collection rules and Create diagnostic settings in Azure Monitor.
For information on metric retention, see Azure Monitor Metrics overview.
For a list of supported logs, see Supported log categories - Microsoft.MachineLearningServices/workspaces/onlineEndpoints
Category | Metric | Name in REST API | Unit | Aggregation | Dimensions | Time Grains | DS Export |
---|---|---|---|---|---|---|---|
Traffic | Connections Active The total number of concurrent TCP connections active from clients. |
ConnectionsActive |
Count | Average | <none> | PT1M | No |
Traffic | Data Collection Errors Per Minute The number of data collection events dropped per minute. |
DataCollectionErrorsPerMinute |
Count | Minimum, Maximum, Average | deployment , reason , type |
PT1M | No |
Traffic | Data Collection Events Per Minute The number of data collection events processed per minute. |
DataCollectionEventsPerMinute |
Count | Minimum, Maximum, Average | deployment , type |
PT1M | No |
Traffic | Network Bytes The bytes per second served for the endpoint. |
NetworkBytes |
BytesPerSecond | Average | <none> | PT1M | No |
Traffic | New Connections Per Second The average number of new TCP connections per second established from clients. |
NewConnectionsPerSecond |
CountPerSecond | Average | <none> | PT1M | No |
Traffic | Request Latency The average complete interval of time taken for a request to be responded in milliseconds |
RequestLatency |
Milliseconds | Average | deployment |
PT1M | Yes |
Traffic | Request Latency P50 The average P50 request latency aggregated by all request latency values collected over the selected time period |
RequestLatency_P50 |
Milliseconds | Average | deployment |
PT1M | Yes |
Traffic | Request Latency P90 The average P90 request latency aggregated by all request latency values collected over the selected time period |
RequestLatency_P90 |
Milliseconds | Average | deployment |
PT1M | Yes |
Traffic | Request Latency P95 The average P95 request latency aggregated by all request latency values collected over the selected time period |
RequestLatency_P95 |
Milliseconds | Average | deployment |
PT1M | Yes |
Traffic | Request Latency P99 The average P99 request latency aggregated by all request latency values collected over the selected time period |
RequestLatency_P99 |
Milliseconds | Average | deployment |
PT1M | Yes |
Traffic | Requests Per Minute The number of requests sent to online endpoint within a minute |
RequestsPerMinute |
Count | Average | deployment , statusCode , statusCodeClass , modelStatusCode |
PT1M | No |