How to trace your application with Azure AI Inference SDK

Article
02/28/2025

Important

Items marked (preview) in this article are currently in public preview. This preview is provided without a service-level agreement, and we don't recommend it for production workloads. Certain features might not be supported or might have constrained capabilities. For more information, see Supplemental Terms of Use for Microsoft Azure Previews.

In this article you'll learn how to trace your application with Azure AI Inference SDK with your choice between using Python, JavaScript, or C#. The Azure AI Inference client library provides support for tracing with OpenTelemetry.

Enable trace in your application

Prerequisites

An Azure Subscription.
An Azure AI project, see Create a project in Azure AI Foundry portal.
An AI model supporting the Azure AI model inference API deployed through Azure AI Foundry.
If using Python, you need Python 3.8 or later installed, including pip.
If using JavaScript, the supported environments are LTS versions of Node.js.

Installation

Install the package azure-ai-inference using your package manager, like pip:

  pip install azure-ai-inference[opentelemetry]

Install the Azure Core OpenTelemetry Tracing plugin, OpenTelemetry, and the OTLP exporter for sending telemetry to your observability backend. To install the necessary packages for Python, use the following pip commands:

pip install opentelemetry 

pip install opentelemetry-exporter-otlp

To learn more about Azure AI Inference SDK for Python and observability, see Tracing via Inference SDK for Python.

Install the package @azure-rest/ai-inference for JavaScript using npm:

    npm install @azure-rest/ai-inference

To learn more about Azure AI Inference SDK for JavaScript and observability, see Tracing via Inference SDK for JavaScript.

Install the Azure AI Inference client library for .NET with NuGet:

    dotnet add package Azure.AI.Inference --prerelease

To learn more Azure AI Inference SDK for C# and observability, see the Tracing via Inference SDK for C#.

To learn more , see the Inference SDK reference.

Configuration

You need to add following configuration settings as per your use case:

To capture prompt and completion contents, set the AZURE_TRACING_GEN_AI_CONTENT_RECORDING_ENABLED environment variable to true (case insensitive). By default, prompts, completions, function names, parameters, or outputs aren't recorded.
To enable Azure SDK tracing, set the AZURE_SDK_TRACING_IMPLEMENTATION environment variable to opentelemetry. Alternatively, you can configure it in the code with the following snippet:
```
from azure.core.settings import settings 

settings.tracing_implementation = "opentelemetry" 
```
To learn more, see Azure Core Tracing OpenTelemetry client library for Python.

Instrumentation is only supported for Chat Completion without streaming. To enable instrumentation, you need to register exporter(s). Following is an example of how to add a console exporter.

Console Exporter:

import { ConsoleSpanExporter, NodeTracerProvider, SimpleSpanProcessor } from "@opentelemetry/sdk-trace-node"; 

const provider = new NodeTracerProvider(); 

provider.addSpanProcessor(new SimpleSpanProcessor(new ConsoleSpanExporter())); 

provider.register();

Enable Instrumentation

The final step is to enable Azure AI Inference instrumentation with the following code snippet:

from azure.ai.inference.tracing import AIInferenceInstrumentor 

# Instrument AI Inference API 

AIInferenceInstrumentor().instrument()

It's also possible to uninstrument the Azure AI Inferencing API by using the uninstrument call. After this call, the traces will no longer be emitted by the Azure AI Inferencing API until instrument is called again:

AIInferenceInstrumentor().uninstrument()

To use instrumentation for Azure SDK, you need to register it before importing any dependencies from @azure/core-tracing, such as @azure-rest/ai-inference.

import { registerInstrumentations } from "@opentelemetry/instrumentation"; 

import { createAzureSdkInstrumentation } from "@azure/opentelemetry-instrumentation-azure-sdk"; 


registerInstrumentations({ 

  instrumentations: [createAzureSdkInstrumentation()], 

}); 

import ModelClient from "@azure-rest/ai-inference";

When making a call for chat completion, you need to include the tracingOptions with the active tracing context:


import { context } from "@opentelemetry/api"; 

client.path("/chat/completions").post({ 

      body: {...}, 

      tracingOptions: { tracingContext: context.active() } 

});

To configure OpenTelemetry and enable Azure AI Inference tracing follow these steps:

Install OpenTelemetry Packages: Install the following dependencies for HTTP tracing and metrics instrumentation as well as console and OTLP exporters:

   dotnet add package OpenTelemetry.Instrumentation.Http 

   dotnet add package OpenTelemetry.Exporter.Console 

   dotnet add package OpenTelemetry.Exporter.OpenTelemetryProtocol

Enable Experimental Azure SDK Observability: Set the context switch to enable experimental Azure SDK observability:
```
   AppContext.SetSwitch("Azure.Experimental.EnableActivitySource", true); 
```
Enable Content Recording: By default, instrumentation captures chat messages without content. To enable content recording, set the following context switch:
```
 AppContext.SetSwitch("Azure.Experimental.TraceGenAIMessageContent", true);
```
Configure Tracer Provider: Configure the tracer provider to export traces and metrics to console and to the local OTLP destination as needed.

Tracing your own functions

To trace your own custom functions, you can leverage OpenTelemetry, you'll need to instrument your code with the OpenTelemetry SDK. This involves setting up a tracer provider and creating spans around the code you want to trace. Each span represents a unit of work and can be nested to form a trace tree. You can add attributes to spans to enrich the trace data with additional context. Once instrumented, configure an exporter to send the trace data to a backend for analysis and visualization. For detailed instructions and advanced usage, refer to the OpenTelemetry documentation. This will help you monitor the performance of your custom functions and gain insights into their execution.

Attach User feedback to traces

To attach user feedback to traces and visualize them in Azure AI Foundry portal using OpenTelemetry's semantic conventions, you can instrument your application enabling tracing and logging user feedback. By correlating feedback traces with their respective chat request traces using the response ID, you can use view and manage these traces in Azure AI Foundry portal. OpenTelemetry's specification allows for standardized and enriched trace data, which can be analyzed in Azure AI Foundry portal for performance optimization and user experience insights. This approach helps you use the full power of OpenTelemetry for enhanced observability in your applications.

Python samples containing fully runnable Python code for tracing using synchronous and asynchronous clients.
JavaScript samples containing fully runnable JavaScript code for tracing using synchronous and asynchronous clients.
C# Samples containing fully runnable C# code for doing inference using synchronous and asynchronous methods.

Share via

How to trace your application with Azure AI Inference SDK

Enable trace in your application

Prerequisites

Installation

Configuration

Enable Instrumentation

Tracing your own functions

Attach User feedback to traces

Feedback

Additional resources

Share via

How to trace your application with Azure AI Inference SDK

Enable trace in your application

Prerequisites

Installation

Configuration

Enable Instrumentation

Tracing your own functions

Attach User feedback to traces

Related content

Feedback

Additional resources