Azure OpenAI: o3-mini deployment has 1 minute hard timeout via API call
Andres da Silva Santos
140
Reputation points
When using o3-mini with stream: true the API cuts the connection after 1 minute waiting for some event.
Request example:
curl --location 'https://host.openai.azure.com/openai/deployments/o3-mini/chat/completions?api-version=2025-01-01-preview' \
--header 'api-key: 123' \
--header 'Content-Type: application/json' \
--data-raw '{
"messages": [
{
"role": "user",
"content": "complex task here"
}
],
"reasoning_effort": "high",
"max_completion_tokens": 60000,
"n": 1,
"response_format": {
"type": "json_schema",
"json_schema": {
"name": "responseFormat",
"schema": {
"type": "object",
"properties": {
"files": {
"type": "array",
"items": {
"type": "object",
"properties": {
"name": {
"type": "string"
},
"content": {
"type": "string"
}
},
"required": [
"name",
"content"
],
"additionalProperties": false
}
}
},
"required": [
"files"
],
"additionalProperties": false
},
"strict": true
}
},
"stream": true,
"stream_options": {
"include_usage": true
},
"user": "app_math_teacher"
}'
Response:
408 Timeout
HTTP/1.1 408 Timeout
Content-Length: 75
Content-Type: application/json
apim-request-id: 8e3359d8-b764-490c-9de8-94ae3a55343e
Strict-Transport-Security: max-age=31536000; includeSubDomains; preload
x-content-type-options: nosniff
x-ms-region: East US 2
x-ratelimit-remaining-requests: 1048
x-ratelimit-remaining-tokens: 10390000
Date: Tue, 04 Feb 2025 01:10:38 GMT
{ "error": { "code": "Timeout", "message": "The operation was timeout." } }
OBS: This only occurs with stream: true
Local: East US 2
Sign in to answer