Azure OpenAI: o3-mini deployment has 1 minute hard timeout via API call

Andres da Silva Santos 140 Reputation points
2025-02-03T21:22:30.89+00:00

When using o3-mini with stream: true the API cuts the connection after 1 minute waiting for some event.

Request example:

curl --location 'https://host.openai.azure.com/openai/deployments/o3-mini/chat/completions?api-version=2025-01-01-preview' \
--header 'api-key: 123' \
--header 'Content-Type: application/json' \
--data-raw '{
    "messages": [
        {
            "role": "user",
            "content": "complex task here"
        }
    ],
    "reasoning_effort": "high",
    "max_completion_tokens": 60000,
    "n": 1,
    "response_format": {
        "type": "json_schema",
        "json_schema": {
            "name": "responseFormat",
            "schema": {
                "type": "object",
                "properties": {
                    "files": {
                        "type": "array",
                        "items": {
                            "type": "object",
                            "properties": {
                                "name": {
                                    "type": "string"
                                },
                                "content": {
                                    "type": "string"
                                }
                            },
                            "required": [
                                "name",
                                "content"
                            ],
                            "additionalProperties": false
                        }
                    }
                },
                "required": [
                    "files"
                ],
                "additionalProperties": false
            },
            "strict": true
        }
    },
    "stream": true,
    "stream_options": {
        "include_usage": true
    },
    "user": "app_math_teacher"
}'

Response:

408 Timeout

HTTP/1.1 408 Timeout
Content-Length: 75
Content-Type: application/json
apim-request-id: 8e3359d8-b764-490c-9de8-94ae3a55343e
Strict-Transport-Security: max-age=31536000; includeSubDomains; preload
x-content-type-options: nosniff
x-ms-region: East US 2
x-ratelimit-remaining-requests: 1048
x-ratelimit-remaining-tokens: 10390000
Date: Tue, 04 Feb 2025 01:10:38 GMT
 
{ "error": { "code": "Timeout", "message": "The operation was timeout." } }

OBS: This only occurs with stream: true

Local: East US 2

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
3,709 questions
{count} votes

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.