Azure OpenAI Service

0 answers

OpenAPI service is returning no response and no error

HI, we are using gpt-4o with the below config: For some of the requests it is giving no response and no error message. Wanted to understand the reason for this and also how to avoid it?

asked

Manaswita Datta 0 Microsoft Employee

commented

James McCarthy 0

1 answer

it is possible to retrieve the actual text that assistant is using from vector store?

I'm currently using azure-ai-openai-assistants 1.0.0-beta.4 and client.listMessages(threadId) returns ThreadMessages but it doesn't contain the actual text retrieved from the vector store but only contains file citations. is there any way to retrieve the…

asked

Jay Lee 0

answered

Sina Salam 15,396

0 answers

rate Limited exceeded error

I am getting the below error while using gpt-4o model "Error occurred with msg: HTTP 429 (429) Rate limit is exceeded. Try again in 1 seconds. and stacktrace : at Azure.AI.OpenAI.ClientPipelineExtensions.ProcessMessage(ClientPipeline pipeline,…

asked

Manaswita Datta 0 Microsoft Employee

commented

Manaswita Datta 0 Microsoft Employee

0 answers

Throttling issue with OpenAI API

Hi, I'm using the Azure OpenAI API in my RAG project and noticed something unusual. In one of the steps, I send 3-4 chat completion requests concurrently. However, the response time appears to be much longer compared to a single request, suggesting that…

asked

Tung Nguyen Xuan 40

commented

Pavankumar Purilla 2,595 Microsoft Vendor

1 answer

450K of 450K TPM

Hello, We are utilizing Azure Sponsorship and currently working with the GPT-4 model. It appears that our quota of 450K/450K TPM (Tokens Per Minute) has been fully utilized. However, I am a bit confused as TPM typically represents tokens per minute, and…

asked

Nico Vincent 0

commented

Marcin Policht 31,705 MVP

0 answers

gpt-o1 performance

Good Day Is that normal to have 180 seconds avg delay at gpt-o1 general available model? Sweden Zone Reason Effort - low Prompts like - How are you ? Knowledge cutoff Using the Playground

asked

Mohamed Hussein 570

commented

SriLakshmi C 1,770 Microsoft Vendor

0 answers

Im getting Error in Chat playground even though I have increase model tokens to 5K. Server responded with status 429. Error message: {'error': {'code': '429', 'message': 'Rate limit is exceeded. Try again in 86400 seconds.'}}

Im using GTP 4o mini and with 5K token limit. I have added own data with Azure search and while trying to query, each time it gives rate limit error.

asked

Yogesh Kulkarni 0

edited a comment

SriLakshmi C 1,770 Microsoft Vendor

0 answers

Azure Open AI Assistant Failure

Hello, Since yesterday we are facing failures regarding uploading files and using file search option of GPT and OpenAI Assistant. When I check the vector side, I can see files are getting failed to be vectorised. This is the problem in Sweden…

asked

AMOUALIAN Hesam 5

commented

AMOUALIAN Hesam 5

1 answer

GPT 4o assistant not accepting PDF attachment in message thread (accepts image)

Hi. We're having difficulty uploading PDF attachments to a message thread for GPT 4o based assistant. We're able to upload images but unable to upload PDF to the thread message. We've enabled File Search and Code Interpreter from Assistants…

asked

GenixPRO 41

answered

Pavankumar Purilla 2,595 Microsoft Vendor

2 answers

Azure AI Llama-3.2-11B-Vision-Instruct shows dramatically less context length than what it should

Hi community, Yesterday I deployed a serverless of Llama-3.2-11B-Vision-Instruct in my Azure AI studio project and then created a backend route to call it with context (for chat completion). I am not sure if I am missing anything but, Meta says the…

asked

JP 0

answered

JP 0

3 answers

What is the carbon cost of using the GPT-4o-mini Global Deployment in the Sweden Central region?

Hello, I am completing a university research project and would like to understand the carbon cost per token for the GPT-4o-mini model when hosted in Azure's Sweden Central region, ideally in terms of grams of CO2 equivalent per token (g…

asked

Hannah Mehr 0

answered

Hannah Mehr 0

0 answers

Azure OpenAI, vector stores stuck processing files

Hello, I'm experiencing inconsistent vector store processing behaviour with Azure OpenAI integration. Files are getting stuck in a "In progress" state with the following characteristics: Vector store files remain in "In progress"…

asked

Srdjan Brkic 0

commented

Wen 0

1 answer

openai.AuthenticationError: Error code: 401 - {'statusCode': 401, 'message': 'Unauthorized. Access token is missing, invalid, audience is incorrect (https://cognitiveservices.azure.com), or have expired.'}

I am trying to run the following code. I have obtained the key and endpoint from my free azure subscription after creating openai resource (see image). I downloaded the code from microsoft page to access api and test it with code below. I checked that…

asked

Chris Grenier 0

commented

santoshkc 11,710 Microsoft Vendor

1 answer

Inquiring on Azure OpenAI pricing for European Regions

Dear Azure OpenAI Support Team, I am writing to inquire about the pricing structure for Azure OpenAI services specifically in European regions (France Central and North Europe or relevant). While reviewing the Azure pricing calculator, I noticed that…

asked

Laurance 0

commented

kothapally Snigdha 870 Microsoft Vendor

2 answers

OpenAI 4o-realtime model latest version availability

Hi, I see that on the openai website the version of the 4o-realtime model is gpt-4o-realtime-preview-2024-12-17 from https://platform.openai.com/docs/guides/realtime As I understand this version has more number of voices and is also much better at…

asked

Akundy, Vyas 60

accepted

Akundy, Vyas 60

0 answers

Failing Azure Foundry Deployment - No Healthy Upstream Error

I am encountering an issue with the GPT-4o model deployed on Azure OpenAI Service. I have deployed the model and attempted to make chat calls using API requests and in the playground. However, as of this afternoon I consistently receive the following…

asked

James Shaw 0 Microsoft Employee

edited a comment

Manas Mohanty (Quadrant Resource LLC) 0 Microsoft Vendor

0 answers

New vector stores for Azure OpenAI Assistants fail after a few days

I am experiencing a recurring issue with newer vector stores created for Azure OpenAI Assistants after December 2024. Assistants linked to these vector stores stop working correctly after a few days, causing significant delays in responses and failure to…

asked

BenjaminS 16

edited the question

BenjaminS 16

1 answer

How to Fetch Azure OpenAl api_version and engine Dynamically After Resource Creation via Python?

Hello! I am using Python to automate the creation of Azure OpenAl resources via the Azure Management API. I am successfully able to create the resource, but I need to dynamically fetch the following details after the resource is created: API Version…

asked

Sudheer Kumar 0

answered

Sina Salam 15,396

1 answer

How to use the whisper openai model in a azure function for azure static webapp

Hi there dear reader, I am working on my first azure static webapp, With Next.js I already have a working version of it. The concept of the application is simple, it makes a transcript of the user (voice) audio input and returns it to the user…

asked

Ted 0

answered

Sina Salam 15,396

1 answer

Azure OpenAI Metrics Issue: Unable to Track Assistant Inference Tokens

Hello, I am encountering an issue with Azure OpenAI Metrics: I can no longer track the number of inference tokens generated by a gpt model when I use OpenAI assistants under the metric AZURE OPENAI - USAGE > Processed Inference Tokens. Previously,…

asked

Andrea ESPOSITO 5

commented

Andrea ESPOSITO 5

Filter

Content

3,539 questions with Azure OpenAI Service tags

OpenAPI service is returning no response and no error

it is possible to retrieve the actual text that assistant is using from vector store?

rate Limited exceeded error

Throttling issue with OpenAI API

450K of 450K TPM

gpt-o1 performance

Im getting Error in Chat playground even though I have increase model tokens to 5K. Server responded with status 429. Error message: {'error': {'code': '429', 'message': 'Rate limit is exceeded. Try again in 86400 seconds.'}}

Azure Open AI Assistant Failure

GPT 4o assistant not accepting PDF attachment in message thread (accepts image)

Azure AI Llama-3.2-11B-Vision-Instruct shows dramatically less context length than what it should

What is the carbon cost of using the GPT-4o-mini Global Deployment in the Sweden Central region?

Azure OpenAI, vector stores stuck processing files

openai.AuthenticationError: Error code: 401 - {'statusCode': 401, 'message': 'Unauthorized. Access token is missing, invalid, audience is incorrect (https://cognitiveservices.azure.com), or have expired.'}

Inquiring on Azure OpenAI pricing for European Regions

OpenAI 4o-realtime model latest version availability

Failing Azure Foundry Deployment - No Healthy Upstream Error

New vector stores for Azure OpenAI Assistants fail after a few days

How to Fetch Azure OpenAl api_version and engine Dynamically After Resource Creation via Python?

How to use the whisper openai model in a azure function for azure static webapp

Azure OpenAI Metrics Issue: Unable to Track Assistant Inference Tokens