OpenAPI service is returning no response and no error
HI, we are using gpt-4o with the below config: For some of the requests it is giving no response and no error message. Wanted to understand the reason for this and also how to avoid it?
it is possible to retrieve the actual text that assistant is using from vector store?
I'm currently using azure-ai-openai-assistants 1.0.0-beta.4 and client.listMessages(threadId) returns ThreadMessages but it doesn't contain the actual text retrieved from the vector store but only contains file citations. is there any way to retrieve the…
rate Limited exceeded error
I am getting the below error while using gpt-4o model "Error occurred with msg: HTTP 429 (429) Rate limit is exceeded. Try again in 1 seconds. and stacktrace : at Azure.AI.OpenAI.ClientPipelineExtensions.ProcessMessage(ClientPipeline pipeline,…
Throttling issue with OpenAI API
Hi, I'm using the Azure OpenAI API in my RAG project and noticed something unusual. In one of the steps, I send 3-4 chat completion requests concurrently. However, the response time appears to be much longer compared to a single request, suggesting that…
450K of 450K TPM
Hello, We are utilizing Azure Sponsorship and currently working with the GPT-4 model. It appears that our quota of 450K/450K TPM (Tokens Per Minute) has been fully utilized. However, I am a bit confused as TPM typically represents tokens per minute, and…
gpt-o1 performance
Good Day Is that normal to have 180 seconds avg delay at gpt-o1 general available model? Sweden Zone Reason Effort - low Prompts like - How are you ? Knowledge cutoff Using the Playground
Im getting Error in Chat playground even though I have increase model tokens to 5K. Server responded with status 429. Error message: {'error': {'code': '429', 'message': 'Rate limit is exceeded. Try again in 86400 seconds.'}}
Im using GTP 4o mini and with 5K token limit. I have added own data with Azure search and while trying to query, each time it gives rate limit error.
Azure Open AI Assistant Failure
Hello, Since yesterday we are facing failures regarding uploading files and using file search option of GPT and OpenAI Assistant. When I check the vector side, I can see files are getting failed to be vectorised. This is the problem in Sweden…
GPT 4o assistant not accepting PDF attachment in message thread (accepts image)
Hi. We're having difficulty uploading PDF attachments to a message thread for GPT 4o based assistant. We're able to upload images but unable to upload PDF to the thread message. We've enabled File Search and Code Interpreter from Assistants…
Azure AI Llama-3.2-11B-Vision-Instruct shows dramatically less context length than what it should
Hi community, Yesterday I deployed a serverless of Llama-3.2-11B-Vision-Instruct in my Azure AI studio project and then created a backend route to call it with context (for chat completion). I am not sure if I am missing anything but, Meta says the…
What is the carbon cost of using the GPT-4o-mini Global Deployment in the Sweden Central region?
Hello, I am completing a university research project and would like to understand the carbon cost per token for the GPT-4o-mini model when hosted in Azure's Sweden Central region, ideally in terms of grams of CO2 equivalent per token (g…
Azure OpenAI, vector stores stuck processing files
Hello, I'm experiencing inconsistent vector store processing behaviour with Azure OpenAI integration. Files are getting stuck in a "In progress" state with the following characteristics: Vector store files remain in "In progress"…
openai.AuthenticationError: Error code: 401 - {'statusCode': 401, 'message': 'Unauthorized. Access token is missing, invalid, audience is incorrect (https://cognitiveservices.azure.com), or have expired.'}
I am trying to run the following code. I have obtained the key and endpoint from my free azure subscription after creating openai resource (see image). I downloaded the code from microsoft page to access api and test it with code below. I checked that…
Inquiring on Azure OpenAI pricing for European Regions
Dear Azure OpenAI Support Team, I am writing to inquire about the pricing structure for Azure OpenAI services specifically in European regions (France Central and North Europe or relevant). While reviewing the Azure pricing calculator, I noticed that…
OpenAI 4o-realtime model latest version availability
Hi, I see that on the openai website the version of the 4o-realtime model is gpt-4o-realtime-preview-2024-12-17 from https://platform.openai.com/docs/guides/realtime As I understand this version has more number of voices and is also much better at…
Failing Azure Foundry Deployment - No Healthy Upstream Error
I am encountering an issue with the GPT-4o model deployed on Azure OpenAI Service. I have deployed the model and attempted to make chat calls using API requests and in the playground. However, as of this afternoon I consistently receive the following…
New vector stores for Azure OpenAI Assistants fail after a few days
I am experiencing a recurring issue with newer vector stores created for Azure OpenAI Assistants after December 2024. Assistants linked to these vector stores stop working correctly after a few days, causing significant delays in responses and failure to…
How to Fetch Azure OpenAl api_version and engine Dynamically After Resource Creation via Python?
Hello! I am using Python to automate the creation of Azure OpenAl resources via the Azure Management API. I am successfully able to create the resource, but I need to dynamically fetch the following details after the resource is created: API Version…
How to use the whisper openai model in a azure function for azure static webapp
Hi there dear reader, I am working on my first azure static webapp, With Next.js I already have a working version of it. The concept of the application is simple, it makes a transcript of the user (voice) audio input and returns it to the user…
Azure OpenAI Metrics Issue: Unable to Track Assistant Inference Tokens
Hello, I am encountering an issue with Azure OpenAI Metrics: I can no longer track the number of inference tokens generated by a gpt model when I use OpenAI assistants under the metric AZURE OPENAI - USAGE > Processed Inference Tokens. Previously,…