Unable to login LUIS portal and CLUAI
Hi Microsoft Support Team, We previously used LUIS AI in our chatbot application, and now with the introduction of Language Studio (CLU AI), we are migrating our setup. However, we are facing an issue when trying to log in. Earlier, we were able to log…
Getting 500 in Azure AI Studio East US
I've been getting status 500 or "There was an issue processing the response" from the serverless mistral model endpoints. Is it a glitch on the server side?
o3-mini approval. how long must we wait? been over 30 days.
Hi Team, We've applied twice for o3-mini approval (gating services). Been over 30 days. How long must we wait? How do we know what's causing delays? Is it specific use-case based? At least we should know the reason for rejection if any. Appreciate any…
When will the o1 and o3 models be available in agents or assistants?
OpenAI have recently made o1 and o3 models available to use in their assistants/agents. Is this feature coming to azure soon? Quickly being able to connect vector stores to reasoning models like o1/o3 would be a game changer for…
How to upload a file to an agent thread and have the agent use the file?
I have tried to upload a file to a thread both through code and through the playground. While I can see that my Vector Storage was indeed created and the file is indeed uploaded there, the agent doesn't seem to be reading the files at all. My question,…
Principal does not have access to API/Operation. Error creating new vector store in Azure AI Studio "create vector store".
When I try uploading a PDF file to the "create vector store" in Azure AI Studio and press "create", I get the error "Principal does not have access to API/Operation." All I am trying to do is upload a file to the…
How to use App Insights with Prompt Flow
I'm working on implementing a new flow using Azure Prompt Flow in VSCode, and I'm trying to have the insights displayed in App Insights. Currently I have: An .env file inside the project and inside the file I have this:…
Azure AI Foundry Serverless Model Rate Limits
Hello, This page: https://learn.microsoft.com/en-us/azure/ai-foundry/model-inference/quotas-limits says that the Rate Limits for deployed serverless models is 200.000 tokens per minute and 1.000 requests per minute, but this page:…
Unable to login to Azure AI Foundry.
Hi, I am trying to log into Azure AI Foundry but I am getting this error: "Your account has been locked. Contact your support person to unlock it, then try again.". Interesting thing is that, I am able to login using my email on outlook.com and…
Azure OpenAI instance, modifying deployment capacity to permit another model deployment
I have an Azure OpenAI instance with multiple models. I want to deploy another model, so modified the deployment capacity of one of them (4o) downwards from 1000K TPM to 950K TPM to permit another 4o model deployment within quota. I selected Standard…
How to fix broken azure openai model deployment?
Our deployment, gpt-4o-data-zone, was unavailable for over four hours (we noticed it at Feb 27, 2025 at 10:28:30.825 am CET), impacting our operations significantly. All calls were failing with status 500 model_error. Do you know what could be a cause of…
Why is there mismatch between the json output supported models in practice and in documentation?
In documentation here, gpt-4o-mini ver 2024-07-18 is supposed to support json output. On ai foundry, the only available version for deployment is the gpt-4o-mini ver 2024-07-18 in every region, the tag being "gpt-4o-mini". But this version…
Issue with Fine-Tuning Microsoft Healthcare Multi-Model MedImageInsight for Classification
I have been working with the Microsoft Healthcare Multi-Model MedImageInsight and successfully deployed the endpoint online. I was able to consume it using both the default method provided in the code and a custom approach. However, I now want to…
Assistants API (base model: GPT4o) unable to parse uploaded image as attachment and answer questions related to info. in image.
When using Assistants Playground on Azure portal: We create an Assistant using Assistants Playground. Then upload a PNG image attachment of a table with some records/content. In our prompt (see sample below) we ask questions related to this info. We…
Slow and unstable responses from o1
As of February 18, 2025, I'm running into issues with o1 on Azure AI services that I wasn't previously. When running o1 on a series of inputs, response times are slower than I saw before (500 queries taking ~1.5 hours instead of 15 minutes seen…
Azure Open Ai Insert own data get error "Cracking and chunking - Data ingestion failed"
hi Expert, I am using Azure AI Foundry and have created a project and hub. The model I am using is GPT-4o along with text-embedding-ada-002 on the Chat Playground. I added my data from a storage account using Azure AI Services and want it to detect my…
Azure AI Foundry Completion Token Limit
Hello, I have deployed a Llama 3.3 70B model using Azure AI Foundry. As you can see in the image below from this page, the output limit should be 8192 tokens. The problem is that when I use the model with Azure AI Inference Completions, the max token…
How to fix: Backend returned unexpected response. Please contact Microsoft for help in Azure AI Foundry
Hi, I have been trying to deploy Llama-3.3-70B-Instruct model on azure ai foundry and use it on our RAG implementation. When I try to use chat completion service I am getting the following error: Error code: 500 - {'error': {'code':…
How to solve wrong citations in azure ai agents with bing grounding
Description: When asking question which require multiple web searches the citations in the answer only fit one of the web searches. Is there any fix to this issue? Example: Question: 'attractions in Berlin and Frankfurt' Answer: 'In Berlin, some of…
Upload documents not available for o1-mini model
I have created a project in Azure AI Foundry, using the o1-mini model. Upon deployment, I noticed recently that I am unable to set a template for chat responses or add data preview (upload documents) for this model during setup. Is this a feature not…