Hi Tamir Goldman,
Greetings & Welcome to Microsoft Q&A forum! Thanks for posting your query!
It seems you might be using a serverless API endpoint for your deployment. Could you please try with managed compute deployment.
I have reproduced the scenario and successfully deployed the LLaMA 3.2 90B Vision Instruct model. Here is the screenshot: For more information: Meta Llama family of models.
For serverless API endpoint will be coming soon. For more updates, please look into What's new page.
Hope this helps. Do let us know if you have any further queries.
If this answers your query, do click Accept Answer
and Yes
for was this answer helpful.