what is difference between AKS and machine learning compute
Azure offers managed compute resources, including Azure Kubernetes Service (AKS) or Azure Machine Learning Compute. I want to upload fine-tuned LLM like Qwen2.5 or llama3.1 to Azure and online serving with these fine-tuned models, what is difference…
Issue with the sandbox being unavailable on Microsoft Azure Fundamentals
Hey everyone, I’ve been trying to use the MS Learn Sandbox for a module, but I keep running into this annoying error message: “The sandbox for this module is currently unavailable. We're working to resolve this as quickly as possible. In the meantime,…
"Error 'Resource provider not registered' when deploying model in Azure ML Tutorial (Free Trial)"
Hi, I'm following the Microsoft Learn tutorial "Create a classification model with Azure Machine Learning designer" (https://microsoftlearning.github.io/mslearn-ai-fundamentals/Instructions/Labs/01-machine-learning.html) with an Azure Free…
How to fix an error in tensorflow/keras for an exercise on creating a Convolutional Neural Networks
I'm following the Microsoft learning exercise and have the following problem. I would appreciate assistance so that i can continue on with my exercises. Note: I also had problems with a previous section exercise "Train a deep neural network…
Private Azure Machine Learning Workspace getting forbidden to reach private Storage Account for SAS
My Azure Machine learning workspace is configured with private endpoint , my aml is configured to a Storage account which is also configured private endpoint and disabled the public access, My Storage Account network will looks like below The…
Cannot Deploy AutoML Endpoint
Hello, Thanks in advance for the help here, I am pretty confused about what config could be missing! I am working through some tutorials on Azure Auto ML, I trained an Automl endpoint and then tried to deploy it and I get the following errors: Resource…
Automl Model Deploy Fails On Gunicorn Version Conflict
Hello, I am attempting to deploy an automl model to a real time endpoint, when I do I am getting a pip error about dependency conflicts between gunicorn and mlflow. It seems these install are dynamically added by azure? What can I do to get past this…
AZURE ML STUDIO (FACING PROBLEM IN DEPLOYMENT)
Not enough quota available for Standard_D2as_v4. Current usage/limit: 0/4. Additional requested: 8 (20% more quota than expected may be needed for redundancy purposes on some SKUs) I am facing a problem while deploying model in selecting Virtual machine…
Azure Machine Learning Studio creates v1 deployments that can't be tested in Studio
If you use AutoML in Azure Machine Learning Studio to train a model and then deploy that model to an Azure Container Instance, you can't test the deployment using Studio. When you click on the Test tab, it comes back with "This deployment is based…
How to rerun successful AzureML Pipeline
I have a successful pipeline that writes files to Blob storage. If I delete the files in blob storage and try to rerun the pipeline, the pipeline run is cached so the pipeline doesn't actually rerun. Is there a way to force rerun the pipeline?
I am trying to schedule my azure ml command job. It runs fine the first time but from second occurrence, it directly goes into completed state!
I am trying to schedule a command job using bicep. It gets executed as expected the first time but then from 2nd time the job directly goes into completed state. resource jobResource 'Microsoft.MachineLearningServices/workspaces/schedules@2023-10-01' =…
Not able to subscribe LLama-3.3-70B-Instruct with Microsoft Azure Sponsorship Account
Hello Team, Greetings of the day!! I would like to inform you that one of my customers is not able to subscribe LLama-3.3-70B-Instruct with Microsoft Azure Sponsorship Account. He attempted to subscribe to the LLama-3.3-70B-Instruct via both the…
Why can't I select Standard_NC6s_v3 as my endpoint Virtual Machine?
I'm trying to deploy my endpoint, and I wish to select Standard_NC6s_v3 as my VM, but the issue is that it only allows me to select up to these VMs. The NC6s falls into the following category (not enough quota). At first, I thought that it was related…
What is the latency and cost of A100 gpu
Hi team, I want to deploy fine-tuned model on Azure. If I use llama3.1-8b model, the input is 4k token and output is 512 token. What is the latency if I deploy the model to A100 gpu ? What is the economy cost for each A100 gpu ? Do we support vllm…
How to do the load balancer, tracking event and log
Hi Team, I want to deploy my fine-tuned LLM like Qwen2.5 model to Azure: How do you do the load balancer between different host or gpus ? How do you record the tracking and log on the Azure disk space ?
AzureMLException: Error in Streaming from Input Data Sources
An AzureMLException is encountered with the message "Error Code: ScriptExecution.StreamAccess.Unexpected Native Error: error in streaming from input data sources" when using the 'Apply SQL Transformation' component in the Azure ML designer…
How to do A/B test on Azure
Hi Team, I want to deploy fine-tuned Qwen2.5B model on Azure, how do I do A/B test or online experiment for model version A and model version B ?
In Azure ML studio deploy option is not there
Hi team im new to Azure ML studio in that i done trained data but i would like to deploy im in trial account i dont see option for deploy i can able to see only submit , share like that only please help
How to cancel a hanging Custom Vision training iteration
A Custom Vision training iteration was started 4 days ago with a 4-hour reserved budget, but it still shows as training. Retraining and exporting the project are not possible until this iteration stops hanging. What are the recommendations for canceling…