Trained yolov8 model on compute cluster and all metrics are flat
Hi, I have been training yolov8 model using compute cluster and after training is complete, I see all the metrics are flat as if model did not learn anything I don't know what went wrong but the training completed successfully with 50 epochs and…
Working with labeled dataset on azure ml studio and how to consume it to train yolo model
I am working on training yolov8 model using azure ml studio and I use data labeling tool to label my dataset but after this I have difficulty in understanding how this labeled dataset when exported (in MLTable format) can be consumed for training yolo…
Real-time endpoint Automated ML failed deploy
I did create a model with Automated ML. when i try to deploy the model i am getting the error: {"code":"SubscriptionNotRegistered","message":"Resource provider [N/A] isn't registered with Subscription [N/A]. Please see…
Deploy Azure Machine Learning extension on AKS is failed.
Execute below command, then get error.I don't know how to make my account has "Microsoft.KubernetesConfiguration/register/action" permission. az provider register --namespace Microsoft.KubernetesConfiguration Message: The client…
My team member deleted the labeled dataset in Azure ML Studio, is there a way to recover it with the exported COCO file?
Hi community. My team member deleted the labeled dataset in Azure ML Studio, but the good thing is we have exported the COCO file and MLTable for that dataset, may I ask is there a way to recover it?
How to recover a deleted data label from the exported coco file
Hi, I have accidentally deleted the resource on the Azure Machine Learning Studio, which has the data on the data labeling. Before the deletion, I exported the coco file from the data labeling, however, I can't seem to be able to reuse the coco file to…
Even though I have enough credits, Azure Machine Learning Studio does not let me create GPU based compute instances. How can I fix this?
Even though I have enough credits, Azure Machine Learning Studio does not let me create GPU based compute instances. How can I fix this?
How to fix AzureMLException: AzureMLException: Message: Execution failed in operation 'to_pandas_dataframe' for Dataset(id='id', name='TEMP_DATA_ASSET', version=1, error_code=ScriptExecution.Database.Unexpected,error_message and ErrorCode: ScriptExecution
Issue: AzureMLException When Consuming Datastore in AzureML via to_pandas_dataframe() Description of the Setup I have manually set up a datastore in AzureML, which is linked to an Azure Synapse Analytics server. During the creation of the datastore, I…
Azure ML studio designer python executor is outdated
Hey Team, Explored a lot on Azure ML studio service and found the designer is helpful. Still an issue when I am using the Execute Python Script in the designer it is import all libraries older version and when checked its python version it was also…
Microsoft Entra Hybrid Join (Azure Hybrid Join) error - 0x80072f78
Hello, So I got through multiple things and I would like to know if someone has any idea what to do here. I'm trying to join devices into tenant as hybrid joined, to deploy Windows Hello for Business. We're using Microsoft Entra Connect after recent…
TRANNING Related Isuues .
I am Getting Problem In Fetcing Data , which is formely working Good after tranning new model its accuracy is decreased suddenly . Example Transport Charge is Fecting Previsoly , and after tranning new model it stop fecting transport charges without any…
Authorization Failed Error
I am trying to run the AzureML SDK v2 code.While trying to authenticate MLClient via DefaultAzureCredential() getting Authorization failed Error. I have AzuremL Data Scientist role as well as full contrbutor role. Please check the log. File…
Seeking Azure Training for AI and Business Intelligence
Hello Microsoft Community! I am excited to share that my company has generously offered to cover the cost of formal training for me in the Azure platform. I am eager to enhance my skills, particularly in the areas of Artificial Intelligence and Business…
How to get permissions in order to deploy TimeGen-1?
Greetings, I am trying to deploy TimeGEN-1. so I've chosen it from model catalog, however when i clicked on "Deploy", i got the following error: Cannot create marketplace subscription UserError: Marketplace Subscription purchase eligibility…
Azure machine learning job is not able to utilize the GPU provisioned Standard_NC6s_v3 using python sdk
I am currently working on fine-tuning a model called Phi-3-small-instruct-128K using the Azure Machine Learning Python SDK. The training job is running successfully, and there are no errors or exceptions thrown. However, the issue I'm facing is that the…
Can't setting the scoring script when creating batch endpoint job
I have a deployment in the Azure Machine Learning batch endpoint. In this deployment, I set a custom scoring script batch_driver.py. (see below picture) However, when I try creating new job, the scoring name still is driver/amlbi_main.py. Is it a bug?
Azure AI Foundry: Private IP Compute Creation Confusion
Using Azure AI Foundry with the following settings: Public network access: Disabled Workspace managed outbound access: Disabled The goal is to create compute resources using private IPs from a virtual network (subnet) to connect with enterprise…
How do I configure my Azure ML Workspace such that when I delete experiments and jobs, the corresponding Blobs in Azure Blob Storage are also deleted
I regularly submit jobs and experiments to train ML models. With each experiment, all my files and outputs are uploaded to my blob storage account. I have been constantly deleting failed and cancelled experiments, but I found out that deleting…
Parent job is complete, but child job keeps running indefinitely, doing nothing
I have an Azure ML sweep that produces the outputs I expected and is marked as complete. However, when I dug deeper (and got pinged a few times), I see that the child jobs are still running and keep running indefinitely. They don't seem to be doing…
Azure Machine Learning is not calling the current compute cluster but a deleted one.
The Azure Machine Learning "job" fails because it cannot find a cluster I deleted. I have another cluster that I created and is "green" as in working, but I can't find where to change the backend. Background. I created the ML…