Azure batch node stays in starting for very long and moves to unusable state

VasanthKumar Kanneri 0 Reputation points
2025-01-13T18:18:31.68+00:00

I am trying to trigger an Azure batch process from ADF using custom activity. I did the following

  1. Created Azure batch account and attached a storage account to it.
  2. Allocated a Pool with the following configuration

User's image

  1. The nodes creation starts and moves to State: starting and stays there for a very long time and then changes to unusable state.
  2. I have added inbound and outbound rules as specified in the MS documentation with no success.

https://learn.microsoft.com/en-us/troubleshoot/azure/hpc/batch/azure-batch-node-unusable-state#virtual-network-configuration-issues

  1. Note: There are no Subnet delegation specified.

Not sure what is happening. Any help is appreciated.

Thank you.

VK

Azure Batch
Azure Batch
An Azure service that provides cloud-scale job scheduling and compute management.
348 questions
{count} votes

1 answer

Sort by: Most helpful
  1. Sina Salam 16,536 Reputation points
    2025-01-14T10:08:38.9633333+00:00

    Hello VasanthKumar Kanneri,

    Welcome to the Microsoft Q&A and thank you for posting your questions here.

    I understand that your Azure batch node stays in starting for very long and moves to unusable state.

    Though, this s a common problem, check the best practices steps - https://learn.microsoft.com/en-us/azure/batch/best-practices and learn how to detect and avoid failures in pool and node background operations - https://learn.microsoft.com/en-us/azure/batch/batch-pool-node-error-checking

    Yes, as you mentioned, subnet delegation can indeed cause the issue you're experiencing with Azure Batch nodes becoming unusable. When a subnet is delegated to a specific service, it can prevent the Azure Batch service from allocating nodes properly. Your question did not explain either you specify subnet delegation or you're referring to the link in your question.

    However, if Subnet Delegation is Specified:

    Disable Subnet Delegation: You need to remove the subnet delegation. This can be done through the Azure portal, PowerShell, Azure CLI, or an Azure Resource Manager template. For more detailed steps, refer to the Microsoft documentation on adding or removing subnet delegation - https://learn.microsoft.com/en-us/azure/virtual-network/manage-subnet-delegation and https://learn.microsoft.com/en-us/troubleshoot/azure/hpc/batch/azure-batch-pool-resizing-failure

    And If Subnet Delegation is Not Specified:

    Check Other Configuration Issues:

    • Ensure that your Network Security Group (NSG) rules are correctly configured to allow necessary inbound and outbound traffic.
    • Verify that your subnet has enough IP addresses to allocate to the pool nodes.
    • Confirm that your Batch account has sufficient core quota and that the VM size you selected is compatible with the image you're using.

    If you continue to face issues after addressing the subnet delegation, it might be helpful to review other potential causes mentioned earlier or contact Azure support via your portal.

    I hope this is helpful! Do not hesitate to let me know if you have any other questions.


    Please don't forget to close up the thread here by upvoting and accept it as an answer if it is helpful.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.