Volume Snapshots Fail on AKS with 502 and/or timeout or HTML error page
Volume snapshots are failing repeatedly, sometimes with 502 or HTML-based error pages from Azure, here's an example: Warning Error 6m24s cloudnative-pg-backup snapshot backup failed: Failed to create snapshot: failed to take snapshot of…
Use PvC instead of ephemeral storage in AKS
I am having storage limit issue. We would like to Use PvC instead of ephemeral storage in AKS
I would like to configure Service connector between Keyvault and my private AKS cluster
Hello, I'm currently facing some issue trying to configure the CSI Driver with my private Kubernetes Cluster. The is a private cluster running inside a Vnet, and the KeyVault that I want to connect to is also registered on that Vnet. However…
How to fully provision AKS backups using Bicep
I have an AKS cluster which I now want to add scheduled backups to using my Bicep-based provisioning pipeline. I haven't found much guidance for this online but I've put together the following Bicep definitions for the various components. In addition to…
Server-sent events and Azure Application Gateway for Containers
I have a server that supports server-sent events. When running this server locally, everything works as expected. But as soon as I deploy it in an AKS Kubernetes cluster in front of an Azure Application Gateway for Containers, the SSE connection times…
AKS is not able to pull images from ACR
I am unable to get our AKS to be able to pull images from our ACR. I get the following error on the pod Failed to pull image "**REMOVE**.azurecr.io/**REMOVED**:dev": [rpc error: code = NotFound desc = failed to pull and unpack image…
AKS is nodepool is not creating nodes in availability zone 1
I have enabled zones 1, 2 and 3 and we are using availability zones in AKS. But the cluster is not provisioning nodes in zone 1. What could be the problem?
AKS auto upgrading on its own when it should not be
Hello, We disabled the auto upgrade schedule for Kubernetes, however the system seems to have performed an upgrade regardless of the setting. How come the upgrade still happened? We manually upgraded from 1.29 to 1.30.6 and now we are seeing 1.30.9. The…

Unable to deploy a AKS cluster in Azure
I'm trying to create a AKS cluster and the "Cluster configuration" is "Production Economy" but the deployment fails with this error message.. I also tried the default "Standard_D8ds_v4" for the nodes and tried changing the…

Does AKS have a retry mechanism if a node exceeds drain timeout during node image upgrade?
AKS supports configuring drain timeout. Per the documentation, the default value is 30 minutes. Which means, if a long running pod doesn't terminate within 30 minutes, AKS does not perform any retry but simply the node upgrade fails, and thereby the…
Scheduled maintenance for Azure VMs
Dear support, we received a notification regarding scheduled maintenance for Azure Virtual Machines in set of regions. Currently our workloads live in the mentioned regions. We have several AKS clusters based on node pools built on Virtual Machine Scale…
Private AKS with public internet egress access
I created a private AKS cluster and using private vnet address space. I need the pods on this cluster the ability to access the internet for monitoring purposes. Some of the documents I am reading this setup by default, after doing a traceroute I am not…
permissions for New-AzAKSCluster
Hello, I have 2 Azure environments (different Tennant / Subscription). In the first Tennant the script is running New-AzKeyVault ... New-AzAKSCluster ... without any problems... In the 2nd environment, New-AzKeyvault ... runs perfect and creates the…
Cannot reach API-server of private AKS cluster
Hi! I am experiencing timeouts on kubectl commands towards my private AKS cluster. The Azure portal states that the cluster is healthy and that the nodepool is running with a single target node. The are however 0 ready nodes but the "Diagnose and…
How can I use istio mtls stric mode with aks application routing add-on
Hi, I'm using "the application routing add-on" in my aks cluster and I installed istio add-on. I added sidecar injection to istio member pods. I want to use mtls mode in strict. Could you please advice me the best way of using the mtls in…
The VM size of standard_d4s_v3 is only allowed in zones [2] in your subscription in location 'westus2'.
Hello guys, I tried to create an AKS in westus2 location with all zones([1, 2, 3]), but it reported below error: Kubernetes Cluster Name: "azure-westus2-test-24dtt-snc"): performing CreateOrUpdate: unexpected status 400 (400 Bad Request) with…
create Kubernetes cluster on Azure Local
Can anyone share documentation or a link for setting up a Kubernetes Cluster on Azure HCI Stack (Azure Local), excluding Azure Arc?
AKS/Docker: I am processing PDF's as large as 5,000 to 10,000 pages for OCR. I'm trying to run the pages concurrently in groups of 50 but running into a situtation where the memory goes to 10GB for a 200MB file.
I am processing PDF's as large as 5,000 to 10,000 pages for OCR with 200 MB files. I'm trying to run the pages concurrently in groups of 50 but running into a situtation where the memory goes to 10GB for a 200MB file. I've tried cleaning up the…
[AKS] A pod running on a Windows container in AKS is frequently restarting without a clear cause.
Issue Description: A pod running on a Windows container in AKS is frequently restarting without a clear cause. Error Details: The last recorded state shows that the pod terminated with exit code -1073741819 (0xC0000005) The pod started at Tue, Feb 4,…
AKS performing automatic upgrades when turned off
Hello, We want to do manual upgrades for our AKS environment and have disabled the automatic upgrade scheduler. We had manually upgraded to 1.30.6 and it seems Azure has force upgraded to 1.30.9. Why is Azure performing the automatic upgrade if the…