Reliability guides by service
While Azure provides a set of reliability features, the resiliency of your workload is a shared responsibility between you and Microsoft and depends on how you have designed your business continuity plan to define your expectations for reliability. For this reason, it's important that you understand the reliability features of each service you use, and how to best implement them in your workload. This document provides links to the reliability guidance for each Azure service, detailing how each service supports your reliability requirements through its features and design.
Each service guide generally contains information on how the service supports:
- Availability zones such as zonal and zone-redundant deployment options, traffic routing and data replication between zones, what happens if a zone experiences an outage, failback, and how to configure your resources for availability zone support.
- Multi-region support such as how to configure multi-region or geo-disaster support, traffic routing and data replication between regions, region-down experience, failover and failback support, alternative multi-region support.
- Backup support such as who controls backups, where they are stored, how they can be recovered, and whether they are accessible only within a region or across regions.
Note
Some service documents are in the process of, or are not yet updated into a single reliability guide format. These may contain more than one document that references reliability guidance.
AI and machine learning
Product | Guidance |
---|---|
Azure AI Health Insights | Reliability in Azure AI Health Insights |
Azure AI Search | Reliability in Azure AI Search |
Azure Bot Service | Reliability in Azure Bot Service |
Azure Machine Learning Service | Failover for business continuity and disaster recovery |
Analytics
Product | Guidance |
---|---|
Azure HDInsight | Reliability in Azure HDInsight |
Azure HDInsight on AKS | Reliability in Azure HDInsight on AKS |
Azure Machine Learning Service | Failover for business continuity and disaster recovery |
Azure Stream Analytics | Achieve geo-redundancy for Azure Stream Analytics jobs |
Azure Event Hubs | Reliability in Azure Event Hubs |
Azure Data Explorer | Business continuity and disaster recovery overview |
Azure Data Share | Disaster recovery for Azure Data Share |
Azure Chaos Studio | Reliability in Azure Chaos Studio |
Microsoft Fabric | Reliability in Microsoft Fabric |
Microsoft Purview | Reliability in Microsoft Purview |
Compute
Product | Guidance |
---|---|
Azure App Service | Reliability in Azure App Service |
Azure Batch | Reliability in Azure Batch |
Azure Container Apps | Reliability in Azure Container Apps |
Azure Container Instances | Reliability in Azure Container Instances |
Azure Functions | Reliability in Azure Functions |
Azure Kubernetes Service (AKS) | Create an Azure Kubernetes Service (AKS) cluster that uses availability zones High availability and disaster recovery overview for Azure Kubernetes Service (AKS) |
Azure Service Fabric | Deploy an Azure Service Fabric cluster across Availability Zones Disaster recovery in Azure Service Fabric |
Azure Spring Apps | Reliability in Azure Spring Apps |
Azure Virtual Machines | Reliability in Azure Virtual Machines |
Azure Virtual Machine Image Builder | Reliability in Azure Virtual Machine Image Builder |
Azure Virtual Machine Scale Sets | Reliability in Azure Virtual Machine Scale Sets |
Azure VMware Solution | Deploy disaster recovery using VMware HCX |
Containers
Product | Guidance |
---|---|
Azure App Configuration | How does App Configuration ensure high data availability? Resiliency and disaster recovery |
Azure Container Apps | Reliability in Azure Container Apps |
Azure Container Instances | Reliability in Azure Container Instances |
Azure Container Registry | Enable zone redundancy in Azure Container Registry for resiliency and high availability Geo-replication in Azure Container Registry |
Azure Kubernetes Service (AKS) | Create an Azure Kubernetes Service (AKS) cluster that uses availability zones High availability and disaster recovery overview for Azure Kubernetes Service (AKS) |
Azure Service Fabric | Deploy an Azure Service Fabric cluster across Availability Zones Disaster recovery in Azure Service Fabric |
Databases
Product | Guidance |
---|---|
Azure SQL | Azure SQL - High availability Disaster recovery guidance - Azure SQL Database |
Azure SQL-Managed Instance | Failover groups overview & best practices - Azure SQL Managed Instance |
Azure Database for MySQL | Overview of business continuity with Azure Database for MySQL - Single Server |
Azure Database for MySQL - Flexible Server | Azure Database for MySQL Flexible Server High availability Azure Database for MySQL Flexible Server - Restore to latest restore point |
Azure Database for PostgreSQL - Flexible Server | Reliability in Azure Database for PostgreSQL - Flexible Server |
Azure Cosmos DB for NoSQL | Reliability in Azure Cosmos DB for NoSQL |
Azure Cosmos DB for MongoDB vCore | Reliability in Azure Cosmos DB for MongoDB vCore |
Azure Cache for Redis | Enable zone redundancy for Azure Cache for Redis Configure passive geo-replication for Premium Azure Cache for Redis instances |
Developer tools
Product | Guidance |
---|---|
Azure API Center | Reliability in Azure API Center |
DevOps
Product | Guidance |
---|---|
Azure Deployment Environments | Reliability in Azure Deployment Environments |
Azure DevOps | Data availability |
Azure Monitor-Log Analytics | Enhance data and service resilience in Azure Monitor Logs with availability zones Log Analytics workspace replication |
Hybrid + multicloud
Product | Guidance |
---|---|
Azure Operator Nexus | Reliability in Azure Operator Nexus |
Industry solutions
Product | Guidance |
---|---|
Microsoft Community Training | Reliability in Microsoft Community Training |
Sustainability Data Solutions in Fabric | Reliability in Sustainability Data Solutions in Fabric |
Integration
Product | Guidance |
---|---|
Azure API for FHIR® | Disaster recovery for Azure API for FHIR |
Azure API Management | Ensure API Management availability and reliability How to implement disaster recovery using service backup and restore |
Azure Data Manager for Energy | Reliability in Azure Data Manager for Energy |
Azure Data Factory | Azure Data Factory data redundancy |
Azure Event Grid | Reliability in Azure Event Grid |
Azure Functions | Reliability in Azure Functions |
Azure Health Data Services: De-identification service (preview) | Reliability in Azure Health Data Services: De-Identification service |
Azure Health Data Services: Workspace services (FHIR®, DICOM®, MedTech) | Business continuity and disaster recovery considerations |
Azure Logic Apps | Reliability in Azure Logic Apps |
Azure Service Bus | Best practices for insulating applications against Service Bus outages and disasters |
Internet of Things
Product | Guidance |
---|---|
Azure Device Registry | Reliability in Azure Device Registry |
Azure IoT Hub | IoT Hub high availability and disaster recovery |
Azure Notification Hubs | Reliability in Azure Notification Hubs |
Media
Product | Guidance |
---|---|
Azure Media Services | High Availability with Media Services and Video on Demand (VOD) |
Management and governance
Product | Guidance |
---|---|
Azure Backup | Reliability in Azure Backup |
Azure Guest Configuration | Azure Guest Configuration Availability |
Azure Monitor-Log Analytics | Enhance data and service resilience in Azure Monitor Logs with availability zones Log Analytics workspace replication |
Azure Site Recovery | Set up disaster recovery for Azure VMs |
Migration
Product | Guidance |
---|---|
Azure Migrate | Does Azure Migrate offer Backup and Disaster Recovery? |
Azure Site Recovery | Set up disaster recovery for Azure VMs |
Networking
Security
Product | Guidance |
---|---|
Azure Disk Encryption | Redundancy options for managed disks |
Azure Firewall | Deploy an Azure Firewall with Availability Zones using Azure PowerShell |
Azure Key Vault | Azure Key Vault availability and redundancy |
Azure Web Application Firewall | Deploy an Azure Firewall with Availability Zones using Azure PowerShell How do I achieve a disaster recovery scenario across datacenters by using Application Gateway? |
Storage
Product | Guidance |
---|---|
Azure Backup | Reliability in Azure Backup |
Azure Blob Storage | Choose the right redundancy optionAzure storage disaster recovery planning and failover |
Azure Databox | How can I recover my data if an entire region fails? |
Azure Elastic SAN | Reliability in Azure Elastic SAN |
Azure NetApp Files | Manage disaster recovery using Azure NetApp Files |
Azure Storage Actions | Reliability in Azure Storage Actions |
Azure Storage-Disk Storage | Best practices for achieving high availability with Azure virtual machines and managed disks |
Azure Storage Mover | Reliability in Azure Storage Mover |
Web
Product | Guidance |
---|---|
Azure AI Search | Reliability in Azure AI Search |
Azure API Management | Ensure API Management availability and reliability How to implement disaster recovery using service backup and restore |
Azure App Service | Reliability in Azure App Service |
Azure Container Apps | Reliability in Azure Container Apps |
Azure Notification Hubs | Reliability in Azure Notification Hubs |
Azure SignalR Service | Resiliency and disaster recovery in Azure SignalR Service |