Cloud latency challenges and solutions
There are a number of benefits in deploying an application to the cloud, but also some challenges that come with it. Added distance and components between the business users and applications in the cloud can cause performance and availability challenges.
Performance Challenges
- Increased latency and reduced bandwidth between client and services deployed to the cloud can cause performance issues in the application. Changes to the application may be necessary and measures may
- Shared environment and lack of control over the connectivity between servers in the cloud could mean connections between servers do not meet the needs of the application design. Some applications require highly specialized hardware as part of communication requirements between systems where even an additional 1/10th of (ms) of additional latency can have significant impact on performance.
Utilize cloud features and infrastructure
Azure ExpressRoute enables you to create private connections between Azure datacenters and infrastructure that's on your premises or in a colocation environment. ExpressRoute connections do not go over the public Internet, and offer more reliability, faster speeds, lower latencies and higher security than typical connections over the Internet.
Content Delivery Network (CDN) and near-site application caches and acceleration.
Internal networking features like VNET can also be used to reduce latency between systems within the Azure Cloud environment.
Application Optimizations
Design considerations and changes to the application may be necessary to accommodate increased latency between on-premises clients/services, and cloud deployed services.
Reduce chattiness across higher latency networks. Communications between some existing applications may have been implemented with a different set of assumptions, and may need to be re-designed and optimized for increases latency between clients and servers.
Reduce the amount of data over the network through caching of data that may be redundant across requests and/or users in the enterprise. This can be incorporated in the client application and/or an on-premises appliance or server.
Content Deliver Networks (CDN) can be used to bring some of the application dependencies closer to the site. Files are cached around the world for redundancy and availability.
Protocol proxy translation and IP application accelerators. Sometimes the protocol necessary for communicating with a server in the cloud may not be the most suitable over a lower bandwidth higher latency connection and a proxy place in the cloud in between can be used to simply proxy requests to one more suitable for this type of connectivity.
Asynchronous or background requests in the application.
Availability
Given the additional distance and components between the client and the solution deployed in the cloud there are also potentially more things to fail. Generally, availability is addressed through redundancy.
Site-To-Cloud Connectivity
Design on-premises clients or servers to temporarily work disconnected. Allows the business to continue operating with either all or a limited set of mission critical features necessary to maintain business continuity in the event of a network failure. A 99.99 available service is not that useful with a 99.9 network connection.
Redundant carrier connections and networking components.
Highly Available Cloud Solutions
Design and build the application with redundant components or services.
Azure Features
Fault Domain Isolation reduces the chances that a hardware failure will interrupt service. Highly available services; Azure Storage, Service Bus, SQL Database.
Multi Data Center Deployment
Applications can be deployed to multiple data centers for even higher levels of availability. A number of deployment topologies across multiple data centers with trade-offs in cost, complexity, and availability.
Azure features to help support this Multi-DC
Azure Storage BLOBs, Tables, and Queues are replicated to different data center with read access to the replica.
For Microsoft Azure SQL Database Premium databases, Active Geo-Replication can be used to create geographically redundant replicas of the source databases. In addition these active secondary databases can be used for read workloads.
Traffic Manager can redirect traffic to services in another data center in the event of a failure.
Resources
ExpressRoute
https://azure.microsoft.com/en-us/services/expressroute/
Guidance for Resilient Cloud Architectures
https://msdn.microsoft.com/en-us/library/jj853352.aspx
Cloud Design Patterns
https://msdn.microsoft.com/en-us/library/dn568099.aspx
Azure SQL Database Business Continuity
https://msdn.microsoft.com/en-us/library/azure/hh852669.aspx
Multiple Datacenter Deployment Guidance
https://msdn.microsoft.com/en-us/library/dn589779.aspx
Resiliency Patterns and Guidance
https://msdn.microsoft.com/en-us/library/dn600215.aspx
Windows Azure Business Continuity Technical Guidance
https://msdn.microsoft.com/library/azure/hh873027.aspx
Disaster Recovery and High Availability for Azure Applications
https://msdn.microsoft.com/en-us/library/dn251004.aspx
Azure SQL Database Business Continuity
https://msdn.microsoft.com/en-us/library/azure/hh852669.aspx