When using Azure as a modern data platform, you have your choice of platform as a service (PaaS) or infrastructure as a service (IaaS). PaaS provides cloud-native options for data ingestion, transformation, and storage. IaaS gives you greater control over your hybrid infrastructure, starting with the size and type of virtual machines (VM) you choose. With either approach, you can take advantage of a variety of fully managed relational, NoSQL, and in-memory databases, storage solutions, and analytics offerings that span proprietary and open-source engines. This example architecture shows both approaches.
Architecture
Download a Visio file of this architecture.
Workflow
IBM MQ is the middleware that acts as a loosely coupled link between a mainframe or midrange system and Azure services. Messages are received and sent according to application requirements to communicate with the mainframe application layer.
In a cloud-native approach, Azure Logic Apps uses the MQ connector to exchange messages with IBM MQ. The Scheduler feature orchestrates the Azure workflow, sending and receiving messages at recurring intervals of one second.
The MQ connector can send the messages it reads directly to storage through a connector or send them to be transformed first. Logic Apps includes several options for data transformation, such as an inline JavaScript runtime that you can use to run simple JavaScript code snippets for data transformation or data operations that perform transformations on JSON, CSV, and HTML table data. You can also create serverless, single-task functions using Azure Function.
Data is loaded into storage. Azure offers many managed data storage solutions, each providing different features and capabilities.
In an IaaS approach, a VM runs Microsoft Host Integration Server (HIS) with the BizTalk Adapter for WebSphere MQ. HIS exchanges messages with IBM MQ and exposes orchestration as web service to a custom .NET application.
A .NET application persists the data using any supported Azure data store. For example, the application can mask data or use private endpoints for security.
Data is loaded into storage. Azure offers many managed data storage solutions, each providing different features and capabilities.
Components
Azure Logic Apps provides tools for data orchestration, data integration, and data transformation. It includes hundreds of connectors for accessing data on-premises or in the cloud. Make sure to test throughput and performance before choosing a data storage connector.
Logic Apps Scheduler provides triggers for starting and running workflows based on the interval and frequency of recurrence that you specify.
Logic Apps MQ connector connects your Logic Apps workflows to an IBM MQ server on-premises or on Azure. Workflows receive and send messages stored in your MQ server. A Microsoft MQ client is also included for communicating with a remote MQ server across a TCP/IP network. You can use the client to connect to IBM WebSphere MQ 7.5, MQ 8.0, and MQ 9.0, 9.1, and 9.2.
Host Integration Server (HIS) can serve as a message integrator through the WebSphere MQ adapter in Microsoft BizTalk Server. A client and server adapter exchange messages between IBM MQ and BizTalk Server. HIS also serves as an MQ listener and can poll the MQ server for messages at intervals you specify.
.NET is a free, open-source development platform used in this example to create an app to pull the data through HIS to the data storage layer. It can also be used to access IBM WebSphere MQ Servers directly through the Microsoft Client for MQ.
Alternatives
For the data layer, you have your choice of managed services, including Azure Database for PostgreSQL, Azure Database for MySQL, Azure Cosmos DB, Azure Database for MariaDB, and Azure SQL.
For the storage layer, create an enterprise data lake using Azure Data Lake Storage.
For the data layer, create a big data analytics platform using Azure Synapse Analytics.
Scenario details
A popular approach in digital transformation scenarios is to see whether existing applications and middleware tiers can run as-is in a hybrid setup where Microsoft Azure serves as the scalable, distributed data platform. This example describes a data-first approach to middleware integration that enables IBM message queues (MQs) running on mainframe or midrange systems to work with Azure services so you can find the best data platform for your workload.
When using Azure as a modern data platform, you have your choice of platform as a service (PaaS) or infrastructure as a service (IaaS). PaaS provides cloud-native options for data ingestion, transformation, and storage. IaaS gives you greater control over your hybrid infrastructure, starting with the size and type of virtual machines (VM) you choose. With either approach, you can take advantage of a variety of fully managed relational, NoSQL, and in-memory databases, storage solutions, and analytics offerings that span proprietary and open-source engines.
This example architecture shows both approaches:
Cloud-native PaaS. Azure Logic Apps exchanges messages with IBM MQ through the MQ connector. Additional connectors provide quick access to events, data, and actions across other apps, services, systems, protocols, and platforms. Logic Apps also includes tools for transforming data from the queue if you need to modify the data format, structure, or values before storing it on Azure or sending it to the application layer.
VM-based IaaS. Running Microsoft Host Integration Server (HIS) on a VM, you can use a messaging integration component that connects to IBM MQ. You control the data transformation process by creating a .NET application to read and write messages. The application can persist data in the Azure data store of your choice, and you can choose the MQ server's polling interval.
Potential use cases
Either of these approaches can be used to:
Enable loosely coupled applications that communicate through messaging systems to use the Azure data platform.
Sync or replicate data incrementally between a mainframe or midrange system and Azure.
Flow event messages between mainframe or midrange systems and Azure.
Considerations
These considerations implement the pillars of the Azure Well-Architected Framework, which is a set of guiding tenets that can be used to improve the quality of a workload. For more information, see Microsoft Azure Well-Architected Framework.
A hybrid datacenter configuration make sense for organizations that are developing their cloud strategies. Connecting to Azure can help bridge the gaps in your datacenter, enhance performance, improve business continuity, and expand your reach globally.
For example, applications on-premises can communicate with a modern data platform on Azure and begin taking advantage of big data analytics or machine learning. If you need a cost-effective storage solution, you can replicate mainframe data, store it on Azure, and keep the data in sync. Azure can also add the scale needed to support online transaction processing (OLTP), batch, and data ingestion systems.
Reliability
Reliability ensures your application can meet the commitments you make to your customers. For more information, see Design review checklist for Reliability.
Azure service-level agreements (SLAs) describe your uptime guarantees. The SLAs for various components can vary. For example, Azure Logic Apps will be available at least 99.9 percent of the time. Configuration settings you choose can improve SLA.
Cost Optimization
Cost Optimization is about looking at ways to reduce unnecessary expenses and improve operational efficiencies. For more information, see Design review checklist for Cost Optimization.
This article outlines a wide range of Azure Services to demonstrate the various possibilities and you probably won't use them all for MQ integration.
Use the Azure pricing calculator to estimate costs for the Azure resources.
Use the BizTalk pricing to understand the pricing for the HIS solution.
Operational Excellence
Operational Excellence covers the operations processes that deploy an application and keep it running in production. For more information, see Design review checklist for Operational Excellence.
In this example architecture, Logic Apps connectors can be used to send messages directly to Azure Storage and Azure Data Lake Storage. For example, Logic Apps includes the Azure Blob Storage connector as the following figure shows. The connector makes it easy to store massive amounts of unstructured data in Azure Blob Storage. Your data becomes accessible from anywhere in the world via HTTP or HTTPS. Blob storage also supports Azure Data Lake Storage Gen2, a big data analytics solution for the cloud. Data is loaded into storage using an Azure service such as the AzCopy tool, Azure Data Factory, or another solution that can connect to storage.
Download a Visio file of this architecture.
Both the PaaS and IaaS architecture options support many popular managed database services. You can load data using a custom-built loader, a vendor solution, or a managed service such as Azure Data Factory. Database options include:
Azure SQL Database. Part of the Azure SQL family, Azure SQL Database is the intelligent, scalable, relational database service built for the cloud. Always up to date, it includes automated features that optimize performance, durability, and scalability, so you can focus on building new applications.
Azure SQL Managed Instance. Part of the Azure SQL service portfolio, SQL Managed Instance combines the broadest SQL Server engine compatibility with all the benefits of a fully managed PaaS.
Azure SQL on Azure Virtual Machines. Part of the Azure SQL family, this cost-effective option is designed for lifting and shifting SQL Server workloads to Azure. It combines the performance, security, and analytics of SQL Server with the flexibility and hybrid connectivity of Azure—with 100 percent code compatibility. Now includes SQL Server 2019 images.
Azure Database for PostgreSQL. This fully managed relational database service is based on the community edition of the open-source PostgreSQL database engine. You can focus on application innovation instead of database management and easily scale your workloads.
Azure Database for MySQL. This fully managed relational database service is based on the community edition of the open-source MySQL database engine.
Azure Cosmos DB. A globally distributed, multi-model database, Azure Cosmos DB provides throughput and storage that scales elastically and independently across any number of geographic regions. It is a fully managed NoSQL database service that guarantees single-digit-millisecond latencies at the 99th percentile anywhere in the world.
Azure Synapse Analytics. This enterprise analytics service accelerates time to insight across data warehouses and big data systems.
Performance Efficiency
Performance Efficiency is the ability of your workload to scale to meet the demands placed on it by users in an efficient manner. For more information, see Design review checklist for Performance Efficiency.
Make sure to test the throughput and performance of your data transformation layer before you finalize your architectural approach. Performance depends on several factors, including a workload's message size, latency, and the connectors that are used. Testing helps you find the most compatible target data platform.
Contributors
This article is maintained by Microsoft. It was originally written by the following contributors.
Principal author:
- Ashish Khandelwal | Principal Engineering Architecture Manager
To see non-public LinkedIn profiles, sign in to LinkedIn.
Next steps
For more information, email Azure Data Engineering - Mainframe & Midrange Modernization (datasqlninja@microsoft.com).
Read the Azure Database Migration Guides.