Migrate from Azure Data Factory to Data Factory in Microsoft Fabric

This article targets Azure Data Factory and Synapse Gen2 pipeline creators. It provides you with guidance and rationale to help modernize and migrate your pipelines to Data Factory for Microsoft Fabric.

Background

Microsoft Fabric is an integrated platform for both self-service and IT-managed enterprise data. With exponential growth in data volumes and complexity, Fabric customers demand enterprise solutions that scale, are secure, easy to manage, and accessible to all users across the largest of organizations.

In recent years, Microsoft invested significant work to deliver scalable cloud capabilities to customers. To that end, Data Factory in Fabric empowers a large ecosystem of data integration developers and data integration solutions built up over decades to use the full set of features, and capabilities that go far beyond comparable functionality available in previous generations.

Naturally, customers are asking whether there's an opportunity to consolidate by hosting their data integration solutions within Fabric. They often ask questions like:

  • Does all the functionality we depend on work in Fabric pipelines?
  • What capabilities are available only in Fabric pipelines?
  • How do we migrate existing pipelines to Fabric pipelines?
  • What's Microsoft's roadmap for enterprise data ingestion?

Answers to many of these questions are described in this article.

Note

The decision to migrate to Fabric capacities depends on the requirements of each customer. Customers should carefully evaluate other benefits in order to make an informed decision. We expect to see organic migration to Fabric pipelines over time, and our intention is that it happens on terms that each customer is comfortable with.

To be clear, currently there aren't any plans to deprecate Azure Data Factory or Synapse Gen2 for data ingestion. There's a priority to focus investment on Fabric pipelines for enterprise data ingestion, and so the extra value provided by Fabric capacities will increase over time. Customers who choose Fabric capacities can expect to benefit from alignment with the Microsoft Fabric product roadmap.

Convergence of enterprise and self-service data integration

The consolidation of items in Fabric simplifies discovery, collaboration, and management by colocating resources. This allows central IT teams to operationalize mission-critical data movement and transformation services aligned with corporate standards, including data lineage and monitoring, while more easily integrating popular self-service items.

To support the collaborative and scalable needs of organizations, Fabric pipelines introduce Office 365 and Teams activities for sending messages, seamless refreshes of Power BI semantic models, and robust governance features, helping organizations manage data lineage and pipeline monitoring. The integration of generative AI via Copilot further enhances the pipeline experience by offering intelligent pipeline generation and error resolution explanations, simplifying the creation and management of complex solutions.

By utilizing a common platform, the workflow is streamlined, enhancing solution development between business and IT. This empowers organizations to scale their data solutions to enterprise levels, ensuring high performance, flexibility, and efficiency in managing vast amounts of data.

Fabric capacities

Due to its distributed architecture, Fabric capacities are less sensitive to overall load, temporal spikes, and high concurrency. By consolidating capacities to larger Fabric capacity SKUs, customers can achieve increased performance and throughput.

Feature comparison

The following table lists features supported in Azure Data Factory and Fabric Data Factory.

Feature Azure Data Factory Fabric Data Factory
Pipeline activities
Office 365 and Teams activities enable you to seamlessly send messages, facilitating efficient communication and collaboration across your organization No Yes
Create connections to your Power BI semantic model and Dataflow Gen2 to ensure your data is consistently refreshed and up-to-date No Yes
Validation in a pipeline to ensure the pipeline only continues execution once it validates the attached dataset reference exists, that it meets the specified criteria, or times out Yes Yes1
Execute a SQL Server Integration Services (SSIS) package to perform data integration and transformation operations Yes Planned
Data transformation
Visually designed data transformations using Apache Spark clusters with Mapping Dataflows to create and manage data transformation processes through a graphical interface Yes No2
Visually designed data transformations using the Fabric compute engine with the intuitive graphical interface of Power Query in Dataflow Gen2 No Yes
Connectivity
Support for all Data Factory data sources Yes In progress3
Scalability
Ensure seamless execution of activities in a pipeline with scheduled runs Yes Yes
Schedule multiple runs for a single pipeline for flexible and efficient pipeline management Yes Planned
Utilize tumbling window triggers to schedule pipeline runs within distinct, nonoverlapping time windows Yes Planned
Event triggers to automate the execution of pipeline runs in response to specific or relevant event occurrences Yes Yes4
Artificial intelligence
Copilot for Data Factory, which provides intelligent pipeline generation to ingest data with ease and explanations to help better understand complex pipelines or to provide suggestions for error messages No Yes
Content management
Data lineage view, which help users understand and assess pipeline dependencies No Yes
Deployment pipelines, which manage the lifecycle of content No Yes
Platform scalability and resiliency
Premium capacity architecture, which supports increased scale and performance No Yes
Multi-Geo support, which helps multinational customers address regional, industry-specific, or organizational data residency requirements Yes Yes
Security
Virtual network (virtual network) data gateway connectivity, which allows Fabric to work seamlessly in an organization's virtual network No Planned
On-premises data gateway connectivity, which allows for secure access of data between an organization's on-premises data sources and Fabric items No Yes
Azure service tags support, which is a defined group of IP addresses that is automatically managed to minimize the complexity of updates or changes to network security rules Yes Yes
Governance
Content endorsement, to promote or certify valuable, high-quality Fabric items No Yes
Microsoft Purview integration, which helps customers manage and govern Fabric items Yes Yes
Microsoft Information Protection (MIP) sensitivity labels and integration with Microsoft Defender for Cloud Apps for data loss prevention No Yes
Monitoring and diagnostic logging
Logging pipeline execution events into an event store to monitor, analyze, and troubleshoot pipeline performance Yes Planned
Monitoring hub, which provides monitoring capabilities for Fabric items No Yes
Microsoft Fabric Capacity Metrics app, which provides monitoring capabilities for Fabric capacities No Yes
Audit log, which tracks user activities across Fabric and Microsoft 365 No Yes

1 Use the Get metadata, Until and If condition activities for equivalent metadata retrieval of the Validation activity.

2 Use the Invoke remote pipeline function to execute the mapping data flow activities, refer to the Invoke Pipeline activity.

3 To view the connectors that are currently supported for data pipelines, refer to Pipeline support.

4 Use Fabric eventstreams and Reflex to trigger execution run events, refer to Pipeline event triggers.

Considerations

There are some other considerations to factor into your planning before migrating to Fabric pipelines.

Licensing

Fabric pipelines require at minimum a Microsoft Fabric (Free) license to author in a premium capacity workspace, to learn more refer to Fabric licenses.

Roadmap

The Microsoft Fabric release plan documentation announces the latest updates and timelines to customers as features are prepared for future releases, including what's new and planned for Data Factory in Microsoft Fabric.

For more information, see Microsoft Fabric release plan documentation.

Learn how to plan for your ADF to Fabric Data Factory migration.