Copilot for Data Factory overview
Important
Copilot for Data Factory is generally available now, but its new Data pipeline capabilities are still in preview.
Copilot in Fabric enhances productivity, unlocks profound insights, and facilitates the creation of custom AI experiences tailored to your data. As a component of the Copilot in Fabric experience, Copilot in Data Factory empowers customers to use natural language to articulate their requirements for creating data integration solutions using Dataflow Gen2. Essentially, Copilot in Data Factory operates like a subject-matter expert (SME) collaborating with you to design your dataflows.
Copilot for Data Factory is an AI-enhanced toolset that supports both citizen and professional data wranglers in streamlining their workflow. It provides intelligent Mashup code generation to transform data using natural language input and generates code explanations to help you better understand earlier generated complex queries and tasks.
Before your business can start using Copilot capabilities in Fabric, your administrator needs to enable Copilot in Microsoft Fabric.
Note
- Your administrator needs to enable the tenant switch before you start using Copilot. See the article Copilot tenant settings for details.
- Your F64 or P1 capacity needs to be in one of the regions listed in this article, Fabric region availability.
- If your tenant or capacity is outside the US or France, Copilot is disabled by default unless your Fabric tenant admin enables the Data sent to Azure OpenAI can be processed outside your tenant's geographic region, compliance boundary, or national cloud instance tenant setting in the Fabric Admin portal.
- Copilot in Microsoft Fabric isn't supported on trial SKUs. Only paid SKUs (F64 or higher, or P1 or higher) are supported.
- Copilot in Fabric is currently rolling out in public preview and is expected to be available for all customers by end of March 2024.
- See the article Overview of Copilot in Fabric and Power BI for more information.
Supported capabilities
With Dataflow Gen2, you can:
- Generate new transformation steps for an existing query.
- Provide a summary of the query and the applied steps.
- Generate a new query that may include sample data or a reference to an existing query.
With Data pipelines, you can:
- Pipeline Generation: Using natural language, you can describe your desired pipeline, and Copilot will understand the intent and generate the necessary Data pipeline activities.
- Error message assistant: troubleshoot Data pipeline issues with clear error explanation capability and actionable troubleshooting guidance.
- Summarize Pipeline: Explain your complex pipeline with the summary of content and relations of activities within the Pipeline.
Get started
Data Factory Copilot is available in both Dataflow Gen2, and Data pipelines.
Get started with Copilot for Dataflow Gen2
Use the following steps to get started with Copilot for Dataflow Gen2:
Create a new Dataflows Gen2.
On the Home tab in Dataflows Gen2, select the Copilot button.
In the bottom left of the Copilot pane, select the starter prompt icon, then the Get data from option.
In the Get data window, search for OData and select the OData connector.
In the Connect to data source for the OData connector, input the following text into the URL field:
https://services.odata.org/V4/Northwind/Northwind.svc/
From the navigator, select the Orders table and then Select related tables. Then select Create to bring multiple tables into the Power Query editor.
Select the Customers query, and in the Copilot pane type this text:
Only keep European customers
, then press Enter or select the Send message icon.Your input is now visible in the Copilot pane along with a returned response card. You can validate the step with the corresponding step title in the Applied steps list and review the formula bar or the data preview window for accuracy of your results.
Select the Employees query, and in the Copilot pane type this text:
Count the total number of employees by City
, then press Enter or select the Send message icon. Your input is now visible in the Copilot pane along with a returned response card and an Undo button.Select the column header for the Total Employees column and choose the option Sort descending. The Undo button disappears because you modified the query.
Select the Order_Details query, and in the Copilot pane type this text:
Only keep orders whose quantities are above the median value
, then press Enter or select the Send message icon. Your input is now visible in the Copilot pane along with a returned response card.Either select the Undo button or type the text
Undo
(any text case) and press Enter in the Copilot pane to remove the step.To leverage the power of Azure OpenAI when creating or transforming your data, ask Copilot to create sample data by typing this text:
Create a new query with sample data that lists all the Microsoft OS versions and the year they were released
Copilot adds a new query to the Queries pane list, containing the results of your input. At this point, you can either transform data in the user interface, continue to edit with Copilot text input, or delete the query with an input such as
Delete my current query
.
Get started with Copilot for Data pipelines
You can use Copilot to generate, summarize, or even troubleshoot your Data pipelines.
Generate a Data pipeline with Copilot
Use these steps to generate a new pipeline with Copilot for Data Factory:
Create a new Data pipeline.
On the Home tab of the Data pipeline editor, select the Copilot button.
Then you can get started with Copilot to build your pipeline with the Ingest data option.
Copilot generates a Copy activity and you can interact with Copilot to complete the whole flow. You can type / to select the source and destination connection, and then add all the required content according to the prefilled started prompt context.
After everything is setup, simply select Run this pipeline to execute the new pipeline and ingest the data.
If you are already familiar with Data pipelines, you can complete everything with one prompt command, too.
Summarize a Data pipeline with Copilot
Use these steps to summarize a pipeline with Copilot for Data Factory:
Open an existing Data pipeline.
On the Home tab of the pipeline editor window, select the Copilot button.
Then you can get started with Copilot to summarize the content of the pipeline.
Select Summarize this pipeline and Copilot generates a summary.
Troubleshoot pipeline errors with Copilot
Copilot empowers you to troubleshoot any pipeline with error messages. You can either use Copilot for pipeline error messages assistant in the Fabric Monitor page, or in pipeline authoring page. The steps below show you how to access the pipeline Copilot to troubleshoot your pipeline from the Fabric Monitor page, but you can use the same steps from the pipeline authoring page.
Go to Fabric Monitor page and select filters to show pipelines with failures, as shown below:
Select the Copilot icon beside the failed pipeline.
Copilot provides a clear error message summary and actionable recommendations to fix it. In the recommendations, troubleshooting links are provided for you to efficiently investigate further.
Limitations of Copilot for Data Factory
Here are the current limitations of Copilot for Data Factory:
- Copilot can't perform transformations or explanations across multiple queries in a single input. For instance, you can't ask Copilot to "Capitalize all the column headers for each query in my dataflow."
- Copilot doesn't understand previous inputs and can't undo changes after a user commits a change when authoring, either via user interface or the chat pane. For example, you can't ask Copilot to "Undo my last 5 inputs." However, users can still use the existing user interface options to delete unwanted steps or queries.
- Copilot can't make layout changes to queries in your session. For example, if you tell Copilot to create a new group for queries in the editor, it doesn't work.
- Copilot may produce inaccurate results when the intent is to evaluate data that isn't present within the sampled results imported into the sessions data preview.
- Copilot doesn't produce a message for the skills that it doesn't support. For example, if you ask Copilot to "Perform statistical analysis and write a summary over the contents of this query", it doesn't complete the instruction successfully as mentioned previously. Unfortunately, it doesn't give an error message either.