Connect an Azure Synapse Analytics data source (preview)
[This article is prerelease documentation and is subject to change.]
Azure Synapse Analytics is an enterprise analytics service that accelerates time to insights across data warehouses and big data systems. Azure Synapse Analytics brings together the best of SQL technologies used in enterprise data warehousing, Spark technologies used for big data, Data Explorer for log and time series analytics, Pipelines for data integration and ETL/ELT, and deep integration with other Azure services such as Power BI, Cosmos DB, and AzureML.
For more information, see Azure Synapse overview.
Important
- This is a preview feature.
- Preview features aren't meant for production use and may have restricted functionality. These features are available before an official release so that customers can get early access and provide feedback.
Prerequisites
Note
- Azure Synapse workspaces which have firewall enabled are currently not supported.
- The Azure Synapse workspace used as data source must be in the same tenant as the Customer Insights environment. Cross-tenant scenarios are not supported by Customer Insights.
Important
Make sure to set all role assignments as described.
In Customer Insights - Data:
- You have an Administrator role. Learn more about user permissions in Customer Insights.
In Azure:
An active Azure subscription.
A user with an Administrator role in Customer Insights - Data needs the User Access Administrator role in the Azure subscription to grant the service principal access to Azure resources in the Customer Insights - Data UI. Otherwise, a user in Azure who has the User Access Administrator role must set following permissions. Learn more about the User Access Administrator role and how to choose between Owner vs User Access Administrator. The Owner implicitly has the User Access Administrator role.
If using a new Azure Data Lake Storage Gen2 account, the service principal for Customer Insights - Data that is "Dynamics 365 AI for Customer Insights" needs Storage Blob Data Contributor permissions. Learn more about connecting to an Azure Data Lake Storage with a service principal. The Data Lake Storage Gen2 must have hierarchical namespace enabled.
On the resource group the Azure Synapse workspace is located, the service principal that is "Dynamics 365 AI for Customer Insights" and the user for Customer Insights needs at least Reader permissions. For more information, see Assign Azure roles using the Azure portal.
The user needs Storage Blob Data Contributor permissions on the Azure Data Lake Storage Gen2 account where the data is located and linked to the Azure Synapse workspace. Learn more about using the Azure portal to assign an Azure role for access to blob and queue data and Storage Blob Data Contributor permissions.
The Azure Synapse workspace managed identity needs Storage Blob Data Contributor permissions on the Azure Data Lake Storage Gen2 account where the data is located and linked to the Azure Synapse workspace. Learn more on using the Azure portal to assign an Azure role for access to blob and queue data and Storage Blob Data Contributor permissions.
On the Azure Synapse workspace, the *service principal for Customer Insights - Data that is "Dynamics 365 AI for Customer Insights" needs Synapse Administrator role assigned. The user needs at least a Synapse Contributor role assigned for the workspace. For more information, see How to set up access control for your Synapse workspace.
If your Customer Insights environment stores data in your own Azure Data Lake Storage, the user who sets up the connection to Azure Synapse Analytics needs at least the built-in Reader role on the Data Lake Storage account. For more information, see Assign Azure roles using the Azure portal.
Connect to the data lake database in Azure Synapse Analytics
Data connection names, data paths such as folders within a container, and table names must use names that begin with a letter. Names can only contain letters, numbers, and underscore (_). Special characters aren't supported.
Go to Data > Data sources.
Select Add a data source.
Choose the Azure Synapse Analytics (Preview) method.
Enter a Name for the data source and an optional Description.
Choose an available connection to Azure Synapse Analytics or create a new one.
Choose a Database from the workspace connected in the selected Azure Synapse Analytics connection and select Next. Currently, we only support the database type Lake database.
Select the tables to ingest from the connected database and select Next.
Optionally, choose the data tables to allow data profiling on.
Select Save to apply your selection and start the ingestion of the data from your newly created data source linked to the Lake database tables in Azure Synapse Analytics. The Data sources page opens showing the new data source in Refreshing status.
Tip
There are statuses for tasks and processes. Most processes depend on other upstream processes, such as data sources and data profiling refreshes.
Select the status to open the Progress details pane and view the progress of the tasks. To cancel the job, select Cancel job at the bottom of the pane.
Under each task, you can select See details for more progress information, such as processing time, the last processing date, and any applicable errors and warnings associated with the task or process. Select the View system status at the bottom of the panel to see other processes in the system.
Loading data can take time. After a successful refresh, review the ingested data from the Data > Tables page.