Connect to Fivetran
Fivetran automated data integration adapts as schemas and APIs change, ensuring reliable data access and simplified analysis with ready-to-query schemas.
You can integrate your Databricks SQL warehouses (formerly Databricks SQL endpoints) and Azure Databricks clusters with Fivetran. The Fivetran integration with Azure Databricks helps you centralize data from disparate data sources into Delta Lake.
Connect to Fivetran using Partner Connect
This section describes how to connect to Fivetran using Partner Connect. Each user creates their own connection.
Note
The per-user connection experience is in Public Preview. You can disable the ability to create per-user connections by contacting your Azure Databricks account team.
Before you connect using Partner Connect
Before you connect to Fivetran using Partner Connect, make sure you have the following:
- The workspace admin user role or the following permissions:
- For data managed by Unity Catalog, the following metastore object privileges for the catalog you want Fivetran to write to:
USE CATALOG
andCREATE SCHEMA
on the catalog.- (Optional) To specify a destination location, CREATE EXTERNAL TABLE on the external location and access to data in cloud object storage.
- For data managed by the legacy Hive metastore, the following metastore object privileges for the catalog you want Fivetran to write to:
USAGE
andCREATE
on the catalog- (Optional) To specify a destination location, access to data in cloud object storage.
Privileges for Unity Catalog metastore objects can be granted by a metastore admin, the owner of the object, or the owner of the catalog or schema that contains the object. For more information, see Unity Catalog privileges and securable objects.
Privileges for legacy Hive metastore objects can be granted by a workspace admin or the owner of the object. For more information, see Hive metastore privileges and securable objects (legacy).
Access to data in cloud object storage must be configured by a workspace admin with sufficient permissions in the cloud object storage account. For more information, see Enable data access configuration.
Partner Connect steps
To connect your Azure Databricks workspace to Fivetran using Partner Connect, do the following:
In the sidebar, click Partner Connect.
Click the Fivetran tile.
The steps in this section create a user-level Fivetran trial account. To sign in to an existing workspace-level Fivetran trial account, click Use existing connection, complete the on-screen instructions to sign in to Fivetran, and skip the rest of the steps in this article.
Select a SQL warehouse. If the SQL warehouse is stopped, click Start.
If your workspace is enabled for Unity Catalog, select a catalog for Fivetran to write to, then click Next.
Partner Connect generates an Azure Databricks personal access token that is associated with your user.
Click Connect to Fivetran.
A new tab opens in your web browser that displays the Fivetran website.
Complete the on-screen instructions on the Fivetran website to create your trial partner account.
Connect to Fivetran manually
For an overview of the manual connection procedure, watch this YouTube video (2 minutes).
Note
To connect a SQL warehouse with Fivetran faster, use Partner Connect.
Before you connect manually
Before you connect to Fivetran manually, you must have the following:
A cluster or SQL warehouse in your Azure Databricks workspace.
The connection details for your cluster or SQL warehouse, specifically the Server Hostname, Port, and HTTP Path values.
An Azure Databricks personal access token or a Microsoft Entra ID (formerly Azure Active Directory) token.. To create a personal access token, follow the steps in Azure Databricks personal access tokens for workspace users.
Note
As a security best practice, when you authenticate with automated tools, systems, scripts, and apps, Databricks recommends that you use personal access tokens belonging to service principals instead of workspace users. To create tokens for service principals, see Manage tokens for a service principal.
Tip
If the Fivetran tile in Partner Connect in your workspace has a check mark icon inside of it, you can get the connection details for the connected SQL warehouse by clicking the tile and then expanding Connection details. The Personal access token is hidden; you must create a replacement personal access token and enter that new token instead when Fivetran asks you for it.
Manual steps
To connect to Fivetran manually, do the following:
Sign in to your Fivetran account, or create a new Fivetran account, at https://fivetran.com/login.
Important
If you sign in to your organization’s Fivetran account, a Choose Destination page may display, listing one or more existing destination entries with the Databricks logo. These entries might contain connection details for compute resources in workspaces that are separate from yours. If you still want to reuse one of these connections, and you trust the compute resource and have access to it, choose that destination and then skip ahead to next steps. Otherwise, choose any available destination to get past this page.
In your Dashboard page in Fivetran, click the Destinations tab.
Click Add Destination.
Enter a Destination name and click Add.
On the Fivetran is modern ELT page, click Set up a connector.
Click a data source, and then click Next.
Follow the on-screen instructions in the Setup Guide in Fivetran to finish setting up the connector.
Click Save & Test.
After the test succeeds, click Continue.
On the Select your data’s destination page, click Databricks on Azure.
Click Continue Setup.
Complete the on-screen instructions in Fivetran to enter the connection details for your existing Azure Databricks compute resource, specifically the Server Hostname and HTTP Path field values, and the token that you generated earlier.
Click Save & Test.
After the test succeeeds, click Continue.
Continue to next steps.
Additional resources
Explore one or more of the following resources on the Fivetran website: