Connect to Dataiku

Dataiku is an end-to-end AI platform for data preparation, AutoML, and MLOps. You can integrate your Databricks SQL warehouses and Azure Databricks clusters with Dataiku.

Connect to Dataiku using Partner Connect

Note

Partner Connect only supports SQL warehouses for Dataiku. To connect a cluster to Dataiku, connect to Dataiku manually.

To connect your Azure Databricks workspace to Dataiku using Partner Connect, do the following:

  1. In the sidebar, click Partner Connect button Partner Connect.

  2. Click the partner tile.

    Note

    If the Dataiku tile has a check mark icon inside it, an administrator has already used Partner Connect to connect Dataiku to your workspace. Skip to step 5. The partner uses the email address for your Azure Databricks account to prompt you to sign in to your existing Dataiku account.

  3. Select a catalog from the drop-down list, and then click Next.

    Note

    If your workspace is Unity Catalog-enabled, but the partner doesn’t support Unity Catalog with Partner Connect, the workspace default catalog is used. If your workspace isn’t Unity Catalog-enabled, hive_metastore is used.

  4. Select a schema from the drop-down list, and then click Add. You can repeat this step to add multiple schemas.

  5. Click Next.

    Partner Connect creates the following resources in your workspace:

    Partner Connect also grants the following privileges to the <PARTNER>_USER service principal:

    • (Unity Catalog)USE CATALOG: Required to interact with objects within the selected catalog.
    • (Unity Catalog) USE SCHEMA: Grants the ability to read the schemas you selected.
    • (Unity Catalog)CREATE SCHEMA: Grants the ability to create schemas in the selected catalog.
    • (Legacy Hive metastore) USAGE: Required to interact with objects within hive_metastore and the selected schemas.
    • (Legacy Hive metastore) CREATE: Grants the ability to create a schema in hive_metastore.
    • (Legacy Hive metastore) READ_METADATA: Grants the ability to read metadata for the schemas you selected.
    • (Legacy Hive metastore) SELECT: Grants the ability to read the schemas you selected.
  6. Click Next.

    The Email box displays the email address for your Azure Databricks account. Dataiku uses this email address to prompt you to either create a new partner account or sign in to your existing partner account.

  7. Click Connect to Dataiku or Sign in.

    A new tab opens in your web browser, which displays the Dataiku website.

  8. Complete the on-screen instructions on the Dataiku website to create your trial Dataiku account or sign in to your existing Dataiku account.

Connect to Dataiku manually

This section describes how to connect an existing SQL warehouse or cluster in your Azure Databricks workspace to Dataiku manually.

Note

For Databricks SQL warehouses, you can connect to Dataiku using Partner Connect to simplify the experience.

Requirements

Before you connect to Dataiku manually, you need the following:

Steps to connect

To connect to Dataiku manually, follow Databricks in the Dataiku documentation.

Additional resources

Explore the following Dataiku resources: