Configure Azure Databricks
Before you can create a Striim for Databricks pipeline, you must do the following:
Create an Azure Databricks user that Striim for Databricks will use to connect to the target (see Learn / Manage users / Add users to your Azure Databricks account).
Give that user the following privileges
CREATE SCHEMA
if you are using Unity Catalog, also USE CATALOG (not required if you are using the Hive metastore)
Get a personal access token for that user, which you will need to provide when creating a pipeline. The following steps are subject to change by Microsoft but they worked in April 2023:
In Azure Databricks, at the top right corner, click your login name, select User Settings, and click Generate new token.
In the Comment field enter a description for the token (such as
for Striim
), optionally adjust its Lifetime, and click Generate.Copy the token and keep it in a safe place. If you lose it you will need to create a new token.
Get the JDBC URL for your Databricks cluster, which you will need to provide when creating a pipeline. The following steps are subject to change by Microsoft but they worked in April 2023:
In Azure Databricks, at the top of the left navigation panel, select Data Science & Engineering.
In the left navigation panel, click Compute, then click the name of your cluster.
At the bottom of the page, expand Advanced options, then select the jDBC/ODBC tap.
Copy the JDBC URL (it starts with
jdbc
and ends withPWD=<personal-access-token>
).