# Steps to create Azure Databricks Cluster

1. Login to Azure Databricks workspace.
2. Select Data Science & Engineering from sidebar.
3. Select create -> New cluster. Add below details:
   * Policy - Unrestricted
   * Cluster name - protecto
   * Cluster mode - Standard
   * Databricks runtime version - latest with LTS
   * Enable table access control and only allow Python and SQL commands
   * Worker type - Node Size - 4 Core, 14 GB RAM (Standard\_DS3\_v2)
   * Driver Type - same as worker
   * In Advance option

&#x20;Add the following in spark config:

&#x20;            spark.databricks.acl.dfAclsEnabled true

&#x20;            spark data brickss.repl.allowedLanguages python,sql

&#x20;            spark.databricks.delta.preview\.enabled true

Note : python notebook and job creation steps will be shared during Protecto product installation. Please find the attached files (protecto\_python\_notebook).

{% file src="<https://323347149-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2Fw6GKvSvsZfGhtiQWrONh%2Fuploads%2FkVijsXRAMf0LLiAcoje3%2Fprotecto_python_notebook.py?alt=media&token=42ee63d5-2ce2-44d7-bfc1-73aa47e25010>" %}

**Credentials needed to connect Databricks:**

* Service principal application id (client id)
* Service principal directory id (tenant id)
* Service principal application secret (client secret)
* Server hostname
* Port
* Sql endpoint http path
* Catalog name (eg: hive\_metastore)

&#x20;
