Create An Azure Databricks Workspace and Cluster
Create An Azure Databricks Workspace and Cluster
The first step to using Azure Databricks is to create and deploy a Databricks workspace,
which is the logical environment. You can do this in the Azure portal.
The workspace creation takes a few minutes. During workspace creation, the Submitting
deployment for Azure Databricks tile appears on the right side of the portal. You might
need to scroll right on your dashboard to see the tile. There's also a progress bar displayed
near the top of the screen. You can watch either area for progress.
What is a cluster?
The notebooks are backed by clusters, or networked computers, that work together to process
your data. The first step is to create a cluster.
Create a cluster
1. When your Azure Databricks workspace creation is complete, select the link to go
to the resource.
2. Select Launch Workspace to open your Databricks workspace in a new tab.
3. In the left-hand menu of your Databricks workspace, select Clusters.
4. Select Create Cluster to add a new cluster.
5. Enter a name for your cluster. Use your name or initials to easily differentiate
your cluster from your coworkers.
6. Select the Databricks RuntimeVersion. We recommend the latest runtime
and Scala 2.11.
7. Specify your cluster configuration. While on the 14 day free trial, the defaults will
be sufficient. When the trial is ended, you may prefer to change Min
Workers to zero. That will allow the compute resources to shut down when
you are not in a coding exercise and reduce your charges.
Hint: Check with your local system administrator to see if there is a
recommended default cluster at your company to use for the rest of the class. This
could save you some money!
8. Select Create Cluster.