0% found this document useful (0 votes)
109 views2 pages

Create An Azure Databricks Workspace and Cluster

The document discusses how to create an Azure Databricks workspace and cluster. It provides step-by-step instructions for creating a Databricks workspace in the Azure portal by selecting Azure Databricks, providing subscription, resource group, location and workspace name details. It then explains that a cluster is backed by networked computers that process data and provides instructions for creating a cluster by selecting Clusters from the Databricks workspace menu and specifying cluster name, runtime version and configuration.

Uploaded by

Asif Khan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
109 views2 pages

Create An Azure Databricks Workspace and Cluster

The document discusses how to create an Azure Databricks workspace and cluster. It provides step-by-step instructions for creating a Databricks workspace in the Azure portal by selecting Azure Databricks, providing subscription, resource group, location and workspace name details. It then explains that a cluster is backed by networked computers that process data and provides instructions for creating a cluster by selecting Clusters from the Databricks workspace menu and specifying cluster name, runtime version and configuration.

Uploaded by

Asif Khan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

Short notes

Create an Azure Databricks workspace and cluster


When talking about the Azure Databricks workspace, we refer to two different things. The
first reference is the logical Azure Databricks environment in which clusters are created, data
is stored (via DBFS), and in which the server resources are housed. The second reference is
the more common one used within the context of Azure Databricks. That is the special root
folder for all of your organization's Databricks assets, including notebooks, libraries, and
dashboards, as shown below:

The first step to using Azure Databricks is to create and deploy a Databricks workspace,
which is the logical environment. You can do this in the Azure portal.

Deploy an Azure Databricks workspace


1. Open the Azure portal.
2. Click Create a Resource in the top left
3. Search for “Databricks”
4. Select Azure Databricks
5. On the Azure Databricks page select Create
6. Provide the required values to create your Azure Databricks workspace:
▪ Subscription: Choose the Azure subscription in which to deploy the
workspace.
▪ Resource Group: Use Create new and provide a name for the new
resource group.
▪ Location: Select a location near you for deployment. For the list of
regions that are supported by Azure Databricks, see Azure services
available by region.
▪ Workspace Name: Provide a unique name for your workspace.
▪ Pricing Tier: Trial (Premium - 14 days Free DBUs). You must
select this option when creating your workspace or you will be
charged. The workspace will suspend automatically after 14 days.
When the trial is over you can convert the workspace to Premium but
then you will be charged for your usage.
7. Select Review + Create.
8. Select Create.

The workspace creation takes a few minutes. During workspace creation, the Submitting
deployment for Azure Databricks tile appears on the right side of the portal. You might
need to scroll right on your dashboard to see the tile. There's also a progress bar displayed
near the top of the screen. You can watch either area for progress.

What is a cluster?
The notebooks are backed by clusters, or networked computers, that work together to process
your data. The first step is to create a cluster.
Create a cluster
1. When your Azure Databricks workspace creation is complete, select the link to go
to the resource.
2. Select Launch Workspace to open your Databricks workspace in a new tab.
3. In the left-hand menu of your Databricks workspace, select Clusters.
4. Select Create Cluster to add a new cluster.

5. Enter a name for your cluster. Use your name or initials to easily differentiate
your cluster from your coworkers.
6. Select the Databricks RuntimeVersion. We recommend the latest runtime
and Scala 2.11.
7. Specify your cluster configuration. While on the 14 day free trial, the defaults will
be sufficient. When the trial is ended, you may prefer to change Min
Workers to zero. That will allow the compute resources to shut down when
you are not in a coding exercise and reduce your charges.
Hint: Check with your local system administrator to see if there is a
recommended default cluster at your company to use for the rest of the class. This
could save you some money!
8. Select Create Cluster.

You might also like