0% found this document useful (0 votes)

236 views11 pages

04A - Working With Datastores - Jupyter Notebook PDF

This document discusses working with datastores in Azure Machine Learning. It shows how to connect to a workspace, view existing datastores, get a reference to a specific datastore, upload data files to that datastore, and then train a machine learning model using data from the datastore. The code uploads diabetes data files to the 'aml_data' datastore, gets a data reference, and defines a training script that accepts the data reference as a parameter to access and use the uploaded data for model training.

Uploaded by

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

236 views11 pages

04A - Working With Datastores - Jupyter Notebook PDF

Uploaded by

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

6/23/2020 04A - Working with Datastores - Jupyter Notebook

Working with Datastores

Although it's fairly common for data scientists to work with data on their local file system, in an
enterprise environment it can be more effective to store the data in a central location where
multiple data scientists can access it. In this lab, you'll store data in the cloud, and use an Azure
Machine Learning datastore to access it.

Important: The code in this notebooks assumes that you have completed the first
two tasks in Lab 4A. If you have not done so, go and do it now!

Connect to Your Workspace

To access your datastore using the Azure Machine Learning SDK, you need to connect to your
workspace.

Note: If the authenticated session with your Azure subscription has expired since
you completed the previous exercise, you'll be prompted to reauthenticate.

In [1]: import azureml.core

from azureml.core import Workspace

# Load the workspace from the saved config file

ws = Workspace.from_config()
print('Ready to use Azure ML {} to work with {}'.format(azureml.core.VERSION, ws.

Ready to use Azure ML 1.8.0 to work with workspace200623

View Datastores in the Workspace

The workspace contains several datastores, including the aml_data datastore you ceated in the
previous task (labdocs/Lab04A.md).

Run the following code to retrieve the default datastore, and then list all of the datastores indicating
which is the default.

https://compute200623.westus2.instances.azureml.net/notebooks/Users/DP100/04A - Working with Datastores.ipynb 1/11

6/23/2020 04A - Working with Datastores - Jupyter Notebook

In [2]: from azureml.core import Datastore

# Get the default datastore

default_ds = ws.get_default_datastore()

# Enumerate all datastores, indicating which is the default

for ds_name in ws.datastores:
print(ds_name, "- Default =", ds_name == default_ds.name)

aml_data - Default = False

azureml_globaldatasets - Default = False
workspaceblobstore - Default = True
workspacefilestore - Default = False

Get a Datastore to Work With

You want to work with the aml_data datastore, so you need to get it by name:

In [3]: aml_datastore = Datastore.get(ws, 'aml_data')

print(aml_datastore.name,":", aml_datastore.datastore_type + " (" + aml_datastore

aml_data : AzureBlob (200623lab4a)

Set the Default Datastore

You are primarily going towork with the aml_data datastore in this course; so for convenience, you
can set it to be the default datastore:

In [4]: ws.set_default_datastore('aml_data')
default_ds = ws.get_default_datastore()
print(default_ds.name)

aml_data

Upload Data to a Datastore

Now that you have identified the datastore you want to work with, you can upload files from your
local file system so that they will be accessible to experiments running in the workspace,
regardless of where the experiment script is actually being run.

https://compute200623.westus2.instances.azureml.net/notebooks/Users/DP100/04A - Working with Datastores.ipynb 2/11

6/23/2020 04A - Working with Datastores - Jupyter Notebook

In [5]: default_ds.upload_files(files=['./data/diabetes.csv', './data/diabetes2.csv'], #

target_path='diabetes-data/', # Put it in a folder path in
overwrite=True, # Replace existing files of the same name
show_progress=True)

Uploading an estimated of 2 files

Uploading ./data/diabetes.csv
Uploading ./data/diabetes2.csv
Uploaded ./data/diabetes2.csv, 1 files out of an estimated total of 2
Uploaded ./data/diabetes.csv, 2 files out of an estimated total of 2
Uploaded 2 files

Out[5]: $AZUREML_DATAREFERENCE_b0c114d9481b4042b7d6639ffb65a9b5

Train a Model from a Datastore

When you uploaded the files in the code cell above, note that the code returned a data reference.
A data reference provides a way to pass the path to a folder in a datastore to a script, regardless of
where the script is being run, so that the script can access data in the datastore location.

The following code gets a reference to the diabetes-data folder where you uploaded the diabetes
CSV files, and specifically configures the data reference for download - in other words, it can be
used to download the contents of the folder to the compute context where the data reference is
being used. Downloading data works well for small volumes of data that will be processed on local
compute. When working with remote compute, you can also configure a data reference to mount
the datastore location and read data directly from the data source.

More Information: For more details about using datastores, see the Azure ML
documentation (https://docs.microsoft.com/azure/machine-learning/how-to-access-
data).

In [6]: data_ref = default_ds.path('diabetes-data').as_download(path_on_compute='diabetes

print(data_ref)

$AZUREML_DATAREFERENCE_7e8f9a0a713747c9a7fc2063f505826c

To use the data reference in a training script, you must define a parameter for it. Run the following
two code cells to create:

1. A folder named diabetes_training_from_datastore

2. A script that trains a classification model by using the training data in all of the CSV files in the
folder referenced by the data reference parameter passed to it.

https://compute200623.westus2.instances.azureml.net/notebooks/Users/DP100/04A - Working with Datastores.ipynb 3/11

6/23/2020 04A - Working with Datastores - Jupyter Notebook

In [7]: import os

# Create a folder for the experiment files

experiment_folder = 'diabetes_training_from_datastore'
os.makedirs(experiment_folder, exist_ok=True)
print(experiment_folder, 'folder created.')

diabetes_training_from_datastore folder created.

https://compute200623.westus2.instances.azureml.net/notebooks/Users/DP100/04A - Working with Datastores.ipynb 4/11

6/23/2020 04A - Working with Datastores - Jupyter Notebook

In [8]: %%writefile $experiment_folder/diabetes_training.py

# Import libraries
import os
import argparse
from azureml.core import Run
import pandas as pd
import numpy as np
import joblib
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LogisticRegression
from sklearn.metrics import roc_auc_score
from sklearn.metrics import roc_curve

# Get parameters
parser = argparse.ArgumentParser()
parser.add_argument('--regularization', type=float, dest='reg_rate', default=0.01
parser.add_argument('--data-folder', type=str, dest='data_folder', help='data fol
args = parser.parse_args()
reg = args.reg_rate

# Get the experiment run context

run = Run.get_context()

# load the diabetes data from the data reference

data_folder = args.data_folder
print("Loading data from", data_folder)
# Load all files and concatenate their contents as a single dataframe
all_files = os.listdir(data_folder)
diabetes = pd.concat((pd.read_csv(os.path.join(data_folder,csv_file)) for csv_fil

# Separate features and labels

X, y = diabetes[['Pregnancies','PlasmaGlucose','DiastolicBloodPressure','TricepsT

# Split data into training set and test set

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.30, random_

# Train a logistic regression model

print('Training a logistic regression model with regularization rate of', reg)
run.log('Regularization Rate', np.float(reg))
model = LogisticRegression(C=1/reg, solver="liblinear").fit(X_train, y_train)

# calculate accuracy
y_hat = model.predict(X_test)
acc = np.average(y_hat == y_test)
print('Accuracy:', acc)
run.log('Accuracy', np.float(acc))

# calculate AUC
y_scores = model.predict_proba(X_test)
auc = roc_auc_score(y_test,y_scores[:,1])
print('AUC: ' + str(auc))
run.log('AUC', np.float(auc))

os.makedirs('outputs', exist_ok=True)
# note file saved in the outputs folder is automatically uploaded into experiment
joblib.dump(value=model, filename='outputs/diabetes_model.pkl')

https://compute200623.westus2.instances.azureml.net/notebooks/Users/DP100/04A - Working with Datastores.ipynb 5/11

6/23/2020 04A - Working with Datastores - Jupyter Notebook

run.complete()

Writing diabetes_training_from_datastore/diabetes_training.py

The script will load the training data from the data reference passed to it as a parameter, so now
you just need to set up the script parameters to pass the file reference when we run the
experiment.

https://compute200623.westus2.instances.azureml.net/notebooks/Users/DP100/04A - Working with Datastores.ipynb 6/11

6/23/2020 04A - Working with Datastores - Jupyter Notebook

In [9]: from azureml.train.estimator import Estimator

from azureml.core import Experiment, Environment
from azureml.widgets import RunDetails

# Create a Python environment

env = Environment("env")
env.python.user_managed_dependencies = True
env.docker.enabled = False

# Set up the parameters

script_params = {
'--regularization': 0.1, # regularization rate
'--data-folder': data_ref # data reference to download files from datastore
}

# Create an estimator
estimator = Estimator(source_directory=experiment_folder,
entry_script='diabetes_training.py',
script_params=script_params,
compute_target = 'local',
environment_definition=env
)

# Create an experiment
experiment_name = 'diabetes-training'
experiment = Experiment(workspace=ws, name=experiment_name)

# Run the experiment

run = experiment.submit(config=estimator)
# Show the run details while running
RunDetails(run).show()
run.wait_for_completion()

Run Properties

Status Completed

Start Time 6/23/2020 2:39:01 PM

Duration 0:00:31

Run Id diabetes-training_1592948340_e4c0a246

Arguments N/A

Accuracy 0.7893333333333333

AUC 0.8568655044545174

Regularization Rate 0.1

Output Logs logs/azureml/105135_azureml.log Auto-switch

O|Current working dir: /tmp/azureml_runs/diabetes-
training 1592948340 e4c0a246
https://compute200623.westus2.instances.azureml.net/notebooks/Users/DP100/04A - Working with Datastores.ipynb 7/11
6/23/2020 04A - Working with Datastores - Jupyter Notebook
training_1592948340_e4c0a246
2020-06-23
21:39:28,612|azureml.history._tracking.PythonWorkingDirectory.wor
kingdir|DEBUG|Reverting working dir from
/tmp/azureml_runs/diabetes-training_1592948340_e4c0a246 to
/tmp/azureml_runs/diabetes-training_1592948340_e4c0a246
2020-06-23
21:39:28,612|azureml.history._tracking.PythonWorkingDirectory|INF
O|Working dir is already updated /tmp/azureml_runs/diabetes-
training_1592948340_e4c0a246

Click here to see the run in Azure Machine Learning studio

(https://ml.azure.com/experiments/diabetes-training/runs/diabetes-
training_1592948340_e4c0a246?wsid=/subscriptions/13f3f409-2802-42d9-a29c-
f7b5775839d5/resourcegroups/200623/workspaces/workspace200623)

Out[9]: {'runId': 'diabetes-training_1592948340_e4c0a246',

'target': 'local',
'status': 'Finalizing',
'startTimeUtc': '2020-06-23T21:39:03.980418Z',
'properties': {'_azureml.ComputeTargetType': 'local',
'ContentSnapshotId': '9174c2d0-35bb-4ddf-99ce-f00be203eb26'},
'inputDatasets': [],
'runDefinition': {'script': 'diabetes_training.py',
'useAbsolutePath': False,
'arguments': ['--regularization',
'0.1',
'--data-folder',
'$AZUREML_DATAREFERENCE_7e8f9a0a713747c9a7fc2063f505826c'],
'sourceDirectoryDataStore': None,
'framework': 'Python',
'communicator': 'None',
'target': 'local',
'dataReferences': {'7e8f9a0a713747c9a7fc2063f505826c': {'dataStoreName': 'aml
_data',
'mode': 'Download',
'pathOnDataStore': 'diabetes-data',
'pathOnCompute': 'diabetes_data',
'overwrite': False}},
'data': {},
'outputData': {},
'jobName': None,
'maxRunDurationSeconds': None,
'nodeCount': 1,
'environment': {'name': 'env',
'version': 'Autosave_2020-06-23T21:39:01Z_8a323df0',
'python': {'interpreterPath': 'python',
'userManagedDependencies': True,
'condaDependencies': {'channels': ['anaconda', 'conda-forge'],
'dependencies': ['python=3.6.2', {'pip': ['azureml-defaults']}],
'name': 'project_environment'},
'baseCondaEnvironment': None},
'environmentVariables': {'EXAMPLE_ENV_VAR': 'EXAMPLE_VALUE'},

https://compute200623.westus2.instances.azureml.net/notebooks/Users/DP100/04A - Working with Datastores.ipynb 8/11

6/23/2020 04A - Working with Datastores - Jupyter Notebook
'docker': {'baseImage': 'mcr.microsoft.com/azureml/intelmpi2018.3-ubuntu16.0
4:20200423.v1',
'platform': {'os': 'Linux', 'architecture': 'amd64'},
'baseDockerfile': None,
'baseImageRegistry': {'address': None, 'username': None, 'password': None},
'enabled': False,
'arguments': []},
'spark': {'repositories': [], 'packages': [], 'precachePackages': True},
'inferencingStackVersion': None},
'history': {'outputCollection': True,
'directoriesToWatch': ['logs'],
'snapshotProject': True},
'spark': {'configuration': {'spark.app.name': 'Azure ML Experiment',
'spark.yarn.maxAppAttempts': '1'}},
'parallelTask': {'maxRetriesPerWorker': 0,
'workerCountPerNode': 1,
'terminalExitCodes': None,
'configuration': {}},
'amlCompute': {'name': None,
'vmSize': None,
'retainCluster': False,
'clusterMaxNodeCount': 1},
'tensorflow': {'workerCount': 1, 'parameterServerCount': 1},
'mpi': {'processCountPerNode': 1},
'hdi': {'yarnDeployMode': 'Cluster'},
'containerInstance': {'region': None, 'cpuCores': 2, 'memoryGb': 3.5},
'exposedPorts': None,
'docker': {'useDocker': False,
'sharedVolumes': True,
'shmSize': '2g',
'arguments': []},
'cmk8sCompute': {'configuration': {}},
'itpCompute': {'configuration': {}},
'cmAksCompute': {'configuration': {}}},
'logFiles': {'azureml-logs/60_control_log.txt': 'https://workspace200620667670
440.blob.core.windows.net/azureml/ExperimentRun/dcid.diabetes-training_15929483
40_e4c0a246/azureml-logs/60_control_log.txt?sv=2019-02-02&sr=b&sig=AwatpEiJnS1r
KNBGuUYi1Un3M3HhUEMqTiM%2B2VNl3z8%3D&st=2020-06-23T21%3A29%3A28Z&se=2020-06-24T
05%3A39%3A28Z&sp=r',
'azureml-logs/70_driver_log.txt': 'https://workspace200620667670440.blob.cor
e.windows.net/azureml/ExperimentRun/dcid.diabetes-training_1592948340_e4c0a246/
azureml-logs/70_driver_log.txt?sv=2019-02-02&sr=b&sig=vqzsBXXgWb6feZ%2BdlUNILu0
jkaPEDvElLU5aUpS4xqA%3D&st=2020-06-23T21%3A29%3A28Z&se=2020-06-24T05%3A39%3A28Z
&sp=r',
'logs/azureml/105135_azureml.log': 'https://workspace200620667670440.blob.cor
e.windows.net/azureml/ExperimentRun/dcid.diabetes-training_1592948340_e4c0a246/
logs/azureml/105135_azureml.log?sv=2019-02-02&sr=b&sig=W3B7aXoUhgTTbIuJf9Qg4mqo
k8o5SfYJTqjO25z%2Fpu0%3D&st=2020-06-23T21%3A29%3A28Z&se=2020-06-24T05%3A39%3A28
Z&sp=r'}}

The first time the experiment is run, it may take some time to set up the Python environment -
subsequent runs will be quicker.

https://compute200623.westus2.instances.azureml.net/notebooks/Users/DP100/04A - Working with Datastores.ipynb 9/11

6/23/2020 04A - Working with Datastores - Jupyter Notebook

When the experiment has completed, in the widget, view the output log to verify that the data files
were downloaded.

As with all experiments, you can view the details of the experiment run in Azure ML studio
(https://ml.azure.com), and you can write code to retrieve the metrics and files generated:

In [10]: # Get logged metrics

metrics = run.get_metrics()
for key in metrics.keys():
print(key, metrics.get(key))
print('\n')
for file in run.get_file_names():
print(file)

Regularization Rate 0.1

Accuracy 0.7893333333333333
AUC 0.8568655044545174

azureml-logs/60_control_log.txt
azureml-logs/70_driver_log.txt
logs/azureml/105135_azureml.log
outputs/diabetes_model.pkl

Once again, you can register the model that was trained by the experiment.

https://compute200623.westus2.instances.azureml.net/notebooks/Users/DP100/04A - Working with Datastores.ipynb 10/11

6/23/2020 04A - Working with Datastores - Jupyter Notebook

In [11]: from azureml.core import Model

# Register the model

run.register_model(model_path='outputs/diabetes_model.pkl', model_name='diabetes_
tags={'Training context':'Using Datastore'}, properties={'AUC'

# List the registered models

print("Registered Models:")
for model in Model.list(ws):
print(model.name, 'version:', model.version)
for tag_name in model.tags:
tag = model.tags[tag_name]
print ('\t',tag_name, ':', tag)
for prop_name in model.properties:
prop = model.properties[prop_name]
print ('\t',prop_name, ':', prop)
print('\n')

Registered Models:
diabetes_model version: 3
Training context : Using Datastore
AUC : 0.8568655044545174
Accuracy : 0.7893333333333333

diabetes_model version: 2
Training context : Parameterized SKLearn Estimator
AUC : 0.8483904671874223
Accuracy : 0.7736666666666666

diabetes_model version: 1
Training context : Estimator
AUC : 0.8483377282451863
Accuracy : 0.774

amlstudio-predict-diabetes version: 1
CreatedByAMLStudio : true

In this exercise, you've explored some options for working with data in the form of datastores.

Azure Machine Learning offers a further level of abstraction for data in the form of datasets, which
you'll explore next.

https://compute200623.westus2.instances.azureml.net/notebooks/Users/DP100/04A - Working with Datastores.ipynb 11/11

This Study Resource Was
80% (5)
This Study Resource Was
4 pages
This Study Resource Was: Azure - Machine Learning
64% (11)
This Study Resource Was: Azure - Machine Learning
7 pages
CARISBatch Utility Reference Guide
No ratings yet
CARISBatch Utility Reference Guide
275 pages
MICROSOFT AZURE ADMINISTRATOR EXAM PREP(AZ-104) Part-3: AZ 104 EXAM STUDY GUIDE
From Everand
MICROSOFT AZURE ADMINISTRATOR EXAM PREP(AZ-104) Part-3: AZ 104 EXAM STUDY GUIDE
Devi Prasad
No ratings yet
AWS Certified Solutions Architect - Professional
From Everand
AWS Certified Solutions Architect - Professional
VB Dev
No ratings yet
Azure Machine Learning
No ratings yet
Azure Machine Learning
18 pages
Oracle Database Administration Interview Questions You'll Most Likely Be Asked: Job Interview Questions Series
From Everand
Oracle Database Administration Interview Questions You'll Most Likely Be Asked: Job Interview Questions Series
Vibrant Publishers
5/5 (1)
Starting Database Administration: Oracle DBA
From Everand
Starting Database Administration: Oracle DBA
anuragbaruah84
3/5 (2)
Some Tutorials in Computer Networking Hacking
From Everand
Some Tutorials in Computer Networking Hacking
Dr. Hidaia Mahmood Alassouli
No ratings yet
Administering Microsoft Azure SQL Solutions DP 300
From Everand
Administering Microsoft Azure SQL Solutions DP 300
Manish Soni
No ratings yet
Mslearn dp100 06
No ratings yet
Mslearn dp100 06
2 pages
Azure For Starters
From Everand
Azure For Starters
Chinmoy Mukherjee
No ratings yet
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
From Everand
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Firebase Storage for Angular: A reliable file upload solution for your applications
From Everand
Firebase Storage for Angular: A reliable file upload solution for your applications
Abdelfattah Ragab
No ratings yet
Inspiring Powershell Articles
From Everand
Inspiring Powershell Articles
Murat Yildirimoglu
No ratings yet
Quick Configuration of Openldap and Kerberos in Linux and Authenicating Linux to Active Directory
From Everand
Quick Configuration of Openldap and Kerberos in Linux and Authenicating Linux to Active Directory
Dr. Hidaia Mahmood Alassouli
No ratings yet
SAS Interview Questions You'll Most Likely Be Asked
From Everand
SAS Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Microsoft Azure Database Administrator DP 300
From Everand
Microsoft Azure Database Administrator DP 300
Manish Soni
No ratings yet
SQL Server: Tips and Tricks - 1
From Everand
SQL Server: Tips and Tricks - 1
Priyanka Agarwal
5/5 (1)
Learn Cassandra in 24 Hours
From Everand
Learn Cassandra in 24 Hours
Alex Nordeen
No ratings yet
DBMS Lab Manual
From Everand
DBMS Lab Manual
Jitendra Patel
1.5/5 (3)
Introduction to Oracle Database Administration
From Everand
Introduction to Oracle Database Administration
Ying Wang
5/5 (1)
1Z0-1096-23
No ratings yet
1Z0-1096-23
4 pages
1Z0-1096-23_2
No ratings yet
1Z0-1096-23_2
4 pages
1Z0-1096-23_3
No ratings yet
1Z0-1096-23_3
4 pages
Visual Basic 2010 Coding Briefs Data Access
From Everand
Visual Basic 2010 Coding Briefs Data Access
Kevin Hough
5/5 (1)
1Z0-1096-23_5
No ratings yet
1Z0-1096-23_5
4 pages
C# 2010 Coding Briefs Data Access
From Everand
C# 2010 Coding Briefs Data Access
Kevin Hough
No ratings yet
DP-100 Microsoft Exam Practice Questions
No ratings yet
DP-100 Microsoft Exam Practice Questions
56 pages
1Z0-1096-23_6
No ratings yet
1Z0-1096-23_6
4 pages
1Z0-1096-23_9
No ratings yet
1Z0-1096-23_9
4 pages
Microsoft Azure Data Engineer DP 203
From Everand
Microsoft Azure Data Engineer DP 203
Manish Soni
No ratings yet
Mslearn dp100 02
No ratings yet
Mslearn dp100 02
5 pages
Create A Classification Model With Azure Machine Learning Designer
No ratings yet
Create A Classification Model With Azure Machine Learning Designer
19 pages
Philly To AML Migration: Prerequisites
No ratings yet
Philly To AML Migration: Prerequisites
3 pages
AZ-104 Azure Administrator Practice Paper 1: AZ-104 Azure Administrator, #1
From Everand
AZ-104 Azure Administrator Practice Paper 1: AZ-104 Azure Administrator, #1
Tech Interviews
No ratings yet
Oracle Essbase 9 Implementation Guide
From Everand
Oracle Essbase 9 Implementation Guide
Joseph Sydney Gomez
No ratings yet
1Z0-1096-23_15
No ratings yet
1Z0-1096-23_15
4 pages
1Z0-1096-23_8
No ratings yet
1Z0-1096-23_8
4 pages
Mslearn dp100 04
No ratings yet
Mslearn dp100 04
3 pages
1Z0-1096-23_7
No ratings yet
1Z0-1096-23_7
4 pages
1Z0-1096-23_10
No ratings yet
1Z0-1096-23_10
4 pages
Oracle Recovery Appliance Handbook: An Insider’S Insight
From Everand
Oracle Recovery Appliance Handbook: An Insider’S Insight
Ramesh Raghav
No ratings yet
1Z0-1096-23_4
No ratings yet
1Z0-1096-23_4
4 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
How The Components Work Together?
No ratings yet
How The Components Work Together?
29 pages
Mastering Apache Cassandra - Second Edition
From Everand
Mastering Apache Cassandra - Second Edition
Nishant Neeraj
No ratings yet
Scala Data Analysis Cookbook (new): Navigate the world of data analysis, visualization, and machine learning with over 100 hands-on Scala recipes
From Everand
Scala Data Analysis Cookbook (new): Navigate the world of data analysis, visualization, and machine learning with over 100 hands-on Scala recipes
Arun Manivannan
No ratings yet
Fast Data Processing Systems with SMACK Stack
From Everand
Fast Data Processing Systems with SMACK Stack
Raúl Estrada
No ratings yet
Configuration of Apache Server To Support ASP
From Everand
Configuration of Apache Server To Support ASP
Dr. Hidaia Mahmood Alassouli
No ratings yet
Knight's Microsoft SQL Server 2012 Integration Services 24-Hour Trainer
From Everand
Knight's Microsoft SQL Server 2012 Integration Services 24-Hour Trainer
Brian Knight
No ratings yet
Configuration of Apache Server To Support ASP
From Everand
Configuration of Apache Server To Support ASP
Dr. Hedaya Mahmood Alasooly
No ratings yet
SQL| KILLING STEPS TO INTRODUCE SQL DATABASES
From Everand
SQL| KILLING STEPS TO INTRODUCE SQL DATABASES
Ben Brumm
No ratings yet
1Z0-1096-23_14
No ratings yet
1Z0-1096-23_14
4 pages
Learn SQL: Database Management Basics
From Everand
Learn SQL: Database Management Basics
Kiet Huynh
No ratings yet
IBM SPSS Statistics 21 Brief Guide
From Everand
IBM SPSS Statistics 21 Brief Guide
Andrei Besedin
No ratings yet
Learning Pandas 2.0: A Comprehensive Guide to Data Manipulation and Analysis for Data Scientists and Machine Learning Professionals
From Everand
Learning Pandas 2.0: A Comprehensive Guide to Data Manipulation and Analysis for Data Scientists and Machine Learning Professionals
Matthew Rosch
No ratings yet
ML Lab Manual
No ratings yet
ML Lab Manual
90 pages
Configuration of a Simple Samba File Server, Quota and Schedule Backup
From Everand
Configuration of a Simple Samba File Server, Quota and Schedule Backup
Dr. Hedaya Alasooly
No ratings yet
Machine Learning On Cloud
15% (13)
Machine Learning On Cloud
28 pages
IGNOU PGDCA MCS 206 Object Oriented Programming using Java Previous Years solved Papers
From Everand
IGNOU PGDCA MCS 206 Object Oriented Programming using Java Previous Years solved Papers
Manish Soni
No ratings yet
Round 1 -MCQ
No ratings yet
Round 1 -MCQ
29 pages
Chapter 6 - Test Bank Chapter 6 - Test Bank
No ratings yet
Chapter 6 - Test Bank Chapter 6 - Test Bank
28 pages
Project On Fashion Store
100% (1)
Project On Fashion Store
10 pages
Bhuvnesh Resum-1
No ratings yet
Bhuvnesh Resum-1
3 pages
COMPUTER-SCIENCE
No ratings yet
COMPUTER-SCIENCE
3 pages
Spend Analytics
No ratings yet
Spend Analytics
14 pages
Chapter 7
No ratings yet
Chapter 7
54 pages
BI Lab Manual
No ratings yet
BI Lab Manual
21 pages
Course Outline Database Systems
No ratings yet
Course Outline Database Systems
3 pages
Activation Button
No ratings yet
Activation Button
4 pages
مراجعة كمبيوتر لغات تالتة اعدادي الترم الثاني بالاجابات اعداد معهد
No ratings yet
مراجعة كمبيوتر لغات تالتة اعدادي الترم الثاني بالاجابات اعداد معهد
42 pages
Student Database System For Higher Education: Literature Review
No ratings yet
Student Database System For Higher Education: Literature Review
4 pages
Normalization vs. Denormalization Striking The Right Balance in Database Design
No ratings yet
Normalization vs. Denormalization Striking The Right Balance in Database Design
7 pages
Cirrus C Compiler Um9
No ratings yet
Cirrus C Compiler Um9
58 pages
Tree Data Structure
No ratings yet
Tree Data Structure
13 pages
UNIX and LINUX Commands
No ratings yet
UNIX and LINUX Commands
79 pages
ADTs
No ratings yet
ADTs
37 pages
Hillstone X-Series: Data Center Firewall X8180
No ratings yet
Hillstone X-Series: Data Center Firewall X8180
6 pages
Servlet Examples
No ratings yet
Servlet Examples
59 pages
DSA Practical PDF
No ratings yet
DSA Practical PDF
23 pages
Computer System
No ratings yet
Computer System
63 pages
Access Lists QUESTION
No ratings yet
Access Lists QUESTION
7 pages
Assignment 2 Bse IV A&b Dbs
No ratings yet
Assignment 2 Bse IV A&b Dbs
8 pages
Computer Networking - PPTX TYpes of Networks
No ratings yet
Computer Networking - PPTX TYpes of Networks
9 pages
LACP Fiberhome OLT
No ratings yet
LACP Fiberhome OLT
5 pages
MySQL WorkBench Installation Guide
No ratings yet
MySQL WorkBench Installation Guide
16 pages
AB Initio Online Training Course Introduction To Abinitio
No ratings yet
AB Initio Online Training Course Introduction To Abinitio
7 pages
Sipo Presentation
No ratings yet
Sipo Presentation
11 pages
Java 3 Mitad de Curso Completito
No ratings yet
Java 3 Mitad de Curso Completito
506 pages

04A - Working With Datastores - Jupyter Notebook PDF

Uploaded by

04A - Working With Datastores - Jupyter Notebook PDF

Uploaded by

6/23/2020 04A - Working with Datastores - Jupyter Notebook

Working with Datastores

Connect to Your Workspace

In [1]: import azureml.core

# Load the workspace from the saved config file

Ready to use Azure ML 1.8.0 to work with workspace200623

View Datastores in the Workspace

https://compute200623.westus2.instances.azureml.net/notebooks/Users/DP100/04A - Working with Datastores.ipynb 1/11

In [2]: from azureml.core import Datastore

# Get the default datastore

# Enumerate all datastores, indicating which is the default

aml_data - Default = False

Get a Datastore to Work With

In [3]: aml_datastore = Datastore.get(ws, 'aml_data')

aml_data : AzureBlob (200623lab4a)

Set the Default Datastore

Upload Data to a Datastore

https://compute200623.westus2.instances.azureml.net/notebooks/Users/DP100/04A - Working with Datastores.ipynb 2/11

In [5]: default_ds.upload_files(files=['./data/diabetes.csv', './data/diabetes2.csv'], #

Uploading an estimated of 2 files

Train a Model from a Datastore

In [6]: data_ref = default_ds.path('diabetes-data').as_download(path_on_compute='diabetes

1. A folder named diabetes_training_from_datastore

https://compute200623.westus2.instances.azureml.net/notebooks/Users/DP100/04A - Working with Datastores.ipynb 3/11

# Create a folder for the experiment files

diabetes_training_from_datastore folder created.

https://compute200623.westus2.instances.azureml.net/notebooks/Users/DP100/04A - Working with Datastores.ipynb 4/11

In [8]: %%writefile $experiment_folder/diabetes_training.py

# Get the experiment run context

# load the diabetes data from the data reference

# Separate features and labels

# Split data into training set and test set

# Train a logistic regression model

https://compute200623.westus2.instances.azureml.net/notebooks/Users/DP100/04A - Working with Datastores.ipynb 5/11

https://compute200623.westus2.instances.azureml.net/notebooks/Users/DP100/04A - Working with Datastores.ipynb 6/11

In [9]: from azureml.train.estimator import Estimator

# Create a Python environment

# Set up the parameters

# Run the experiment

Start Time 6/23/2020 2:39:01 PM

Regularization Rate 0.1

Output Logs logs/azureml/105135_azureml.log Auto-switch

Click here to see the run in Azure Machine Learning studio

Out[9]: {'runId': 'diabetes-training_1592948340_e4c0a246',

https://compute200623.westus2.instances.azureml.net/notebooks/Users/DP100/04A - Working with Datastores.ipynb 8/11

https://compute200623.westus2.instances.azureml.net/notebooks/Users/DP100/04A - Working with Datastores.ipynb 9/11

In [10]: # Get logged metrics

Regularization Rate 0.1

https://compute200623.westus2.instances.azureml.net/notebooks/Users/DP100/04A - Working with Datastores.ipynb 10/11

In [11]: from azureml.core import Model

# Register the model

# List the registered models

https://compute200623.westus2.instances.azureml.net/notebooks/Users/DP100/04A - Working with Datastores.ipynb 11/11

You might also like