EMR Workshop Lab 0: Create VPC
EMR Workshop Lab 0: Create VPC
(Updated 27-Oct-16)
- Cluster Creation
This lab demonstrates the steps involved in cluster creation.
Create VPC
In AWS Mgmt Console
Click on VPC
In VPC Dashboard
Choose Start VPC Wizard
In Step 1: Select a VPC Configuration
Choose VPC with a Single Public Subnet
In Step 2: VPC with a Single Public Subnet
Enter a VPC name.
Keep the defaults on everything else.
Click Create VPC
EC2 key pair
Make sure you have an EC2 key pair in the region you are using.
http://docs.aws.amazon.com/AWSEC2/latest/UserGuide/ec2-key-
pairs.html#having-ec2-create-your-key-pair
Launch EMR Cluster
Open the Amazon EMR console at
https://console.aws.amazon.com/elasticmapreduce/
In Create Cluster
Click ‘Go to advanced options’
• Hadoop
• Ganglia
• Hive
• Zeppelin
• Presto
• Tez
• Pig
• Hue
• Spark
Click ‘Next’
Step 2: Hardware Config
Network Choose previously created VPC
EC2 Subnet Choose the public subnet
Instances Set the cluster instances and counts as follows:
Click ‘Next’
Click ‘Next’
Step 4: Security
EC2 Key Pair Choose a key pair in the region
Cluster visible Leave checked
Permissions Choose Default
EC2 Security Groups Leave as default
Encryption Options Leave as default
Update Security Group
• In the details page for your cluster, scroll down and click on
the security group shown for ‘Security Group for Master’
• Click on the security group for ‘ElasticMapReduce-master’
• Click on the ‘Inbound’ tab.
• Click the ‘Edit’ button.
• Click the Add Rule button.
• Add a rule that allows SSH from your IP Address.
• Click Save.
This will you to SSH into the cluster when it comes up in about
10-15 mins.