0% found this document useful (0 votes)

44 views

Lecture04 Storage Services - UGv1.1

The document discusses cloud computing storage architectures and AWS storage services. It provides an overview of big data, file systems, and network-attached storage. It then describes AWS storage options like Amazon S3, EBS, EFS, and Glacier, comparing block-level versus object-level storage.

Uploaded by

cojiwo3794

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views

Lecture04 Storage Services - UGv1.1

Uploaded by

cojiwo3794

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 78

Cloud Computing Architecture

Lecture 04 Storage Services

1
Reminders
 Assignment 1A Demos in your tutorial this week (via Collaborate)
 Assignment 1B out now!!!
 Make sure you write your own PHP code.
 All assignments will be checked for plagiarism.

 Don’t forget to do the online AWS MCQ quizzes.

2
Last week – Network Design (inside the VPC)
 Choosing a Region and Selecting Availability Zones
 Creating a Virtual Private Cloud (VPC) and Subnets
 VPC components and network address - CIDR

 Private and Public Subnets

 Default VPCs and Default Subnets

 Controlling VPC Traffic

 Route tables, Security groups, Network ACLs, Internet gateways, NATs, Bastion Hosts

 Multiple VPCs and AWS Accounts

Quizzes: ACF 2.0.3 VPC
ACA Module 2 Designing your environment 3

3
This week
 Storage in the Cloud
 Big Data
 NFS
 Distributed File Systems (databases next week)

 AWS Storage services (ACF Module 7)

 Amazon Elastic Block Store (Amazon EBS)
 Plus some extra notes on Instance Storage

 Amazon Elastic File System (Amazon EFS)

 Amazon Simple Storage Service (Amazon S3)
4
 Amazon Glacier

4
Big Data – Centre of the Universe
 Drivers
 Internet commerce, Mobile, Social media,
IoT and sensors, Science (e.g. biology,
astronomy, meteorology), Health, Spooks, …
 Data sizes
 KB (103), MB (106), GB (109), TB (1012),
PB(1015), exabyte (EB, 1018),
zettabyte (ZB, 1021), yottabyte (YB,1024)
 Everyday: Facebook 10T, Twitter 7T,
Youtube 4.5T+
 4Vs: volume, variety, velocity, and veracity

5
Network Storage Began with File Servers
Years ago, local-area networks used special servers, called file
servers, to support file sharing, file replication, and storage for
large files.

6
Network-Attached Storage (NAS)

Plug directly into the network.

7
Advantages of NAS
 Reliability: A NAS device typically provides advanced data striping across
multiple volumes within the device. If one (or more) volumes fail, the data
striping would maintain the data and allow reconstruction of the file
contents.
 Performance: Because the NAS device did not run a complete operating
system, the hardware had less system overhead, which allowed it to
outperform a file server.
 Compatibility: NAS devices normally support common file systems, which,
in turn, make them fully compatible with common operating systems.
 Ease of performing backups: NAS devices are commonly used for backup
devices. Within a home, for example, all devices can easily access and
back up files to a NAS device.

8
Cloud-Based Storage
 Cloud-based data storage is the next step in the evolution of
NAS devices.
 Across the web (the cloud), many providers offer data storage
that resides in the cloud.
 Data may be accessible as follows:
 Through a web browser interface
 Through a mounted disk drive
 Through a set of API (application program interface) calls

9
Advantages of Cloud-Based Storage
 Scalability: Most cloud-based data storage providers let users scale their
storage capacity (up or down) to align with their storage needs.
 Pay for use: With most cloud-based data storage facilities, users pay only
for the storage (within a range) that they need.
 Reliability: Many cloud-based data storage facilities provide transparent
data replication.
 Ease of access: Most cloud-based data storage facilities support web-
based access to files from any place, at any time, using a variety of
devices.
 Ease of use: Many cloud-based data storage solutions let users map a
drive letter to the remote file storage area and then access the files
through the use of a logical drive.
10

10
Disadvantages of Cloud-Based Storage

 Performance: Because the cloud-based disk storage devices

are accessed over the Internet, they will never be as fast as
local drives.
 Security: Some users will never feel comfortable with their data
in the cloud.
 Data orphans: Users may abandon data in cloud storage
facilities, leaving confidential private or company data
at risk.

11
Cloud-Based Block Storage

 In the simplest sense, a block of data storage is a fixed-sized

sequence of bits. The size of the block normally corresponds to
an underlying unit of storage on the cloud-based block
storage device.
 Some applications work with very large blocks of data, the
format of which has meaning only to the application itself—
meaning that the data may not map well to storage within a file
system or database.

12
File Systems
 Operating systems exist to allow users to run programs and to store and
retrieve data (files) from one user session to the next.
 Within the operating system, special software, called the file system,
oversees the storage and retrieval of files to and from a disk.
 When you copy a file, delete a file, or create and move files between
folders, the file system is performing the work.
 Initially, file systems allowed users to manipulate only local files that
reside on one of the PC’s disk drives.
 As networks became more prevalent, so too did network operating
systems, which allow users and programs to manipulate files residing on a
device across the network.

13
Real World: Hadoop Distributed File System

 Apache Hadoop is an open source project, the goal of which is

to support reliable, scalable distributed computing.
 Part of the project includes the Hadoop Distributed File System
(HDFS), a Java-based file system that is well suited for cloud-
based storage.
 HDFS is designed to be highly fault tolerant and robust to
maintain operation in the event of a device failure.

14
AWS Storage Options: Block vs. Object Storage

What if you want to change one character in a 1‐GB file?

Block Storage Object Storage

Change one block (piece of the file) Entire file must be updated
that contains the character

One of the critical concepts to understanding the differences between some storage
types is whether they offer "block‐level" storage or "object‐level" storage. This
difference has a major impact on the throughput, latency, and cost of your storage
solution: block storage solutions are typically faster and use less bandwidth but cost
more than object‐level storage.
This week
 Storage in the Cloud
 Big Data
 NFS
 Distributed File Systems (databases next week)

 AWS Storage services (ACF Module 7)

 Amazon Elastic Block Store (Amazon EBS)
 Plus some extra notes on Instance Storage

 Amazon Elastic File System (Amazon EFS)

 Amazon Simple Storage Service (Amazon S3)
16
 Amazon Glacier

16
Introduction to Storage Services

Amazon S3 Amazon EBS

Amazon Amazon
EFS Glacier

Amazon Amazon EC2 Storage AWS IAM

VPC

Amazon RDS Amazon

DynamoDB

It is not surprising that storage is another AWS core service. There are three broad
categories of storage: instance store ("ephemeral"), Amazon EBS, and Amazon S3.

Instance store, or ephemeral storage, is a temporary storage that it is added to your

Amazon EC2 instance. Amazon EBS is persistent, mountable storage; it can be
mounted as a device to an Amazon EC2 instance. Amazon EBS can only be
mounted to an Amazon EC2 instance in the same Availability Zone. Like Amazon
EBS, Amazon S3 is persistent storage but it can be accessed from anywhere.
Amazon Elastic Block Store
(Amazon EBS)

We just finished discussing compute options to power your solution. Amazon Elastic
Block Store (Amazon EBS) is an AWS block storage system that is best used for
storing persistent data. Amazon EBS provides highly available block level storage
volumes for use with Amazon EC2 instances.
Storage

Amazon Elastic Block Store

(Amazon EBS)

Amazon EBS provides persistent block storage volumes for use with Amazon
EC2 instances in the cloud. Persistent storage is any data storage device that retains
data after power to that device is shut off. It is also sometimes referred to as non‐
volatile storage.

Each Amazon EBS volume is automatically replicated within its Availability Zone to
protect you from component failure, offering high availability and durability. Amazon EBS
volumes offer the consistent and low‐latency performance needed to run your
workloads. With Amazon EBS, you can scale your usage up or down within minutes – all
while paying a low price for only what you provision.
Amazon EBS Review
Amazon EBS allows you to create individual storage volumes and attach them to an
Amazon EC2 instance.
Amazon EBS offers block‐level storage
Volumes are automatically replicated within its Availability Zone
Can be backed up automatically to Amazon S3
Uses:
Boot volumes and storage for Amazon EC2 instances

Data storage with a file system

Database hosts

Enterprise applications

Amazon EBS volumes provide durable, detachable, block‐level storage (like an external
hard drive) for your Amazon EC2 instances. Because they are directly attached to the
instances, they can provide extremely low latency between where the data is stored and
where it might be used on the instance. For this reason, they can be used to run a
database with an Amazon EC2 instance. Amazon EBS volumes can also be used to back
up your instances into Amazon Machine Images (AMI), which are stored in Amazon S3
and can be reused to create new Amazon EC2 instances later.
CCA 2.01: Compute, Storage & Network ► Part 4: Storage Services ► EBS
Amazon EBS Lifecycle Call CreateVolume
1 GiB to 16 TiB
Vast amounts of
unused space Create
Call AttachVolume to affiliate with
Attach one Amazon EC2 instance

Attached • Format from Amazon EC2

and instance OS
In Use • Mount formatted drive

Deleted
CreateSnapshot
Snapshot to
Call DeleteVolume Amazon S3
Detach

Call DetachVolume

Amazon EBS provides block‐level storage volumes for use with Amazon EC2 instances.
Amazon EBS volumes are highly available and reliable storage volumes that can be
attached to any running instance in the same Availability Zone. The Amazon EBS
volumes attached to an Amazon EC2 instance are exposed as storage volumes that
persist independently from the life of the instance. When the volumes are not attached
to an EC2 instance, you pay only for the cost of storage.
Amazon EBS Volume Types

Solid‐State Drives (SSD) Hard Disk Drives (HDD)

General Purpose Provisioned IOPS Throughput‐Optimized Cold

Max volume size 16 TiB 16 TiB 16 TiB 16 TiB

Max IOPS/volume 10,000 32,000 500 250
Max throughput/volume 160 MiB/s 500 MiB/s 500 MiB/s 250 MiB/s

Matching the correct technology to your workload is a key best practice for reducing
storage costs. Provisioned IOPS SSD‐backed Amazon EBS volumes can give you the highest
performance, but if your application doesn't require or won't use performance that high,
one of the lower‐cost options might be a better solution.

For more information, see

http://docs.aws.amazon.com/AWSEC2/latest/UserGuide/EBSVolumeTypes.html.
Amazon EBS Volume Types
Solid‐State Drives (SSD) Hard Disk Drives (HDD)
General Purpose Provisioned IOPS Throughput‐Optimized Cold

• Recommended for most • I/O‐intensive workloads • Streaming workloads • Throughput‐oriented

workloads requiring consistent, fast storage for large volumes
• Relational DBs throughput at a low price of data that is
• System boot volumes infrequently accessed
• NoSQL DBs • Big data
• Virtual desktops • Scenarios where the
• Data warehouses lowest storage cost is
Use Cases • Low‐latency interactive
important
apps • Log processing
• Cannot be a boot volume
• Development and test • Cannot be a boot volume
environments

© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon EBS
Snapshots
Point‐in‐time snapshots
Recreate a new volume at any time
Encryption
Encrypted Amazon EBS volumes
No additional cost
Elasticity
Increase capacity
Change to different types
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved.

To provide an even higher level of data durability, Amazon EBS gives you the ability to
create point‐in‐time snapshots of your volumes, and AWS allows you to recreate a new
volume from a snapshot at any time. Share snapshots or even copy snapshots to
different AWS Regions for even greater disaster recovery (DR) protection. You can, for
example, encrypt and share your snapshots from Virginia to Tokyo.

You could also have encrypted Amazon EBS volumes at no additional cost. The
encryption occurs on the Amazon EC2 side, so the data moving between the Amazon
EC2 instance and the Amazon EBS volume inside AWS data centers will be encrypted in
transit.

As your company grows, the amount of data stored on your Amazon EBS volumes will
likely also grow. Amazon EBS volumes have the ability to increase capacity and change to
different types, meaning that you can change from hard disk to SSD or increase from a
50‐gigabyte volume to a 16‐terabyte volume. For example, you can do this resize
operation on the fly without needing to stop the instances.

25
Amazon EBS: Volumes and IOPS
1. Volumes
Amazon EBS volumes persist independently from the instance
All volume types are charged by the amount provisioned per month
2. Input Output Operations per Second (IOPS)
General Purpose (SSD)
Charged by the amount your provision in GB per month until storage is released
Magnetic
Charged by the number of requests to volume
Provisioned IOPS (SSD)
Charged by the amount you provision in IOPS (by % of day / month used)

When you begin to estimate the cost for Amazon EBS, you need to consider the
following:

1. Volumes – Volume storage for all Amazon EBS volume types is charged by the
amount you provision in GB per month, until you release the storage.

2. Input Output Operations per Second (IOPS) – I/O is included in the price of General
Purpose (SSD) volumes, while for Amazon EBS Magnetic volumes, I/O is charged by
the number of requests you make to your volume. With Provisioned IOPS (SSD)
volumes, you are also charged by the amount you provision in IOPS (multiplied by
the percentage of days you provision for the month).
Amazon EBS: Snapshots and Data Transfer

3. Snapshots
Added cost of Amazon EBS snapshots to Amazon S3 is per GB‐
month of data stored

4. Data Transfer
Inbound data transfer is free
Outbound data transfer charges are tiered

3. Snapshot – Amazon EBS provides the ability to back up snapshots of your data to
Amazon S3 for durable recovery. If you opt for Amazon EBS snapshots, the added
cost is per GB‐month of data stored.

4. Data Transfer – Take into account the amount of data transferred out of your
application. Inbound data transfer is free, and outbound data transfer charges are
tiered.
In Review
Amazon EBS Features:
Persistent and customizable block storage for Amazon EC2
HDD and SSD types
Replicated in the same Availability Zone
Easy and transparent encryption
Elastic volumes
Back up using snapshots

Amazon EBS provides block level storage volumes for use with Amazon EC2 instances.
Amazon EBS volumes are off‐instance storage that persists independently from the life
of an instance. They are analogous to virtual disks in the cloud. Amazon EBS provides
three volume types: General Purpose (SSD), Provisioned IOPS (SSD), and Magnetic.

The three volume types differ in performance characteristics and cost, so you can
choose the right storage performance and price for the needs of your applications.

To learn more about Amazon EBS see, https://aws.amazon.com/ebs/.

28
Recorded Elastic Block Store demo
demo:
Amazon Elastic
Block Store

© 2019 Amazon Web Services, Inc. or its Affiliates.

29
All rights reserved.

Now, take a moment to watch the Elastic Block Store demo. The recording runs a little over 5
minutes, and it reinforces many of the concepts that were discussed in this section of the
module.

The demonstration shows how to configure the following resources by using the AWS
Management Console. The demonstration shows how to:

• Create an Amazon General Purpose (SSD) EBS volume

• Attach the EBS volume to an EC2 instance

The demonstration also shows how to interact with the EBS volume using the Amazon Command
Line Interface and how to mount the EBS volume to the EC2 instance.
Lab 4 Scenario
This lab focuses on Amazon EBS, a key underlying storage mechanism for Amazon EC2
instances. In this lab, you will create an Amazon EBS volume, attach it to an instance,
apply a file system to the volume, and then take a snapshot backup.

Amazon EBS

Amazon EBS provides persistent block storage volumes for use with Amazon
EC2 instances in the AWS cloud. Each Amazon EBS volume is automatically replicated
within its Availability Zone to protect you from component failure, offering high
availability and durability.

After completing this lab, you will be able to:

• Create an Amazon EBS volume
• Attach the volume to an instance
• Configure the instance to use the virtual disk
• Create an Amazon EBS snapshot
• Restore the snapshot

Duration: ~45 minutes

Lab 4: Tasks

Amazon EBS
Create a new EBS volume.

Attach the volume to an Amazon EC2 instance.

Amazon EC2
Instance

Create and configure your file system.

Amazon EBS

Create a snapshot.
snapshot

attached created
Amazon EC2 Amazon EBS Snapshot
Instance

In this lab, you:

• Created an Amazon EBS volume

• Attached that volume to an instance

• Configured the instance to use the virtual disk

• Created an Amazon EBS snapshot

• Restored the snapshot

~ 45 minutes

© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
CCA 2.01: Compute, Storage & Network ► Part 4: Storage Services ► EC2 Instance Storage
Amazon EC2 Instance Storage

 Local, complimentary direct attached block storage

 Includes availability, number of disks, and size based on EC2 instance type

 Optimized for up to 365,000 Read IOPS and 315,000 First Write IOPS

 SSD or magnetic

 Has no persistence

 Automatically deletes data when an EC2 instance stops, fails or is terminated

An instance store provides temporary block‐level storage for your instance. This storage
is located on disks that are physically attached to the host computer. Instance store is
ideal for temporary storage of information that changes frequently, such as buffers,
caches, scratch data, and other temporary content, or for data that is replicated across a
fleet of instances, such as a load‐balanced pool of web servers.
For more information, see:
• http://docs.aws.amazon.com/AWSEC2/latest/UserGuide/InstanceStorage.html#insta
nce‐store‐volumes
• http://docs.aws.amazon.com/AWSEC2/latest/UserGuide/i2‐instances.html
CCA 2.01: Compute, Storage & Network ► Part 4: Storage Services ► EC2 Instance Storage
Amazon EBS vs. Amazon EC2 Instance Store

Amazon EBS
 Data stored on an Amazon EBS volume can persist independently of the life of the instance.
 Storage is persistent.

Amazon EC2 Instance Store

 Data stored on a local instance store persists only as long as the instance is alive.
 Storage is ephemeral.

Use the local instance store only for temporary data. For data that requires a higher level
of durability, use Amazon EBS volumes or back up the data to Amazon S3. If you are
using an Amazon EBS volume as a root partition and want your Amazon EBS volume to
persist outside of the life of the instance, set the Delete on termination flag to “No”.
CCA 2.01: Compute, Storage & Network ► Part 4: Storage Services ► EC2 Instance Storage
Instance Lifecycle – Reboot vs. Stop vs. Terminate

Stop/Start
Characteristic Reboot (EBS-backed instances only) Terminate
The instance stays on the The instance runs on a
Host computer
same host computer new host computer

Public IP address No change New address assigned

Elastic IP addresses EIP remains associated EIP remains associated EIP is disassociated
(EIP) with the instance with the instance from the instance

Instance store
Preserved Erased Erased
volumes

Boot volume is
EBS volume Preserved Preserved
deleted by default

Stops incurring charges Stops incurring charges

Instance billing hour
Billing as soon as state is changed as soon as state is changed
doesn’t change
to stopping to shutting-down

The table shows the differences between rebooting, stopping, and terminating your
instance.
For more information, see:
http://docs.aws.amazon.com/AWSEC2/latest/UserGuide/ec2‐instance‐lifecycle.html
This week
 Storage in the Cloud
 Big Data
 NFS
 Distributed File Systems (databases next week)

 AWS Storage services (ACF Module 7)

 Amazon Elastic Block Store (Amazon EBS)
 Plus some extra notes on Instance Storage

 Amazon Elastic File System (Amazon EFS)

 Amazon Simple Storage Service (Amazon S3)
36
 Amazon Glacier

36
Part 2:
Amazon Elastic File System
(Amazon EFS)

Amazon Elastic File System (Amazon EFS) provides simple, scalable file storage for use
with Amazon EC2 instances in the AWS Cloud. Amazon EFS is easy to use and offers a
simple interface that allows you to create and configure file systems quickly and easily.
Storage

Amazon EFS

Amazon EFS provides simple, scalable, elastic file storage for use with AWS
services and on‐premises resources. It is easy to use and offers a simple interface
that allows you to create and configure file systems quickly and easily. Amazon
EFS is built to elastically scale on demand without disrupting applications,
growing and shrinking automatically as you add and remove files, so your
applications have the storage they need, when they need it.
Amazon EFS Features
File storage in the AWS cloud
Perfect for big data and analytics, media processing workflows, content
management, web serving and home directories
Petabyte‐scale, low latency file system
Shared storage
Elastic capacity
Supports the Network File System versions 4.0 and 4.1 (NFSv4) protocol
Compatible with all Linux‐based AMIs for Amazon EC2

Amazon EFS is a fully managed service that makes it easy to set up and scale file storage
in the AWS cloud. It is the easiest way to build a file system for big data and analytics,
media processing workflows, content management, web serving and home directories.
You can create file systems that are accessible to Amazon EC2 instances via a file system
interface (using standard operating system file I/O APIs) and that support full file system
access semantics (such as strong consistency and file locking).

Amazon EFS file systems can automatically scale from gigabytes to petabytes of data
without needing to provision storage. Thousands of Amazon EC2 instances can access an
Amazon EFS file system at the same time, and Amazon EFS provides consistent
performance to each Amazon EC2 instance. Amazon EFS is designed to be highly durable
and highly available. With Amazon EFS, there is no minimum fee or setup costs, and you
pay only for the storage you use.
Amazon EFS Architecture

Availability Zone A Availability Zone B Availability Zone C

NFS clients
EC2 Instance
EC2 Instance EC2 Instance
Network

Network Network
NFS NFS Interface

Interface Interface Clients Clients

Subnet

Network Mount Network Mount Network Mount

Interface Interface Interface
Target Target Target
Mount Target Mount Target Mount Target

Subnet Subnet
Subnet

Amazon
EFS
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved.

Amazon EFS provides file storage in the cloud. With Amazon EFS, you can create a file
system, mount the file system on an Amazon EC2 instance, and then read and write data
from to and from your file system. You can mount an Amazon EFS file system in your
VPC, through the Network File System versions 4.0 and 4.1 (NFSv4) protocol.

You can access your Amazon EFS file system concurrently from Amazon EC2 instances in
your Amazon VPC, so applications that scale beyond a single connection can access a file
system. Amazon EC2 instances running in multiple Availability Zones within the same
AWS Region can access the file system, so that many users can access and share a
common data source.

In this illustration, the VPC has three Availability Zones, and each has one mount target
created in it. We recommend that you access the file system from a mount target within
the same Availability Zone. Note that one of the Availability Zones has two subnets.
However, a mount target is created in only one of the subnets.
Amazon EFS Implementation

1 Create your Amazon EC2 resources and launch your Amazon EC2 instance.

2 Create your Amazon EFS file system.

3 Create your target mounts in appropriate subnets.

4 Connect your Amazon EC2 instances to target mounts.

5 Clean up resources and protect your AWS account.

There are five steps you need to perform to create and use your first Amazon EFS file
system, mount it on an Amazon EC2 instance in your VPC, and test the end‐to‐end setup.

1. Create your Amazon EC2 resources and launch your instance. (Before you can launch
and connect to an Amazon EC2 instance, you need to create a key pair, unless you
already have one.)
2. Create your Amazon EFS file system.
3. In the appropriate subnet, create your target mounts.
4. Next, connect to your Amazon EC2 instance and mount the Amazon EFS file system.
5. Finally, clean up your resources and protect your AWS account.
Amazon EFS Resources

File system
Mount target
Subnet ID
Security groups
One or more per file system
Create in a VPC subnet
One per Availability Zone
Must be in the same VPC
Tags
Key‐value pairs
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved.

In Amazon EFS, a file system is the primary resource. Each file system has properties
such as ID, creation token, creation time, file system size in bytes, number of mount
targets created for the file system, and the file system state.

Amazon EFS also supports other resources to configure the primary resource. These
include mount targets and tags.
Mount target: To access your file system, you must create mount targets in your VPC.
Each mount target has the following properties:
• The mount target ID
• The subnet ID in which it is created
• The file system ID for which it is created
• An IP address at which the file system may be mounted
• The mount target state.
You can use the IP address or the DNS name in your mount command. Each file system
has a DNS name of the following form.
Tags: To help organize your file systems, you can assign your own metadata to each of
the file systems you create. Each tag is a key‐value pair.

Think of mount targets and tags as subresources that don't exist without being
associated with a file system.
In Review
Amazon EFS provides file storage over a network
Perfect for big data and analytics, media processing workflows,
content management, web serving and home directories
Fully managed service that eliminates storage administration tasks
Accessible from the console, an API, or the CLI
Scales up or down as files are added or removed and you pay for
what you use.

We've covered an introduction to Amazon EFS, including key features and key resources.
It provides file storage in the cloud that is perfect for big data and analytics, media
processing workflows, content management, web serving and home directories.
Amazon EFS scales up or down as files are added or removed and you pay for only what
you are using.

Amazon EFS is a fully managed service that is accessible from the console, an API, or the
CLI.

For more information about Amazon S3, see https://aws.amazon.com/efs/.

43
Recorded demo:
Amazon Elastic File
System

Now, take a moment to watch the Amazon EFS demo. The recording runs a little over 6 minutes,
and it reinforces many of the concepts that were discussed in this section of the module.

The demonstration shows how to configure the following resources by using the AWS
Management Console. The demonstration shows how to:

• Create an Elastic File System (EFS) implementation in a Virtual Private Cloud

• Attach the EFS
• Configure security and performance settings for the EFS implementation

The demonstration also reviews .how to get specific instructions for how to validate your EFS
installation so you can connect to EC2 instances.
This week
 Storage in the Cloud
 Big Data
 NFS
 Distributed File Systems (databases next week)

 AWS Storage services (ACF Module 7)

 Amazon Elastic Block Store (Amazon EBS)
 Plus some extra notes on Instance Storage

 Amazon Elastic File System (Amazon EFS)

 Amazon Simple Storage Service (Amazon S3)
45
 Amazon Glacier

45
Part 3:
Amazon Simple Storage Service (S3)

Companies today need the ability to simply and securely collect, store, and analyze their
data at a massive scale. Amazon S3 is object storage built to store and retrieve any
amount of data from anywhere – web sites and mobile apps, corporate applications, and
data from IoT sensors or devices.
Storage

Amazon Simple Storage

Service (Amazon S3)

Amazon S3 is object‐level storage, which means that if you want to change a part
of a file, you have to make the change and then re‐upload the entire modified
file. Amazon S3 stores data as objects within resources called buckets.

Let’s take a closer look Amazon S3.

Amazon S3 Review
Managed cloud storage solution designed to scale seamlessly and provide 99.999999999% durability.

Store as many objects as you want.

Bucket names must be unique across all existing bucket names in Amazon S3.
Amazon S3 cannot be used as a bootable drive.
Data is stored redundantly.
Access Amazon S3 with the AWS Management Console, one of the AWS SDKs, or a third‐party solution.
Object uploads or deletes can trigger notifications, workflows, or even scripts.
Data in transit and at rest can be encrypted automatically.
Storage class analysis (Amazon S3 Analytics) to analyze storage access patterns and transition the right
data to the right storage class.

You can store as many objects as you want within a bucket, and write, read, and delete objects in your
bucket. Bucket names are universal and must be unique across all existing bucket
names in Amazon S3. Objects can be up to 5 terabytes in size. By default, data in Amazon S3 is
stored redundantly across multiple facilities and multiple devices in each facility.
Amazon S3 is a fully managed storage service that provides a simple API for storing and retrieving
data. This means that the data you store in Amazon S3 isn't associated with any particular server, and
you don't have to manage any infrastructure yourself. You can put as many objects into Amazon S3 as
you want. Amazon S3 holds trillions of objects and regularly peaks at millions of requests per second.
Objects can be almost any data file, such as images, videos, or server logs. Since Amazon S3 supports
objects as large as several terabytes in size, you could even store database snapshots as objects.
Amazon S3 also provides low‐latency access to the data over the internet by HTTP or HTTPS, so you
can retrieve data anytime from anywhere. You can also access Amazon S3 privately through a virtual
private cloud endpoint. You get fine‐grained control over who can access your data using identity and
access management policies, S3 bucket policies, and even per‐object access control lists.
By default, none of your data is shared publicly. You can also encrypt your data in transit and choose
to enable server‐side encryption on your objects.
Amazon S3 can be accessed via the web‐based AWS Management Console, programmatically via the
API and SDKs, or with third‐party solutions (which use the API/SDKs).
Amazon S3 includes event notifications that allow you to set up automatic notifications when certain
events occur, such as an object being uploaded to or deleted from a specific bucket. Those
notifications can be sent to you, or they can be used to trigger other processes, such as AWS Lambda
scripts.
With storage class analysis, you can analyze storage access patterns and transition the right data to the
right storage class. This new Amazon S3 Analytics feature automatically identifies the optimal lifecycle
policy to transition less frequently accessed storage to Amazon S3 Standard – Infrequent Access (S3
Standard‐IA). You can configure a storage class analysis policy to monitor an entire bucket, a prefix, or
object tag. Once an infrequent access pattern is observed, you can easily create a new lifecycle age
policy based on the results. Storage class analysis also provides daily visualizations of your storage
usage in the AWS Management Console. You can export these to an S3 bucket to analyze using the
business intelligence tools of your choice, such as Amazon QuickSight.
Amazon S3 storage classes

Amazon S3 offers a range of object‐level storage classes that are

designed for different use cases:
• Amazon S3 Standard
• Amazon S3 Intelligent‐Tiering
• Amazon S3 Standard‐Infrequent Access (Amazon S3 Standard‐IA)
• Amazon S3 One Zone‐Infrequent Access (Amazon S3 One Zone‐IA)
• Amazon S3 Glacier
• Amazon S3 Glacier Deep Archive
Amazon S3 offers a range of object‐level storage classes that are designed for different use
© 2019 Amazon Web Services, Inc. or its Affiliates. All rights reserved. 49
cases. These classes include:

• Amazon S3 Standard – Amazon S3 Standard is designed for high durability, availability,

and performance object storage for frequently accessed data. Because it delivers low
latency and high throughput, Amazon S3 Standard is appropriate for a variety of use
cases, including cloud applications, dynamic websites, content distribution, mobile and
gaming applications, and big data analytics.

• Amazon S3 Intelligent‐Tiering – The Amazon S3 Intelligent‐Tiering storage class is

designed to optimize costs by automatically moving data to the most cost‐effective access
tier, without performance impact or operational overhead. For a small monthly
monitoring and automation fee per object, Amazon S3 monitors access patterns of the
objects in Amazon S3 Intelligent‐Tiering, and moves the objects that have not been
accessed for 30 consecutive days to the infrequent access tier. If an object in the
infrequent access tier is accessed, it is automatically moved back to the frequent access
tier. There are no retrieval fees when you use the Amazon S3 Intelligent‐Tiering storage
class, and no additional fees when objects are moved between access tiers. It works well
for long‐lived data with access patterns that are unknown or unpredictable.
• Amazon S3 Standard‐Infrequent Access (Amazon S3 Standard‐IA) – The Amazon S3 Standard‐
IA storage class is used for data that is accessed less frequently, but requires rapid access
when needed. Amazon S3 Standard‐IA is designed to provide the high durability, high
throughput, and low latency of Amazon S3 Standard, with a low per‐GB storage price and per‐
GB retrieval fee. This combination of low cost and high performance makes Amazon S3
Standard‐IA good for long‐term storage and backups, and as a data store for disaster recovery
files.

• Amazon S3 One Zone‐Infrequent Access (Amazon S3 One Zone‐IA) – Amazon S3 One Zone‐IA
is for data that is accessed less frequently, but requires rapid access when needed. Unlike
other Amazon S3 storage classes, which store data in a minimum of three Availability Zones,
Amazon S3 One Zone‐IA stores data in a single Availability Zone and it costs less than Amazon
S3 Standard‐IA. Amazon S3 One Zone‐IA works well for customers who want a lower‐cost
option for infrequently accessed data, but do not require the availability and resilience of
Amazon S3 Standard or Amazon S3 Standard‐IA. It is a good choice for storing secondary
backup copies of on‐premises data or easily re‐creatable data. You can also use it as cost‐
effective storage for data that is replicated from another AWS Region by using Amazon S3
Cross‐Region Replication.

• Amazon S3 Glacier – Amazon S3 Glacier is a secure, durable, and low‐cost storage class for
data archiving. You can reliably store any amount of data at costs that are competitive with—
or cheaper than—on‐premises solutions. To keep costs low yet suitable for varying needs,
Amazon S3 Glacier provides three retrieval options that range from a few minutes to hours.
You can upload objects directly to Amazon S3 Glacier, or use Amazon S3 lifecycle policies to
transfer data between any of the Amazon S3 storage classes for active data (Amazon S3
Standard, Amazon S3 Intelligent‐Tiering, Amazon S3 Standard‐IA, and Amazon S3 One Zone‐IA)
and Amazon S3 Glacier.

• Amazon S3 Glacier Deep Archive – Amazon S3 Glacier Deep Archive is the lowest‐cost storage
class for Amazon S3. It supports long‐term retention and digital preservation for data that
might be accessed once or twice in a year. It is designed for customers — particularly
customers in highly regulated industries, such as financial services, healthcare, and public
sectors — that retain datasets for 7–10 years (or more) to meet regulatory compliance
requirements. Amazon S3 Glacier Deep Archive can also be used for backup and disaster
recovery use cases. It is a cost‐effective and easy‐to‐manage alternative to magnetic tape
systems, whether these tape systems are on‐premises libraries or off‐premises services.
Amazon S3 Glacier Deep Archive complements Amazon S3 Glacier, and it is also designed to
provide 11 9s of durability. All objects that are stored in Amazon S3 Glacier Deep Archive are
replicated and stored across at least three geographically dispersed Availability Zones, and
these objects can be restored within 12 hours.

For more information see, Amazon S3 storage classes.

49
Amazon S3 Review
Amazon S3
To upload your data (photos, videos, documents, etc.):
1. Create a bucket in one of the AWS Regions.
2. Upload any number of objects to the bucket.

Bucket
https://s3-ap-northeast-1.amazonaws.com/[bucket name]/
[bucket name]
Region code Bucket name

Object
https://s3-ap-northeast-1.amazonaws.com/[bucket name]/Preview2.mp4
Preview2.mp4
Tokyo Region Key
(ap-northeast-1)

To get the most out of Amazon S3, you need to understand a few simple concepts. First,
Amazon S3 stores data inside buckets. Buckets are essentially the prefix for a set of files,
and as such must be uniquely named across all of Amazon S3. Buckets are logical
containers for objects. You can have one or more buckets in your account. For each
bucket, you can control access, in other words, who can create, delete and list objects in
the bucket. You can also view access logs for the bucket, and its objects, and choose the
geographical region where Amazon S3 will store the bucket and its contents.

In the example, we've used Amazon S3 to create a bucket in the Tokyo region, which is
identified within AWS formally by its region code: "ap‐northeast‐1").

The URL for a bucket is structured as displayed here, with the region code first, followed
by amazonaws.com, followed by the bucket name.

Amazon S3 refers to files as objects. Once you have a bucket, you can store any number
of objects inside of it. An object is composed of data, and any metadata that describes
that file. To store an object in Amazon S3, you upload the file you want to store into a
bucket.

When you upload a file, you can set permission on the data as well as any metadata.
In this example, we're storing the object "Preview2.mp4" inside of our bucket.
The URL for the file includes the object name at the end.

50
Data Redundantly Stored in Region

media/welcome.mp4
Facility 1 Facility 2 Facility 3

my‐bucket‐name

media/welcome.mp4 media/welcome.mp4 media/welcome.mp4

When you create a bucket in Amazon S3, it's associated with a particular AWS Region.
Whenever you store data in the bucket, it is redundantly stored across multiple AWS
facilities within your selected region. Amazon S3 is designed to durably store your data,
even in the case of concurrent data loss in two AWS facilities.

51
Designed for Seamless Scaling

media/welcome.mp4 prod2.mp4 prod3.mp4 prod4.mp4

prod5.mp4 prod6.mp4 prod7.mp4 prod8.mp4

my‐bucket‐name

prod9.mp4 prod10.mp4 prod11.mp4 prod12.mp4

Amazon S3 will automatically manage the storage behind your bucket even as your data
grows. This allows you to get started immediately and to have your data storage grow
with your application needs. Amazon S3 will also scale to handle a high volume of
requests. You don't have to provision the storage or throughput, and you'll only be billed
for what you use.

52
Access the Data Anywhere

You can access Amazon S3 via the console, AWS CLI, or AWS SDK. Additionally, you can
also access the data in your bucket directly via the rest endpoints. These support HTTP
or HTTPS access. To support this type of URL‐based access, S3 bucket names must be
globally unique and DNS‐compliant. Also, object keys should be using characters that are
safe for URLs.

53
Common Use Cases
Storing application assets
Static web hosting
Backup and disaster recovery (DR)
Staging area for big data
Many more….

This flexibility to store a virtually unlimited amount of data and access that data from
anywhere makes Amazon S3 suitable for a wide range of scenarios. Let's look at some
use cases for Amazon S3:
• As a location for any application data, Amazon S3 buckets provide that shared
location for storing objects that any instances of your application can access,
including applications on Amazon EC2 or even traditional servers. This can be useful
for user‐generated media files, server logs, or other files your application needs to
store in a common location. Also, because the content can be fetched directly over
the web, you can offload serving of that content from your application and allow
clients to directly fetch the data themselves from Amazon S3.
• For static web hosting, Amazon S3 buckets can serve up the static contents of your
website, including HTML, CSS, JavaScript, and other files.
• The high durability of Amazon S3 makes it a good candidate to store backups of your
data. For even greater availability and disaster recovery capability, Amazon S3 can
even be configured to support cross‐region replication such that data put into an
Amazon S3 bucket in one region can be automatically replicated to another Amazon
S3 region. The scalable storage and performance of Amazon S3 make it a great
candidate for staging or long‐term storage of data you plan to analyze using a variety
of big data tools. Given how simple it is to store and access data with Amazon S3,
you'll find yourself using it frequently with AWS services and for other parts of your
application.

54
Amazon S3 Pricing
Pay only for what you use, including:
GBs per month
Transfer OUT to other regions
PUT, COPY, POST, LIST, and GET requests

You do NOT have to pay for:

Transfer IN to Amazon S3
Transfer OUT from Amazon S3 to Amazon CloudFront or Amazon EC2 in the same
region.

Specific costs may vary depending on region and the specific requests made.
As a general rule, you only pay for transfers that cross the boundary of your region,
which means you do not pay for transfers to Amazon CloudFront's edge locations within
that same region.
Amazon S3: Storage Pricing
To estimate Amazon S3 costs, consider the following:
1. Types of storage classes
Standard Storage
99.999999999% durability
99.99% availability
Standard‐Infrequent Access (SIA)
99.999999999% durability
99.9% availability
2. Amount of storage
The number and size of objects

When you begin to estimate the costs of Amazon S3, you need to consider the following:

1. Storage Class:
• Standard Storage is designed to provide 99.999999999% durability and
99.99% availability.
• Standard – Infrequent Access (SIA) is a storage option within Amazon S3 that
you can use to reduce your costs by storing less frequently accessed data at
slightly lower levels of redundancy than Amazon S3’s standard storage.
Standard – Infrequent Access is designed to provide the same
99.999999999% durability as Amazon S3 with 99.9% availability in a given
year. It’s important to note that each class has different rates.
2. Storage – The number and size of objects stored in your Amazon S3 buckets as well
as type of storage.
Amazon S3: Storage Pricing
3. Requests:
The number of requests (GET, PUT, COPY):
Type of requests
Different rates for GET requests than other requests
4. Data Transfer:
Pricing based on the amount of data transferred out of the
Amazon S3 region
Data transfer in is free, charge for data transfer out

3. Requests – The number and type of requests. GET requests incur charges at
different rates than other requests, such as PUT and COPY requests.
• Get: Retrieves an object from Amazon S3. You must have READ access to use
this operation.
• Put: Adds an object to a bucket. You must have WRITE permissions on a
bucket to add an object to it.
• Copy: Creates a copy of an object that is already stored in Amazon S3. A PUT
copy operation is the same as performing a GET and then a PUT.

4. Data Transfer – The amount of data transferred out of the Amazon S3 region.
Remember that data transfer in is free, but there is a charge for data transfer out.
In Review
Amazon S3 is a fully managed cloud storage service
Store a virtually unlimited number of objects
Pay for only what you use
Access at any time, from anywhere
Amazon S3 offers rich security controls

We've covered an introduction to Amazon S3 including key features and some common
use cases.

For more information about Amazon S3, see https://aws.amazon.com/s3/.

58
CCA 2.01: Compute, Storage & Network ► Part 4: Storage Services ► EBS
Amazon EBS and Amazon S3

Amazon EBS Amazon S3

Paradigm Block storage with file system Object store

Performance Very fast Fast

Across multiple servers in an Across multiple facilities

Redundancy
Availability Zone in a Region

EBS Encryption – Data volumes

Security Encryption
and Snapshots

Access from the

No (1) Yes (2)
Internet?

Typical use case It is a disk drive Online storage

(1) Accessible from the Internet if mounted to server and set up as FTP, etc.
(2) Only with proper credentials, unless ACLs are world-readable

This table illustrates the significant differences between Amazon S3 and Amazon EBS.
Amazon EBS volumes are network‐attached hard drives that can be written to or read
from at a block level. Amazon S3 is an object‐level storage medium.
This means that you must write whole objects at a time. If you change one small part of
a file, you must still rewrite the entire file in order to commit the change to Amazon S3.
This can be very time‐consuming if you have frequent writes to the same object.
Amazon S3 is optimized for write‐once/read‐many use cases. The other major difference
is cost. With Amazon S3 you pay for what you use, and with Amazon EBS you pay for
what you provision.
Recorded demo:
Amazon Simple
Storage System

Now, take a moment to watch the Amazon S3 demo. The recording runs a little over 4 minutes,
and it reinforces many of the concepts that were discussed in this section of the module.

The demonstration shows how to configure the following resources by using the AWS
Management Console. The demonstration shows how to:

• Create an Amazon S3 bucket

• Upload files and create folders
• Change bucket settings

The demonstration also reviews some of the more commonly used settings for an S3 bucket.
Lab Exercise - Create a publicly accessible S3 web page

 Lab Exercise - Create a publicly accessible S3 web page.

 In this exercise you will load up some content (web page or picture) to an
S3 bucket and make that bucket publicly available.You can use you own
account or Qwiklabs Sandbox to do this.
 Links on how to do this are listed below. Show your tutor once you have
finished.
 https://docs.aws.amazon.com/AmazonS3/latest/dev/UsingBucket.html (Li
nks to an external site.)
 https://docs.aws.amazon.com/AmazonS3/latest/dev/WebsiteHosting.html

61
This week
 Storage in the Cloud
 Big Data
 NFS
 Distributed File Systems (databases next week)

 AWS Storage services (ACF Module 7)

 Amazon Elastic Block Store (Amazon EBS)
 Plus some extra notes on Instance Storage

 Amazon Elastic File System (Amazon EFS)

 Amazon Simple Storage Service (Amazon S3)
62
 Amazon Glacier

62
Amazon Glacier

Amazon Glacier is a secure, durable, and extremely low‐cost cloud storage service for
data archiving and long‐term backup.
Storage

Amazon Glacier

Let’s take a closer look at Amazon Glacier.

Amazon Glacier Review
Amazon Glacier is a data archiving service designed for security, durability,
and an extremely low cost.
Designed for durability of 99.999999999% of objects
Supports SSL/TLS encryption of data in transit and at rest
The Vault Lock feature enforces compliance via a lockable policy
Extremely low‐cost design is ideal for long‐term archiving
Provides three options for access to archives (Expedited, Standard, and Bulk) from a few
minutes to several hours

Amazon Glacier's data archiving means that although you can store your data at an
extremely low cost (even in comparison to Amazon S3), you cannot retrieve your data
immediately when you want it. Data stored in Amazon Glacier takes several hours to
retrieve, which is why it's ideal for archiving.

There are three key Amazon Glacier terms that you should be familiar with:
• Archive: Any object such as a photo, video, file, or document that you store in
Amazon Glacier. It is the base unit of storage in Amazon Glacier. Each archive has its
own unique ID and can also have a description.
• Vault: A container for storing archives. When you create a vault, you specify the vault
name and the region in which you would like the vault located.
• Vault Access Policy: Determine who can and cannot access the data stored in the
vault as well as what operations users can and cannot perform. One vault access
permissions policy can be created for each vault to manage access permissions for
that vault. You can also use a vault lock policy to make sure a vault cannot be altered.
Each vault can have one vault access policy and one vault lock policy attached to it.

There are three options for retrieving data with varying access times and cost:
Expedited, Standard, and Bulk retrievals, as follows:
• Expedited retrievals are typically made available within 1 – 5 minutes (highest cost).
• Standard retrievals typically complete within 3 – 5 hours (less than expedited, more
than bulk).
• Bulk retrievals typically complete within 5 – 12 hours (lowest cost).
Think of it as being like choosing the cost to ship a package.

65
Using Amazon Glacier

RESTful
Web services

Java or .NET
SDKs

Amazon S3 with
lifecycle policies

To store and access data in Amazon Glacier, you can use the AWS Management Console;
however only a few operations, such as creating and deleting vaults and creating and
managing archive policies, are available in the console. Almost all other operations
require that you use either the Amazon Glacier REST API, or AWS Java or .NET SDKs to
interact with Amazon Glacier via the CLI.

You can also archive data into Amazon Glacier using lifecycle policies. Let’s take a closer
look at what that means.
Lifecycle Policies
Amazon S3 lifecycle policies allow you to delete or move objects based on
age.

Amazon S3 Amazon S3 Amazon Glacier Delete

Standard Standard ‐
Infrequent Access

30 Days 60 Days 365 Days

Preview2.mp4 Preview2.mp4 Preview2.mp4

You should automate the lifecycle of your data stored in Amazon S3. Using lifecycle
policies, you can have data cycled at regular intervals between different Amazon S3
storage types. This reduces your overall cost, because you are paying less for data as it
becomes less important with time.

In addition to being able to set lifecycle rules per object, you can also set lifecycle rules
per bucket.

Let’s take a look at an example of a lifecycle policy that moves data as it ages from
Amazon S3 Standard to Amazon S3 Standard – Infrequent Access and, finally, into
Amazon Glacier before it is deleted. Let’s say that the user uploads a video to your
application and your application generates a thumbnail preview of the video. This video
preview is stored to Amazon S3 Standard, because it is likely that the user will want to
access it right away.

Your usage data indicates that most thumbnail previews are not accessed after 30 days.
So, your lifecycle policy will take this previews and move them to Amazon S3 infrequent
access after 30 days. Once another 30 days have lapsed, it is highly unlikely that it the
preview will be accessed again, so it is moved to Amazon Glacier where it remains for 1
year. After one year, the preview is deleted. The important thing to note is that the
lifecycle policy manages all of this movement automatically.
For more information, see http://docs.aws.amazon.com/AmazonS3/latest/dev/object‐
lifecycle‐mgmt.html.

67
Storage Comparison
Amazon S3 Amazon Glacier
Data volume No limit No limit
Average latency ms min/hrs
Item size 5 TB max 40 TB max
Cost/GB per month ¢¢ ¢
Billed requests PUT, COPY, POST,
UPLOAD and retrieval
LIST, and GET
Retrieval pricing ¢ ¢¢
Per request Per request and per GB

While Amazon S3 and Amazon Glacier are both object storage solutions that allow you
to store an unlimited amount of data, there are some critical differences between them
that are outlined in this chart.

1. Be careful when deciding which storage solution is correct for your needs. These are
two very different services for storage needs. Amazon S3 is designed for frequent,
low‐latency access to your data, while Amazon Glacier is designed for low‐cost, long‐
term storage of infrequently accessed data.
2. The maximum item size in Amazon S3 is 5 TB, whereas Amazon Glacier can store
items up to 40 TB in size.
3. Because Amazon S3 gives you faster access to your data, the storage cost per
gigabyte is higher than it is with Amazon Glacier.
4. While both services have per request charges, Amazon S3 charges for PUT, COPY,
POST, LIST, GET while Amazon Glacier charges for UPLOAD and retrieval.
5. Because Amazon Glacier was designed for less frequent access to data, it costs more
for each retrieval request than Amazon S3. Both the cost per retrieval and the cost
per GB are higher for Amazon Glacier.
Server‐Side Encryption

Another important difference between Amazon S3 and Amazon Glacier is how data is
encrypted. Server‐side encryption is about protecting data at rest. With both solutions,
you can securely transfer your data over HTTPS. Any data archived in Amazon Glacier is
encrypted by default. With Amazon S3, your application must initiate server‐side
encryption. There are several ways to accomplish this:

• Server‐side encryption with Amazon S3‐managed encryption keys (SSE‐S3)

employs strong multi‐factor encryption. Amazon S3 encrypts each object with
a unique key. As an additional safeguard, it encrypts the key itself with a
master key that it regularly rotates. Amazon S3 server‐side encryption uses
one of the strongest block ciphers available, 256‐bit Advanced Encryption
Standard (AES‐256), to encrypt your data.
• AWS Key Management Service (AWS KMS) is a service that combines secure,
highly available hardware and software to provide a key management system
scaled for the cloud. AWS KMS uses customer master keys (CMKs) to encrypt
your Amazon S3 objects. You use AWS KMS via the Encryption Keys section in
the IAM console or via AWS KMS APIs to centrally create encryption keys,
define the policies that control how keys can be used, and audit key usage to
prove they are being used correctly. You can use these keys to protect your
data in Amazon S3 buckets.
• Using server‐side encryption with customer‐provided encryption keys (SSE‐C)
allows you to set your own encryption keys. With the encryption key you
provide as part of your request, Amazon S3 manages both encryption (as it
writes to disks) and decryption (when you access your objects).

69
Security with Amazon Glacier

Control access with

AWS IAM

Amazon Glacier encrypts

By default, only you can access your data. You can enable and control access to your
data in Amazon Glacier by using AWS IAM. You just set up an AWS IAM policy that
specifies user access.
Recorded demo:
Amazon S3 Glacier

Now, take a moment to watch the Amazon Glacier demo. The recording runs a little over 2
minutes, and it reinforces many of the concepts that were discussed in this section of the
module.

The demonstration shows how to configure the following resources by using the AWS
Management Console. The demonstration shows how to:

• Create an Amazon Glacier vault

• Upload archived items to the vault using a third‐party graphical interface tool
.
In Review
Amazon Glacier is a data archiving service designed for security, durability, and
an extremely low cost.
Amazon Glacier pricing is region‐based.
Extremely low‐cost design is ideal for long‐term archiving.
The service is designed for durability of 99.999999999% of objects.

We've covered an introduction to Amazon Glacier including key differences between

Amazon S3 and Amazon Glacier.

For more information about Amazon Glacier, see https://aws.amazon.com/glacier/.

72
Amazon Glacier Demo

Please review the Amazon Glacier demonstration: M2_S3_Glacier v2.0.mp4.

This video demonstration can be found in the learning management system.

HCIA - Storage Course
No ratings yet
HCIA - Storage Course
16 pages
SecurOS VMS Server 400 Datasheet en
No ratings yet
SecurOS VMS Server 400 Datasheet en
2 pages
Notes
No ratings yet
Notes
30 pages
Working With Cloud-Based Storage: CSE423 Virtualization and Cloud Computing
No ratings yet
Working With Cloud-Based Storage: CSE423 Virtualization and Cloud Computing
27 pages
W7 - CLO2 - File System and Storage
No ratings yet
W7 - CLO2 - File System and Storage
21 pages
UE20CS351 Unit3 Slides PDF
No ratings yet
UE20CS351 Unit3 Slides PDF
177 pages
Distributed Systems CH7-2022
No ratings yet
Distributed Systems CH7-2022
15 pages
CloudComputing Unit 3
No ratings yet
CloudComputing Unit 3
8 pages
cloud unit-4-2
No ratings yet
cloud unit-4-2
32 pages
Unit V - IOT
No ratings yet
Unit V - IOT
25 pages
Unit-4_Cloud Storage and Database Services
No ratings yet
Unit-4_Cloud Storage and Database Services
88 pages
UNIT-III CC
No ratings yet
UNIT-III CC
14 pages
KCS713 Unit 3 Lecture 6
No ratings yet
KCS713 Unit 3 Lecture 6
29 pages
Chapter 3
No ratings yet
Chapter 3
30 pages
Cloud For Beginners Course Slides
No ratings yet
Cloud For Beginners Course Slides
147 pages
Cloud Storage
No ratings yet
Cloud Storage
16 pages
CC Unit 4
No ratings yet
CC Unit 4
46 pages
Mca-305 (G2) - SM09
No ratings yet
Mca-305 (G2) - SM09
21 pages
CC Unit 04
No ratings yet
CC Unit 04
8 pages
Architecture & Consideration
No ratings yet
Architecture & Consideration
7 pages
Storage Lecture
No ratings yet
Storage Lecture
15 pages
Unit 4
No ratings yet
Unit 4
47 pages
Cloud Storage Seminar
No ratings yet
Cloud Storage Seminar
11 pages
05 Storage and Database Services
No ratings yet
05 Storage and Database Services
71 pages
storage-technologies-unit-3-part-1
No ratings yet
storage-technologies-unit-3-part-1
15 pages
Cloud Computing
No ratings yet
Cloud Computing
11 pages
10. Storage - Part 2
No ratings yet
10. Storage - Part 2
42 pages
Bluesky: A Cloud-Backed File System For The Enterprise
No ratings yet
Bluesky: A Cloud-Backed File System For The Enterprise
14 pages
#5 Cloud Computing
No ratings yet
#5 Cloud Computing
28 pages
Cloud Storage
No ratings yet
Cloud Storage
4 pages
Unit-II_CC_Notes
No ratings yet
Unit-II_CC_Notes
33 pages
Information Search
No ratings yet
Information Search
11 pages
question bank unit-3-4-5-answers
No ratings yet
question bank unit-3-4-5-answers
69 pages
Cloud Stroage
No ratings yet
Cloud Stroage
10 pages
CC Unit-5
No ratings yet
CC Unit-5
9 pages
Cloud Computing Unit 2 Notes
No ratings yet
Cloud Computing Unit 2 Notes
14 pages
UNIT 5 Storage Systems
No ratings yet
UNIT 5 Storage Systems
9 pages
Unit 3 Part 1
No ratings yet
Unit 3 Part 1
14 pages
CC UNIT-V
No ratings yet
CC UNIT-V
6 pages
Iot 5
No ratings yet
Iot 5
4 pages
UNITIV Cloud Computing AU - IV
No ratings yet
UNITIV Cloud Computing AU - IV
16 pages
Unit 4
No ratings yet
Unit 4
8 pages
Final Unit II CC 2022
No ratings yet
Final Unit II CC 2022
56 pages
Seminar Report ON Cloud Storage 1
100% (1)
Seminar Report ON Cloud Storage 1
25 pages
Storage - Block, Object File Storage
No ratings yet
Storage - Block, Object File Storage
1 page
05_Storage_and_Database_Services
No ratings yet
05_Storage_and_Database_Services
74 pages
Cloud Storage
No ratings yet
Cloud Storage
8 pages
Unit 1
No ratings yet
Unit 1
62 pages
Unit 3
No ratings yet
Unit 3
9 pages
Cloud Storage
No ratings yet
Cloud Storage
5 pages
History: Cloud Storage Is A Model of
No ratings yet
History: Cloud Storage Is A Model of
5 pages
Cloud Computing Chapter8
No ratings yet
Cloud Computing Chapter8
43 pages
Unit 1 - Data Access in the Internet Era (Part a)
No ratings yet
Unit 1 - Data Access in the Internet Era (Part a)
134 pages
How Cloud Storage Works
No ratings yet
How Cloud Storage Works
6 pages
Cloud Storage Seminar
No ratings yet
Cloud Storage Seminar
21 pages
Storage in AWS
No ratings yet
Storage in AWS
55 pages
Welcome To The New Era of Cloud Computing: The Web Is Replacing The Desktop
No ratings yet
Welcome To The New Era of Cloud Computing: The Web Is Replacing The Desktop
36 pages
IB & IT Notes
No ratings yet
IB & IT Notes
9 pages
Seminar Report New
No ratings yet
Seminar Report New
27 pages
Chapter 3 - Cloud Product and Services
No ratings yet
Chapter 3 - Cloud Product and Services
29 pages
The Ceph Handbook: Building and Managing Scalable Distributed Storage Systems
From Everand
The Ceph Handbook: Building and Managing Scalable Distributed Storage Systems
Robert Johnson
No ratings yet
Storage Area Networks For Dummies
From Everand
Storage Area Networks For Dummies
Christopher Poelker
3.5/5 (2)
VTL NetVault
No ratings yet
VTL NetVault
19 pages
h8224 Replication Isilon Synciq WP
No ratings yet
h8224 Replication Isilon Synciq WP
94 pages
Kaspersky Security For Windows Server Datasheet
No ratings yet
Kaspersky Security For Windows Server Datasheet
2 pages
DXi-Series Configuration and Best Practices Guide For Veeam Backup & Replication
No ratings yet
DXi-Series Configuration and Best Practices Guide For Veeam Backup & Replication
17 pages
Dell Powerprotect Data Manager Appliance Storagereview
No ratings yet
Dell Powerprotect Data Manager Appliance Storagereview
33 pages
Storage scv3000 - Deployment Guide - en Us PDF
No ratings yet
Storage scv3000 - Deployment Guide - en Us PDF
109 pages
CHAP - IV (Cloud Computing)
No ratings yet
CHAP - IV (Cloud Computing)
30 pages
Testdisk Documentation: Release 7.1
No ratings yet
Testdisk Documentation: Release 7.1
69 pages
Veeam PP Veeam Enhanced NAS Backup Scenario Card
No ratings yet
Veeam PP Veeam Enhanced NAS Backup Scenario Card
1 page
Cisco It Aci Design: Aci Plugin For Red Hat Openshift Container Architecture and Design Guide
No ratings yet
Cisco It Aci Design: Aci Plugin For Red Hat Openshift Container Architecture and Design Guide
29 pages
PowerScale Onefs 9.4 Backup and Recovery Guide
No ratings yet
PowerScale Onefs 9.4 Backup and Recovery Guide
88 pages
Radio-Frequency Identification (Rfid) : Internal Test I
No ratings yet
Radio-Frequency Identification (Rfid) : Internal Test I
6 pages
Praful Resume Storage Backup
No ratings yet
Praful Resume Storage Backup
5 pages
DiskSavvy Disk Space Analyzer
No ratings yet
DiskSavvy Disk Space Analyzer
39 pages
Naukri AnkushSonone9y 0m-1
No ratings yet
Naukri AnkushSonone9y 0m-1
5 pages
HUS VM Architecture
No ratings yet
HUS VM Architecture
28 pages
CCS367 - ST CIA 1 & CIA II QP
No ratings yet
CCS367 - ST CIA 1 & CIA II QP
5 pages
Final Report
No ratings yet
Final Report
6 pages
Components of System Administration
50% (2)
Components of System Administration
3 pages
DATA RECOVERY (kHATEEB)
100% (1)
DATA RECOVERY (kHATEEB)
46 pages
HCIA Storage Reviewer
No ratings yet
HCIA Storage Reviewer
56 pages
Emc E05-001
100% (3)
Emc E05-001
64 pages
GNS-1001 Spec V1.0
No ratings yet
GNS-1001 Spec V1.0
3 pages
Dell Storage Center Owner's Manual
No ratings yet
Dell Storage Center Owner's Manual
54 pages
Storage System 5500 V5
No ratings yet
Storage System 5500 V5
6 pages
OS Module 5 Notes (BISOS304)
No ratings yet
OS Module 5 Notes (BISOS304)
30 pages
Vsphere Esxi Vcenter Server 55 Storage Guide
No ratings yet
Vsphere Esxi Vcenter Server 55 Storage Guide
278 pages
DSTN Merged - Csi Zc447 Es Zc447is Zc447ss Zc447 CH 15-16
No ratings yet
DSTN Merged - Csi Zc447 Es Zc447is Zc447ss Zc447 CH 15-16
63 pages