OneFS 7.2 Web Administration Guide
OneFS 7.2 Web Administration Guide
OneFS
Version 7.2
CONTENTS
Chapter 1
19
Chapter 2
21
Chapter 3
33
CONTENTS
Chapter 4
Access zones
77
CONTENTS
Chapter 5
85
Chapter 6
Identity management
141
CONTENTS
Chapter 7
Auditing
161
Auditing overview........................................................................................162
Protocol audit events.................................................................................. 162
Supported event types................................................................................ 162
Supported audit tools................................................................................. 163
Enable system configuration auditing..........................................................164
Enable protocol access auditing..................................................................164
Auditing settings......................................................................................... 165
Chapter 8
File sharing
167
CONTENTS
Chapter 9
Home directories
197
Chapter 10
Snapshots
209
CONTENTS
Chapter 11
233
Deduplication overview...............................................................................234
Deduplication jobs......................................................................................234
Data replication and backup with deduplication..........................................235
Snapshots with deduplication.....................................................................235
Deduplication considerations......................................................................235
Shadow-store considerations......................................................................236
SmartDedupe license functionality..............................................................236
Managing deduplication............................................................................. 236
Assess deduplication space savings.............................................. 237
Specify deduplication settings....................................................... 237
View deduplication space savings..................................................238
8
CONTENTS
Chapter 12
241
CONTENTS
Chapter 13
279
FlexProtect overview....................................................................................280
File striping................................................................................................. 280
Requested data protection.......................................................................... 280
FlexProtect data recovery.............................................................................281
Smartfail........................................................................................ 281
Node failures................................................................................. 281
Requesting data protection......................................................................... 282
Requested protection settings.....................................................................282
Requested protection disk space usage...................................................... 283
Chapter 14
NDMP backup
285
CONTENTS
Chapter 15
311
11
CONTENTS
Chapter 16
Protection domains
325
Chapter 17
Data-at-rest-encryption
329
Chapter 18
SmartQuotas
337
CONTENTS
Chapter 19
Storage Pools
359
13
CONTENTS
Chapter 20
System jobs
389
Chapter 21
Networking
403
CONTENTS
Chapter 22
Hadoop
435
Chapter 23
Antivirus
453
15
CONTENTS
Chapter 24
VMware integration
467
Chapter 25
16
473
CONTENTS
17
CONTENTS
18
CHAPTER 1
Introduction to this guide
19
Live Chat
Create a Service Request
Telephone Support
20
CHAPTER 2
Isilon scale-out NAS
21
Use Case
S-Series
IOPS-intensive applications
X-Series
Function
22
Isilon cluster
An Isilon cluster consists of three or more hardware nodes, up to 144. Each node runs the
Isilon OneFS operating system, the distributed file-system software that unites the nodes
into a cluster. A clusters storage capacity ranges from a minimum of 18 TB to a maximum
of 15.5 PB.
Cluster administration
OneFS centralizes cluster management through a web administration interface and a
command-line interface. Both interfaces provide methods to activate licenses, check the
status of nodes, configure the cluster, upgrade the system, generate alerts, view client
connections, track performance, and change various settings.
In addition, OneFS simplifies administration by automating maintenance with a job
engine. You can schedule jobs that scan for viruses, inspect disks for errors, reclaim disk
space, and check the integrity of the file system. The engine manages the jobs to
minimize impact on the cluster's performance.
With SNMP versions 2c and 3, you can remotely monitor hardware components, CPU
usage, switches, and network interfaces. EMC Isilon supplies management information
bases (MIBs) and traps for the OneFS operating system.
OneFS also includes a RESTful application programming interfaceknown as the Platform
APIto automate access, configuration, and monitoring. For example, you can retrieve
performance statistics, provision users, and tap the file system. The Platform API
integrates with OneFS role-based access control to increase security. See the Isilon
Platform API Reference.
Quorum
An Isilon cluster must have a quorum to work properly. A quorum prevents data conflicts
for example, conflicting versions of the same filein case two groups of nodes become
unsynchronized. If a cluster loses its quorum for read and write requests, you cannot
access the OneFS file system.
For a quorum, more than half the nodes must be available over the internal network. A
seven-node cluster, for example, requires a four-node quorum. A 10-node cluster requires
a six-node quorum. If a node is unreachable over the internal network, OneFS separates
the node from the cluster, an action referred to as splitting. After a cluster is split, cluster
operations continue as long as enough nodes remain connected to have a quorum.
In a split cluster, the nodes that remain in the cluster are referred to as the majority
group. Nodes that are split from the cluster are referred to as the minority group.
Internal and external networks
23
When split nodes can reconnect with the cluster and resynchronize with the other nodes,
the nodes rejoin the cluster's majority group, an action referred to as merging.
A OneFS cluster contains two quorum properties:
l
By connecting to a node with SSH and running the sysctl command-line tool as root,
you can view the status of both types of quorum. Here is an example for a cluster that has
a quorum for both read and write operations, as the command's output indicates with a
1, for true:
sysctl efs.gmp.has_quorum
efs.gmp.has_quorum: 1
sysctl efs.gmp.has_super_block_quorum
efs.gmp.has_super_block_quorum: 1
Storage pools
Storage pools segment nodes and files into logical divisions to simplify the management
and storage of data.
A storage pool comprises node pools and tiers. Node pools group equivalent nodes to
protect data and ensure reliability. Tiers combine node pools to optimize storage by
need, such as a frequently used high-speed tier or a rarely accessed archive.
The SmartPools module groups nodes and files into pools. If you do not activate a
SmartPools license, the module provisions node pools and creates one file pool. If you
activate the SmartPools license, you receive more features. You can, for example, create
multiple file pools and govern them with policies. The policies move files, directories, and
file pools among node pools or tiers. You can also define how OneFS handles write
operations when a node pool or tier is full. SmartPools reserves a virtual hot spare to
reprotect data if a drive fails regardless of whether the SmartPools license is activated.
IP address pools
Within a subnet, you can partition a cluster's external network interfaces into pools of IP
address ranges. The pools empower you to customize your storage network to serve
different groups of users. Although you must initially configure the default external IP
subnet in IPv4 format, you can configure additional subnets in IPv4 or IPv6.
You can associate IP address pools with a node, a group of nodes, or NIC ports. For
example, you can set up one subnet for storage nodes and another subnet for accelerator
nodes. Similarly, you can allocate ranges of IP addresses on a subnet to different teams,
such as engineering and sales. Such options help you create a storage topology that
matches the demands of your network.
In addition, network provisioning rules streamline the setup of external connections.
After you configure the rules with network settings, you can apply the settings to new
nodes.
As a standard feature, the OneFS SmartConnect module balances connections among
nodes by using a round-robin policy with static IP addresses and one IP address pool for
each subnet. Activating a SmartConnect Advanced license adds features, such as
defining IP address pools to support multiple DNS zones.
Storage pools
25
Data-access protocols
With the OneFS operating system, you can access data with multiple file-sharing and
transfer protocols. As a result, Microsoft Windows, UNIX, Linux, and Mac OS X clients can
share the same directories and files.
OneFS supports the following protocols.
SMB
The Server Message Block (SMB) protocol enables Windows users to access the
cluster. OneFS works with SMB 1, SMB 2, and SMB 2.1, as well as SMB 3.0 for
Multichannel only. With SMB 2.1, OneFS supports client opportunity locks (oplocks)
and large (1 MB) MTU sizes. The default file share is /ifs.
NFS
The Network File System (NFS) protocol enables UNIX, Linux, and Mac OS X systems
to remotely mount any subdirectory, including subdirectories created by Windows
users. OneFS works with NFS versions 3 and 4. The default export is /ifs.
HDFS
The Hadoop Distributed File System (HDFS) protocol enables a cluster to work with
Apache Hadoop, a framework for data-intensive distributed applications. HDFS
integration requires you to activate a separate license.
FTP
FTP allows systems with an FTP client to connect to the cluster and exchange files.
HTTP
HTTP gives systems browser-based access to resources. OneFS includes limited
support for WebDAV.
A file provider for accounts in /etc/spwd.db and /etc/group files. With the file
provider, you can add an authoritative third-party source of user and group
information.
You can manage users with different identity management systems; OneFS maps the
accounts so that Windows and UNIX identities can coexist. A Windows user account
managed in Active Directory, for example, is mapped to a corresponding UNIX account in
NIS or LDAP.
To control access, an Isilon cluster works with both the access control lists (ACLs) of
Windows systems and the POSIX mode bits of UNIX systems. When OneFS must
transform a file's permissions from ACLs to mode bits or from mode bits to ACLs, OneFS
merges the permissions to maintain consistent security settings.
OneFS presents protocol-specific views of permissions so that NFS exports display mode
bits and SMB shares show ACLs. You can, however, manage not only mode bits but also
26
ACLs with standard UNIX tools, such as the chmod and chown commands. In addition,
ACL policies enable you to configure how OneFS manages permissions for networks that
mix Windows and UNIX systems.
Access zones
OneFS includes an access zones feature. Access zones allow users from different
authentication providers, such as two untrusted Active Directory domains, to access
different OneFS resources based on an incoming IP address. An access zone can
contain multiple authentication providers and SMB namespaces.
RBAC for administration
OneFS includes role-based access control (RBAC) for administration. In place of a
root or administrator account, RBAC lets you manage administrative access by role.
A role limits privileges to an area of administration. For example, you can create
separate administrator roles for security, auditing, storage, and backup.
It is recommended that you do not save data to the root /ifs file path but in directories
below /ifs. The design of your data storage structure should be planned carefully. A
well-designed directory optimizes cluster performance and cluster administration.
Data layout
OneFS evenly distributes data among a cluster's nodes with layout algorithms that
maximize storage efficiency and performance. The system continuously reallocates data
to conserve space.
OneFS breaks data down into smaller sections called blocks, and then the system places
the blocks in a stripe unit. By referencing either file data or erasure codes, a stripe unit
helps safeguard a file from a hardware failure. The size of a stripe unit depends on the
file size, the number of nodes, and the protection setting. After OneFS divides the data
into stripe units, OneFS allocates, or stripes, the stripe units across nodes in the cluster.
When a client connects to a node, the client's read and write operations take place on
multiple nodes. For example, when a client connects to a node and requests a file, the
node retrieves the data from multiple nodes and rebuilds the file. You can optimize how
OneFS lays out data to match your dominant access patternconcurrent, streaming, or
random.
27
Writing files
On a node, the input-output operations of the OneFS software stack split into two
functional layers: A top layer, or initiator, and a bottom layer, or participant. In read and
write operations, the initiator and the participant play different roles.
When a client writes a file to a node, the initiator on the node manages the layout of the
file on the cluster. First, the initiator divides the file into blocks of 8 KB each. Second, the
initiator places the blocks in one or more stripe units. At 128 KB, a stripe unit consists of
16 blocks. Third, the initiator spreads the stripe units across the cluster until they span a
width of the cluster, creating a stripe. The width of the stripe depends on the number of
nodes and the protection setting.
After dividing a file into stripe units, the initiator writes the data first to non-volatile
random-access memory (NVRAM) and then to disk. NVRAM retains the information when
the power is off.
During the write transaction, NVRAM guards against failed nodes with journaling. If a
node fails mid-transaction, the transaction restarts without the failed node. When the
node returns, it replays the journal from NVRAM to finish the transaction. The node also
runs the AutoBalance job to check the file's on-disk striping. Meanwhile, uncommitted
writes waiting in the cache are protected with mirroring. As a result, OneFS eliminates
multiple points of failure.
Reading files
In a read operation, a node acts as a manager to gather data from the other nodes and
present it to the requesting client.
Because an Isilon cluster's coherent cache spans all the nodes, OneFS can store different
data in each node's RAM. By using the internal InfiniBand network, a node can retrieve
file data from another node's cache faster than from its own local disk. If a read operation
requests data that is cached on any node, OneFS pulls the cached data to serve it
quickly.
In addition, for files with an access pattern of concurrent or streaming, OneFS pre-fetches
in-demand data into a managing node's local cache to further improve sequential-read
performance.
Metadata layout
OneFS protects metadata by spreading it across nodes and drives.
Metadatawhich includes information about where a file is stored, how it is protected,
and who can access itis stored in inodes and protected with locks in a B+ tree, a
standard structure for organizing data blocks in a file system to provide instant lookups.
OneFS replicates file metadata across the cluster so that there is no single point of
failure.
Working together as peers, all the nodes help manage metadata access and locking. If a
node detects an error in metadata, the node looks up the metadata in an alternate
location and then corrects the error.
28
Striping
In a process known as striping, OneFS segments files into units of data and then
distributes the units across nodes in a cluster. Striping protects your data and improves
cluster performance.
To distribute a file, OneFS reduces it to blocks of data, arranges the blocks into stripe
units, and then allocates the stripe units to nodes over the internal network.
At the same time, OneFS distributes erasure codes that protect the file. The erasure codes
encode the file's data in a distributed set of symbols, adding space-efficient redundancy.
With only a part of the symbol set, OneFS can recover the original file data.
Taken together, the data and its redundancy form a protection group for a region of file
data. OneFS places the protection groups on different drives on different nodescreating
data stripes.
Because OneFS stripes data across nodes that work together as peers, a user connecting
to any node can take advantage of the entire cluster's performance.
By default, OneFS optimizes striping for concurrent access. If your dominant access
pattern is streaming--that is, lower concurrency, higher single-stream workloads, such as
with video--you can change how OneFS lays out data to increase sequential-read
performance. To better handle streaming access, OneFS stripes data across more drives.
Streaming is most effective on clusters or subpools serving large files.
Description
Antivirus
OneFS can send files to servers running the Internet Content Adaptation
Protocol (ICAP) to scan for viruses and other threats.
29
Feature
Description
Clones
OneFS enables you to create clones that share blocks with other files to save
space.
OneFS can back up data to tape and other devices through the Network Data
Management Protocol. Although OneFS supports both NDMP 3-way and 2way backup, 2-way backup requires an Isilon Backup Accelerator node.
Protection
domains
The following software modules also help protect data, but they require you to activate a
separate license:
Licensed
Feature
Description
SyncIQ
SyncIQ replicates data on another Isilon cluster and automates failover and
failback operations between clusters. If a cluster becomes unusable, you can
fail over to another Isilon cluster.
SnapshotIQ
You can protect data with a snapshota logical copy of data stored on a
cluster.
SmartLock
The SmartLock tool prevents users from modifying and deleting files. You can
commit files to a write-once, read-many state: The file can never be modified
and cannot be deleted until after a set retention period. SmartLock can help
you comply with Securities and Exchange Commission Rule 17a-4.
30
Data mirroring
You can protect on-disk data with mirroring, which copies data to multiple locations.
OneFS supports two to eight mirrors. You can use mirroring instead of erasure codes, or
you can combine erasure codes with mirroring.
Mirroring, however, consumes more space than erasure codes. Mirroring data three
times, for example, duplicates the data three times, which requires more space than
erasure codes. As a result, mirroring suits transactions that require high performance.
You can also mix erasure codes with mirroring. During a write operation, OneFS divides
data into redundant protection groups. For files protected by erasure codes, a protection
group consists of data blocks and their erasure codes. For mirrored files, a protection
group contains all the mirrors of a set of blocks. OneFS can switch the type of protection
group as it writes a file to disk. By changing the protection group dynamically, OneFS can
continue writing data despite a node failure that prevents the cluster from applying
erasure codes. After the node is restored, OneFS automatically converts the mirrored
protection groups to erasure codes.
VMware integration
OneFS integrates with several VMware products, including vSphere, vCenter, and ESXi.
For example, OneFS works with the VMware vSphere API for Storage Awareness (VASA) so
that you can view information about an Isilon cluster in vSphere. OneFS also works with
the VMware vSphere API for Array Integration (VAAI) to support the following features for
block storage: hardware-assisted locking, full copy, and block zeroing. VAAI for NFS
requires an ESXi plug-in.
With the Isilon for vCenter plug-in, you can backup and restore virtual machines on an
Isilon cluster. With the Isilon Storage Replication Adapter, OneFS integrates with the
Data mirroring
31
VMware vCenter Site Recovery Manager to recover virtual machines that are replicated
between Isilon clusters.
Software modules
You can access advanced features by activating licenses for EMC Isilon software
modules.
SmartLock
SmartLock protects critical data from malicious, accidental, or premature alteration
or deletion to help you comply with SEC 17a-4 regulations. You can automatically
commit data to a tamper-proof state and then retain it with a compliance clock.
SyncIQ automated failover and failback
SyncIQ replicates data on another Isilon cluster and automates failover and failback
between clusters. If a cluster becomes unusable, you can fail over to another Isilon
cluster. Failback restores the original source data after the primary cluster becomes
available again.
File clones
OneFS provides provisioning of full read/write copies of files, LUNs, and other
clones. OneFS also provides virtual machine linked cloning through VMware API
integration.
SnapshotIQ
SnapshotIQ protects data with a snapshota logical copy of data stored on a
cluster. A snapshot can be restored to its top-level directory.
SmartPools
SmartPools enable you to create multiple file pools governed by file-pool policies.
The policies move files and directories among node pools or tiers. You can also
define how OneFS handles write operations when a node pool or tier is full.
SmartConnect
If you activate a SmartConnect Advanced license, you can balance policies to evenly
distribute CPU usage, client connections, or throughput. You can also define IP
address pools to support multiple DNS zones in a subnet. In addition, SmartConnect
supports IP failover, also known as NFS failover.
InsightIQ
The InsightIQ virtual appliance monitors and analyzes the performance of your Isilon
cluster to help you optimize storage resources and forecast capacity.
Aspera for Isilon
Aspera moves large files over long distances fast. Aspera for Isilon is a cluster-aware
version of Aspera technology for non-disruptive, wide-area content delivery.
HDFS
OneFS works with the Hadoop Distributed File System protocol to help clients
running Apache Hadoop, a framework for data-intensive distributed applications,
analyze big data.
SmartQuotas
The SmartQuotas module tracks disk usage with reports and enforces storage limits
with alerts.
32
CHAPTER 3
General cluster administration
33
User interfaces
OneFS provides several interfaces for managing the EMC Isilon cluster.
34
Interface
Description
Comment
OneFS web
administration
interface
OneFS API
Licensing
Advanced cluster features are available when you activate licenses for OneFS software
modules. Each optional OneFS software module requires you to activate a separate
license.
For more information about the following optional software modules, contact your EMC
Isilon sales representative.
l
HDFS
Connecting to the cluster
35
InsightIQ
SmartConnect Advanced
SmartDedupe
SmartLock
SmartPools
SmartQuotas
SnapshotIQ
SyncIQ
License status
The status of a OneFS module license indicates whether the functionality provided by the
module are available on the cluster.
Licenses exist in one of the following states:
Status
Description
Inactive
The license has not been activated on the cluster. You cannot access the features
provided by the corresponding module.
Evaluation The license has been temporarily activated on the cluster. You can access the
features provided by the corresponding module for a limited period of time. After the
license expires, the features will become unavailable, unless the license is
reactivated.
Activated
The license has been activated on the cluster. You can access the features provided
by the corresponding module.
Expired
The evaluation license has expired on the cluster. You can no longer access the
features provided by the corresponding module. The features will remain
unavailable, unless you reactivate the license.
The following table describes what functionality is available for each license depending
on the license's status:
36
License
Inactive
Evaluation/
Activated
Expired
HDFS
Clients cannot
access the cluster
through HDFS.
InsightIQ
License
Inactive
Evaluation/
Activated
Expired
are stored on an
Isilon cluster with
Isilon for vCenter.
are stored on an
Isilon cluster with
Isilon for vCenter.
SmartPools
SmartConnect
Advanced
Client connections
are balanced by
using a round robin
policy. IP address
allocation is static.
Each external
network subnet can
be assigned only
one IP address pool.
SmartDedupe
You cannot
deduplicate data
with SmartDedupe.
SmartLock
License status
37
License
Inactive
Evaluation/
Activated
Expired
SnapshotIQ
SmartQuotas
SyncIQ
License configuration
You can configure or unconfigure some OneFS module licenses.
You can configure a license by performing specific operations through the corresponding
module. Not all actions that require you to activate a license will configure the license.
Also, not all licenses can be configured. Configuring a license does not add or remove
access to any features provided by a module.
You can unconfigure a license only through the isi license unconfigure
command. You may want to unconfigure a license for a OneFS software module if, for
example, you enabled an evaluation version of a module but later decided not to
purchase a permanent license. Unconfiguring a module license does not deactivate the
license. Unconfiguring a license does not add or remove access to any features provided
by a module.
The following table describes both the actions that cause each license to be configured
and the results of unconfiguring each license:
38
License
Cause of configuring
Result of unconfiguring
HDFS
No system impact.
InsightIQ
No system impact.
License
Cause of configuring
Result of unconfiguring
No system impact.
SmartPools
Create a file pool policy (other than the OneFS deletes all file pool policies
default file pool policy).
(except the default file pool policy).
SmartConnect
SmartDedupe
No system impact.
SmartLock
No system impact.
SnapshotIQ
SmartQuotas
Create a quota.
No system impact.
SyncIQ
No system impact.
Activate a license
To access a OneFS module, you must activate a license.
Before you begin
Before you can activate a license, you must obtain a valid license key, and you must have
root user privileges on your cluster. To obtain a license key, contact your EMC Isilon sales
representative.
Procedure
1. Click Help > About This Cluster.
2. In the Licensed Modules section, click Activate license.
3. In the License key field, type the license key for the module that you want to enable.
4. Read the end user license agreement, click I have read and agree, and then click
Submit.
Unconfigure a license
You can unconfigure a licensed module through the command-line interface.
You must have root user privileges on your Isilon cluster to unconfigure a module license.
This procedure is available only through the command-line interface (CLI).
Note
39
Procedure
1. Open a secure shell (SSH) connection to any node in the cluster.
You must log in as root.
2. Run the isi license unconfigure command.
The following command unconfigures the license for SmartConnect:
isi license unconfigure -m smartconnect
If you do not know the module name, run the isi license command for a list of
OneFS modules and their status.
OnesFS returns a confirmation message similar to the following text: The
SmartConnect module has been unconfigured. The license is
unconfigured, and any processes enabled for the module are disabled.
Certificates
You can renew the Secure Sockets Layer (SSL) certificate for the Isilon web administration
interface or replace it with a third-party SSL certificate.
All Platform API communication, which includes communication through the web
administration interface, is over SSL. You can replace or renew the self-signed certificate
with a certificate that you generate. To replace or renew an SSL certificate, you must be
logged in as root.
Procedure
1. Establish an SSH connection to any node in the cluster.
2. At the command prompt, run the following command to create the appropriate
directory.
mkdir /ifs/local/
3. At the command prompt, run the following command to change to the directory.
cd /ifs/local/
40
Option
Description
Third-party
(public or
private) CAissued
certificate
req -new \/
-days 730 -nodes -x509 -key server.key -out server.crt
41
Note
The steps to view security details vary by browser. For example, in some browsers, you
can click the padlock icon in the address bar to view the security details for the web
page. Follow the steps that are specific to your browser.
In addition, you should add the following attributes to be sent with your certificate
request:
l
Cluster identity
You can specify identity attributes for the EMC Isilon cluster.
Cluster name
The cluster name appears on the login page, and it makes the cluster and its nodes
more easily recognizable on your network. Each node in the cluster is identified by
the cluster name plus the node number. For example, the first node in a cluster
named Images may be named Images-1.
Cluster description
The cluster description appears below the cluster name on the login page. The
cluster description is useful if your environment has multiple clusters.
Login message
The login message appears as a separate box on the login page. The login message
can convey cluster information, login instructions, or warnings that a user should
know before logging into the cluster.
Procedure
1. Click Cluster Management > General Settings > Cluster Identity.
2. Optional: In the Cluster Name and Description area, type a name for the cluster in the
Cluster Name field and type a description in the Cluster Description field.
3. Optional: In the Login Message area, type a title in the Message Title field and a
message in the Message Body field.
4. Click Submit.
After you finish
You must add the cluster name to your DNS servers.
43
Note
If the cluster and Active Directory become out of sync by more than 5 minutes,
authentication will not work.
7. Click Submit.
You can test your configuration by sending a test event notification.
3. Click Submit.
45
Description
Manual Allows you to manually add a node to the cluster without requiring authorization.
Secure
Requires authorization of every node added to the cluster and the node must be added
through the web administration interface or through the isi devices -a add -d
<unconfigured_node_serial_no> command in the command-line interface.
Note
If you specify a secure join mode, you cannot join a node to the cluster through serial
console wizard option [2] Join an existing cluster.
If the cluster character encoding is not set to UTF-8, SMB share names are case-sensitive.
You must restart the cluster to apply character encoding changes.
CAUTION
46
Results
After the cluster restarts, the web administration interface reflects your change.
To enable access time tracking, click Enabled, and then specify in the Precision
fields how often to update the last-accessed time by typing a numeric value and by
selecting a unit of measure, such as Seconds, Minutes, Hours, Days, Weeks,
Months, or Years.
For example, if you configure a Precision setting of 1 day, the cluster updates the
last-accessed time once each day, even if some files were accessed more often
than once during the day.
3. Click Submit.
Cluster monitoring
You can monitor the health, performance, and status of your EMC Isilon cluster.
Using the OneFS dashboard from the web administration interface, you can monitor the
status and health of the OneFS system. Information is available for individual nodes,
including node-specific network traffic, internal and external network interfaces, and
details about node pools, tiers, and overall cluster health. You can monitor the following
areas of your EMC Isilon cluster health and performance:
Node status
Health and performance statistics for each node in the cluster, including hard disk
drive (HDD) and solid-state drive (SSD) usage.
Client connections
Number of clients connected per node.
New events
List of event notifications generated by system events, including the severity, unique
instance ID, start time, alert message, and scope of the event.
Cluster size
Current view: Used and available HDD and SSD space and space reserved for the
virtual hot spare (VHS). Historical view: Total used space and cluster size for a oneyear period.
47
Status: To view details about a node, click the ID number of the node.
Client connection summary: To view a list of current connections, click Dashboard >
Cluster Overview > Client Connections Status.
New events: To view more information about an event, click View details in the
Actions column.
Cluster size: To switch between current and historical views, click Historical or
Current near the Monitoring section heading. In historical view, click Used or
Cluster size to change the display.
Cluster throughput (file system): To switch between current and historical views,
click Historical or Current next to the Monitoring section heading. To view
throughput statistics for a specific period within the past two weeks, click
Dashboard > Cluster Overview > Throughput Distribution.
Note
CPU usage: To switch between current and historical views, click Historical or
Current near the Monitoring section heading.
Note
You can hide or show a plot by clicking System, User, or Total in the chart legend.
To view maximum usage, next to Show, select Maximum.
Status: To view networks settings for a node interface or subnet or pool, click the
link in the Status area.
Client connections: To view current clients connected to this node, review the list
in this area.
Chassis and drive status: To view the state of drives in this node, review this area.
To view details about a drive, click the name link of the drive; for example, Bay1.
Node size: To switch between current and historical views, click Historical or
Current next to the Monitoring area heading. In historical view, click Used or
Cluster size to change the display accordingly.
Node throughput (file system): To switch between current and historical views,
click Historical or Current next to the Monitoring area heading. To view throughput
statistics for a period within the past two weeks, click Dashboard > Cluster
Overview > Throughput Distribution.
Note
CPU usage: To switch between current and historical views, click Historical or
Current next to the Monitoring area heading.
Note
You can hide or show a plot by clicking System, User, or Total in the chart legend.
To view maximum usage, next to Show, select Maximum.
49
State
Description
Interface
HEALTHY
Command-line
interface, web
administration
interface
SMARTFAIL or
Smartfail or
restripe in
progress
Command-line
interface, web
administration
interface
NOT AVAILABLE
Command-line
interface, web
administration
interface
Note
SED_ERROR command-line
interface states.
50
SUSPENDED
Command-line
interface, web
administration
interface
NOT IN USE
Command-line
interface, web
administration
interface
REPLACE
STALLED
NEW
USED
Error
state
State
Description
Interface
PREPARING
Command-line
interface only
EMPTY
Command-line
interface only
WRONG_TYPE
BOOT_DRIVE
Command-line
interface only
SED_ERROR
Command-line
interface, web
administration
interface
Note
Error
state
available.
ERASE
Command-line
interface only
Note
available.
INSECURE
Command-line
interface only
Web
administration
interface only
Note
SED.
UNENCRYPTED SED
51
State
Description
Interface
Error
state
Note
SNMP monitoring
You can use SNMP to remotely monitor the EMC Isilon cluster hardware components,
such as fans, hardware sensors, power supplies, and disks. The default Linux SNMP tools
or a GUI-based SNMP tool of your choice can be used for this purpose.
You can enable SNMP monitoring on individual nodes on your cluster, and you can also
monitor cluster information from any node. Generated SNMP traps are sent to your SNMP
network. You can configure an event notification rule that specifies the network station
where you want to send SNMP traps for specific events, so that when an event occurs, the
cluster sends the trap to that server. OneFS supports SNMP in read-only mode. OneFS
supports SNMP version 2c, which is the default value, and SNMP version 3.
Note
OneFS does not support SNMP v1. Although an option for v1/v2c may be displayed, if you
select the v1/v2c pair, OneFS will only monitor through SNMP v2c.
You can configure settings for SNMP v3 alone or for both SNMP v2c and v3.
Note
If you configure SNMP v3, OneFS requires the SNMP-specific security level of AuthNoPriv
as the default value when querying the cluster. The security level AuthPriv is not
supported.
Elements in an SNMP hierarchy are arranged in a tree structure, similar to a directory tree.
As with directories, identifiers move from general to specific as the string progresses from
left to right. Unlike a file hierarchy, however, each element is not only named, but also
numbered.
For example, the SNMP
entity .iso.org.dod.internet.private.enterprises.isilon.oneFSss.s
sLocalNodeId.0 maps to .1.3.6.1.4.1.12124.3.2.0. The part of the name that
52
refers to the OneFS SNMP namespace is the 12124 element. Anything further to the right
of that number is related to OneFS-specific monitoring.
Management Information Base (MIB) documents define human-readable names for
managed objects and specify their data types and other properties. You can download
MIBs that are created for SNMP-monitoring of an Isilon cluster from the webadministration interface or manage them using the command-line interface. MIBs are
stored in /usr/local/share/snmp/mibs/ on a OneFS node. The OneFS ISILONMIBs serve two purposes:
l
ISILON-MIB is a registered enterprise MIB. Isilon clusters have two separate MIBs:
ISILON-MIB
Defines a group of SNMP agents that respond to queries from a network monitoring
system (NMS) called OneFS Statistics Snapshot agents. As the name implies, these
agents snapshot the state of the OneFS file system at the time that it receives a
request and reports this information back to the NMS.
ISILON-TRAP-MIB
Generates SNMP traps to send to an SNMP monitoring station when the
circumstances occur that are defined in the trap protocol data units (PDUs).
The OneFS MIB files map the OneFS-specific object IDs with descriptions. Download or
copy MIB files to a directory where your SNMP tool can find them, such as /usr/share/
snmp/mibs/ or /usr/local/share/snmp/mibs, depending on the tool that you
use.
To enable Net-SNMP tools to read the MIBs to provide automatic name-to-OID mapping,
add -m All to the command, as in the following example:
snmpwalk -v2c -c public -m All <node IP> isilon
If the MIB files are not in the default Net-SNMP MIB directory, you may need to specify the
full path, as in the following example. Note that all three lines are a single command.
snmpwalk -m /usr/local/share/snmp/mibs/ISILON-MIB.txt:/usr/local\
/share/snmp/mibs/ISILON-TRAP-MIB.txt:/usr/local/share/snmp/mibs \
/ONEFS-TRAP-MIB.txt -v2c -C c -c public <node IP> enterprises.onefs
Note
The previous examples are run from the snmpwalk command on a cluster. Your SNMP
version may require different arguments.
53
The Isilon cluster does not generate SNMP traps unless you configure an event
notification rule to send events.
Procedure
1. Click Cluster Management > General Settings > SNMP Monitoring.
2. In the Service area of the SNMP Monitoring page, enable or disable SNMP monitoring.
a. To disable SNMP monitoring, click Disable, and then click Submit.
b. To enable SNMP monitoring, click Enable, and then continue with the following
steps to configure your settings.
3. In the Downloads area, click Download for the MIB file that you want to download.
Follow the download process that is specific to your browser.
4. Optional: If you are using Internet Explorer as your browser, right-click the Download
link, select Save As from the menu, and save the file to your local drive.
You can save the text in the file format that is specific to your Net-SNMP tool.
5. Copy MIB files to a directory where your SNMP tool can find them, such as /usr/
share/snmp/mibs/ or /usr/local/share/snmp/mibs, depending on the
SNMP tool that you use.
To have Net-SNMP tools read the MIBs to provide automatic name-to-OID mapping,
add -m All to the command, as in the following example: snmpwalk -v2c -c
public -m All <node IP> isilon
6. Navigate back to the SNMP Monitoring page and configure General Settings.
a. In the Settings area, configure protocol access by selecting the version that you
want.
OneFS does not support writable OIDs; therefore, no write-only community string
setting is available.
b. In the System location field, type the system name.
This setting is the value that the node reports when responding to queries. Type a
name that helps to identify the location of the node.
54
OneFS no longer supports SNMP v1. Although an option for v1/v2c may be displayed,
if you select the v1/v2c pair, OneFS will only monitor through SNMP v2c.
8. Configure SNMP v3 Settings.
a. In the Read-only user field, type the SNMP v3 security name to change the name of
the user with read-only privileges.
The default read-only user is general.
The password must contain at least eight characters and no spaces.
b. In the SNMP v3 password field, type the new password for the read-only user to
set a new SNMP v3 authentication password.
The default password is password. It is recommended that you change the
password to improve security.
c. Type the new password in the Confirm password field to confirm the new
password.
9. Click Submit.
Coalesced events
OneFS coalesces related, group events or repeated, duplicate events into a single event.
55
In the following example, a single connection issue might generate the following events:
Event
Description
Because the events are triggered by a single occurrence, OneFS creates a group event
and combines the related messages under the new group event numbered 24.294.
Instead of seeing four events, you will see a single group event alerting you to storage
transport issues. You can still view all the grouped events individually if you choose.
To view this coalesced event, run the following command:
isi events show 24.924
The system displays the following example output of the coalesced group event:
ID:
24.924
Type:
199990001
Severity:
critical
Value:
0.0
Message:
Disk Errors detected (Bay 1)
Node:
21
Lifetime:
Sun Jun 17 23:29:29 2012 - Now
Quieted:
Not quieted
Specifiers: disk: 35
val: 0.0
devid: 24
drive_serial: 'XXXXXXXXXXXXX'
lba: 1953520064L
lnn: 21
drive_type: 'HDD'
device: 'da1'
bay: 1
unit: 805306368
Coalesced by: -Coalescer Type: Group
Coalesced events:
ID
STARTED
ENDED SEV LNN MESSAGE
24.911 06/17 23:29 -- I
21 Disk stall: Bay 1, Type HDD, LNUM 35.
Disk ...
24.912 06/17 23:29 -- I
21 Sector error: da1 block 1953520064
24.913 06/17 23:29 -- I
21 Sector error: da1 block 2202232
24.914 06/17 23:29 -- I
21 Sector error: da1 block 2202120
24.915 06/17 23:29 -- I
21 Sector error: da1 block 2202104
24.916 06/17 23:29 -- I
21 Sector error: da1 block 2202616
24.917 06/17 23:29 -- I
21 Sector error: da1 block 2202168
24.918 06/17 23:29 -- I
21 Sector error: da1 block 2202106
24.919 06/17 23:29 -- I
21 Sector error: da1 block 2202105
24.920 06/17 23:29 -- I
21 Sector error: da1 block 1048670
24.921 06/17 23:29 -- I
21 Sector error: da1 block 223
24.922 06/17 23:29 -- C
21 Disk Repair Initiated: Bay 1, Type
HDD, LNUM...
56
The system displays the following example output of the coalesced duplicate event:
ID: 1.3035
Type: 500010001
Severity: info
Value: 0.0
Message: SmartQuotas threshold violation on quota violated, domain
direc...
Node: All
Lifetime: Thu Jun 14 01:00:00 2012 - Now
Quieted: Not quieted
Specifiers: enforcement: 'advisory'
domain: 'directory /ifs/quotas'
name: 'violated'
val: 0.0
devid: 0
lnn: 0
Coalesced by: -Coalescer Type: Duplicate
Coalesced events:
ID STARTED ENDED SEV LNN MESSAGE
18.621 06/14 01:00 -- I All SmartQuotas threshold violation on quota
vio...
18.630 06/15 01:00 -- I All SmartQuotas threshold violation on quota
vio...
18.638 06/16 01:00 -- I All SmartQuotas threshold violation on quota
vio...
18.647 06/17 01:00 -- I All SmartQuotas threshold violation on quota
vio...
18.655 06/18 01:00 -- I All SmartQuotas threshold violation on quota
vio...
57
Description
COALESCED
CREATOR EV COALID UPDATED A group was created and the placeholder first event label was
updated to include actual group information.
58
DROPPED
An event did not include any new information and was not stored
in the master event database.
FORWARDED_TO_MASTER
DB: STORED
DB: PURGED
Responding to events
You can view event details and respond to cluster events.
You can view and manage new events, open events, and recently ended events. You can
also view coalesced events and additional, more-detailed information about a specific
event. You also can quiet or cancel events.
If a new event of the same event type is triggered, it is a separate new event and
must be quieted.
Unquiet
Returns a quieted event to an unacknowledged state in the list of new events and
removes the event from the list of quieted events.
Cancel
Permanently ends an occurrence of an event. The system cancels an event when
conditions are met that end its duration, which is bounded by a start time and an
end time, or when you cancel the event manually.
Most events are canceled automatically by the system when the event reaches the end of
its duration. The event remains in the system until you manually acknowledge or quiet
the event. You can acknowledge events through either the web administration interface
or the command-line interface.
Respond to an event
You can change the status of an event by quieting, unquieting, or canceling it.
Procedure
1. Click Dashboard > Events > Summary.
2. Perform one of the following actions:
l
Responding to events
59
Option
Description
Notification batch
mode
Batch all
Batch by severity
Batch by category
No batching
No custom notification
template is set
Custom notification
template
60
Setting
Option
Description
Click Browse, navigate to and select the template file that you want to use, and
then click OK.
In the Set custom notification template field, type the path and file name of the
template file that you want to use.
5. Click Submit.
61
62
3. When you have finished viewing the rule details, click Cancel.
Cluster maintenance
Trained service personnel can replace or upgrade components in Isilon nodes.
Isilon Technical Support can assist you with replacing node components or upgrading
components to increase performance.
battery
SATA/SAS Drive
memory (DIMM)
fan
front panel
intrusion switch
IB/NVRAM card
SAS controller
power supply
Cluster maintenance
63
If you configure your cluster to send alerts to Isilon, Isilon Technical Support will contact
you if a component needs to be replaced. If you do not configure your cluster to send
alerts to Isilon, you must initiate a service request.
drive
memory (DIMM)
If you want to upgrade components in your nodes, contact Isilon Technical Support.
It is recommended that you contact EMC Isilon Technical Support before updating the
drive firmware.
Automatically updates the drive firmware for new and replacement drives to the latest
revision before those drives are formatted and used in a cluster. This is applicable
only for clusters running OneFS 7.2 and later.
Note
64
To view the drive firmware status of all the nodes, run the following command:
isi drivefirmware status
To view the drive firmware status of drives on a specific node, run the isi
devices command with the -a fwstatus option. Run the following command
to view the drive firmware status of each drive on node 1:
isi devices -a fwstatus -d 1
Model
HGST HUS724030ALA640
HGST HUS724030ALA640
HGST HUS724030ALA640
FW
MF80AAC0
MF80AAC0
MF80AAC0
Desired FW
65
Run the following command to view the drive firmware status on node 1 and disk
12:
isi devices -a fwstatus -d 1:12
Power cycling drives during a firmware update might return unexpected results. As a best
practice, do not restart or power off nodes when the drive firmware is being updated in a
cluster.
Procedure
1. Run the isi devices command with the -a fwupdate option.
The following command updates the drive firmware for all drives on node 1:
isi devices -a fwupdate -d 1
You must run the command once for each node that requires a drive firmware update.
Updating the drive firmware of a single drive takes approximately 15 seconds,
depending on the drive model. OneFS updates drives sequentially.
FW
MF80AAC0
Desired FW
30
Count
1
Nodes
Where:
Model
Displays the name of the drive model.
FW
Displays the version number of the firmware currently running on the drives.
Desired FW
If the drive firmware should be upgraded, displays the version number of the drive
firmware that the firmware should be updated to.
Count
Displays the number of drives of this model that are currently running the specified
drive firmware.
Nodes
Displays the LNNs of nodes that the specified drives exist in.
The following example shows the output of the isi devices command with the -a
fwstatus option:
Node 1
Bay 1
Bay 2
Bay 3
Model
HGST HUS724030ALA640
HGST HUS724030ALA640
HGST HUS724030ALA640
FW
MF80AAC0
MF80AAC0
MF80AAC0
Desired FW
Where:
Drive
Displays the number of the bay that the drive is in.
Note
This column is not labeled in the output. The information appears under the node
number.
Model
Displays the name of the drive model.
FW
Displays the version number of the firmware currently running on the drive.
Desired FW
Displays the version number of the drive firmware that the drive should be updated
to. If a drive firmware update is not required, the Desired FW column is empty.
67
New and replacement drives added to a cluster are formatted regardless of the status of
their firmware revision. You can identify a firmware update failure by viewing the firmware
status for the drives on a specific node. In case of a failure, run the isi devices
command with the fwupdate action on the node to update the firmware manually. For
example, run the following command to manually update the firmware on node 1:
isi devices -a fwupdate -d 1
68
If you remove a storage node, the Cluster Status area displays smartfail progress. If
you remove a non-storage accelerator node, it is immediately removed from the
cluster.
Description
Shut down
Reboot
3. Click Submit.
Upgrading OneFS
Two options are available for upgrading the OneFS operating system: a rolling upgrade or
a simultaneous upgrade. Before upgrading OneFS software, a pre-upgrade check must be
performed.
A rolling upgrade individually upgrades and restarts each node in the EMC Isilon cluster
sequentially. During a rolling upgrade, the cluster remains online and continues serving
clients with no interruption in service, although some connection resets may occur on
SMB clients. Rolling upgrades are performed sequentially by node number, so a rolling
upgrade takes longer to complete than a simultaneous upgrade. The final node in the
upgrade process is the node that you used to start the upgrade process.
Note
Rolling upgrades are not available for all clusters. For instructions on how to plan an
upgrade, prepare the cluster for upgrade, and perform an upgrade of the operating
system, see the OneFS Upgrade Planning and Process Guide.
A simultaneous upgrade installs the new operating system and restarts all nodes in the
cluster at the same time. Simultaneous upgrades are faster than rolling upgrades but
require a temporary interruption of service during the upgrade process. Your data is
inaccessible during the time that it takes to complete the upgrade process.
Before beginning either a simultaneous or rolling upgrade, OneFS compares the current
cluster and operating system with the new version to ensure that the cluster meets
certain criteria, such as configuration compatibility (SMB, LDAP, SmartPools), disk
availability, and the absence of critical cluster events. If upgrading puts the cluster at
risk, OneFS warns you, provides information about the risks, and prompts you to confirm
whether to continue the upgrade.
If the cluster does not meet the pre-upgrade criteria, the upgrade does not proceed, and
the unsupported statuses are listed.
Upgrading OneFS
69
Remote support
Isilon Technical Support personnel can remotely manage your Isilon cluster to
troubleshoot an open support case with your permission.
You can enable remote customer service support through SupportIQ or the EMC Secure
Remote Support (ESRS) Gateway.
The SupportIQ scripts are based on the Isilon isi_gather_info log-gathering tool.
The SupportIQ module is included with the OneFS operating system and does not require
you to activate a separate license. You must enable and configure the SupportIQ module
before SupportIQ can run scripts to gather data. The feature may have been enabled
when the cluster was first set up, but you can enable or disable SupportIQ through the
Isilon web administration interface.
In addition to enabling the SupportIQ module to allow the SupportIQ agent to run scripts,
you can enable remote access, which allows Isilon Technical Support personnel to
monitor cluster events and remotely manage your cluster using SSH or the web
administration interface. Remote access helps Isilon Technical Support to quickly identify
and troubleshoot cluster issues. Other diagnostic tools are available for you to use in
conjunction with Isilon Technical Support to gather and upload information such as
packet capture metrics.
Note
If you enable remote access, you must also share cluster login credentials with Isilon
Technical Support personnel. Isilon Technical Support personnel remotely access your
cluster only in the context of an open support case and only after receiving your
permission.
Configuring SupportIQ
OneFS logs contain data that Isilon Technical Support personnel can securely upload,
with your permission, and then analyze to troubleshoot cluster problems. The SupportIQ
technology must be enabled and configured for this process.
When SupportIQ is enabled, Isilon Technical Support personnel can request logs through
scripts that gather cluster data and then upload the data to a secure location. You must
enable and configure the SupportIQ module before SupportIQ can run scripts to gather
data. The feature may have been enabled when the cluster was first set up.
You can also enable remote access, which allows Isilon Technical Support personnel to
troubleshoot your cluster remotely and run additional data-gathering scripts. Remote
70
access is disabled by default. To enable remote SSH access to your cluster, you must
provide the cluster password to a Technical Support engineer.
Send alerts via SupportIQ agent (HTTPS) and by email (SMTP) SupportIQ
delivers notifications to Isilon through the SupportIQ agent over HTTPS and by
email over SMTP.
Disable SupportIQ
You can disable SupportIQ so the SupportIQ agent does not run scripts to gather and
upload data about your EMC Isilon cluster.
Procedure
1. Click Cluster Management > General Settings > SupportIQ.
2. Clear the Enable SupportIQ check box.
3. Click Submit.
71
SupportIQ scripts
When SupportIQ is enabled, Isilon Technical Support personnel can request logs with
scripts that gather cluster data and then upload the data. The SupportIQ scripts are
located in the /usr/local/SupportIQ/Scripts/ directory on each node.
The following table lists the data-gathering activities that SupportIQ scripts perform.
These scripts can be run automatically, at the request of an Isilon Technical Support
representative, to collect information about your cluster's configuration settings and
operations. The SupportIQ agent then uploads the information to a secure Isilon FTP site,
so that it is available for Isilon Technical Support personnel to analyze. The SupportIQ
scripts do not affect cluster services or the availability of your data.
72
Action
Description
Collects and uploads information about the state and health of the
OneFS /ifs/ file system.
Get IB data
Collects and uploads only the most recent cluster log information.
Get messages
Collects and uploads information about cluster-wide and nodespecific network configuration settings and operations.
Action
Description
Warns if the chassis is open and uploads a text file of the event
information.
isi_gather_info
isi_gather_info -incremental
isi_gather_info
single node
Enable support personnel to run the same scripts used by SupportIQ to gather data
from your devices.
An important difference between SupportIQ and the ESRS Gateway is that SupportIQ
management is cluster-wide; SupportIQ manages all nodes. The ESRS Gateway manages
nodes individually; you select which nodes should be managed.
You can only enable one remote support system on your Isilon cluster. The EMC products
you use and your type of environment determine which system is most appropriate for
your Isilon cluster:
l
If your environment comprises one or more EMC products that can be monitored, use
the ESRS Gateway.
Remote support using ESRS Gateway
73
If your use of ESRS requires the ESRS Client, use SupportIQ. Isilon nodes do not
support ESRS Client connectivity.
If the only EMC products in your environment are Isilon nodes, use SupportIQ.
See the most recent version of the document titled EMC Secure Remote Support Technical
Description for a complete description of EMC Secure Remote Support features and
functionality.
Additional documentation on ESRS can be found on the EMC Online Support site.
Select which nodes you want managed through the ESRS Gateway with the ESRS
Configuration Tool.
Create rules for remote support connection to Isilon nodes with the ESRS Policy
Manager.
See the most recent version of the document titled EMC Secure Remote Site Planning Guide
for a complete description of ESRS Gateway server requirements, installation, and
configuration.
See the most recent version of the document titled EMC Secure Remote Support Gateway
for Windows Operations Guide for a complete description of the ESRS Configuration Tool.
See the most recent version of the document titled EMC Secure Remote Support Policy
Manager Operations Guide for a complete description of the ESRS Policy Manger.
Additional documentation on ESRS can be found on the EMC Online Support site.
74
The following command enables ESRS Gateway support, specifies a primary gateway,
a remote support subnet, and that an SMTP failover should be used.
isi remotesupport connectemc modify --enabled yes \
-- primary-esrs-gateway gw-serv-esrs1 --use-smtp-failover yes \
--remote-support-subnet subnet0
yes
gw-serv-esrs1
gw-serv-esrs2
yes
no
subnet0
75
76
CHAPTER 4
Access zones
Access zones
77
Access zones
78
The base directory cannot be identical to the base directory of any other access zone,
except the System zone. For example, you cannot specify /ifs/data/hr to both
the zone2 and zone3 access zones.
Cannot overlap with the file system tree of a base directory in any other access zone,
except the System zone. For example, if /ifs/data/hr is assigned to zone2, you
cannot assign /ifs/data/hr/personnel to zone3.
The base directory of the default System access zone is /ifs and cannot be
modified.
Access zones
Note
Assigning a base directory that is identical to or overlaps with the System zone is
allowed, but only recommended as a temporary base directory when modifying the base
directory path and migrating data to the new directory.
Details
OneFS does not support one DNS server per access zone. It is
recommended that all access zones point to a single DNS
server.
Access zones
20
Access zones best practices
79
Access zones
Entity
20
LDAP providers
20
NIS providers
20
Local providers
20
File providers
20
Kerberos providers
20
SMB shares per access zone A cluster cannot exceed a total of 30000 SMB shares across all the
access zones.
Note
SSH
HTTP
FTP
WebHDFS
RAN API
Quality of service
You can set upper bounds on quality of service by assigning specific physical resources
to each access zone.
Quality of service addresses physical hardware performance characteristics that can be
measured, improved, and sometimes guaranteed. Characteristics measured for quality of
service include but are not limited to throughput rates, CPU usage, and disk capacity.
When you share physical hardware in an EMC Isilon cluster across multiple virtual
instances, competition exists for the following services:
l
CPU
Memory
Network bandwidth
Disk I/O
Disk capacity
Access zones do not provide logical quality of service guarantees to these resources, but
you can partition these resources between access zones on a single cluster. The following
table describes a few ways to partition resources to improve quality of service:
80
Use
Notes
NICs
You can assign specific NICs on specific nodes to an IP address pool that is
associated with an access zone. By assigning these NICs, you can determine the
Access zones
Use
Notes
nodes and interfaces that are associated with an access zone. This enables the
separation of CPU, memory, and network bandwidth.
SmartPools
SmartQuotas Through SmartQuotas, you can limit disk capacity by a user or a group or in a
directory. By applying a quota to an access zone's base directory, you can limit
disk capacity used in that access zone.
81
Access zones
Access zones
83
Access zones
84
CHAPTER 5
Authentication and access control
85
In most situations, the default settings are sufficient. You can configure additional access
zones, custom roles, and permissions policies as necessary for your particular
environment.
Role-based access
You can assign role-based access to delegate administrative tasks to selected users.
Role based access control (RBAC) allows the right to perform particular administrative
actions to be granted to any user who can authenticate to a cluster. Roles are created by
a Security Administrator, assigned privileges, and then assigned members. All
administrators, including those given privileges by a role, must connect to the System
zone to configure the cluster. When these members log in to the cluster through a
configuration interface, they have these privileges. All administrators can configure
settings for access zones, and they always have control over all access zones on the
cluster.
Roles also give you the ability to assign privileges to member users and groups. By
default, only the root user and the admin user can log in to the web administration
interface through HTTP or the command-line interface through SSH. Using roles, the root
and admin users can assign others to built-in or customer roles that have login and
administrative privileges to perform specific administrative tasks.
Note
As a best practice, assign users to roles that contain the minimum set of necessary
privileges. For most purposes, the default permission policy settings, system access
zone, and built-in roles are sufficient. You can create role-based access management
policies as necessary for your particular environment.
Roles
You can permit and limit access to administrative areas of your EMC Isilon cluster on a
per-user basis through roles.
OneFS includes built-in administrator roles with predefined sets of privileges that cannot
be modified. The following list describes what you can and cannot do through roles:
86
You can add any user to a role as long as the user can authenticate to the cluster.
You can create custom roles and assign privileges to those roles.
You can add a group to a role, which grants to all users who are members of that
group all of the privileges associated with the role.
Note
When OneFS is first installed, only users with root- or admin-level can log in and assign
users to roles.
Built-in roles
Built-in roles include privileges to perform a set of administrative functions.
The following tables describe each of the built-in roles from most powerful to least
powerful. The tables include the privileges and read/write access levels, if applicable,
that are assigned to each role. You can assign users and groups to built-in roles and to
roles that you create.
Table 1 SecurityAdmin role
Description
Privileges
Read/write
access
ISI_PRIV_LOGIN_CONSOLE N/A
ISI_PRIV_LOGIN_PAPI
N/A
ISI_PRIV_LOGIN_SSH
N/A
ISI_PRIV_AUTH
Read/write
ISI_PRIV_ROLE
Read/write
Description
Privileges
Read/write
access
ISI_PRIV_LOGIN_CONSOLE
N/A
ISI_PRIV_LOGIN_PAPI
N/A
ISI_PRIV_LOGIN_SSH
N/A
ISI_PRIV_SYS_SHUTDOWN
N/A
ISI_PRIV_SYS_SUPPORT
N/A
ISI_PRIV_SYS_TIME
N/A
ISI_PRIV_ANTIVIRUS
Read/write
ISI_PRIV_AUDIT
Read/write
ISI_PRIV_CLUSTER
Read/write
87
Description
Privileges
Read/write
access
ISI_PRIV_DEVICES
Read/write
ISI_PRIV_EVENT
Read/write
ISI_PRIV_FTP
Read/write
ISI_PRIV_HDFS
Read/write
ISI_PRIV_HTTP
Read/write
ISI_PRIV_ISCSI
Read/write
ISI_PRIV_JOB_ENGINE
Read/write
ISI_PRIV_LICENSE
Read/write
ISI_PRIV_NDMP
Read/write
ISI_PRIV_NETWORK
Read/write
ISI_PRIV_NFS
Read/write
ISI_PRIV_NTP
Read/write
ISI_PRIV_QUOTA
Read/write
ISI_PRIV_REMOTE_SUPPORT Read/write
ISI_PRIV_SMARTPOOLS
Read/write
ISI_PRIV_SMB
Read/write
ISI_PRIV_SNAPSHOT
Read/write
ISI_PRIV_STATISTICS
Read/write
ISI_PRIV_SYNCIQ
Read/write
ISI_PRIV_VCENTER
Read/write
ISI_PRIV_WORM
Read/write
ISI_PRIV_NS_TRAVERSE
N/A
ISI_PRIV_NS_IFS_ACCESS
N/A
Description
Privileges
88
Read/write access
N/A
ISI_PRIV_LOGIN_PAPI
N/A
ISI_PRIV_LOGIN_SSH
N/A
ISI_PRIV_ANTIVIRUS
Read-only
ISI_PRIV_AUDIT
Read-only
ISI_PRIV_CLUSTER
Read-only
Description
Privileges
Read/write access
ISI_PRIV_DEVICES
Read-only
ISI_PRIV_EVENT
Read-only
ISI_PRIV_FTP
Read-only
ISI_PRIV_HDFS
Read-only
ISI_PRIV_HTTP
Read-only
ISI_PRIV_ISCSI
Read-only
ISI_PRIV_JOB_ENGINE
Read-only
ISI_PRIV_LICENSE
Read-only
SI_PRIV_NDMP
Read-only
ISI_PRIV_NETWORK
Read-only
ISI_PRIV_NFS
Read-only
ISI_PRIV_NTP
Read-only
ISI_PRIV_QUOTA
Read-only
ISI_PRIV_REMOTE_SUPPORT Read-only
ISI_PRIV_SMARTPOOLS
Read-only
ISI_PRIV_SMB
Read-only
ISI_PRIV_SNAPSHOT
Read-only
ISI_PRIV_STATISTICS
Read-only
ISI_PRIV_SYNCIQ
Read-only
ISI_PRIV_VCENTER
Read-only
ISI_PRIV_WORM
Read-only
Description
Privileges
Read/write
access
ISI_PRIV_LOGIN_PAPI
N/A
ISI_PRIV_ISCSI
Read/write
ISI_PRIV_NETWORK
Read/write
ISI_PRIV_SMARTPOOLS
Read/write
ISI_PRIV_SNAPSHOT
Read/write
ISI_PRIV_SYNCIQ
Read/write
ISI_PRIV_VCENTER
Read/write
ISI_PRIV_NS_TRAVERSE
N/A
89
Description
Privileges
Read/write
access
ISI_PRIV_NS_IFS_ACCESS N/A
Table 5 BackupAdmin role
Description
Privileges
Read/write access
Read-only
ISI_PRIV_IFS_RESTORE Read-only
Custom roles
Custom roles supplement built-in roles.
You can create custom roles and assign privileges mapped to administrative areas in your
EMC Isilon cluster environment. For example, you can create separate administrator roles
for security, auditing, storage provisioning, and backup.
You can designate certain privileges as read-only or read/write when adding the privilege
to a role. You can modify this option at any time.
You can add or remove privileges as user responsibilities grow and change.
Privileges
Privileges permit users to complete tasks on an EMC Isilon cluster.
Privileges are associated with an area of cluster administration such as Job Engine, SMB,
or statistics.
Privileges have one of two forms:
Action
Allows a user to perform a specific action on a cluster. For example, the
ISI_PRIV_LOGIN_SSH privilege allows a user to log in to a cluster through an SSH
client.
Read/Write
Allows a user to view or modify a configuration subsystem such as statistics,
snapshots, or quotas. For example, the ISI_PRIV_SNAPSHOT privilege allows an
administrator to create and delete snapshots and snapshot schedules. A read/write
privilege can grant either read-only or read/write access. Read-only access allows a
user to view configuration settings; read/write access allows a user to view and
modify configuration settings.
Privileges are granted to the user on login to a cluster through the OneFS API, the web
administration interface, SSH, or a console session. A token is generated for the user,
which includes a list of all privileges granted to the user. Each URI, web-administration
interface page, and command requires a specific privilege to view or modify the
information available through any of these interfaces.
90
Note
Privileges are not granted to users that do not connect to the System Zone during login or
to users that connect through the deprecated Telnet service, even if they are members of
a role.
OneFS privileges
Privileges in OneFS are assigned through role membership; privileges cannot be assigned
directly to users and groups.
Table 6 Login privileges
OneFS privilege
User right
Privilege type
Action
ISI_PRIV_LOGIN_PAPI
Action
ISI_PRIV_LOGIN_SSH
Action
User right
Privilege type
OneFS privilege
Action
ISI_PRIV_SYS_SUPPORT
Action
ISI_PRIV_SYS_TIME
Action
OneFS privilege
User right
Privilege type
ISI_PRIV_AUTH
Configure external
authentication providers
Read/write
ISI_PRIV_ROLE
Read/write
OneFS privilege
User right
Privilege type
ISI_PRIV_ANTIVIRUS
Configure antivirus
scanning
Read/write
IS_PRIV_AUDIT
Configure audit
capabilities
Read/write
ISI_PRIV_CLUSTER
Read/write
91
92
OneFS privilege
User right
Privilege type
ISI_PRIV_DEVICES
Read/write
ISI_PRIV_EVENT
Read/write
ISI_PRIV_FTP
Read/write
ISI_PRIV_HDFS
Read/write
ISI_PRIV_HTTP
Read/write
ISI_PRIV_ISCSI
Read/write
ISI_PRIV_JOB_ENGINE
Schedule cluster-wide
jobs
Read/write
ISI_PRIV_LICENSE
Read/write
ISI_PRIV_NDMP
Read/write
ISI_PRIV_NETWORK
Configure network
interfaces
Read/write
ISI_PRIV_NFS
Read/write
ISI_PRIV_NTP
Configure NTP
Read/write
ISI_PRIV_QUOTA
Read/write
ISI_PRIV_REMOTE_SUPPO
RT
Read/write
ISI_PRIV_SMARTPOOLS
Read/write
ISI_PRIV_SMB
Read/write
ISI_PRIV_SNAPSHOT
Read/write
ISI_PRIV_SNMP
Read/write
ISI_PRIV_STATISTICS
Read/write
ISI_PRIV_SYNCIQ
Configure SyncIQ
Read/write
ISI_PRIV_VCENTER
Read/write
ISI_PRIV_WORM
Configure SmartLock
directories
Read/write
OneFS privilege
User right
Privilege type
ISI_PRIV_EVENT
Read/write
ISI_PRIV_LICENSE
Read/write
ISI_PRIV_STATISTICS
Read/write
OneFS privilege
User right
Privilege type
ISI_PRIV_IFS_BACKUP
Action
Note
Action
Note
If you are on the sudoers list because you are a member of a role that has the
ISI_PRIV_EVENT privilege, the following command succeeds:
sudo isi alert list
The following tables list all One FS commands available, the associated privilege or rootaccess requirement, and whether sudo is required to run the command.
93
Note
If you are running in compliance mode, additional sudo commands are available.
Table 12 Privileges sorted by CLI command
94
isi command
Privilege
Requires sudo
isi alert
ISI_PRIV_EVENT
isi audit
ISI_PRIV_AUDIT
ISI_PRIV_AUTH
ISI_PRIV_ROLE
isi avscan
ISI_PRIV_ANTIVIRUS
isi batterystatus
ISI_PRIV_STATISTICS
isi config
root
ISI_PRIV_JOB_ENGINE
ISI_PRIV_STATISTICS
isi devices
ISI_PRIV_DEVICES
isi drivefirmware
root
isi domain
root
isi email
ISI_PRIV_CLUSTER
isi events
ISI_PRIV_EVENT
isi exttools
root
isi fc
root
isi filepool
ISI_PRIV_SMARTPOOLS
isi firmware
root
isi ftp
ISI_PRIV_FTP
isi get
root
isi hdfs
ISI_PRIV_HDFS
isi iscsi
ISI_PRIV_ISCSI
isi job
ISI_PRIV_JOB_ENGINE
isi license
ISI_PRIV_LICENSE
isi lun
ISI_PRIV_ISCSI
isi ndmp
ISI_PRIV_NDMP
isi networks
ISI_PRIV_NETWORK
isi nfs
ISI_PRIV_NFS
isi perfstat
ISI_PRIV_STATISTICS
isi command
Privilege
Requires sudo
isi pkg
root
isi quota
ISI_PRIV_QUOTA
isi readonly
root
isi remotesupport
ISI_PRIV_REMOTE_SUPPORT
isi servicelight
ISI_PRIV_DEVICES
isi services
root
isi set
root
isi smb
ISI_PRIV_SMB
isi snapshot
ISI_PRIV_SNAPSHOT
isi snmp
ISI_PRIV_SNMP
isi stat
ISI_PRIV_STATISTICS
isi statistics
ISI_PRIV_STATISTICS
isi status
ISI_PRIV_STATISTICS
isi storagepool
ISI_PRIV_SMARTPOOLS
isi sync
ISI_PRIV_SYNCIQ
isi tape
ISI_PRIV_NDMP
isi target
ISI_PRIV_ISCSI
isi update
root
isi version
ISI_PRIV_CLUSTER
isi worm
ISI_PRIV_WORM
isi zone
ISI_PRIV_AUTH
Privilege
isi commands
ISI_PRIV_ANTIVIRUS
isi avscan
ISI_PRIV_AUDIT
isi audit
ISI_PRIV_AUTH
isi zone
ISI_PRIV_IFS_BACKUP
N/A
ISI_PRIV_CLUSTER
isi email
isi version
Requires sudo
x
N/A
x
95
Privilege
isi commands
ISI_PRIV_DEVICES
isi devices
isi servicelight
isi alert
isi events
ISI_PRIV_FTP
isi ftp
ISI_PRIV_HDFS
isi hdfs
ISI_PRIV_ISCSI
isi iscsi
isi lun
isi target
isi job
ISI_PRIV_LICENSE
isi license
ISI_PRIV_NDMP
isi ndmp
isi tape
ISI_PRIV_NETWORK
isi networks
ISI_PRIV_NFS
isi nfs
ISI_PRIV_QUOTA
isi quota
ISI_PRIV_ROLE
ISI_PRIV_REMOTE_SUPPORT
isi remotesupport
ISI_PRIV_IFS_RESTORE
N/A
ISI_PRIV_SMARTPOOLS
isi filepool
isi storagepool
ISI_PRIV_SMB
isi smb
ISI_PRIV_SNAPSHOT
isi snapshot
ISI_PRIV_SNMP
isi snmp
ISI_PRIV_STATISTICS
isi batterystatus
isi perfstat
isi stat
ISI_PRIV_EVENT
ISI_PRIV_JOB_ENGINE
96
Requires sudo
x
N/A
Privilege
isi commands
Requires sudo
isi statistics
isi status
ISI_PRIV_SYNCIQ
isi sync
ISI_PRIV_WORM
isi worm
root
isi config
isi domain
isi drivefirmware
isi exttools
isi fc
isi firmware
isi get
isi pkg
isi readonly
isi services
isi set
isi update
These privileges circumvent traditional file access checks, such as mode bits or NTFS
ACLs.
Most cluster privileges allow changes to cluster configuration in some manner. The
backup and restore privileges allow access to cluster data from the System zone, the
traversing of all directories, and reading of all file data and metadata regardless of file
permissions.
Users assigned these privileges use the protocol as a backup protocol to another
machine without generating access-denied errors and without connecting as the root
user. These two privileges are supported over the following client-side protocols:
l
SMB
RAN API
FTP
SSH
Data backup and restore privileges
97
Authentication
OneFS supports local and remote authentication providers to verify that users attempting
to access an EMC Isilon cluster are who they claim to be. Anonymous access, which does
not require authentication, is supported for protocols that allow it.
OneFS supports concurrent multiple authentication provider types, which are analogous
to directory services. For example, OneFS is often configured to authenticate Windows
clients with Active Directory and to authenticate UNIX clients with LDAP. You can also
configure NIS, designed by Sun Microsystems, to authenticate users and groups when
they access a cluster.
Note
98
Authentication
provider
Active Directory
LDAP
NIS
Local
File
MIT Kerberos
x
x
Description
Authentication
Netgroups
Kerberos authentication
Kerberos is a network authentication provider that negotiates encryption tickets for
securing a connection. OneFS supports Active Directory Kerberos and MIT Kerberos
authentication providers on an EMC Isilon cluster. If you configure an Active Directory
provider, Kerberos authentication is provided automatically. MIT Kerberos works
independently of Active Directory.
For MIT Kerberos authentication, you define an administrative domain known as a realm.
Within this realm, an authentication server has the authority to authenticate a user, host,
or service. You can optionally define a Kerberos domain to allow additional domain
extensions to be associated with a realm.
The authentication server in a Kerberos environment is called the Key Distribution Center
(KDC) and distributes encrypted tickets. When a user authenticates with an MIT Kerberos
provider within a realm, an encrypted ticket with the user's service principal name (SPN)
is created and validated to securely pass the user's identification for the requested
service.
99
You can include an MIT Kerberos provider in specific access zones for authentication.
Each access zone may include at most one MIT Kerberos provider. You can discontinue
authentication through an MIT Kerberos provider by removing the provider from all the
referenced access zones.
SPNs must match the SmartConnect zone name and the FQDN hostname of the cluster. If
the SmartConnect zone settings are changed, you must update the SPNs on the cluster to
match the changes.
LDAP
The Lightweight Directory Access Protocol (LDAP) is a networking protocol that enables
you to define, query, and modify directory services and resources.
OneFS can authenticate users and groups against an LDAP repository in order to grant
them access to the cluster. OneFS supports Kerberos authentication for an LDAP provider.
The LDAP service supports the following features:
l
Users, groups, and netgroups.
l
Configurable LDAP schemas. For example, the ldapsam schema allows NTLM
authentication over the SMB protocol for users with Windows-like attributes.
l
Simple bind authentication, with and without SSL.
l
Redundancy and load balancing across servers with identical directory data.
l
Multiple LDAP provider instances for accessing servers with different user data.
l
Encrypted passwords.
Active Directory
The Active Directory directory service is a Microsoft implementation of Lightweight
Directory Access Protocol (LDAP), Kerberos, and DNS technologies that can store
information about network resources. Active Directory can serve many functions, but the
primary reason for joining the cluster to an Active Directory domain is to perform user and
group authentication.
When the cluster joins an Active Directory domain, a single Active Directory machine
account is created. The machine account establishes a trust relationship with the domain
100
and enables the cluster to authenticate and authorize users in the Active Directory forest.
By default, the machine account is named the same as the cluster. If the cluster name is
more than 15 characters long, the name is hashed and displayed after joining the
domain.
Note
Configure a single Active Directory instance if all domains have a trust relationship.
Configure multiple Active Directory instances only to grant access to multiple sets of
mutually-untrusted domains.
NIS
The Network Information Service (NIS) provides authentication and identity uniformity
across local area networks. OneFS includes an NIS authentication provider that enables
you to integrate the cluster with your NIS infrastructure.
NIS, designed by Sun Microsystems, can authenticate users and groups when they
access the cluster. The NIS provider exposes the passwd, group, and netgroup maps from
an NIS server. Hostname lookups are also supported. You can specify multiple servers for
redundancy and load balancing.
Note
File provider
A file provider enables you to supply an authoritative third-party source of user and group
information to an EMC Isilon cluster. A third-party source is useful in UNIX and Linux
environments that synchronize /etc/passwd, /etc/group, and etc/netgroup
files across multiple servers.
Standard BSD /etc/spwd.db and /etc/group database files serve as the file
provider backing store on a cluster. You generate the spwd.db file by running the
pwd_mkdb command in the OneFS command-line interface (CLI). You can script updates
to the database files.
On a cluster, a file provider hashes passwords with libcrypt. The Modular Crypt
Format is parsed to determine the hashing algorithm. The following algorithms are
supported:
l
MD5
Blowfish
NT-Hash
SHA-256
SHA-512
NIS
101
Note
The built-in System file provider includes services to list, manage, and authenticate
against system accounts such as root, admin, and nobody. It is recommended that you
do not modify the System file provider.
Local provider
The local provider provides authentication and lookup facilities for user accounts added
by an administrator.
Local authentication is useful when Active Directory, LDAP, or NIS directory services are
not configured or when a specific user or application needs access to the cluster. Local
groups can include built-in groups and Active Directory groups as members.
In addition to configuring network-based authentication sources, you can manage local
users and groups by configuring a local password policy for each node in the cluster.
OneFS settings specify password complexity, password age and re-use, and passwordattempt lockout policies.
Authorization
OneFS supports two types of authorization data on a file: Windows-style access control
lists (ACLs) and POSIX mode bits (UNIX permissions). Authorization type is based on the
ACL policies that are set and on the file-creation method.
Access to a file or directory is governed by either a Windows access control list (ACL) or
UNIX mode bits. Regardless of the security model, OneFS enforces access rights
consistently across access protocols. A user is granted or denied the same rights to a file
when using SMB for Windows file sharing as when using NFS for UNIX file sharing.
An EMC Isilon cluster includes global policy settings that enable you to customize the
default ACL and UNIX permissions to best support your environment. Generally, files that
are created over SMB or in a directory that has an ACL receive an ACL; otherwise, OneFS
relies on the POSIX mode bits that define UNIX permissions. In either case, the owner is
represented by a UNIX identifier (UID or GID) or by its Windows identifier (SID). The
102
primary group is represented by a GID or SID. Although mode bits are present when a file
has an ACL, the mode bits are provided for only protocol compatibility, not for access
checks.
Note
Although you can configure ACL policies to optimize a cluster for UNIX or Windows, you
should do so only if you understand how ACL and UNIX permissions interact.
The OneFS file system installs with UNIX permissions as the default. By using Windows
Explorer or OneFS administrative tools, you can give a file or directory an ACL. In addition
to Windows domain users and groups, ACLs in OneFS can include local, NIS, and LDAP
users and groups. After you give a file an ACL, OneFS stops enforcing the file's mode bits,
which remain only as an estimate of the effective permissions.
SMB
You can configure SMB shares to provide Windows clients network access to file system
resources on the cluster. You can grant permissions to users and groups to carry out
operations such as reading, writing, and setting access permissions on SMB shares.
ACLs
In Windows environments, file and directory permissions, referred to as access rights, are
defined in access control lists (ACLs). Although ACLs are more complex than mode bits,
ACLs can express much more granular sets of access rules. OneFS checks the ACL
processing rules commonly associated with Windows ACLs.
A Windows ACL contains zero or more access control entries (ACEs), each of which
represents the security identifier (SID) of a user or a group as a trustee. In OneFS, an ACL
can contain ACEs with a UID, GID, or SID as the trustee. Each ACE contains a set of rights
that allow or deny access to a file or folder. An ACE can optionally contain an inheritance
flag to specify whether the ACE should be inherited by child folders and files.
Note
Instead of the standard three permissions available for mode bits, ACLs have 32 bits of
fine-grained access rights. Of these, the upper 16 bits are general and apply to all object
types. The lower 16 bits vary between files and directories but are defined in a way that
allows most applications to apply the same bits for files and directories.
Rights grant or deny access for a given trustee. You can block user access explicitly
through a deny ACE or implicitly by ensuring that a user does not directly, or indirectly
through a group, appear in an ACE that grants the right.
NFS
You can configure NFS exports to provide UNIX clients network access to file system
resources on the cluster.
SMB
103
UNIX permissions
In a UNIX environment, file and directory access is controlled by POSIX mode bits, which
grant read, write, or execute permissions to the owning user, the owning group, and
everyone else.
OneFS supports the standard UNIX tools for viewing and changing permissions, ls,
chmod, and chown. For more information, run the man ls, man chmod, and man
chown commands.
All files contain 16 permission bits, which provide information about the file or directory
type and the permissions. The lower 9 bits are grouped as three 3-bit sets, called triples,
which contain the read, write, and execute (rwx) permissions for each class of users
owner, group, and other. You can set permissions flags to grant permissions to each of
these classes.
Unless the user is root, OneFS checks the class to determine whether to grant or deny
access to the file. The classes are not cumulative: The first class matched is applied. It is
therefore common to grant permissions in decreasing order.
Mixed-permission environments
When a file operation requests an objects authorization data, for example, with the ls l command over NFS or with the Security tab of the Properties dialog box in Windows
Explorer over SMB, OneFS attempts to provide that data in the requested format. In an
environment that mixes UNIX and Windows systems, some translation may be required
when performing create file, set security, get security, or access operations.
SID-to-UID and SID-to-GID mappings are cached in both the OneFS ID mapper and the
stat cache. If a mapping has recently changed, the file might report inaccurate
information until the file is updated or the cache is flushed.
contains the corresponding rights that are denied. In both cases, the trustee of the
ACE corresponds to the file owner, group, or everyone. After all of the ACEs are
generated, any that are not needed are removed before the synthetic ACL is returned.
Managing roles
You can view, add, or remove members of any role. Except for built-in roles, whose
privileges you cannot modify, you can add or remove OneFS privileges on a role-by-role
basis.
Note
Roles take both users and groups as members. If a group is added to a role, all users who
are members of that group are assigned the privileges associated with the role. Similarly,
members of multiple roles are assigned the combined privileges of each role.
View a role
You can view information about built-in and custom roles.
Procedure
1. Click Access > Membership & Roles > Roles.
2. In the Roles area, select a role and click View / Edit.
3. In the View Role Details dialog box, view information about the role.
4. Click Close to return to the Membership & Roles page.
View privileges
You can view user privileges.
This procedure must be performed through the command-line interface (CLI). You can
view a list of your privileges or the privileges of another user using the following
commands:
Procedure
1. Establish an SSH connection to any node in the cluster.
2. To view privileges, run one of the following commands.
l
To view a list of privileges for another user, run the following command, where
<user> is a placeholder for another user by name:
isi auth mapping token <user>
Managing roles
105
Modify a role
You can modify the description and the user or group membership of any role, including
built-in roles. However, you can modify the name and privileges only for custom roles.
Procedure
1. Click Access > Membership & Roles > Roles.
2. In the Roles area, select a role and click View / Edit.
The View Role Details dialog box appears.
3. Click Edit Role and modify the settings as needed in the Edit Role Details dialog box.
4. Click Save Changes to return to the View Role Details dialog box.
5. Click Close.
Copy a role
You can copy an existing role and add or remove privileges and members for that role as
needed.
Procedure
1. Click Access > Membership & Roles > Roles.
2. In the Roles area, select a role and click More > Copy.
3. Modify the role name, description, members, and privileges as needed.
4. Click Copy Role.
Users
Groups
Well-known SIDs
3. If you selected User or Group, locate the user or group through one of the following
methods:
l
Type the Username or Group Name you want to search for in the text field.
Select the authentication provider you want to search for from the Provider list.
Only providers that are currently configured and enabled on the cluster are listed.
4. Click Search.
5. Select a user name, group name, or a well-known SID from the search results to add
as members to the role.
6. Click Select.
To delete a single role, click More > Delete from the Actions column against the
selected role.
To delete multiple roles, select Delete Selection from the Select a bulk action list.
107
If you do not specify a port, the default port is used. The default port for non-secure
LDAP (ldap://) is 389; for secure LDAP (ldaps://), it is 636. If you specify non-secure
LDAP, the bind password is transmitted to the server in clear text.
5. Optional: Configure the following settings as needed.
Load balance servers
Select the check box to connect to a random server, or clear the check box to
connect according to the order in which the servers are listed in the Servers field.
Base Distinguished Name
Type the distinguished name (DN) of the entry at which to start LDAP searches.
Base DNs can include cn (Common Name), l (Locality), dc (Domain Component),
ou (Organizational Unit), or other components. For example, dc=emc,dc=com is
a base DN for emc.com.
Bind to
Type the distinguished name of the entry at which to bind to the LDAP server.
Password
Specify the password to use when binding to the LDAP server. Use of this
password does not require a secure connection; if the connection is not using
Transport Layer Security (TLS), the password is sent in clear text.
6. Optional: To modify the default settings for user, group, and netgroup queries, click
Default Query Settings.
7. Optional: To modify the settings for user queries and home directory provisioning,
click User Query Settings.
8. Optional: To modify the settings for group queries, click Group Query Settings.
9. Optional: To modify the settings for netgroup queries, click Netgroup Query Settings.
10.Optional: To modify the default LDAP attributes that contain user information or to
modify LDAP security settings, click Advanced LDAP Settings.
108
109
Search Timeout
Specifies the number of seconds after which to stop retrying and fail a search. The
default value is 100. This setting is available only in the default query settings.
Query Filter
Specifies the LDAP filter for user, group, or netgroup objects. This setting is not
available in the default query settings.
Authenticate users from this provider
Specifies whether to allow the provider to respond to authentication requests. This
setting is available only in the user query settings.
Home Directory Naming
Specifies the path to use as a template for naming home directories. The path must
begin with /ifs and can contain variables, such as %U, that are expanded to
generate the home directory path for the user. This setting is available only in the
user query settings.
Create home directories on first login
Specifies whether to create a home directory the first time a user logs in, if a home
directory does not already exist for the user. This setting is available only in the user
query settings.
UNIX Shell
Specifies the path to the user's login shell, for users who access the file system
through SSH. This setting is available only in the user query settings.
110
111
Consider the following information when you configure an Active Directory provider:
l
When you join Active Directory from OneFS, cluster time is updated from the Active
Directory server, as long as an NTP server has not been configured for the cluster.
If you migrate users to a new or different Active Directory domain, you must re-set the
ACL domain information after you configure the new provider. You can use third-party
tools such as Microsoft SubInACL.
Procedure
1. Click Access > Authentication Providers > Active Directory.
2. Click Join a domain.
3. In the Domain Name field, type a fully qualified Active Directory domain name.
The domain name will also be used as the provider name.
4. In the User field, type the username of an account that is authorized to join the Active
Directory domain.
5. In the Password field, type the password of the user account.
6. Optional: In the Organizational Unit field, type the name of the organizational unit
(OU) to connect to on the Active Directory server. Specify the OU in the format OuName
or OuName1/SubName2.
7. Optional: In the Machine Account field, type the name of the machine account.
Note
If you specified an OU to connect to, the domain join will fail if the machine account
does not reside in the OU.
8. Optional: To enable Active Directory authentication for NFS, select the Enable Secure
NFS check box.
Note
If you specified an OU to connect to, the domain join will fail if the machine account
does not reside in the OU.
If you enable this setting, OneFS registers NFS service principal names (SPNs) during
the domain join.
9. Optional: To configure advanced settings, click Advanced Active Directory Settings.
10.Click Join.
2. In the Active Directory Providers table, click View details for the provider whose
settings you want to modify.
3. Click Advanced Active Directory Settings.
4. For each setting that you want to modify, click Edit, make the change, and then click
Save.
5. Optional: Click Close.
Description
Trusted Domains
Use enhanced privacy and Encrypts communication to and from the domain controller.
encryption
Home Directory Naming
Creates a home directory the first time that a user logs in if a home
directory does not already exist for the user.
UNIX Shell
Specifies the path to the login shell to use if the Active Directory
server does not provide login-shell information. This setting applies
only to users who access the file system through SSH.
113
Setting
Description
Auto-assign UIDs
Auto-assign GIDs
Make UID/GID
Restricts user and group lookups to the specified domains.
assignments for users and
groups in these specific
domains
If the Load balance servers option is not selected, servers are accessed in the order in
which they are listed.
5. In the NIS Domain field, type the domain name.
6. Optional: Configure the Load balance servers setting:
l
To connect according to the order in which the servers are listed in the Servers
field, clear the check box.
114
a. In the Search Timeout field, specifies the number of seconds after which to stop
retrying and fail a search. The default value is 100.
b. In the Retry Frequency field, specify the timeout period in seconds after which a
request will be retried. The default value is 5.
8. Optional: Specify the User Query Settings.
a. Select the Authenticate users from this provider check box to allow the provider to
respond to authentication requests.
b. Type a path in the Home Directory Naming field to use as a template for naming
home directories. The path must begin with /ifs and can contain expansion
variables, such as %U, which expand to generate the home directory path for the
user. For more information, see the Home directories section.
c. Select the Create home directories on first login check box to specify whether to
create a home directory the first time a user logs in, if a home directory does not
already exist for the user.
d. Select a path from the UNIX Shell list to specify the path to the user's login shell
for users who access the file system through SSH.
9. Optional: Click Host Name Query Settings and then configure the Resolve hosts from
this provider setting:
l
115
If the replacement files are located outside the /ifs directory tree, you must distribute
them manually to every node in the cluster. Changes that are made to the system
provider's files are automatically distributed across the cluster.
To specify a user replacement file, in the Users File field, type or browse to the
location of the spwd.db file.
To specify a group replacement file, in the Groups File field, type or browse to the
location of the group file.
To specify a netgroup replacement file, in the Netgroups File field, type or browse
to the location of the netgroup file.
116
Option
Description
Authenticate users
from this provider
Home Directory
Naming
Create home
directories on first
login
Option
Description
UNIX Shell
Specifies the path to the user's login shell, for users who
access the file system through SSH.
By default, the binary password file, spwd.db, is created in the /etc directory. You
can override the location to store the spwd.db file by specifying the -d option with a
different target directory.
The following command generates an spwd.db file in the /etc directory from a
password file that is located at /ifs/test.passwd:
pwd_mkdb /ifs/test.passwd
The following command generates an spwd.db file in the /ifs directory from a
password file that is located at /ifs/test.passwd:
pwd_mkdb -d /ifs /ifs/test.passwd
The fields are defined below in the order in which they appear in the file.
Note
UNIX systems often define the passwd format as a subset of these fields, omitting the
Class, Change, and Expiry fields. To convert a file from passwd to master.passwd
format, add :0:0: between the GID field and the Gecos field.
Username
The user name. This field is case-sensitive. OneFS does not limit the length; many
applications truncate the name to 16 characters, however.
Managing file providers
117
Password
The users encrypted password. If authentication is not required for the user, you can
substitute an asterisk (*) for a password. The asterisk character is guaranteed to not
match any password.
UID
The UNIX user identifier. This value must be a number in the range 0-4294967294
that is not reserved or already assigned to a user. Compatibility issues occur if this
value conflicts with an existing account's UID.
GID
The group identifier of the users primary group. All users are a member of at least
one group, which is used for access checks and can also be used when creating
files.
Class
This field is not supported by OneFS and should be left empty.
Change
OneFS does not support changing the passwords of users in the file provider. This
field is ignored.
Expiry
OneFS does not support the expiration of user accounts in the file provider. This field
is ignored.
Gecos
This field can store a variety of information but is usually used to store the users full
name.
Home
The absolute path to the users home directory, beginning at /ifs.
Shell
The absolute path to the users shell. If this field is set to /sbin/nologin, the
user is denied command-line access.
The fields are defined below in the order in which they appear in the file.
Group name
The name of the group. This field is case-sensitive. Although OneFS does not limit
the length of the group name, many applications truncate the name to 16 characters.
Password
This field is not supported by OneFS and should contain an asterisk (*).
GID
The UNIX group identifier. Valid values are any number in the range 0-4294967294
that is not reserved or already assigned to a group. Compatibility issues occur if this
value conflicts with an existing group's GID.
118
Group members
A comma-delimited list of user names.
Where <host> is a placeholder for a machine name, <user> is a placeholder for a user name,
and <domain> is a placeholder for a domain name. Any combination is valid except an
empty triple: (,,).
The following sample file contains two netgroups. The rootgrp netgroup contains four
hosts: two hosts are defined in member triples and two hosts are contained in the nested
othergrp netgroup, which is defined on the second line.
rootgrp (myserver, root, somedomain.com) (otherserver, root,
somedomain.com) othergrp
othergrp (other-win,, somedomain.com) (other-linux,, somedomain.com)
Note
A new line signifies a new netgroup. You can continue a long netgroup entry to the next
line by typing a backslash character (\) in the right-most position of the first line.
119
Description
Users
Groups
120
Option
Description
Allow password
to expire
UID
Full Name
Email Address
Option
Description
Primary Group
Click Select group to specify the owner group using the Select a
Primary Group dialog box.
a. To locate a group under the selected local provider, type a
group name or click Search.
b. Select a group to return to the Manage Users window.
Additional
Groups
Click Add group to specify any additional groups to make this user
a member of.
Home Directory
Type the path to the user's home directory. If you do not specify a
path, a directory is automatically created at /ifs/home/
<username>.
UNIX Shell
This setting applies only to users who access the file system
through SSH. From the list, select a shell. By default,
the /bin/zsh shell is selected.
Account
Expiration Date
Click the calendar icon to select the expiration date or type the
expiration date in the field, and then type the date in the format
<mm>/<dd>/<yyyy>.
Enable the
account
Select this check box to allow the user to authenticate against the
local database for SSH, FTP, HTTP, and Windows file sharing
through SMB. This setting is not used for UNIX file sharing through
NFS.
8. Click Create.
You cannot assign a GID that is in use by another group. It is recommended that you
leave this field blank to allow the system to automatically generate the GID.
7. Optional: For each member that you want to add to the group, click Add user and
perform the following tasks in the Select a User dialog box:
a. Search for either Users or Well-known SIDs.
b. If you selected Users, specify values for the following fields:
Username
Managing local users and groups
121
Type all or part of a user name, or leave the field blank to return all users. Wildcard
characters are accepted.
Access Zone
Select the access zone that contains the authentication provider that you want to
search.
Provider
Select an authentication provider.
c. Click Search.
d. In the Search Results table, select a user and then click Select.
The dialog box closes.
8. Click Create.
The maximum name length is 104 characters. It is recommended that names do not
exceed 64 characters.
Names can contain any special character that is not in the list of invalid characters. It
is recommend that names do not contain spaces.
122
4. In the list of groups, click View details for the local group whose settings you want to
modify.
5. For the Members setting, click Edit.
6. Add or remove the users that you want, and then click Save.
7. Click Close.
The maximum name length is 104 characters. It is recommended that names do not
exceed 64 characters.
Names can contain any special character that is not in the list of invalid characters. It
is recommend that names do not contain spaces.
123
configuration changes to the Kerberos provider. You can include the provider in one or
more access zones.
124
9. Click Close.
To delete a single realm, select the realm and click More > Delete from the Actions
column.
To delete multiple realms, select the realms and then select Delete Selection from
the Select a bulk action list.
Defining a realm
Defining a domain
125
a. In the Realm Name field, type a domain name preferably in uppercase characters.
For example, type CLUSTER-NAME.COMPANY.COM.
b. Select the Set as the default realm check box to set the realm as the default.
c. In the Key Distribution Centers (KDCs) field, add one or more KDCs by specifying
the hostnames or IP addresses of the authentication servers.
d. In the Admin Server field, type the hostname or IP address of the authentication
server to use as the master KDC. If you omit this step, the first KDC that you added
previously is used as the default admin server.
e. Optional: In the Default Domain field, specify the domain name to use for
translating the service principal names (SPNs).
4. Optional: In the Create Domain(s) section, specify one or more domain names to
associate with the realm.
5. In the Authenticate to Realm section, specify the following parameters:
a. In the User field, type a user name who has the permission to create the SPNs in
the Kerberos realm.
b. In the Password field, type the password for the user.
6. In the Create Provider section, select one of the following options for managing the
SPNs:
l
126
To delete a single provider, select the provider and click More > Delete from the
Actions column.
127
To delete multiple providers, select the providers and then select Delete Selection
from the Select a bulk action list.
128
To delete a single domain mapping, select the mapping and click More > Delete
from the Actions column.
To delete multiple domain mappings, select the mappings and then select Delete
Selection from the Select a bulk action list.
129
3. View mode-bits permissions for a user by running the isi auth access command.
The following command displays verbose-mode file permissions information in /
ifs/ for the user that you specify in place of <username>:
isi auth access <username> /ifs/ -v
4. View expected ACL user permissions on a file for a user by running the isi auth
access command.
The following command displays verbose-mode ACL file permissions for the file
file_with_acl.tx in /ifs/data/ for the user that you specify in place of
<username>:
isi auth access <username> /ifs/data/file_with_acl.tx -v
Description
Send NTLMv2
On-Disk Identity Controls the preferred identity to store on disk. If OneFS is unable
to convert an identity to the preferred format, it is stored as is.
This setting does not affect identities that are currently stored on
disk. Select one of the following settings:
native
Allow OneFS to determine the identity to store on disk. This is
the recommended setting.
unix
Always store incoming UNIX identifiers (UIDs and GIDs) on
disk.
sid
Store incoming Windows security identifiers (SIDs) on disk,
unless the SID was generated from a UNIX identifier; in that
case, convert it back to the UNIX identifier and store it on
disk.
Workgroup
Space
Replacement
For clients that have difficulty parsing spaces in user and group
names, specifies a substitute character.
3. Click Save.
After you finish
If you changed the on-disk identity selection, it is recommended that you run the
PermissionRepair job with the Convert repair type to prevent potential permissions
errors. For more information, see Update cluster permissions on page 139.
131
Because ACL policies change the behavior of permissions throughout the system, they
should be modified only as necessary by experienced administrators with advanced
knowledge of Windows ACLs. This is especially true for the advanced settings, which are
applied regardless of the cluster's environment.
For UNIX, Windows, or balanced environments, the optimal permission policy settings are
selected and cannot be modified. However, you can choose to manually configure the
cluster's default permission settings if necessary to support your particular environment.
Note
You must be logged in to the web administration interface to perform this task.
Procedure
1. Click Protocols > ACLs > ACL Policies.
2. In the Standard Settings area, under Environment, select the option that best
describes your environment, or select Configure permission policies manually to
configure individual permission policies.
3. If you selected the Configure permission policies manually option, configure the
settings as needed.
For more information about these settings, see ACL policy settings options.
4. In the Advanced Settings area, configure the settings as needed.
132
Inheritable ACLs on the system take precedence over this setting: If inheritable ACLs
are set on a folder, any new files and folders created in that folder will inherit the
folder's ACL. Disabling this setting does not remove ACLs currently set on files. If you
want to clear an existing ACL, run the chmod -b <mode> <file> command to remove
the ACL and set the correct permissions.
133
If you try to run the chmod command on the same permissions that are currently set
on a file with an ACL, you may cause the operation to silently failThe operation
appears to be successful, but if you were to examine the permissions on the cluster,
you would notice that the chmod command had no effect. As a workaround, you can
run the chmod command away from the current permissions and then perform a
second chmod command to revert to the original permissions. For example, if your
file shows 755 UNIX permissions and you want to confirm this number, you could
run chmod 700 file; chmod 755 file.
134
Over NFS, the chown or chgrp operation changes the permissions and the owner or
owning group. For example, consider a file owned by user Joe with rwx------ (700)
permissions, signifying rwx permissions for the owner, but no permissions for
anyone else. If you run the chown command to change ownership of the file to user
Bob, the owner permissions are still rwx but they now represent the permissions for
Bob, rather than for Joe, who lost all of his permissions. This setting does not affect
UNIX chown or chgrp operations performed on files with UNIX permissions, and it
does not affect Windows chown or chgrp operations, which do not change any
permissions.
135
The take ownership right only gives users the ability to take file ownership, not to
give ownership away.
Select one of the following options.
Allow only owners to chmod or chown
Causes chmod and chown access checks to operate with UNIX-like behavior.
Allow owner and users with 'take ownership' right to chown, and owner and users
with 'change permissions' right to chmod
Causes chmod and chown access checks to operate with Windows-like
behavior.
Advanced settings
Treatment of 'rwx' permissions
In UNIX environments, rwx permissions signify two things: A user or group has read,
write, and execute permissions; and a user or group has the maximum possible level
of permissions.
When you assign UNIX permissions to a file, no ACLs are stored for that file. A
Windows system processes only ACLs; Windows does not process UNIX permissions.
Therefore, when you view a file's permissions on a Windows system, the cluster
must translate the UNIX permissions into an ACL. This type of ACL is called a
synthetic ACL. Synthetic ACLs are not stored anywhere; instead, they are dynamically
generated as needed and then they are discarded. If a file has UNIX permissions,
you may notice synthetic ACLs when you run the ls file command on the cluster to
view a files ACLs.
When you generate a synthetic ACL, the cluster maps UNIX permissions to Windows
rights. Windows supports a more granular permissions model than UNIX does, and it
specifies rights that cannot easily be mapped from UNIX permissions. If the cluster
maps rwx permissions to Windows rights, you must enable one of the following
options. The main difference between rwx and Full Control is the broader set of
permissions with Full Control.
Select one of the following options.
Retain 'rwx' permissions
Generates an ACE that provides only read, write, and execute permissions.
Treat 'rwx' permissions as Full Control
Generates an ACE that provides the maximum Windows permissions for a user
or a group by adding the change permissions right, the take ownership right,
and the delete right.
136
137
Group permissions
Select one of the following options for group permissions:
Approximate group mode bits using all possible group ACEs
Makes the group permissions appear more permissive than the actual
permissions on the file.
Approximate group mode bits using only the ACE with the group ID
Makes the group permissions appear more accurate, in that you see only the
permissions for a particular group and not the more permissive set. This may
cause access-denied problems for UNIX clients, however.
No "deny" ACEs
The Windows ACL user interface cannot display an ACL if any deny ACEs are out of
canonical ACL order. To correctly represent UNIX permissions, deny ACEs may be
required to be out of canonical ACL order.
Select one of the following options.
Do not modify synthetic ACLs and mode bit approximations
Specifies to not modify synthetic ACL generation; deny ACEs will be generated
when necessary.
CAUTION
138
Description
Clone
permissions
Inherit
permissions
Convert
permissions
The remaining settings options differ depending on the selected repair task.
7. In the Template File or Directory field, type or browse to the directory in /ifs that you
want to copy permissions from. This setting applies to only the Clone and Inherit
repair types.
8. Optional: From the Mapping type list, select the preferred on-disk identity type to
apply. This setting applies to only the Convert permissions repair task.
Update cluster permissions
139
Option
Description
Global
Native
9. Optional: From the Access Zone list, select an access zone to use for ID mapping. This
setting applies to only the Convert permissions repair task.
140
CHAPTER 6
Identity management
Identity management
141
Identity management
Authenticate a user with Active Directory but give the user a UNIX identity.
Disallow login of users that do not exist in both Active Directory and LDAP.
For more information about identity management, see the white paper Managing identities
with the Isilon OneFS user mapping service at EMC Online Support.
Identity types
OneFS supports three primary identity types, each of which you can store directly on the
file system. Identity types are user identifier and group identifier for UNIX, and security
identifier for Windows.
When you log on to an EMC Isilon cluster, the user mapper expands your identity to
include your other identities from all the directory services, including Active Directory,
LDAP, and NIS. After OneFS maps your identities across the directory services, it
generates an access token that includes the identity information associated with your
accounts. A token includes the following identifiers:
l
A UNIX user identifier (UID) and a group identifier (GID). A UID or GID is a 32-bit
number with a maximum value of 4,294,967,295.
A security identifier (SID) for a Windows user account. A SID is a series of authorities
and sub-authorities ending with a 32-bit relative identifier (RID). Most SIDs have the
form S-1-5-21-<A>-<B>-<C>-<RID>, where <A>, <B>, and <C> are specific to a domain or
computer and <RID> denotes the object in the domain.
A list of supplemental identities, including all groups in which the user is a member.
The token also contains privileges that stem from administrative role-based access
control.
On an Isilon cluster, a file contains permissions, which appear as an access control list
(ACL). The ACL controls access to directories, files, and other securable system objects.
When a user tries to access a file, OneFS compares the identities in the users access
token with the files ACL. OneFS grants access when the files ACL includes an access
control entry (ACE) that allows the identity in the token to access the file and that does
142
Identity management
not include an ACE that denies the identity access. OneFS compares the access token of
a user with the ACL of a file.
Note
For more information about access control lists, including a description of the
permissions and how they correspond to POSIX mode bits, see the white paper titled EMC
Isilon multiprotocol data access with a unified security model on the EMC Online Support
web site.
When a name is provided as an identifier, it is converted into the corresponding user or
group object and the correct identity type. You can enter or display a name in various
ways:
l
UNIX assumes unique case-sensitive namespaces for users and groups. For example,
Name and name represent different objects.
Windows provides a single, case-insensitive namespace for all objects and also
specifies a prefix to target an Active Directory domain; for example, domain\name.
Kerberos and NFSv4 define principals, which require names to be formatted the same
way as email addresses; for example, [email protected].
Multiple names can reference the same object. For example, given the name support and
the domain example.com, support, EXAMPLE\support and [email protected] are all
names for a single object in Active Directory.
Access tokens
An access token is created when the user first makes a request for access.
Access tokens represent who a user is when performing actions on the cluster and supply
the primary owner and group identities during file creation. Access tokens are also
compared against the ACL or mode bits during authorization checks.
During user authorization, OneFS compares the access token, which is generated during
the initial connection, with the authorization data on the file. All user and identity
mapping occurs during token generation; no mapping takes place during permissions
evaluation.
An access token includes all UIDs, GIDs, and SIDs for an identity, in addition to all OneFS
privileges. OneFS reads the information in the token to determine whether a user has
access to a resource. It is important that the token contains the correct list of UIDs, GIDs,
and SIDs. An access token is created from one of the following sources:
Source
Authentication
Username
Kerberized NFSv3
Kerberized NFSv4
HTTP
FTP
HDFS
SMB NTLM
Access tokens
143
Identity management
Source
Authentication
Description
User identity
lookup
Note
ID mapping
The user's identifiers are associated across directory services. All SIDs
are converted to their equivalent UID/GID and vice versa. These ID
mappings are also added to the access token.
User mapping
On-disk identity The default on-disk identity is calculated from the final token and the
calculation
global setting. These identities are used for newly created files.
ID mapping
The Identity (ID) mapping service maintains relationship information between mapped
Windows and UNIX identifiers to provide consistent access control across file sharing
protocols within an access zone.
Note
ID mapping and user mapping are different services, despite the similarity in names.
During authentication, the authentication daemon requests identity mappings from the
ID mapping service in order to create access tokens. Upon request, the ID mapping
service returns Windows identifiers mapped to UNIX identifiers or UNIX identifiers
mapped to Windows identifiers. When a user authenticates to a cluster over NFS with a
UID or GID, the ID mapping service returns the mapped Windows SID, allowing access to
files that another user stored over SMB. When a user authenticates to the cluster over
144
Identity management
SMB with a SID, the ID mapping service returns the mapped UNIX UID and GID, allowing
access to files that a UNIX client stored over NFS.
Mappings between UIDs or GIDs and SIDs are stored according to access zone in a
cluster-distributed database called the ID map. Each mapping in the ID map is stored as a
one-way relationship from the source to the target identity type. Two-way mappings are
stored as complementary one-way mappings.
User and group lookups may be disabled or limited, depending on the Active Directory
settings. You enable user and group lookup settings through the isi auth ads
modify command.
If the ID mapping service does not locate and return a mapped UID or GID in the ID map,
the authentication daemon searches other external authentication providers configured
in the same access zone for a user that matches the same name as the Active Directory
user.
If a matching user name is found in another external provider, the authentication daemon
adds the matching user's UID or GID to the access token for the Active Directory user, and
the ID mapping service creates a mapping between the UID or GID and the Active
Directory user's SID in the ID map. This is referred to as an external mapping.
Note
When an external mapping is stored in the ID map, the UID is specified as the on-disk
identity for that user. When the ID mapping service stores a generated mapping, the SID
is specified as the on-disk identity.
If a matching user name is not found in another external provider, the authentication
daemon assigns a UID or GID from the ID mapping range to the Active Directory user's
SID, and the ID mapping service stores the mapping in the ID map. This is referred to as a
generated mapping. The ID mapping range is a pool of UIDs and GIDs allocated in the
mapping settings.
After a mapping has been created for a user, the authentication daemon retrieves the UID
or GID stored in the ID map upon subsequent lookups for the user.
For UIDs, the ID mapping service generates a UNIX SID with a domain of S-1-22-1 and
a resource ID (RID) matching the UID. For example, the UNIX SID for UID 600 is
S-1-22-1-600.
ID mapping
145
Identity management
For GIDs, the ID mapping service generates a UNIX SID with a domain of S-1-22-2 and
an RID matching the GID. For example, the UNIX SID for GID 800 is S-1-22-2-800.
ID mapping ranges
In access zones with multiple external authentication providers, such as Active Directory
and LDAP, it is important that the UIDs and GIDs from different providers that are
configured in the same access zone do not overlap. Overlapping UIDs and GIDs between
providers within an access zone might result in some users gaining access to other users'
directories and files.
The range of UIDs and GIDs that can be allocated for generated mappings is configurable
in each access zone through the isi auth settings mappings modify
command. The default range for both UIDs and GIDs is 10000002000000 in each
access zone.
Do not include commonly used UIDs and GIDs in your ID ranges. For example, UIDs and
GIDs below 1000 are reserved for system accounts and should not be assigned to users
or groups.
User mapping
User mapping provides a way to control permissions by specifying a user's security
identifiers, user identifiers, and group identifiers. OneFS uses the identifiers to check file
or group ownership.
With the user-mapping feature, you can apply rules to modify which user identity OneFS
uses, add supplemental user identities, and modify a user's group membership. The
user-mapping service combines a users identities from different directory services into a
single access token and then modifies it according to the rules that you create.
Note
You can configure mapping rules on a per-zone basis. Mapping rules must be configured
separately in each access zone that uses them. OneFS maps users only during login or
protocol access.
The user's groups come from Active Directory and LDAP, with the LDAP groups and the
autogenerated group GID added to the list. To pull groups from LDAP, the mapping
service queries the memberUid attribute. The users home directory, gecos, and shell
come from Active Directory.
146
Identity management
Options
A parameter
Wildcards
User mapping
147
Identity management
Stop all processing before applying a default deny rule. To do so, create a rule
that matches allowed users but does nothing, such as an add operator with no
field options, and has the break option. After enumerating the allowed users,
you can place a catchall deny at the end to replace anybody unmatched with an
empty user.
To prevent explicit rules from being skipped, in each group of rules, order explicit
rules before rules that contain wildcard characters.
Add the LDAP or NIS primary group to the supplemental groups
When an Isilon cluster is connected to Active Directory and LDAP, a best practice is
to add the LDAP primary group to the list of supplemental groups. This lets OneFS
honor group permissions on files created over NFS or migrated from other UNIX
storage systems. The same practice is advised when an Isilon cluster is connected to
both Active Directory and NIS.
On-disk identity
After the user mapper resolves a user's identities, OneFS determines an authoritative
identifier for it, which is the preferred on-disk identity.
OnesFS stores either UNIX or Windows identities in file metadata on disk. On-disk identity
types are UNIX, SID, and native. Identities are set when a file is created or a file's access
control data is modified. Almost all protocols require some level of mapping to operate
correctly, so choosing the preferred identity to store on disk is important. You can
configure OneFS to store either the UNIX or the Windows identity, or you can allow OneFS
to determine the optimal identity to store.
On-disk identity types are UNIX, SID, and native. Although you can change the type of ondisk identity, the native identity is best for a network with UNIX and Windows systems. In
native on-disk identity mode, setting the UID as the on-disk identity improves NFS
performance.
Note
The SID on-disk identity is for a homogeneous network of Windows systems managed
only with Active Directory. When you upgrade from a version earlier than OneFS 6.5, the
on-disk identity is set to UNIX. When you upgrade from OneFS 6.5 or later, the on-disk
identity setting is preserved. On new installations, the on-disk identity is set to native.
The native on-disk identity type allows the OneFS authentication daemon to select the
correct identity to store on disk by checking for the identity mapping types in the
following order:
148
Identity management
Order Mapping
type
Description
Algorithmic
mapping
External
mapping
A user with an explicit UID and GID defined in a directory service (such as
Active Directory with RFC 2307 attributes, LDAP, NIS, or the OneFS file
provider or local provider) has the UNIX identity set as the on-disk
identity.
Persistent
mapping
No mapping
If a user lacks a UID or GID even after querying the other directory
services and identity databases, its SID is set as the on-disk identity. In
addition, to make sure a user can access files over NFS, OneFS allocates
a UID and GID from a preset range of 1,000,000 to 2,000,000. In native
on-disk identity mode, a UID or GID that OneFS generates is never set as
the on-disk identity.
Note
If you change the on-disk identity type, you should run the PermissionRepair job in
convert mode to make sure that the disk representation of all files is consistent with the
changed setting.
Managing ID mappings
You can create, modify, and delete identity mappings and configure ID mapping settings.
Managing ID mappings
149
Identity management
The following command deletes all identity mappings in the zone3 access zone that
were both created automatically and include a UID or GID from an external
authentication source:
isi auth mapping delete --all --only-external --zone=zone3
The following command deletes the identity mapping of the user with UID 4236 in the
zone3 access zone:
isi auth mapping delete --source-uid=4236 --zone=zone3
150
Identity management
The following command flushes the mapping of the user with UID 4236 in the zone3
access zone:
isi auth mapping flush --source-uid-4236 --zone=zone3
151
Identity management
SID: S-1-22-2-4236
On Disk: 4236
Yes
25000
50000
Yes
25000
50000
152
You can only create user-mapping rules if you are connected to the EMC Isilon cluster
through the System zone; however, you can apply user-mapping rules to specific
access zones. If you create a user-mapping rule for a specific access zone, the rule
applies only in the context of its zone.
When you change user-mapping on one node, OneFS propagates the change to the
other nodes.
Identity management
After you make a user-mapping change, the OneFS authentication service reloads the
configuration.
The OneFS user access token contains a combination of identities from Active Directory
and LDAP if both directory services are configured. You can run the following commands
to discover the identities that are within each specific directory service.
Procedure
1. Establish an SSH connection to any node in the cluster.
2. View a user identity from Active Directory only by running the isi auth users
view command.
The following command displays the identity of a user named stand in the Active
Directory domain named YORK:
isi auth users view --user=YORK\\stand --show-groups
3. Vew a user identity from LDAP only by running the isi auth users view
command.
The following command displays the identity of an LDAP user named stand:
isi auth user view --user=stand --show-groups
153
Identity management
UID: 4326
SID: S-1-22-1-4326
Primary Group
ID : GID:7222
Name : stand
Additional Groups: stand
sd-group
sd-group2
Generate a primary UID or GID from the reserved range of UIDs and GIDs
It is recommended that you assign a user from the well-known account that has a
read-only access.
9. Click Save Changes.
Identity management
3. In the Test User Mapping section, type or select a user, group, or a well-known SID
from the Select a User dialog box.
4. Click Test Mapping.
The token results appear in the Results section as shown:
User
Name:krb_user_002
UID:1002
SID:S-1-22-1-1001
On disk:1001
ZID:1
Zone:System
Privileges:Primary Group
Name:krb_user_001
GID:1000
SID:S-1-22-2-1001
On disk:1000
Supplemental Identities
Name:Authenticated Users
GID: SID:S-1-5-11
Description
Depending on your selection, the Create a User Mapping Rule dialog box refreshes to
display additional fields.
Merge Windows and UNIX tokens
155
Identity management
Rules are called in the order they are listed. To ensure that each rule gets processed,
list replacements first and allow/deny rules last. You can change the order in which a
rule is listed by clicking its title bar and dragging it to a new position.
7. Click Save Changes.
Identity management
Rules are called in the order they are listed. To ensure that each rule gets processed,
list the replacements first and the allow or deny rules at the end. You can change the
order in which a rule is listed by clicking its title bar and dragging it to a new position.
9. Click Save Changes.
username
unix_name
primary_uid
primary_user_sid
primary_gid
primary_group_sid
Options control how a rule combines identity information in a token. The break option is
the exception: It stops OneFS from processing additional rules.
Although several options can apply to a rule, not all options apply to all operators. The
following table describes the effect of each option and the operators that they work with.
Option
Operator
Description
user
insert, append
Copies the primary UID and primary user SID, if they exist, to the
token.
groups
insert, append
Copies the primary GID and primary group SID, if they exist, to the
token.
groups
insert, append
If the mapping service fails to find the second user in a rule, the
service tries to find the username of the default user. The name
of the default user cannot include wildcards. When you set the
option for the default user in a rule with the command-line
interface, you must set it with an underscore: default_user.
157
Identity management
Option
Operator
Description
break
all operators
Stops the mapping service from applying rules that follow the
insertion point of the break option. The mapping service
generates the final token at the point of the break.
Web interface
CLI Direction
Description
append
Append fields
from a user
++
Left-to-right
insert
Insert fields
from a user
+=
Left-to-right
replace
Replace one
user with a
different user
=>
Left-to-right
user error.
158
Identity management
Operator
Web interface
CLI Direction
Description
remove
groups
Remove
supplemental
groups from a
user
--
join
&= Bidirectional Inserts the new identity into the token. If the
new identity is the second user, the
mapping service inserts it after the existing
identity; otherwise, the service inserts it
before the existing identity. The location of
the insertion point is relevant when the
existing identity is already the first in the list
because OneFS uses the first identity to
determine the ownership of new file system
objects.
Unary
159
Identity management
160
CHAPTER 7
Auditing
Auditing overview................................................................................................162
Protocol audit events.......................................................................................... 162
Supported event types........................................................................................ 162
Supported audit tools......................................................................................... 163
Enable system configuration auditing..................................................................164
Enable protocol access auditing..........................................................................164
Auditing settings................................................................................................. 165
Auditing
161
Auditing
Auditing overview
You can audit system configuration changes and SMB and NFS protocol activity on an
EMC Isilon cluster. All audit data is stored and protected in the cluster file system and
organized by audit topics.
When you enable or disable system configuration auditing, no additional configuration is
required. If you enable configuration auditing, all configuration events that are handled
by the APIincluding writes, modifications, and deletionsare tracked and recorded in
the config audit topic directories.
You can enable and configure protocol auditing for one or more access zones in a cluster.
If you enable protocol auditing for an access zone, file-access events through the SMB
and NFS protocol are recorded in the protocol audit topic directories. The protocol
audit log file is consumable by auditing applications that support the EMC Common Event
Enabler (CEE), such as Varonis DatAdvantage for Windows. By default, OneFS logs only
the events that are handled by Varonis, but you can specify which events to log in each
access zone. For example, you might want to audit the default set of protocol events in
the System access zone but audit only successful attempts to delete files in a different
access zone.
162
Auditing
close
Mount a share
Delete a file
Close a directory
rename
delete
set_security
The following event types are available for forwarding through CEE but are unsupported
by Varonis DatAdvantage:
Event name Example protocol activity
read
write
close
get_security
The following protocol audit events are not exported through CEE and are unsupported by
Varonis DatAdvantage:
Event name Example protocol activity
logon
logoff
Supported features
Audit events
create
close
delete
rename
163
Auditing
Application
Supported features
Audit events
l
set_security
Note
It is recommended that you install and configure third-party auditing applications before
you enable the OneFS auditing feature. Otherwise, the backlog consumed by the tool may
be so large that results may be stale for a prolonged time.
For the most current list of supported auditing tools, see the Isilon Third-Party Software &
Hardware Compatibility Guide.
It is recommended that you install and configure third-party auditing applications before
you enable the OneFS auditing feature. Otherwise, the backlog consumed by the tool may
be so large that results may be stale for a prolonged time.
Procedure
1. Click Cluster Management > Auditing.
2. In the Settings area, select the Enable Configuration Change Auditing checkbox.
3. Click Save Changes.
It is recommended that you install and configure third-party auditing applications before
you enable the OneFS auditing feature. Otherwise, the backlog consumed by the tool may
become so large that results might be stale for a prolonged time.
Procedure
1. Click Cluster Management > Auditing.
2. In the Settings area, select the Enable Protocol Access Auditing checkbox.
3. In the Audited Zones area, click Add Zones.
4. In the Select Zones dialog box, select the checkbox for one or more access zones, and
then click Add Zones.
5. Optional: In the Event Forwarding area, specify one or more CEE servers to forward
logged events to.
a. In the CEE Server URIs field, type the URI of each CEE server in the CEE server pool.
164
Auditing
The OneFS CEE export service uses round robin load-balancing when exporting
events to multiple CEE servers. Valid URIs start with http:// and include the port
number and path to the CEE server if necessaryfor example, http://
example.com:12228/cee.
b. In the Storage Cluster Name field, specify the name of the storage cluster to use
when forwarding protocol events.
This name value is typically the SmartConnect zone name, but in cases where
SmartConnect is not implemented, the value must match the hostname of the
cluster as Varonis recognizes it. If the field is left blank, events from each node are
filled with the node name (clustername + lnn). This setting is required only if
needed by your third-party audit application.
Note
Although this step is optional, be aware that a backlog of events will accumulate
regardless of whether CEE servers have been configured. When configured, CEE
forwarding begins with the oldest events in the backlog and moves toward newest
events in a first-in-first-out sequence.
6. Click Save Changes.
Results
The following protocol events, which are the only events supported by Varonis
DatAdvantage, are collected for audited access zones by default: create, close,
delete, rename, and set_security. You can modify the set of events that are
audited in an access zone by running the isi zone zones modify command in the
command-line interface. Because each audited event consumes system resources, it is
recommended that you only configure zones for events that are needed by your auditing
application.
Auditing settings
You can view or modify basic settings for configuration change auditing and protocol
access auditing.
Enable Configuration Change Auditing
Audits requests that are made through the API for system configuration changes.
Enable Protocol Access Auditing
Audits requests that are made through the SMB protocol to access data.
Audited Zones
Specifies one or more access zones to audit. This setting applies only to protocol
access auditing.
CEE Server URIs
Specifies one or more CEE server URIs where audit events will be forwarded. The
OneFS CEE export service uses round robin load-balancing when exporting events to
multiple CEE servers. This setting applies only to protocol access auditing.
Storage Cluster Name
Specifies the name of the storage cluster to use when forwarding protocol events
typically, the SmartConnect zone name. This setting is required only if needed by
your third-party audit application.
Auditing settings
165
Auditing
166
CHAPTER 8
File sharing
File sharing
167
File sharing
It is recommended that you do not save data to the root /ifs file path but in directories
below /ifs. The design of your data storage structure should be planned carefully. A
well-designed directory structure optimizes cluster performance and administration.
You can set Windows- and UNIX-based permissions on OneFS files and directories. Users
who have the required permissions and administrative privileges can create, modify, and
read data on the cluster through one or more of the supported file sharing protocols.
l
SMB. Allows Microsoft Windows and Mac OS X clients to access files that are stored
on the cluster.
NFS. Allows Linux and UNIX clients that adhere to the RFC1813 (NFSv3) and RFC3530
(NFSv4) specifications to access files that are stored on the cluster.
HTTP and HTTPS (with optional DAV). Allows clients to access files that are stored on
the cluster through a web browser.
FTP. Allows any client that is equipped with an FTP client program to access files that
are stored on the cluster through the FTP protocol.
SMB
OneFS includes a configurable SMB service to create and manage SMB shares. SMB
shares provide Windows clients network access to file system resources on the cluster.
You can grant permissions to users and groups to carry out operations such as reading,
writing, and setting access permissions on SMB shares.
The /ifs directory is configured as an SMB share and is enabled by default. OneFS
supports both user and anonymous security modes. If the user security mode is enabled,
users who connect to a share from an SMB client must provide a valid user name with
proper credentials.
The SMB protocol uses security identifiers (SIDs) for authorization data. All identities are
converted to SIDs during retrieval and are converted back to their on-disk representation
before they are stored on the cluster.
When a file or directory is created, OneFS checks the access control list (ACL) of its parent
directory. If the ACL contains any inheritable access control entries (ACEs), a new ACL is
generated from those ACEs. Otherwise, OneFS creates an ACL from the combined file and
directory create mask and create mode settings.
OneFS supports the following SMB clients:
168
SMB version
1.0
File sharing
SMB version
2.0
2.1
Windows 7 or later
Windows Server 2008 R2 or later
You can migrate multiple SMB servers, such as Windows file servers or NetApp filers,
to a single Isilon cluster. You can then configure a separate access zone for each SMB
server.
You can configure each access zone with a unique set of SMB share names that do
not conflict with share names in other access zones, and then join each access zone
to a different Active Directory domain.
You can reduce the number of available and accessible shares to manage by
associating an IP address pool with an access zone to restrict authentication to the
zone.
You can configure default SMB share settings that apply to all shares in an access
zone.
The Isilon cluster includes a built-in access zone named System, where you manage all
aspects of the cluster and other access zones. If you don't specify an access zone when
managing SMB shares, OneFS will default to the System zone.
SMB Multichannel
SMB Multichannel supports establishing a single SMB session over multiple network
connections.
SMB Multichannel is a feature of the SMB 3.0 protocol that provides the following
capabilities:
SMB shares in access zones
169
File sharing
Increased throughput
OneFS can transmit more data to a client through multiple connections over a high
speed network adapters or over multiple network adapters.
Connection failure tolerance
When an SMB Multichannel session is established over multiple network
connections, the session is not lost if one of the connections has a network fault,
which enables the client to continue to work.
Automatic discovery
SMB Multichannel automatically discovers supported hardware configurations on
the client that have multiple available network paths and then negotiates and
establishes a session over multiple network connections. You are not required to
install components, roles, role services, or features.
SMB Multichannel must be enabled on both the EMC Isilon cluster and the Windows
client computer. It is enabled on the Isilon cluster by default.
SMB Multichannel establishes a single SMB session over multiple network connections
only on supported network interface card (NIC) configurations. SMB Multichannel
requires at least one of the following NIC configurations on the client computer:
l
One or more network interface cards that support Receive Side Scaling (RSS).
One or more network interface cards configured with link aggregation. Link
aggregation enables you to combine the bandwidth of multiple NICs on a node into a
single logical interface.
170
Client-side NIC
Configuration
Description
Single RSS-capable
NIC
Multiple NICs
File sharing
Client-side NIC
Configuration
Description
allow SMB Multichannel to leverage the combined bandwidth of multiple
NICs and provides connection fault tolerance if a connection or a NIC fails.
Note
When you connect to a zone through the MMC Shared Folders snap-in, you can view and
manage all SMB shares assigned to that zone; however, you can only view active SMB
sessions and open files on the specific node that you are connected to in that zone.
Changes you make to shares through the MMC Shared Folders snap-in are propagated
across the cluster.
171
File sharing
You must run the Microsoft Management Console (MMC) from a Windows workstation
that is joined to the domain of an Active Directory (AD) provider configured on the
cluster.
Role-based access control (RBAC) privileges do not apply to the MMC. A role with
SMB privileges is not sufficient to gain access.
l
local to local
local to remote
remote to local
remote to remote
To allow Windows SMB clients to traverse each type of symbolic link, the type must be
enabled on the client. The following Windows command enables all four link types:
fsutil behavior set SymlinkEvaluation L2L:1 R2R:1 L2R:1 R2L:1
172
File sharing
In another example, the following options must be enabled in your Samba configuration
file (smb.conf) to allow Samba clients to traverse symbolic links and wide links:
l
follow symlinks=yes
wide links=yes
For more information, refer to the symbolic link configuration information for your specific
SMB client and version.
NFS
OneFS provides an NFS server so you can share files on your cluster with NFS clients that
adhere to the RFC1813 (NFSv3) and RFC3530 (NFSv4) specifications.
In OneFS, the NFS server is fully optimized as a multi-threaded service running in user
space instead of the kernel. This architecture load balances the NFS service across all
nodes of the cluster, providing the stability and scalability necessary to manage up to
thousands of connections across multiple NFS clients.
NFS mounts execute and refresh quickly, and the server constantly monitors fluctuating
demands on NFS services and makes adjustments across all nodes to ensure continuous,
reliable performance. Using a built-in process scheduler, OneFS ensures fair allocation of
node resources so that no client can seize more than its fair share of NFS services.
The NFS server also supports access zones defined in OneFS, so that clients can access
only the exports appropriate to their zone. For example, if NFS exports are specified for
Zone 2, only clients assigned to Zone 2 can access these exports.
To simplify client connections, especially for exports with large pathnames, the NFS
server also supports aliases, which are shortcuts to mount points that clients can specify
directly.
For secure NFS file sharing, OneFS supports NIS and LDAP authentication providers.
NFS exports
You can manage individual NFS export rules that define mount-points (paths) available to
NFS clients and how the server should perform with these clients.
In OneFS, you can create, delete, list, view, modify, and reload NFS exports.
NFS export rules are zone-aware. Each export is associated with a zone, can only be
mounted by clients on that zone, and can only expose paths below the zone root. By
default, any export command applies to the client's current zone.
Each rule must have at least one path (mount-point), and can include additional paths.
You can also specify that all subdirectories of the given path or paths are mountable.
Otherwise, only the specified paths are exported, and child directories are not
mountable.
An export rule can specify a particular set of clients, enabling you to restrict access to
certain mount-points or to apply a unique set of options to these clients. If the rule does
Anonymous access to SMB shares
173
File sharing
not specify any clients, then the rule applies to all clients that connect to the server. If the
rule does specify clients, then that rule is applied only to those clients.
NFS aliases
You can create and manage aliases as shortcuts for directory pathnames in OneFS. If
those pathnames are defined as NFS exports, NFS clients can specify the aliases as NFS
mount points.
NFS aliases are designed to give functional parity with SMB share names within the
context of NFS. Each alias maps a unique name to a path on the file system. NFS clients
can then use the alias name in place of the path when mounting.
Aliases must be formed as top-level Unix pathnames, having a single forward slash
followed by name. For example, you could create an alias named /q4 that mapped
to /ifs/data/finance/accounting/winter2013 (a path in OneFS). An NFS
client could mount that directory through either of:
mount cluster_ip:/q4
mount cluster_ip:/ifs/data/finance/accounting/winter2013
Aliases and exports are completely independent. You can create an alias without
associating it with an NFS export. Similarly, an NFS export does not require an alias.
Each alias must point to a valid path on the file system. While this path is absolute, it
must point to a location beneath the zone root (/ifs on the System zone). If the alias
points to a path that does not exist on the file system, any client trying to mount the alias
would be denied in the same way as attempting to mount an invalid full pathname.
NFS aliases are zone-aware. By default, an alias applies to the client's current access
zone. To change this, you can specify an alternative access zone as part of creating or
modifying an alias.
Each alias can only be used by clients on that zone, and can only apply to paths below
the zone root. Alias names are unique per-zone, but the same name can be used in
different zones, for example, /home.
When you create an alias in the Web Administration interface, the alias list displays the
status of the alias. Similarly, using the --check option of the isi nfs aliases
command, you can check the status of an NFS alias. Status can be one of good, illegal
path, name conflict, not exported, or path not found.
174
Log file
Description
nfs.log
rpc_lockd.log
rpc_statd.log
isi_netgroup_d.log
File sharing
FTP
OneFS includes a secure FTP service called vsftpd, which stands for Very Secure FTP
Daemon, that you can configure for standard FTP and FTPS file transfers.
It is recommended that you configure ACL and UNIX permissions only if you fully
understand how they interact with one another.
175
File sharing
Note
It is recommended that you keep write caching enabled. You should also enable write
caching for all file pool policies.
OneFS interprets writes to the cluster as either synchronous or asynchronous, depending
on a client's specifications. The impacts and risks of write caching depend on what
protocols clients use to write to the cluster, and whether the writes are interpreted as
synchronous or asynchronous. If you disable write caching, client specifications are
ignored and all writes are performed synchronously.
The following table explains how clients' specifications are interpreted, according to the
protocol.
Protocol Synchronous
Asynchronous
NFS
SMB
If a node fails, no data will be lost except in the unlikely event that a client of that
node also crashes before it can reconnect to the cluster. In that situation,
asynchronous writes that have not been committed to disk will be lost.
SMB
If a node fails, asynchronous writes that have not been committed to disk will be lost.
It is recommended that you do not disable write caching, regardless of the protocol that
you are writing with. If you are writing to the cluster with asynchronous writes, and you
decide that the risks of data loss are too great, it is recommended that you configure your
clients to use synchronous writes, rather than disable write caching.
176
File sharing
Modifying the advanced settings could result in operational failures. Be aware of the
potential consequences before committing changes to these settings.
Procedure
1. Click Protocols > Windows Sharing (SMB) > SMB Server Settings.
2. From the SMB service setting, select Enabled.
3. From the Snapshot Directory Settings box, choose the system default or a custom
configuration for the following settings:
l
Visible at root
Accessible at root
Visible in subdirectories
Accessible in subdirectories
4. Click Save.
If you modify the default settings, the changes are applied to all existing shares in the
access zone unless the setting was configured at the SMB share-level.
Procedure
1. Click Protocols > Windows Sharing (SMB) > SMB Default Share Settings.
2. From the Current Access Zones drop-down list, select the access zone that the default
settings will apply to.
3. From the File and Directory Permissions Settings box, choose the system default or a
custom configuration for the following settings:
l
Create Permissions
177
File sharing
4. From the Performance Settings box, choose the system default or a custom
configuration for the following settings:
l
Change Notify
Oplocks
Impersonate Guest
Impersonate User
NTFS ACL
6. Click Save.
The following command disables SMB Multichannel on the EMC Isilon cluster:
isi smb settings global modify -support-multichannel=no
These settings affect the behavior of the SMB service. Changes to these settings can
affect all current and future SMB shares.
178
Setting
Setting value
Visible at Root
Accessible at Root
Visible in Subdirectories
File sharing
Setting
Setting value
Accessible in
Subdirectories
Changes that are made directly to an SMB share override the default settings configured
from the Default SMB Share Settings tab.
If the mask and mode bits match the default values, a green check mark next to a setting
appears, indicating that the specified read (R), write (W), or execute (X) permission is
enabled at the user, group, or "other" level. The "other" level includes all users who are
not listed as the owner of the share, and are not part of the group level that the file
belongs to.
Setting
Setting value
Create Permissions Sets the default source permissions to apply when a file or directory is
created. The default value is Default ACL.
Create Mask (Dir)
Specifies UNIX mode bits that are removed when a directory is created,
restricting permissions. Mask bits are applied before mode bits are applied.
Specifies UNIX mode bits that are added when a directory is created,
enabling permissions. Mode bits are applied after mask bits are applied.
Specifies UNIX mode bits that are removed when a file is created, restricting
permissions. Mask bits are applied before mode bits are applied.
Specifies UNIX mode bits that are added when a file is created, enabling
permissions. Mode bits are applied after mask bits are applied.
Changes that are made directly to an SMB share override the default settings configured
from the Default SMB Share Settings tab.
179
File sharing
Setting
Setting value
Change Notify Configures notification of clients when files or directories change. This helps
prevent clients from seeing stale content, but requires server resources. The
default value is Norecurse.
Oplocks
Changes that are made directly to an SMB share override the default settings configured
from the Default SMB Share Settings tab.
Setting
Setting value
Impersonate Guest Determines guest access to a share. The default value is Never.
Impersonate User
Allows all file access to be performed as a specific user. This must be a fully
qualified user name. The default value is No value.
NTFS ACL
Allows ACLs to be stored and edited from SMB clients. The default value is
Yes.
It is recommended that you configure advanced SMB share settings only if you have a
solid understanding of the SMB protocol.
180
File sharing
expansion variables in the share path to automatically create and redirect users to their
own home directories.
Before you begin
You must specify a path to use as the SMB share; create the directory before you create
an SMB share. Shares are specific to access zones and the share path must exist under
the zone path. Create access zones before you create SMB shares.
Procedure
1. Click Protocols > Windows Sharing (SMB) > SMB Shares.
2. From the Current Access Zones drop-down list, select the access zone the share will
belong to.
3. Click Add a share.
4. In the Share Name field, type a name for the share.
Share names can contain up to 80 characters, and can only contain alphanumeric
characters, hyphens, and spaces. Also, if the cluster character encoding is not set to
UTF-8, SMB share names are case-sensitive.
5. Optional: In the Description field, type a comment about the share.
A description is optional, but can be helpful if you are managing multiple shares. This
field is limited to 255 characters.
6. In the Directory to be Shared field, type the full path of the share, beginning with /
ifs, or click Browse to locate the share.
Note
You can specify one or more of the following variables in the directory path but you
must select the Allow Variable Expansion check box or the string is interpreted
literally by the system.
Variable Expansion
%D
%U
%Z
%L
%0
%1
%2
For example, if a user is in a domain named DOMAIN and has a username of user_1,
the path /ifs/home/%D/%U expands to /ifs/home/DOMAIN/user_1.
7. Apply the initial ACL settings for the directory. You can modify these settings later.
l
To apply a default ACL to the shared directory, click Apply Windows default ACLs.
181
File sharing
Note
If the Auto-Create Directories setting is enabled, OneFS creates an ACL with the
equivalent of UNIX 700 mode bit permissions for any directory that is created
automatically.
l
To maintain the existing permissions on the shared directory, click Do not change
existing permissions.
To expand path variables such as %U in the share directory path, select Allow
Variable Expansion.
To automatically create home directories when users access the share for the first
time, select Auto-Create Directories. This option is available only if the Allow
Variable Expansion option is enabled.
Any changes made to these settings will only affect the settings for this share. If you need
to make changes to default SMB share values, that can be done from the Default SMB
Share Settings tab.
Procedure
1. Click Protocols > Windows Sharing (SMB) > SMB Shares.
2. From the Current Access Zones drop-down list, select the access zone that contains
the share you want to modify.
3. From the list of SMB shares, locate the share you want to modify and then click View
details.
4. For each setting that you want to modify, click Edit, make the change, and then click
Save.
5. To modify the settings for file and directory permissions, performance, or security,
click Advanced SMB Share Settings.
6. Click Save.
182
File sharing
You can delete all of the shares on the cluster by selecting the Name/Path option, and
then selecting Delete from the drop-down menu.
4. Click Delete.
5. In the confirmation dialog box, click Delete to confirm the deletion.
183
File sharing
File sharing
5. Click Add a User or Group. Then select the option you want to search for.
l
Users
Groups
Well-known SIDs
6. If you selected User or Group, you can locate the user or group through one of the
following methods:
l
Type the Username or Group Name you want to search for in the text field, and
then click Search.
Select the authentication provider you want to search in the text field, and then
click Search. Only providers that are currently configured and enabled on the
cluster are listed.
Type the Username or Group Name and select an authentication provider and click
Search.
185
File sharing
NFSv3
NFSv4
3. Select the Requested Lock Protection Level setting from the drop-down list.
Because the NFS service is distributed across all nodes on the cluster, you can select
the number of node failures that would be tolerated and still keep the service running.
The default of [+2] is the optimal value. It is recommended that you not change it.
4. Click the Reload button for the Cached Export Configuration setting.
The cached NFS export settings are reloaded to ensure that changes to DNS or NIS are
applied.
5. Click Save.
5. Click Save.
6. Click Close.
Results
With these settings, regardless of the users' credentials on the NFS client, they would not
be able to gain root privileges on the NFS server.
186
File sharing
Setting
Description
NFSv3
NFSv4
Requested Lock
Protection Level
Determines the number of node failures that can happen before a lock
might be lost. The default value is [+2] Tolerate failure of 2
nodes.
Cached Export
Configuration
Enables you to reload cached NFS exports to ensure that any domain or
network changes take effect immediately.
It is recommended that you modify the default export to limit access only to trusted
clients, or to restrict access completely. To ensure that sensitive data is not
compromised, other exports that you create should be lower in the OneFS file hierarchy,
and can be protected by access zones or limited to specific clients with either root, readwrite, or read-only access, as appropriate.
187
File sharing
Note
If no clients are specified, all clients on your network are allowed to access the export.
If you add the same client to more than one list and the client is entered in the same
format for each entry, the client is normalized to a single list in the following order of
priority: Root Clients, Always Read-Write Clients, Always Read-Only
Clients, Clients.
Setting
Description
Clients
Always
Read-Write
Clients
Always
Read-Only
Clients
Root Clients
5. For the Directory Paths setting, type or browse to the directory that you want to export.
You can add multiple directory paths by clicking Add another directory path for each
additional path.
6. Specify export permissions:
l
UNIX (system)
Kerberos5
Kerberos5 Integrity
Kerberos5 Privacy
File sharing
Results
The new NFS export is created and shown at the top of the NFS Exports list.
You can delete all the exports on a cluster at once. Click the Export ID/Path check box at
the top of the NFS Exports list, and then select Delete from the drop-down list to the right.
Procedure
1. Select Protocols > UNIX Sharing (NFS) > NFS Exports.
2. In the NFS Exports list, click the check box to the left of the export that you want to
delete.
3. Click Delete.
4. In the confirmation dialog box, click Delete to confirm the operation.
In the following example output, export 1 contains a directory path that does not
currently exist:
ID
Message
----------------------------------Modify an NFS export
189
File sharing
1
'/ifs/test' does not exist
----------------------------------Total: 1
Changes to default export settings affect all current and future NFS exports that use
default settings, and, if specified incorrectly, could impact the availability of the NFS file
sharing service. It is recommended that you not make changes to default settings,
particularly advanced settings, unless you have experience working with NFS. Instead,
you should change settings as needed for individual NFS exports as you create them.
Procedure
1. Select Protocols > UNIX Sharing (NFS) > Export Settings.
Four common NFS export settings are displayed: Map Root User, Map Non Root User,
Map Failure, and Security Types. In addition, a link to advanced default export
settings is provided.
2. Modify any of default settings that you want to apply to all new NFS exports, or to
existing exports that use any of the default values.
3. When you are finished modifying settings, click Save.
Default values
The default settings result in a root squashing rule whereby no user on the NFS
client, even a root user, can gain root privileges on the NFS server.
190
Map Failure
Security Types
File sharing
Description
Block Size
The block size used to calculate block counts for NFSv3 FSSTAT and
NFSv4 GETATTR requests. The default value is 8192 bytes.
Commit Asynchronous
The preferred directory read transfer size reported to NFSv3 and NFSv4
clients. The default value is 131072 bytes.
Read Transfer Max Size The maximum read transfer size reported to NFSv3 and NFSv4 clients.
The default value is 1048576 bytes.
Read Transfer Multiple
The preferred read transfer size reported to NFSv3 and NFSv4 clients.
The default value is 1048576 bytes.
Setattr Asynchronous
The reply to send for DATASYNC writes. The default value is DATASYNC.
The reply to send for FILESYNC writes. The default value is FILESYNC.
Write Transfer Max Size The maximum write transfer size reported to NFSv3 and NFSv4 clients.
The default value is 1048576 bytes.
Write Transfer Multiple
The preferred write transfer size reported to NFSv3 and NFSv4 clients.
The default value is 524288.
The reply to send for UNSTABLE writes. The default value is UNSTABLE.
191
File sharing
Setting
Setting value
Specifies the maximum file size to allow. This setting is advisory in nature
and is returned to the client in a reply to an NFSv3 FSINFO or NFSv4 GETATTR
request. The default value is 9223372036854776000 bytes.
Readdirplus Enable Enables the use of NFSv3 readdirplus service whereby a client can send a
request and received extended information about the directory and files in
the export. The default is Yes.
Return 32 bit File
IDs
Specifies return 32-bit file IDs to the client. The default is No.
Description
When this setting is enabled, OneFS allows the NFS client to set various time
attributes on the NFS server. The default value is Yes.
Encoding
Overrides the general encoding settings the cluster has for the export. The
default value is DEFAULT.
Map Lookup UID Looks up incoming user identifiers (UIDs) in the local authentication database.
The default value is No.
Symlinks
Informs the NFS client that the file system supports symbolic link file types. The
default value is Yes.
Time Delta
Sets the server clock granularity. The default value is 1e-9 seconds
(0.000000001 second).
File sharing
193
File sharing
Procedure
1. Select Protocols > UNIX Sharing (NFS).
The UNIX Sharing (NFS) page is displayed with four tabs across the top.
2. Click NFS Aliases.
The NFS Aliases list appears, displaying all aliases created for the current access
zone.
3. Next to the alias that you intend to delete, select More> Delete Alias.
A Confirm Delete dialog box appears.
4. Click Delete.
The alias is removed from the NFS Aliases list.
194
File sharing
Procedure
1. Click Protocols > FTP Settings.
2. Click Enable.
3. Select one or more of the following settings:
Option
Description
Server-toserver
transfers
Anonymous
access
Local access
4. Click Submit.
Description
Enable HTTP
Disable HTTP entirely Closes the HTTP port used for file access. Users can continue
to access the web administration interface by specifying the
port number in the URL. The default port is 8080.
3. In the Document root directory field, type or click Browse to navigate to an existing
directory in /ifs, or click File System Explorer to create a new directory and set its
permissions.
Note
The HTTP server runs as the daemon user and group. To properly enforce access
controls, you must grant the daemon user or group read access to all files under the
document root, and allow the HTTP server to traverse the document root.
195
File sharing
4. In the Server hostname field, type the HTTP server name. The server hostname must
be a fully-qualified, SmartConnect zone name and valid DNS name. The name must
begin with a letter and contain only letters, numbers, and hyphens (-).
5. In the Administrator email address field, type an email address to display as the
primary contact for issues that occur while serving files.
6. From the Active Directory Authentication list, select an authentication setting:
Option
Description
Off
Integrated Authentication
Only
7. Click the Enable DAV check box. This allows multiple users to manage and modify files
collaboratively across remote web servers.
8. Click the Disable access logging check box.
9. Click Submit.
196
CHAPTER 9
Home directories
Home directories
197
Home directories
Active Directory
Local
Share permissions are checked when files are accessed, before the underlying file
system permissions are checked. Either of these permissions can prevent access to the
file or directory.
198
Home directories
Home directory share paths must begin with /ifs/ and must be in the root path of the
access zone in which the home directory SMB share is created.
In the following commands, the --allow-variable-expansion option is enabled to
indicate that %U should be expanded to the user name, which is user411 in this
example. The --auto-create-directory option is enabled to create the directory if
it does not exist:
isi smb shares create HOMEDIR --path=/ifs/home/%U \
--allow-variable-expansion=yes --auto-create-directory=yes
isi smb shares permission modify HOMEDIR --wellknown Everyone \
--permission-type allow --permission full
isi smb shares view HOMEDIR
When user411 connects to the share with the net use command, the user's home
directory is created at /ifs/home/user411. On user411's Windows client, the net
use m: command connects /ifs/home/user411 through the HOMEDIR share:
net use m: \\cluster.company.com\HOMEDIR /u:user411
Procedure
1. Run the following commands on the cluster with the --allow-variableexpansion option enabled. The %U expansion variable expands to the user name,
and the --auto-create-directory option is enabled to create the directory if it
does not exist:
isi smb shares create HOMEDIR --path=/ifs/home/%U \
--allow-variable-expansion=yes --auto-create-directory=yes
isi smb shares permission modify HOMEDIR --wellknown Everyone \
--permission-type allow --permission full
199
Home directories
If user411 connects to the share with the net use command, user411's home
directory is created at /ifs/home/user411. On user411's Windows client, the net
use m: command connects /ifs/home/user411 through the HOMEDIR share,
mapping the connection similar to the following example:
net use m: \\cluster.company.com\HOMEDIR /u:user411
2. Run a net use command, similar to the following example, on a Windows client to
map the home directory for user411:
net use q: \\cluster.company.com\HOMEDIR_ACL /u:user411
200
Home directories
3. Run a command similar to the following example on the cluster to view the inherited
ACL permissions for the user411 share:
cd /ifs/home/user411
ls -lde .
If another SMB share exists that matches the user's name, the user connects to the
explicitly named share rather than to the %U share.
Procedure
1. Run the following command to create a share that matches the authenticated user
login name when the user connects to the share:
isi smb share create %U /ifs/home/%U \
--allow-variable-expansion=yes --auto-create-directory=yes \
--zone=System
After running this command, user Zachary will see a share named 'zachary' rather
than '%U', and when Zachary tries to connect to the share named 'zachary', he will be
directed to /ifs/home/zachary. On a Windows client, if Zachary runs the
following commands, he sees the contents of his /ifs/home/zachary directory:
net use m: \\cluster.ip\zachary /u:zachary
cd m:
dir
Similarly, if user Claudia runs the following commands on a Windows client, she sees
the directory contents of /ifs/home/claudia:
net use m: \\cluster.ip\claudia /u:claudia
cd m:
dir
Zachary and Claudia cannot access one another's home directory because only the
share 'zachary' exists for Zachary and only the share 'claudia' exists for Claudia.
201
Home directories
2. Run the following command to set the default login shell for all Active Directory users
in your domain to /bin/bash:
isi auth ads modify YOUR.DOMAIN.NAME.COM --login-shell /bin/bash
System
/ifs
4.77M
Yes
0077
/usr/share/skel
create, delete, rename, set_security, close
Home directories
In the command result, you can see the default setting for Home Directory
Umask for the created home directory is 0700, which is equivalent to (0755 &
~(077)). You can modify the Home Directory Umask setting for a zone with the
--home-directory-umask option, specifying an octal number as the umask
value. This value indicates the permissions that are to be disabled, so larger mask
values indicate fewer permissions. For example, a umask value of 000 or 022 yields
created home directory permissions of 0755, whereas a umask value of 077 yields
created home directory permissions of 0700.
2. Run a command similar to the following example to allow a group/others write/
execute permission in a home directory:
isi zone zones modify System --home-directory-umask=022
In this example, user home directories will be created with mode bits 0755 masked
by the umask field, set to the value of 022. Therefore, user home directories will be
created with mode bits 0755, which is equivalent to (0755 & ~(022)).
2. Run the isi auth ads modify command with the --home-directorytemplate and --create-home-directory options.
isi auth ads modify YOUR.DOMAIN.NAME.COM \
--home-directory-template=/ifs/home/ADS/%D/%U \
--create-home-directory=yes
3. Run the isi auth ads view command with the --verbose option.
The system displays output similar to the following example:
Name: YOUR.DOMAIN.NAME.COM
NetBIOS Domain: YOUR
...
Create Home Directory: Yes
Set SSH/FTP home directory creation options
203
Home directories
5. Optional: To verify this information from an external UNIX node, run the ssh command
from an external UNIX node.
For example, the following command would create /ifs/home/ADS/<yourdomain>/user_100 if it did not previously exist:
ssh <your-domain>\\[email protected]
2. Run the isi zone zones modify command to modify the default skeleton
directory.
The following command modifies the default skeleton directory, /usr/share/skel,
in an access zone, where System is the value for the <zone> option and /usr/share/
skel2 is the value for the <path> option:
isi zone zones modify System --skeleton-directory=/usr/share/skel2
204
Home directories
File
Active Directory
Home directory
Home directory
creation
Enabled
--homedirectorytemplate=/ifs
/home/%U
--createhomedirectory=yes
--loginshell=/bin/sh
--homedirectorytemplate=""
--createhomedirectory=no
--homedirectory-
Disabled
None
Disabled
/bin/sh
205
Home directories
Authentication
provider
Home directory
Home directory
creation
Disabled
None
Disabled
None
template=/ifs
/home/%D/%U
l
--createhomedirectory=no
--loginshell=/bin/sh
Note
If available, provider
information overrides
this value.
LDAP
NIS
--homedirectorytemplate=""
--createhomedirectory=no
--homedirectorytemplate=""
--createhomedirectory=no
When you create an SMB share through the web administration interface, you must select
the Allow Variable Expansion check box or the string is interpreted literally by the system.
206
Variable Value
Description
%U
Home directories
Variable Value
Description
typically included at the end of the path. For
example, for a user named user1, the path /ifs/
home/%U is mapped to /ifs/home/user1.
%D
%Z
%L
%0
%1
%2
Note
If the user name includes fewer than three characters, the %0, %1, and %2 variables
wrap around. For example, for a user named ab, the variables maps to a, b, and a,
respectively. For a user named a, all three variables map to a.
207
Home directories
Local user
File user
208
LDAP user
NIS user
CHAPTER 10
Snapshots
Snapshots
209
Snapshots
Snapshots overview
A OneFS snapshot is a logical pointer to data that is stored on a cluster at a specific point
in time.
A snapshot references a directory on a cluster, including all data stored in the directory
and its subdirectories. If the data referenced by a snapshot is modified, the snapshot
stores a physical copy of the data that was modified. Snapshots are created according to
user specifications or are automatically generated by OneFS to facilitate system
operations.
To create and manage snapshots, you must activate a SnapshotIQ license on the cluster.
Some applications must generate snapshots to function but do not require you to
activate a SnapshotIQ license; by default, these snapshots are automatically deleted
when OneFS no longer needs them. However, if you activate a SnapshotIQ license, you
can retain these snapshots. You can view snapshots generated by other modules without
activating a SnapshotIQ license.
You can identify and locate snapshots by name or ID. A snapshot name is specified by a
user and assigned to the virtual directory that contains the snapshot. A snapshot ID is a
numerical identifier that OneFS automatically assigns to a snapshot.
210
Snapshots
To reduce disk-space usage, snapshots that reference the same directory reference each
other, with older snapshots referencing newer snapshots. If a file is deleted, and several
snapshots reference the file, a single snapshot stores a copy the file, and the other
snapshots reference the file from the snapshot that stored the copy. The reported size of
a snapshot reflects only the amount of data stored by the snapshot and does not include
the amount of data referenced by the snapshot.
Because snapshots do not consume a set amount of storage space, there is no availablespace requirement for creating a snapshot. The size of a snapshot grows according to
how the data referenced by the snapshot is modified. A cluster cannot contain more than
20,000 snapshots.
Snapshot schedules
You can automatically generate snapshots according to a snapshot schedule.
With snapshot schedules, you can periodically generate snapshots of a directory without
having to manually create a snapshot every time. You can also assign an expiration
period that determines when SnapshotIQ deletes each automatically generated
snapshot.
Snapshot aliases
A snapshot alias is a logical pointer to a snapshot. If you specify an alias for a snapshot
schedule, the alias will always point to the most recent snapshot generated by that
schedule. Assigning a snapshot alias allows you to quickly identify and access the most
recent snapshot generated according to a snapshot schedule.
If you allow clients to access snapshots through an alias, you can reassign the alias to
redirect clients to other snapshots. In addition to assigning snapshot aliases to
snapshots, you can also assign snapshot aliases to the live version of the file system.
This can be useful if clients are accessing snapshots through a snapshot alias, and you
want to redirect the clients to the live version of the file system.
Snapshot schedules
211
Snapshots
It is recommended that you do not disable the snapshot delete job. Disabling the
snapshot delete job prevents unused disk space from being freed and can also cause
performance degradation.
212
Snapshots
Deletion
type
Snapshot
frequency
Snapshot time
Snapshot
expiration
Max snapshots
retained
Ordered
Every hour
deletion
(for mostly
static
data)
Beginning at 12:00
AM Ending at 11:59
AM
1 month
720
Beginning at 12:00
AM Ending at 11:59
PM
1 day
27
At 12:00 AM
1 week
Saturday at 12:00
AM
1 month
3 months
Every month
File clones
SnapshotIQ enables you to create file clones that share blocks with existing files in order
to save space on the cluster. A file clone usually consumes less space and takes less
time to create than a file copy. Although you can clone files from snapshots, clones are
primarily used internally by OneFS.
The blocks that are shared between a clone and cloned file are contained in a hidden file
called a shadow store. Immediately after a clone is created, all data originally contained
in the cloned file is transferred to a shadow store. Because both files reference all blocks
from the shadow store, the two files consume no more space than the original file; the
clone does not take up any additional space on the cluster. However, if the cloned file or
clone is modified, the file and clone will share only blocks that are common to both of
them, and the modified, unshared blocks will occupy additional space on the cluster.
Over time, the shared blocks contained in the shadow store might become useless if
neither the file nor clone references the blocks. The cluster routinely deletes blocks that
are no longer needed. You can force the cluster to delete unused blocks at any time by
running the ShadowStoreDelete job.
Clones cannot contain alternate data streams (ADS). If you clone a file that contains
alternate data streams, the clone will not contain the alternate data streams.
File clones
213
Snapshots
Shadow-store considerations
Shadow stores are hidden files that are referenced by cloned and deduplicated files. Files
that reference shadow stores behave differently than other files.
l
When files that reference shadow stores are replicated to another Isilon cluster or
backed up to a Network Data Management Protocol (NDMP) backup device, the
shadow stores are not transferred to the target Isilon cluster or backup device. The
files are transferred as if they contained the data that they reference from shadow
stores. On the target Isilon cluster or backup device, the files consume the same
amount of space as if they had not referenced shadow stores.
When OneFS creates a shadow store, OneFS assigns the shadow store to a storage
pool of a file that references the shadow store. If you delete the storage pool that a
shadow store resides on, the shadow store is moved to a pool occupied by another
file that references the shadow store.
OneFS does not delete a shadow-store block immediately after the last reference to
the block is deleted. Instead, OneFS waits until the ShadowStoreDelete job is run to
delete the unreferenced block. If a large number of unreferenced blocks exist on the
cluster, OneFS might report a negative deduplication savings until the
ShadowStoreDelete job is run.
Shadow stores are protected at least as much as the most protected file that
references it. For example, if one file that references a shadow store resides in a
storage pool with +2 protection and another file that references the shadow store
resides in a storage pool with +3 protection, the shadow store is protected at +3.
Quotas account for files that reference shadow stores as if the files contained the
data referenced from shadow stores; from the perspective of a quota, shadow-store
references do not exist. However, if a quota includes data protection overhead, the
quota does not account for the data protection overhead of shadow stores.
Snapshot locks
A snapshot lock prevents a snapshot from being deleted. If a snapshot has one or more
locks applied to it, the snapshot cannot be deleted and is referred to as a locked
snapshot. If the duration period of a locked snapshot expires, OneFS will not delete the
snapshot until all locks on the snapshot have been deleted.
OneFS applies snapshot locks to ensure that snapshots generated by OneFS applications
are not deleted prematurely. For this reason, it is recommended that you do not delete
snapshot locks or modify the duration period of snapshot locks.
A limited number of locks can be applied to a snapshot at a time. If you create snapshot
locks, the limit for a snapshot might be reached, and OneFS could be unable to apply a
snapshot lock when necessary. For this reason, it is recommended that you do not create
snapshot locks.
214
Snapshots
Snapshot reserve
The snapshot reserve enables you to set aside a minimum percentage of the cluster
storage capacity specifically for snapshots. If specified, all other OneFS operations are
unable to access the percentage of cluster capacity that is reserved for snapshots.
Note
The snapshot reserve does not limit the amount of space that snapshots can consume on
the cluster. Snapshots can consume a greater percentage of storage capacity specified by
the snapshot reserve. It is recommended that you do not specify a snapshot reserve.
Active
No
Yes
Configure SnapshotIQ
settings
No
Yes
Yes
Yes
Delete snapshots
Yes
Yes
Yes
Yes
View snapshots
Yes
Yes
If you a SnapshotIQ license becomes inactive, you will no longer be able to create new
snapshots, all snapshot schedules will be disabled, and you will not be able to modify
snapshots or snapshot settings. However, you will still be able to delete snapshots and
access data contained in snapshots.
215
Snapshots
directories while the directories are empty. Creating a domain for a directory that contains
less data takes less time.
5. In the Directory Path field, specify the directory that you want to be contained in
snapshots that are generated according to this schedule.
6. From the Snapshot Frequency list, select how often you want to generate snapshots
according to the schedule.
216
Snapshots
Option
Description
Note
A snapshot schedule cannot span multiple days. For example, you cannot specify to
begin generating snapshots at 5:00 PM Monday and end at 5:00 AM Tuesday. To
continuously generate snapshots for a period greater than a day, you must create two
snapshot schedules. For example, to generate snapshots from 5:00 PM Monday to
5:00 AM Tuesday, create one schedule that generates snapshots from 5:00 PM to
11:59 PM on Monday, and another schedule that generates snapshots from 12:00 AM
to 5:00 AM on Tuesday.
7. Optional: To assign an alternative name to the most recent snapshot generated by the
schedule, specify a snapshot alias.
a. Next to Create an Alias, click Yes.
b. To modify the default snapshot alias name, in the Alias Name field, type an
alternative name for the snapshot.
8. Optional: To specify a length of time that snapshots generated according to the
schedule exist on the cluster before they are automatically deleted by OneFS, specify
an expiration period.
a. Next to Snapshot Expiration, click Snapshots expire.
b. Next to Snapshots expire, specify how long you want to retain the snapshots
generated according to the schedule.
9. Click Create.
Create a snapshot
You can create a snapshot of a directory.
Procedure
1. Click Data Protection > SnapshotIQ > Summary.
2. Click Capture a new snapshot.
3. Optional: In the Capture a Snapshot area, in the Snapshot Name field, type a name.
4. In the Directory Path field, specify the directory that you want the snapshot to contain.
5. Optional: To create an alternative name for the snapshot, specify a snapshot alias.
a. Next to Create an Alias, click Yes.
Create a snapshot
217
Snapshots
b. To modify the default snapshot alias name, in the Alias Name field, type an
alternative name for the snapshot.
6. Optional: To assign a time that OneFS will automatically delete the snapshot, specify
an expiration period.
a. Next to Snapshot Expiration, click Snapshot Expires on.
b. In the calendar, specify the day that you want the snapshot to be automatically
deleted.
7. Click Capture.
Description
%A
%a
The abbreviated day of the week. For example, if the snapshot is generated
on a Sunday, %a is replaced with Sun.
%B
%b
%C
The first two digits of the year. For example, if the snapshot is created in
2014, %C is replaced with 20.
%c
%d
%e
%F
%G
%g
218
Snapshots
Variable
Description
%H
The hour. The hour is represented on the 24-hour clock. Single-digit hours are
preceded by a zero. For example, if a snapshot is created at 1:45 AM, %H is
replaced with 01.
%h
%I
The hour represented on the 12-hour clock. Single-digit hours are preceded
by a zero. For example, if a snapshot is created at 1:45 PM, %I is replaced
with 01.
%j
%k
The hour represented on the 24-hour clock. Single-digit hours are preceded
by a blank space.
%l
The hour represented on the 12-hour clock. Single-digit hours are preceded
by a blank space. For example, if a snapshot is created at 1:45 AM, %I is
replaced with 1.
%M
%m
%p
AM or PM.
%{PolicyName} The name of the replication policy that the snapshot was created for. This
variable is valid only if you are specifying a snapshot naming pattern for a
replication policy.
%R
%r
%S
%s
%{SrcCluster} The name of the source cluster of the replication policy that the snapshot was
created for. This variable is valid only if you are specifying a snapshot naming
pattern for a replication policy.
%T
%U
The two-digit numerical week of the year. Numbers range from 00 to 53. The
first day of the week is calculated as Sunday.
%u
The numerical day of the week. Numbers range from 1 to 7. The first day of
the week is calculated as Monday. For example, if a snapshot is created on
Sunday, %u is replaced with 7.
%V
The two-digit numerical week of the year that the snapshot was created in.
Numbers range from 01 to 53. The first day of the week is calculated as
Monday. If the week of January 1 is four or more days in length, then that
week is counted as the first week of the year.
219
Snapshots
Variable
Description
%v
The day that the snapshot was created. This variable is equivalent to
specifying %e-%b-%Y.
%W
The two-digit numerical week of the year that the snapshot was created in.
Numbers range from 00 to 53. The first day of the week is calculated as
Monday.
%w
The numerical day of the week that the snapshot was created on. Numbers
range from 0 to 6. The first day of the week is calculated as Sunday. For
example, if the snapshot was created on Sunday, %w is replaced with 0.
%X
The time that the snapshot was created. This variable is equivalent to
specifying %H:%M:%S.
%Y
%y
The last two digits of the year that the snapshot was created in. For example,
if the snapshot was created in 2014, %y is replaced with 14.
%Z
%z
The offset from coordinated universal time (UTC) of the time zone that the
snapshot was created in. If preceded by a plus sign, the time zone is east of
UTC. If preceded by a minus sign, the time zone is west of UTC.
%+
The time and date that the snapshot was created. This variable is equivalent
to specifying %a %b %e %X %Z %Y.
%%
Managing snapshots
You can delete and view snapshots. You can also modify the name, duration period, and
alias of an existing snapshot. However, you cannot modify the data contained in a
snapshot; the data contained in a snapshot is read-only.
Snapshots
If multiple snapshots contain the same directories, deleting older snapshots is more
likely to free disk-space than deleting newer snapshots.
Snapshots that are assigned expiration dates are automatically marked for deletion by
the snapshot daemon. If the daemon is disabled, snapshots will not be automatically
deleted by the system. It is recommended that you do not disable the snapshot daemon.
Delete snapshots
You can delete a snapshot if you no longer want to access the data contained in the
snapshot.
OneFS frees disk space occupied by deleted snapshots when the SnapshotDelete job is
run. Also, if you delete a snapshot that contains clones or cloned files, data in a shadow
store might no longer be referenced by files on the cluster; OneFS deletes unreferenced
data in a shadow store when the ShadowStoreDelete job is run. OneFS routinely runs
both the shadow store delete and SnapshotDelete jobs. However, you can also manually
run the jobs at any time.
Procedure
1. Click Data Protection > SnapshotIQ > Snapshots.
2. Specify the snapshots that you want to delete.
a. For each snapshot you want to delete, in the Saved File System Snapshots table,
in the row of a snapshot, select the check box.
b. From the Select an action list, select Delete.
c. In the confirmation dialog box, click Delete.
3. Optional: To increase the speed at which deleted snapshot data is freed on the
cluster, run the SnapshotDelete job.
a. Click Cluster Management > Job Operations > Job Types.
b. In the Job Types area, in the SnapshotDelete row, from the Actions column, select
Start Job.
c. Click Start.
4. Optional: To increase the speed at which deleted data shared between deduplicated
and cloned files is freed on the cluster, run the ShadowStoreDelete job.
Run the ShadowStoreDelete job only after you run the SnapshotDelete job.
a. Click Cluster Management > Job Operations > Job Types.
b. In the Job Types area, in the ShadowStoreDelete row, from the Actions column,
select Start Job.
c. Click Start.
221
Snapshots
View snapshots
You can view snapshots.
Procedure
1. Click Data Protection > SnapshotIQ > Snapshots.
The 50 most recently generated snapshots appear in the Saved File System
Snapshots table.
2. Optional: To view additional snapshots, at the bottom of the Saved File System
Snapshots table, click Show 50 more.
Snapshot information
You can view information about snapshots, including the total amount of space
consumed by all snapshots.
The following information is displayed in the Saved Snapshots area:
SnapshotIQ Status
Indicates whether a SnapshotIQ license has been activated on the cluster.
Total Number of Saved Snapshots
Indicates the total number of snapshots that exist on the cluster.
Total Number of Snapshots Pending Deletion
Indicates the total number of snapshots that were deleted on the cluster since the
last snapshot delete job was run. The space consumed by the deleted snapshots is
not freed until the snapshot delete job is run again.
Total Number of Snapshot Aliases
Indicates the total number of snapshot aliases that exist on the cluster.
Capacity Used by Saved Snapshots
Indicates the total amount of space consumed by all snapshots.
222
Snapshots
Revert a snapshot
You can revert a directory back to the state it was in when a snapshot was taken. Before
OneFS reverts a snapshot, OneFS generates a snapshot of the directory being reverted, so
that data stored in the directory is not lost. OneFS does not delete a snapshot after
reverting it.
Before you begin
l
Procedure
1. Click Cluster Management > Job Operations > Job Types.
2. In the Job Types area, in the SnapRevert row, from the Actions column, select Start
Job.
3. Optional: To specify a priority for the job, from the Priority list, select a priority.
Lower values indicate a higher priority. If you do not specify a priority, the job is
assigned the default snapshot revert priority.
4. Optional: To specify the amount of cluster resources the job is allowed to consume,
from the Impact policy list, select an impact policy.
If you do not specify a policy, the job is assigned the default snapshot revert policy.
5. In the Snapshot ID to revert field, type the name or ID of the snapshot that you want to
revert, and then click Start.
You can access up to 64 snapshots of a directory through Windows explorer, starting with
the most recent snapshot. To access more than 64 snapshots for a directory, access the
cluster through a UNIX command line.
Procedure
1. In Windows Explorer, navigate to the directory that you want to restore or the directory
that contains the file that you want to restore.
2. Right-click the folder, and then click Properties.
3. In the Properties window, click the Previous Versions tab.
Restoring snapshot data
223
Snapshots
4. Select the version of the folder that you want to restore or the version of the folder
that contains the version of the file that you want to restore.
5. Restore the version of the file or directory.
l
To copy the selected directory to another location, click Copy and then specify a
location to copy the directory to.
To restore a specific file, click Open, and then copy the file into the original
directory, replacing the existing copy with the snapshot version.
3. Clone a file from the snapshot by running the cp command with the -c option.
For example, the following command clones test.txt from Snapshot2014Jun04:
cp -c /ifs/.snapshot/Snapshot2014Jun04/archive/test.txt \
/ifs/archive/test_clone.text
224
Snapshots
225
Snapshots
226
Snapshots
If a snapshot alias references the live version of the file system, the Target ID is
-1.
2. Optional: View information about a specific snapshot by running the isi snapshot
aliases view command.
The following command displays information about latestWeekly:
isi snapshot aliases view latestWeekly
It is recommended that you do not create, delete, or modify snapshots locks unless you
are instructed to do so by Isilon Technical Support.
Deleting a snapshot lock that was created by OneFS might result in data loss. If you
delete a snapshot lock that was created by OneFS, it is possible that the corresponding
snapshot might be deleted while it is still in use by OneFS. If OneFS cannot access a
snapshot that is necessary for an operation, the operation will malfunction and data loss
might result. Modifying the expiration date of a snapshot lock created by OneFS can also
result in data loss because the corresponding snapshot can be deleted prematurely.
227
Snapshots
It is recommended that you do not modify the expiration dates of snapshot locks.
Procedure
1. Open a secure shell (SSH) connection to any node in the cluster and log in.
2. Run the isi snapshot locks modify command.
The following command sets an expiration date two days from the present date for a
snapshot lock with an ID of 1 that is applied to a snapshot named
SnapshotApril2014:
isi snapshot locks modify SnapshotApril2014 1 --expires 2D
228
Snapshots
For example, the following command deletes a snapshot lock that is applied to
SnapshotApril2014 and has a lock ID of 1:
isi snapshot locks delete Snapshot2014Apr16 1
The system prompts you to confirm that you want to delete the snapshot lock.
3. Type yes and then press ENTER.
SnapshotIQ settings
SnapshotIQ settings determine how snapshots behave and can be accessed.
The following SnapshotIQ settings can be configured:
229
Snapshots
Snapshot Scheduling
Determines whether snapshots can be generated.
Note
230
Snapshots
231
Snapshots
232
CHAPTER 11
Deduplication with SmartDedupe
233
Deduplication overview
The SmartDedupe software module enables you to save storage space on your cluster by
reducing redundant data. Deduplication maximizes the efficiency of your cluster by
decreasing the amount of storage required to store multiple files with similar blocks.
SmartDedupe deduplicates data by scanning an Isilon cluster for identical data blocks.
Each block is 8 KB. If SmartDedupe finds duplicate blocks, SmartDedupe moves a single
copy of the blocks to a hidden file called a shadow store. SmartDedupe then deletes the
duplicate blocks from the original files and replaces the blocks with pointers to the
shadow store.
Deduplication is applied at the directory level, targeting all files and directories
underneath one or more root directories. You can first assess a directory for
deduplication and determine the estimated amount of space you can expect to save. You
can then decide whether to deduplicate the directory. After you begin deduplicating a
directory, you can monitor how much space is saved by deduplication in real time.
SmartDedupe does not deduplicate files that are 32 KB and smaller, because doing so
would consume more cluster resources than the storage savings are worth. Each shadow
store can contain up to 255 blocks. Each block in a shadow store can be referenced
32000 times.
Deduplication jobs
Deduplication is performed by maintenance jobs referred to as deduplication jobs. You
can monitor and control deduplication jobs as you would any other maintenance job on
the cluster. Although the overall performance impact of deduplication is minimal, the
deduplication job consumes 256 MB of memory per node.
When a deduplication job is first run on a cluster, SmartDedupe samples blocks from
each file and creates index entries for those blocks. If the index entries of two blocks
match, SmartDedupe scans the blocks adjacent to the matching pair and then
deduplicates all duplicate blocks. After a deduplication job samples a file once, new
deduplication jobs will not sample the file again until the file is modified.
The first deduplication job you run might take significantly longer to complete than
subsequent deduplication jobs. The first deduplication job must scan all files under the
specified directories to generate the initial index. If subsequent deduplication jobs take a
long time to complete, this most likely indicates that a large amount of data is being
deduplicated. However, it can also indicate that clients are creating a large amount of
new data on the cluster. If a deduplication job is interrupted during the deduplication
process, the job will automatically restart the scanning process from where the job was
interrupted.
It is recommended that you run deduplication jobs when clients are not modifying data
on the cluster. If clients are continually modifying files on the cluster, the amount of
space saved by deduplication is minimal because the deduplicated blocks are constantly
removed from the shadow store. For most clusters, it is recommended that you start a
deduplication job every ten days.
The permissions required to modify deduplication settings are not the same as those
needed to run a deduplication job. Although a user must have the maintenance job
permission to run a deduplication job, the user must have the deduplication permission
to modify deduplication settings. By default, the deduplication job is configured to run at
a low priority.
234
Deduplication considerations
Deduplication can significantly increase the efficiency at which you store data. However,
the effect of deduplication varies depending on the cluster.
You can reduce redundancy on a cluster by running SmartDedupe. Deduplication creates
links that can impact the speed at which you can read from and write to files. In
particular, sequentially reading chunks smaller than 512 KB of a deduplicated file can be
significantly slower than reading the same small, sequential chunks of a nondeduplicated file. This performance degradation applies only if you are reading noncached data. For cached data, the performance for deduplicated files is potentially better
than non-deduplicated files. If you stream chunks larger than 512 KB, deduplication does
not significantly impact the read performance of the file. If you intend on streaming 8 KB
or less of each file at a time, and you do not plan on concurrently streaming the files, it is
recommended that you do not deduplicate the files.
Data replication and backup with deduplication
235
Deduplication is most effective when applied to static or archived files and directories.
The less files are modified, the less negative effect deduplication has on the cluster. For
example, virtual machines often contain several copies of identical files that are rarely
modified. Deduplicating a large number of virtual machines can greatly reduce consumed
storage space.
Shadow-store considerations
Shadow stores are hidden files that are referenced by cloned and deduplicated files. Files
that reference shadow stores behave differently than other files.
l
Managing deduplication
You can manage deduplication on a cluster by first assessing how much space you can
save by deduplicating individual directories. After you determine which directories are
236
237
238
Deduplication information
You can view the amount of disk space saved by deduplication in the Deduplication
Savings area:
Space Savings
The total amount of physical disk space saved by deduplication, including
protection overhead and metadata. For example, if you have three identical files that
are all 5 GB, the estimated physical saving would be greater than 10 GB, because
deduplication saved space that would have been occupied by file metadata and
protection overhead.
Deduplicated data
The amount of space on the cluster occupied by directories that were deduplicated.
Other data
The amount of space on the cluster occupied by directories that were not
deduplicated.
Deduplication information
239
240
CHAPTER 12
Data replication with SyncIQ
241
To prevent permissions errors, make sure that ACL policy settings are the same across
source and target clusters.
You can create two types of replication policies: synchronization policies and copy
policies. A synchronization policy maintains an exact replica of the source directory on
the target cluster. If a file or sub-directory is deleted from the source directory, the file or
directory is deleted from the target cluster when the policy is run again.
You can use synchronization policies to fail over and fail back data between source and
target clusters. When a source cluster becomes unavailable, you can fail over data on a
target cluster and make the data available to clients. When the source cluster becomes
available again, you can fail back the data to the source cluster.
A copy policy maintains recent versions of the files that are stored on the source cluster.
However, files that are deleted on the source cluster are not deleted from the target
cluster. Failback is not supported for copy policies. Copy policies are most commonly
used for archival purposes.
Copy policies enable you to remove files from the source cluster without losing those files
on the target cluster. Deleting files on the source cluster improves performance on the
source cluster while maintaining the deleted files on the target cluster. This can be useful
242
if, for example, your source cluster is being used for production purposes and your target
cluster is being used only for archiving.
After creating a job for a replication policy, SyncIQ must wait until the job completes
before it can create another job for the policy. Any number of replication jobs can exist on
a cluster at a given time; however, only five replication jobs can run on a source cluster at
the same time. If more than five replication jobs exist on a cluster, the first five jobs run
while the others are queued to run. The number of replication jobs that a single target
cluster can support concurrently is dependent on the number of workers available on the
target cluster.
You can replicate any number of files and directories with a single replication job. You
can prevent a large replication job from overwhelming the system by limiting the amount
of cluster resources and network bandwidth that data synchronization is allowed to
consume. Because each node in a cluster is able to send and receive data, the speed at
which data is replicated increases for larger clusters.
You can accurately predict when modifications will be made to the data
Configuring a policy to start when changes are made to the source directory can be useful
under the following conditions:
l
For policies that are configured to start whenever changes are made to the source
directory, SyncIQ checks the source directories every ten seconds. SyncIQ does not
account for excluded files or directories when detecting changes, so policies that exclude
files or directories from replication might be run unnecessarily. For example, assume that
newPolicy replicates /ifs/data/media but excludes /ifs/data/media/temp. If a
modification is made to /ifs/data/media/temp/file.txt, SyncIQ will run
newPolicy, but will not replicate /ifs/data/media/temp/file.txt.
If a policy is configured to start whenever changes are made to its source directory, and a
replication job fails, SyncIQ will wait one minute before attempting to run the policy
again. SyncIQ will increase this delay exponentially for each failure up to a maximum
delay of eight hours. You can override the delay by running the policy manually at any
time. After a job for the policy completes successfully, SyncIQ will resume checking the
source directory every ten seconds.
243
target cluster, the mark persists on the target cluster. When a replication policy is run,
SyncIQ checks the mark to ensure that data is being replicated to the correct location.
On the target cluster, you can manually break an association between a replication policy
and target directory. Breaking the association between a source and target cluster causes
the mark on the target cluster to be deleted. You might want to manually break a target
association if an association is obsolete. If you break the association of a policy, the
policy is disabled on the source cluster and you cannot run the policy. If you want to run
the disabled policy again, you must reset the replication policy.
Note
Breaking a policy association causes either a full or differential replication to occur the
next time you run the replication policy. During a full or differential replication, SyncIQ
creates a new association between the source and target clusters. Depending on the
amount of data being replicated, a full or differential replication can take a very long time
to complete.
number of workers per node to increase the speed at which data is replicated to the
target cluster.
You can also reduce resource consumption through file-operation rules that limit the rate
at which replication policies are allowed to send files. However, it is recommended that
you only create file-operation rules if the files you intend to replicate are predictably
similar in size and not especially large.
Replication reports
After a replication job completes, SyncIQ generates a report that contains detailed
information about the job, including how long the job ran, how much data was
transferred, and what errors occurred.
If a replication report is interrupted, SyncIQ might create a subreport about the progress
of the job so far. If the job is then restarted, SyncIQ creates another subreport about the
progress of the job until the job either completes or is interrupted again. SyncIQ creates a
subreport each time the job is interrupted until the job completes successfully. If multiple
subreports are created for a job, SyncIQ combines the information from the subreports
into a single report.
SyncIQ routinely deletes replication reports. You can specify the maximum number of
replication reports that SyncIQ retains and the length of time that SyncIQ retains
replication reports. If the maximum number of replication reports is exceeded on a
cluster, SyncIQ deletes the oldest report each time a new report is created.
You cannot customize the content of a replication report.
Note
If you delete a replication policy, SyncIQ automatically deletes any reports that were
generated for that policy.
Replication snapshots
SyncIQ generates snapshots to facilitate replication, failover, and failback between Isilon
clusters. Snapshots generated by SyncIQ can also be used for archival purposes on the
target cluster.
Replication reports
245
SyncIQ generates source snapshots to ensure that replication jobs do not transfer
unmodified data. When a job is created for a replication policy, SyncIQ checks whether it
is the first job created for the policy. If it is not the first job created for the policy, SyncIQ
compares the snapshot generated for the earlier job with the snapshot generated for the
new job.
SyncIQ replicates only data that has changed since the last time a snapshot was
generated for the replication policy. When a replication job is completed, SyncIQ deletes
the previous source-cluster snapshot and retains the most recent snapshot until the next
job is run.
246
Note
Data failover
Data failover is the process of preparing data on a secondary cluster to be modified by
clients. After you fail over to a secondary cluster, you can redirect clients to modify their
data on the secondary cluster.
Before failover is performed, you must create and run a replication policy on the primary
cluster. You initiate the failover process on the secondary cluster. Failover is performed
per replication policy; to migrate data that is spread across multiple replication policies,
you must initiate failover for each replication policy.
You can use any replication policy to fail over. However, if the action of the replication
policy is set to copy, any file that was deleted on the primary cluster will be present on
the secondary cluster. When the client connects to the secondary cluster, all files that
were deleted on the primary cluster will be available to the client.
If you initiate failover for a replication policy while an associated replication job is
running, the failover operation completes but the replication job fails. Because data
might be in an inconsistent state, SyncIQ uses the snapshot generated by the last
successful replication job to revert data on the secondary cluster to the last recovery
point.
If a disaster occurs on the primary cluster, any modifications to data that were made after
the last successful replication job started are not reflected on the secondary cluster.
When a client connects to the secondary cluster, their data appears as it was when the
last successful replication job was started.
Data failback
Data failback is the process of restoring clusters to the roles they occupied before a
failover operation. After data failback is complete, the primary cluster hosts clients and
replicates data to the secondary cluster for backup.
The first step in the failback process is updating the primary cluster with all of the
modifications that were made to the data on the secondary cluster. The next step in the
failback process is preparing the primary cluster to be accessed by clients. The final step
in the failback process is resuming data replication from the primary to the secondary
cluster. At the end of the failback process, you can redirect users to resume accessing
their data on the primary cluster.
You can fail back data with any replication policy that meets all of the following criteria:
l
The policy does not exclude any files or directories from replication.
247
the last completed replication job started. The RPO is never greater than the time it takes
for two consecutive replication jobs to run and complete.
If a disaster occurs while a replication job is running, the data on the secondary cluster is
reverted to the state it was in when the last replication job completed. For example,
consider an environment in which a replication policy is scheduled to run every three
hours, and replication jobs take two hours to complete. If a disaster occurs an hour after
a replication job begins, the RPO is four hours, because it has been four hours since a
completed job began replicating data.
RTO is the maximum amount of time required to make backup data available to clients
after a disaster. The RTO is always less than or approximately equal to the RPO,
depending on the rate at which replication jobs are created for a given policy.
If replication jobs run continuously, meaning that another replication job is created for
the policy before the previous replication job completes, the RTO is approximately equal
to the RPO. When the secondary cluster is failed over, the data on the cluster is reset to
the state it was in when the last job completed; resetting the data takes an amount of
time proportional to the time it took users to modify the data.
If replication jobs run on an interval, meaning that there is a period of time after a
replication job completes before the next replication job for the policy starts, the
relationship between RTO and RPO depends on whether a replication job is running when
the disaster occurs. If a job is in progress when a disaster occurs, the RTO is roughly
equal to the RPO. However, if a job is not running when a disaster occurs, the RTO is
negligible because the secondary cluster was not modified since the last replication job
ran, and the failover process is almost instantaneous.
248
directories under the included directory are replicated to the target cluster; any
directories that are not contained in an included directory are excluded.
If you both include and exclude directories, any excluded directories must be contained
in one of the included directories; otherwise, the excluded-directory setting has no effect.
For example, consider a policy with the following settings:
l
In this example, the setting that excludes the /ifs/data/archive directory has no
effect because the /ifs/data/archive directory is not under either of the included
directories. The /ifs/data/archive directory is not replicated regardless of whether
the directory is explicitly excluded. However, the setting that excludes the /ifs/data/
media/music/working directory does have an effect, because the directory would be
replicated if the setting was not specified.
In addition, if you exclude a directory that contains the source directory, the excludedirectory setting has no effect. For example, if the root directory of a policy is /ifs/
data, explicitly excluding the /ifs directory does not prevent /ifs/data from being
replicated.
Any directories that you explicitly include or exclude must be contained in or under the
specified root directory. For example, consider a policy in which the specified root
directory is /ifs/data. In this example, you could include both the /ifs/data/
media and the /ifs/data/users/ directories because they are under /ifs/data.
Excluding directories from a synchronization policy does not cause the directories to be
deleted on the target cluster. For example, consider a replication policy that
synchronizes /ifs/data on the source cluster to /ifs/data on the target cluster. If
the policy excludes /ifs/data/media from replication, and /ifs/data/media/
file exists on the target cluster, running the policy does not cause /ifs/data/
media/file to be deleted from the target cluster.
249
250
File name
Includes or excludes files based on the file name. You can specify to include or
exclude full or partial names that contain specific text.
The following wildcard characters are accepted:
Note
Alternatively, you can filter file names by using POSIX regular-expression (regex) text.
Isilon clusters support IEEE Std 1003.2 (POSIX.2) regular expressions. For more
information about POSIX regular expressions, see the BSD man pages.
Table 15 Replication file matching wildcards
Wildcard Description
*
[ ]
Path
Includes or excludes files based on the file path. This option is available for copy
policies only.
You can specify to include or exclude full or partial paths that contain specified text.
You can also include the wildcard characters *, ?, and [ ].
Size
Includes or excludes files based on their size.
Note
251
Type
Includes or excludes files based on one of the following file-system object types:
l
Soft link
Regular file
Directory
Click Connect to only the nodes in the subnet and pool if the target cluster name
specifies a SmartConnect zone.
3. Specify which nodes you want replication policies to connect to when a policy is run.
Option
Description
Note
SyncIQ does not support dynamically allocated IP address pools. If a replication job
connects to a dynamically allocated IP address, SmartConnect might reassign the
address while a replication job is running, which would disconnect the job and cause
it to fail.
4. Click Submit.
252
Source directory
File-criteria statement
Target directory
To copy all files from the source directory to the target directory, click Copy.
Note
To copy all files from the source directory to the target directory and delete any
files on the target directory that are not in the source directory, click Synchronize.
6. In the Run job area, specify whether replication jobs will be run.
Option
Description
a. Click On a schedule.
b. Specify a schedule.
If you configure a replication policy to run more
than once a day, you cannot configure the interval
to span across two calendar days. For example,
you cannot configure a replication policy to run
every hour starting at 7:00 PM and ending at 1:00
AM.
253
To include a directory, in the Included Directories area, click Add a directory path.
To exclude a directory, in the Excluded Directories area, click Add a directory path.
3. Optional: Prevent specific files from being replicated by specifying file matching
criteria.
a. In the File Matching Criteria area, select a filter type.
b. Select an operator.
c. Type a value.
Files that do not meet the specified criteria will not be replicated to the target cluster.
For example, if you specify File Type doesn't match .txt, SyncIQ will not
replicate any files with the .txt file extension. If you specify Created after
08/14/2013, SyncIQ will not replicate any files created before August 14th, 2013.
If you want to specify more than one file matching criterion, you can control how the
criteria relate to each other by clicking either Add an "Or" condition or Add an "And"
condition.
4. Specify which nodes you want the replication policy to connect to when the policy is
run.
Option
Description
Note
SyncIQ does not support dynamically allocated IP address pools. If a replication job
connects to a dynamically allocated IP address, SmartConnect might reassign the
address while a replication job is running, which would disconnect the job and cause
it to fail.
After you finish
The next step in the process of creating a replication policy is specifying the target
directory.
254
The fully qualified domain name of any node in the target cluster.
localhost
SyncIQ does not support dynamically allocated IP address pools. If a replication job
connects to a dynamically allocated IP address, SmartConnect might reassign the
address while a replication job is running, which would disconnect the job and cause
it to fail.
2. In the Target Directory field, type the absolute path of the directory on the target
cluster that you want to replicate data to.
CAUTION
If you specify an existing directory on the target cluster, ensure that the directory is
not the target of another replication policy. If this is a synchronization policy, ensure
that the directory is empty. All files are deleted from the target of a synchronization
policy the first time the policy is run.
If the specified target directory does not already exist on the target cluster, the
directory is created the first time the job is run. It is recommended that you do not
specify the /ifs directory. If you specify the /ifs directory, the entire target cluster
is set to a read-only state, preventing you from storing any other data on the cluster.
If this is a copy policy, and files in the target directory share the same name as files in
the source directory, the target directory files are overwritten when the job is run.
3. If you want replication jobs to connect only to the nodes included in the SmartConnect
zone specified by the target cluster, click Connect only to the nodes within the target
cluster SmartConnect Zone.
After you finish
The next step in the process of creating a replication policy is specifying policy target
snapshot settings.
255
Procedure
1. To create archival snapshots on the target cluster, in the Target Snapshots area, click
Capture snapshots on the target cluster.
2. Optional: To modify the default alias of the last snapshot created according to the
replication policy, in the Snapshot Alias Name field, type a new alias.
You can specify the alias name as a snapshot naming pattern. For example, the
following naming pattern is valid:
%{PolicyName}-on-%{SrcCluster}-latest
3. Optional: To modify the snapshot naming pattern for snapshots created according to
the replication policy, in the Snapshot Naming Pattern field, type a naming pattern.
Each snapshot generated for this replication policy is assigned a name based on this
pattern.
For example, the following naming pattern is valid:
%{PolicyName}-from-%{SrcCluster}-at-%H:%M-on-%m-%d-%Y
Do not modify the default setting without consulting Isilon Technical Support.
2. Optional: From the Log Level list, select the level of logging you want SyncIQ to
perform for replication jobs.
The following log levels are valid, listed from least to most verbose:
256
Click Error.
Click Notice.
3. Optional: If you want SyncIQ to perform a checksum on each file data packet that is
affected by the replication policy, select the Validate File Integrity check box.
If you enable this option, and the checksum values for a file data packet do not
match, SyncIQ retransmits the affected packet.
4. Optional: To modify the length of time SyncIQ retains replication reports for the policy,
in the Keep Reports For area, specify a length of time.
After the specified expiration period has passed for a report, SyncIQ automatically
deletes the report.
Some units of time are displayed differently when you view a report than how they
were originally entered. Entering a number of days that is equal to a corresponding
value in weeks, months, or years results in the larger unit of time being displayed. For
example, if you enter a value of 7 days, 1 week appears for that report after it is
created. This change occurs because SyncIQ internally records report retention times
in seconds and then converts them into days, weeks, months, or years.
5. Optional: Specify whether to record information about files that are deleted by
replication jobs by selecting one of the following options:
l
257
Procedure
1. Click Cluster Management > Job Operations > Job Types.
2. In the Job Types area, in the DomainMark row, from the Actions column, select Start
Job.
3. In the Domain Root Path field, type the path of a source directory of a replication
policy.
4. From the Type of domain list, select SyncIQ.
5. Ensure that the Delete domain check box is cleared.
6. Click Start Job.
You can assess only replication policies that have never been run before.
Procedure
1. Click Data Protection > SyncIQ > Policies.
2. In the SyncIQ Policies table, in the row of a replication policy, from the Actions
column, select Assess Sync.
3. Click Data Protection > SyncIQ > Summary.
4. After the job completes, in the SyncIQ Recent Reports table, in the row of the
replication job, click View Details.
The report displays the total amount of data that would have been transferred in the
Total Data field.
Procedure
1. Click Data Protection > SyncIQ > Policies.
2. In the SyncIQ Policies table, in the Actions column for a job, select Start Job.
259
Policy Name
The name of the associated replication policy.
Started
The time the job started.
Elapsed
How much time has elapsed since the job started.
Transferred
The number of files that have been transferred, and the total size of all transferred
files.
Source Directory
The path of the source directory on the source cluster.
Target Host
The target directory on the target cluster.
Actions
Displays any job-related actions that you can perform.
Although you cannot fail over or fail back SmartLock directories, you can recover
SmartLock directories on a target cluster. After you recover SmartLock directories, you can
migrate them back to the source cluster.
3. On the secondary cluster, replicate data to the primary cluster by using the mirror
policies.
You can replicate data either by manually starting the mirror policies or by modifying
the mirror policies and specifying a schedule.
4. Prevent clients from accessing the secondary cluster and then run each mirror policy
again.
To minimize impact to clients, it is recommended that you wait until client access is
low before preventing client access to the cluster.
5. On the primary cluster, click Data Protection > SyncIQ > Local Targets.
6. In the SyncIQ Local Targets table, from the Actions column, select Allow Writes for
each mirror policy.
7. On the secondary cluster, click Data Protection > SyncIQ > Policies.
8. In the SyncIQ Policies table, from the Actions column, select Resync-prep for each
mirror policy.
Revert a failover operation
261
If the last replication job completed successfully and a replication job is not
currently running, select Allow Writes.
If a replication job is currently running, wait until the replication job completes,
and then select Allow Writes.
If the primary cluster became unavailable while a replication job was running,
select Break Association.
3. If you clicked Break Association, restore any files that are left in an inconsistent state.
a. Delete all files that are not committed to a WORM state from the target directory.
b. Copy all files from the failover snapshot to the target directory.
Failover snapshots are named according to the following naming pattern:
SIQ-Failover-<policy-name>-<year>-<month>-<day>_<hour>-<minute><second>
262
The source directory is the SmartLock directory that you are migrating.
The target directory is an empty SmartLock directory. The source and target
directories must be of the same SmartLock type. For example, if the target
directory is a compliance directory, the source must also be a compliance
directory.
2. Replicate data to the target cluster by running the policies you created.
You can replicate data either by manually starting the policies or by specifying a policy
schedule.
3. Optional: To ensure that SmartLock protection is enforced for all files, commit all files
in the SmartLock source directory to a WORM state.
Because autocommit information is not transferred to the target cluster, files that
were scheduled to be committed to a WORM state on the source cluster will not be
scheduled to be committed at the same time on the target cluster. To ensure that all
files are retained for the appropriate time period, you can commit all files in target
SmartLock directories to a WORM state.
For example, the following command automatically commits all files in /ifs/data/
smartlock to a WORM state after one minute:
isi smartlock modify --path /ifs/data/smartlock --autocommit 1n
This step is unnecessary if you have not configured an autocommit time period for the
SmartLock directory being replicated.
4. Prevent clients from accessing the source cluster and run the policy that you created.
To minimize impact to clients, it is recommended that you wait until client access is
low before preventing client access to the cluster.
5. On the target cluster, click Data Protection > SyncIQ > Local Targets.
6. In the SyncIQ Local Targets table, in the row of each replication policy, from the
Actions column, select Allow Writes.
7. Optional: If any SmartLock directory configuration settings, such as an autocommit
time period, were specified for the source directories of the replication policies, apply
those settings to the target directories.
8. Optional: Delete the copy of your SmartLock data on the source cluster.
If the SmartLock directories are compliance directories or enterprise directories with
the privileged delete functionality permanently disabled, you cannot recover the
space consumed by the source SmartLock directories until all files are released from a
263
WORM state. If you want to free the space before files are released from a WORM
state, contact Isilon Technical Support for information about reformatting your cluster.
Source directory
File-criteria statement
Target cluster
This applies only if you target a different cluster. If you modify the IP or domain name
of a target cluster, and then modify the replication policy on the source cluster to
match the new IP or domain name, a full replication is not performed.
Target directory
Procedure
1. Click Data Protection > SyncIQ > Policies.
2. In the SyncIQ Policies table, in the row for a policy, click View/Edit.
3. In the View SyncIQ Policy Details dialog box, click Edit Policy.
4. Modify the settings of the replication policy, and then click Save Changes.
264
If you disable a replication policy while an associated replication job is running, the
running job is not interrupted. However, the policy will not create another job until the
policy is enabled.
Procedure
1. Click Data Protection > SyncIQ > Policies.
2. In the SyncIQ Policies table, in the row for a replication policy, select either Enable
Policy or Disable Policy.
If neither Enable Policy nor Disable Policy appears, verify that a replication job is not
running for the policy. If an associated replication job is not running, ensure that the
SyncIQ license is active on the cluster.
265
266
Target Directory
The full path of the target directory. Data is replicated to the target directory from the
source directory.
Restrict Target Nodes
Determines whether the policy can connect to all nodes on the target cluster or can
connect only to specific nodes.
Capture Snapshots
Determines whether archival snapshots are generated on the target cluster.
Snapshot Alias Name
Specifies an alias for the latest archival snapshot taken on the target cluster.
Snapshot Naming Pattern
Specifies how archival snapshots are named on the target cluster.
Snapshot Expiration
Specifies how long archival snapshots are retained on the target cluster before they
are automatically deleted by the system.
Workers Threads Per Node
Specifies the number of workers per node that are generated by OneFS to perform
each replication job for the policy.
Log Level
Specifies the amount of information that is recorded for replication jobs.
More verbose options include all information from less verbose options. The
following list describes the log levels from least to most verbose:
Notice
Includes job and process-level activity, including job starts, stops, and worker
coordination information. This is the recommended log level.
Error
Includes events related to specific types of failures.
Network Activity
Includes more job-level activity and work-item information, including specific
paths and snapshot names.
File Activity
Includes a separate event for each action taken on a file. Do not select this
option without first consulting Isilon Technical Support.
Replication logs are typically used for debugging purposes. If necessary, you can log
in to a node through the command-line interface and view the contents of
the /var/log/isi_migrate.log file on the node.
Validate File Integrity
Determines whether OneFS performs a checksum on each file data packet that is
affected by a replication job. If a checksum value does not match, OneFS retransmits
the affected file data packet.
Keep Reports For
Specifies how long replication reports are kept before they are automatically deleted
by OneFS.
Log Deletions on Synchronization
Determines whether OneFS records when a synchronization job deletes files or
directories on the target cluster.
The following replication policy fields are available only through the OneFS command-line
interface.
Replication policy settings
267
Source Subnet
Specifies whether replication jobs connect to any nodes in the cluster or if jobs can
connect only to nodes in a specified subnet.
Source Pool
Specifies whether replication jobs connect to any nodes in the cluster or if jobs can
connect only to nodes in a specified pool.
Password Set
Specifies a password to access the target cluster.
Report Max Count
Specifies the maximum number of replication reports that are retained for this
policy.
Target Compare Initial Sync
Determines whether full or differential replications are performed for this policy. Full
or differential replications are performed the first time a policy is run and after a
policy is reset.
Source Snapshot Archive
Determines whether snapshots generated for the replication policy on the source
cluster are deleted when the next replication policy is run. Enabling archival source
snapshots does not require you to activate the SnapshotIQ license on the cluster.
Source Snapshot Pattern
If snapshots generated for the replication policy on the source cluster are retained,
renames snapshots according to the specified rename pattern.
Source Snapshot Expiration
If snapshots generated for the replication policy on the source cluster are retained,
specifies an expiration period for the snapshots.
Restrict Target Network
Determines whether replication jobs connect only to nodes in a given SmartConnect
zone. This setting applies only if the Target Host is specified as a SmartConnect
zone.
Target Detect Modifications
Determines whether SyncIQ checks the target directory for modifications before
replicating files. By default, SyncIQ always checks for modifications.
Note
Disabling this option could result in data loss. It is recommended that you consult
Isilon Technical Support before disabling this option.
Resolve
Determines whether you can manually resolve the policy if a replication job
encounters an error.
268
To cancel a specific job, in the row for a replication job, select Cancel Running Job.
To cancel all jobs targeting the local cluster, select the check box to the left of
Policy Name and then select Cancel Selection from the Select a bulk action list.
After a replication policy is reset, SyncIQ performs a full or differential replication the
next time the policy is run. Depending on the amount of data being replicated, a full or
differential replication can take a very long time to complete.
Procedure
1. Click Data Protection > SyncIQ > Local Targets.
2. In the SyncIQ Local Targets table, in the row for a replication policy, select Break
Association.
3. In the Confirm dialog box, click Yes.
269
271
272
273
Source
The path of the source directory on the source cluster.
Target
The IP address or fully qualified domain name of the target cluster.
Actions
Displays any report-related actions that you can perform.
Depending on the amount of data being synchronized or copied, a full and differential
replications can take a very long time to complete.
Depending on the amount of data being replicated, a full or differential replication can
take a very long time to complete. Reset a replication policy only if you cannot fix the
issue that caused the replication error. If you fix the issue that caused the error, resolve
the policy instead of resetting the policy.
274
Procedure
1. Click Data Protection > SyncIQ > Policies.
2. In the SyncIQ Policies table, in the row for a policy, select Reset Sync State.
3. Run the policy by running the isi sync jobs start command.
For example, the following command runs newPolicy:
isi sync jobs start newPolicy
Managing changelists
You can create and view changelists that describe what data was modified by a
replication job. Changelists are most commonly accessed by applications through the
OneFS Platform API.
To create a changelist, you must enable changelists for a replication policy. If changelists
are enabled for a policy, SyncIQ does not automatically delete the repstate files
generated by the policy; if changelists are not enabled for a policy, SyncIQ automatically
deletes the repstate files after the corresponding replication jobs complete. SyncIQ
generates one repstate file for each replication job. Because a large amount of repstate
files can consume a large amount of disk space, it is recommended that you do not
enable changelists for a policy unless it is necessary for your workflow.
275
If changelists are enabled for a policy, SyncIQ does not automatically delete source
cluster snapshots for the policy. To create a changelist, you must have access to two
consecutive snapshots and an associated repstate generated by a replication policy.
Create a changelist
You can create a changelist to view what data was modified by a replication job.
Before you begin
Through the OneFS command line, enable changelists for a replication policy, and then
run the policy at least twice. The following command enables changelists for newPolicy:
isi sync policies modify newPolicy --changelist true
Note
You can enable changelists only through the command-line interface (CLI).
Procedure
1. Optional: Record the IDs of the snapshots generated by the replication policy.
a. Click File System Management > SnapshotIQ > Snapshots.
b. In the row of the snapshots that were created for the replication policies that you
want to create a change list for, click, View Details and record the IDs of the
snapshots
The snapshots must have been generated sequentially for the same replication
policy. Changelist snapshots are generated according to the following snapshot
naming convention:
SIQ-Changelist-<policy-name>-<date>
If source-archival snapshots are enabled for the policy, the change-list snapshots
are named according to the naming convention specified by the policy.
2. Click Cluster Management > Job Operations > Job Types.
3. In the Job Types area, in the ChangelistCreate row, from the Actions column, select
Start Job.
4. In the Older Snapshot ID field, type the ID of the snapshot generated when the
replication policy started.
5. In the Newer Snapshot ID field, type the ID of the snapshot generated when the
replication policy ended.
6. Optional: If you want to recreate the changelist later, select Retain the replication
record after a changelist is created. If you do not select this option, the repstate file
used to generate the changelist is deleted after the changelist is created.
7. Click Start Job.
View a changelist
You can view a changelist that describes what data was modified by a replication job.
This procedure is available only through the command-line interface (CLI).
Procedure
1. View the IDs of changelists by running the following command:
isi_changelist_mod -l
276
Changelist IDs include the IDs of both snapshots used to create the changelist. If
OneFS is still in the process of creating a changelist, inprog is appended to the
changelist ID.
2. Optional: View all contents of a changelist by running the isi_changelist_mod
command with the -a option.
The following command displays the contents of a changelist named 2_6:
isi_changelist_mod -a 2_6
Changelist information
You can view the information contained in changelists.
Note
277
type
If an item was modified, describes the type of item that was modified. The following
types of items might have been modified:
regular
A regular file was modified.
directory
A directory was modified.
symlink
A symbolic link was modified.
fifo
A first-in-first-out (FIFO) queue was modified.
socket
A Unix domain socket was modified.
char device
A character device was modified.
block device
A block device was modified.
unknown
An unknown type of file was modified.
If any type of item was removed, this field is set to (REMOVED).
size
The size of the item that was modified, in bytes. If an item was removed, this field is
set to 0.
path_size
The total size of the null-terminated UTF-8 string that contains the path, relative to
the root path, of the file or directory that was modified or removed, in bytes.
path_offset
The number of bytes between the start of the changelist entry structure and the path,
relative to the root path, of the file or directory that was modified or removed.
atime
The POSIX timestamp of when the item was last accessed.
atimensec
The number of nanoseconds past the atime that the item was last accessed.
ctime
The POSIX timestamp of when the item was last changed.
ctimensec
The number of nanoseconds past the ctime that the item was last changed.
mtime
The POSIX timestamp of when the item was last modified.
mtimensec
The number of nanoseconds past the mtime that the item was last modified.
278
CHAPTER 13
Data layout with FlexProtect
FlexProtect overview............................................................................................280
File striping......................................................................................................... 280
Requested data protection.................................................................................. 280
FlexProtect data recovery.....................................................................................281
Requesting data protection................................................................................. 282
Requested protection settings.............................................................................282
Requested protection disk space usage.............................................................. 283
279
FlexProtect overview
An Isilon cluster is designed to continuously serve data, even when one or more
components simultaneously fail. OneFS ensures data availability by striping or mirroring
data across the cluster. If a cluster component fails, data stored on the failed component
is available on another component. After a component failure, lost data is restored on
healthy components by the FlexProtect proprietary system.
Data protection is specified at the file level, not the block level, enabling the system to
recover data quickly. Because all data, metadata, and parity information is distributed
across all nodes, the cluster does not require a dedicated parity node or drive. This
ensures that no single node limits the speed of the rebuild process.
File striping
OneFS uses the internal network to automatically allocate and stripe data across nodes
and disks in the cluster. OneFS protects data as the data is being written. No separate
action is necessary to stripe data.
OneFS breaks files into smaller logical chunks called stripes before writing the files to
disk; the size of each file chunk is referred to as the stripe unit size. Each OneFS block is
8 KB, and a stripe unit consists of 16 blocks, for a total of 128 KB per stripe unit. During a
write, OneFS breaks data into stripes and then logically places the data in a stripe unit.
As OneFS stripes data across the cluster, OneFS fills the stripe unit according to the
number of nodes and protection level.
OneFS can continuously reallocate data and make storage space more usable and
efficient. As the cluster size increases, OneFS stores large files more efficiently.
Smartfail
OneFS protects data stored on failing nodes or drives through a process called
smartfailing.
During the smartfail process, OneFS places a device into quarantine. Data stored on
quarantined devices is read only. While a device is quarantined, OneFS reprotects the
data on the device by distributing the data to other devices. After all data migration is
complete, OneFS logically removes the device from the cluster, the cluster logically
changes its width to the new configuration, and the node or drive can be physically
replaced.
OneFS smartfails devices only as a last resort. Although you can manually smartfail
nodes or drives, it is recommended that you first consult Isilon Technical Support.
Occasionally a device might fail before OneFS detects a problem. If a drive fails without
being smartfailed, OneFS automatically starts rebuilding the data to available free space
on the cluster. However, because a node might recover from a failure, if a node fails,
OneFS does not start rebuilding data unless the node is logically removed from the
cluster.
Node failures
Because node loss is often a temporary issue, OneFS does not automatically start
reprotecting data when a node fails or goes offline. If a node reboots, the file system does
not need to be rebuilt because it remains intact during the temporary failure.
If you configure N+1 data protection on a cluster, and one node fails, all of the data is still
accessible from every other node in the cluster. If the node comes back online, the node
rejoins the cluster automatically without requiring a full rebuild.
To ensure that data remains protected, if you physically remove a node from the cluster,
you must also logically remove the node from the cluster. After you logically remove a
node, the node automatically reformats its own drives, and resets itself to the factory
FlexProtect data recovery
281
default settings. The reset occurs only after OneFS has confirmed that all data has been
reprotected. You can logically remove a node using the smartfail process. It is important
that you smartfail nodes only when you want to permanently remove a node from the
cluster.
If you remove a failed node before adding a new node, data stored on the failed node
must be rebuilt in the free space in the cluster. After the new node is added, OneFS
distributes the data to the new node. It is more efficient to add a replacement node to the
cluster before failing the old node because OneFS can immediately use the replacement
node to rebuild the data stored on the failed node.
For 4U Isilon IQ X-Series and NL-Series nodes, and IQ 12000X/EX 12000 combination
platforms, the minimum cluster size of three nodes requires a minimum of N+2:1.
282
Requested protection
setting
Minimum number of
nodes required
Definition
[+1n]
[+2d:1n]
[+2n]
[+3d:1n]
Requested protection
setting
Minimum number of
nodes required
Definition
node failure without sustaining any
data loss.
[+3d:1n1d]
[+3n]
[+4d:1n]
[+4d:2n]
[+4n]
Nx (Data mirroring)
N
The cluster can recover from N - 1
For example, 5x requires drive or node failures without
a minimum of five nodes. sustaining data loss. For example, 5x
protection means that the cluster can
recover from four drive or node
failures.
[+2d:1n] [+2n]
[+3d:
1n]
[+3d:
1n1d]
[+3n]
[+4d:
1n]
[+4d:
2n]
[+4n]
4+2
(33%)
6+3
(33%)
3+3
(50%)
8+4
(33%)
2 +1
(33%)
283
Number [+1n]
of
nodes
[+2d:1n] [+2n]
[+3d:
1n]
[+3d:
1n1d]
[+3n]
[+4d:
1n]
[+4d:
2n]
[+4n]
3 +1
(25%)
6+2
(25%)
9+3
(25%)
5+3
(38%)
12 + 4
(25%)
4+4
(50%)
4 +1
(20%)
8+2
(20%)
3+2
(40%)
12 + 3
(20%)
7+3
(30%)
16 + 4
(20%)
6+4
(40%)
5 +1
(17%)
10 + 2
(17%)
4+2
(33%)
15 + 3
(17%)
9+3
(25%)
16 + 4
(20%)
8+4
(33%)
6 +1
(14%)
12 + 2
(14%)
5+2
(29%)
15 + 3
(17%)
11 + 3
(21%)
4+3
(43%)
16 + 4
(20%)
10 + 4
(29%)
7 +1
(13%)
14 + 2
(12.5%)
6+2
(25%)
15 + 3
(17%)
13 + 3
(19%)
5+3
(38%)
16 + 4
(20%)
12 + 4
(25% )
8 +1
(11%)
16 + 2
(11%)
7+2
(22%)
15 + 3
(17%)
15+3
(17%)
6+3
(33%)
16 + 4
(20%)
14 + 4
(22%)
5+4
(44%)
10
9 +1
(10%)
16 + 2
(11%)
8+2
(20%)
15 + 3
(17%)
15+3
(17%)
7+3
(30%)
16 + 4
(20%)
16 + 4
(20%)
6+4
(40%)
12
11 +1
(8%)
16 + 2
(11%)
10 + 2
(17%)
15 + 3
(17%)
15+3
(17%)
9+3
(25%)
16 + 4
(20%)
16 + 4
(20%)
8+4
(33%)
14
13 + 1
(7%)
16 + 2
(11%)
12 + 2
(14%)
15 + 3
(17%)
15+3
(17%)
11 + 3
(21%)
16 + 4
(20%)
16 + 4
(20%)
10 + 4
(29%)
16
15 + 1
(6%)
16 + 2
(11%)
14 + 2
(13%)
15 + 3
(17%)
15+3
(17%)
13 + 3
(19%)
16 + 4
(20%)
16 + 4
(20%)
12 + 4
(25%)
18
16 + 1
(6%)
16 + 2
(11%)
16 + 2
(11%)
15 + 3
(17%)
15+3
(17%)
15 + 3
(17%)
16 + 4
(20%)
16 + 4
(20%)
14 + 4
(22%)
20
16 + 1
(6%)
16 + 2
(11%)
16 + 2
(11%)
16 + 3
(16%)
16 + 3
(16%)
16 + 3
(16%)
16 + 4
(20%)
16 + 4
(20% )
16 + 4
(20%)
30
16 + 1
(6%)
16 + 2
(11%)
16 + 2
(11%)
16 + 3
(16%)
16 + 3
(16%)
16 + 3
(16%)
16 + 4
(20%)
16 + 4
(20%)
16 + 4
(20%)
The parity overhead for mirrored data protection is not affected by the number of nodes in
the cluster. The following table describes the parity overhead for requested mirrored
protection.
2x
3x
4x
5x
6x
7x
8x
284
CHAPTER 14
NDMP backup
NDMP backup
285
NDMP backup
NDMP backup
then connect the Fibre Channel switch to two Fibre Channel ports, OneFS creates two
entries for the device, one for each path.
Note
If you perform an NDMP two-way backup operation, you must assign static IP addresses
to the Backup Accelerator node. If you connect to the cluster through a data management
application (DMA), you must connect to the IP address of a Backup Accelerator node. If
you perform an NDMP three-way backup, you can connect to any node in the cluster.
DMA
Supported
Symantec NetBackup
Yes
EMC Networker
Yes
EMC Avamar
Yes
Commvault Simpana
No
Yes
Dell NetVault
Yes
ASG-Time Navigator
Yes
287
NDMP backup
In a level 10 NDMP backup, only data changed since the most recent incremental
(level 1-9) backup or the last level 10 backup is copied. By repeating level 10
backups, you can be assured that the latest versions of files in your data set are
backed up without having to run a full backup.
l
Supported DMAs
NDMP backups are coordinated by a data management application (DMA) that runs on a
backup server.
OneFS supports the following DMAs:
l
Symantec NetBackup
EMC NetWorker
EMC Avamar
Dell NetVault
CommVault Simpana
ASG-Time Navigator
Note
All supported DMAs can connect to an Isilon cluster through IPv4. CommVault Simpana is
currently the only DMA that also supports connecting to an Isilon cluster through IPv6.
288
NDMP backup
See the Isilon Third-Party Software and Hardware Compatibility Guide for the latest
information about supported DMAs.
LTO-3
LTO-4
LTO-5
LTO-6
OneFS does not back up file system configuration data, such as file protection level
policies and quotas.
OneFS does not support multiple concurrent backups onto the same tape.
OneFS does not support restoring data from a file system other than OneFS. However,
you can migrate data via the NDMP protocol from a NetApp or EMC VNX storage
system to OneFS.
Backup Accelerator nodes cannot interact with more than 1024 device paths,
including the paths of tape and media changer devices. For example, if each device
has four paths, you can connect 256 devices to a Backup Accelerator node. If each
device has two paths, you can connect 512 devices.
OneFS does not support more than 64 concurrent NDMP sessions per Backup
Accelerator node.
Install the latest patches for OneFS and your data management application (DMA).
If you are backing up multiple directories that contain small files, set up a separate
schedule for each directory.
If you are performing three-way NDMP backups, run multiple NDMP sessions on
multiple nodes in your Isilon cluster.
Restore files through Direct Access Restore (DAR), especially if you restore files
frequently. However, it is recommended that you do not use DAR to restore a full
backup or a large number of files, as DAR is better suited to restoring smaller
numbers of files.
NDMP hardware support
289
NDMP backup
Restore files through Directory DAR (DDAR) if you restore large numbers of files
frequently.
Use the largest tape record size available for your version of OneFS. The largest tape
record size for OneFS versions 6.5.5 and later is 256 KB. The largest tape record size
for versions of OneFS earlier than 6.5.5 is 128 KB.
If possible, do not include or exclude files from backup. Including or excluding files
can affect backup performance, due to filtering overhead.
Limit the number of files in a directory. Distribute files across multiple directories
instead of including a large number of files in a single directory.
Networking recommendations
l
Connect NDMP sessions only through SmartConnect zones that are exclusively used
for NDMP backup.
Configure multiple policies when scheduling backup operations, with each policy
capturing a portion of the file system. Do not attempt to back up the entire file system
through a single policy.
This is recommended only if you are backing up a significant amount of data. Running
four concurrent streams might not be necessary for smaller backups.
l
Attach more Backup Accelerator nodes to larger clusters. The recommended number
of Backup Accelerator nodes is listed in the following table.
Table 17 Nodes per Backup Accelerator node
X-Series
NL-Series
S-Series
HD-Series
Attach more Backup Accelerator nodes if you are backing up to more tape devices.
The following table lists the recommended number of tape devices per backup
accelerator node:
Table 18 Tape devices per Backup Accelerator node
Tape device type Recommended number of tape devices per Backup Accelerator node
290
LTO-5, LTO-6
LTO-4
NDMP backup
Tape device type Recommended number of tape devices per Backup Accelerator node
LTO-3
DMA-specific recommendations
l
Character
Description
Example
archive*
/ifs/data/archive1
/ifs/data/archive42_a/media
[]
/ifs/data/data_store_a
/ifs/data/data_store_c
user_?
/ifs/data/user_1
/ifs/data/user_2
Includes a blank
space
user\ 1
/ifs/data/user 1
/ifs/data/data_store_8
Unanchored patterns such as home or user1 target a string of text that might belong to
many files or directories. Anchored patterns target specific file pathnames, such as ifs/
data/home. You can include or exclude either type of pattern.
For example, suppose you want to back up the /ifs/data/home directory, which
contains the following files and directories:
l
/ifs/data/home/user1/file.txt
/ifs/data/home/user2/user1/file.txt
/ifs/data/home/user3/other/file.txt
/ifs/data/home/user4/emptydirectory
Excluding files and directories from NDMP backups
291
NDMP backup
If you simply include the /ifs/data/home directory, all files and directories, including
emptydirectory would be backed up.
If you specify both include and exclude patterns, any excluded files or directories under
the included directories would not be backed up. If the excluded directories are not found
in any of the included directories, the exclude specification would have no effect.
Note
From the DMA vendor list, select the name of the DMA vendor to manage backup
operations.
If your DMA vendor is not included in the list, select generic. However, note that
any vendors not included on the list are not officially supported and might not
function as expected.
4. Click Add administrator to add an NDMP user account through which your DMA can
access the cluster.
a. In the Add Administrator dialog box, in the Name field, type a name for the
account.
b. In the Password and Confirm password fields, type a password for the account.
c. Click Submit.
292
NDMP backup
293
NDMP backup
294
NDMP backup
Setting
Description
Name
295
NDMP backup
Setting
Description
State
WWN
Product
The name of the device vendor and the model name or number of the device.
Serial Number
Paths
The name of the Backup Accelerator node that the device is attached to and
the port number or numbers to which the device is connected.
LUN
Port ID
The port ID of the device that binds the logical device to the physical device.
WWPN
The world wide port name (WWPN) of the port on the tape or media changer
device.
NDMP backup
Setting
Description
Port
The name of the Backup Accelerator node, and the number of the port.
Topology
The type of Fibre Channel topology that the port is configured to support..
Options are:
Point to Point
A single backup device or Fibre Channel switch directly connected to
the port.
Loop
Multiple backup devices connected to a single port in a circular
formation.
Auto
Automatically detects the topology of the connected device. This is the
recommended setting, and is required if you are using a switchedfabric topology.
WWNN
The world wide node name (WWNN) of the port. This name is the same for
each port on a given node.
WWPN
The world wide port name (WWPN) of the port. This name is unique to the
port.
Rate
The rate at which data is sent through the port. The rate can be set to 1
Gb/s, 2 Gb/s, 4 Gb/s, 8 Gb/s, and Auto. 8 Gb/s is available for
A100 nodes only. If set to Auto, OneFS automatically negotiates with the
DMA to determine the rate. Auto is the recommended setting.
297
NDMP backup
Item
Description
Session
Elapsed
Transferred
Throughput
The average throughput of the session over the past five minutes.
Client/Remote
Mover/Data
The current state of the data mover and the data server. The first
word describes the activity of the data mover. The second word
describes the activity of the data server.
The data mover and data server send data to and receive data from
each other during backup and restore operations. The data mover is
a component of the backup server that receives data during
backups and sends data during restore operations. The data server
is a component of OneFS that sends data during backups and
receives information during restore operations.
The following states might appear:
Active
The data mover or data server is currently sending or receiving
data.
Paused
The data mover is temporarily unable to receive data. While the
data mover is paused, the data server cannot send data to the
data mover. The data server cannot be paused.
Idle
The data mover or data server is not sending or receiving data.
298
NDMP backup
Item
Description
Listen
The data mover or data server is waiting to connect to the data
server or data mover.
Operation
Backup (0-10)
Indicates that data is currently being backed up to a media
server. The number indicates the level of NDMP backup.
Restore
Indicates that data is currently being restored from a media
server.
Source/Destination
Device
Mode
Read/Write
OneFS is reading and writing data during a backup operation.
Read
OneFS is reading data during a restore operation.
Raw
The DMA has access to tape drives, but the drives do not
contain writable tape media.
299
NDMP backup
Restartable backups are supported only for EMC NetWorker 8.1 and later.
300
NDMP backup
2. To view detailed information about a specific backup context, run the isi ndmp
extensions contexts view command.
The following command displays detailed information about a backup context with an
ID of 792eeb8a-8784-11e2-aa70-0025904e91a4:
isi ndmp extensions contexts view 792eeb8a-8784-11e2aa70-0025904e91a4
301
NDMP backup
To configure a file list backup, you must complete the following tasks:
l
The file list is an ASCII text file that lists the pathnames of files to be backed up. The
pathnames must be relative to the path specified in the FILESYSTEM environment
variable. Absolute file paths in the file list are not supported. The pathnames of all files
must be included, or they are not backed up. For example, if you include the pathname of
a subdirectory, only the subdirectory, not the files it contains, is backed up.
To specify the full path of the source directory to be backed up, you must specify the
FILESYSTEM environment variable in your DMA. For example:
FILESYSTEM=/ifs/data/projects
To specify the pathname of the file list, you must specify the environment variable,
BACKUP_FILE_LIST in your DMA. The file list must be accessible from the node
performing the backup. For example:
BACKUP_FILE_LIST=/ifs/data/proj_list.txt
As shown in the example, the pathnames are relative to the full path of the source
directory, which you specify in the FILESYSTEM environment variable. Absolute file paths
are not supported in the file list.
Also as shown, the directories and files should be in sorted order.
The pathnames of all files must be included in the file list, or they are not backed up. For
example, if you only include the pathname of a subdirectory, the subdirectory is backed
up, but not the files the subdirectory contains. The exception is ADS (alternate data
streams). All ADS associated with a file to be backed up are automatically backed up.
302
NDMP backup
Parallel restore works for full and selective restore operations. If you specify DAR (direct
access restore), however, the operation reverts to serial processing.
303
NDMP backup
Supported DMAs
Tested configurations
7.1.1
7.1.0.1 (and
later)*
7.0.2.5
6.6.5.26
* The tape drive sharing function is not supported in the OneFS 7.0.1 release.
EMC NetWorker refers to the tape drive sharing capability as DDS (dynamic drive sharing).
Symantec NetBackup uses the term SSO (shared storage option). Consult your DMA
vendor documentation for configuration instructions.
304
NDMP backup
3. Optional: To remove a default NDMP setting for a directory, run the isi ndmp
settings variables delete command:
For example, the following command removes the default file history format
for /ifs/data/media:
isi ndmp settings variables delete /ifs/data/media --name HIST
305
NDMP backup
Environment variable
Valid values
Default
Description
BACKUP_MODE=
TIMESTAMP
SNAPSHOT
TIMESTAMP
FILESYSTEM=
<file-path>
None
LEVEL=
<integer>
0
Performs a full NDMP
backup.
1-9
Performs an incremental
backup at the specified level.
10
Performs unlimited
incremental backups.
UPDATE=
Y
N
Y
OneFS updates the dump
dates file.
N
OneFS does not update the
dump dates file.
HIST=
<file-historyformat>
D
Specifies dir/node file
history.
306
NDMP backup
Environment variable
Valid values
Default
Description
F
Specifies path-based file
history.
Y
Specifies the default file
history format determined by
your NDMP backup settings.
N
Disables file history.
DIRECT=
Y
N
Y
Enables DAR and DDAR.
N
Disables DAR and DDAR.
FILES=
<file-matchingpattern>
None
EXCLUDE=
<file-matchingpattern>
None
RESTORE_HARDLINK
_BY_TABLE=
Y
N
CHECKPOINT_INTERVAL
_IN_BYTES=
<size>
5 GB
307
NDMP backup
Environment variable
Valid values
Default
Description
the backup from where the
process failed. The <size>
parameter is the space between
each checkpoint.
Note that this variable can only be
set from the DMA. For example, if
you specify 2 GB, your DMA would
create a checkpoint each time 2
GB of data were backed up.
Restartable backups are
supported only for EMC NetWorker
8.1 and later.
BACKUP_FILE_LIST=
<file-path>
None
RESTORE_OPTIONS=
0
1
308
NDMP backup
309
NDMP backup
310
CHAPTER 15
File retention with SmartLock
311
SmartLock overview
You can prevent users from modifying and deleting files on an EMC Isilon cluster with the
SmartLock software module. You must activate a SmartLock license on a cluster to
protect data with SmartLock.
With the SmartLock software module, you can create SmartLock directories and commit
files within those directories to a write once read many (WORM) state. You cannot erase
or re-write a file committed to a WORM state. After a file is removed from a WORM state,
you can delete the file. However, you can never modify a file that has been committed to
a WORM state, even after it is removed from a WORM state.
Compliance mode
SmartLock compliance mode enables you to protect your data in compliance with the
regulations defined by U.S. Securities and Exchange Commission rule 17a-4. You can
upgrade a cluster to compliance mode during the initial cluster configuration process,
before you activate the SmartLock license. To upgrade a cluster to SmartLock compliance
mode after the initial cluster configuration process, contact Isilon Technical Support.
If you upgrade a cluster to compliance mode, you will not be able to log in to that cluster
through the root user account. Instead, you can log in to the cluster through the
compliance administrator account that is configured either during initial cluster
configuration or when the cluster is upgraded to compliance mode. If you are logged in
through the compliance administrator account, you can perform administrative tasks
through the sudo command.
SmartLock directories
In a SmartLock directory, you can commit a file to a WORM state manually or you can
configure SmartLock to automatically commit the file. You can create two types of
SmartLock directories: enterprise and compliance. However, you can create compliance
directories only if the cluster has been upgraded to SmartLock compliance mode. Before
you can create SmartLock directories, you must activate a SmartLock license on the
cluster.
If you commit a file to a WORM state in an enterprise directory, the file can never be
modified and cannot be deleted until the retention period passes. However, if you are
logged in through the root user account, you can delete the file before the retention
period passes through the privileged delete feature. The privileged delete feature is not
available for compliance directories. Enterprise directories reference the system clock to
facilitate time-dependent operations, including file retention.
Compliance directories enable you to protect your data in compliance with the
regulations defined by U.S. Securities and Exchange Commission rule 17a-4. If you
commit a file to a WORM state in a compliance directory, the file cannot be modified or
deleted before the specified retention period has expired. You cannot delete committed
files, even if you are logged in to the compliance administrator account. Compliance
directories reference the compliance clock to facilitate time-dependent operations,
including file retention.
You must set the compliance clock before you can create compliance directories. You can
set the compliance clock only once. After you set the compliance clock, you cannot
modify the compliance clock time. The compliance clock is controlled by the compliance
clock daemon. Because a user can disable the compliance clock daemon, it is possible
312
for a user to increase the retention period of WORM committed files in compliance mode.
However, it is not possible to decrease the retention period of a WORM committed file.
Do not configure SmartLock settings for a target SmartLock directory unless you are no
longer replicating data to the directory. Configuring an autocommit time period for a
target SmartLock directory can cause replication jobs to fail. If the target SmartLock
directory commits a file to a WORM state, and the file is modified on the source cluster,
the next replication job will fail because it cannot update the file.
Allowed
Non-SmartLock
Non-SmartLock
Yes
Non-SmartLock
SmartLock enterprise
Yes
Non-SmartLock
SmartLock compliance
No
SmartLock enterprise
Non-SmartLock
SmartLock enterprise
SmartLock enterprise
Yes
SmartLock enterprise
SmartLock compliance
No
SmartLock compliance
Non-SmartLock
No
SmartLock compliance
SmartLock enterprise
No
SmartLock compliance
SmartLock compliance
Yes
If you replicate SmartLock directories to another cluster with SyncIQ, the WORM state of
files is replicated. However, SmartLock directory configuration settings are not transferred
to the target directory.
For example, if you replicate a directory that contains a committed file that is set to expire
on March 4th, the file is still set to expire on March 4th on the target cluster. However, if
313
the directory on the source cluster is set to prevent files from being committed for more
than a year, the target directory is not automatically set to the same restriction.
If you back up data to an NDMP device, all SmartLock metadata relating to the retention
date and commit status is transferred to the NDMP device. If you restore data to a
SmartLock directory on the cluster, the metadata persists on the cluster. However, if the
directory that you restore to is not a SmartLock directory, the metadata is lost. You can
restore to a SmartLock directory only if the directory is empty.
SmartLock considerations
314
If a file is owned exclusively by the root user, and the file exists on a cluster that is in
SmartLock compliance mode, the file will be inaccessible, because the root user
account is disabled in compliance mode. For example, this can happen if a file is
assigned root ownership on a cluster that has not been upgraded to compliance
mode, and then the file is replicated to a cluster in compliance mode. This can also
occur if a file is assigned root ownership before a cluster is upgraded to SmartLock
compliance mode or if a root-owned file is restored on a compliance cluster after
being backed up.
It is recommended that you create files outside of SmartLock directories and then
transfer them into a SmartLock directory after you are finished working with the files.
If you are uploading files to a cluster, it is recommended that you upload the files to a
non-SmartLock directory, and then later transfer the files to a SmartLock directory. If a
file is committed to a WORM state while the file is being uploaded, the file will
become trapped in an inconsistent state.
Files can be committed to a WORM state while they are still open. If you specify an
autocommit time period for a directory, the autocommit time period is calculated
according to the length of time since the file was last modified, not when the file was
closed. If you delay writing to an open file for more than the autocommit time period,
the file will be automatically committed to a WORM state, and you will not be able to
write to the file.
In a Microsoft Windows environment, if you commit a file to a WORM state, you can
no longer modify the hidden or archive attributes of the file. Any attempt to modify
the hidden or archive attributes of a WORM committed file will generate an error. This
can prevent third-party applications from modifying the hidden or archive attributes.
Retention periods
A retention period is the length of time that a file remains in a WORM state before being
released from a WORM state. You can configure SmartLock directory settings that enforce
default, maximum, and minimum retention periods for the directory.
If you manually commit a file, you can optionally specify the date that the file is released
from a WORM state. You can configure a minimum and a maximum retention period for a
SmartLock directory to prevent files from being retained for too long or too short a time
period. It is recommended that you specify a minimum retention period for all SmartLock
directories.
For example, assume that you have a SmartLock directory with a minimum retention
period of two days. At 1:00 PM on Monday, you commit a file to a WORM state, and
specify the file to be released from a WORM state on Tuesday at 3:00 PM. The file will be
Set the compliance clock
315
released from a WORM state two days later on Wednesday at 1:00 PM, because releasing
the file earlier would violate the minimum retention period.
You can also configure a default retention period that is assigned when you commit a file
without specifying a date to release the file from a WORM state.
Procedure
1. Open a secure shell (SSH) connection to any node in the cluster and log in.
2. Run the isi worm domains create command.
If you specify the path of an existing directory, the directory must be empty.
The following command creates a compliance directory with a default retention period
of four years, a minimum retention period of three years, and an maximum retention
period of five years:
sudo isi worm domains create /ifs/data/SmartLock/directory1 \
--compliance --default-retention 4Y --min-retention 3Y \
--max-retention 5Y --mkdir
316
You can modify SmartLock directory settings only 32 times per directory. It is
recommended that you set SmartLock configuration settings only once and do not modify
the settings after files are added to the SmartLock directory.
Procedure
1. Open a secure shell (SSH) connection to any node in the cluster and log in.
2. Modify SmartLock configuration settings by running the isi worm modify
command.
The following command sets the default retention period to one year:
isi worm domains modify /ifs/data/SmartLock/directory1 \
--default-retention 1Y
3. Optional: To view detailed information about a specific SmartLock directory, run the
isi worm domains view command.
The following command displays detailed information about /ifs/data/
SmartLock/directory2:
isi worm domains view /ifs/data/SmartLock/directory2
317
65537
/ifs/data/SmartLock/directory2
enterprise
4295426060
30m
off
1Y
3M
3/32 Max
318
Override date
The override retention date for the directory. Files committed to a WORM state are
not released from a WORM state until after the specified date, regardless of the
maximum retention period for the directory or whether a user specifies an earlier
date to release a file from a WORM state.
Privileged delete
Indicates whether files in the directory can be deleted through the privileged delete
functionality.
on
The root user can delete files committed to a WORM state by running the isi
worm files delete command.
off
WORM committed files cannot be deleted, even through the isi worm files
delete command.
disabled
WORM committed files cannot be deleted, even through the isi worm files
delete command. After this setting is applied, it cannot be modified.
Default retention period
The default retention period for the directory. If a user does not specify a date to
release a file from a WORM state, the default retention period is assigned.
Times are expressed in the format "<integer> <time>", where <time> is one of the
following values:
Y
Specifies years
M
Specifies months
W
Specifies weeks
D
Specifies days
H
Specifies hours
m
Specifies minutes
s
Specifies seconds
Forever indicates that WORM committed files are retained permanently by default.
Use Min indicates that the default retention period is equal to the minimum
retention date. Use Max indicates that the default retention period is equal to the
maximum retention date.
319
320
Total modifies
The total number of times that SmartLock settings have been modified for the
directory. You can modify SmartLock settings only 32 times per directory.
3. Specify the name of the file you want to set a retention period for by creating an
object.
Managing files in SmartLock directories
321
4. Specify the retention period by setting the last access time for the file.
The following command sets an expiration date of July 1, 2015 at 1:00 PM:
$file.LastAccessTime = Get-Date "2015/7/1 1:00 pm"
322
2. Override the retention period expiration date for all WORM committed files in a
SmartLock directory by running the isi worm modify command.
For example, the following command overrides the retention period expiration date
of /ifs/data/SmartLock/directory1 to June 1, 2014:
isi worm domains modify /ifs/data/SmartLock/directory1 \
--override-date 2014-06-01
3. Delete the WORM committed file by running the isi worm filedelete command.
The following command deletes /ifs/data/SmartLock/directory1/file:
isi worm files delete /ifs/data/SmartLock/directory1/file
323
-----------------------------------65539 /ifs/data/SmartLock/directory1
WORM State: COMMITTED
Expires: 2015-06-01T00:00:00
324
CHAPTER 16
Protection domains
Protection domains
325
Protection domains
326
Copying a large number of files into a protection domain might take a very long time
because each file must be marked individually as belonging to the protection
domain.
You cannot move directories in or out of protection domains. However, you can move
a directory contained in a protection domain to another location within the same
protection domain.
Creating a protection domain for a directory that contains a large number of files will
take more time than creating a protection domain for a directory with fewer files.
Because of this, it is recommended that you create protection domains for directories
while the directories are empty, and then add files to the directory.
Protection domains
327
Protection domains
328
CHAPTER 17
Data-at-rest-encryption
Data-at-rest-encryption
329
Data-at-rest-encryption
Self-encrypting drives
Self-encrypting drives store data on a EMC Isilon cluster that is specially designed for
data-at-rest encryption.
Data-at-rest- encryption on self-encrypted drives occurs when data that is stored on a
device is encrypted to prevent unauthorized data access. All data written to the storage
device is encrypted when it is stored, and all data read from the storage device is
decrypted when it is read. The stored data is encrypted with a 256-bit data AES
encryption key and decrypted in the same manner. OneFS controls data access by
combining the drive authentication key with on-disk data-encryption keys.
Note
All nodes in a cluster must be of the self-encrypting drive type. Mixed nodes are not
supported.
When a drive is smartfailed and removed from a node, the encryption key on the drive
is removed. Because the encryption key for reading data from the drive must be the
same key that was used when the data was written, it is impossible to decrypt data
that was previously written to the drive. When you smartfail and then remove a drive,
it is cryptographically erased.
Note
330
When a self-encrypting drive loses power, the drive locks to prevent unauthorized
access. When power is restored, data is again accessible when the appropriate drive
authentication key is provided.
Data-at-rest-encryption
Before you begin the data-migration process, both clusters must be upgraded to the
same OneFS version.
During data migration, an error is generated that indicates you are running in mixed
mode, which is not supported and is not secure. The data migrated to the self-encrypted
drives is not secure until the smartfail process is completed for the non-encrypted drives.
CAUTION
Description
Interface
HEALTHY
Command-line
interface, web
administration
interface
SMARTFAIL or
Smartfail or
restripe in
progress
Command-line
interface, web
administration
interface
NOT AVAILABLE
Command-line
interface, web
administration
interface
Error
state
331
Data-at-rest-encryption
State
Description
Interface
Note
SED_ERROR command-line
interface states.
332
SUSPENDED
Command-line
interface, web
administration
interface
NOT IN USE
Command-line
interface, web
administration
interface
REPLACE
STALLED
NEW
USED
PREPARING
Command-line
interface only
EMPTY
Command-line
interface only
WRONG_TYPE
BOOT_DRIVE
Command-line
interface only
Error
state
Data-at-rest-encryption
State
Description
Interface
Error
state
SED_ERROR
Command-line
interface, web
administration
interface
Note
available.
ERASE
Command-line
interface only
Note
available.
INSECURE
Command-line
interface only
Web
administration
interface only
Note
SED.
UNENCRYPTED SED
333
Data-at-rest-encryption
Lnum 11
/dev/da1
Lnum 10
/dev/da2
Lnum 9
/dev/da3
Lnum 8
/dev/da4
Lnum 7
/dev/da5
Lnum 6
/dev/da6
Lnum 5
/dev/da7
Lnum 4
/dev/da8
Lnum 3
/dev/da9
Lnum 2
/dev/da10
Lnum 1
/dev/da11
Lnum 0
/dev/da12
[SMARTFAIL]
SN:Z296M8HK
[HEALTHY]
SN:Z296M8N5
[HEALTHY]
SN:Z296LBP4
[HEALTHY]
SN:Z296LCJW
[HEALTHY]
SN:Z296M8XB
[HEALTHY]
SN:Z295LXT7
[HEALTHY]
SN:Z296M8ZF
[HEALTHY]
SN:Z296M8SD
[HEALTHY]
SN:Z296M8QA
[HEALTHY]
SN:Z296M8Q7
[HEALTHY]
SN:Z296M8SP
[HEALTHY]
SN:Z296M8QZ
If you run the isi dev command after the smartfail completes successfully, the system
displays output similar to the following example, showing the drive state as REPLACE:
Node 1, [ATTN]
Bay 1
000093172YE04
Bay 2
00009330EYE03
Bay 3
00009330EYE03
Bay 4
00009327BYE03
Bay 5
00009330KYE03
Bay 6
000093172YE03
Bay 7
00009330KYE03
Bay 8
00009330EYE03
Bay 9
00009330EYE03
Bay 10
00009330EYE03
Bay 11
00009330EYE04
Bay 12
00009330JYE03
334
Lnum 11
/dev/da1
Lnum 10
/dev/da2
Lnum 9
/dev/da3
Lnum 8
/dev/da4
Lnum 7
/dev/da5
Lnum 6
/dev/da6
Lnum 5
/dev/da7
Lnum 4
/dev/da8
Lnum 3
/dev/da9
Lnum 2
/dev/da10
Lnum 1
/dev/da11
Lnum 0
/dev/da12
[REPLACE]
SN:Z296M8HK
[HEALTHY]
SN:Z296M8N5
[HEALTHY]
SN:Z296LBP4
[HEALTHY]
SN:Z296LCJW
[HEALTHY]
SN:Z296M8XB
[HEALTHY]
SN:Z295LXT7
[HEALTHY]
SN:Z296M8ZF
[HEALTHY]
SN:Z296M8SD
[HEALTHY]
SN:Z296M8QA
[HEALTHY]
SN:Z296M8Q7
[HEALTHY]
SN:Z296M8SP
[HEALTHY]
SN:Z296M8QZ
Data-at-rest-encryption
If you run the isi dev command while the drive in bay 3 is being smartfailed, the
system displays output similar to the following example:
Node 1, [ATTN]
Bay 1
000093172YE04
Bay 2
00009330EYE03
Bay 3
00009330EYE03
Bay 4
00009327BYE03
Bay 5
00009330KYE03
Bay 6
000093172YE03
Bay 7
00009330KYE03
Bay 8
00009330EYE03
Bay 9
00009330EYE03
Bay 10
00009330EYE03
Bay 11
00009330EYE04
Bay 12
00009330JYE03
Lnum 11
/dev/da1
Lnum 10
/dev/da2
Lnum 9
N/A
Lnum 8
/dev/da4
Lnum 7
/dev/da5
Lnum 6
/dev/da6
Lnum 5
/dev/da7
Lnum 4
/dev/da8
Lnum 3
/dev/da9
Lnum 2
/dev/da10
Lnum 1
/dev/da11
Lnum 0
/dev/da12
[REPLACE]
SN:Z296M8HK
[HEALTHY]
SN:Z296M8N5
[SMARTFAIL]
SN:Z296LBP4
[HEALTHY]
SN:Z296LCJW
[HEALTHY]
SN:Z296M8XB
[HEALTHY]
SN:Z295LXT7
[HEALTHY]
SN:Z296M8ZF
[HEALTHY]
SN:Z296M8SD
[HEALTHY]
SN:Z296M8QA
[HEALTHY]
SN:Z296M8Q7
[HEALTHY]
SN:Z296M8SP
[HEALTHY]
SN:Z296M8QZ
If the smartfail is unsuccessful, OneFS attempts to delete the drive password, because
the drive could not be crypto-erased. If you run the isi dev command after the
smartfail is unsuccessful, the system displays output similar to the following example,
showing the drive state as ERASE:
Node 1, [ATTN]
Bay 1
000093172YE04
Bay 2
00009330EYE03
Bay 3
00009330EYE03
Bay 4
00009327BYE03
Bay 5
00009330KYE03
Bay 6
000093172YE03
Bay 7
00009330KYE03
Bay 8
00009330EYE03
Bay 9
00009330EYE03
Bay 10
00009330EYE03
Bay 11
00009330EYE04
Bay 12
00009330JYE03
Lnum 11
/dev/da1
Lnum 10
/dev/da2
Lnum 9
/dev/da3
Lnum 8
/dev/da4
Lnum 7
/dev/da5
Lnum 6
/dev/da6
Lnum 5
/dev/da7
Lnum 4
/dev/da8
Lnum 3
/dev/da9
Lnum 2
/dev/da10
Lnum 1
/dev/da11
Lnum 0
/dev/da12
[REPLACE]
SN:Z296M8HK
[HEALTHY]
SN:Z296M8N5
[ERASE]
SN:Z296LBP4
[HEALTHY]
SN:Z296LCJW
[HEALTHY]
SN:Z296M8XB
[HEALTHY]
SN:Z295LXT7
[HEALTHY]
SN:Z296M8ZF
[HEALTHY]
SN:Z296M8SD
[HEALTHY]
SN:Z296M8QA
[HEALTHY]
SN:Z296M8Q7
[HEALTHY]
SN:Z296M8SP
[HEALTHY]
SN:Z296M8QZ
335
Data-at-rest-encryption
336
CHAPTER 18
SmartQuotas
SmartQuotas
337
SmartQuotas
SmartQuotas overview
The SmartQuotas module is an optional quota-management tool that monitors and
enforces administrator-defined storage limits. Using accounting and enforcement quota
limits, reporting capabilities, and automated notifications, SmartQuotas manages
storage use, monitors disk storage, and issues alerts when disk-storage limits are
exceeded.
Quotas help you manage storage usage according to criteria that you define. Quotas are
used as a method of trackingand sometimes limitingthe amount of storage that a
user, group, or project consumes. Quotas are a useful way of ensuring that a user or
department does not infringe on the storage that is allocated to other users or
departments. In some quota implementations, writes beyond the defined space are
denied, and in other cases, a simple notification is sent.
The SmartQuotas module requires a separate license. For additional information about
the SmartQuotas module or to activate the module, contact your EMC Isilon sales
representative.
Quota types
OneFS uses the concept of quota types as the fundamental organizational unit of storage
quotas. Storage quotas comprise a set of resources and an accounting of each resource
type for that set. Storage quotas are also called storage domains.
Storage quotas creation requires three identifiers:
l
338
SmartQuotas
Note
You should not create quotas of any type on the OneFS root (/ifs). A root-level quota
may significantly degrade performance.
In this example, the default-user type created a new specific-user type automatically
(user:admin) and added the new usage to it. Default-user does not have any usage
because it is used only to generate new quotas automatically. Default-user enforcement
is copied to a specific-user (user:admin), and the inherited quota is called a linked quota.
In this way, each user account gets its own usage accounting.
Defaults can overlap. For example, default-user@/ifs/dir-1 and default-user@/ifs/cs
both may be defined. If the default enforcement changes, OneFS storage quotas
propagate the changes to the linked quotas asynchronously. Because the update is
asynchronous, there is some delay before updates are in effect. If a default type, such as
every user or every group, is deleted, OneFS deletes all children that are marked as
inherited. As an option, you can delete the default without deleting the children, but it is
important to note that this action breaks inheritance on all inherited children.
Continuing with the example, add another file that is owned by the root user. Because the
root type exists, the new usage is added to it.
my-OneFS-1# touch /ifs/dir-1/anotherfile
my-OneFS-1# isi quota ls -v --path=/ifs/dir-1 --format=list
Type: default-user
339
SmartQuotas
AppliesTo: DEFAULT
Path: /ifs/dir-1
Snap: No
Thresholds
Hard : Soft : Adv : Grace : Usage
Files : 0
With Overhead : 0.00b
W/O Overhead : 0.00b
Over: Enforced: No
Container: No
Linked: ---------------------------------------------------------------------Type: user
AppliesTo: root
Path: /ifs/dir-1
Snap: No
Thresholds
Hard : Soft : Adv : Grace : Usage
Files : 2
With Overhead : 3.50K
W/O Overhead : 55.00b
Over: Enforced: No
Container: No
Linked: Yes
---------------------------------------------------------------------Type: user
AppliesTo: admin
Path: /ifs/dir-1
Snap: No
Thresholds
Hard : Soft : Adv : Grace : Usage
Files : 1
With Overhead : 1.50K
W/O Overhead : 0.00b
Over: Enforced: No
Container: No
Linked: Yes
Configuration changes for linked quotas must be made on the parent quota that the
linked quota is inheriting from. Changes to the parent quota are propagated to all
children. To override configuration from the parent quota, you must unlink the quota first.
340
SmartQuotas
Track the amount of disk space used by various users or groups to bill each user,
group, or directory for only the disk space used.
Review and analyze reports that help you identify storage usage patterns and
define storage policies.
Plan for capacity and other storage needs.
Enforcement limits
Enforcement limits include all of the functionality of the accounting option, plus the
ability to limit disk storage and send notifications. Using enforcement limits, you can
logically partition a cluster to control or restrict how much storage that a user, group,
or directory can use. For example, you can set hard- or soft-capacity limits to ensure
that adequate space is always available for key projects and critical applications and
to ensure that users of the cluster do not exceed their allotted storage capacity.
Optionally, you can deliver real-time email quota notifications to users, group
managers, or administrators when they are approaching or have exceeded a quota
limit.
Note
If a quota type uses the accounting-only option, enforcement limits cannot be used for
that quota.
The actions of an administrator logged in as root may push a domain over a quota
threshold. For example, changing the protection level or taking a snapshot has the
potential to exceed quota parameters. System actions such as repairs also may push a
quota domain over the limit.
The system provides three types of administrator-defined enforcement thresholds.
Threshold
type
Description
Hard
Limits disk usage to a size that cannot be exceeded. If an operation, such as a file
write, causes a quota target to exceed a hard quota, the following events occur:
l
341
SmartQuotas
Threshold
type
Description
Allows a limit with a grace period that can be exceeded until the grace period
expires. When a soft quota is exceeded, an alert is logged to the cluster and a
notification is issued to specified recipients; however, data writes are permitted
during the grace period.
If the soft threshold is still exceeded when the grace period expires, data writes
fail, and a hard-limit notification is issued to the recipients you have specified.
Writes resume when the usage falls below the threshold.
Advisory
Disk-usage calculations
For each quota that you configure, you can specify whether data-protection overhead is
included in future disk-usage calculations.
Most quota configurations do not need to include overhead calculations. If you do not
include data-protection overhead in usage calculations for a quota, future disk-usage
calculations for the quota include only the space that is required to store files and
directories. Space that is required for the data-protection setting of the cluster is not
included.
Consider the same example user, who is now restricted by a 40 GB quota that does not
include data-protection overhead in its disk-usage calculations. If your cluster is
configured with a 2x data-protection level and the user writes a 10 GB file to the cluster,
that file consumes 20 GB of space but the 10GB for the data-protection overhead is not
counted in the quota calculation. In this example, the user has reached 25 percent of the
40 GB quota by writing a 10 GB file to the cluster. This method of disk-usage calculation
is recommended for most quota configurations.
If you include data-protection overhead in usage calculations for a quota, future diskusage calculations for the quota include the total amount of space that is required to
store files and directories, in addition to any space that is required to accommodate your
data-protection settings, such as parity or mirroring. For example, consider a user who is
restricted by a 40 GB quota that includes data-protection overhead in its disk-usage
calculations. If your cluster is configured with a 2x data-protection level (mirrored) and
the user writes a 10 GB file to the cluster, that file actually consumes 20 GB of space: 10
GB for the file and 10 GB for the data-protection overhead. In this example, the user has
reached 50 percent of the 40 GB quota by writing a 10 GB file to the cluster.
Note
Cloned and deduplicated files are treated as ordinary files by quotas. If the quota
includes data protection overhead, the data protection overhead for shared data is not
included in the usage calculation.
You can configure quotas to include the space that is consumed by snapshots. A single
path can have two quotas applied to it: one without snapshot usage, which is the default,
342
SmartQuotas
and one with snapshot usage. If you include snapshots in the quota, more files are
included in the calculation than are in the current directory. The actual disk usage is the
sum of the current directory and any snapshots of that directory. You can see which
snapshots are included in the calculation by examining the .snapshot directory for the
quota path.
Note
Only snapshots created after the QuotaScan job finishes are included in the calculation.
Quota notifications
Quota notifications are generated for enforcement quotas, providing users with
information when a quota violation occurs. Reminders are sent periodically while the
condition persists.
Each notification rule defines the condition that is to be enforced and the action that is to
be executed when the condition is true. An enforcement quota can define multiple
notification rules. When thresholds are exceeded, automatic email notifications can be
sent to specified users, or you can monitor notifications as system alerts or receive
emails for these events.
Notifications can be configured globally, to apply to all quota domains, or be configured
for specific quota domains.
Enforcement quotas support the following notification settings. A given quota can use
only one of these settings.
Limit notification settings
Description
343
SmartQuotas
Quota reports
The OneFS SmartQuotas module provides reporting options that enable administrators to
manage cluster resources and analyze usage statistics.
Storage quota reports provide a summarized view of the past or present state of the
quota domains. After raw reporting data is collected by OneFS, you can produce data
summaries by using a set of filtering parameters and sort types. Storage-quota reports
include information about violators, grouped by threshold types. You can generate
reports from a historical data sample or from current data. In either case, the reports are
views of usage data at a given time. OneFS does not provide reports on data aggregated
over time, such as trending reports, but you can use raw data to analyze trends. There is
no configuration limit on the number of reports other than the space needed to store
them.
OneFS provides the following data-collection and reporting methods:
l
Ad hoc reports are generated and saved at the request of the user.
Creating quotas
You can create two types of storage quotas to monitor data: accounting quotas and
enforcement quotas. Storage quota limits and restrictions can apply to specific users,
groups, or directories.
The type of quota that you create depends on your goal.
l
344
Enforcement quotas monitor and limit disk usage. You can create enforcement
quotas that use any combination of hard limits, soft limits, and advisory limits.
SmartQuotas
Note
Note
After you create a new quota, it begins to report data almost immediately, but the data is
not valid until the QuotaScan job completes. Before using quota data for analysis or other
purposes, verify that the QuotaScan job has finished.
To include snapshot data in the accounting quota, select the Include Snapshot
Data check box.
To include snapshot data in the accounting quota, select the Include Snapshot
Data check box.
345
SmartQuotas
3. From the Quota Type list, select the target for this quota: a directory, user, or group.
4. Depending on the target that you selected, select the entity that you want to apply the
quota to. For example, if you selected User from the Quota Type list, you can target all
users or a specific user.
5. In the Directory path field, type the path and directory for the quota, or click Browse,
and then select a directory.
6. Optional: In the Usage Accounting area, click the Include Snapshot Data check box,
the Include Data-Protection Overhead check box, or both to include them in the
quota.
7. In the Usage Limits area, click Specify Usage Limits.
8. Click the check box next to the option for each type of limit that you want to enforce.
9. Type numerals in the fields and select from the lists the values that you want to use
for the quota.
10.In the Limit Notations area, click the notification option that you want to apply to the
quota.
11.To generate an event notification, select the Create cluster event check box.
12.Optional: If you selected the option to use custom notification rules, click the link to
expand the custom notification type that applies to the usage-limit selections.
13.Click Create Quota.
After you finish
After you create a quota, it begins to report data almost immediately but the data is not
valid until the QuotaScan job completes. Before using quota data for analysis or other
purposes, verify that the QuotaScan job has finished.
Managing quotas
You can modify the configured values of a storage quota, and you can enable or disable a
quota. You can also create quota limits and restrictions that apply to specific users,
groups, or directories.
Quota management in OneFS is simplified by the quota search feature, which helps you
to locate a quota or quotas by using filters. You can unlink quotas that are associated
with a parent quota, and configure custom notifications for quotas. You can also disable
a quota temporarily and then enable it when needed.
Note
SmartQuotas
3. In the Quota Type list, select the quota type that you want to find.
4. If you selected User Quota or Group quota for a quota type, type a full or partial user
or group name in the User or Group field.
You can use the wildcard character (*) in the User or Group field.
l
To search for only default users, select the Only show default users checkbox.
To search for only default groups, select the Only show default groups check box.
To search for only quotas that are in violations, select the Only show quotas for
which usage limits are currently in violation check box.
To clear the result set and display all storage quotas, in the Quotas & Usage area, select
Show all quotas and usage for this report for Report Filters, and then click Update
Display.
Manage quotas
Quotas help you monitor and analyze the current or historical use of disk storage. You
can search for quotas, and you can view, modify, delete, and unlink a quota.
An initial QuotaScan job must run for the default or scheduled quotas, or the data
displayed may be incomplete.
Before you modify a quota, consider how the changes will affect the file system and end
users.
Note
l
The options to edit or delete a quota appear only when the quota is not linked to a
default quota.
The option to unlink a quota is available only when the quota is linked to a default
quota.
Procedure
1. Click File System Management > SmartQuotas > Quotas & Usage.
2. From the Quota Report options, select the type of quota report that you want to view
or manage.
l
To monitor and analyze current disk storage use, click Show current quotas and
usage (Live Report).
To monitor and analyze historical disk storage use, click Show archived quota
report to select from the list of archived scheduled and manually generated quota
reports.
Manage quotas
347
SmartQuotas
3. For Report Filters, select the filters to be used for this quota report.
l
To view all information in the quota report, click Show all quotas and usage for
this report.
To filter the quota report, click Search for specific quotas within this report, and
then select the filters that you want to apply.
Configuration changes for linked quotas must be made on the parent (default)
quota that the linked quota is inheriting from. Changes to the parent quota are
propagated to all children. If you want to override configuration from the parent
quota, you must first unlink the quota.
348
SmartQuotas
Threshold exceeded
Over-quota reminder
To configure the number of live reports that you want to archive, type the number
of reports in the Limit archive size field.
To specify an archive directory that is different from the default, in the Archive
Directory field, type the path or click Browse to select the path.
6. In the Manual Report Archiving area, you can configure the following size and
directory options:
l
To configure the number of live reports that you want to archive, type the number
of reports in the Limit archive size field.
To specify an archive directory that is different from the default, in the Archive
Directory field, type the path or click Browse to select the path.
7. In the Email Mapping Rules area, choose each mapping rule that you want to use by
selecting the check box in the Provider Type column.
8. In the Notification Rules area, define default notification rules for each rule type.
l
To expand the list of limit notifications rules types, click Default Notifications
Settings.
Managing quota notifications
349
SmartQuotas
To set the advisory-limit options that you want, click Event: Advisory Limit Value
Exceeded and Event: While Advisory Limit Remains Exceeded.
To display default settings for soft-limit notifications, click Soft Limit Notification
Rules.
To set the soft-limit options that you want, click Event: Soft Limit Value Exceeded,
Event: While Soft Limit Remains Exceeded, Event: Soft Limit Grace Period Expired,
and Event: Soft Limit Write Access Denied.
To display the options for a hard-limit notification rule, click Hard Limit
Notification Rules .
To set the hard-limit options that you want, click Event: Hard Limit Write Access
Denied and Event: While Hard Limit Remains Exceeded.
9. Click Save.
After you finish
After you create a new quota, it begins to report data almost immediately, but the data is
not valid until the QuotaScan job completes. Before using quota data for analysis or other
purposes, verify that the QuotaScan job has finished.
350
SmartQuotas
You must be logged in to the web administration interface to perform this task.
Procedure
1. Click File System Management > SmartQuotas > Settings.
2. Optional: In the Email Mapping area, click Create an email mapping rule.
3. From the Provider Type list, select the provider type for this notification rule.
4. From the Current Domain list, select the domain that you want to use for the mapping
rule.
5. In the Map-to-Domain field, type the name of the domain that you want to map email
notifications to.
Repeat this step if you want to map more than one domain.
6. Click Save Rule.
The following example illustrates a custom email template to notify recipients about an
exceeded quota.
Text-file contents with variables
The disk quota on directory <ISI_QUOTA_PATH> owned by
351
SmartQuotas
SmartQuotas
Description
ls -a *.xml
ls <filename>.xml
Description
Directory Path
353
SmartQuotas
Option
Description
User Quota
Group Quota
No Usage Limit
Description
Exceeded Remains
exceeded
Send email
Yes
Yes
Notify owner
Yes
Yes
Notify another
Yes
Yes
Message template
Select from the following template types for use in formatting email
notifications:
Yes
Yes
Create cluster event Select to generate an event notification for the quota when exceeded.
Yes
Yes
Delay
Specify the length of time (hours, days, weeks) to delay before generating a
notification.
Yes
No
Frequency
Specify the notification and alert frequency: daily, weekly, monthly, yearly;
depending on selection, specify intervals, day to send, time of day, multiple
emails per rule.
No
Yes
354
Custom
SmartQuotas
Description
Exceeded Remains
exceeded
Grace
period
expired
Write
access
denied
Send email
Yes
Yes
Yes
Yes
Notify owner
Yes
Yes
Yes
Yes
Notify another
Yes
Yes
Yes
Yes
Message
template
Yes
Yes
Yes
Yes
Custom
Create cluster
event
Yes
Yes
Yes
Yes
Delay
Yes
No
No
Yes
Frequency
No
Yes
Yes
No
Description
Write access
denied
Exceeded
Send email
Yes
Yes
Notify owner
Yes
Yes
Notify another
Yes
Yes
Message template
Select from the following template types for use in formatting email
notifications:
Yes
Yes
Custom
Soft limit quota notification rules settings
355
SmartQuotas
Option
Description
Write access
denied
Exceeded
Create cluster event Select to generate an event notification for the quota when exceeded.
Yes
Yes
Delay
Specify the length of time (hours, days, weeks) to delay before generating a
notification.
Yes
No
Frequency
Specify the notification and alert frequency: daily, weekly, monthly, yearly;
depending on selection, specify intervals, day to send, time of day, multiple
emails per rule.
No
Yes
Description
Turn Off Notifications for this Quota Disables all notifications for the quota.
Use Custom Notification Rules
Description
Scheduled
reporting
Report
frequency
On. Reports run automatically according to the schedule that you specify.
Specifies the interval for this report to run: daily, weekly, monthly, or yearly. You can use the following options to
further refine the report schedule.
Generate report every. Specify the numeric value for the selected report frequency; for example, every 2
months.
Generate reports on. Select the day or multiple days to generate reports.
Select report day by. Specify date or day of the week to generate the report.
Generate one report per specified by. Set the time of day to generate this report.
356
SmartQuotas
Setting
Description
Generate multiple reports per specified day. Set the intervals and times of day to generate the report for
that day.
Scheduled
Determines the maximum number of scheduled reports that are available for viewing on the SmartQuotas
report archiving Reports page.
Limit archive size for scheduled reports to a specified number of reports. Type the integer to specify the
maximum number of reports to keep.
Archive Directory. Browse to the directory where you want to store quota reports for archiving.
Manual report
archiving
Determines the maximum number of manually generated (on-demand) reports that are available for viewing on
the SmartQuotas Reports page.
Limit archive size for live reports to a specified number of reports. Type the integer to specify the maximum
number of reports to keep.
Archive Directory. Browse to the directory where you want to store quota reports for archiving.
Description
Example
ISI_QUOTA_PATH
/ifs/data
20 GB
ISI_QUOTA_USAGE
10.5 GB
ISI_QUOTA_OWNER
jsmith
ISI_QUOTA_TYPE
Threshold type
Advisory
ISI_QUOTA_GRACE
5 days
ISI_QUOTA_NODE
someHost-prod-wf-1
357
SmartQuotas
358
CHAPTER 19
Storage Pools
Storage Pools
359
Storage Pools
Without active
SmartPools license
Yes
Yes
Yes
Yes
Yes
SSD strategies
Yes
Yes
L3 cache
Yes
Yes
Tiers
Yes
Yes
GNA
Yes
Yes
No
Yes
Spillover management
No
Yes
360
Storage Pools
361
Storage Pools
Autoprovisioning
When you add a node to your cluster, OneFS automatically assigns the node to a node
pool. With node pools, OneFS can ensure optimal performance, load balancing, and
reliability of the file system. Autoprovisioning reduces the time required for the manual
management tasks associated with resource planning and configuration.
Nodes are not provisioned, meaning they are not associated with each other and are not
writable, until at least three nodes of an equivalence class are added to the cluster. If you
have added only two nodes of an equivalence class to your cluster, no data is stored on
those nodes until you add a third node of the same equivalence class.
Similarly, if a node goes down or is removed from the cluster so that fewer than three
equivalence-class nodes remain, the node pool becomes under-provisioned. However,
the two remaining nodes are still writable. If only one node remains, that node is not
writable, but remains readable.
OneFS offers a compatibility function, also referred to as node equivalency, that enables
certain node types to be provisioned to existing node pools, even when there are fewer
than three equivalence-class nodes. For example, an S210 node could be provisioned
and added to a node pool of S200 nodes. Similarly, an X410 node could be added to a
node pool of X400 nodes. The compatibility function ensures that you can add nodes one
at a time to your cluster and still have them be fully functional peers within a node pool.
The larger number of the two factors (minimum number of virtual drives or percentage
of total disk space), rather than their sum, determines the space allocated for virtual
hot spare.
It is important to understand the following information when configuring VHS settings:
362
If you configure both settings, the enforced minimum value satisfies both
requirements.
If you select the option to reduce the amount of available space, free-space
calculations do not include the space reserved for the virtual hot spare. The reserved
virtual hot spare free space is used for write operations unless you select the option
to deny new data writes. If Reduce amount of available space is enabled while Deny
Storage Pools
new data writes is disabled, it is possible for the file system to report utilization as
more than 100 percent.
Note
Virtual hot spare reservations affect spillover. If the virtual hot spare option Deny writes
is enabled but Reduce amount of available space is disabled, spillover occurs before the
file system reports 100% utilization.
Spillover
If you activate a SmartPools license, you can designate a storage pool to receive spill
data when the hardware specified by a file pool policy is not writable. If you do not want
data to spill over from a different location because the specified node pool or tier is full or
not writable, you can disable this feature.
Spillover management is available after you activate a SmartPools license. You can direct
write operations to a specified storage pool in the cluster when there is not enough space
to write a file according to the storage pool policy.
Note
Virtual hot spare reservations affect spillover. If the setting Deny writes is enabled but
Deduce amount of available space is disabled, spillover occurs before the file system
reports 100% utilization.
Node pools
A node pool is a collection of three or more nodes. As you add nodes to an Isilon cluster,
OneFS automatically provisions them into node pools based on characteristics such as
series, drive size, RAM, and SSD-per-node ratio. Nodes with identical characteristics are
called equivalence-class nodes.
If you add fewer than three nodes of a node type, OneFS cannot autoprovision the nodes
to your cluster. In these cases, you can create compatibilities. Compatibilities enable
OneFS to provision nodes that are not equivalence-class to a compatible node pool.
After provisioning, each node in the OneFS cluster is a peer, and any node can handle a
data request. Each provisioned node increases the aggregate disk, cache, CPU, and
network capacity on the cluster.
You can move nodes from an automatically managed node pool into one that you define
manually. This capability is available only through the OneFS command-line interface. If
you attempt to remove nodes from a node pool such that the removal would leave fewer
than three nodes in the pool, the removal fails. When you remove a node from a manually
defined node pool, OneFS attempts to move the node into a node pool of the same
equivalence class, or into a compatible node pool.
Node compatibilities
OneFS requires that a node pool contain at least three nodes so that the operating
system can write data and perform the necessary load balancing and data protection
Spillover
363
Storage Pools
operations. You can enable certain nodes to be provisioned to an existing node pool by
defining a compatibility.
Note
S210 RAM
X400 RAM
X410 RAM
24 GB
32 GB
24 GB
32 GB
48 GB
64 GB
48 GB
64 GB
96 GB
128 GB
96 GB
128 GB
256 GB
192 GB
256 GB
Note
After you have added three or more S210 or X410 nodes to your cluster, you should
consider removing the compatibilities that you have created. This step enables OneFS to
autoprovision new S210 or X410 node pools and take advantage of the performance
specifications of the newer node types.
364
Storage Pools
Suggested protection
Based on the configuration of your Isilon cluster, OneFS automatically calculates the
amount of protection that is recommended to maintain EMC Isilon's stringent data
protection requirements.
OneFS includes a function to calculate the suggested protection for data to maintain a
theoretical mean-time to data loss (MTTDL) of 5000 years. Suggested protection provides
the optimal balance between data protection and storage efficiency on your cluster.
By configuring file pool policies, you can specify one of multiple requested protection
settings for a single file, for subsets of files called file pools, or for all files on the cluster.
It is recommended that you do not specify a setting below suggested protection. OneFS
periodically checks the protection level on the cluster, and alerts you if data falls below
the recommended protection.
Protection policies
OneFS provides a number of protection policies to choose from when protecting a file or
specifying a file pool policy.
The more nodes you have in your cluster, up to 20 nodes, the more efficiently OneFS can
store and protect data, and the higher levels of requested protection the operating
system can achieve. Depending on the configuration of your cluster and how much data
is stored, OneFS might not be able to achieve the level of protection that you request. For
example, if you have a three-node cluster that is approaching capacity, and you request
+2n protection, OneFS might not be able to deliver the requested protection.
The following table describes the available protection policies in OneFS.
Protection policy
Summary
+1n
+2d:1n
+2n
+3d:1n
+3d:1n1d
+3n
+4d:1n
+4d:2n
+4n
Mirrors:
2x
3x
Note
4x
5x
6x
Mirrors can use more data than the other protection policies, but
might be an effective way to protect files that are written nonsequentially or to provide faster access to important files.
7x
Suggested protection
365
Storage Pools
Protection policy
Summary
8x
SSD strategies
OneFS clusters can contain nodes that include solid-state drives (SSD). OneFS
autoprovisions equivalence-class nodes with SSDs into one or more node pools. The SSD
strategy defined in the default file pool policy determines how SSDs are used within the
cluster, and can be set to increase performance across a wide range of workflows.
You can configure file pool policies to apply specific SSD strategies as needed. When you
select SSD options during the creation of a file pool policy, you can identify the files in
the OneFS cluster that require faster or slower performance. When the SmartPools job
runs, OneFS uses file pool policies to move this data to the appropriate storage pool and
drive type.
The following SSD strategy options that you can set in a file pool policy are listed in order
of slowest to fastest choices:
Avoid SSDs
Writes all associated file data and metadata to HDDs only.
CAUTION
Use this option to free SSD space only after consulting with Isilon Technical Support
personnel. Using this strategy can negatively affect performance.
Metadata read acceleration
Writes both file data and metadata to HDDs. This is the default setting. An extra
mirror of the file metadata is written to SSDs, if available. The SSD mirror is in
addition to the number of mirrors, if any, required to satisfy the requested
protection.
Metadata read/write acceleration
Writes file data to HDDs and metadata to SSDs, when available. This strategy
accelerates metadata writes in addition to reads but requires about four to five times
more SSD storage than the Metadata read acceleration setting. Enabling GNA does
not affect read/write acceleration.
Data on SSDs
Uses SSD node pools for both data and metadata, regardless of whether global
namespace acceleration is enabled. This SSD strategy does not result in the creation
of additional mirrors beyond the normal requested protection but requires
significantly increased storage requirements compared with the other SSD strategy
options.
366
Storage Pools
that at least 2.0% of the total cluster storage is SSD-based before enabling global
namespace acceleration.
You can enable GNA only if 20% or more of the nodes in the cluster contain at least one
SSD and 1.5% or more of the total cluster storage is SSD-based. For best results, ensure
that at least 2.0% of the total cluster storage is SSD-based before enabling global
namespace acceleration. If the ratio of SSDs to non-SSDs on the cluster falls below the
1.5% threshold, GNA becomes inactive even if enabled. GNA is reactivated when the ratio
is corrected. When GNA is inactive, existing SSD mirrors are readable but newly written
metadata does not include the extra SSD mirror.
Note
If GNA is enabled for the cluster, file pool policies that direct data to node pools with L3
cache enabled should also set the SSD strategy to Avoid SSDs. Otherwise, additional
SSD mirrors would be created for data that is already accelerated by L3 cache. This is an
inefficient use of SSD storage space and is not recommended.
L3 cache overview
You can configure nodes with solid-state drives (SSDs) to increase cache memory and
speed up file system performance across larger working file sets.
OneFS caches file data and metadata at multiple levels. The following table describes the
types of file system cache available on an Isilon cluster.
Name
Type
Profile
Scope
Description
L1 cache
RAM
Volatile
Local
node
L2 cache
RAM
Volatile
Global
SmartCache
Variable
Nonvolatile
Local
node
L3 cache
SSD
Nonvolatile
Global
OneFS caches frequently accessed file and metadata in available random access memory
(RAM). Caching enables OneFS to optimize data protection and file system performance.
When RAM cache reaches capacity, OneFS normally discards the oldest cached data and
processes new data requests by accessing the storage drives. This cycle is repeated each
time RAM cache fills up.
You can deploy SSDs as L3 cache to reduce the cache cycling issue and further improve
file system performance. L3 cache adds significantly to the available cache memory and
provides faster access to data than hard disk drives (HDD).
As L2 cache reaches capacity, OneFS evaluates data to be released and, depending on
your workflow, moves the data to L3 cache. In this way, much more of the most frequently
accessed data is held in cache, and overall file system performance is improved.
L3 cache overview
367
Storage Pools
For example, consider a cluster with 128GB of RAM. Typically the amount of RAM
available for cache fluctuates, depending on other active processes. If 50 percent of RAM
is available for cache, the cache size would be approximately 64GB. If this same cluster
had three nodes, each with two 200GB SSDs, the amount of L3 cache would be 1.2TB,
approximately 18 times the amount of available L2 cache.
L3 cache is enabled by default for new node pools. A node pool is a collection of nodes
that are all of the same equivalence class, or for which compatibilities have been created.
L3 cache applies only to the nodes where the SSDs reside. For the HD400 node, which is
primarily for archival purposes, L3 cache is on by default and cannot be turned off. On the
HD400, L3 cache is used only for metadata.
If you enable L3 cache on a node pool, OneFS manages all cache levels to provide
optimal data protection, availability, and performance. In addition, in case of a power
failure, the data on L3 cache is retained and still available after power is restored.
Note
Although some benefit from L3 cache is found in workflows with streaming and
concurrent file access, L3 cache provides the most benefit in workflows that involve
random file access.
Migration to L3 cache
L3 cache is enabled by default on new nodes. If you are upgrading your cluster from an
older release (pre-OneFS 7.1.1), you must enable L3 cache manually on node pools that
have SSDs. When you enable L3 cache, OneFS activates a process that migrates SSDs
from storage disks to cache. File data currently on SSDs is moved elsewhere in the
cluster.
You can enable L3 cache as the default for all new node pools or manually for a specific
node pool, either through the command line or from the web administration interface.
You can enable L3 cache only on node pools whose nodes have SSDs.
Depending on the amount of data stored in your SSDs, the migration process can take
some time. OneFS displays a message informing you that the migration is about to begin:
WARNING: Changes to L3 cache configuration can have a long completion
time. If this is a concern, please contact EMC Isilon Support for
more information.
You must confirm whether OneFS should proceed with the migration. After you do, OneFS
handles the migration intelligently as a background process. You can continue to
administer your cluster during the migration.
If you choose to disable L3 cache on a node pool, the migration process is very fast.
368
Storage Pools
Required privileges
You need to have the SmartPools administrative privilege (or higher) to enable or disable
L3 cache.
You can enable or disable L3 cache from the command line or web administration
interface.
Tiers
A tier is a user-defined collection of node pools that you can specify as a storage pool for
files. A node pool can belong to only one tier.
You can create tiers to assign your data to any of the node pools in the tier. For example,
you can assign a collection of node pools to a tier specifically created to store data that
requires high availability and fast access. In a three-tier system, this classification may
be Tier 1. You can classify data that is used less frequently or that is accessed by fewer
users as Tier-2 data. Tier 3 usually comprises data that is seldom used and can be
archived for historical or regulatory purposes.
File pools
File pools are sets of files that you define to apply policy-based control of the storage
characteristics of your data.
The initial installation of OneFS places all files in the cluster into a single file pool, which
is subject to the default file pool policy. SmartPools enables you to define additional file
pools, and create policies that move files in these pools to specific node pools and tiers.
File pool policies match specific file characteristics (such as file size, type, date of last
access or a combination of these and other factors), and define specific storage
operations for files that match them. The following examples demonstrate a few ways you
can configure file pool policies:
l
You can create a file pool policy for a specific file extension that requires high
availability.
You can configure a file pool policy to store that type of data in a storage pool that
provides the fastest reads or read/writes.
You can create another file pool policy to evaluate last accessed date, allowing you to
store older files in storage pool best suited for archiving for historical or regulatory
purposes.
369
Storage Pools
The unlicensed OneFS SmartPools technology allows you to configure the default file pool
policy for managing the node pools that are created when the cluster is autoprovisioned.
The default file pool contains all files and is stored in any node pool. Default file pool
operations are defined by settings of the default file pool policy.
You cannot reorder or remove the default file pool policy. The settings in the default file
pool policy apply to all files that are not covered by another file pool policy. For example,
data that is not covered by a file pool policy can be moved to a tier that you identify as a
default for this purpose.
All file pool policy operations are executed when the SmartPools job runs. When new files
are created, OneFS temporarily chooses a storage pool policy, using a mechanism based
on file pool policies used when the last SmartPools job ran. The system may apply new
storage settings and move these files again when the next SmartPools job runs, based on
a matching file pool policy.
Storage Pools
3. Click Edit.
The Edit Node Pools Details page appears.
4. Enter a new name for the node pool, or specify a new requested protection level from
the list, or do both.
A node pool name can contain alphanumeric characters and underscores but cannot
begin with a number.
5. Click Save Changes in the Edit Node Pools Details page.
6. Click Close in the View Node Pools Details page.
The S210 and X410 nodes must have compatible RAM and the same drive configurations
as their S200 and X400 counterparts to be provisioned into those node pools.
Procedure
1. Click File System > Storage Pools > SmartPools.
The SmartPools tab displays two lists: Tiers & Node Pools and Compatibilities.
2. Click Create a Compatibility.
The Create a Compatibility dialog box displays drop-down lists of compatibility
options.
3. In the Make list, select either S200 or X400, as appropriate.
4. In the compatible with list, select either S210 or X410, as appropriate.
5. Click Create a Compatibility.
A Confirm Changes to this Cluster dialog box appears with two check boxes that you
must select before proceeding. The check boxes describe the result of the operation.
6. Select all check boxes, then click Confirm.
Results
The compatibility is created and described in the Compatibilities list. The result of the
compatibility appears in the Tiers & Node Pools list.
371
Storage Pools
Note
Newer node types typically have better performance specifications than older node types,
so merging them with older node types could reduce performance. Also, when two node
pools are merged, OneFS restripes the data to take advantage of the larger node pool,
which could take considerable time, depending on the size of your data set.
Procedure
1. Click File System > Storage Pools > SmartPools.
The SmartPools tab displays two lists: Tiers & Node Pools and Compatibilities.
2. Click Create a Compatibility.
The Create a Compatibility dialog box displays drop-down lists of compatibility
options.
3. In the Make list, select either S200 or X400, as appropriate.
4. In the compatible with list, select either S210 or X410, as appropriate.
5. Click Create a Compatibility.
A Confirm Changes to this Cluster dialog box appears with a two check boxes that you
have to select before proceeding. The check boxes describe the result of the
operation.
6. Click Confirm.
If the two node pools being merged have different settings, for example L3 cache,
OneFS prompts you to resolve the differences before merging the node pools.
Results
The compatibility appears in the Compatibilities list. In the Tiers & Node Pools list, the
two former node pools are joined as one.
Delete a compatibility
You can delete a compatibility, and any nodes that are part of a node pool because of
this compatibility are removed from the node pool.
CAUTION
Storage Pools
The Delete a Compatibility dialog box appears, describing the compatibility and what
will happen when you delete it.
3. Click Delete Compatibility.
A Confirm Changes to this Cluster dialog box appears with a list of four check boxes
that you must select before proceeding.
4. Select all check boxes, and click Confirm.
Results
The compatibility is deleted, and the new state of the affected nodes appears in the Tiers
& Node Pools list.
373
Storage Pools
On HD400 node pools, SSDs are only used for L3 cache, which is turned on by default
and cannot be turned off. All other node pools with SSDs for L3 cache can have their
SSDs migrated back to storage drives.
Procedure
1. Click File System > Storage Pools > SmartPools.
2. In the Tiers & Node Pools area of the SmartPools tab, select View/Edit next to the
target node pool.
The View Node Pool Details dialog box appears, showing the current settings of the
node pool.
3. Click Edit.
The Edit Node Pool Details dialog box appears.
4. Clear the Enable L3 cache check box.
The setting is grayed out for node pools without SSDs, or for which the setting cannot
be changed.
5. Click Save Changes.
The Confirm Change to L3 Cache Setting message box appears.
6. Click Continue.
The migration process to disable L3 cache begins and can take awhile, depending on
the number and size of the SSDs in the node pool. When the migration process is
complete, the View Node Pool Details dialog box appears.
7. Click Close.
Managing tiers
You can move node pools into tiers to optimize file and storage management.
Managing tiers requires ISI_PRIV_SMARTPOOLS or higher administrative privileges.
374
Storage Pools
Create a tier
You can group create a tier that contains one or more node pools. You can use the tier to
store specific categories of files.
Procedure
1. Click File System > Storage Pools > SmartPools.
The SmartPools tab appears with two sections: Tiers & Node Pools and
Compatibilities.
2. In the Tiers & Node Pools section, click Create a Tier.
3. In the Create a Tier page that appears, enter a name for the tier.
4. For each node pool that you want to add to the tier, select a node pool from the
Available Node Pools list, and click Add.
The node pool is moved into the Selected Node Pools for this Tier list.
5. Click Create Tier.
The Create a Tier page closes, and the new tier is added to the Tiers & Node Pools
area. The node pools that you added are shown below the tier name.
Edit a tier
You can modify the name and change the node pools that are assigned to a tier.
A tier name can contain alphanumeric characters and underscores but cannot begin with
a number.
Procedure
1. Click File System > Storage Pools > SmartPools.
The SmartPools tab displays two groups: Tiers & Node Pools and Compatibilities.
2. In the Tiers & Node Pools area, next to the tier you want to edit, click View/Edit.
3. In the View Tier Details dialog box, click Edit Tier.
4. In the Edit Tier Details dialog box, modify the following settings as needed:
Option
Description
Tier Name
To change the name of the tier, select and type over the
existing name.
Node Pool
Selection
5. When you have finished editing tier settings, click Save Changes.
6. In the View Tier Details dialog box, click Close.
Delete a tier
You can delete a tier that has no assigned node pools.
Before you begin
If you want to delete a tier that does have assigned node pools, you must first remove the
node pools from the tier.
Create a tier
375
Storage Pools
Procedure
1. Click File System > Storage Pools > SmartPools.
The SmartPools tab displays two lists: Tiers & Node Pools and Compatibilities.
2. In the Tiers & Node Pools list, next to the tier that you want to delete, click More >
Delete Tier.
A message box asks you to confirm or cancel the operation.
3. Click Delete Tier to confirm the operation.
Results
The tier is removed from the Tiers & Node Pools list.
For example, to free up disk space on your performance tier (S-series node pools), you
could create a file pool policy to match all files greater than 25 MB in size, which have not
been accessed or modified for more than a month, and move them to your archive tier
(NL-series node pools).
You can configure and prioritize multiple file pool policies to optimize file storage for your
particular work flows and cluster configuration. When the SmartPools job runs, by default
once a day, it applies file pool policies in priority order. When a file pool matches the
criteria defined in a policy, the actions in that policy are applied, and lower-priority
custom policies are ignored for the file pool.
After the list of custom file pool policies is traversed, if any of the actions are not applied
to a file, the actions in the default file pool policy are applied. In this way, the default file
pool policy ensures that all actions apply to every file.
Note
You can reorder the file pool policy list at any time, but the default file pool policy is
always last in the list of file pool policies.
OneFS also provides customizable template policies that you can copy to make your own
policies. These templates, however, are only available from the OneFS web
administration interface.
376
Storage Pools
If existing file pool policies direct data to a specific storage pool, do not configure other
file pool policies with anywhere for the Data storage target option. Because the
specified storage pool is included when you use anywhere, target specific storage
pools to avoid unexpected results.
Procedure
1. Click File System > Storage Pools > File Pool Policies.
2. Click Create a File Pool Policy.
3. In the Create a File Pool Policy dialog box, enter a policy name and, optionally, a
description.
4. Specify the files to be managed by the file pool policy.
To define the file pool, you can specify file matching criteria by combining IF, AND, and
OR conditions. You can define these conditions with a number of file attributes, such
as name, path, type, size, and timestamp information.
5. Specify SmartPools actions to be applied to the selected file pool.
You can specify storage and I/O optimization settings to be applied.
6. Click Create Policy.
Results
The file pool policy is created and applied when the next scheduled SmartPools system
job runs. By default, this job runs once a day, but you also have the option to start the job
immediately.
OneFS supports UNIX shell-style (glob) pattern matching for file name attributes and
paths.
The following table lists the file attributes that you can use to define a file pool policy.
File attribute
Specifies
Name
377
Storage Pools
File attribute
Specifies
Path
File type
Size
Includes or excludes files based on one of the following filesystem object types:
l
File
Directory
Other
Created
Metadata changed
Accessed
378
Storage Pools
File attribute
Specifies
File attribute
Description
[a-z]
bat, bet, and bit, and 1[4-7]2 matches 142, 152, 162, and
172.
You can exclude characters within brackets by following the first
bracket with an exclamation mark. For example, b[!ie] matches
Matches any character in place of the question mark. For example, t?p
matches tap, tip, and top.
SmartPools settings
SmartPools settings include directory protection, global namespace acceleration, L3
cache, virtual hot spare, spillover, requested protection management, and I/O
optimization management.
Setting
Description
Notes
Increase
directory
protection to a
higher level than
its contents
379
Storage Pools
Setting
Description
Notes
As an example, consider a cluster that has a +2
default file pool protection setting and no additional
file pool policies. OneFS directories are always
mirrored, so they are stored at 3x, which is the
mirrored equivalent of the +2 default.
This configuration can sustain a failure of two nodes
before data loss or inaccessibility. If this setting is
enabled, all directories are protected at 4x. If the
cluster experiences three node failures, although
individual files may be inaccessible, the directory
tree is available and provides access to files that are
still accessible.
In addition, if another file pool policy protects some
files at a higher level, these too are accessible in the
event of a three-node failure.
Enable global
namespace
acceleration
Use SSDs as L3
Cache by default
for new node
pools
Virtual Hot Spare Reserves a minimum amount of space in the node pool If you configure both the minimum number of virtual
that can be used for data repair in the event of a drive
failure.
To reserve disk space for use as a virtual hot spare,
select from the following options:
l
380
Storage Pools
Setting
Description
Enable global
spillover
Spillover Data
Target
Notes
Manage I/O
optimization
settings
381
Storage Pools
If existing file pool policies direct data to a specific storage pool, do not add or modify a
file pool policy to the anywhere option for the Data storage target option. Target a
specific file pool instead.
Procedure
1. Click File System > Storage Pools > File Pool Policies.
2. In the File Pool Policies tab, next to Default Policy in the list, click View/Edit.
The View Default Policy Details dialog box is displayed.
3. Click Edit Policy.
The Edit Default Policy Details dialog box is displayed.
4. In the Apply SmartPools Actions to Selected Files section, choose the storage settings
that you want to apply as the default for Storage Target, Snapshot Storage Target,
and Requested Protection Level.
5. Click Save Changes, and then click Close.
Results
The next time the SmartPools job runs, the settings that you selected are applied to any
file that is not covered by another file pool policy.
Description
Notes
Note
CAUTION
382
Storage Pools
Setting
Description
Notes
Avoid SSDs
Write all associated file data and metadata to HDDs
only.
CAUTION
Data on SSDs
Use SSDs for both data and metadata. Regardless of
whether global namespace acceleration is enabled,
any SSD blocks reside on the storage target if there
is room.
Snapshot storage target
Requested protection
383
Storage Pools
4. In the Apply SmartPools Actions to Selected Files section, under I/O Optimization
Settings,choose the settings that you want to apply as the default for Write
Performance and Data Access Pattern.
5. Click Save Changes, and then click Close.
Results
The next time the SmartPools job runs, the settings that you selected are applied to any
file that is not covered by another file pool policy.
Description
Notes
SmartCache
Enables or disables
SmartCache.
Data access
pattern
If existing file pool policies direct data to a specific storage pool, do not configure other
file pool policies with anywhere for the Data storage target option. Because the
specified storage pool is included when you use anywhere, target specific storage
pools to avoid unintentional file storage locations.
Procedure
1. Click File System > Storage Pools > File Pool Policies.
2. In the File Pool Policies list, next to the policy you want to modify, click View/Edit.
The View File Pool Policy Details dialog box is displayed.
3. Click Edit Policy.
The Edit File Pool Policy Details dialog box is displayed.
4. Modify the policy settings, and then click Save Changes.
5. Click Close in the View File Pool Policy Details dialog box.
384
Storage Pools
Results
Changes to the file pool policy are applied when the next SmartPools job runs. You can
also start the SmartPools job manually to execute the policy immediately.
385
Storage Pools
Procedure
1. Click File System > Storage Pools > File Pool Policies.
The File Pool Policies tab displays two lists: File Pool Policies and Policy Templates.
2. In the File Pool Policies list, next to the policy that you want to delete, click More >
Delete Policy.
3. In the Confirm Delete dialog box, click Delete.
Results
The file pool policy is removed from the File Pool Policies list.
386
Storage Pools
387
Storage Pools
388
CHAPTER 20
System jobs
System jobs
389
System jobs
To initiate any Job Engine tasks, you must have the role of SystemAdmin in the OneFS
system.
390
Job name
Description
Exclusion Impact
Set
Policy
Priority Operation
AutoBalance
Restripe
Low
Auto
System jobs
Job name
Description
Exclusion Impact
Set
Policy
Priority Operation
AutoBalanceLin
Restripe
Low
Auto
AVScan
Performs an antivirus
scan on all files.
None
Low
Manual
Collect
Mark
Low
Auto
Dedupe*
None
Low
Manual
None
Low
Manual
DomainMark
Low
Manual
FlexProtect
Medium
Auto
391
System jobs
Job name
Description
Exclusion Impact
Set
Policy
Priority Operation
Note
392
FlexProtectLin
Medium
Auto
FSAnalyze
Gathers information
about the file system.
None
Low
Scheduled
IntegrityScan
Mark
Medium
Manual
MediaScan
Low
Scheduled
MultiScan
Restripe
Mark
Low
Auto
PermissionRepair
None
Low
Manual
QuotaScan*
Updates quota
None
accounting for
domains created on an
existing file tree.
Available only if you
activate a
SmartQuotas license.
Low
Auto
System jobs
Job name
Description
Exclusion Impact
Set
Policy
Priority Operation
SetProtectPlus
Restripe
Low
Manual
None
Low
Scheduled
SmartPools*
Enforces SmartPools
file policies. Available
only if you activate a
SmartPools license.
Restripe
Low
Scheduled
SnapRevert
Reverts an entire
snapshot back to
head.
None
Low
Manual
SnapshotDelete
None
Medium
Auto
TreeDelete
Medium
Manual
Job operation
OneFS includes system maintenance jobs that run to ensure that your Isilon cluster
performs at peak health. Through the Job Engine, OneFS runs a subset of these jobs
automatically, as needed, to ensure file and data integrity, check for and mitigate drive
and node failures, and optimize free space. For other jobs, for example, Dedupe, you can
use Job Engine to start them manually or schedule them to run automatically at regular
intervals.
The Job Engine runs system maintenance jobs in the background and prevents jobs
within the same classification (exclusion set) from running simultaneously. Two exclusion
sets are enforced: restripe and mark.
Restripe job types are:
l
AutoBalance
AutoBalanceLin
FlexProtect
FlexProtectLin
MediaScan
MultiScan
SetProtectPlus
SmartPools
393
System jobs
Collect
IntegrityScan
MultiScan
Note that MultiScan is a member of both the restripe and mark exclusion sets. You
cannot change the exclusion set parameter for a job type.
The Job Engine is also sensitive to job priority, and can run up to three jobs, of any
priority, simultaneously. Job priority is denoted as 110, with 1 being the highest and 10
being the lowest. The system uses job priority when a conflict among running or queued
jobs arises. For example, if you manually start a job that has a higher priority than three
other jobs that are already running, Job Engine pauses the lowest-priority active job, runs
the new job, then restarts the older job at the point at which it was paused. Similarly, if
you start a job within the restripe exclusion set, and another restripe job is already
running, the system uses priority to determine which job should run (or remain running)
and which job should be paused (or remain paused).
Other job parameters determine whether jobs are enabled, their performance impact, and
schedule. As system administrator, you can accept the job defaults or adjust these
parameters (except for exclusion set) based on your requirements.
When a job starts, the Job Engine distributes job segmentsphases and tasksacross
the nodes of your cluster. One node acts as job coordinator and continually works with
the other nodes to load-balance the work. In this way, no one node is overburdened, and
system resources remain available for other administrator and system I/O activities not
originated from the Job Engine.
After completing a task, each node reports task status to the job coordinator. The node
acting as job coordinator saves this task status information to a checkpoint file.
Consequently, in the case of a power outage, or when paused, a job can always be
restarted from the point at which it was interrupted. This is important because some jobs
can take hours to run and can use considerable system resources.
394
Impact policy
Allowed to run
Resource consumption
LOW
Low
MEDIUM
Medium
HIGH
High
OFF_HOURS
Outside of business
hours. Business hours are
defined as 9AM to 5pm,
Monday through Friday.
OFF_HOURS is paused
during business hours.
Low
System jobs
If you want to specify other than a default impact policy for a job, you can create a custom
policy with new settings.
Jobs with a low impact policy have the least impact on available CPU and disk I/O
resources. Jobs with a high impact policy have a significantly higher impact. In all cases,
however, the Job Engine uses CPU and disk throttling algorithms to ensure that tasks that
you initiate manually, and other I/O tasks not related to the Job Engine, receive a higher
priority.
Job priorities
Job priorities determine which job takes precedence when more than three jobs of
different exclusion sets attempt to run simultaneously. The Job Engine assigns a priority
value between 1 and 10 to every job, with 1 being the most important and 10 being the
least important.
The maximum number of jobs that can run simultaneously is three. If a fourth job with a
higher priority is started, either manually or through a system event, the Job Engine
pauses one of the lower-priority jobs that is currently running. The Job Engine places the
paused job into a priority queue, and automatically resumes the paused job when one of
the other jobs is completed.
If two jobs of the same priority level are scheduled to run simultaneously, and two other
higher priority jobs are already running, the job that is placed into the queue first is run
first.
395
System jobs
The Job Reports table displays a chronological list of the last ten job events that have
occurred on the cluster. Event information includes the time the event occurred, the
job responsible for the event, and event results.
2. Filter reports by job type by selecting the job from the Filter by Job Type drop-down list
and clicking Reset.
3. Click on View Details next to a job name to view recent events for only that job.
Recent events for the job appear in the View Job Report Details window, and include
information such as start time, duration, and whether or not the job was successful.
Start a job
By default, only some system maintenance jobs are scheduled to run automatically.
However, you can start any of the jobs manually at any time.
Procedure
1. Click Cluster Management > Job Operations > Job Summary.
2. In the Active Jobs window, select the job you want to start and click More.
3. Click Start Running Job.
Pause a job
You can pause a job temporarily to free up system resources.
Procedure
1. Click Cluster Management > Job Operations > Job Summary.
2. In the Active Jobs table, click More for the job that you want to pause.
3. Click Pause Running Job in the menu that appears.
The job remains paused until you resume it.
Resume a job
You can resume a paused job.
Procedure
1. Click Cluster Management > Job Operations > Job Summary.
2. In the Active Jobs table, click More for the job that you want to pause.
3. Click Resume Running Job in the menu that appears.
Results
The job continues from the phase or task at which it was paused.
Cancel a job
If you want to free up system resources, or for any reason, you can permanently
discontinue a running, paused, or waiting job.
Procedure
1. Click Cluster Management > Job Operations > Job Summary.
2. In the Active Jobs table, click More for the job that you want to cancel.
3. Click Cancel Running Job in the menu that appears.
396
System jobs
Update a job
You can change the priority and impact policy of a running, waiting, or paused job.
When you update a job, only the current instance of the job runs with the updated
settings. The next instance of the job returns to the default settings for that job.
Note
Update a job
397
System jobs
System jobs
Description
Policy description a. In the Description field, type a new overview for the impact
policy.
b. Click Submit.
Impact schedule
The modified impact policy is saved and listed in alphabetical order in the Impact
Policies table.
399
System jobs
System jobs
If the job is not on the first page of the Job Reports screen, click the right-arrow icon to
page through the list until you locate your job.
3. Click View Details.
The View Job Report Details screen appears, listing job statistics such as elapsed
time, CPU and memory usage, and total I/O operations.
4. When you are finished viewing the report, click Close.
401
System jobs
402
CHAPTER 21
Networking
Networking
403
Networking
Networking overview
After you determine the topology of your network, you can set up and manage your
internal and external networks.
There are two types of networks associated with an EMC Isilon cluster:
Internal
Nodes communicate with each other using a high speed low latency InfiniBand
network. You can optionally configure a second InfiniBand network as a failover for
redundancy.
External
Clients connect to the cluster through the external network with Ethernet. The Isilon
cluster supports standard network communication protocols, including NFS, SMB,
HTTP, and FTP. The cluster includes various external Ethernet connections, providing
flexibility for a wide variety of network configurations. External network speeds vary
by product.
404
Networking
Creates a default external network subnet called subnet0, with the specified
netmask, gateway, and SmartConnect service address.
Creates a default IP address pool called pool0 with the specified IP address range,
the SmartConnect zone name, and the external interface of the first node in the
cluster as the only member.
Creates a default network provisioning rule called rule0, which automatically assigns
the first external interface for all newly added nodes to pool0.
Adds pool0 to subnet0 and configures pool0 to use the virtual IP of subnet0 as its
SmartConnect service address.
Sets the global, outbound DNS settings to the domain name server list and DNS
search list, if provided.
Internal network failover
405
Networking
Once the initial external network has been established, you can configure the following
information about your external network:
l
Netmask
IP address range
Gateway
You can make modifications to the external network through the web administration
interface and the command-line interface.
IP address pools
You can partition EMC Isilon cluster nodes and external network interfaces into logical IP
address pools. IP address pools are also utilized when configuring SmartConnect zones
and IP failover support for protocols such as NFS. Multiple pools for a single subnet are
available only if you activate a SmartConnect Advanced license.
IP address pools:
l
The IP address pool of a subnet consists of one or more IP address ranges and a set of
cluster interfaces. All IP address ranges in a pool must be unique.
A default IP address pool is configured during the initial cluster setup through the
command-line configuration wizard. You can modify the default IP address pool at any
time. You can also add, remove, or modify additional IP address pools.
If you add external network subnets to your cluster through the subnet wizard, you must
specify the IP address pools that belong to the subnet.
IP address pools are allocated to external network interfaces either dynamically or
statically. The static allocation method assigns one IP address per pool interface. The IP
addresses remain assigned, regardless of that interface's status, but the method does
not guarantee that all IP addresses are assigned. The dynamic allocation method
distributes all pool IP addresses, and the IP address can be moved depending on the
interface's status and connection policy settings.
IPv6 support
You can configure dual stack support for IPv6.
With dual-stack support in OneFS, you can configure both IPv4 and IPv6 addresses.
However, configuring an EMC Isilon cluster to use IPv6 exclusively is not supported. When
you set up the cluster, the initial subnet must consist of IPv4 addresses.
The following table describes important distinctions between IPv4 and IPv6.
406
IPv4
IPv6
32-bit addresses
128-bit addresses
Networking
IPv4
IPv6
Subnet mask
Prefix length
SmartConnect module
SmartConnect is a module that specifies how the DNS server on the EMC Isilon cluster
handles connection requests from clients and the methods used to assign IP addresses
to network interfaces.
Settings and policies configured for SmartConnect are applied per IP address pool. You
can configure basic and advanced SmartConnect settings.
SmartConnect Basic
SmartConnect Basic is included with OneFS as a standard feature and does not require a
license.
SmartConnect Basic supports the following settings:
l
You may only assign one IP address pool per external network subnet.
SmartConnect Advanced
SmartConnect Advanced extends the settings available from SmartConnect Basic. It
requires an active license.
SmartConnect Advanced supports the following settings:
l
SmartConnect Advanced allows you to specify the following IP address pool configuration
options:
l
You can define an IP address failover policy for the IP address pool.
You can define an IP address rebalance policy for the IP address pool.
Connection balancing
The connection balancing policy determines how the DNS server handles client
connections to the EMC Isilon cluster.
You can specify one of the following balancing methods:
SmartConnect module
407
Networking
Round robin
Selects the next available node on a rotating basis. This is the default method.
Without a SmartConnect license for advanced settings, this is the only method
available for load balancing.
Connection count
Determines the number of open TCP connections on each available node and selects
the node with the fewest client connections.
Network throughput
Determines the average throughput on each available node and selects the node
with the lowest network interface load.
CPU usage
Determines the average CPU utilization on each available node and selects the node
with lightest processor usage.
IP address allocation
The IP address allocation policy ensures that all of the IP addresses in the pool are
assigned to an available network interface.
You can specify whether to use static or dynamic allocation.
Static
Assigns one IP address to each network interface added to the IP address pool, but
does not guarantee that all IP addresses are assigned.
Once assigned, the network interface keeps the IP address indefinitely, even if the
network interface becomes unavailable. To release the IP address, remove the
network interface from the pool or remove it from the cluster.
Without a license for SmartConnect Advanced, static is the only method available for
IP address allocation.
Dynamic
Assigns IP addresses to each network interface added to the IP address pool until all
IP addresses are assigned. This guarantees a response when clients connect to any
IP address in the pool.
If a network interface becomes unavailable, its IP addresses are automatically
moved to other available network interfaces in the pool as determined by the IP
address failover policy.
This method is only available with a license for SmartConnect Advanced.
408
SMB
NFSv4
HTTP
FTP
Static
Networking
sFTP
FTPS
HDFS
SyncIQ
NFSv2
NFSv3
Dynamic
IP address failover
The IP address failover policy specifies how to handle the IP addresses of network
interfaces that become unavailable
To define an IP address failover policy, you must have a license for SmartConnect
Advanced, and the IP address allocation policy must be set to dynamic. Dynamic IP
allocation to ensures that all of the IP addresses in the pool are assigned to available
network interfaces.
When a network interface becomes unavailable, the IP addresses that were assigned to it
are redistributed to available network interfaces according to the IP address failover
policy. Subsequent client connections are directed to the new network interfaces.
You can select one of the following the connection balancing methods to determine how
the IP address failover policy selects which network interface receives a redistributed IP
address:
l
Round robin
Connection count
Network throughput
CPU usage
IP address rebalancing
The IP address rebalance policy specifies when to redistribute IP addresses if one or more
previously unavailable network interfaces becomes available again.
To define an IP address rebalance policy, you must have a license for SmartConnect
Advanced, and the IP address allocation policy must be set to dynamic. Dynamic IP
addresses allocation ensures that all of the IP addresses in the pool are assigned to
available network interfaces.
You can set rebalancing to occur manually or automatically:
Manual
Does not redistribute IP addresses until you manually issue a rebalance command
through the command-line interface.
Upon rebalancing, IP addresses will be redistributed according to the connection
balancing method specified by the IP address failover policy defined for the IP
address pool.
IP address failover
409
Networking
Automatic
Automatically redistributes IP addresses according to the connection balancing
method specified by the IP address failover policy defined for the IP address pool.
Automatic rebalance may also be triggered by changes to cluster nodes, network
interfaces, or the configuration of the external network.
Note
Rebalancing can disrupt client connections. Ensure the client workflow on the IP
address pool is appropriate for automatic rebalancing.
SmartConnect requires that you add a new name server (NS) record to the existing
authoritative DNS zone that contains the cluster, and you must provide the fully qualified
domain name (FQDN) of the SmartConnect zone.
410
Networking
NIC aggregation
Network interface card (NIC) aggregation, also known as link aggregation, is optional, and
enables you to combine the bandwidth of a node's physical network interface cards into
a single logical connection. NIC aggregation provides improved network throughput.
Note
Some NICs may allow aggregation of ports only on the same network card.
For LACP and FEC aggregation modes, the switch must support IEEE 802.3ad link
aggregation. Since the trunks on the network switch must also be configured, the
node must be connected with the correct ports on the switch.
Routing options
By default, outgoing client traffic on the EMC Isilon cluster is destination-based; traffic is
routed to a particular gateway based on where the traffic is going. OnefS supports sourcebased routing and static routes; these options allow for more granular control of the
direction of outgoing client traffic.
Source-based routing
Source-based routing selects which gateway to direct outgoing client traffic through
based on the source IP address in each packet header.
In the following example, you enable source-based routing on an Isilon cluster that is
connected to SubnetA and SubnetB. Each subnet is configured with a SmartConnect zone
NIC aggregation
411
Networking
and a gateway, also labeled A and B. When a client on SubnetA makes a request to
SmartConnect ZoneB, the response originates from ZoneB. This results in a ZoneB
address as the source IP in the packet header, and the response is routed through
GatewayB. Without source-based routing, the default route is destination-based, so the
response is routed through GatewayA.
In another example, a client on SubnetC, which is not connected to the Isilon cluster,
makes a request to SmartConnect ZoneA and ZoneB. The response from ZoneA is routed
through GatewayA, and the response from ZoneB is routed through GatewayB. In other
words, the traffic is split between gateways. Without source-based routing, both
responses are routed through the same gateway.
When enabled, source-based routing automatically scans your network configuration to
create client traffic rules. If you make modifications to your network configuration, such
as changing the IP address of a gateway server, source-based routing adjusts the rules.
Source-based routing is applied across the entire cluster and only supports the IPv4
protocol.
Enabling or disabling source-based routing goes into effect immediately. Packets in
transit continue on their original courses, and subsequent traffic is routed based on the
status change. Transactions composed of multiple packets might be disrupted or delayed
if the status of source-based routing changes during transmission.
Source-based routing can conflict with static routes. If a routing conflict occurs, sourcebased routing rules are prioritized over the static route.
You might enable source-based routing if you have a large network with a complex
topology. For example, if your network is a multi-tenant environment with several
gateways, traffic is more efficiently distributed with source-based routing.
Static routing
A static route directs outgoing client traffic to a specified gateway based on the IP
address the client is connected through.
You configure static routes by IP address pool, and each route applies to all network
interfaces that are members of the IP address pool. Static routes only support the IPv4
protocol.
You might configure static routing in order to connect to networks that are unavailable
through the default routes or if you have a small network that only requires one or two
routes.
Note
If you have upgraded from a version earlier than OneFS 7.0, existing static routes that
were added through rc scripts will no longer work and must be re-created by running the
isi networks modify pool command with the --add-static-routes option.
VLANs
Virtual LAN (VLAN) tagging is an optional setting that enables an EMC Isilon cluster to
participate in multiple virtual networks.
You can partition a physical network into multiple broadcast domains, or virtual local
area networks (VLANs). You can enable a cluster to participate in a VLAN which allows
multiple cluster subnet support without multiple network switches; one physical switch
enables multiple virtual subnets.
412
Networking
VLAN tagging inserts an ID into packet headers. The switch refers to the ID to identify from
which VLAN the packet originated and to which network interface a packet should be
sent.
Modify the IP address ranges of the internal network and the int-b/failover network
You can configure the int-b/failover network to provide backup in the event of an int-a
network failure. Configuration involves specifying a valid netmask and IP address range
for the failover network.
3. In the IP Ranges area, you can add, delete, or migrate your IP address ranges.
Ideally, the new range is contiguous with the previous one. For example, if your
current IP address range is 192.168.160.60 - 192.168.160.162, the new range should
start with 192.168.160.163.
4. Click Submit.
5. Restart the cluster, if needed.
l
If the IP address changes are within the internal network netmask, you do not need
to restart the cluster.
If the IP address changes reduce the current range, you must restart the cluster.
If you migrate the IP address ranges, you must restart the cluster.
413
Networking
Note
For the changes in netmask value to take effect, you must reboot the cluster.
Procedure
1. Click Cluster Configuration > Network Configuration.
2. In the Internal Network Settings area, select the network that you want to configure
the netmask for.
l
It is recommended that the netmask values you specify for int-a and int-b/failover are
the same. If you modify the netmask value of one, modify the other.
3. In the Netmask field, type a netmask value.
You cannot modify the netmask value if the change invalidates any node addresses.
4. Click Submit.
A dialog box prompts you to reboot the cluster.
5. Specify when you want to reboot the cluster.
l
To immediately reboot the cluster, click Yes. When the cluster finishes rebooting,
the login page appears.
Click No to return to the Edit Internal Network page without changing the settings
or rebooting the cluster.
414
Networking
It is recommended that the netmask values you specify for int-a and int-b/failover are
the same.
9. In the Settings area, for State, click Enable to enable the int-b and failover networks.
10.Click Submit.
The Confirm Cluster Reboot dialog box appears.
11.Restart the cluster by clicking Yes.
Adding a subnet
You can add and configure an external subnet.
Adding a subnet to the external network encompasses these tasks:
Procedure
1. Configuring subnet settings.
2. Adding an IP address to a new subnet.
3. Optional: Configuring SmartConnect settings for a new subnet.
4. Selecting interface members for a new subnet.
415
Networking
5. Specify the IP address format for the subnet and configure an associated netmask or
prefix length setting:
l
For an IPv4 subnet, click IPv4 in the IP Family list. In the Netmask field, type a
dotted decimal octet (x.x.x.x) that represents the subnet mask.
For an IPv6 subnet, click IPv6 in the IP family list. In the Prefix length field, type an
integer (ranging from 1 to 128) that represents the network prefix length.
6. In the MTU list, type or select the size of the maximum transmission units the cluster
uses in network communication. Any numerical value is allowed, but must be
compatible with your network and the configuration of all devices in the network path.
Common settings are 1500 (standard frames) and 9000 (jumbo frames).
Although OneFS supports both 1500 MTU and 9000 MTU, using a larger frame size for
network traffic permits more efficient communication on the external network
between clients and cluster nodes. For example, if a subnet is connected through a 10
GbE interface and NIC aggregation is configured for IP address pools in the subnet, it
is recommended you set the MTU to 9000. To benefit from using jumbo frames, all
devices in the network path must be configured to use jumbo frames.
7. In the Gateway address field, type the IP address of the gateway server device through
which the cluster communicates with systems outside of the subnet.
8. In the Gateway priority field, type an integer for the priority of the subnet gateway for
nodes assigned to more than one subnet.
You can configure only one default gateway per node, but each subnet can be
assigned a gateway. When a node belongs to more than one subnet, this option
enables you to define the preferred default gateway. A value of 1 represents the
highest priority, and 10 represents the lowest priority.
9. If you plan to use SmartConnect for connection balancing, in the SmartConnect
service IP field, type the IP address that will receive all incoming DNS requests for
each IP address pool according to the client connection policy. You must have at least
one subnet configured with a SmartConnect service IP in order to use connection
balancing.
10.Optional: In the Advanced section, you can enable VLAN tagging if you want to enable
the cluster to participate in virtual networks.
Note
416
Networking
If your cluster is running SmartConnect Basic for connection balancing, you can configure
only one IP address pool per subnet. If you activate a SmartConnect Advanced license,
you can configure unlimited IP address pools per subnet.
Procedure
1. In the Step 2 of 4 IP Address Pool Settings dialog box, type a unique Name for the
IP address pool. The name can be up to 32 alphanumeric characters long and can
include underscores or hyphens, but no spaces or other punctuation.
2. Type a Description for the IP address pool. The description can contain up to 128
characters.
3. In the Access zone list, click to select an access zone for the pool. OneFS includes a
default system access zone.
4. In the IP range (low-high) area, click New.
OneFS adds an IP address range with default Low IP and High IP values.
5. Click to select the default Low IP value. Replace the default value with the starting IP
address of the subnet's IP address pool.
6. Click to select the default High IP value. Replace the default value with the ending IP
address of the subnet's IP address pool.
7. Optional: Add IP address ranges to the IP address pool by repeating steps 3 through 6
as needed.
8. Click Next.
The Step 3 of 4 SmartConnect Settings dialog box appears.
After you finish
The next step in the process of adding a new subnet is configuring SmartConnect
settings, which is optional. If you do not wish to configure SmartConnect settings, the
next step is adding network interface members to the new subnet.
417
Networking
SmartConnect requires that you add a new name server (NS) record to the existing
authoritative DNS zone that contains the cluster and that you delegate the SmartConnect
zone as a fully qualified domain name (FQDN).
Procedure
1. In the Step 3 of 4 SmartConnect Settings dialog box, type a Zone name for the
SmartConnect zone that this IP address pool represents. The zone name must be
unique among the pools served by the SmartConnect service subnet specified in Step
3 below.
2. In the Connection policy list, select the type of connection balancing policy set by the
IP address pool of this subnet. The connection balancing policy determines how
SmartConnect distributes incoming DNS requests across the members of an IP
address pool.
Option
Description
Round Robin
Connection Count
Network Throughput Sets the overall average throughput volume on each available
node to optimize the cluster usage.
CPU Usage
3. In the SmartConnect service subnet list, select the name of the external network
subnet whose SmartConnect service will answer DNS requests on behalf of the IP
address pool. A pool can have only one SmartConnect service answering DNS
requests. If this option is left blank, the IP address pool the subnet belongs to is
excluded when SmartConnect answers incoming DNS requests for the cluster.
Note
If you have activated a SmartConnect Advanced license, complete the following steps
for the options in the SmartConnect Advanced section of this wizard page.
4. In the IP allocation method list, select the method by which IP addresses are assigned
to the member interfaces for this IP address pool:
418
Option
Description
Static
Networking
Option
Description
l
Dynamic Select this IP allocation method to ensure that all IP addresses in the IP
address pool are assigned to member interfaces, which allows clients to
connect to any IP addresses in the pool and be guaranteed a response. If
a node or an interface becomes unavailable, their IP addresses are
automatically moved to other available member interfaces in the pool.
If you select the dynamic IP allocation method, you can specify the SmartConnect
Rebalance policy and the IP failover policy in the next two steps.
5. Select the type of SmartConnect Rebalance policy to redistribute IP addresses. IP
address redistribution occurs when node interface members in an IP address pool
become available. These options can only be selected if the IP allocation method is
set to Dynamic.
Option
Description
Automatic
Automatically redistributes IP addresses. The automatic
Failback (default) rebalance is triggered by a change to one of the following items.
Manual Failback
Cluster membership.
Description
Round Robin
Connection Count
Network Throughput Sets the overall average throughput volume on each available
node to optimize the cluster usage.
CPU Usage
7. Click Next to store the changes that you made to this wizard page.
The Step 4 of 4 IP Address Pool members dialog box appears.
After you finish
The next step in the process of adding a new subnet is adding network interface
members.
Adding a subnet
419
Networking
Modifying an external network subnet that is in use can disable access to the cluster and
the web administration interface. OneFS displays a warning if deleting a subnet will
terminate communication between the cluster and the web administration interface.
Procedure
1. Click Cluster Management > Network Configuration.
2. In the External Network Settings area, click the name of the subnet you want to
modify.
3. In the Settings area, click Edit.
4. Modify the Basic subnet settings as needed.
420
Networking
Option
Description
Description
Netmask
The subnet mask for the network interface. This field appears
only for IPv4 subnets.
MTU
Gateway address The IP address of the gateway server through which the cluster
communicates with systems outside of the subnet.
Gateway priority
The priority of the subnet's gateway for nodes that are assigned
to more than one subnet. Only one default gateway can be
configured on each Isilon node, but each subnet can have its
own gateway. If a node belongs to more than one subnet, this
option enables you to define the preferred default gateway. A
value of 1 is the highest priority, with 10 being the lowest
priority.
SmartConnect
service IP
Description
VLAN tagging
VLAN ID
Hardware load
balancing IPs
You can enter the IP address for a hardware load balancing switch
that uses Direct Server Return (DSR).
6. Click Submit.
421
Networking
Procedure
1. Click Cluster Management > Network Configuration.
2. In the External Network Settings area, click the name of the subnet you want to
delete.
The Edit Subnet page appears for the subnet you specified.
3. Click Delete subnet.
A confirmation dialogue box appears.
4. Click Yes to delete the subnet.
If the subnet you are deleting is used to communicate with the web administration
interface, the confirmation message will contain an additional warning.
3. To verify that the static route was created, run the following command: isi networks
ls pools -v.
OK
3. To ensure that the static route was created, run the following command: isi
networks ls pools -v.
422
Networking
423
Networking
Configure IP failover
You can configure IP failover to reassign an IP address from an unavailable node to a
functional node, which enables clients to continue communicating with the cluster, even
after a node becomes unavailable.
Procedure
1. Click Cluster Management > Network Configuration
2. In the External Network Settings area, click the name of the subnet for which you
want to set up IP failover.
3. Expand the area of the pool you want to modify and click Edit in the SmartConnect
Settings area.
4. Optional: In the Zone name field, enter a name for the zone, using no more than 128
characters.
424
Networking
Description
Round Robin
Connection Count
Network Throughput Uses the overall average throughput volume on each available
node to optimize the cluster usage.
CPU Usage
6. If you purchased a license for SmartConnect Advanced, you will also have access to
the following lists:
IP allocation method
This setting determines how IP addresses are assigned to clients. Select either
Dynamic or Static.
Rebalance Policy
This setting defines the client redirection policy for when a node becomes
unavailable. The IP allocation list must be set to Dynamic in order for rebalance
policy options to be selected.
IP failover policy
This setting defines the client redirection policy when an IP address becomes
unavailable.
425
Networking
Networking
Ethernet interfaces, both types of interfaces can be aggregated, but only with interfaces
of the same type. NIC aggregation cannot be used with mixed interface types.
An external interface for a node cannot be used by an IP address pool in both an
aggregated configuration and an individual interface. You must remove the individual
interface for a node from the Interfaces in current pool table before configuring an
aggregated NIC. Otherwise, the web administration interface displays an error message
when you click Submit.
Note
Consult your network switch documentation for supported NIC aggregation modes.
OneFS supports the following NIC aggregation modes:
Option
Description
Link Aggregation
Control Protocol
(LACP)
Legacy Fast
EtherChannel (FEC)
mode
Etherchannel (FEC)
Active / Passive
Failover
427
Networking
Option
Description
Round-Robin Tx
6. Click Submit.
LNI numbering corresponds to the physical positioning of the NIC ports as found on
the back of the node. LNI mappings are numbered from left to right.
Aggregated LNIs are listed in the order in which they are aggregated at the time they
are created.
LNI
NIC
Aggregated LNI
ext-1
em0
lagg0
fec0
ext-2
em1
ext-1
em2
lagg0
fec0
ext-2
em3
lagg1
fec1
ext-3
em0
lagg2
fec2
ext-4
em1
ext-1
em0
lagg0
fec0
ext-2
em1
lagg1
fec1
10gige-1 cxgb0
10gige-1 cxgb1
428
Networking
4. Select the name of the aggregated NIC for the node that you want to remove in the
Interfaces in current pool table, and then click the left arrow button to move the name
into the Available interfaces table.
5. Select one or more individual interfaces for the node in the Available interfaces table,
and then click the right arrow button to move the interfaces into the Interfaces in
current pool table.
6. When you have completed modifying the node interface settings, click Submit.
429
Networking
Note
DNS resolver options are described in the /etc/resolv.conf file. Setting DNS
resolver options is not recommended. Most clusters do not need DNS resolver options
and setting them may change how OneFS performs DNS lookups.
6. Click Submit.
Round robin is the only connection policy available if you have not activated a
SmartConnect Advanced license.
430
Networking
Option
Description
Round Robin
Connection Count
Network Throughput Sets the overall average throughput volume on each available
node to optimize the cluster usage.
CPU Usage
7. In the SmartConnect service subnet list, select the name of the external network
subnet whose SmartConnect service answers DNS requests on behalf of the IP
address pool. A pool can have only one SmartConnect service answering DNS
requests. If this option is left blank, the IP address pool that the SmartConnect service
belongs to is excluded when SmartConnect answers incoming DNS requests for the
cluster.
If you have activated a SmartConnect Advanced license, complete the following steps
in the SmartConnect Advanced area.
8. In the IP allocation method list, select the method by which IP addresses are assigned
to the member interfaces for this IP address pool.
431
Networking
3. In the Name field, type a unique name for the provisioning rule. The rule name can be
a maximum of 32 characters and can include spaces or other punctuation.
4. Optional: In the Description field, type a descriptive comment about the provisioning
rule.
5. In the If node type is list, select the type of node to which you want to apply the rule:
Option
Description
Any
Apply the provisioning rule to all types of Isilon nodes that join
the cluster.
Storage-i
Accelerator-i
Storage
Accelerator
Description
ext-1
ext-2
ext-3
ext-4
ext-agg
ext-agg-2
ext-agg-3
ext-agg-4
10gige-1
10gige-2
432
Networking
8. In the Pool list, select the IP address pool of the subnet that should be used by the
new node.
9. Click Submit.
Source-based routing is enabled or disabled on the entire EMC Isilon cluster and
supports only the IPv4 protocol.
Enabling and disabling source-based routing is only supported through the commandline interface.
Procedure
1. Enable source-based routing by running the following command:
isi networks sbr enable
433
Networking
Configure a static route on a per-pool basis. Static routes support only the IPv4 protocol.
You can only add or remove a static route through the command-line interface.
Procedure
1. Optional: Identify the name of the IP address pool you want to modify for static routes
by running the following command:
isi networks list pools
2. Configure static routes on a pool by running the isi networks modify pool
command.
Specify the route in classless inter-domain routing (CIDR) notation format.
The following command adds a static route to pool5 and assigns the route to all
network interfaces that are members of the pool:
isi networks modify pool subnet10:pool5 --add-staticroutes=192.168.205.128/24-192.168.205.2
434
CHAPTER 22
Hadoop
Hadoop
435
Hadoop
Hadoop overview
Hadoop is an open-source platform that runs analytics on large sets of data across a
distributed file system.
In a Hadoop implementation on an EMC Isilon cluster, OneFS acts as the distributed file
system and HDFS is supported as a native protocol. Clients from a Hadoop cluster
connect to the Isilon cluster through the HDFS protocol to manage and process data.
Hadoop support on the cluster requires you to activate an HDFS license. To obtain a
license, contact your EMC Isilon sales representative.
Hadoop architecture
Hadoop consists of a compute layer and a storage layer.
In a typical Hadoop implementation, both layers exist on the same cluster.
436
Hadoop
The compute and storage layers are on separate clusters instead of the same cluster.
Instead of storing data within a Hadoop distributed file system, the storage layer
functionality is fulfilled by OneFS on an EMC Isilon cluster. Nodes on the Isilon cluster
function as both a NameNode and a DataNode.
The compute layer is established on a Hadoop compute cluster that is separate from
the Isilon cluster. MapReduce and its components are installed on the Hadoop
compute cluster only.
In addition to HDFS, clients from the Hadoop compute cluster can connect to the
Isilon cluster over any protocol that OneFS supports such as NFS, SMB, FTP, and
HTTP.
Hadoop compute clients can connect to any node on the Isilon cluster that functions
as a NameNode instead of being routed by a single NameNode.
Cloudera
Manager
Greenplum GPHD
HAWQ
Hortonworks Data
Platform
Pivotal HD
Versions supported
3 (Updates 25)
4.2
5.0
5.1
4.0
5.2
1.1
1.2
1.1.0.1
1.1.11.3.3 (non-GUI)
2.1
1.0.1
How Hadoop is implemented on OneFS
437
Hadoop
Hadoop
distribution
Apache Hadoop
Versions supported
2.0
0.20.203
0.20.205
1.0.01.0.3
1.2.1
2.0.x
2.22.4
WebHDFS
OneFS supports access to HDFS data through WebHDFS client applications.
WebHDFS is a RESTful programming interface based on HTTP operations such as GET,
PUT, POST, and DELETE that is available for creating client applications. WebHDFS client
applications allow you to access HDFS data and perform HDFS operations through HTTP
and HTTPS.
WebHDFS is supported by OneFS on a per-access zone basis and is enabled by default.
WebHDFS supports simple authentication and Kerberos authentication. If the HDFS
authentication method for an access zone is set to All, OneFS uses simple
authentication by default.
Note
Secure impersonation
Secure impersonation enables you to create proxy users that can impersonate other
users to run Hadoop jobs.
You might configure secure impersonation if you use applications, such as Apache Oozie,
to automatically schedule, manage, and run Hadoop jobs. For example, you can create an
Oozie proxy user that securely impersonates a user called HadoopAdmin, which allows
the Oozie user to request that Hadoop jobs be performed by the HadoopAdmin user.
You configure proxy users for secure impersonation on a perzone basis, and users or
groups of users that you assign as members to the proxy user must be from the same
access zone. A member can be one or more of the following identity types:
l
If the proxy user does not present valid credentials or if a proxy user member does not
exist on the cluster, access is denied. The proxy user can only access files and subdirectories located in the HDFS root directory of the access zone. It is recommended that
438
Hadoop
you limit the members that the proxy user can impersonate to users that have access only
to the data the proxy user needs.
Ambari agent
The Ambari client/server framework is a third-party tool that enables you to configure,
manage, and monitor a Hadoop cluster through a browser-based interface. The OneFS
Ambari agent allows you to monitor the status of HDFS services on the EMC Isilon cluster
through the Ambari interface.
The Ambari agent is configured per access zone; you can configure the OneFS Ambari
agent in any access zone that contains HDFS data. To start an Ambari agent in an access
zone, you must specify the address of the external Ambari server and the address of a
NameNode that acts as the point of contact for the access zone.
The external Ambari server receives communications from the OneFS Ambari agent. Once
the Ambari agent assigned to the access zone registers with the Ambari server, the agent
provides a heartbeat status at regular intervals. The OneFS Ambari agent does not
provide metrics or alerts to the Ambari server. The external Ambari server must be
specified by a resolvable hostname, FQDN, or IP address and must be assigned to an
access zone.
The NameNode is the designated point of contact in an access zone that Hadoop services
managed through the Ambari interface should connect through. For example, if you
manage services such as YARN or Oozie through the Ambari interface, the services will
connect to the access zone through the specified NameNode. The Ambari agent
communicates the location of the designated NameNode to the Ambari server, and to the
Ambari interface, the NameNode represents the access zone. If you change the
designated NameNode address, the Ambari agent will inform the Ambari server. The
NameNode must be a resolvable SmartConnect zone name or an IP address from the IP
address pool associated with the access zone.
Note
The specified NameNode value maps to the NameNode, secondary NameNode, and
DataNode components on the Ambari interface.
The OneFS Ambari agent is based on the Apache Ambari framework and is compatible
with Ambari server versions 1.5.1.110 and 1.6.0.
Ambari agent
439
Hadoop
Hadoop
Each SmartConnect zone represents a specific pool of IP addresses. When you associate
a SmartConnect zone with an access zone, OneFS only allows Hadoop clients connecting
through the IP addresses in the SmartConnect zone to reach the HDFS data in the access
zone. A root HDFS directory is specified for each access zone. This configuration isolates
data within access zones and allows you to restrict client access to the data.
A SmartConnect zone evenly distributes NameNode requests from Hadoop compute
clients across the access zone. When a Hadoop compute client makes an initial DNS
request to connect to the SmartConnect zone, the Hadoop client is routed to an Isilon
node that serves as a NameNode. Subsequent requests from the Hadoop compute client
go the same node. When a second Hadoop client makes a DNS request for the
SmartConnect zone, SmartConnect balances the traffic and routes the client connection
to a different node than that used by the previous Hadoop compute client.
If you create a SmartConnect zone, you must add a new name server (NS) record as a
delegated domain to the authoritative DNS zone that contains the Isilon cluster. On the
Hadoop compute cluster, you must add the name of the DNS entry of the SmartConnect
zone to the core-site.xml file so that your Hadoop compute clients connect to a
NameNode with the DNS name of the zone.
SmartConnect is discussed in further detail in the Networking section of this guide.
Activate a license for HDFS. When a license is activated, the HDFS service is enabled
by default.
Create directories on the cluster that will be set as HDFS root directories.
Create a SmartConnect zone for balancing connections from Hadoop compute clients.
Create local Hadoop users in access zones that do not have directory services such
as Active Directory or LDAP.
Set the HDFS root directory in each access zone that supports HDFS connections.
Set an authentication method in each access zone that supports HDFS connections.
441
Hadoop
Procedure
1. Open a secure shell (SSH) connection to any node in the cluster and then log in.
2. Run the isi hdfs settings modify command.
The following example command sets the block size to 1 GB:
isi hdfs settings modify --default-block-size=1G
You must specify the block size in bytes. Suffixes K, M, and G are allowed.
The following example command sets the checksum type to crc32:
isi hdfs settings modify --default-checksum-type=crc32
The following example command sets the number of server threads to 32:
isi hdfs settings modify --server-threads=32
Description
Block size
The HDFS block size setting on the EMC cluster determines how the HDFS
service returns data upon read requests from Hadoop compute client.
You can modify the HDFS block size on the cluster to increase the block size
from the default of 64 MB up to 128 MB. Increasing the block size enables the
Isilon cluster nodes to read and write HDFS data in larger blocks and optimize
performance for most use cases.
The Hadoop cluster maintains a different block size that determines how a
Hadoop compute client writes a block of file data to the Isilon cluster. The
optimal block size depends on your data, how you process your data, and other
factors. You can configure the block size on the Hadoop cluster in the hdfssite.xml configuration file in the dfs.block.size property.
Checksum type
The HDFS service sends the checksum type to Hadoop compute clients, but it
does not send any checksum data, regardless of the checksum type. The
default checksum type is set to None. If you Hadoop distribution requires a
checksum type other than None to the client, you can set the checksum type to
CRC32 or CRC32C.
Service threads The HDFS service generates multiple threads to handle HDFS traffic from EMC
Isilon nodes.
By default, the service thread value is set to auto, which calculates the thread
count by multiplying the number of cores on a node by eight and adding a
minimum threshold of thirteen. It generates a maximum of 96 threads on a
node.
To support a large system of Hadoop compute clients, you might need to
increase the number of threads. If you are distributing HDFS traffic across all of
the nodes in an Isilon cluster through a SmartConnect zone, the total number of
442
Hadoop
Setting
Description
HDFS service threads should equal at least half of the total number of maps and
reduces on the Hadoop compute cluster. The maximum thread count is 256 per
node.
Logging level
Errorgeneral errors
Noticeconditions that are not errors, but might require special handling
64M
none
crit
auto
443
Hadoop
Authentication method
WebHDFS support
Kerberos only
Hadoop
2. Optional: To identify the name of the access zone that you want to modify for HDFS,
run the following command:
isi zone zones list
3. Set the HDFS authentication method for the access zone by running the isi zone
zones modify <zone> command, where <zone> is the name of the zone.
The following command specifies that Hadoop compute clients connecting to zone3
must be identified through the simple authentication method:
isi zone zones modify zone3 --hdfs-authentication=simple_only
The following example command specifies that Hadoop compute clients connecting to
zone3 must be identified through the Kerberos authentication method:
isi zone zones modify zone3 --hdfs-authentication=kerberos_only
445
Hadoop
Hadoop user that maps to a user on a Hadoop compute client for that access zone. If
directory services are available, a local user account is not required.
Procedure
1. Open a secure shell (SSH) connection to any node in the cluster.
2. Run the isi auth users create command.
The following example command creates a user named hadoop-user1 assigned to a
local authentication provider within the zone3 access zone:
isi auth users create --name=hadoop-user1 --provider=local -zone=zone3
3. Configure the HDFS root directory for this access zone by running the isi zone
zones modify <zone> command, where <zone> is the name of the zone.
The following command specifies that Hadoop compute clients connecting to zone3
are given access to the /ifs/hadoop/ directory:
isi zone zones modify zone3 --hdfs-root-directory=/ifs/hadoop
3. Enable or disable WebHDFS in the access zone by running the isi zone zones
modify command.
The following command enables WebHDFS in zone3:
isi zone zones modify zone3 --webhdfs-enabled=yes
446
Hadoop
The following command designates hadoop-user23 in zone1 as a new proxy user and
adds the group hadoop-users to the list of members that the proxy user can
impersonate:
isi hdfs proxyusers create hadoop-user23 --zone=zone1 --addgroup=hadoop-users
447
Hadoop
The following command designates hadoop-user23 in zone1 as a new proxy user and
adds UID 2155 to the list of members that the proxy user can impersonate:
isi hdfs proxyusers create hadoop-user23 --zone=zone1 --addUID=2155
Hadoop
Name: krb_users
ID: SID:S-1-22-2-1003
----------------------Type: wellknown
Name: LOCAL
ID: SID:S-1-2-0
3. To view the configuration details for a specific proxy user, run the isi hdfs
proxyusers viewcommand.
The following command displays the configuration details for the hadoop-user23
proxy user in zone1:
isi hdfs proxyusers view hadoop-user23 --zone=zone1
449
Hadoop
Procedure
1. Open a secure shell (SSH) connection to any node in the cluster and log in.
2. Create a virtual HDFS rack by running the isi hdfs racks create command.
A rack name begins with a forward slashfor example, /hdfs-rack2.
The following command creates a rack named /hdfs-rack2:
isi hdfs racks create /hdfs-rack2
3. Modify the virtual HDFS rack by running the isi hdfs racks modify command.
A rack name begins with a forward slashfor example, /hdfs-rack2.
The following example command renames a rack named /hdfs-rack2 to /hdfs-rack5:
isi hdfs racks modify /hdfs-rack2 --new-name=/hdfs-rack5
In addition to adding a new range to the list of existing ranges, you can modify the
client IP address ranges by replacing the current ranges, deleting a specific range or
deleting all ranges.
The following example command replaces any existing IP pools with subnet1:pool1
and subnet2:pool2 on the rack named /hdfs-rack2:
isi hdfs racks modify /hdfs-rack2 --ippools=subnet1:pool1,subnet2:pool2
In addition to replacing the list of existing pools with new pools, you can modify the IP
pools by adding pools to list of current pools, deleting a specific pool or deleting all
pools.
450
Hadoop
3. Delete a virtual HDFS rack by running the isi hdfs racks delete command.
A rack name begins with a forward slashfor example, /hdfs-rack2.
The following command deletes the virtual HDFS rack named /hdfs-rack2:
isi hdfs racks delete /hdfs-rack2
The following example command displays setting details for all virtual HDFS racks
configured on the cluster:
isi hdfs racks list -v
3. To view the setting details for a specific virtual HDFS rack, run the isi hdfs racks
view command:
Each rack name begins with a forward slashfor example /hdfs-rack2.
Delete a virtual HDFS rack
451
Hadoop
The following example command displays setting details for the virtual HDFS rack
named /hdfs-rack2:
isi hdfs racks view /hdfs-rack2
452
CHAPTER 23
Antivirus
Antivirus
453
Antivirus
Antivirus overview
You can scan the files you store on an Isilon cluster for computer viruses and other
security threats by integrating with third-party scanning services through the Internet
Content Adaptation Protocol (ICAP). OneFS sends files through ICAP to a server running
third-party antivirus scanning software. These servers are referred to as ICAP servers.
ICAP servers scan files for viruses.
After an ICAP server scans a file, it informs OneFS of whether the file is a threat. If a threat
is detected, OneFS informs system administrators by creating an event, displaying near
real-time summary information, and documenting the threat in an antivirus scan report.
You can configure OneFS to request that ICAP servers attempt to repair infected files. You
can also configure OneFS to protect users against potentially dangerous files by
truncating or quarantining infected files.
Before OneFS sends a file to be scanned, it ensures that the scan is not redundant. If a
file has already been scanned and has not been modified, OneFS will not send the file to
be scanned unless the virus database on the ICAP server has been updated since the last
scan.
Note
Antivirus scanning is available only if all nodes in the cluster are connected to the
external network.
On-access scanning
You can configure OneFS to send files to be scanned before they are opened, after they
are closed, or both. Sending files to be scanned after they are closed is faster but less
secure. Sending files to be scanned before they are opened is slower but more secure.
If OneFS is configured to ensure that files are scanned after they are closed, when a user
creates or modifies a file on the cluster, OneFS queues the file to be scanned. OneFS then
sends the file to an ICAP server to be scanned when convenient. In this configuration,
users can always access files without any delay. However, it is possible that after a user
modifies or creates a file, a second user might access the file before the file is scanned. If
a virus was introduced to the file from the first user, the second user will be able to
access the infected file. Also, if an ICAP server is unable to scan a file, the file will still be
accessible to users.
If OneFS ensures that files are scanned before they are opened, when a user attempts to
download a file from the cluster, OneFS first sends the file to an ICAP server to be
scanned. The file is not sent to the user until the scan is complete. Scanning files before
they are opened is more secure than scanning files after they are closed, because users
can access only scanned files. However, scanning files before they are opened requires
users to wait for files to be scanned. You can also configure OneFS to deny access to files
that cannot be scanned by an ICAP server, which can increase the delay. For example, if
no ICAP servers are available, users will not be able to access any files until the ICAP
servers become available again.
If you configure OneFS to ensure that files are scanned before they are opened, it is
recommended that you also configure OneFS to ensure that files are scanned after they
are closed. Scanning files as they are both opened and closed will not necessarily
improve security, but it will usually improve data availability when compared to scanning
files only when they are opened. If a user wants to access a file, the file may have already
454
Antivirus
been scanned after the file was last modified, and will not need to be scanned again if
the ICAP server database has not been updated since the last scan.
The name and IP address of the user that triggered the scan.
This information is not included in reports triggered by antivirus scan policies.
455
Antivirus
ICAP servers
The number of ICAP servers that are required to support an Isilon cluster depends on how
virus scanning is configured, the amount of data a cluster processes, and the processing
power of the ICAP servers.
If you intend to scan files exclusively through antivirus scan policies, it is recommended
that you have a minimum of two ICAP servers per cluster. If you intend to scan files on
access, it is recommended that you have at least one ICAP server for each node in the
cluster.
If you configure more than one ICAP server for a cluster, it is important to ensure that the
processing power of each ICAP server is relatively equal. OneFS distributes files to the
ICAP servers on a rotating basis, regardless of the processing power of the ICAP servers. If
one server is significantly more powerful than another, OneFS does not send more files to
the more powerful server.
McAfee VirusScan Enterprise 8.7 and later with VirusScan Enterprise for Storage 1.0
and later.
Antivirus
You can configure OneFS and ICAP servers to react in one of the following ways when
threats are detected:
Repair or quarantine
Attempts to repair infected files. If an ICAP server fails to repair a file, OneFS
quarantines the file. If the ICAP server repairs the file successfully, OneFS sends the
file to the user. Repair or quarantine can be useful if you want to protect users from
accessing infected files while retaining all data on a cluster.
Repair or truncate
Attempts to repair infected files. If an ICAP server fails to repair a file, OneFS
truncates the file. If the ICAP server repairs the file successfully, OneFS sends the file
to the user. Repair or truncate can be useful if you do not care about retaining all
data on your cluster, and you want to free storage space. However, data in infected
files will be lost.
Alert only
Only generates an event for each infected file. It is recommended that you do not
apply this setting.
Repair only
Attempts to repair infected files. Afterwards, OneFS sends the files to the user,
whether or not the ICAP server repaired the files successfully. It is recommended that
you do not apply this setting. If you only attempt to repair files, users will still be
able to access infected files that cannot be repaired.
Quarantine
Quarantines all infected files. It is recommended that you do not apply this setting. If
you quarantine files without attempting to repair them, you might deny access to
infected files that could have been repaired.
Truncate
Truncates all infected files. It is recommended that you do not apply this setting. If
you truncate files without attempting to repair them, you might delete data
unnecessarily.
Click Only scan files smaller than the maximum file size and specify a maximum
file size.
3. In the Filename restrictions area, specify whether to exclude files from being scanned
based on file names and extensions.
Configuring global antivirus settings
457
Antivirus
Click Scan all files except those with the following extensions or filenames.
4. Optional: If you chose to exclude files based on file names and extensions, specify
which files will be selected.
a. In the Extensions area, click Edit list, and specify extensions.
b. In the Filenames area, click Edit list, and specify filenames.
You can specify the following wild cards:
Wildcard Description
*
[ ]
5. Click Submit.
To require that all files be scanned before they are opened by a user, select Scan
files when they are opened, and then specify whether you want to allow access to
files that cannot be scanned.
To scan files after they are closed, select Scan files when they are closed.
3. In the Directories to be scanned area, specify the directories that you want to apply
on-access settings to.
458
Antivirus
If no directories are specified, on-access scanning settings are applied to all files. If
you specify a directory, only files from the specified directories will be scanned as
they are accessed.
4. Click Submit.
459
Antivirus
460
Antivirus
6. In the Run policy area, specify whether you want to run the policy according to a
schedule or manually.
Scheduled policies can also be run manually at any time.
Option
Description
Click Manually
a. Click Scheduled.
b. In the Interval area, specify on what days you want
the policy to run.
c. In the Frequency area, specify how often you want
the policy to run on the specified days.
7. Click Submit.
461
Antivirus
Scan a file
You can manually scan an individual file for viruses. This procedure is available only
through the command-line interface (CLI).
Procedure
1. Open a secure shell (SSH) connection to any node in the cluster and log in.
2. Run the isi avscan manual command.
For example, the following command scans /ifs/data/virus_file:
isi avscan manual /ifs/data/virus_file
462
Antivirus
Rescan a file
You can rescan the file for viruses if, for example, you believe that a file is no longer a
threat.
Procedure
1. Click Data Protection > Antivirus > Detected Threats.
2. In the Detected Threats table, in the row of a file, click Rescan.
463
Antivirus
View threats
You can view files that have been identified as threats by an ICAP server.
Procedure
1. Click Data Protection > Antivirus > Detected Threats.
2. In the Detected Threats table, view potentially infected files.
464
Antivirus
465
Antivirus
466
CHAPTER 24
VMware integration
VMware integration
467
VMware integration
VAAI
OneFS uses VMware vSphere API for Array Integration (VAAI) to support offloading
specific virtual machine storage and management operations from VMware ESXi
hypervisors to an Isilon cluster.
VAAI support enables you to accelerate the process of creating virtual machines and
virtual disks. For OneFS to interact with your vSphere environment through VAAI, your
VMware environment must include ESXi 5.0 or later hypervisors.
If you enable VAAI capabilities for an Isilon cluster, when you clone a virtual machine
residing on the cluster through VMware, OneFS clones the files related to that virtual
machine.
To enable OneFS to use VMware vSphere API for Array Integration (VAAI), you must install
the VAAI NAS plug-in for Isilon on the ESXi server. For more information on the VAAI NAS
plug-in for Isilon, see the VAAI NAS plug-in for Isilon Release Notes.
VASA
OneFS communicates with VMware vSphere through VMware vSphere API for Storage
Awareness (VASA).
VASA support enables you to view information about Isilon clusters through vSphere,
including Isilon-specific alarms in vCenter. VASA support also enables you to integrate
with VMware profile driven storage by providing storage capabilities for Isilon clusters in
vCenter. For OneFS to communicate with vSphere through VASA, your VMware
environment must include ESXi 5.0 or later hypervisors.
468
VMware integration
Alarm name
Description
Thin-provisioned LUN There is not enough available space on the cluster to allocate space for
capacity exceeded
writing data to thinly provisioned LUNs. If this condition persists, you will
not be able to write to the virtual machine on this cluster. To resolve this
issue, you must free storage space on the cluster.
If the cluster is composed of i-Series, X-Series , or S-Series nodes, but the cluster
does not contain SSDs, the cluster is recognized as a capacity cluster.
Capacity
The Isilon cluster is composed of Isilon X-Series nodes that do not contain SSDs. The
cluster is configured for a balance between performance and capacity.
Hybrid
The Isilon cluster is composed of nodes associated with two or more storage
capabilities. For example, if the cluster contained both Isilon S-Series and NL-Series
nodes, the storage capability of the cluster is displayed as Hybrid.
Enable VASA
You must enable an Isilon cluster to communicate with VMware vSphere API for Storage
Awareness (VASA) by enabling the VASA daemon.
Procedure
1. Open a secure shell (SSH) connection to any node in the cluster and log in.
2. Enable VASA by running the following command:
isi services isi_vasa_d enable
469
VMware integration
Record the location of where you saved the certificate. You will need this file path
when adding the vendor provider in vCenter.
470
VMware integration
3. Disable or enable the VASA daemon by running one of the following commands:
l
471
VMware integration
472
CHAPTER 25
File System Explorer
473
The File System Explorer displays up to 1000 files in a directory. If more than 1000 files
exist within a directory, the files are displayed without additional information, such as file
size and last modified date.
The contents of the selected directory are displayed in the right pane. You can view
the contents of another directory by clicking the directory in the Directories pane.
474
Create a directory
You can create a directory under /ifs through the File System Explorer.
Procedure
1. Navigate to File System Management > File System Explorer.
2. In the Directories pane, specify where you want to create the directory.
3. Click Add Directory.
4. In the New Directory Properties dialog box, in the Directory name field, type a name
for the directory.
5. From the User list, select the owner of the directory.
6. From the Group list, select the group for the directory.
7. From the Permissions table, specify the basic permissions for the directory.
8. Click Submit.
Create a directory
475
476
477
478