PowerScale OneFS CLI Administration Guide

Configure, manage, and monitor ...Dell Technologies PowerScale clusters through the ...OneFS command-line interface. ...OneFS commands extend the standard FreeBSD command set.

PowerScale, EMC, Isilon, OneFS, cluster, web UI, GUI, access control, ACL, Active Directory, ADS, authentication, home directory, identity management, Kerberos, LDAP, NIS, privileges, RBAC, roles, backup and recovery, backup accelerator node, data replication, direct access restore, DMA, incremental forever backup, multi-stream backup, multi-stream restore, backup context, NDMP environment variables, NDMP two-way backup, NDMP three-way backup, snapshot-based incremental backup, token-based incremental backu

Dell Inc. or its subsidiaries.

User Guide for DELL EMC models including: PowerScale OneFS CLI, PowerScale OneFS, PowerScale, OneFS, CLI

PDF Viewing Options

Not Your Device? Search For Manuals or Datasheets below:


File Info : application/pdf, 478 Pages, 3.01MB

Document DEVICE REPORTPowerScale-OneFS-9-2-0-0-CLI-Admin-guide
PowerScale OneFS
CLI Administration Guide
9.2.0.0
May 2021

Notes, cautions, and warnings
NOTE: A NOTE indicates important information that helps you make better use of your product. CAUTION: A CAUTION indicates either potential damage to hardware or loss of data and tells you how to avoid the problem. WARNING: A WARNING indicates a potential for property damage, personal injury, or death.
© 2016 - 2021 Dell Inc. or its subsidiaries. All rights reserved. Dell, EMC, and other trademarks are trademarks of Dell Inc. or its subsidiaries. Other trademarks may be trademarks of their respective owners.

Contents

Chapter 1: Introduction to this guide........................................................................................... 24 About this guide.................................................................................................................................................................24 Scale-out NAS overview..................................................................................................................................................24 Getting help........................................................................................................................................................................ 24
Chapter 2: PowerScale scale-out NAS......................................................................................... 25 OneFS storage architecture........................................................................................................................................... 25 Node components............................................................................................................................................................. 25 Internal and external networks...................................................................................................................................... 26 PowerScale cluster........................................................................................................................................................... 27 Cluster administration.................................................................................................................................................27 Quorum...........................................................................................................................................................................27 Splitting and merging..................................................................................................................................................28 Storage pools................................................................................................................................................................28 The OneFS operating system.........................................................................................................................................28 Mixed data-access protocol environments........................................................................................................... 28 Identity management and access control..............................................................................................................29 Structure of the file system............................................................................................................................................29 Data layout.................................................................................................................................................................... 30 Writing files................................................................................................................................................................... 30 Reading files................................................................................................................................................................. 30 Metadata layout...........................................................................................................................................................30 Locks and concurrency............................................................................................................................................... 31 Striping............................................................................................................................................................................31 Data protection overview.................................................................................................................................................31 N+M data protection.................................................................................................................................................. 32 Data mirroring...............................................................................................................................................................32 The file system journal................................................................................................................................................33 Virtual hot spare (VHS)............................................................................................................................................. 33 Balancing protection with storage space.............................................................................................................. 33 Data compression..............................................................................................................................................................33 Software modules............................................................................................................................................................. 33
Chapter 3: Introduction to the OneFS command-line interface.................................................... 35 OneFS command-line interface overview................................................................................................................... 35 Syntax diagrams................................................................................................................................................................ 35 Universal options............................................................................................................................................................... 36 Command-line interface privileges................................................................................................................................ 37 SmartLock compliance command permissions...........................................................................................................37 OneFS time values............................................................................................................................................................ 39
Chapter 4: General cluster administration................................................................................... 40 General cluster administration overview..................................................................................................................... 40 User interfaces.................................................................................................................................................................. 40

Contents

3

Connecting to the cluster................................................................................................................................................ 41 Log in to the web administration interface............................................................................................................ 41 Open an SSH connection to a cluster..................................................................................................................... 41
Licensing..............................................................................................................................................................................42 Software licenses........................................................................................................................................................ 42 Hardware tiers..............................................................................................................................................................42 License status.............................................................................................................................................................. 42 Adding and removing licenses.................................................................................................................................. 43 Activating trial licenses.............................................................................................................................................. 45
Certificates......................................................................................................................................................................... 45 Certificate management............................................................................................................................................ 45 Replacing or renewing the TLS certificate............................................................................................................46 List available certificates........................................................................................................................................... 49 View certificate settings............................................................................................................................................49 Verify a TLS certificate update................................................................................................................................50 TLS certificate data example................................................................................................................................... 50
Cluster identity.................................................................................................................................................................. 50 Set the cluster name .................................................................................................................................................50
Cluster contact information............................................................................................................................................ 51 Cluster date and time........................................................................................................................................................51
Set the cluster date and time................................................................................................................................... 51 Specify an NTP time server......................................................................................................................................52 SMTP email settings.........................................................................................................................................................52 Configure SMTP email settings .............................................................................................................................. 52 View SMTP email settings........................................................................................................................................ 53 Configuring the cluster join mode................................................................................................................................. 53 Specify the cluster join mode ..................................................................................................................................53 File system settings.......................................................................................................................................................... 54 Specify the cluster character encoding.................................................................................................................54 Enable or disable access time tracking ................................................................................................................. 54 Data compression settings and monitoring................................................................................................................ 55 Data compression terminology.................................................................................................................................55 Enable or disable data compression........................................................................................................................55 View compression statistics..................................................................................................................................... 56 Events and alerts...............................................................................................................................................................58 Events overview.......................................................................................................................................................... 58 Alerts overview............................................................................................................................................................ 58 Channels overview...................................................................................................................................................... 58 Event groups overview.............................................................................................................................................. 59 Viewing and modifying event groups......................................................................................................................59 View an event............................................................................................................................................................... 61 Managing alerts............................................................................................................................................................62 Managing channels......................................................................................................................................................63 Maintenance and testing........................................................................................................................................... 64 Managing event thresholds.......................................................................................................................................66 Security hardening............................................................................................................................................................ 68 STIG hardening profile............................................................................................................................................... 68 Apply a security hardening profile........................................................................................................................... 68 Revert a security hardening profile.........................................................................................................................69 View the security hardening status.........................................................................................................................70

4

Contents

Cluster configuration backup and restore...................................................................................................................70 Cluster configuration backup and restore............................................................................................................. 70
Cluster monitoring.............................................................................................................................................................73 Monitor the cluster......................................................................................................................................................73 View node status......................................................................................................................................................... 73
Monitoring cluster hardware...........................................................................................................................................73 View node hardware status.......................................................................................................................................73 Chassis and drive states............................................................................................................................................ 74 Check battery status.................................................................................................................................................. 75 SNMP monitoring........................................................................................................................................................ 75
Cluster maintenance......................................................................................................................................................... 78 Replacing node components..................................................................................................................................... 78 Upgrading node components....................................................................................................................................79 Automatic Replacement Recognition (ARR) for drives......................................................................................79 Managing drive firmware...........................................................................................................................................80 Managing cluster nodes............................................................................................................................................. 83 Upgrading OneFS........................................................................................................................................................ 84
SRS Summary.................................................................................................................................................................... 85 Obtain signed OneFS license file for evaluation clusters...................................................................................85 Configuring and Enabling SRS Overview...............................................................................................................86 Enable and configure Secure Remote Services support .................................................................................. 86 Diagnostic commands and scripts...........................................................................................................................88 Disable SRS support.................................................................................................................................................... 91 View SRS configuration settings..............................................................................................................................91 PowerScale SRS Managed File Transfer support................................................................................................92
Chapter 5: Access zones............................................................................................................. 95 Data Security overview .................................................................................................................................................. 95 Base directory guidelines.................................................................................................................................................95 Access zones best practices.......................................................................................................................................... 96 Access zones on a SyncIQ secondary cluster............................................................................................................96 Access zone limits............................................................................................................................................................. 97 Quality of service...............................................................................................................................................................97 Zone-based Role-based Access Control (zRBAC)....................................................................................................97 Non-System access zone privileges....................................................................................................................... 98 Built-in roles in non-System zones..........................................................................................................................98 Zone-specific authentication providers....................................................................................................................... 99 Managing access zones...................................................................................................................................................99 Create an access zone...............................................................................................................................................99 Assign an overlapping base directory................................................................................................................... 100 Manage authentication providers in an access zone........................................................................................ 100 Associate an IP address pool with an access zone........................................................................................... 100 Modify an access zone.............................................................................................................................................100 Delete an access zone...............................................................................................................................................101 View a list of access zones...................................................................................................................................... 101 Create one or more access zones.......................................................................................................................... 101 Create local users in an access zone.................................................................................................................... 102 Access files through the RESTful Access to Namespace (RAN) in non-System zones.......................... 102

Contents

5

Chapter 6: Authentication..........................................................................................................104 Authentication overview................................................................................................................................................ 104 Authentication provider features.................................................................................................................................104 Security Identifier (SID) history overview................................................................................................................. 105 Supported authentication providers........................................................................................................................... 105 Active Directory............................................................................................................................................................... 105 LDAP................................................................................................................................................................................... 106 NIS.......................................................................................................................................................................................106 Kerberos authentication.................................................................................................................................................106 Keytabs and SPNs overview................................................................................................................................... 107 MIT Kerberos protocol support.............................................................................................................................. 107 File provider.......................................................................................................................................................................107 Local provider................................................................................................................................................................... 108 Multi-factor Authentication (MFA).............................................................................................................................108 Multi-instance active directory.................................................................................................................................... 108 LDAP public keys............................................................................................................................................................. 108 Managing Active Directory providers......................................................................................................................... 109 Configure an Active Directory provider................................................................................................................109 Modify an Active Directory provider..................................................................................................................... 109 Delete an Active Directory provider......................................................................................................................109 Managing LDAP providers..............................................................................................................................................110 Configure an LDAP provider.................................................................................................................................... 110 Modify an LDAP provider......................................................................................................................................... 110 Delete an LDAP provider.......................................................................................................................................... 110 Configure the LDAP provider to use TLS connections...................................................................................... 111 Managing NIS providers.................................................................................................................................................. 111 Configure an NIS provider.........................................................................................................................................111 Modify an NIS provider.............................................................................................................................................. 111 Delete an NIS provider.............................................................................................................................................. 112 Managing MIT Kerberos authentication......................................................................................................................112 Managing MIT Kerberos realms.............................................................................................................................. 112 Managing MIT Kerberos providers......................................................................................................................... 113 Managing MIT Kerberos domains........................................................................................................................... 115 Managing SPNs and keys......................................................................................................................................... 116 Managing file providers...................................................................................................................................................118 Configure a file provider........................................................................................................................................... 118 Generate a password file.......................................................................................................................................... 118 Modify a file provider.................................................................................................................................................118 Delete a file provider..................................................................................................................................................119 Password file format..................................................................................................................................................119 Group file format........................................................................................................................................................ 119 Netgroup file format................................................................................................................................................. 120 Managing local users and groups.................................................................................................................................120 View a list of users and groups by provider........................................................................................................ 120 Create a local user......................................................................................................................................................121 Create a local group................................................................................................................................................... 121 Naming rules for local users and groups............................................................................................................... 121 Configure or modify a local password policy........................................................................................................121 Local password policy settings............................................................................................................................... 122

6

Contents

Modify a local user.....................................................................................................................................................123 Modify a local group..................................................................................................................................................123 Delete a local user......................................................................................................................................................123 Delete a local group...................................................................................................................................................123 SSH Authentication and Configuration...................................................................................................................... 123 Pre-requisites for Multi-factor Authentication (MFA)..................................................................................... 124 SSH configuration using password........................................................................................................................124 SSH Configuration using public keys.................................................................................................................... 124
Chapter 7: Administrative roles and privileges........................................................................... 126 Role-based access...........................................................................................................................................................126 Roles................................................................................................................................................................................... 126 Custom roles............................................................................................................................................................... 126 Built-in roles.................................................................................................................................................................127 Privileges............................................................................................................................................................................130 Supported OneFS privileges....................................................................................................................................130 Data backup and restore privileges....................................................................................................................... 133 Command-line interface privileges........................................................................................................................ 133 Managing roles................................................................................................................................................................. 137 View roles.....................................................................................................................................................................137 View privileges............................................................................................................................................................ 137 Create and modify a custom role...........................................................................................................................138 Delete a custom role................................................................................................................................................. 138 Add a user to built-in roles.......................................................................................................................................138 Create a new role and add a user.......................................................................................................................... 140
Chapter 8: Identity management................................................................................................ 142 Identity management overview.................................................................................................................................... 142 Identity types.................................................................................................................................................................... 142 Access tokens...................................................................................................................................................................143 Access token generation................................................................................................................................................143 ID mapping................................................................................................................................................................... 144 User mapping.............................................................................................................................................................. 145 On-disk identity.......................................................................................................................................................... 146 Managing ID mappings....................................................................................................................................................147 Create an identity mapping......................................................................................................................................147 Modify an identity mapping..................................................................................................................................... 147 Delete an identity mapping...................................................................................................................................... 148 View an identity mapping......................................................................................................................................... 148 Flush the identity mapping cache.......................................................................................................................... 148 View a user token...................................................................................................................................................... 149 Configure identity mapping settings..................................................................................................................... 149 View identity mapping settings.............................................................................................................................. 149 Managing user identities................................................................................................................................................ 150 View user identity...................................................................................................................................................... 150 Create a user-mapping rule......................................................................................................................................151 Merge Windows and UNIX tokens.........................................................................................................................152 Retrieve the primary group from LDAP................................................................................................................152 Mapping rule options.................................................................................................................................................153

Contents

7

Mapping rule operators............................................................................................................................................ 153
Chapter 9: Home directories...................................................................................................... 155 Home directories overview........................................................................................................................................... 155 Home directory permissions..........................................................................................................................................155 Authenticating SMB users.............................................................................................................................................155 Home directory creation through SMB...................................................................................................................... 156 Create home directories with expansion variables............................................................................................ 156 Create home directories with the --inheritable-path-acl option.................................................................... 157 Create special home directories with the SMB share %U variable............................................................... 158 Home directory creation through SSH and FTP...................................................................................................... 158 Set the SSH or FTP login shell .............................................................................................................................. 158 Set SSH/FTP home directory permissions......................................................................................................... 159 Set SSH/FTP home directory creation options................................................................................................. 159 Provision home directories with dot files.............................................................................................................160 Home directory creation in a mixed environment.....................................................................................................161 Interactions between ACLs and mode bits................................................................................................................ 161 Default home directory settings in authentication providers................................................................................ 161 Supported expansion variables.....................................................................................................................................162 Domain variables in home directory provisioning..................................................................................................... 163
Chapter 10: Data access control................................................................................................. 164 Data access control overview...................................................................................................................................... 164 ACLs....................................................................................................................................................................................164 UNIX permissions.............................................................................................................................................................165 Mixed-permission environments.................................................................................................................................. 165 NFS access of Windows-created files..................................................................................................................165 SMB access of UNIX-created files........................................................................................................................165 Managing access permissions...................................................................................................................................... 166 View expected user permissions............................................................................................................................ 166 Configure access management settings.............................................................................................................. 167 Modify ACL policy settings......................................................................................................................................167 Run the PermissionsRepair job............................................................................................................................... 167
Chapter 11: File sharing.............................................................................................................. 169 File sharing overview...................................................................................................................................................... 169 Mixed protocol environments................................................................................................................................. 169 Write caching with SmartCache............................................................................................................................ 170 SMB security..................................................................................................................................................................... 171 SMB shares in access zones....................................................................................................................................171 SMB Multichannel...................................................................................................................................................... 172 SMB share management through MMC...............................................................................................................173 SMBv3 encryption..................................................................................................................................................... 173 SMB server-side copy...............................................................................................................................................174 SMB continuous availability.....................................................................................................................................175 SMB file filtering.........................................................................................................................................................175 Symbolic links and SMB clients.............................................................................................................................. 176 Anonymous access to SMB shares........................................................................................................................ 177 Managing SMB settings............................................................................................................................................177

8

Contents

Managing SMB shares.............................................................................................................................................. 179 NFS security..................................................................................................................................................................... 183
NFS exports................................................................................................................................................................ 184 NFS aliases.................................................................................................................................................................. 184 NFS log files................................................................................................................................................................ 185 Managing the NFS service...................................................................................................................................... 185 Managing NFS exports............................................................................................................................................. 186 Managing NFS aliases...............................................................................................................................................188 FTP......................................................................................................................................................................................190 View FTP settings..................................................................................................................................................... 190 Enable FTP file sharing.............................................................................................................................................190 Configure FTP file sharing........................................................................................................................................191 HTTP and HTTPS security............................................................................................................................................. 191 Enable and configure HTTP..................................................................................................................................... 191 Enable HTTPS through the Apache service........................................................................................................192 Disable HTTPS through the Apache service....................................................................................................... 192
Chapter 12: File filtering............................................................................................................ 193 File filtering in an access zone......................................................................................................................................193 Enable and configure file filtering in an access zone.............................................................................................. 193 Disable file filtering in an access zone........................................................................................................................ 193 View file filtering settings.............................................................................................................................................. 194
Chapter 13: Auditing and logging............................................................................................... 195 Auditing overview............................................................................................................................................................ 195 Syslog................................................................................................................................................................................. 195 Syslog forwarding...................................................................................................................................................... 196 Protocol audit events..................................................................................................................................................... 196 Supported audit tools..................................................................................................................................................... 196 Delivering protocol audit events to multiple CEE servers......................................................................................197 Supported event types................................................................................................................................................... 197 Sample audit log...............................................................................................................................................................198 Audit log purging..............................................................................................................................................................198 Managing audit settings.................................................................................................................................................199 Enable protocol access auditing.............................................................................................................................199 Forward protocol access events to syslog .........................................................................................................199 Enable system configuration auditing.................................................................................................................. 200 Set the audit hostname........................................................................................................................................... 200 Configure protocol audited zones.........................................................................................................................200 Forward system configuration changes to syslog............................................................................................ 200 Configure protocol event filters............................................................................................................................. 201 Integrating with the Common Event Enabler........................................................................................................... 201 Install CEE for Windows...........................................................................................................................................201 Configure CEE for Windows...................................................................................................................................202 Configure CEE servers to deliver protocol audit events................................................................................. 202 Tracking the delivery of protocol audit events........................................................................................................203 View the time stamps of delivery of events to the CEE server and syslog................................................203 Display a global view of delivery of protocol audit events to the CEE server and syslog.......................203 Move the log position of the CEE forwarder..................................................................................................... 204

Contents

9

View the rate of delivery of protocol audit events to the CEE server........................................................ 204 Automatic deletion....................................................................................................................................................204 Manual deletion......................................................................................................................................................... 206
Chapter 14: Snapshots...............................................................................................................207 Snapshots overview....................................................................................................................................................... 207 Data protection with SnapshotIQ............................................................................................................................... 207 Snapshot disk-space usage.......................................................................................................................................... 208 Snapshot schedules........................................................................................................................................................208 Snapshot aliases..............................................................................................................................................................208 File and directory restoration.......................................................................................................................................208 Best practices for creating snapshots.......................................................................................................................209 Best practices for creating snapshot schedules..................................................................................................... 209 File clones.......................................................................................................................................................................... 210 Shadow-store considerations................................................................................................................................. 210 Snapshot locks.................................................................................................................................................................. 211 Snapshot reserve..............................................................................................................................................................211 SnapshotIQ license functionality...................................................................................................................................211 Creating snapshots with SnapshotIQ.......................................................................................................................... 211 Create a SnapRevert domain.................................................................................................................................. 212 Create a snapshot schedule.................................................................................................................................... 212 Create a snapshot......................................................................................................................................................212 Snapshot naming patterns.......................................................................................................................................212 Managing snapshots ...................................................................................................................................................... 215 Reducing snapshot disk-space usage................................................................................................................... 215 Delete a snapshot...................................................................................................................................................... 215 Modify snapshot attributes..................................................................................................................................... 215 Modify a snapshot alias ...........................................................................................................................................216 View snapshots.......................................................................................................................................................... 216 Snapshot information................................................................................................................................................ 217 Restoring snapshot data................................................................................................................................................ 217 Revert a snapshot .....................................................................................................................................................217 Restore a file or directory using Windows Explorer...........................................................................................217 Restore a file or directory through a UNIX command line............................................................................... 218 Clone a file from a snapshot................................................................................................................................... 218 Managing snapshot schedules......................................................................................................................................219 Modify a snapshot schedule .................................................................................................................................. 219 Delete a snapshot schedule ................................................................................................................................... 219 View snapshot schedules ........................................................................................................................................219 Managing snapshot aliases........................................................................................................................................... 220 Configure a snapshot alias for a snapshot schedule........................................................................................ 220 Assign a snapshot alias to a snapshot..................................................................................................................220 Reassign a snapshot alias to the live file system...............................................................................................220 View snapshot aliases.............................................................................................................................................. 220 Snapshot alias information.......................................................................................................................................221 Managing with snapshot locks..................................................................................................................................... 221 Create a snapshot lock............................................................................................................................................. 221 Modify a snapshot lock expiration date................................................................................................................221 Delete a snapshot lock.............................................................................................................................................222 Snapshot lock information...................................................................................................................................... 222

10

Contents

Configure SnapshotIQ settings .................................................................................................................................. 222 SnapshotIQ settings ................................................................................................................................................ 223
Set the snapshot reserve..............................................................................................................................................223 Managing changelists.....................................................................................................................................................224
Create a changelist................................................................................................................................................... 224 Delete a changelist....................................................................................................................................................224 View a changelist...................................................................................................................................................... 224 Changelist information............................................................................................................................................. 224
Chapter 15: Deduplication with SmartDedupe............................................................................ 226 Deduplication overview................................................................................................................................................. 226 Deduplication jobs...........................................................................................................................................................226 Data replication and backup with deduplication...................................................................................................... 227 Snapshots with deduplication...................................................................................................................................... 227 Deduplication considerations........................................................................................................................................227 Shadow-store considerations...................................................................................................................................... 228 SmartDedupe license functionality............................................................................................................................. 228 Managing deduplication.................................................................................................................................................228 Assess deduplication space savings .................................................................................................................... 228 Specify deduplication settings ..............................................................................................................................229 View deduplication space savings ........................................................................................................................229 View a deduplication report ...................................................................................................................................229 Deduplication job report information....................................................................................................................229 Deduplication information....................................................................................................................................... 230
Chapter 16: Inline Data Deduplication.........................................................................................231 Inline Data Deduplication overview..............................................................................................................................231 Inline deduplication interoperability............................................................................................................................ 233 Considerations for using inline deduplication........................................................................................................... 233 Enable inline deduplication............................................................................................................................................ 233 Verify inline deduplication is enabled..........................................................................................................................234 View inline deduplication reports.................................................................................................................................234 Disable or pause inline deduplication..........................................................................................................................235 Remove deduplication....................................................................................................................................................235 Troubleshoot index allocation issues.......................................................................................................................... 235
Chapter 17: Data replication with SyncIQ................................................................................... 237 SyncIQ data replication overview................................................................................................................................237 Replication policies and jobs......................................................................................................................................... 237 SmartLock considerations for SyncIQ................................................................................................................. 238 Automated replication policies............................................................................................................................... 238 Source and target cluster association................................................................................................................. 239 Configuring SyncIQ source and target clusters with NAT..............................................................................240 Full and differential replication...............................................................................................................................240 Controlling replication job resource consumption..............................................................................................241 Replication policy priority......................................................................................................................................... 241 Replication reports.................................................................................................................................................... 241 Replication snapshots.................................................................................................................................................... 242 Source cluster snapshots........................................................................................................................................ 242

Contents

11

Target cluster snapshots.........................................................................................................................................242 Data failover and failback with SyncIQ......................................................................................................................242
Data failover............................................................................................................................................................... 243 Data failback...............................................................................................................................................................243 SmartLock compliance mode failover and failback........................................................................................... 243 SmartLock replication limitations.......................................................................................................................... 244 Recovery times and objectives for SyncIQ...............................................................................................................245 RPO Alerts.................................................................................................................................................................. 245 Replication policy priority.............................................................................................................................................. 245 SyncIQ license functionality......................................................................................................................................... 245 Replication for nodes with multiple interfaces........................................................................................................ 246 Restrict SyncIQ source nodes..................................................................................................................................... 246 Creating replication policies......................................................................................................................................... 246 Excluding directories in replication....................................................................................................................... 246 Excluding files in replication....................................................................................................................................247 File criteria options....................................................................................................................................................247 Configure default replication policy settings .....................................................................................................248 Create a replication policy.......................................................................................................................................249 Create a SyncIQ domain..........................................................................................................................................249 Assess a replication policy ..................................................................................................................................... 249 Managing replication to remote clusters.................................................................................................................. 250 Start a replication job...............................................................................................................................................250 Pause a replication job ............................................................................................................................................250 Resume a replication job ........................................................................................................................................ 250 Cancel a replication job ........................................................................................................................................... 251 View active replication jobs ....................................................................................................................................251 Restrict SyncIQ to use the interfaces in the IP address pool ........................................................................251 Modify the Restrict Target Network value......................................................................................................... 251 Replication job information .....................................................................................................................................251 Initiating data failover and failback with SyncIQ..................................................................................................... 252 Fail over data to a secondary cluster ..................................................................................................................252 Revert a failover operation..................................................................................................................................... 252 Fail back data to a primary cluster ...................................................................................................................... 253 Run the ComplianceStoreDelete job in a Smartlock compliance mode domain.........................................254 Performing disaster recovery for older SmartLock directories........................................................................... 254 Recover SmartLock compliance directories on a target cluster .................................................................. 254 Migrate SmartLock compliance directories .......................................................................................................255 Managing replication policies....................................................................................................................................... 255 Modify a replication policy .....................................................................................................................................255 Delete a replication policy ...................................................................................................................................... 256 Enable or disable a replication policy ...................................................................................................................256 View replication policies ......................................................................................................................................... 256 Replication policy information ............................................................................................................................... 257 Managing replication to the local cluster.................................................................................................................. 258 Cancel replication to the local cluster .................................................................................................................258 Break local target association ...............................................................................................................................258 View replication policies targeting the local cluster......................................................................................... 258 Remote replication policy information ................................................................................................................ 259 Managing replication performance rules................................................................................................................... 259 Create a network traffic rule ................................................................................................................................ 259

12

Contents

Create a file operations rule .................................................................................................................................. 259 Modify a performance rule .................................................................................................................................... 259 Delete a performance rule ..................................................................................................................................... 260 Enable or disable a performance rule ..................................................................................................................260 View performance rules ......................................................................................................................................... 260 Managing replication reports........................................................................................................................................ 261 Configure default replication report settings .....................................................................................................261 Delete replication reports........................................................................................................................................ 261 View replication reports .......................................................................................................................................... 261 Replication report information............................................................................................................................... 262 Managing failed replication jobs.................................................................................................................................. 263 Resolve a replication policy ................................................................................................................................... 263 Reset a replication policy ....................................................................................................................................... 263 Perform a full or differential replication...............................................................................................................263 Restrict SyncIQ to use the interfaces in the IP address pool .............................................................................264 Modify the Restrict Target Network value...............................................................................................................264
Chapter 18: Data Encryption with SyncIQ.................................................................................. 265 SyncIQ data encryption overview...............................................................................................................................265 SyncIQ traffic encryption..............................................................................................................................................265 Configure certificates.............................................................................................................................................. 266 Create encrypted SyncIQ policies......................................................................................................................... 267 Per-policy throttling overview..................................................................................................................................... 267 Create a bandwidth rule.......................................................................................................................................... 267 Troubleshooting SyncIQ encryption........................................................................................................................... 267
Chapter 19: Data Compression...................................................................................................268 Data compression........................................................................................................................................................... 268 Data compression settings and monitoring.............................................................................................................. 268 Enable or disable data compression........................................................................................................................... 268 View compression statistics......................................................................................................................................... 269
Chapter 20: Data layout with FlexProtect.................................................................................. 273 FlexProtect overview..................................................................................................................................................... 273 File striping........................................................................................................................................................................273 Requested data protection........................................................................................................................................... 273 FlexProtect data recovery............................................................................................................................................ 274 Smartfail...................................................................................................................................................................... 274 Node failures...............................................................................................................................................................274 Requesting data protection.......................................................................................................................................... 275 Requested protection settings.................................................................................................................................... 275 Requested protection disk space usage....................................................................................................................276
Chapter 21: Large file size support............................................................................................ 278 Large file support............................................................................................................................................................ 278 Feature enablement requirements.............................................................................................................................. 278 Restrictions after enabling large file support........................................................................................................... 278 Enable large file support................................................................................................................................................279 Check SyncIQ and cluster disk space compatibility............................................................................................... 279

Contents

13

Chapter 22: Administering NDMP.............................................................................................. 281 NDMP backup and recovery overview....................................................................................................................... 281 NDMP two-way backup................................................................................................................................................ 282 NDMP three-way backup............................................................................................................................................. 282 Support for NDMP sessions on Generation 6 hardware....................................................................................... 282 Setting preferred IPs for NDMP three-way operations........................................................................................ 282 NDMP multi-stream backup and recovery............................................................................................................... 283 Snapshot-based incremental backups....................................................................................................................... 283 NDMP backup and restore of SmartLink files......................................................................................................... 283 NDMP protocol support................................................................................................................................................ 284 Supported DMAs.............................................................................................................................................................285 NDMP hardware support.............................................................................................................................................. 285 NDMP backup limitations..............................................................................................................................................285 NDMP performance recommendations..................................................................................................................... 286 Excluding files and directories from NDMP backups............................................................................................. 287 Configuring basic NDMP backup settings................................................................................................................ 288 Configure and enable NDMP backup................................................................................................................... 288 Disable NDMP backup ............................................................................................................................................ 288 NDMP backup settings ...........................................................................................................................................288 View NDMP backup settings ................................................................................................................................ 288 Managing NDMP user accounts..................................................................................................................................289 Create an NDMP user account ............................................................................................................................ 289 Modify the password of an NDMP user account .............................................................................................289 Delete an NDMP user account ............................................................................................................................. 289 View NDMP user accounts ....................................................................................................................................289 Managing NDMP backup devices............................................................................................................................... 289 Detect NDMP backup devices ............................................................................................................................. 290 Modify an NDMP backup device entry name ................................................................................................... 290 Delete a device entry for a disconnected NDMP backup device..................................................................290 View NDMP backup devices ................................................................................................................................. 290 Managing NDMP Fibre Channel ports....................................................................................................................... 290 Modify NDMP backup port settings ................................................................................................................... 290 Enable or disable an NDMP backup port..............................................................................................................291 View NDMP backup ports ...................................................................................................................................... 291 NDMP backup port settings .................................................................................................................................. 291 Managing NDMP preferred IP settings...................................................................................................................... 291 Create an NDMP preferred IP setting.................................................................................................................. 291 Modify an NDMP preferred IP setting................................................................................................................. 292 List NDMP preferred IP settings...........................................................................................................................292 View NDMP preferred IP settings........................................................................................................................ 292 Delete NDMP preferred IP settings......................................................................................................................292 Managing NDMP sessions............................................................................................................................................ 293 End an NDMP session .............................................................................................................................................293 View NDMP sessions .............................................................................................................................................. 293 NDMP session information .................................................................................................................................... 293 Managing NDMP restartable backups....................................................................................................................... 294 Configure NDMP restartable backups for NetWorker.....................................................................................295 View NDMP restartable backup contexts...........................................................................................................295 Delete an NDMP restartable backup context.................................................................................................... 295

14

Contents

Configure NDMP restartable backup settings...................................................................................................295 View NDMP restartable backup settings............................................................................................................ 296 NDMP restore operations............................................................................................................................................. 296 NDMP parallel restore operation...........................................................................................................................296 NDMP serial restore operation.............................................................................................................................. 296 Specify a NDMP serial restore operation............................................................................................................296 Managing default NDMP variables............................................................................................................................. 296 Specify the default NDMP variable settings for a path...................................................................................297 Modify the default NDMP variable settings for a path....................................................................................297 View the default NDMP settings for a path....................................................................................................... 297 NDMP environment variables.................................................................................................................................297 Setting environment variables for backup and restore operations................................................................301 Managing snapshot based incremental backups.....................................................................................................302 Enable snapshot-based incremental backups for a directory........................................................................ 302 Delete snapshots for snapshot-based incremental backups.......................................................................... 302 View snapshots for snapshot-based incremental backups.............................................................................302 Managing cluster performance for NDMP sessions...............................................................................................302 Enable NDMP Redirector to manage cluster performance............................................................................ 302 Managing CPU usage for NDMP sessions................................................................................................................303 Enable NDMP Throttler........................................................................................................................................... 303 View NDMP backup logs ..............................................................................................................................................303
Chapter 23: File retention with SmartLock................................................................................ 304 SmartLock overview...................................................................................................................................................... 304 Compliance mode............................................................................................................................................................304 Enterprise mode.............................................................................................................................................................. 305 SmartLock directories................................................................................................................................................... 305 Replication and backup with SmartLock...................................................................................................................305 SmartLock license functionality.................................................................................................................................. 306 SmartLock considerations............................................................................................................................................ 306 Delete WORM domain and directories...................................................................................................................... 306 Set the compliance clock.............................................................................................................................................. 307 View the compliance clock........................................................................................................................................... 307 Creating a SmartLock directory.................................................................................................................................. 307 Retention periods...................................................................................................................................................... 307 Autocommit time periods........................................................................................................................................ 308 Create an enterprise directory for a non-empty directory............................................................................. 308 Create a SmartLock directory................................................................................................................................308 Managing SmartLock directories................................................................................................................................ 309 Modify a SmartLock directory............................................................................................................................... 309 Exclude a SmartLock directory..............................................................................................................................309 Delete a SmartLock directory................................................................................................................................ 309 View SmartLock directory settings....................................................................................................................... 310 SmartLock directory configuration settings........................................................................................................310 Managing files in SmartLock directories.................................................................................................................... 312 Set a retention period through a UNIX command line...................................................................................... 312 Set a retention period through Windows Powershell....................................................................................... 312 Commit a file to a WORM state through a UNIX command line.................................................................... 313 Commit a file to a WORM state through Windows Explorer.......................................................................... 313 Override the retention period for all files in a SmartLock directory..............................................................313

Contents

15

Delete a file committed to a WORM state ......................................................................................................... 314 View WORM status of a file....................................................................................................................................314
Chapter 24: Data Removal with Instant Secure Erase (ISE)........................................................315 Instant Secure Erase.......................................................................................................................................................315 ISE during drive smartfail...............................................................................................................................................315 Enable Instant Secure Erase (ISE).............................................................................................................................. 316 View current ISE configuration.................................................................................................................................... 316 Disable Instant Secure Erase (ISE)..............................................................................................................................317
Chapter 25: Protection domains.................................................................................................318 Protection domains overview....................................................................................................................................... 318 Protection domain considerations............................................................................................................................... 318 Create a protection domain ......................................................................................................................................... 318 Delete a protection domain ..........................................................................................................................................319
Chapter 26: Data-at-rest-encryption......................................................................................... 320 Data-at-rest encryption overview.............................................................................................................................. 320 Self-encrypting drives................................................................................................................................................... 320 Data security on self-encrypting drives.................................................................................................................... 320 Data migrations and upgrades to a cluster with self-encrypting drives.............................................................321 Enabling external key management.............................................................................................................................321 Migrate nodes and SEDs to external key management..........................................................................................321 Chassis and drive states................................................................................................................................................322 Smartfailed drive REPLACE state...............................................................................................................................323 Smartfailed drive ERASE state.................................................................................................................................... 324
Chapter 27: SmartQuotas.......................................................................................................... 325 SmartQuotas overview.................................................................................................................................................. 325 Quota types......................................................................................................................................................................325 Default quota type..........................................................................................................................................................326 Usage accounting and limits.........................................................................................................................................328 Disk-usage calculations................................................................................................................................................. 329 Quota notifications......................................................................................................................................................... 330 Quota notification rules................................................................................................................................................. 330 Quota reports................................................................................................................................................................... 331 Creating quotas................................................................................................................................................................331 Create an accounting quota....................................................................................................................................331 Create an enforcement quota................................................................................................................................332 Managing quotas.............................................................................................................................................................332 Search for quotas......................................................................................................................................................332 Manage quotas.......................................................................................................................................................... 332 Export a quota configuration file...........................................................................................................................333 Import a quota configuration file........................................................................................................................... 333 Managing quota notifications.................................................................................................................................334 Email quota notification messages........................................................................................................................335 Managing quota reports.......................................................................................................................................... 337 Basic quota settings................................................................................................................................................. 338 Advisory limit quota notification rules settings..................................................................................................339

16

Contents

Soft limit quota notification rules settings..........................................................................................................340 Hard limit quota notification rules settings..........................................................................................................341 Limit notification settings........................................................................................................................................342 Quota report settings...............................................................................................................................................342
Chapter 28: Storage Pools.........................................................................................................344 Storage pools overview................................................................................................................................................. 344 Storage pool functions.................................................................................................................................................. 345 Autoprovisioning..............................................................................................................................................................346 Node pools........................................................................................................................................................................346 Compatibilities............................................................................................................................................................346 Compatibility restrictions.........................................................................................................................................347 Manual node pools.................................................................................................................................................... 347 Virtual hot spare..............................................................................................................................................................348 Spillover............................................................................................................................................................................. 348 Suggested protection.................................................................................................................................................... 348 Protection policies.......................................................................................................................................................... 348 SSD strategies................................................................................................................................................................. 349 Other SSD mirror settings............................................................................................................................................ 350 Global namespace acceleration................................................................................................................................... 350 L3 cache overview..........................................................................................................................................................350 Migration to L3 cache...............................................................................................................................................351 L3 cache on archive-class node pools................................................................................................................. 352 Tiers....................................................................................................................................................................................352 File pool policies.............................................................................................................................................................. 352 FilePolicy job...............................................................................................................................................................353 Managing node pools through the command-line interface.................................................................................353 Delete an SSD compatibility................................................................................................................................... 353 Create a node pool manually.................................................................................................................................. 354 Add a node to a manually managed node pool...................................................................................................354 Add a new node type to an existing node pool..................................................................................................354 Remove a node type from a node pool................................................................................................................355 Change the name or protection policy of a node pool.....................................................................................355 Remove a node from a manually managed node pool......................................................................................355 View node pool settings.......................................................................................................................................... 356 Modify default storage pool settings................................................................................................................... 356 SmartPools settings................................................................................................................................................. 356 Managing L3 cache from the command-line interface..........................................................................................359 Set L3 cache as the default for new node pools.............................................................................................. 359 Enable L3 cache on a specific node pool ........................................................................................................... 359 Restore SSDs to storage drives for a node pool...............................................................................................360 Managing tiers................................................................................................................................................................. 360 Create a tier............................................................................................................................................................... 360 Add or move node pools in a tier...........................................................................................................................360 Rename a tier............................................................................................................................................................. 360 Delete a tier.................................................................................................................................................................361 Creating file pool policies............................................................................................................................................... 361 Create a file pool policy............................................................................................................................................ 361 Valid wildcard characters........................................................................................................................................ 362 Default file pool requested protection settings................................................................................................. 362

Contents

17

Default file pool I/O optimization settings..........................................................................................................363 Managing file pool policies............................................................................................................................................ 364
Modify a file pool policy........................................................................................................................................... 364 Configure default file pool policy settings...........................................................................................................365 Prioritize a file pool policy....................................................................................................................................... 365 Delete a file pool policy............................................................................................................................................366 Monitoring storage pools.............................................................................................................................................. 366 Monitor storage pools..............................................................................................................................................366 View the health of storage pools.......................................................................................................................... 366 View results of a SmartPools job...........................................................................................................................367
Chapter 29: Pool-based tree reporting in FSAnalyze (FSA)........................................................368 FSAnalyze (FSA).............................................................................................................................................................368 Pool-based tree reporting in FSAnalyze (FSA)....................................................................................................... 368 Enable pool-based tree reporting in FSA.................................................................................................................. 368 Disable pool-based tree reporting in FSA................................................................................................................. 369
Chapter 30: System jobs........................................................................................................... 370 System jobs overview.................................................................................................................................................... 370 System jobs library...........................................................................................................................................................371 Job operation................................................................................................................................................................... 374 Job performance impact............................................................................................................................................... 375 Job priorities.....................................................................................................................................................................376 Managing system jobs....................................................................................................................................................376 Start a job....................................................................................................................................................................377 Pause a job.................................................................................................................................................................. 377 Modify a job................................................................................................................................................................ 377 Resume a job.............................................................................................................................................................. 378 Cancel a job................................................................................................................................................................ 378 View active jobs.........................................................................................................................................................378 View job history......................................................................................................................................................... 378 Modify job type settings................................................................................................................................................379 Managing impact policies.............................................................................................................................................. 379 Create an impact policy........................................................................................................................................... 379 View impact policy settings.................................................................................................................................... 380 Modify an impact policy...........................................................................................................................................380 Delete an impact policy............................................................................................................................................ 381 Viewing job reports and statistics............................................................................................................................... 381 View statistics for a job in progress...................................................................................................................... 381 View a report for a completed job........................................................................................................................ 382
Chapter 31: S3 support.............................................................................................................. 383 S3 .......................................................................................................................................................................................383 S3 concepts................................................................................................................................................................383 Server Configuration...................................................................................................................................................... 383 Global S3 settings..................................................................................................................................................... 384 S3 zone settings........................................................................................................................................................385 Certificates................................................................................................................................................................. 385 Bucket handling ..............................................................................................................................................................386

18

Contents

Managing buckets.....................................................................................................................................................386 Object handling ...............................................................................................................................................................387
Object key................................................................................................................................................................... 388 Object Metadata....................................................................................................................................................... 388 Multipart upload........................................................................................................................................................ 388 Etag.............................................................................................................................................................................. 388 PUT object..................................................................................................................................................................389 Cross protocol locking............................................................................................................................................. 389 Authentication................................................................................................................................................................. 389 Access keys................................................................................................................................................................ 389 Access control........................................................................................................................................................... 389 Anonymous authentication..................................................................................................................................... 392 Access key management...............................................................................................................................................392 Managing keys........................................................................................................................................................... 393
Chapter 32: Small Files Storage Efficiency for archive workloads..............................................395 Overview ..........................................................................................................................................................................395 Requirements................................................................................................................................................................... 396 Upgrades and rollbacks................................................................................................................................................. 396 Interoperability.................................................................................................................................................................396 Managing Small Files Storage Efficiency...................................................................................................................397 Implementation overview ....................................................................................................................................... 397 Enable Small Files Storage Efficiency.................................................................................................................. 398 View and configure global settings.......................................................................................................................398 Specify selection criteria for files to pack ......................................................................................................... 399 Disable packing.......................................................................................................................................................... 399 Reporting features ........................................................................................................................................................ 400 Estimate possible storage savings........................................................................................................................ 400 View packing and unpacking activity by SmartPools jobs............................................................................... 401 Monitor storage efficiency with FSAnalyze ...................................................................................................... 402 View ShadowStore information............................................................................................................................. 403 Monitor storage efficiency on a small data set..................................................................................................403 File system structure..................................................................................................................................................... 403 Viewing file attributes.............................................................................................................................................. 404 Defragmenter overview.................................................................................................................................................404 Managing the defragmenter........................................................................................................................................ 405 Enable the defragmenter........................................................................................................................................ 405 Configure the defragmenter ................................................................................................................................. 405 Run the defragmenter ............................................................................................................................................ 407 View estimated storage savings before defragmenting ................................................................................. 407 CLI commands for Small Files Storage Efficiency.................................................................................................. 408 isi_sfse_assess.......................................................................................................................................................... 408 isi_gconfig -t defrag-config.................................................................................................................................... 412 isi_packing................................................................................................................................................................... 414 isi_sstore .....................................................................................................................................................................416 isi_sstore defrag ....................................................................................................................................................... 419 isi_storage_efficiency.............................................................................................................................................. 422 Troubleshooting Small Files Storage Efficiency ..................................................................................................... 423 Log files....................................................................................................................................................................... 423 Fragmentation issues............................................................................................................................................... 423

Contents

19

Chapter 33: Networking............................................................................................................ 425 Networking overview..................................................................................................................................................... 425 About the internal network...........................................................................................................................................425 Internal IP address ranges.......................................................................................................................................426 Internal network failover......................................................................................................................................... 426 About the external network......................................................................................................................................... 426 Groupnets................................................................................................................................................................... 426 Subnets........................................................................................................................................................................ 427 IP address pools.........................................................................................................................................................428 SmartConnect module............................................................................................................................................. 428 Node provisioning rules............................................................................................................................................ 431 Routing options..........................................................................................................................................................432 Managing internal network settings........................................................................................................................... 432 Add or remove an internal IP address range...................................................................................................... 432 Modify an internal network netmask.................................................................................................................... 433 Configure and enable internal network failover................................................................................................. 433 Disable internal network failover........................................................................................................................... 434 Managing groupnets.......................................................................................................................................................434 Create a groupnet.....................................................................................................................................................434 Modify a groupnet.....................................................................................................................................................434 Delete a groupnet..................................................................................................................................................... 435 View groupnets..........................................................................................................................................................435 Enabling Router Advertisement.............................................................................................................................436 Managing external network subnets.......................................................................................................................... 436 Create a subnet......................................................................................................................................................... 436 Modify a subnet.........................................................................................................................................................436 Delete a subnet.......................................................................................................................................................... 437 View subnets.............................................................................................................................................................. 437 Enable or disable VLAN tagging............................................................................................................................ 438 Add or remove a DSR address............................................................................................................................... 438 Managing Multi-SSIP..................................................................................................................................................... 439 Configure a SmartConnect service IP address..................................................................................................439 Configure Multi-SSIP............................................................................................................................................... 439 Add, clear, or remove a SmartConnect Multi-SSIP IP address range..........................................................440 Managing IP address pools........................................................................................................................................... 440 Create an IP address pool....................................................................................................................................... 440 Modify an IP address pool........................................................................................................................................441 Delete an IP address pool........................................................................................................................................ 441 View IP address pools............................................................................................................................................... 441 Add or remove an IP address range..................................................................................................................... 442 Configure IP address allocation............................................................................................................................. 442 Managing SmartConnect Settings..............................................................................................................................443 Configure a SmartConnect DNS zone................................................................................................................. 443 Specify a SmartConnect service subnet............................................................................................................. 443 Suspend or resume a node..................................................................................................................................... 444 Configure a connection balancing policy............................................................................................................. 444 Configure an IP failover policy............................................................................................................................... 445 Managing connection rebalancing.............................................................................................................................. 445 Configure an IP rebalance policy........................................................................................................................... 445

20

Contents

Manually rebalance IP addresses...........................................................................................................................446 Managing network interface members...................................................................................................................... 446
Add or remove a network interface......................................................................................................................446 Specify a link aggregation mode............................................................................................................................447 View network interfaces......................................................................................................................................... 448 Managing node provisioning rules...............................................................................................................................448 Create a node provisioning rule............................................................................................................................. 448 Modify a node provisioning rule.............................................................................................................................449 Delete a node provisioning rule..............................................................................................................................449 View node provisioning rules.................................................................................................................................. 449 Managing routing options............................................................................................................................................. 450 Enable or disable source-based routing...............................................................................................................450 Add or remove a static route.................................................................................................................................. 451 Managing DNS cache settings..................................................................................................................................... 451 DNS cache settings...................................................................................................................................................451
Chapter 34: NFS3oRDMA.......................................................................................................... 453 RDMA support for NFSoRDMA...................................................................................................................................453 Enable RDMA feature for NFSv3 protocol............................................................................................................... 453 Disable RDMA feature for NFSv3 protocol.............................................................................................................. 454 View RDMA flag on network interface cards...........................................................................................................454 Create an IP address pool with RDMA support.......................................................................................................454 Modify an IP address pool with RDMA support...................................................................................................... 455
Chapter 35: Partitioned Performance Monitoring...................................................................... 456 Partitioned Performance Monitoring......................................................................................................................... 456 Workload monitoring...................................................................................................................................................... 456 Create a standard dataset............................................................................................................................................ 457 View dataset list........................................................................................................................................................ 457 View details of dataset............................................................................................................................................ 457 Modify details of dataset........................................................................................................................................ 458 Delete dataset .......................................................................................................................................................... 458 Create a dataset with filters........................................................................................................................................ 458 Pin a workload................................................................................................................................................................. 458 Apply filter(s) to a dataset........................................................................................................................................... 459 View statistics................................................................................................................................................................. 459 Additional information.....................................................................................................................................................461
Chapter 36: IPMI....................................................................................................................... 462 IPMI overview.................................................................................................................................................................. 462 Enable IPMI.......................................................................................................................................................................462 Enable IPMI power control and Serial over LAN..................................................................................................... 463 Configure IPMI username and password.................................................................................................................. 463
Chapter 37: Antivirus................................................................................................................ 464 Antivirus overview.......................................................................................................................................................... 464 On-access scanning....................................................................................................................................................... 464 ICAP Antivirus policy scanning.................................................................................................................................... 465 Individual file scanning using ICAP..............................................................................................................................465

Contents

21

WORM files and antivirus............................................................................................................................................. 465 Antivirus scan reports....................................................................................................................................................465 ICAP servers.................................................................................................................................................................... 466 CAVA servers...................................................................................................................................................................466 ICAP threat responses...................................................................................................................................................466 CAVA threat responses................................................................................................................................................. 467 Configuring global antivirus settings.......................................................................................................................... 467
Include specific files in antivirus scans ............................................................................................................... 467 Configure on-access scanning settings ..............................................................................................................467 Configure antivirus threat response settings ....................................................................................................468 Configure antivirus report retention settings.....................................................................................................468 Enable or disable antivirus scanning..................................................................................................................... 468 Managing ICAP servers................................................................................................................................................. 468 Add and connect to an ICAP server ....................................................................................................................468 Temporarily disconnect from an ICAP server ................................................................................................... 468 Reconnect to an ICAP server ............................................................................................................................... 469 Remove an ICAP server ......................................................................................................................................... 469 Managing CAVA servers............................................................................................................................................... 469 Add and connect to a CAVA server .................................................................................................................... 469 Modify CAVA connection settings........................................................................................................................469 List or view CAVA servers ..................................................................................................................................... 470 Disable connection to a CAVA server ................................................................................................................. 470 Delete a CAVA server configuration.....................................................................................................................470 Create an IP pool in a CAVA server...................................................................................................................... 470 Create a dedicated Access Zone on a CAVA server......................................................................................... 471 Create an Active Directory authentication provider for the AvVendor access zone................................ 471 Update the role in the access zone...................................................................................................................... 472 Scan CloudPool files in a CAVA server................................................................................................................ 472 View CloudPool scan timeout.................................................................................................................................473 Manage CAVA filters................................................................................................................................................ 473 Manage CAVA jobs .................................................................................................................................................. 474 Create an ICAP antivirus policy .................................................................................................................................. 474 Managing ICAP antivirus policies................................................................................................................................ 475 Modify an ICAP antivirus policy ............................................................................................................................475 Delete an ICAP antivirus policy .............................................................................................................................475 Enable or disable an ICAP antivirus policy ......................................................................................................... 475 View ICAP antivirus policies .................................................................................................................................. 475 Managing antivirus scans.............................................................................................................................................. 475 Scan a file....................................................................................................................................................................476 Manually run an ICAP antivirus policy.................................................................................................................. 476 Stop a running antivirus scan................................................................................................................................. 476 Managing antivirus threats........................................................................................................................................... 476 Manually quarantine a file ...................................................................................................................................... 476 Rescan a file............................................................................................................................................................... 476 Remove a file from quarantine ..............................................................................................................................476 Manually truncate a file............................................................................................................................................477 View threats .............................................................................................................................................................. 477 Antivirus threat information....................................................................................................................................477 Managing antivirus reports........................................................................................................................................... 477 View antivirus reports ............................................................................................................................................. 477

22

Contents

View antivirus events............................................................................................................................................... 478

Contents

23

1

Introduction to this guide

This section contains the following topics.
Topics:
· About this guide · Scale-out NAS overview · Getting help

About this guide
This guide describes how the PowerScale OneFS command-line interface provides access to cluster configuration, management, and monitoring functionality.
OneFS commands extend the standard UNIX command set. For an alphabetical list and description of all OneFS commands, see the OneFS CLI Command Reference.
Your suggestions help us to improve the accuracy, organization, and overall quality of the documentation. Send your feedback to http://bit.ly/isilon-docfeedback. If you cannot provide feedback through the URL, send an email message to [email protected].

Scale-out NAS overview
The scale-out NAS storage platform combines modular hardware with unified software to harness unstructured data. Powered by the OneFS operating system, a cluster to deliver a scalable pool of storage with a global namespace.
The unified software platform provides centralized web-based and command-line administration to manage the following features:
 A cluster that runs a distributed file system  Scale-out nodes that add capacity and performance  Storage options that manage files and tiering  Flexible data protection and high availability  Software modules that control costs and optimize resources.

Getting help

This topic contains resources for getting answers to questions about PowerScale products.

Dell Technologies support

 https://www.dell.com/support/incidents-online/en-us/contactus/product/ isilon-onefs

Telephone support

 United States: 1-800-SVC-4EMC (1-800-782-4362)  Canada: 1-800-543-4782  Worldwide: 1-508-497-7901  Local phone numbers for a specific country or region are available at https://
www.dell.com/support/incidents-online/en-us/contactus/product/isilon-onefs.

PowerScale OneFS Documentation Info  https://www.dell.com/support/kbdoc/en-us/000152189/powerscale-onefs-info-

Hubs

hubs

Dell Community Board for self-help

 https://www.dell.com/community

24

Introduction to this guide

2

PowerScale scale-out NAS

This section contains the following topics:
Topics:
· OneFS storage architecture · Node components · Internal and external networks · PowerScale cluster · The OneFS operating system · Structure of the file system · Data protection overview · Data compression · Software modules

OneFS storage architecture
PowerScale takes a scale-out approach to storage by creating a cluster of nodes that runs a distributed file system. OneFS combines the three layers of storage architecture--file system, volume manager, and data protection--into a scale-out NAS cluster.
Each node adds resources to the cluster. Because each node contains globally coherent RAM, as a cluster becomes larger, it becomes faster. Meanwhile, the file system expands dynamically and redistributes content, which eliminates the work of partitioning disks and creating volumes.
Nodes work as peers to spread data across the cluster. Striping--the process of segmenting and distributing data--protects data and enables users connecting to any node to take advantage of the entire cluster's performance.
OneFS uses distributed software to scale data across commodity hardware. Master devices do not control the cluster, and secondary devices do not invoke dependencies. Each node helps to control data requests, boost performance, and expand cluster capacity.

Node components

As a rack-mountable appliance, a pre-Generation 6 storage node includes the following components in a 2U or 4U rackmountable chassis with an LCD front panel: CPUs, RAM, NVRAM, network interfaces, InfiniBand adapters, disk controllers, and storage media. A PowerScale cluster is made up of three or more nodes, up to 144. The 4U chassis is always used for Generation 6. There are four nodes in one 4U chassis in Generation 6, therefore a quarter chassis makes up one node.
When you add a node to a pre-Generation 6 cluster, you increase the aggregate disk, cache, CPU, RAM, and network capacity. OneFS groups RAM into a single coherent cache so that a data request on a node benefits from data that is cached anywhere. NVRAM is grouped to write data with high throughput and to protect write operations from power failures. As the cluster expands, spindles and CPU combine to increase throughput, capacity, and input-output operations per second (IOPS). The minimum cluster for Generation 6 is four nodes and Generation 6 does not use NVRAM. Journals are stored in RAM and M.2 flash is used for a backup in case of node failure.
The PowerScale F200 and F600 nodes are 1U models that require a minimum cluster size of three nodes. Clusters can be expanded to a maximum of 252 nodes in single node increments.
There are several types of nodes, all of which can be added to a cluster to balance capacity and performance with throughput or IOPS:

Node

Use Case

PowerScale F200 (The F200 is supported with OneFS 9.0.0.0 All-flash solution, software inline data compression, and data

and later releases only)

deduplication

PowerScale scale-out NAS

25

Node

Use Case

PowerScale F600 (The F600 is supported with OneFS 9.0.0.0 All-flash solution, software inline data compression, and data

and later releases only)

deduplication

IsilonGen-6 Hardware F800 and F810 (The F810 is supported with OneFS 8.1.3 and with OneFS 8.2.1 and later releases only)

All-flash solution, hardware data compression and data deduplication on the F810

Gen-6 Hardware H-Series

 H600, performance spinning solution  H500, performance capacity  H400, capacity performance  H5600, large capacity in a performance node, data
compression: requires PowerScale 8.2.2 and later releases for in-line compression and in-line deduplication support)

Gen-6 Hardware A-Series

 A200, active archive  A2000, deep archive

S-Series

IOPS-intensive applications

X-Series

High-concurrency and throughput-driven workflows

NL-Series

Near-primary accessibility, with near-tape value

HD-Series

Maximum capacity

The following Dell Technologies PowerScale nodes improve performance:

Node A-Series Performance Accelerator A-Series Backup Accelerator

Function
Independent scaling for high performance
High-speed and scalable backup-and-restore solution for tape drives over Fibre Channel connections

Internal and external networks
A cluster includes two networks: an internal network to exchange data between nodes and an external network to handle client connections.
Nodes exchange data through the internal network with a proprietary, unicast protocol over InfiniBand or Ethernet, depending on the node model. Each node includes redundant InfiniBand or Ethernet ports for a second internal network in case the first port fails. Ethernet is the only supported external network.
Supported network configurations are as follows:
 Generation 5 nodes support only InfiniBand for the internal network.  Generation 6 nodes support 10 GB and 40 GB Ethernet, with 25 GB Ethernet as a later add-on option.  PowerScale F200 supports 10 GB and 40 GB Ethernet and InfiniBand.  PowerScale F600 supports 10, 25, 40, and 100 GB Ethernet, and10, and 40 GB InfiniBand. PowerScale , and Generation 6 nodes support InfiniBand and Ethernet for the internal network. You can mix Generation 5, Generation 6, and PowerScale F200 and F600 nodes in the same cluster. PowerScale F200 and F600 nodes support only an Ethernet internal network.
NOTE: Ethernet is recommended for the internal network. However PowerScale F200 and F600 nodes do support InfiniBand options for existing InfiniBand clusters.
Clients reach the cluster using Ethernet. Since every node includes Ethernet ports, the cluster bandwidth scales with performance and capacity as nodes are added.
CAUTION: Only Isilon or PowerScale nodes should be connected to the internal network, depending on the node model. Information that is exchanged on the back-end network is not encrypted. Connecting anything other than Isilon or PowerScale nodes to the internal network creates a security risk.

26

PowerScale scale-out NAS

PowerScale cluster
A PowerScale cluster consists of three or more hardware nodes, up to 252. Each node runs the PowerScale OneFS operating system, the distributed file-system software that unites the nodes into a cluster. The storage capacity of a cluster ranges from a minimum of 11 TB raw with three PowerScale F200 nodes to more than 50 PB.
Cluster administration
OneFS centralizes cluster management through a web administration interface and a command-line interface. Both interfaces provide methods to activate licenses, check the status of nodes, configure the cluster, upgrade the system, generate alerts, view client connections, track performance, and change various settings.
In addition, OneFS simplifies administration by automating maintenance with a Job Engine. You can schedule jobs that scan for viruses, inspect disks for errors, reclaim disk space, and check the integrity of the file system. The engine manages the jobs to minimize impact on the cluster's performance.
With SNMP versions 2c and 3, you can remotely monitor hardware components, CPU usage, switches, and network interfaces. Dell Technologies PowerScale supplies management information bases (MIBs) and traps for the OneFS operating system.
OneFS also includes an application programming interface (API) that is divided into two functional areas: One area enables cluster configuration, management, and monitoring functionality, and the other area enables operations on files and directories on the cluster. You can send requests to the OneFS API through a Representational State Transfer (REST) interface, which is accessed through resource URIs and standard HTTP methods. The API integrates with OneFS role-based access control (RBAC) to increase security. See the PowerScale API Reference.
Quorum
A PowerScale cluster must have a quorum to work correctly. A quorum prevents data conflicts--for example, conflicting versions of the same file--in case two groups of nodes become unsynchronized. If a cluster loses its quorum for read and write requests, you cannot access the OneFS file system.
For a quorum, more than half the nodes must be available over the internal network. A seven-node cluster, for example, requires a four-node quorum. A 10-node cluster requires a six-node quorum. If a node is unreachable over the internal network, OneFS separates the node from the cluster, an action referred to as splitting. After a cluster is split, cluster operations continue as long as enough nodes remain connected to have a quorum.
In a split cluster, the nodes that remain in the cluster are referred to as the majority group. Nodes that are split from the cluster are referred to as the minority group.
When split nodes can reconnect with the cluster and re-synchronize with the other nodes, the nodes rejoin the cluster's majority group, an action referred to as merging.
A OneFS cluster contains two quorum properties:
 read quorum (efs.gmp.has_quorum)  write quorum (efs.gmp.has_super_block_quorum) By connecting to a node with SSH and running the sysctl command-line tool as root, you can view the status of both types of quorum. Here is an example for a cluster that has a quorum for both read and write operations, as the command output indicates with a 1, for true:
sysctl efs.gmp.has_quorum efs.gmp.has_quorum: 1
sysctl efs.gmp.has_super_block_quorum efs.gmp.has_super_block_quorum: 1
The degraded states of nodes--such as smartfail, read-only, offline--effect quorum in different ways. A node in a smartfail or read-only state affects only write quorum. A node in an offline state, however, affects both read and write quorum. In a cluster, the combination of nodes in different degraded states determines whether read requests, write requests, or both work.
A cluster can lose write quorum but keep read quorum. Consider a four-node cluster in which nodes 1 and 2 are working normally. Node 3 is in a read-only state, and node 4 is in a smartfail state. In such a case, read requests to the cluster succeed. Write requests, however, receive an input-output error because the states of nodes 3 and 4 break the write quorum.
A cluster can also lose both its read and write quorum. If nodes 3 and 4 in a four-node cluster are in an offline state, both write requests and read requests receive an input-output error, and you cannot access the file system. When OneFS can reconnect

PowerScale scale-out NAS

27

with the nodes, OneFS merges them back into the cluster. Unlike a RAID system, a PowerScale node can rejoin the cluster without being rebuilt and reconfigured.
Splitting and merging
Splitting and merging optimize the use of nodes without your intervention. OneFS monitors every node in a cluster. If a node is unreachable over the internal network, OneFS separates the node from the cluster, an action referred to as splitting. When the cluster can reconnect to the node, OneFS adds the node back into the cluster, an action referred to as merging. When a node is split from a cluster, it will continue to capture event information locally. You can connect to a split node with SSH and run the isi event events list command to view the local event log for the node. The local event log can help you troubleshoot the connection issue that resulted in the split. When the split node rejoins the cluster, local events gathered during the split are deleted. You can still view events generated by a split node in the node's event log file located at /var/log/isi_celog_events.log. If a cluster splits during a write operation, OneFS might need to reallocate blocks for the file on the side with the quorum, which leads allocated blocks on the side without a quorum to become orphans. When the split nodes reconnect with the cluster, the OneFS Collect system job reclaims the orphaned blocks. Meanwhile, as nodes split and merge with the cluster, the OneFS AutoBalance job redistributes data evenly among the nodes in the cluster, optimizing protection and conserving space.
Storage pools
Storage pools segment nodes and files into logical divisions to simplify the management and storage of data. A storage pool comprises node pools and tiers. Node pools group equivalent nodes to protect data and ensure reliability. Tiers combine node pools to optimize storage by need, such as a frequently used high-speed tier or a rarely accessed archive. The SmartPools module groups nodes and files into pools. If you do not activate a SmartPools license, the module provisions node pools and creates one file pool. If you activate the SmartPools license, you receive more features. You can, for example, create multiple file pools and govern them with policies. The policies move files, directories, and file pools among node pools or tiers. You can also define how OneFS handles write operations when a node pool or tier is full. SmartPools reserves a virtual hot spare to reprotect data if a drive fails regardless of whether the SmartPools license is activated.
The OneFS operating system
A distributed operating system based on FreeBSD, OneFS presents a PowerScale cluster's file system as a single share or export with a central point of administration. The OneFS operating system does the following:  Supports common data-access protocols, such as SMB and NFS  Connects to multiple identity management systems, such as Active Directory and LDAP  Authenticates users and groups  Controls access to directories and files
Mixed data-access protocol environments
With the OneFS operating system, you can access data with multiple file-sharing and transfer protocols. As a result, Microsoft Windows, UNIX, Linux, and MacOS X clients can share the same directories and files.
NOTE: On new installations of OneFS 9.0.0.0 and later, all protocols are disabled by default. You must enable any protocols that you plan to use. Use the following command to enable a protocol using the CLI, where <protocol> is one of smb, nfs, hdfs, ftp, http, https, or s3.
isi services <protocol> enable
OneFS supports the following protocols:

28

PowerScale scale-out NAS

SMB

The Server Message Block (SMB) protocol enables Windows users to access the cluster. OneFS works with SMB 1, SMB 2, and SMB 2.1, as well as SMB 3.0 for Multichannel only. With SMB 2.1,OneFS supports client opportunity locks (oplocks) and large (1 MB) MTU sizes.

NFS

The Network File System (NFS) protocol enables UNIX, Linux, and Mac OS X systems to remotely mount any subdirectory, including subdirectories created by Windows users. OneFS works with NFS versions 3 and 4.

HDFS

The Hadoop Distributed File System (HDFS) protocol enables a cluster to work with Apache Hadoop, a framework for data-intensive distributed applications. HDFS integration requires you to activate a separate license.

FTP

FTP allows systems with an FTP client to connect to the cluster and exchange files.

HTTP and HTTPS HTTP and its secure variant, HTTPS, give systems browser-based access to resources. OneFS includes limited support for WebDAV.

S3

The S3-on-OneFS technology enables using the Amazon Web Services Simple Storage Service (AWS S3)

protocol to store data in the form of objects on top of the OneFS file system storage.

Identity management and access control

OneFS works with multiple identity management systems to authenticate users and control access to files. OneFS also features access zones that allow users from different directory services to access different resources based on their IP address. Meanwhile, role-based access control (RBAC) segments administrative access by role.
OneFS authenticates users with the following identity management systems:
 Microsoft Active Directory (AD)  Lightweight Directory Access Protocol (LDAP)  Network Information Service (NIS)  Local users and local groups  A file provider for accounts in /etc/spwd.db and /etc/group files
Use the file provider to add an authoritative third-party source of user and group information.

You can manage users with different identity management systems; OneFS maps the accounts so that Windows and UNIX identities can co-exist. A Windows user account managed in Active Directory, for example, is mapped to a corresponding UNIX account in NIS or LDAP.
To control access, a PowerScale cluster works with both the access control lists (ACLs) of Windows systems and the POSIX mode bits of UNIX systems. When OneFS must transform file permissions from ACLs to mode bits or from mode bits to ACLs, OneFS merges the permissions to maintain consistent security settings.
OneFS presents protocol-specific views of permissions so that NFS exports display mode bits and SMB shares show ACLs. You can, however, manage not only mode bits but also ACLs with standard UNIX tools, such as the chmod and chown commands. ACL policies also enable you to configure how OneFS manages permissions for networks that mix Windows and UNIX systems.

Access zones

OneFS includes an access zones feature. Access zones allow users from different authentication providers, such as two untrusted Active Directory domains, to access different OneFS resources based on an incoming IP address. An access zone can contain multiple authentication providers and SMB namespaces.

RBAC for administration

OneFS includes role-based access control for administration. In place of a root or administrator account, RBAC lets you manage administrative access by role. A role limits privileges to an area of administration. For example, you can create separate administrator roles for security, auditing, storage, and backup.

Structure of the file system
OneFS presents all the nodes in a cluster as a global namespace. In the file system, directories are inode number links. An inode contains file metadata and an inode number, which identifies location of a file. OneFS dynamically allocates inodes, and there is no limit on the number of inodes. To distribute data among nodes, OneFS sends messages with a globally routable block address through the internal network of a cluster. The block address identifies the node and the drive storing the block of data.

PowerScale scale-out NAS

29

NOTE: The design of your data storage structure should be planned carefully. A well-designed directory optimizes cluster performance and cluster administration.
Data layout
OneFS evenly distributes data among a cluster's nodes with layout algorithms that maximize storage efficiency and performance. The system continuously reallocates data to conserve space. OneFS breaks data down into smaller sections called blocks, and then the system places the blocks in a stripe unit. By referencing either file data or erasure codes, a stripe unit helps safeguard a file from a hardware failure. The size of a stripe unit depends on the file size, the number of nodes, and the protection setting. After OneFS divides the data into stripe units, OneFS allocates, or stripes, the stripe units across nodes in the cluster. When a client connects to a node, the client's read and write operations take place on multiple nodes. For example, when a client connects to a node and requests a file, the node retrieves the data from multiple nodes and rebuilds the file. You can optimize how OneFS lays out data to match your dominant access pattern--concurrent, streaming, or random.
Writing files
On a node, the input-output operations of the OneFS software stack split into two functional layers: A top layer, or initiator, and a bottom layer, or participant. In read and write operations, the initiator and the participant play different roles. When a client writes a file to a node, the initiator on the node manages the layout of the file on the cluster. First, the initiator divides the file into blocks of 8 KB each. Second, the initiator places the blocks in one or more stripe units. At 128 KB, a stripe unit consists of 16 blocks. Third, the initiator spreads the stripe units across the cluster until they span a width of the cluster, creating a stripe. The width of the stripe depends on the number of nodes and the protection setting. After dividing a file into stripe units, the initiator writes the data first to non-volatile random-access memory (NVRAM) and then to disk. NVRAM retains the information when the power is off. During the write transaction, NVRAM guards against failed nodes with journaling. If a node fails mid-transaction, the transaction restarts without the failed node. When the node returns, it replays the journal from NVRAM to finish the transaction. The node also runs the AutoBalance job to check the file's on-disk striping. Meanwhile, uncommitted writes waiting in the cache are protected with mirroring. As a result, OneFS eliminates multiple points of failure.
Reading files
In a read operation, a node acts as a manager to gather data from the other nodes and present it to the requesting client. Because a PowerScale cluster's coherent cache spans all the nodes, OneFS can store different data in each node's RAM. A node using the internal network can retrieve file data from another node's cache faster than from its own local disk. If a read operation requests data that is cached on any node, OneFS pulls the cached data to serve it quickly. For files with an access pattern of concurrent or streaming, OneFS pre-fetches in-demand data into a managing node's local cache to further improve sequential-read performance.
Metadata layout
OneFS protects metadata by spreading it across nodes and drives. Metadata--which includes information about where a file is stored, how it is protected, and who can access it--is stored in inodes and protected with locks in a B+ tree, a standard structure for organizing data blocks in a file system to provide instant lookups. OneFS replicates file metadata across the cluster so that there is no single point of failure. Working together as peers, all the nodes help manage metadata access and locking. If a node detects an error in metadata, the node looks up the metadata in an alternate location and then corrects the error.

30

PowerScale scale-out NAS

Locks and concurrency
OneFS includes a distributed lock manager that orchestrates locks on data across all the nodes in a cluster.
The lock manager grants locks for the file system, byte ranges, and protocols, including SMB share-mode locks and NFS advisory locks. OneFS also supports SMB opportunistic locks.
Because OneFS distributes the lock manager across all the nodes, any node can act as a lock coordinator. When a thread from a node requests a lock, the lock manager's hashing algorithm typically assigns the coordinator role to a different node. The coordinator allocates a shared lock or an exclusive lock, depending on the type of request. A shared lock allows users to share a file simultaneously, typically for read operations. An exclusive lock allows only one user to access a file, typically for write operations.

Striping
In a process known as striping, OneFS segments files into units of data and then distributes the units across nodes in a cluster. Striping protects your data and improves cluster performance.
To distribute a file, OneFS reduces it to blocks of data, arranges the blocks into stripe units, and then allocates the stripe units to nodes over the internal network.
At the same time, OneFS distributes erasure codes that protect the file. The erasure codes encode the file's data in a distributed set of symbols, adding space-efficient redundancy. With only a part of the symbol set, OneFS can recover the original file data.
Taken together, the data and its redundancy form a protection group for a region of file data. OneFS places the protection groups on different drives on different nodes--creating data stripes.
Because OneFS stripes data across nodes that work together as peers, a user connecting to any node can take advantage of the entire cluster's performance.
By default, OneFS optimizes striping for concurrent access. If your dominant access pattern is streaming--that is, lower concurrency, higher single-stream workloads, such as with video--you can change how OneFS lays out data to increase sequential-read performance. To better handle streaming access, OneFS stripes data across more drives. Streaming is most effective on clusters or subpools serving large files.

Data protection overview

A PowerScale cluster is designed to serve data even when components fail. By default, OneFS protects data with erasure codes, enabling you to retrieve files when a node or disk fails. As an alternative to erasure codes, you can protect data with two to eight mirrors.
When you create a cluster with five or more nodes, erasure codes deliver as much as 80 percent efficiency. On larger clusters, erasure codes provide as much as four levels of redundancy.
OneFS includes the following features to help protect the integrity, availability, and confidentiality of data:

Feature Anti-virus Clones NDMP backup and restore
Protection domains

Description
OneFS can send files to servers running the Internet Content Adaptation Protocol (ICAP) or Common AntiVirus Agent (CAVA) to scan for viruses and other threats.
OneFS enables you to create clones that share blocks with other files to save space.
OneFS can back up data to tape and other devices through the Network Data Management Protocol. Although OneFS supports both three-way and two-way backup, two-way backup requires a PowerScale Backup Accelerator Node.
You can apply protection domains to files and directories to prevent changes.

The following software modules help protect data, but you must activate a separate license to use them:

PowerScale scale-out NAS

31

Licensed Feature SyncIQ
SnapshotIQ SmartLock
CloudPools

Description
SyncIQ replicates data on another PowerScale cluster and automates failover and failback operations between clusters. If a cluster becomes unusable, you can fail over to another PowerScale cluster .
You can protect data with a snapshot--a logical copy of data that is stored on a cluster.
The SmartLock tool prevents users from modifying and deleting files. You can commit files to a write-once, read-many state: The file can never be modified and cannot be deleted until after a set retention period. SmartLock can help you comply with Securities and Exchange Commission Rule 17a-4.
CloudPools extends the capabilities of OneFS by moving data to lower-cost cloud storage. You can move older or seldomused data to cloud storage and free up space on your cluster. CloudPools supports several cloud storage providers.

N+M data protection
OneFS uses data redundancy across the entire cluster to prevent data loss resulting from drive or node failures. Protection is built into the file system structure and can be applied down to the level of individual files.
Protection in OneFS is modeled on the Reed-Solomon algorithm, which uses forward error correction (FEC). Using FEC, OneFS allocates data in 128KB chunks. For each N data chunk, OneFS writes M protection, or parity, chunks. Each N+M chunk, referred to as a protection group, is written on an independent disk in an independent node. This process is referred to as data striping. By striping data across the entire cluster, OneFS is able to recover files in cases where drives or nodes fail.
In OneFS, the concepts of protection policy and protection level are different. The protection policy is the protection setting that you specify for storage pools on your cluster. The protection level is the actual protection that OneFS achieves for data, based on the protection policy and the actual number of writable nodes.
For example, if you have a three-node cluster, and you specify a protection policy of [+2d:1n], OneFS is able to tolerate the failure of two drives or one node without data loss. However, on that same three-node cluster, if you specify a protection policy of [+4d:2n], OneFS cannot achieve a protection level that would allow for four drive failures or two node failures. This is because N+M must be less than or equal to the number of nodes in the cluster.
By default, OneFS calculates and sets a recommended protection policy based on your cluster configuration. The recommended protection policy achieves the optimal balance between data integrity and storage efficiency.
You can set a protection policy that is higher than the cluster can support. In a four-node cluster, for example, you can set the protection policy at [5x]. However, OneFS would protect the data at 4x until you add a fifth node to the cluster, after which OneFS would automatically re-protect the data at 5x.
Data mirroring
You can protect on-disk data with mirroring, which copies data to multiple locations. OneFS supports two to eight mirrors. You can use mirroring instead of erasure codes, or you can combine erasure codes with mirroring.
Mirroring, however, consumes more space than erasure codes. Mirroring data three times, for example, duplicates the data three times, which requires more space than erasure codes. As a result, mirroring suits transactions that require high performance.
You can also mix erasure codes with mirroring. During a write operation, OneFS divides data into redundant protection groups. For files protected by erasure codes, a protection group consists of data blocks and their erasure codes. For mirrored files, a protection group contains all the mirrors of a set of blocks. OneFS can switch the type of protection group as it writes a file to disk. By changing the protection group dynamically, OneFS can continue writing data despite a node failure that prevents the cluster from applying erasure codes. After the node is restored, OneFS automatically converts the mirrored protection groups to erasure codes.

32

PowerScale scale-out NAS

The file system journal
A journal, which records file-system changes in a battery-backed NVRAM card, recovers the file system after failures, such as a power loss. When a node restarts, the journal replays file transactions to restore the file system.
Virtual hot spare (VHS)
When a drive fails, OneFS uses space reserved in a subpool instead of a hot spare drive. The reserved space is known as a virtual hot spare. In contrast to a spare drive, a virtual hot spare automatically resolves drive failures and continues writing data. If a drive fails, OneFS migrates data to the virtual hot spare to reprotect it. You can reserve as many as four disk drives as a virtual hot spare.
Balancing protection with storage space
You can set protection levels to balance protection requirements with storage space. Higher protection levels typically consume more space than lower levels because you lose an amount of disk space to storing erasure codes. The overhead for the erasure codes depends on the protection level, the file size, and the number of nodes in the cluster. Since OneFS stripes both data and erasure codes across nodes, the overhead declines as you add nodes.

Data compression

OneFS supports inline data compression on Isilon F810 and H5600 nodes, and on PowerScale F200 and F600 nodes.
The F810 node contains a Network Interface Card (NIC) that compresses and decompresses data.
Hardware compression and decompression are performed in parallel across the 40Gb Ethernet interfaces of supported nodes as clients read and write data to the cluster. This distributed interface model allows compression to scale linearly across the node pool as supported nodes are added to a cluster.
You can enable inline data compression on a cluster that:  Contains F810, H5600, F200, or F600 node pools  Offers a 40Gb Ethernet back-end network
The following table lists the nodes and OneFS release combinations that support inline data compression.

Nodes F810 H5600 F200 and F600

Required OneFS releases 8.1.3 or 8.2.1 and later 8.2.2 and later 9.0.0.0 and later

Mixed Clusters
In a mixed cluster environment, data is stored in a compressed form on F810, H5600, F200, and F600 node pools. Data that is written or tiered to storage pools of other node types is uncompressed when it moves between pools.

Software modules

You can access advanced features by activating licenses for Dell Technologies PowerScale software modules.

SmartLock HDFS

SmartLock protects critical data from malicious, accidental, or premature alteration or deletion to help you comply with SEC 17a-4 regulations. You can automatically commit data to a tamper-proof state and then retain it with a compliance clock.
OneFS works with the Hadoop Distributed File System protocol to help clients running Apache Hadoop, a framework for data-intensive distributed applications, analyze big data.

PowerScale scale-out NAS

33

SyncIQ automated failover and failback Security hardening SnapshotIQ SmartDedupe SmartPools
CloudPools
SmartConnect Advanced
InsightIQ SmartQuotas S3

SyncIQ replicates data on another PowerScale cluster and automates failover and failback between clusters. If a cluster becomes unusable, you can fail over to another PowerScale cluster. Failback restores the original source data after the primary cluster becomes available again.
Security hardening is the process of configuring your system to reduce or eliminate as many security risks as possible. You can apply a hardening policy that secures the configuration of OneFS, according to policy guidelines.
SnapshotIQ protects data with a snapshot--a logical copy of data that is stored on a cluster. A snapshot can be restored to its top-level directory.
You can reduce redundancy on a cluster by running SmartDedupe. Deduplication creates links that can impact the speed at which you can read from and write to files.
SmartPools enables you to create multiple file pools governed by file-pool policies. The policies move files and directories among node pools or tiers. You can also define how OneFS handles write operations when a node pool or tier is full.
Built on the SmartPools policy framework, CloudPools enables you to archive data to cloud storage, effectively defining the cloud as another tier of storage. CloudPools supports Dell Technologies PowerScale, Dell Technologies ECS Appliance, Amazon S3, Amazon C2S, Alibaba Cloud, and Microsoft Azure as cloud storage providers.
If you activate a SmartConnect Advanced license, you can balance policies to evenly distribute CPU usage, client connections, or throughput. You can also define IP address pools to support multiple DNS zones in a subnet. SmartConnect also supports IP failover, also known as NFS failover. It is recommended that you define a static pool that encompasses all nodes for management purposes. Dynamic IP addresses are configured only on nodes with quorum to ensure client connectivity. Defining a static pool for all nodes avoids administration difficulties for out of quorum nodes that will not have dynamic IP addresses configured for SSH connections.
The InsightIQ virtual appliance monitors and analyzes the performance of your PowerScale cluster to help you optimize storage resources and forecast capacity.
The SmartQuotas module tracks disk usage with reports and enforces storage limits with alerts.
OneFS support for the Amazon Web Services Simple Storage Service (AWS S3) protocol enables using the Amazon Web Services Simple Storage Service (AWS S3) protocol to store data in the form of objects on top of the OneFS file system storage. Using S3-OneFS enables reading data from, and writing data to, the PowerScale platform. The data resides under a single namespace. The AWS S3 protocol becomes a primary resident of the OneFS protocol stack, along with NFS, SMB, and HDFS, allowing multiprotocol access to objects and files. The S3 protocol supports bucket and object creation, retrieving, updating, and deletion. Object retrievals and updates are atomic. Bucket properties can be updated. Objects are accessible using NFS and SMB as normal files, providing cross-protocol support. To use S3, administrators generate access IDs and secret keys to authenticated users for access.

34

PowerScale scale-out NAS

3

Introduction to the OneFS command-line interface

This section contains the following topics:
Topics:
· OneFS command-line interface overview · Syntax diagrams · Universal options · Command-line interface privileges · SmartLock compliance command permissions · OneFS time values

OneFS command-line interface overview
The OneFS command-line interface extends the standard UNIX command set to include commands that enable you to manage a PowerScale cluster outside of the web administration interface or LCD panel. You can access the command-line interface by opening a secure shell (SSH) connection to any node in the cluster.
You can run isi commands to configure, monitor, and manage PowerScale clusters and the individual nodes in a cluster.
This publication provides conceptual and task information about CLI commands. For an alphabetical listing of all CLI commands, see the OneFS CLI Command Reference.

Syntax diagrams

The format of each command is described in a syntax diagram. The following conventions apply for syntax diagrams:

Element [ ] < > { }
| ...

Description
Square brackets indicate an optional element. If you omit the contents of the square brackets when specifying a command, the command still runs successfully.
Angle brackets indicate a placeholder value. You must replace the contents of the angle brackets with a valid value, otherwise the command fails.
Braces indicate a group of elements. If the contents of the braces are separated by a vertical bar, the contents are mutually exclusive. If the contents of the braces are not separated by a bar, the contents must be specified together.
Vertical bars separate mutually exclusive elements within the braces.
Ellipses indicate that the preceding element can be repeated more than once. If ellipses follow a brace or bracket, the contents of the braces or brackets can be repeated more than once.

Introduction to the OneFS command-line interface

35

Each isi command is broken into three parts: command, required options, and optional options. Required options are positional, meaning that you must specify them in the order that they appear in the syntax diagram. However, you can specify a required option in an alternative order by preceding the text displayed in angle brackets with a double dash. For example, consider isi snapshot snapshots create.
isi snapshot snapshots create <name> <path> [--expires <timestamp>] [--alias <string>] [--verbose]
If the <name> and <path> options are prefixed with double dashes, the options can be moved around in the command. For example, the following command is valid, where onefs_root is your OneFSinstallation's root directory:
isi snapshot snapshots create --verbose --path /onefs_root/data --alias newSnap_alias --name newSnap
Shortened versions of commands are accepted as long as the command is unambiguous and does not apply to multiple commands. For example, isi snap snap c newSnap /onefs_root/data is equivalent to isi snapshot snapshots create newSnap /onefs_root/data because the root of each word belongs to one command exclusively. If a word belongs to more than one command, the command fails. For example, isi sn snap c newSnap /onefs_root/ data is not equivalent to isi snapshot snapshots create newSnap /onefs_root/data because the root of isi sn could belong to either isi snapshot or isi snmp. If you begin typing a word and then press TAB, the rest of the word automatically appears as long as the word is unambiguous and applies to only one command. For example, isi snap completes to isi snapshot because that is the only valid possibility. However, isi sn does not complete, because it is the root of both isi snapshot and isi snmp.
Universal options
Some options are valid for all commands.
Syntax
isi [--timeout <integer>] [--debug] <command> [--help]
--timeout <integer> Specifies the number of seconds before the command times out.
--debug Displays all calls to the PowerScale OneFS Platform API. If a traceback occurs, displays traceback in addition to error message.
--help Displays a basic description of the command and all valid options for the command.
Examples
The following command causes the isi sync policies list command to timeout after 30 seconds:
isi --timeout 30 sync policies list
The following command displays help output for isi sync policies list:
isi sync policies list --help

36

Introduction to the OneFS command-line interface

Command-line interface privileges
You can perform most tasks granted by a privilege through the command-line interface (CLI). Some OneFS commands require root access.
SmartLock compliance command permissions
If a cluster is running in SmartLock compliance mode, root access is disabled on the cluster. If a command requires root access, you can run the command only through the sudo program.
In compliance mode, you can run all isi commands that are followed by a space through sudo. For example, you can run isi sync policies create through sudo. You can also run the following isi_ commands through sudo; these commands are internal and are typically run only by PowerScale Technical Support:
 isi_auth_expert  isi_bootdisk_finish  isi_bootdisk_provider_dev  isi_bootdisk_status  isi_bootdisk_unlock  isi_checkjournal  isi_clean_idmap  isi_client_stats  isi_cpr  isi_cto_update  isi_disk_firmware_reboot  isi_dmi_info  isi_dmilog  isi_dongle_sync  isi_drivenum  isi_dsp_install  isi_dumpjournal  isi_eth_mixer_d  isi_evaluate_provision_drive  isi_fcb_vpd_tool  isi_flexnet_info  isi_flush  isi_for_array  isi_fputil  isi_gather_info  isi_gather_auth_info  isi_gather_cluster_info  isi_gconfig  isi_get_itrace  isi_get_profile  isi_hangdump  isi_hw_check  isi_hw_status  isi_ib_bug_info  isi_ib_fw  isi_ib_info  isi_ilog  isi_imdd_status  isi_inventory_tool  isi_ipmicmc  isi_job_d  isi_kill_busy

Introduction to the OneFS command-line interface

37

 isi_km_diag  isi_lid_d  isi_linmap_mod  isi_logstore  isi_lsiexputil  isi_make_abr  isi_mcp  isi_mps_fw_status  isi_netlogger  isi_nodes  isi_ntp_config  isi_patch_d  isi_phone_home  isi_promptsupport  isi_radish  isi_rbm_ping  isi_repstate_mod  isi_restill  isi_rnvutil  isi_sasphymon  isi_save_itrace  isi_savecore  isi_sed  isi_send_abr  isi_smbios  isi_stats_tool  isi_transform_tool  isi_ufp  isi_umount_ifs  isi_update_cto  isi_update_serialno  isi_vitutil
Besides isi commands, you can run the following UNIX commands through sudo:
 date  gcore  ifconfig  kill  killall  nfsstat  ntpdate  nvmecontrol  pciconf  pkill  ps  pwd_mkdb  renice  shutdown  sysctl  tcpdump  top

38

Introduction to the OneFS command-line interface

OneFS time values

OneFS uses different values for time depending on the application.
You can specify time periods, such as a month, for multiple OneFS applications. However, because some time values have more than one meaning, OneFS defines time values based on the application. The following table describes the time values for OneFS applications:

Module SnapshotIQ SmartLock SyncIQ

Month 30 days 31 days 30 days

Year 365 days (does not account for leap year) 365 days (does not account for leap year) 365 days (does not account for leap year)

Introduction to the OneFS command-line interface

39

4

General cluster administration

This section contains the following topics:
Topics:
· General cluster administration overview · User interfaces · Connecting to the cluster · Licensing · Certificates · Cluster identity · Cluster contact information · Cluster date and time · SMTP email settings · Configuring the cluster join mode · File system settings · Data compression settings and monitoring · Events and alerts · Security hardening · Cluster configuration backup and restore · Cluster monitoring · Monitoring cluster hardware · Cluster maintenance · SRS Summary

General cluster administration overview
You can manage general OneFS settings and module licenses for your PowerScale cluster.
General cluster administration covers several areas. You can:
 Manage general settings such as cluster name, date and time, and email.  Monitor the cluster status and performance, including hardware components.  Configure how to handle events and notifications.  Perform cluster maintenance such as adding, removing, and restarting nodes.  Configure security hardening and compliance settings for the cluster. You can accomplish most management tasks using either the web administration or command-line interface. However, there are some tasks that you can manage only in one or the other.

User interfaces

OneFS provides several interfaces for managing PowerScale clusters.

Interface OneFS web administration interface

Description

Comment

The browser-based OneFS web administration interface provides secure access with OneFS-supported browsers. Use this interface to view robust graphical monitoring displays and to perform cluster-management tasks.

The OneFS web administration interface uses port 8080 as its default port.

40

General cluster administration

Interface OneFS command-line interface OneFS API
Node front panel

Description

Comment

Run OneFS isi commands in the command-line interface to configure, monitor, and manage the cluster. Access to the command-line interface is through a secure shell (SSH) connection to any node in the cluster.

TheOneFS command-line interface provides an extended standard UNIX command set for managing the cluster.

The OneFS application programming interface (API) is divided into two functional areas: one area enables cluster configuration, management, and monitoring functionality, and the other area enables operations on files and directories on the cluster. You can send requests to the OneFS API through a Representational State Transfer (REST) interface, which is accessed through resource URIs and standard HTTP methods.

You should have a solid understanding of HTTP/1.1 and experience writing HTTP-based client software before you implement client-based software through the OneFS API.

With the exception of accelerator nodes, the front panel of each node contains an LCD screen with five buttons that you can use to monitor node and cluster details.

Node status, events, cluster details, capacity, IP and MAC addresses, throughput, and drive status are available through the node front panel.

Connecting to the cluster
PowerScale cluster access is provided through the web administration interface or through SSH. You can use a serial connection to perform cluster administration tasks through the command-line interface. You can also access the cluster through the node front panel to accomplish a subset of cluster management tasks. For information about connecting to the node front panel, see the installation documentation for your node.

Log in to the web administration interface

You can monitor and manage your PowerScale cluster from the browser-based web administration interface.
1. In a browser window, enter the URL for your cluster in the address field. In the following examples, replace <yourNodeIPaddress> with the first IP address you provided when you configured ext-1.

IPv4

https://<yourNodeIPaddress>:8080

IPv6

https://[<yourNodeIPaddress>]:8080

If your security certificates have not been configured, the system displays a message. Resolve any certificate configurations, then continue to the website.
2. Log in to OneFS by typing your OneFS credentials in the Username and Password fields. After you log in to the web administration interface, there is a 4-hour login timeout.

Open an SSH connection to a cluster
You can use any SSH client such as OpenSSH or PuTTY to connect to a PowerScale cluster. You must have valid OneFS credentials to log in to a cluster after the connection is open. 1. Open a secure shell (SSH) connection to any node in the cluster, using the IP address of the node and port number 22. 2. Log in with your OneFS credentials.
At the OneFS system prompt, you can use isi commands to monitor and manage your cluster.

General cluster administration

41

Licensing
All PowerScale software and hardware must be licensed through Dell Technologies Software Licensing Central (SLC).
A license file contains a record of your active software licenses and your cluster hardware. One copy of the license file is stored in the SLC repository, and another copy of the license file is stored on your cluster. The license file on your cluster and the license file in the SLC repository must match. The license file contains a record of the following:  OneFS license  Optional software module licenses  Hardware information

Software licenses
Your OneFS license and optional software module licenses are contained in the license file on your cluster. Your license file must match your license record in the Dell Technologies Software Licensing Central (SLC) repository.
Ensure that the license file on your cluster, and your license file in the SLC repository, match your upgraded version of OneFS.
Advanced cluster features are available when you activate licenses for the following OneFS software modules:
 CloudPools  Security hardening  HDFS  PowerScale Swift  SmartConnect Advanced  SmartDedupe  SmartLock  SmartPools  SmartQuotas  SnapshotIQ  SyncIQ
For more information about optional software modules, contact your Dell Technologies sales representative.

Hardware tiers
Your license file contains information about the PowerScale hardware that is installed in your cluster. Your license file lists nodes by tiers. Nodes are placed into a tier according to their compute performance level, capacity, and drive type.
NOTE: Your license file contains line items for every node in your cluster. However, pre-Generation 6 hardware is not included in the OneFS licensing model.

License status

The status of a OneFS license indicates whether the license file on your cluster reflects your current version of OneFS. The status of a OneFS module license indicates whether the functionality provided by a module is available on the cluster.
Licenses exist in one of the following states:

Status Unsigned
Inactive

Description
The license has not been updated in Dell Technologies Software Licensing Central (SLC). You must generate and submit an activation file to update your license file with your new version of OneFS.
The license has not been activated on the cluster. You cannot access the features provided by the corresponding module.

42

General cluster administration

Status Evaluation
Activated Expired

Description
The license has been temporarily activated on the cluster. You can access the features provided by the corresponding module for 90 days.
The license has been activated on the cluster. You can access the features provided by the corresponding module.
The license has expired on the cluster. After the license expires, you must generate and submit an activation file to update your license file.

View license information
You can view information about the current license status for OneFS, hardware, and optional PowerScale software modules. Run the following command:
isi license list

Adding and removing licenses
You can allow OneFS to update your license file automatically, or you can update your license file manually. The automated process to update a license file requires that SRS and in-product activation are enabled. The manual process to update a license file requires generating an activation file, submitting the activation file to Dell Technologies Software Licensing Central (SLC), then uploading an updated license file to your cluster. Your license file should be updated when you:  Upgrade for the first time to OneFS 8.1 or later.  Require the activation of an optional software module.  Add new hardware.  Upgrade existing hardware.
Enable OneFS to update your license file automatically
You can enable OneFS to keep your license file updated automatically using SRS. To allow OneFS to automatically communicate with SRS and update your license file: 1. Enable SRS. 2. Enable in-product activation using the following command:
isi license activation start
NOTE: When you first enable SRS, in-product activation is enabled by default.
Update a license activation file manually
To update the license file, generate a license activation file and submit it to Software Licensing Central (SLC). 1. Run the isi license generate command to create an activation file, or to add or remove licenses from your activation
file. The command includes the --file parameter to designate the location to save your activation file.

General cluster administration

43

The following command adds a OneFS license and saves the activation file, named <cluster-name>_activation.xml to the/ifs directory on your cluster:
isi license generate --include OneFS --file /ifs/<cluster-name>_activation.xml
The following command adds OneFS and SyncIQ licenses, removes your Cloudpools license, and saves the new activation file:
isi license generate --include OneFS --include SyncIQ --exclude Cloudpools --file /ifs/<cluster-name>_activation.xml
2. Save a copy of the activation file to your local machine. The next procedure describes how to submit the file to Dell Technologies SLC.
Submit a license activation file to SLC
After you generate an activation file in OneFS, submit the activation file to Dell Technologies Software Licensing Central (SLC) to receive a signed license file for your cluster. Before you submit your activation file to SLC, you must generate the activation file through OneFS and save the file to your local machine. 1. From your local, internet-connected system, go to Dell Technologies Software Licensing Central (SLC). 2. Log into the system using your Dell Technologies credentials. 3. Click ACTIVATE at the top of the page.
A menu appears with two options: Activate and Activate by File. 4. Click Activate by File
The Upload Activation File page appears. 5. Confirm that your company name is listed next to Company.
If your company name is not displayed, click Select a Company and search with your company name and ID. 6. Click Upload. 7. Locate the activation file on your local machine and click Open.
This is an XML file. The file was originally generated with the name activation.xml but you may have saved it with a different name. 8. Click the Start the Activation Process button. The Apply License Authorization Code (LAC) page appears. 9. In the Missing Product & Quantities Summary table, confirm that there is a green check in the column on the far right. If any row is missing a green check in that column, you can search for a different LAC by clicking the Search button and selecting a different available LAC. 10. Click the Next: Review button. 11. Click the Activate button. When the signed license file is available, SLC will send it to you as an attachment to an email.
NOTE: Your signed license file may not be available immediately.
The signed license file is an XML file with a name in the following format:
ISLN_nnn_date.xml
For example, ISLN_15002_13-Feb-2019.xml 12. Download the signed license file to your local machine, in a directory where you can locate it for the next procedure. For
example, save it in /ifs on your local machine.

44

General cluster administration

Upload the signed license file
After you receive a signed license file from Dell Technologies Software Licensing Central (SLC), upload the file to your cluster. Run the isi license add command.
# isi license add --path <file-path-on-your-local-machine>
For example, the following command uploads the signed license file that you had previously copied into a directory named /ifs on your local machine:
isi license add --path /ifs/ISLN_15002_13-Feb-2019.xml
Activating trial licenses
You can activate a trial license that allows you to evaluate an optional software module for 90 days.
Activate a trial license
You can activate a trial license to evaluate a OneFS software module. Run the isi license add command. The following command activates a trial license for the Cloudpools and SyncIQ modules:
isi license add --evaluation Cloudpools --evaluation SyncIQ
Certificates
All OneFS API communication, which includes communication through the web administration interface, is over Transport Layer Security (TLS). You can renew the TLS certificate for the OneFS web administration interface or replace it with a third-party TLS certificate. To replace or renew a TLS certificate, you must be logged in as root.
NOTE: OneFS defaults to the best supported version of TLS against the client request.
Certificate management
You can manage TLS certificates using the OneFS command line interface. The isi certificate settings view command enables viewing all of the certificate-related configuration options. For example:
# isi certificate settings view Certificate Monitor Enabled: Yes Certificate Pre Expiration Threshold: 4W2D Default HTTPS Certificate ID: default Subject: C=US, ST=Washington, L=Bellingham, O="Sample Systems, Inc.", OU=Sample Systems, CN=Sample Systems, [email protected] Status: valid
The configuration options Certificate monitor enabled and Certificate Pre Expiration Threshold control a nightly cron job that monitors the expiration of every managed certificate and raises a CELOG alert when a certificate is set to expire within the configured threshold. The default expiration is 30 days. The ID: default option indicates that this certificate is the default TLS certificate.

General cluster administration

45

The isi certificate server list command lists the available certificates. For example:

# isi certificate server list

ID

Name Status Expires

-------------------------------------------

a50e6da default valid 2021-12-25T11:01:55

c392083 mycert valid 2020-04-19T09:40:29

-------------------------------------------

Total: 2

You can view the settings for a particular certificate using the isi certificate server view <certificate_name> command, where <certificate_name> is the name of the certificate for which to view settings. For example, suppose that you want to view the settings for a certificate named mycert:

# isi certificate server view mycert ID: c39208312f11f9d85a383f1fb4338e3eac92258c066d01931684a4c2bf343f71
Name: mycert Description:
Subject: C=US, ST=Washington, CN=*.local.samplesys.com, [email protected]
Issuer: C=US, ST=Washington, L=Bellingham, CN=AdminCritter Root, [email protected]
Status: valid Not Before: 2019-04-10T09:40:29
Not After: 2020-04-19T09:40:29 Fingerprints
Type: SHA1 Value: 58:e7:8e:1f:1a:bb:5f:15:94:88:6b:91:be:e1:4f:47:76:ac:df:90
Type: SHA256 Value: c3:92:08:31:2f:11:f9:d8:5a:38:3f:1f:b4:33:8e:3e:ac:92:25:8c:06:6d:01:93:16:84:a4:c2:bf:34 :3f:71 DNS Names: *.local.samplesys.com

Note the DNS Names listed when you view the certificate information. OneFS attempts to map any configured SmartConnect names or aliases to one of the certificates available to the system. If no match is found, OneFS uses the default certificate.
You can change the certificate settings using the isi certificate settings modify command. For example, to change the default HTTPS certificate to the mycert certificate:

# isi certificate settings modify --default-https-certificate=mycert

To verify that the default certificate has been changed:

# isi certificate settings view Certificate Monitor Enabled: Yes Certificate Pre Expiration Threshold: 4W2D Default HTTPS Certificate ID: mycert Subject: C=US, ST=Washington, CN=*.local.samplesys.com, [email protected]
Status: valid

Replacing or renewing the TLS certificate
The Transport Layer Security (TLS) certificate is used to access the cluster through a browser. The cluster initially contains a self-signed certificate for this purpose. You can continue to use the existing self-signed certificate, or you can replace it with a third-party certificate authority (CA)-issued certificate.
If you continue to use the self-signed certificate, you must replace it when it expires, with either:
 A third-party (public or private) CA-issued certificate  Another self-signed certificate that is generated on the cluster

46

General cluster administration

Replace the TLS certificate with a third-party CA-issued certificate
This procedure describes how to replace the existing TLS certificate with a third-party (public or private) certificate authority (CA)-issued TLS certificate. When you request a TLS certificate from a certificate authority, you must provide information about your organization. It is recommended that you determine this information before you begin the process. See the TLS certificate data example section of this chapter for details and examples of the required information.
NOTE: This procedure requires you to restart the isi_webui service, which restarts the web administration interface. It is recommended that you perform these steps during a scheduled maintenance window. 1. Open a secure shell (SSH) connection to any node in the cluster and log in as root. 2. List the current certificate:
isi certificate server list
Output similar to the following is displayed, where <old cert ID> is the ID number of the current certificate:
ID Name Default <old cert ID> Isilon Systems True Total: 1
3. Import the new certificate. This assumes the certificate is located in the /ifs/local directory.
isi certificate server import --certificatepath=/ifs/local/cert.pem --certificate-keypath=/ifs/local/key.pem --description='My new server certificate' ­default
4. Confirm the new certificate was imported:
isi certificate server list
Output similar to the following is displayed, where <old cert ID> is the ID number of the current certificate:
ID Name Default <new cert ID> True <old cert ID> Isilon Systems False Total: 2
5. Remove the old certificate:
isi certificate server delete --id=<old cert ID>
6. Confirm the deletion:
Are you sure you want to delete this certificate (ID: <old cert ID>)?? (yes/[no]): yes
7. Confirm the new certificate is the only one present:
isi certificate server list ID Name Default <new cert ID> True Total: 1
NOTE: If you have multiple certificates, set the default one after you replace the old one:
isi certificate settings modify --default-https-certificate=<certificate id>

General cluster administration

47

Renew the self-signed TLS certificate
This procedure describes how to replace an expired self-signed TLS certificate by generating a new certificate that is based on the existing (stock) server key. When you generate a self-signed certificate, you must provide information about your organization. It is a good idea to determine this information in advance, before you begin the process. See the TLS certificate data example section of this chapter for details and examples of the required information.
NOTE: This procedure requires you to restart the isi_webui service, which restarts the web administration interface. Therefore, it is recommended that you perform these steps during a scheduled maintenance window. 1. Open a secure shell (SSH) connection to any node in the cluster and log in as root. 2. Create a backup directory by running the following command:
mkdir /ifs/data/backup/
3. Set the permissions on the backup directory to 700:
chmod 700 /ifs/data/backup
4. Make backup copies of the existing server.crt and server.key files by using the cp command to copy them to the backup directory that you just created. NOTE: If files with the same names exist in the backup directory, either overwrite the existing files, or, to save the old backups, rename the new files with a timestamp or other identifier.
5. Create a working directory to hold the files while you complete this procedure:
mkdir /ifs/local/
6. Set the permissions on the working directory to 700:
chmod 700 /ifs/local
7. Change to the working directory:
cd /ifs/local/
8. At the command prompt, use the cp command to copy the existing certificate to the working directory that you just created, then run the following command to create a certificate that will expire in 2 years (730 days). Increase or decrease the value for -days to generate a certificate with a different expiration date.
openssl req -new -days 730 -nodes -x509 -key \ server.key -out server.crt
NOTE: the -x509 value is a certificate format.
9. When prompted, type the information to be incorporated into the certificate request. When you finish entering the information, a renewal certificate is created, based on the existing (stock) server key. The renewal certificate is named server.crt and it appears in the /ifs/local directory.
10. Optional: To verify the attributes in the TLS certificate, run the following command:
isi certificate server view server.crt

48

General cluster administration

11. Run the following commands to install the certificate and key and restart the isi_webui service:
isi services -a isi_webui disable
chmod 640 server.key
# isi certificate server import --name=server ­certificate-path=/ifs/server.crt -certificate-key-path=/ifs/server.key
isi services -a isi_webui enable
If the private key is password encrypted, you can use the isi certificate server import command's -certificate-key-password <string> parameter to specify the password.
12. Run the command isi certificate server list to verify that the installation succeeded. Optionally re-run the isi certificate server view server.crt command to confirm the certificate settings.
13. Delete the temporary working files from the /ifs/local directory: rm /ifs/local/<common-name>.csr \ /ifs/local/<common-name>.key /ifs/local/<common-name>.crt
14. (Optional) Delete the backup files from the /ifs/data/backup directory:
rm /ifs/data/backup/server.crt.bak \ /ifs/data/backup/server.key.bak
List available certificates
You can list the available Secure Sockets Layer certificates. You must be logged in to the command line interface as an administrator user. Run the following command: isi certificate server list A list of available certificates appears.
View certificate settings
You can view certificate-related settings. Settings include whether the certificate is the default certificate and the expiration threshold. Check your certificate settings before making any changes. Log in to the cluster as a user with the root or administrator role. Run the following command: isi certificate settings view OneFS displays the certificate settings. Following are the certificate settings for the fictional company Internet Widgets Pty. Ltd of Sampletown, Washington, US. The
ID: default
setting indicates that this is the default certificate.
Certificate Monitor Enabled: Yes Certificate Pre Expiration Threshold: 4W2D
Default HTTPS Certificate ID: default Subject: C=US, ST=Washington, L=Sampletown, O="Internet Widgets, Pty. Ltd.", OU=IW Systems, CN=Internet Widgets, [email protected] Status: valid

General cluster administration

49

Verify a TLS certificate update
You can verify the details stored in a TLS certificate. Run the following command to view the attributes in an TLS certificate:
isi certificate settings view <certificate-name>

TLS certificate data example
TLS certificate renewal or replacement requires you to provide data such as a fully qualified domain name and a contact email address.
When you renew or replace a TLS certificate, you are asked to provide data in the format that is shown in the following example:
You are about to be asked to enter information that will be incorporated into your certificate request. What you are about to enter is what is called a Distinguished Name or a DN. There are quite a few fields but you can leave some blank For some fields there will be a default value, If you enter '.', the field will be left blank. ----Country Name (2 letter code) [AU]:US State or Province Name (full name) [Some-State]:Washington Locality Name (eg, city) []:Seattle Organization Name (eg, company) [Internet Widgits Pty Ltd]:Company Organizational Unit Name (eg, section) []:System Administration Common Name (e.g. server FQDN or YOUR name) []:localhost.example.org Email Address []:[email protected]
In addition, if you are requesting a third-party CA-issued certificate, you should include additional attributes that are shown in the following example:
Please enter the following 'extra' attributes to be sent with your certificate request A challenge password []:password An optional company name []:Another Name

Cluster identity

You can specify identity attributes for a PowerScale cluster.

Cluster name
Cluster description Login message

The cluster name appears on the login page, and it makes the cluster and its nodes more easily recognizable on your network. Each node in the cluster is identified by the cluster name plus the node number. For example, the first node in a cluster that is named Images may be named Images-1.
The cluster description appears below the cluster name on the login page. The cluster description is useful if your environment has multiple clusters.
The login message appears as a separate box on the login page of the OneFS web administration interface, or as a line of text under the cluster name in the OneFS command-line interface. The login message can convey cluster information, login instructions, or warnings that a user should know before logging into the cluster. Set this information in the Cluster Identity page of the OneFS web administration interface.

Set the cluster name
You can specify a name, description, and login message for your PowerScale cluster. Cluster names must begin with a letter and can contain only numbers, letters, and hyphens. The cluster name is added to the node number to identify each node in the cluster. For example, the first node in a cluster named Images may be named Images-1.

50

General cluster administration

1. Open the isi config command prompt by running the following command:
isi config
2. Run the name command. The following command sets the name of the cluster to NewName:
name NewName
3. Save your changes by running the following command:
commit
Cluster contact information
PowerScale Technical Support personnel and event notification recipients will communicate with the specified contacts. You can specify the following contact information for your PowerScale cluster:  Company name and location  Primary and secondary contact names  Phone number and email address for each contact
Cluster date and time
The Network Time Protocol (NTP) service is configurable manually, so you can ensure that all nodes in a cluster are synchronized to the same time source. The NTP method automatically synchronizes cluster date and time settings through an NTP server. Alternatively, you can set the date and time reported by the cluster by manually configuring the service. Windows domains provide a mechanism to synchronize members of the domain to a master clock running on the domain controllers, so OneFS adjusts the cluster time to that of Active Directory with a service. If there are no external NTP servers that are configured, OneFS uses the Windows domain controller as the NTP time server. When the cluster and domain time that is become out of sync by more than 4 minutes, OneFS generates an event notification.
NOTE: If the cluster and Active Directory that is become out of sync by more than 5 minutes, authentication does not work.
Set the cluster date and time
You can set the date, time, and time zone for the PowerScale cluster. 1. Run the isi config command.
The command-line prompt changes to indicate that you are in the isi config subsystem. 2. Specify the current date and time by running the date command.
The following command sets the cluster time to 9:47 AM on July 22, 2015:
date 2015/07/22 09:47:00
3. To verify your time zone setting, run the timezone command. The current time zone setting displays. For example:
The current time zone is: Pacific Time Zone
4. To view a list of valid time zones, run the help timezone command. The following options display:
Greenwich Mean Time Eastern Time Zone Central Time Zone Mountain Time Zone Pacific Time Zone

General cluster administration

51

Arizona Alaska Hawaii Japan Advanced
5. To change the time zone, enter the timezone command followed by one of the displayed options. The following command changes the time zone to Hawaii:
timezone Hawaii
A message confirming the new time zone setting displays. If your desired time zone did not display when you ran the help timezone command, enter timezone Advanced. After a warning screen, you will proceed to a list of regions. When you select a region, a list of specific time zones for that region appears. Select the desired time zone (you may need to scroll), then enter OK or Cancel until you return to the isi config prompt. 6. Run the commit command to save your changes and exit isi config.
Specify an NTP time server
You can specify one or more Network Time Protocol (NTP) servers to synchronize the system time on the PowerScale cluster. The cluster periodically contacts the NTP servers and sets the date and time based on the information it receives. Run the isi_ntp_config command, specifying add server, followed by the host name, IPv4, or IPv6 address for the desired NTP server. The following command specifies ntp.time.server1.com:
isi_ntp_config add server ntp.time.server1.com
SMTP email settings
If your network environment requires the use of an SMTP server or if you want to route PowerScale cluster event notifications with SMTP through a port, you can configure SMTP email settings. SMTP settings include the SMTP relay address and port number that email is routed through. You can specify an origination email and subject line for all event notification email messages sent from the cluster. If your SMTP server is configured to support authentication, you can specify a username and password. You can also specify whether to apply encryption to the connection.
Configure SMTP email settings
You can send event notifications through the SMTP mail server. You can also enable SMTP authentication if your SMTP server is configured to use it. You can configure SMTP email settings if your network environment requires the use of an SMTP server or if you want to route PowerScale cluster event notifications with SMTP through a port. Run the isi email command. The following example configures SMTP email settings:
isi email settings modify --mail-relay 10.7.180.45 \ --mail-sender [email protected] \ --mail-subject "PowerScale cluster event" --use-smtp-auth yes \ --smtp-auth-username SMTPuser --smtp-auth-passwd Password123 \ --use-encryption yes

52

General cluster administration

View SMTP email settings
You can view SMTP email settings.
Run the following command:
isi email settings view
The system displays information similar to the following example:
Mail Relay: SMTP Port: 25
Mail Sender: Mail Subject: Use SMTP Auth: No SMTP Auth Username: Use Encryption: No
Batch Mode: none User Template: -

Configuring the cluster join mode

The cluster join mode specifies how a node is added to the PowerScale cluster and whether authentication is required. OneFS supports manual and secure join modes for adding nodes to the cluster.

Mode Manual
Secure

Description
Allows you to manually add a node to the cluster without requiring authorization.
Requires authorization of every node added to the cluster. The node must be added through the web administration interface or through the isi devices -a add -d <unconfigured_node_serial_no> command in the command-line interface.
NOTE: If you specify a secure join mode, you cannot join a node to the cluster through serial console wizard option [2] Join an existing cluster.

Specify the cluster join mode
You can specify a join mode that determines how nodes are added to a PowerScale cluster. 1. Open the isi config command prompt by running the following command:
isi config
2. Run the joinmode command. The following command prevents nodes from joining the cluster unless the join is initiated by the cluster: joinmode secure
3. Save your changes by running the following command: commit

General cluster administration

53

File system settings
You can configure global file system settings on a PowerScale cluster for access time tracking and character encoding. You can enable or disable access time tracking, which monitors the time of access on each file. If necessary, you can also change the default character encoding on the cluster.
Specify the cluster character encoding
You can modify the character encoding set for a PowerScale cluster after installation. Only OneFS-supported character sets are available for selection. UTF-8 is the default character set for OneFS nodes.
NOTE: If the cluster character encoding is not set to UTF-8, SMB share names are case-sensitive.
You must restart the cluster to apply character encoding changes. CAUTION: Character encoding is typically established during installation of the cluster. Modifying the character encoding setting after installation may render files unreadable if done incorrectly. Modify settings only if necessary after consultation with PowerScale Technical Support.
1. Run the isi config command. The command-line prompt changes to indicate that you are in the isi config subsystem.
2. Modify the character encoding by running the encoding command. The following command sets the encoding for the cluster to ISO-8859-1:
encoding ISO-8859-1 3. Run the commit command to save your changes and exit the isi config subsystem. 4. Restart the cluster to apply character encoding modifications.
Enable or disable access time tracking
You can enable access time tracking to support features that require it. By default, a PowerScale cluster does not track the timestamp when files are accessed. You can enable this feature to support OneFS features that use it. For example, access-time tracking must be enabled to configure SyncIQ policy criteria that match files based on when they were last accessed.
NOTE: Enabling access-time tracking may affect cluster performance.
1. Enable or disable access time tracking by setting the atime_enabled system control.  To enable access time tracking, run the following command:
sysctl efs.bam.atime_enabled=1  To disable access time tracking, run the following command:
sysctl efs.bam.atime_enabled=0 2. To specify how often to update the last-accessed time, set the atime_grace_period system control.
Specify the amount of time as a number of milliseconds. The following command configures OneFS to update the last-accessed time every two weeks:
sysctl efs.bam.atime_grace_period=1209600000

54

General cluster administration

Data compression settings and monitoring
From the OneFS command line, you can enable and disable inline data compression on an Isilon or PowerScale cluster. You can also view statistics that are related to compression activity and efficiency across the cluster. Data compression is available only with node pools of Isilon F810 and H5600 nodes and PowerScale F200 and F600 nodes.

Data compression terminology

The following list contains definitions for OneFS terminology related to data compression.

Logical data
Dedupe saved Compression saved Preprotected physical Protection overhead Protected physical Dedupe ratio
Compression ratio Data reduction ratio Efficiency ratio

Also known as effective data, this is a data size that excludes protection overhead and data efficiency savings from compression and deduplication. The amount of capacity savings related to deduplication. The amount of capacity savings related to in-line compression.
Also known as usable data, this is a data size that excludes protection overhead, but includes data efficiency savings from compression and deduplication. The size of the erasure coding used to protect data.
Also known as raw data, this is a data size that includes protection overhead, and takes into account the data efficiency savings from compression and deduplication. The estimated ratio of deduplication, where the ratio will be displayed as 1.0:1 if there is no deduplication on the cluster. The ratio of logical data to preprotected physical data, it's the usable efficiency ratio from compression. The ratio is calculated by dividing logical data by preprotected physical data and is expressed as x:1. The usable efficiency ration from compression and deduplication. This ratio will be the same as the compression ratio if there is no deduplication occurring on the cluster. The ratio of logical data to protected physical data. This is the overall raw efficiency ratio which is calculated by dividing logical data by protected physical data and is expressed as x:1.

Enable or disable data compression
You can turn data compression on or off from the OneFS command line. This procedure is available only through the OneFS command-line interface (CLI). The following are the settings for data compression configuration:  Enabled: yes  Enabled: no The default setting is Enabled: yes.
NOTE: This compression setting only applies to data stored on Isilon F810 and H5600 node pools, and on PowerScale F200 and F600 node pools. Data written to any other node types ignore this setting and are not compressed. If a cluster does not contain a supported node pool, this setting is ignored.
NOTE: When you enable compression, OneFS does not go back and compress the data that was written while compression was disabled. 1. To view the current compression setting, run the following command: isi compression settings view The system displays output similar to the following example:
Enabled: Yes
2. If compression is enabled and you want to disable it, run the following command:

General cluster administration

55

isi compression settings modify --enabled=False 3. If compression is disabled and you want to enable it, run the following command:
isi compression settings modify --enabled=True 4. After you adjust settings, confirm that the setting is correct. Run the following command:
isi compression settings view

View compression statistics

You can view reports about data compression that include current and historic compression ratios, as well as logical and physical data block totals.
This procedure is available only through the OneFS command-line interface (CLI).
1. To view a report that contains recent writes and total cluster data reduction, run the following command: isi statistics data-reduction The system displays output similar to the following example:

Recent Writes Cluster Data Reduction

(5 mins)

--------------------- ------------- ----------------------

Logical data

1.07M

49.57M

Zero-removal saved

0

-

Deduplication saved

0

0

Compression saved

0

0

Preprotected physical

1.07M

49.57M

Protection overhead

2.14M

99.14M

Protected physical

3.21M

160.61M

Zero removal ratio

1.00 : 1

-

Deduplication ratio

1.00 : 1

1.00 : 1

Compression ratio

1.00 : 1

1.00 : 1

Data reduction ratio

1.00 : 1

1.00 : 1

Efficiency ratio

0.33 : 1

0.31 : 1

--------------------- ------------- ----------------------

The Recent Writes column displays statistics for the previous five minutes. The Cluster Data Reduction column displays statistics for overall data efficiency across the entire cluster.
2. To view a report that contains statistics about compression ratios from the last five minutes, the percent of data that is not compressible, the total logical and physical data blocks that were processed, and writes where compression was not attempted, run the following command:
isi compression stats view
The system displays output similar to the following example:
stats for 300 seconds at: 2021-02-25 19:20:36 (1614280836) compression ratio for compressed writes: 0.00 : 1 compression ratio for all writes: 1.00 : 1 incompressible data percent: 0.00% total logical blocks: 135 total physical blocks: 135 writes for which compression was not attempted: 100.00%

 If the incompressible data percentage is high, the data being written to the cluster might be a type that has already been compressed.
 If the number of writes for which compression was not attempted is high, you might be working with a cluster with multiple node types. If so, OneFS might be directing writes to a node pool that does not support data compression.
3. To view a report that contains the statistics that the isi compression stats view command provides, but also shows statistics from previous five minute intervals, run the following command:
isi compression stats list
The system displays output similar to the following example:

56

General cluster administration

General cluster administration

57

Events and alerts
OneFS continuously monitors the health and performance of your cluster and generates events when situations occur that might require your attention. Events can be related to file system integrity, network connections, jobs, hardware, and other vital operations and components of your cluster. After events are captured, they are analyzed by OneFS. Events with similar root causes are organized into event groups. An event group is a single point of management for numerous events related to a particular situation. You can determine which event groups you want to monitor, ignore, or resolve. An alert is the message that reports on a change that has occurred in an event group. For some events, you can set the thresholds at which to raise alerts. You can control how alerts related to an event group are distributed. Alerts are distributed through channels. You can create and configure a channel to send alerts to a specific audience, control the content the channel distributes, and limit frequency of the alerts.
Events overview
Events are individual occurrences or conditions related to the data workflow, maintenance operations, and hardware components of your cluster. Throughout OneFS there are processes that are constantly monitoring and collecting information on cluster operations. When the status of a component or operation changes, the change is captured as an event and placed into a priority queue at the kernel level. Every event has two ID numbers that help to establish the context of the event:  The event type ID identifies the type of event that has occurred.  The event instance ID is a unique number that is specific to a particular occurrence of an event type. When an event is
submitted to the kernel queue, an event instance ID is assigned. You can reference the instance ID to determine the exact time that an event occurred. You can view individual events. However, you manage events and alerts at the event group level.
Alerts overview
An alert is a message that describes a change that has occurred in an event group. At any point in time, you can view event groups to track situations occurring on your cluster. You can also create alerts to proactively notify you when there is a change in an event group. For example, you can generate an alert when a new event is added to an event group, when an event group is resolved, or when the severity of an event group changes. You can adjust the thresholds at which certain events raise alerts. For example, by default, OneFS generates an alert when a disk pool is 95% full. You can adjust that threshold to a lower percentage. You can configure your cluster to generate alerts only for specific event groups, conditions, severity, or during limited time periods. Alerts are delivered through channels. You can configure a channel to determine who will receive the alert and when.
Channels overview
Channels are pathways by which event groups send alerts. When an alert is generated, the channel that is associated with the alert determines how the alert is distributed and who receives the alert. You can configure a channel to deliver alerts with one of the following mechanisms: SMTP, SNMP, or Connect Home. You can also specify the required routing and labeling information for the delivery mechanism.

58

General cluster administration

Event groups overview
Event groups are collections of individual events that are related symptoms of a single situation on your cluster. Event groups provide a single point of management for multiple event instances that are generated in response to a situation on your cluster. For example, if a chassis fan fails in a node, OneFS might capture multiple events related both to the failed fan itself, and to exceeded temperature thresholds within the node. All events related to the fan will be represented in a single event group. Because there is a single point of contact, you do not need to manage numerous individual events. You can handle the situation as a single, coherent issue. All management of events is performed at the event group level. You can mark an event group as resolved or ignored. You can also configure how and when alerts are distributed for an event group.
Viewing and modifying event groups
You can view event and modify the status of event groups.
View an event group
Use the isi event groups list command to view event groups. 1. Optional: To identify the group ID of the event group that you want to view, run the following command:
isi event groups list
2. To view the details of a specific event group, run the isi event groups view command and specify the event group ID. The following example command displays the details for an event group with the event group ID of 65686:
isi event groups view 65686
The system displays output similar to the following example:
ID: 65686 Started: 08/15 02:12 Causes Long: Node 2 offline Last Event: 2015-08-15T03:01:17
Ignore: No Ignore Time: Never
Resolved: Yes Ended: 08/15 02:46
Events: 6 Severity: critical
Stop alerts for a single event group
You can stop alerts for a single event group by running a script for OneFS 8.0.0.4 and earlier. This procedure provides an example for specifically excluding event 400160001. 1. Optional: Create a script with the file name, eventgroups.py:
#!/usr/bin/python import json
efile="/etc/celog/events.json" egfile="/etc/celog/eventgroups.json"
with open(efile, "r") as ef: e = json.loads(ef.read())
with open(egfile, "r") as egf: eg = json.loads(egf.read())
out = []

General cluster administration

59

symptoms = [] for eg,v in eg.iteritems():
if eg != v["name"]: print "warning.. name not same %s" % eg
out += [eg] for s in v["symptoms"]:
symptoms += [s]
for e,v in e.iteritems(): if e not in symptoms: out += [e]
print ",".join(out)
2. To stop alerts for a single event group type 400160001 (SW_AUDIT_CEE_UNREACHABLE):
a. Configure email alerts for all event groups except 400160001:
isi event channels create mychannel smtp --address [email protected] --smtp_host smtp.isilon.com isi event alert create myalert NEW mychannel --eventgroup `./eventgroups.py | sed 's/,400160001//'
b. View the alert configuration:
csmithhart-rt4x-1# isi event alert view myalert Name: myalert
Eventgroup: synciq, storage_transport, hwmon, windows_auth, overheating, physmem, reboot, powersupply, reboot-fail, avscan, filesystem, windows_idmap, external_network, toocold, windows_networking, 400050001, 400050002, 400050003, 400050004, 400110001, 900150001, 500010005, 500010004, 900130014, 500010001, 500010003, 500010002, 400140002, 900010012, 800010009, 400040012, 400140001, 600010003, 900130005, 600010001, 400060004, 900100009, 900100008, 200010002, 200010003, 600010004, 200010001, 200010006, 900100002, 900100001, 200010005, 900060033, 900060032, 900130011, 900130010, 400130001, 900100005, 900130015, 400040020, 910100007, 400210002, 900060030, 900010013, 600010005, 900060037, 910100006, 900060035, 400150003, 400150002, 400150001, 900060034, 900140001, 400150005, 400150004, 1100000008, 900100011, 900080033, 900100013, 900100016, 900100017, 900100018, 1100000001, 1100000002, 1100000003, 1100000004, 1100000005, 1100000006, 1100000007, 900130008, 900130009, 700100001, 900130004, 900100007, 900130006, 900130007, 100010009, 900040055, 400210001, 100010001, 900090023, 100010003, 100010002, 900020033, 400070004, 900100006, 400070005, 900110004, 400030002, 400120001, 400030001, 400130002, 100010018, 800010006, 100010012, 100010013, 100010010, 100010011, 600010002, 100010017, 100010014, 100010015, 900100004, 900140005, 200020003, 920100002, 400100004, 400100005, 400100002, 400100003, 400100001, 900100003, 400080001, 900010008, 920100005, 900090046, 100010029, 100010028, 100010027, 100010026, 100010025, 100010024, 100010023, 900100010, 700020003, 900060038, 700030005, 900060031, 900060036, 900100021, 900100020, 900100015, 900110005, 800010008, 100010030, 100010031, 900010003, 900010002, 900010005, 900010004, 900010007, 900010006, 900010009, 400100011, 700010005, 920100009, 400090001, 400090002, 400090003, 900060027, 900100026, 900040033, 100010045, 100010044, 900010010, 900010011, 100010041, 100010040, 100010043, 100010042, 900120005, 900120004, 900100012, 900100014, 400020002, 400020001, 900060029, 700040001, 900100022, 900100027, 700050001, 920100008, 900100019, 920100001, 920100000, 920100003, 920100004, 920100007, 920100006
Category: Sev: *
Channel: mychannel Condition: NEW

60

General cluster administration

c. Verify that alert 400160001 does not send:
/usr/bin/isi_celog/celog_send_events.py -o 400160001
d. Verify that other alerts send:
/usr/bin/isi_celog/celog_send_events.py -o 400050001
Change the status of an event group
You can ignore or resolve an event group. 1. Optional: To identify the group ID of the event group that you want modify, run the following command:
isi event groups list
2. To change the status of an event group, run the isi event groups modify command. To change the status of all event groups at once, run the isi event groups bulk command. The following example command modifies an event group with the event group ID of 7 to a status of ignored:
isi event groups modify 7 --ignored true
The following example command changes the status of all event groups to resolved:
isi event groups bulk --resolved true
View an event
You can view the details of a specific event. 1. Optional: To identify the instance ID of the event that you want to view, run the following command:
isi event events list
2. To view the details of a specific event, run the isi event events view command and specify the event instance ID. The following example command displays the details for an event with the instance ID of 3.121:
isi event events view 3.121
The system displays output similar to the following example:
ID: 3.121 Eventgroup ID: 7
Event Type: 200020001 Message: Gigabit Ethernet link ext-1 (vmx1) running below capacity Devid: 3 Lnn: 3 Time: 2015-08-04T16:02:10
Severity: warning Value: 1.0

General cluster administration

61

Managing alerts
You can view, create, modify, or delete alerts to determine the information you deliver about event groups.
View an alert
You can view the details of a specific alert. 1. Optional: To identify the alert ID of the alert that you want to view, run the following command:
isi event alerts list
2. To view the details of a specific alert, run the isi event alerts view command and specify the name of the alert. The following example command displays the details for an event with the name NewExternal:
isi event alerts view NewExternal
The name of the alert is case-sensitive. The system displays output similar to the following example:
Name: NewExternal Eventgroup: 3
Category: 200000000, 700000000, 900000000 Channel: RemoteSupport
Condition: NEW
Create a new alert
You can create new alerts to provide specific updates on event groups. Run the isi event alerts create command. The following command creates an alert named Hardware, sets the alert condition to NEW_EVENTS, and sets the channel that will broadcast the event as RemoteSupport:
isi event alerts create Hardware NEW-EVENTS --channel RemoteSupport
The following command creates an alert named ExternalNetwork, sets the alert condition to NEW, sets the source event group to the event group with the ID number 3, sets the channel that will broadcast the event as RemoteSupport, sets the severity level to critical, and sets the maximum alert limit to 10:
isi event alerts create ExternalNetwork NEW --eventgroup 3 --channel RemoteSupport -severity critical --limit 10
Modify an alert
You can modify an alert that you created. 1. Optional: To identify the name of the alert that you want to modify, run the following command:
isi event alerts list
2. Modify an alert by running the isi event alerts modify command. The following example command modifies the alert named ExternalNetwork by changing the name of the alert to ExtNetwork, adding the event group with an event group ID number of 131091, and filtering so that alerts will only be sent for event groups with a severity value of critical:
isi event alerts modify ExternalNetwork --name ExtNetwork --add-eventgroup 131091 -severity critical

62

General cluster administration

Delete an alert
You can delete alerts that you created. 1. Optional: To identify the name of the alert that you want to delete, run the following command:
isi event alerts list
2. Delete an alert by running the isi event alerts delete command. The following example command deletes the alert named ExtNetwork:
isi event alerts delete ExtNetwork
The name of the alert is case-sensitive. 3. Type yes to confirm deletion.
Managing channels
You can view, create, modify, or delete channels to determine how you deliver information about event groups.
View a channel
You can view the details of a specific channel. 1. Optional: To identify the name of the channel that you want to view, run the following command:
isi event channels list
2. To view the details of a channel, run the isi event channels view command and specify the name of the channel. The following example command displays the details for a channel with the name Support:
isi event channels view Support
The name of the channel is case-sensitive. The system displays output similar to the following example:
ID: 3 Name: Support Type: smtp
Enabled: Yes Excluded Nodes: 2
Address: [email protected] Send As: [email protected] Subject: Support Request SMTP Host: SMTP Port: 25 SMTP Use Auth: No SMTP Username: SMTP Password: SMTP Security: -
Batch: NONE Enabled: Yes Allowed Nodes: 1
Create a channel
You can create and configure new channels to send out alert information. You can configure a channel to deliver alerts with one of the following:  SMTP  SNMP

General cluster administration

63

 Connect Home Run the isi event channels create command to create an events channel. The auth-protocol can be SHA or MD5, and the priv-protocol can be AES or DES. The command must include the following information:
$ isi event channels create snmpchannel snmp --host <snmptrap daemon ip address> $ isi event channels modify snmpchannel --use-snmp-trap True --snmp-use-v3 True $ isi event channels modify snmpchannel --snmp-auth-protocol SHA --snmp-auth-password mypassword1 $ isi event channels modify snmpchannel --snmp-priv-protocol AES --snmp-priv-password mypassword2 $ isi event channels modify snmpchannel --snmp-engine-id 0x80002F5C809104C90F67E95B4C $ isi event channels modify snmpchannel --snmp-security-level authPriv --snmp-securityname v3user
Modify a channel
You can modify a channel that you created. 1. Optional: To identify the name of the channel that you want to modify, run the following command:
isi event channels list
2. Modify a channel by running the isi event channels modify command. The following example command modifies the channel named Support by changing the send-from email address to [email protected]:
isi event channels modify Support --send-as [email protected]
The following example command modifies the channel named Support by changing the SMTP username to admin, and the SMTP password to p@ssword:
isi event channels modify Support --smtp-username admin --smtp-password p@ssword
Delete a channel
You can delete channels that you created. You will not be able to delete a channel that is currently in use by an alert. Remove a channel from an alert by running the isi event alerts modify command. 1. Optional: To identify the name of the channel that you want to delete, run the following command:
isi event channels list
2. Delete a channel by running the isi event channels delete command. The following example command deletes the alert named Support:
isi event channels delete Support
The name of the channel is case-sensitive. 3. Type yes to confirm deletion.
Maintenance and testing
You can modify event settings to specify retention and storage limits for event data, schedule maintenance history windows, and send test events.

64

General cluster administration

Event data retention and storage limits
You can modify settings to determine how event data is handled on your cluster. By default, data related to resolved event groups is retained indefinitely. You can set a retention limit to make the system automatically delete resolved event group data after a certain number of days. You can also limit the amount of memory that event data can occupy on your cluster. By default, the limit is 1 megabyte of memory for every 1 terabyte of total memory on the cluster. You can adjust this limit to be between 1 and 100 megabytes of memory. For smaller clusters, the minimum amount of memory that will be set aside is 1 gigabyte. When your cluster reaches a storage limit, the system will begin deleting the oldest event group data to accommodate new data.
View event storage settings
You can view your storage and maintenance settings. To view , run the isi event settings view command. The system displays output similar to the following example:
Retention Days: 90 Storage Limit: 1
Maintenance Start: 2015-08-05T08:00:00 Maintenance Duration: 4H
Heartbeat Interval: daily
Modify event storage settings
You can modify your storage and maintenance settings. Modify your settings by running the isi event settings modify command. The following example command changes the number of days that resolved event groups are saved to 120:
isi event settings modify --retention-days 120
The following example command changes the storage limit for event data to 5 MB for every 1 TB of total cluster storage:
isi event settings modify --storage-limit 5
Maintenance windows
You can schedule a maintenance window by setting a maintenance start time and duration. During a scheduled maintenance window, the system will continue to log events, but no alerts will be generated. Scheduling a maintenance window will keep channels from being flooded by benign alerts associated with cluster maintenance procedures. Active event groups will automatically resume generating alerts when the scheduled maintenance period ends.
Schedule a maintenance window
You can schedule a maintenance window to discontinue alerts while you are performing maintenance on your cluster. Schedule a maintenance window by running the isi event settings modify command. The following example command schedules a maintenance window that begins on September 1, 2015 at 11:00pm and lasts for two days:
isi event settings modify --maintenance-start 2015-09-01T23:00:00 --maintenance-duration 2D

General cluster administration

65

Test events and alerts
Test events called heartbeat events are automatically generated. You can also manually generate test alerts. In order to confirm that the system is operating correctly, test events are automatically sent every day, one event from each node in your cluster. These are referred to as heartbeat events and are reported to an event group named Heartbeat Event. To test the configuration of channels, you can manually send a test alert through the system.
Create a test alert
You can manually generate a test alert. Manually generate a test alert by running the isi event test create command. The following example command creates a test alert with the message Test message:
isi event test create "Test message"

Modify the heartbeat event
You can change the frequency that a heartbeat event is generated. This procedure is available only through the command-line interface. 1. Open a secure shell (SSH) connection to any node in the cluster and log in. 2. Modify the heartbeat event interval by running the isi event settings modify command.
The following example command modifies the heartbeat event so that it is sent on a weekly basis:
isi event settings modify --heartbeat-interval weekly

Managing event thresholds
You can list, modify, reset, and view alert thresholds for events that use percentage-based statistics to generate alerts.

List events with configurable thresholds

You can view a list of the events that have configurable thresholds. You must have the ISI_PRIV_EVENT privilege to run this command. Run the isi event thresholds list command. A list of configurable event IDs and their corresponding names appears, similar to the following sample output.

ID

ID Name

-------------------------------

100010001 SYS_DISK_VARFULL

100010002 SYS_DISK_VARCRASHFULL

100010003 SYS_DISK_ROOTFULL

100010015 SYS_DISK_POOLFULL

100010018 SYS_DISK_SSDFULL

600010005 SNAP_RESERVE_FULL

800010006 FILESYS_FDUSAGE

-------------------------------

Modify thresholds for a specific event
You can modify the event thresholds for a specific event. You modify the threshold for a specific reporting level as a percentage: 0 to 100. Reporting levels are info, warn, critical, and emergency.

66

General cluster administration

You must have the ISI_PRIV_EVENT privilege and the event ID. 1. Optional: To identify the event ID for which to modify thresholds, run the following command:
isi event thresholds list
2. Modify an event threshold by running the isi event thresholds modify command. Specify the event ID and the reporting level you want to modify. The following example command modifies the SYS_DISK_VARFULL warning threshold to 65%. isi event thresholds 100010001 --warn 65
Reset a specified event's thresholds to the default values
You can revert the event thresholds for a specific event to the default values. You can reset the threshold for specific reporting levels, or for all reporting levels using the -all flag. Reporting levels are info, warn, critical, and emergency. You must have the ISI_PRIV_EVENT privilege and the event ID. 1. Optional: To identify the event ID for which to reset threshold values, run the following command:
isi event thresholds list
2. Reset an event threshold by running the isi event thresholds reset command. Specify the event ID and the reporting level you want to reset. For example, the following command resets the critical reporting level for the SYS_DISK_VARFULL threshold to its default value. isi event thresholds reset 100010001 --critical
View threshold details for a specific event
You can view the existing event thresholds for a specific event. You must have the ISI_PRIV_EVENT privilege. 1. Optional: To identify the event ID for which to view threshold details, run the following command:
isi event thresholds list
2. View threshold details for a specific event by running the following command: isi event thresholds view <event_ID>
The thresholds for the event appear. The following example command shows the threshold details for the SYS_DISK_VARFULL event.
isi event thresholds view 100010001 ID: 100010001
ID Name: SYS_DISK_VARFULL Description: Percentage at which /var partition is near capacity
Defaults: info (75%), warn (85%), crit (90%) Thresholds: info (50%), warn (65%), crit (80%)

General cluster administration

67

Security hardening
Security hardening is the process of configuring a system to reduce or eliminate as many security risks to support the Security Technical Implementation Guides (STIGs). When you apply a hardening profile on a PowerScale cluster, OneFS reads the security profile file and applies the configuration that is defined in the profile to the cluster. If required, OneFS identifies configuration issues that prevent hardening on the nodes. For example, the file permissions on a particular directory might not be set to the expected value, or the required directories might be missing. When an issue is found, you can choose to allow OneFS to resolve the issue, or you can defer resolution and fix the issue manually.
NOTE: The intention of the hardening profile is to support STIGs defined by the Defense Information Systems Agency (DISA) and applicable to OneFS. The hardening profile only supports a subset of requirements that are defined by DISA in STIGs. The hardening profile is meant to be primarily used in Federal accounts.
If you determine that the hardening configuration is not right for your system, OneFS allows you to revert the security hardening profile. Reverting a hardening profile returns OneFS to the configuration before hardening. You must have an active security hardening license and be logged in to the PowerScale cluster as the root user to apply hardening to OneFS. To obtain a license, contact your PowerScale sales representative.
STIG hardening profile
The OneFS STIG hardening profile contains a subset of the configuration requirements set by the Department of Defense. The STIG hardening profile is designed for PowerScale clusters that support Federal Government accounts. A PowerScale cluster that is installed with a STIG profile relies on the surrounding ecosystem being secure. After you apply the OneFS STIG hardening profile, the OneFS configuration is modified to make the PowerScale cluster more secure and to support controls that are defined by the DISA STIGs. Some examples of the many system changes are as follows:  After you log in through SSH or the web interface, the system displays a message that you are accessing a U.S. Government
Information System and displays the terms and conditions of using the system.  On each node, SSH and the web interface listen only on the node's external IP address.  Password complexity requirements increase for local user accounts. Passwords must be at least 14 characters and contain at
least one of each of the following character types: numeric, uppercase, lowercase, symbol.  Root SSH is disabled. You can log in as root only through the web interface or through a serial console session.  The system umask is changed to 077 by default.  Restricted access to device config and system files.  Basic authentication is disabled by default.
Apply a security hardening profile
You can apply the OneFS STIG hardening profile to the PowerScale cluster. Security hardening requires root privileges and can be performed only through the command-line interface. Once hardening has been successfully applied to the cluster, root SSH is not allowed on a hardened cluster. To log in as the root user on a hardened cluster, you must connect through the web interface or a serial console session. You must have an active security hardening license to apply a hardening profile to OneFS. To obtain a license, contact your PowerScale sales representative. 1. Open a secure shell (SSH) connection to any node in the cluster and log in as root. 2. Run the isi hardening apply command.
The following command directs OneFS to apply the hardening profile to the PowerScale cluster.
isi hardening apply --profile=STIG
NOTE: STIG is a tag, not a file.
OneFS checks whether the system contains any configuration issues that must be resolved before hardening can be applied.  If OneFS does not encounter any issues, the hardening profile is applied.  If OneFS encounters issues, the system displays output similar to the following example:

68

General cluster administration

Found the following Issue(s) on the cluster: Issue #1 (PowerScale Control_id:isi_GEN001200_01) Node: test-cluster-2 1: /etc/syslog.conf: Actual permission 0664; Expected permission 0654
Issue #2 (PowerScale Control_id:isi_GEN001200_02) Node: test-cluster-3 1: /usr/bin/passwd: Actual permission 4555; Expected permission 0555 2: /usr/bin/yppasswd: Actual permission 4555; Expected permission 0555 Node: test-cluster-2 1: /usr/bin/passwd: Actual permission 4555; Expected permission 0555 2: /usr/bin/yppasswd: Actual permission 4555; Expected permission 0555
Total: 2 issue(s) Do you want to resolve the issue(s)?[Y/N]:
3. Resolve any configuration issues. At the prompt Do you want to resolve the issue(s)?[Y/N], choose one of the following actions:  To allow OneFS to resolve all issues, type Y. OneFS fixes the issues and then applies the hardening profile.  To defer resolution and fix all of the found issues manually, type N. After you have fixed all of the deferred issues, run the isi hardening apply command again.
NOTE: If OneFS encounters an issue that is considered catastrophic, the system prompts you to resolve the issue manually. OneFS cannot resolve a catastrophic issue.
Revert a security hardening profile
You can revert a hardening profile that has been applied to the PowerScale cluster.
Reverting security hardening requires root privileges and can be performed only through the command-line interface. To log in as the root user on a hardened cluster, you must connect through a serial console session. Root SSH is not allowed on a hardened cluster.
You must have an active security hardening license to revert a hardening profile on OneFS. To obtain a license, contact your PowerScale sales representative.
1. Open a serial console session on any node in the cluster and log in as root. 2. Run the isi hardening revert command.
OneFS checks whether the system is in an expected state.  If OneFS does not encounter any issues, the hardening profile is reverted.  If OneFS encounters any issues, the system displays output similar to the following example:
Found the following Issue(s) on the cluster: Issue #1 (PowerScale Control_id:isi_GEN001200_01) Node: test-cluster-2 1: /etc/syslog.conf: Actual permission 0664; Expected permission 0654
Issue #2 (PowerScale Control_id:isi_GEN001200_02) Node: test-cluster-3 1: /usr/bin/passwd: Actual permission 4555; Expected permission 0555 2: /usr/bin/yppasswd: Actual permission 4555; Expected permission 0555 Node: test-cluster-2 1: /usr/bin/passwd: Actual permission 4555; Expected permission 0555 2: /usr/bin/yppasswd: Actual permission 4555; Expected permission 0555
Total: 2 issue(s) Do you want to resolve the issue(s)?[Y/N]:
3. Resolve any configuration issues. At the prompt Do you want to resolve the issue(s)?[Y/N], choose one of the following actions:  To allow OneFS to resolve all issues, type Y. OneFS sets the affected configurations to the expected state and then reverts the hardening profile.  To defer resolution and fix all of the found issues manually, type N. OneFS halts the revert process until all of the issues are fixed. After you have fixed all of the deferred issues, run the isi hardening revert command again.

General cluster administration

69

NOTE: If OneFS encounters an issue that is considered catastrophic, the system will prompt you to resolve the issue manually. OneFS cannot resolve a catastrophic issue.
View the security hardening status
You can view the security hardening status of the PowerScale cluster and each cluster node. A cluster is not considered hardened until all of its nodes are hardened. During the hardening process, if OneFS encounters issues that must be resolved manually, or if you defer issues to resolve them manually, the nodes on which the issues occur are not hardened until the issues are resolved and the hardening profile is applied successfully. If you need help resolving these issues, contact PowerScale Technical Support. Viewing the security hardening status of the cluster requires root privileges and can be performed only through the commandline interface. To log in as the root user on a hardened cluster, you must connect through a serial console session. Root SSH is not allowed on a hardened cluster. You do not need a security hardening license to view the hardening status of the cluster. 1. Open a console session on any node in the cluster and log in as root. 2. Run the isi hardening status command to view the status of security hardening on the PowerScale cluster and each
of the nodes. The system displays output similar to the following example:
Cluster Name: test-cluster Hardening Status: Not Hardened Profile: STIG Node status: test-cluster-1: Disabled test-cluster-2: Enabled test-cluster-3: Enabled
Cluster configuration backup and restore
This section contains the following topics:
Cluster configuration backup and restore
The import cluster configuration feature supports restore and automatic cluster setup scenarios, and the export cluster configuration supports backup and auditing scenarios. The configuration backup and restore feature provides two major functions:  Full OneFS cluster-wide configuration backup  Full OneFS cluster-wide configuration restore The import or export features enables you to create a definition of what a cluster should be (from backup or template) and apply that definition to a cluster (for restoration purpose) where the backup is generated or to other clusters with the same hardware configuration. The definition to the cluster is indicated with specific variables, such as cluster name, node names, and IP addresses. The variables are modified on a per cluster basis. Currently OneFS supports the backup and restore of http, quota, snapshot, nfs, smb, s3, ndmp configurations. You can create, view, and list export or import tasks.
Create export task
You can create an export task. Before performing this task, you must:  Add licenses like quota, snapshot  Start services like NFS, SMB that are disabled by default.

70

General cluster administration

The default value to create a export task is "all" and is optional. All components that are currently supported are http, quota, snapshot, nfs, smb, s3, and ndmp. 1. Run the isi cluster config exports create command.
The following message appears:
Are you sure you want to export cluster configuration? (yes/[no]):
2. Enter "yes". A new export task similar to the following is created:
Created export task 'Tjolley-ga9dy9j-20210218065606'

View list of export tasks

You can view a list of configuration export tasks.
Run the isi cluster config exports list command. A list of export tasks similar to the following appears:

ID Status Done Failed Pending Message

Path

-----------------------------------------------------------------------------------------

------------------------------------------------------------

Tjolley-ga9dy9j-20210218065606 Failed ['http', 'quota', 'snapshot', 'nfs', 'smb', 's3',

'ndmp'] ['quota', 'snapshot'] []

Components: ['quota', 'snapshot'] are not

licensed. /ifs/data/Isilon_Support/config_mgr/backup/Tjolley-ga9dy9j-20210218065606

-----------------------------------------------------------------------------------------

------------------------------------------------------------

Total: 1

View export task
You can view the details of an export task.
Run the isi cluster config exports view <id> command. The following example command lets you view the details of the export task:
isi cluster config exports view Tjolley-ga9dy9j-20210218065606 ID: Tjolley-ga9dy9j-20210218065606
Status: Failed Done: ['http', 'quota', 'snapshot', 'nfs', 'smb', 's3', 'ndmp']
Failed: ['quota', 'snapshot'] Pending: [] Message: Components: ['quota', 'snapshot'] are not licensed.
Path: /ifs/data/Isilon_Support/config_mgr/backup/Tjolley-ga9dy9j-20210218065606

Create import task
You can create a new import task. Before performing this task, you must:  Add licenses like quota, snapshot  Start services like NFS, SMB that are disabled by default  Create directory for NFS exports, snapshot, quota  Create an S3 user 1. Run the isi cluster config imports create <export-id> command.
Run the following example command to create an import task.
isi cluster config imports create Tjolley-ga9dy9j-20210218065606

General cluster administration

71

The following message appears:
Are you sure you want to import cluster configuration? (yes/[no]):
2. Enter "yes". The following message appears indicating that the new import task is created.
This may take a few seconds, please wait a moment Created import task 'Tjolley-ga9dy9j-20210218083543'

View list of import tasks

You can view a list of configuration import tasks.
Run the isi cluster config exports list command. A list of import tasks similar to the following appears:

ID

Export ID

Status Done

Failed

Pending

Message

Path

-----------------------------------------------------------------------------------------

-------------------------------------------------------------------

Tjolley-ga9dy9j-20210218083543 Tjolley-ga9dy9j-20210218065606 Failed ['http', 'quota',

'snapshot', 'nfs', 'smb', 's3', 'ndmp'] ['quota', 'snapshot'] []

Components:

['quota', 'snapshot'] are not licensed. /ifs/data/Isilon_Support/config_mgr/restore/

Tjolley-ga9dy9j-20210218083543

-----------------------------------------------------------------------------------------

-------------------------------------------------------------------

Total: 1

View import task
You can view the details of an import task.
Run the isi cluster config imports view <id> command. The following example command lets you view the details of the import task:
isi cluster config imports view Tjolley-ga9dy9j-20210218083543 ID: Tjolley-ga9dy9j-20210218083543 Export ID: Tjolley-ga9dy9j-20210218065606 Status: Failed Done: ['http', 'quota', 'snapshot', 'nfs', 'smb', 's3', 'ndmp'] Failed: ['quota', 'snapshot'] Pending: [] Message: Components: ['quota', 'snapshot'] are not licensed. Path: /ifs/data/Isilon_Support/config_mgr/restore/Tjolley-ga9dy9j-202102180

Stop export or import tasks
You can stop the export or import tasks that are running. 1. Run the isi_config_stop command.
The following message appears:
Stop all export/import tasks? (y/n [n])
2. Enter "y" to stop all the tasks that are running. The tasks are stopped and the following message appears:
Done

72

General cluster administration

Cluster monitoring
You can view health and status information for the PowerScale cluster and monitor cluster and node performance. Run the isi status command to review the following information:  Cluster, node, and drive health  Storage data such as size and amount used  Data reduction ratio  IP addresses  Throughput  Critical events  Job status Additional commands are available to review performance information for the following areas:  General cluster statistics  Statistics by protocol or by clients connected to the cluster  Performance data by drive  Historical performance data Advanced performance monitoring and analytics are available through the InsightIQ module, which requires you to activate a separate license. For more information about optional software modules, contact your Dell Technologies account representative.
Monitor the cluster
You can monitor the health and performance of a cluster with charts and tables. Run the following command:
isi status
View node status
You can view the status of a node. Optional: Run the isi status command: The following command displays information about a node with a logical node number (LNN) of 1:
isi status -n 1
Monitoring cluster hardware
You can manually check the status of hardware on the PowerScale cluster as well as enable SNMP to remotely monitor components.
View node hardware status
You can view the hardware status of a node. 1. Click Dashboard > Cluster Overview > Cluster Status. 2. Optional: In the Status area, click the ID number for a node. 3. In the Chassis and drive status area, click Platform.

General cluster administration

73

Chassis and drive states

You can view chassis and drive state details.
In a cluster, the combination of nodes in different degraded states determines whether read requests, write requests, or both work. A cluster can lose write quorum but keep read quorum. OneFS provides details about the status of chassis and drives in your cluster. The following table describes all the possible states that you may encounter in your cluster.

State HEALTHY L3 SMARTFAIL or Smartfail or restripe in progress NOT AVAILABLE
SUSPENDED
NOT IN USE REPLACE STALLED
NEW USED
PREPARING EMPTY

Description

Interface

Error state

All drives in the node are functioning correctly.

Command-line interface, web administration interface

A solid state drive (SSD) was deployed as level 3 Command-line interface (L3) cache to increase the size of cache memory and improve throughput speeds.

The drive is in the process of being removed safely from the file system, either because of an I/O error or by user request. Nodes or drives in a smartfail or read-only state affect only write quorum.

Command-line interface, web administration interface

A drive is unavailable for a variety of reasons.

Command-line

X

You can click the bay to view detailed information interface, web

about this condition.

administration interface

NOTE: In the web administration interface,

this state includes the ERASE and

SED_ERROR command-line interface states.

This state indicates that drive activity is temporarily suspended and the drive is not in use. The state is manually initiated and does not occur during normal cluster activity.

Command-line interface, web administration interface

A node in an offline state affects both read and write quorum.

Command-line interface, web administration interface

The drive was smartfailed successfully and is ready to be replaced.

Command-line interface only

The drive is stalled and undergoing stall evaluation. Stall evaluation is the process of checking drives that are slow or having other issues. Depending on the outcome of the evaluation, the drive may return to service or be smartfailed. This is a transient state.

Command-line interface only

The drive is new and blank. This is the state that a drive is in when you run the isi dev command with the -a add option.

Command-line interface only

The drive was added and contained a PowerScaleGUID but the drive is not from this node. This drive likely will be formatted into the cluster.

Command-line interface only

The drive is undergoing a format operation. The drive state changes to HEALTHY when the format is successful.

Command-line interface only

No drive is in this bay.

Command-line interface only

74

General cluster administration

State WRONG_TYPE BOOT_DRIVE SED_ERROR ERASE
INSECURE
UNENCRYPTED

Description

Interface

Error state

The drive type is wrong for this node. For example, a non-SED drive in a SED node, SAS instead of the expected SATA drive type.

Command-line interface only

Unique to the A100 drive, which has boot drives Command-line interface

in its bays.

only

The drive cannot be acknowledged by the OneFS Command-line

X

system.

interface, web

NOTE: In the web administration interface, administration interface

this state is included in Not available.

The drive is ready for removal but needs your attention because the data has not been erased. You can erase the drive manually to guarantee that data is removed.
NOTE: In the web administration interface,
this state is included in Not available.

Command-line interface only

Data on the self-encrypted drive is accessible

Command-line interface X

by unauthorized personnel. Self-encrypting drives only

should never be used for non-encrypted data

purposes.

NOTE: In the web administration interface,

this state is labeled Unencrypted SED.

Data on the self-encrypted drive is accessible

Web administration

X

by unauthorized personnel. Self-encrypting drives interface only

should never be used for non-encrypted data

purposes.

NOTE: In the command-line interface, this

state is labeled INSECURE.

Check battery status

You can monitor the status of NVRAM batteries and charging systems. This task may only be performed at the OneFS command-line interface on node hardware that supports the command.
1. Open an SSH connection to any node in the cluster.
2. Run the isi batterystatus list command to view the status of all NVRAM batteries and charging systems on the node.
The system displays output similar to the following example:

Lnn Status1 Status2 Result1 Result2

----------------------------------------

1

Good

Good

-

-

2

Good

Good

-

-

3

Good

Good

-

-

----------------------------------------

SNMP monitoring
You can use SNMP to remotely monitor the PowerScale cluster hardware components, such as fans, hardware sensors, power supplies, and disks. Use the default Linux SNMP tools or a GUI-based SNMP tool of your choice for this purpose.
SNMP is enabled or disabled cluster wide: nodes are not configured individually. You can monitor cluster information from any node in the cluster. Generated SNMP traps correspond to CELOG events. You can also send SNMP notifications using the command isi event channels create snmpchannel snmp --use-snmp-trap false.

General cluster administration

75

You can configure an event notification rule that specifies the network station where you want to send SNMP traps for specific events. When the specific event occurs, the cluster sends the trap to that server. OneFS supports SNMP version 2c (default), and SNMP version 3 in read-only mode.
OneFS does not support SNMP version 1. Although the command isi snmp settings modify includes the option --snmp-v1-v2-access, OneFS monitors only through SNMP version 2c.
You can configure settings for SNMP version 3 alone or for both SNMP version 2c and version 3.

NOTE: All SNMP v3 security levels are configurable: noAuthNoPriv, authNoPriv, authPriv.

Elements in an SNMP hierarchy are arranged in a tree structure, similar to a directory tree. As with directories, identifiers move from general to specific as the string progresses from left to right. Unlike a file hierarchy, however, each element is not only named, but also numbered.
For example, the SNMP entity iso.org.dod.internet.private.enterprises.powerscale.cluster.clusterStatus.clusterName.0 maps to .1.3.6.1.4.1.12124.1.1.1.0. The element 12124 refers to the OneFS SNMP namespace. Anything further to the right of that number is related to OneFS-specific monitoring.
Management Information Base (MIB) documents define human-readable names for managed objects and specify their datatypes and other properties. You can download MIBs that are created for SNMP-monitoring of a PowerScale cluster from the OneFS web administration interface or manage them using the command-line interface (CLI). MIBs are stored in /usr/share/snmp/ mibs/ on a OneFS node. The OneFS ISILON-MIBs serve two purposes:
 Augment the information available in standard MIBs.  Provide OneFS-specific information that is unavailable in standard MIBs.
ISILON-MIB is a registered enterprise MIB. PowerScale clusters have two separate MIBs:

ISILON-MIB
ISILON-TRAPMIB

Defines a group of SNMP agents that respond to queries from a network monitoring system (NMS) called OneFS Statistics Snapshot agents. These agents snapshot the state of the OneFS file system at the time that it receives a request and reports this information back to the NMS.
Generates SNMP traps to send to an SNMP monitoring station when the circumstances occur that are defined in the trap protocol data units (PDUs).

The OneFS MIB files map the OneFS-specific object IDs with descriptions. Download or copy MIB files to a directory where your SNMP tool can find them, such as /usr/share/snmp/mibs/.
To enable Net-SNMP tools to read the MIBs to provide automatic name-to-OID mapping, add -m All to the command, as in the following example:

snmpwalk -v2c I$ilonpublic -m All <node IP> isilon

During SNMP configuration, it is recommended that you change the mapping to something similar to the following: isi snmp settings modify -c <newcommunitystring>

If the MIB files are not in the default Net-SNMP MIB directory, specify the full path, as in the following example. All three lines are a single command.

snmpwalk -m /usr/local/share/snmp/mibs/ISILON-MIB.txt:/usr \ /share/snmp/mibs/ISILON-TRAP-MIB.txt:/usr/share/snmp/mibs \ /ONEFS-TRAP-MIB.txt -v2c -C c -c public isilon

NOTE: The previous examples are run from the snmpwalk command on a cluster. Your SNMP version may require different arguments.

Managing SNMP settings
You can use SNMP to monitor cluster hardware and system information. You can configure settings through either the web administration interface or the command-line interface. The default SNMP v3 username (general) and password can be changed to anything from the CLI or the WebUI. The username is only required when SNMP v3 is enabled and making SNMP v3 queries.

76

General cluster administration

Configure a network monitoring system (NMS) to query each node directly through a static IPv4 address. If a node is configured for IPv6, you can communicate with SNMP over IPv6. The SNMP proxy is enabled by default, and the SNMP implementation on each node is configured automatically to proxy for all other nodes in the cluster except itself. This proxy configuration allows the PowerScale Management Information Base (MIB) and standard MIBs to be exposed seamlessly by using context strings for supported SNMP versions. This approach allows you to query a node through another node by appending _node_<node number> to the community string of the query. For example, snmpwalk -m /usr/share/snmp/mibs/ISILON-MIB.txt -v 2c -c 'I$ilonpublic_node_1' localhost <nodename>.
Configure SNMP settings
You can configure SNMP monitoring settings. NOTE: When SNMP v3 is used, OneFS requires the SNMP-specific security level of AuthNoPriv as the default value when querying the PowerScale cluster. The security level AuthPriv is not supported.
 The following isi snmp settings modify command enables SNMP v3 access:
isi snmp settings modify --snmp-v3-access=yes
 The following isi snmp settings modify command configures the security level, the authentication password and protocol, and the privacy password and protocol:
isi snmp settings modify --help ... [--snmp-v3-access ] [{--snmp-v3-read-only-user | -u} ] [--snmp-v3-auth-protocol (SHA | MD5)] [--snmp-v3-priv-protocol (AES | DES)] [--snmp-v3-security-level (noAuthNoPriv | authNoPriv | authPriv)] [{--snmp-v3-password | -p} ] [--snmp-v3-priv-password ] [--set-snmp-v3-password] [--set-snmp-v3-priv-password]
Configure the cluster for SNMP monitoring
You can configure your PowerScale cluster to remotely monitor hardware components using SNMP. 1. Enable SNMP monitoring.
Run the following command to enable SNMP v2c and to set the SNMP community name. The default community name is I$ilonpublic. Ensure that you set the default community name when enabling SNMP v2c. OneFS supports SNMP v2c and later.
isi snmp settings modify --snmp-v1-v2c-access yes -c <community_name>
Run the following command to enable SNMP v3 and set the SNMPv3 private password:
isi snmp settings modify --snmp-v3-access yes --snmp-v3-priv-password <your_private_password>
2. Start the SNMP service.
isi snmp settings modify --service yes
3. Download the MIB file you want to use (base or trap). Follow the download process that is specific to your browser.
4. Copy the MIB files to a directory where your SNMP tool can find them, such as /usr/share/snmp/mibs/. To have Net-SNMP tools read the MIBs to provide automatic name-to-OID mapping, add -m All to the command, as in the following example:
snmpwalk -v2c -c public -m All <node IP> isilon
5. Configure SNMP v3 settings.

General cluster administration

77

a. Change the default SNMPv3 security name and password for the read-only user. The default read-only user is general. The default password is password. It is recommended that you set the community name from the default. Ensure that you change both the SNMPv3 authentication and privacy passwords to improve security. The password must contain at least eight characters and no spaces.
isi snmp settings modify -u <readonly-user> --snmp-v3-password <readonly-password>
b. Set a system contact and a cluster description for reporting purposes. For example, to set the system contact to SysAdmin and cluster description to Production:
isi snmp settings modify --system-contact SysAdmin --system-location Production
View SNMP settings
You can review SNMP monitoring settings.  Run the following command:
isi snmp settings view
This is an example of the output generated by the command:
$ isi snmp settings view System Location: unset System Contact: [email protected]
SNMP V1 V2C Access: Yes Read Only Community: I$ilonpublic
SNMP V3 Access: No SNMP V3 Read Only User: general
SNMP V3 Auth Protocol: MD5 SNMP V3 Priv Protocol: AES SNMP V3 Security Level: authNoPriv
SNMP Service Enabled: Yes
Cluster maintenance
Trained service personnel can replace or upgrade components in PowerScale nodes. Dell EMC PowerScale Technical Support can assist you with replacing node components or upgrading components to increase performance.
Replacing node components
If a node component fails, PowerScale Technical Support will work with you to quickly replace the component and return the node to a healthy status. Trained service personnel can replace the following field replaceable units (FRUs):  battery  boot flash drive  SATA/SAS Drive  memory (DIMM)  fan  front panel  intrusion switch  network interface card (NIC)  InfiniBand card  NVRAM card  SAS controller

78

General cluster administration

 power supply If you configure your cluster to send alerts to PowerScale, PowerScale Technical Support will contact you if a component needs to be replaced. If you do not configure your cluster to send alerts to PowerScale, you must initiate a service request.
Upgrading node components
You can upgrade node components to gain additional capacity or performance. Trained service personnel can upgrade the following components in the field:  drive  memory (DIMM)  network interface card (NIC) If you want to upgrade components in your nodes, contact PowerScale Technical Support.
Automatic Replacement Recognition (ARR) for drives
When a drive is replaced in a node, OneFS automatically formats and adds the drive to the cluster. If you are replacing a drive in a node, either to upgrade the drive or to replace a failed drive, you do not need to take additional actions to add the drive to the cluster. OneFS will automatically format the drive and add it. ARR will also automatically update the firmware on the new drive to match the current drive support package installed on the cluster. Drive firmware will not be updated for the entire cluster, only for the new drive. If you prefer to format and add drives manually, you can disable ARR.
View Automatic Replacement Recognition (ARR) status
You can confirm whether ARR is enabled on your cluster. 1. To confirm whether ARR is enabled on your cluster, run the following command:
isi devices config view --node-lnn all
The system displays configuration information for each node by Logical Node Number (LNN). As part of the configuration display, you will see the ARR status for each node:
Automatic Replacement Recognition: Enabled : True
2. To view the ARR status of a specific node, run the isi devices config view command and specify the LNN of the node you want to view. If you don't specify a node LNN, the configuration information for the local node you are connecting through will display. The following example command displays the ARR status for the node with the LNN of 2:
isi devices config view --node-lnn 2
Enable or Disable Automatic Replacement Recognition (ARR)
You can enable or disable ARR for your entire cluster, or just for specific nodes. By default, ARR is enabled on all nodes. 1. To disable ARR for your entire cluster, run the following command:
isi devices config modify --automatic-replacement-recognition no

General cluster administration

79

2. To enable ARR for your entire cluster, run the following command:
isi devices config modify --automatic-replacement-recognition yes
3. To disable ARR for a specific node, run the isi devices config modify command with the ARR parameter and specify the LNN of the node. If you don't specify a node LNN, the command will be applied to the entire cluster. The following example command disables ARR for the node with the LNN of 2:
isi devices config modify --automatic-replacement-recognition no --node-lnn 2
NOTE: We recommend that you keep your ARR settings consistent across all nodes. Changing ARR settings on specific nodes can lead to confusion during drive maintenance.
4. To enable ARR for a specific node, run the isi devices config modify command with the ARR parameter and specify the LNN of the node. If you don't specify a node LNN, the command will be applied to the entire cluster. The following example command enables ARR for the node with the LNN of 2:
isi devices config modify --automatic-replacement-recognition yes --node-lnn 2
Managing drive firmware
If the firmware of any drive in a cluster becomes obsolete, the cluster performance or hardware reliability might be affected. To ensure overall data integrity, update the drive firmware to the latest revision by installing the drive support package. Determine whether the drive firmware on your cluster is the latest revision by viewing the status of the drive firmware.
NOTE: It is recommended that you contact PowerScale Technical Support before updating the drive firmware.
Drive firmware update overview
You can update the drive firmware through drive support packages. Download and install the drive support package from Online Support depending on the OneFS version running on your cluster and the type of drives on the nodes.
Drive Support Package
For clusters running OneFS 7.1.1 and later, install a drive support package to update the drive firmware. You do not have to reboot the affected nodes to complete the firmware update. A drive support package provides the following additional capabilities:  Updates the following drive configuration information:
 List of supported drives  Drive firmware metadata  SSD wear monitoring data  SAS and SATA settings and attributes  Automatically updates the drive firmware for new and replacement drives to the latest revision before those drives are formatted and used in a cluster. This update is applicable only for clusters running OneFS 7.2 and later.
NOTE: Firmware of drives in use cannot be updated automatically.

80

General cluster administration

Install a drive support package
The following instructions are for performing a non-disruptive firmware update (NDFU) with a drive support package (DSP).
CAUTION: Please refer to the table in the System requirements section above to confirm that you are performing the correct procedure for your node type and OneFS version.
1. Go to the Dell EMC Support page that lists all the available versions of the drive support package. 2. Click the latest version of the drive support package and download the file.
NOTE: If you are unable to download the package, contact PowerScale Technical Support for assistance.
3. Open a secure shell (SSH) connection to any node in the cluster and log in. 4. Create or check for the availability of the directory structure /ifs/data/Isilon_Support/dsp. 5. Copy the downloaded file to the dsp directory through SCP, FTP, SMB, NFS, or any other supported data-access protocols. 6. Unpack the file by running the following command:
tar -zxvf Drive_Support_<version>.tgz 7. Install the package by running the following command:
isi_dsp_install Drive_Support_<version>.tar NOTE:  You must run the isi_dsp_install command to install the drive support package. Do not use the isi pkg command.  Running isi_dsp_install will install the drive support package on the entire cluster.  The installation process takes care of installing all the necessary files from the drive support package followed by the uninstallation of the package. You do not need to delete the package after its installation or prior to installing a later version.

Drive firmware status information

You can view information about the status of the drive firmware through the OneFS command-line interface. The following example shows the output of the isi devices drive firmware list command:

your-cluster-1# isi devices drive firmware list

Lnn Location Firmware Desired Model

------------------------------------------------------

2

Bay 1

A204

-

HGST HUSMM1680ASS200

2

Bay 2

A204

-

HGST HUSMM1680ASS200

2

Bay 3

MFAOABW0 MFAOAC50 HGST HUS724040ALA640

2

Bay 4

MFAOABW0 MFAOAC50 HGST HUS724040ALA640

2

Bay 5

MFAOABW0 MFAOAC50 HGST HUS724040ALA640

2

Bay 6

MFAOABW0 MFAOAC50 HGST HUS724040ALA640

2

Bay 7

MFAOABW0 MFAOAC50 HGST HUS724040ALA640

2

Bay 8

MFAOABW0 MFAOAC50 HGST HUS724040ALA640

2

Bay 9

MFAOABW0 MFAOAC50 HGST HUS724040ALA640

2

Bay 10

MFAOABW0 MFAOAC50 HGST HUS724040ALA640

2

Bay 11

MFAOABW0 MFAOAC50 HGST HUS724040ALA640

2

Bay 12

MFAOABW0 MFAOAC50 HGST HUS724040ALA640

------------------------------------------------------

Total: 12

Where: LNN Location Firmware Desired

Displays the LNN for the node that contains the drive. Displays the bay number where the drive is installed. Displays the version number of the firmware currently running on the drive. If the drive firmware should be upgraded, displays the version number of the drive firmware that the firmware should be updated to.

General cluster administration

81

Model

Displays the model number of the drive.

NOTE: The isi devices drive firmware list command displays firmware information for the drives in the local node only. You can display drive firmware information for the entire cluster, not just the local cluster, by running the following command:

isi devices drive firmware list --node-lnn all

Update the drive firmware
You can update the drive firmware to the latest revision; updating the drive firmware ensures overall data integrity. This procedure explains how to update the drive firmware on nodes that have bootflash drives after you have installed the latest drive support package. For a list of nodes with bootflash drives, see the System requirements section of the PowerScale Drive Support Package Release Notes.
NOTE: Power cycling drives during a firmware update might return unexpected results. Do not restart or power off nodes when the drive firmware is being updated in a cluster.
1. Open a secure shell (SSH) connection to any node in the cluster and log in. 2. Run the following command to update the drive firmware for your entire cluster:
isi devices drive firmware update start all --node-lnn all To update the drive firmware for a specific node only, run the following command: isi devices drive firmware update start all --node-lnn <node-number>
CAUTION: Wait for one node to finish updating before you initiate an update on the next node. To confirm that a node has finished updating, run the following command:
isi devices drive firmware update list
A drive that is still updating displays a status of FWUPDATE.
Updating the drive firmware of a single drive takes approximately 15 seconds, depending on the drive model. OneFS updates drives sequentially.

Verify a drive firmware update
After you update the drive firmware in a node, confirm that the firmware is updated properly and that the affected drives are operating correctly. 1. Ensure that no drive firmware updates are currently in progress by running the following command:
isi devices drive firmware update list
If a drive is currently being updated, [FW_UPDATE] appears in the status column. 2. Verify that all drives have been updated by running the following command:
isi devices drive firmware list --node-lnn all
If all drives have been updated, the Desired FW column is empty. 3. Verify that all affected drives are operating in a healthy state by running the following command:
isi devices drive list --node-lnn all
If a drive is operating in a healthy state, [HEALTHY] appears in the status column.

82

General cluster administration

Automatic update of drive firmware
For clusters running OneFS 7.2 or later, install the latest drive support package on a node to automatically update the firmware for a new or replacement drive. The information within the drive support package determines whether the firmware of a drive must be updated before the drive is formatted and used. If an update is available, the drive is automatically updated with the latest firmware.
NOTE: New and replacement drives added to a cluster are formatted regardless of the status of their firmware revision. You can identify a firmware update failure by viewing the firmware status for the drives on a specific node. In case of a failure, run the isi devices command with the fwupdate action on the node to update the firmware manually. For example, run the following command to manually update the firmware on node 1:
isi devices -a fwupdate -d 1
Managing cluster nodes
You can add and remove nodes from a cluster. You can also shut down or restart the entire cluster.
Add a node to a cluster
You can add a new node to an existing PowerScale cluster. Before you add a node to a cluster, verify that an internal IP address is available. Add IP addresses as necessary before you add a new node. If a new node is running a different version of OneFS than a cluster, the system changes the node version of OneFS to match the cluster.
NOTE: For specific information about version compatibility between OneFS and PowerScale hardware, refer to the PowerScale Supportability and Compatibility Guide. 1. To identify the serial number of the node to be added, run the following command:
isi devices node list
2. To join the node to the cluster, run the following command: isi devices node add <serial-number>
For example, the following command joins a node to the cluster with a serial number of 43252:
isi devices node add 43252
Remove a node from the cluster
You can remove a node from a cluster. When you remove a node, the system smartfails the node to ensure that data on the node is transferred to other nodes in the cluster. Removing a storage node from a cluster deletes the data from that node. Before the system deletes the data, the FlexProtect job safely redistributes data across the nodes remaining in the cluster. Run the isi devices command. The following command removes a node with a logical node number (LNN) of 2 from the cluster:
isi devices node smartfail -node-lnn 2

General cluster administration

83

Modify the LNN of a node
You can modify the logical node number (LNN) of a node. This procedure is available only through the command-line interface (CLI). The nodes within your cluster can be renamed to any name/integer between 1 and 252. By changing the name of your node, you are resetting the LNN.
NOTE: Although you can specify any integer as an LNN, we recommend that you do not specify an integer greater than 252. Specifying LNNs above 252 can result in significant performance degradation. 1. Open a secure shell (SSH) connection to any node in the cluster and log in. 2. Open the isi config command prompt by running the following command:
isi config
3. Run the lnnset command. The following command switches the LNN of a node from 12 to 73:
lnnset 12 73
4. Enter commit . You might need to reconnect to your SSH session before the new node name is automatically changed.
Restart or shut down the cluster
You can restart or shut down the PowerScale cluster. 1. Run the isi config command.
The command-line prompt changes to indicate that you are in the isi config subsystem 2. Restart or shutdown nodes on the cluster.
 To restart a single node or all nodes on the cluster, run the reboot command. The following command restarts a single node by specifying the LNN (logical node number):
reboot 7
 To shut down a single node or all nodes on the cluster, run the shutdown command. The following command shuts down all nodes on the cluster:
shutdown all
Upgrading OneFS
Before upgrading the OneFS cluster, it is recommended to perform preupgrade checks to confirm that your OneFS cluster is ready for an upgrade. These checks scan the current OneFS cluster and operating system for issues and compare the current OneFS version with the new version to ensure that the cluster meets certain criteria, such as configuration compatibility (SMB, LDAP, SmartPools), disk availability, and the absence of critical cluster events. Some preupgrade checks are optional, but it is recommended to perform all preupgrade checks before upgrading. If upgrading puts the cluster at risk, OneFS warns you, provides information about the risks, and prompts you to confirm whether to continue the upgrade. There are three options available for upgrading the OneFS cluster: parallel upgrades, rolling upgrades, or simultaneous upgrades. For more information about how to plan, prepare, and perform an upgrade on your OneFS cluster, see the PowerScale OneFS Upgrade Planning and Process Guide.

84

General cluster administration

Parallel upgrades
A parallel upgrade installs the new operating system on a subset of nodes and restarts that subset of nodes at the same time. Each subset of nodes attempts to make a reservation for their turn to upgrade until all nodes are upgraded. Node subsets and reservations are based on diskpool and node availability. During a parallel upgrade, node subsets that are not being upgraded remain online and can continue serving clients. However, clients that are connected to a restarting node are disconnected and reconnected. How the client connection behaves when a node is restarted depends on several factors including client type, client configuration (mount type, timeout settings), IP allocation method, and how the client connected to the cluster. In OneFS 9.2.0.0 and later, client connection behavior is managed by the disruption manager settings. Parallel upgrades are recommended whenever possible, as they require a smaller maintenance window than rolling upgrades, and do not require the interruption of service like simultaneous upgrades.
Rolling upgrades
A rolling upgrade installs the new operating system and restarts each node individually in the OneFS cluster so that only one node is offline at a time. A rolling upgrade takes longer to complete than a simultaneous upgrade. During a rolling upgrade, nodes that are not currently being upgraded remain online and can continue serving clients. However, clients that are connected to a restarting node are disconnected and reconnected. How the client connection behaves when a node is restarted depends on several factors including client type, client configuration (mount type, timeout settings), IP allocation method, and how the client connected to the cluster.
Simultaneous upgrades
A simultaneous upgrade installs the new operating system and restarts all nodes in the OneFS cluster at the same time. Simultaneous upgrades are faster than rolling upgrades but require a temporary interruption of service during the upgrade process. All client connections to the cluster must be terminated before completing the upgrade and data is inaccessible until the installation of the new OneFS operating system is complete and the cluster is back online.
SRS Summary
OneFS allows remote support through Secure Remote Services (SRS), which monitors the cluster, and with permission, provides remote access for PowerScale Technical Support personnel to gather cluster data and troubleshoot issues. SRS is a secure, Customer Support system that includes 24x7 remote monitoring and secure authentication with AES 256-bit encryption and RSA digital certificates. Although SRS is not a licensed feature, it must be enabled if you are using in product activation for your OneFS license. If you are using an evaluation license on the cluster, it is not possible to enable SRS. To evaluate SRS on an evaluation cluster, ask Technical Support for help with obtaining a signed license file. If you configure and enable remote support, PowerScale Technical Support personnel can establish a secure SSH session with the cluster through the SRS connection. Remote access to the cluster is only in the context of an open support case. You can allow or deny the remote session request by PowerScale Technical Support personnel. During remote sessions, support personnel can run remote support scripts that gather diagnostic data about cluster settings and operations. Diagnostic data is sent over the secure SRS connection to Dell Technologies SRS. The remote support user credentials are required for access to the cluster. The remote support user is a separate user, not a general cluster user, or a System Admin user. OneFS does not store the required remote support user credentials. A complete description of SRS features and functionality is available in the most recent version of the EMC Secure Remote Services Technical Description. More SRS documentation is available on Dell Technologies Support by Product.
Obtain signed OneFS license file for evaluation clusters
If a cluster was acquired for evaluation or proof of concept (POC) purposes, you still need a signed OneFS license file before SRS can be enabled. Evaluation licenses are used for evaluation or proof of concept purposes. To obtain a signed OneFS license file, follow these steps: 1. Generate a license activation file as described earlier in this guide.

General cluster administration

85

2. Open a support case with the Dell licensing team. 3. Include the following information:
 Sales order number(s)  Your license activation file The licensing team will generate the signed license file and send it in an email.
4. Upload the signed license file to your cluster, as described earlier in this guide.
Configuring and Enabling SRS Overview
You can configure support for Secure Remote Services (SRS) on the PowerScale cluster. SRS is now configured for the entire cluster with a single registration.
When you enable support for SRS on a cluster, you can optionally create rules for remote support connections to the PowerScale cluster with the SRS Policy Manager. Details on the Policy Manager are available in the most current Secure Remote Services Installation Guide.
You can implement firewall rules to block SSH from the SRS gateway(s) to the PowerScale nodes. Firewall rules ensure that Dell EMC has no remote access to the cluster, but outbound cluster alerts and telemetry can still be serviced by the SRS gateways and sent to Dell EMC.
Enable and configure Secure Remote Services support
You can enable support for Secure Remote Services (SRS) on a PowerScale cluster using the isi esrs modify command.
Pre-requisites:  If your cluster is running OneFS 9.0.0.0 and has PowerScale F200 or PowerScale F600 nodes installed, you must have SRS
v3 installed. For more information, see the Secure Remote Services documentation.  If your cluster is running OneFS 9.1.0.0 and later, you must have SRS v3 installed.  Install and configure a SRS 3.x gateway server before you enable SRS on a OneFS cluster. Complete details for installing and
upgrading SRS 3.x gateway server are available in the Secure Remote Services documentation. NOTE: The SRS Virtual Edition gateway (SRS 3.x) does not support installing software that is not already included in the appliance. While the customer has full access to the appliance, loading additional software or updating software already installed may require redeployment.
 If the SRS 3.x gateway server supports IPv6, then SRS is supported through IPv6 communications.  If the SRS 3.x gateway server is configured for IPv4, then, to support SRS transmissions and remote connections, at least
one subnet on the PowerScale cluster must be configured for IPv4 addresses. All nodes that are managed by SRS must have at least one network interface that is a member of an IPv4 address pool. Only nodes with an interface in this pool are eligible for election to be the SRS main node. All alerts on the cluster are routed through this main node.
 You must know the credentials (username and password) of a user account that can log into the SRS Gateway. Ask the installer of the SRS Gateway for these credentials. The username must be registered in MyService360 under the same customer site ID under which the hardware and software licenses are registered.
 The IP address pools that handle gateway connections must exist in the system and must belong to a subnet under groupnet0, which is the default system groupnet.
 For successful connections, all back-end records at Dell EMC must be correct and aligned to your customer site ID. Contact Dell EMC customer support to request verification before performing this procedure.
1. Run the isi esrs modify command to enable and modify the SRS configuration on the OneFS cluster:
isi esrs modify --enabled=true --primary-esrs-gateway=<gateway-server> --gatewayaccess-pools=subnetx:poolx
--username=<username> [--password=<password>]
Where:  gateway-server is the IP address or name of the primary gateway.  subnetx and poolx identify the network and pool for storing the collected data. Verify values with the site's storage
administrator.  username and password are the credentials for accessing the primary gateway.

86

General cluster administration

Ask the installer of the SRS Gateway for these credentials. The username must be registered in myService360 under the same customer site ID under which the hardware and software licenses are registered.
 The --password argument is optional on the command line. If not provided, the system prompts you for the password and does not display the response as you type it. Omitting password from the command line is preferable in most cases for security reasons.
For example:

isi esrs modify --enabled=true --primary-esrs-gateway=10.21.11.19 --gateway-access-

pools=subnet2:pool3

[email protected]

2. Review the messages in the output and take appropriate action. See the next section for details about possible errors. NOTE: In OneFS 9.0.0.0 and newer, when Secure Remote Services is enabled, SRS Telemetry is also enabled. The SRS Telemetry EULA must be agreed to before you can continue.

Troubleshoot error messages from isi esrs modify
This section describes the error messages that you might receive from the isi esrs modify command, and provides corrective actions. 1. Look specifically for the invalid username and password message.
u'message': u'invalid username and password'
This error could indicate that the username or password is not correct. It could also indicate that the user does not having appropriate access to the site ID.
 username and password are the credentials for accessing the primary gateway.  Ask the installer of the SRS Gateway for these credentials.  The username must be registered in myService360 under the same customer site ID under which the hardware and
software licenses are registered.  Contact Dell EMC support if you need help. The following figure shows the command output with the error message.

Figure 1. Invalid username and password error
2. Look for the Your OneFS license is unsigned message. Your OneFS license is unsigned. To enable ESRS, you must first have a signed OneFS license.

General cluster administration

87

This error indicates that a signed license file is not found on the cluster. Follow the instructions in Licensing to generate an activation file, get the license file signed, and upload the signed license file to your cluster. 3. Look for the Device match not found message.
: u'Connected', u'message': u'Device match not found for input device with Serial Number ELMISL0310CBSQ and Product ISILON-GW'
This message might indicate that the license was recently created, but the related software ID (SWID) is not yet propagated through the Dell EMC backend systems. In this case, SRS is configured but will not be enabled and connected to the gateway until the backend processing completes. There is no action to take except to wait for the backend processing at Dell EMC to propagate the SWID. This may take up to 4 hours.
NOTE: Also see the following paragraph. There are two possible reasons for receiving the Device match not found message.
4. Here is another reason for the Device match not found message. If you recently relocated a cluster or performed a data erasure and reimaged or reinstalled the cluster, a new cluster ID was generated by Dell EMC. The old cluster ID is no longer valid. In this case, notify Dell EMC support that you need a new cluster id associated with your SWID.
Verify that SRS Telemetry is enabled
This task ensures that the SRS Telemetry feature (previously called phone home) is enabled. SRS Telemetry replaces phone home:  isi_phone_home was deprecated in OneFS 8.2.1.  isi_phone_home was disabled in OneFS 8.2.2. In OneFS 9.0.0.0 and newer, SRS Telemetry is enabled when Secure Remote Services is enabled. In OneFS 9.0.0.0 and newer, the SRS Telemetry EULA must be agreed to before you can begin using the feature. SRS Telemetry gathers configuration data (gconfig), system controls (sysctls), directory paths, and statistics at the cluster level. SRS Telemetry also gathers API endpoints and statistics at the node level. This data is sent through Secure Remote Services for use by CloudIQ. For more information about SRS Telemetry, contact your OneFS support representative. You can verify that SRS Telemetry is enabled as follows: 1. Open an SSH connection on any node in the cluster. 2. Log in using the root account. 3. To view the state of SRS telemetry:
# isi esrs telemetry view
4. To enable or disable SRS telemetry, use the following command:
# isi esrs telemetry modify --enabled=[true|false]
NOTE: If SRS Telemetry is disabled, CloudIQ cannot properly monitor the health of your cluster.
Diagnostic commands and scripts
After you enable remote support through SRS, PowerScale Technical Support personnel can request diagnostic commands and scripts that gather cluster data and then upload the data. Scripts are based on the isi diagnostics gather and isi diagnostics netlogger tools. They are located in the /ifs/data/Isilon_Support/ directory on each node.
NOTE: The isi diagnostics gather and isi diagnostics netlogger commands replace the isi_gather_info command.

88

General cluster administration

This tool sends information about a cluster to PowerScale Technical Support.
The following tables list the data-gathering activities that remote support scripts perform. At the request of a PowerScale Technical Support representative, these scripts can be run automatically to collect information about the configuration settings and operations of a cluster. Information is sent to SRS over the secure SRS connection, so that it is available for PowerScale Technical Support personnel to analyze. The remote support scripts do not affect cluster services or data availability.

Table 1. Commands and subcommands for isi diagnostics gather

Command isi diagnostics gather start
isi diagnostics gather stop

Description
Begins the collection, and uploads all recent cluster log information.
Stops the collection of cluster information.

isi diagnostics gather status

Provides the cluster status.

isi diagnostics gather settings view

Views the gather settings.

isi diagnostics gather settings modify --esrs Sets SRS for gather upload.

isi diagnostics gather settings modify -ftp-upload

Controls whether or not to use FTP upload on completed gather

isi diagnostics gather settings modify --ftp- Sets the FTP host to upload to. upload-host

isi diagnostics gather settings modify --ftp- Sets the FTP user's password. Also see --set-ftp-

upload-pass

upload-pass.

isi diagnostics gather settings modify --ftp- Sets the path on the FTP server for the upload. upload-path

isi diagnostics gather settings modify --ftp- Sets the proxy server for FTP. upload-proxy

isi diagnostics gather settings modify --ftp- Sets the port for proxy server for FTP. upload-proxy-port

isi diagnostics gather settings modify --ftp- Sets the FTP user. upoad-user

isi diagnostics gather settings modify -gather-mode

Sets the type of gather: incremental or full.

isi diagnostics gather settings modify -http-upload

Sets whether or not to use HTTP upload on completed gather.

isi diagnostics gather settings modify --help Displays help for this command.

isi diagnostics gather settings modify -http-upload-host

Sets the HTTP host to upload to.

isi diagnostics gather settings modify -http-upload-path

Sets the path for the upload.

isi diagnostics gather settings modify -http-upload-proxy

Sets the proxy to use for HTTP upload.

isi diagnostics gather settings modify -http-upload-proxy-port

Sets the proxy port to use for HTTP upload.

isi diagnostics gather settings modify --set- Specify --ftp-upload-pass interactively. ftp-upload-pass

isi diagnostics gather settings modify -upload

Enables gather upload.

isi diagnostics gather settings modify -verbose

Displays more detailed information.

General cluster administration

89

Table 2. Commands and subcommands for isi diagnostics netlogger

Command

Description

isi diagnostics netlogger start

Starts the netlogger process.

isi diagnostics netlogger stop

Stops the netlogger process.

isi diagnostics netlogger status

Provides the netlogger status.

isi diagnostics netlogger settings view

Views the netlogger settings.

isi diagnostics netlogger settings modify -- Sets the Client IP address or addresses to filter. clients

isi diagnostics netlogger settings modify -- Sets the number of capture files to keep after they reach the

count

duration limit. Defaults to the last 3 files.

isi diagnostics netlogger settings modify -- Sets how long to run the capture before rotating the capture

duration

file. The default is10 minutes.

isi diagnostics netlogger settings modify -- Displays help for this command. help

isi diagnostics netlogger settings modify -- Specifies the network interfaces on which to capture. interfaces

isi diagnostics netlogger settings modify -- Sets a list of nodes on which to run capture. nodelist

isi diagnostics netlogger settings modify -- Sets the TCP or UDP port or ports to filter. ports

isi diagnostics netlogger settings modify -- Sets the protocols to filter: TCP, UDP, IP, ARP. protocols

isi diagnostics netlogger settings modify -- Specifies how much of the packet to display. The default is

snaplength

320 bytes. A 0 value shows all of the packet.

Table 3. Cluster information scripts Action Clean watch folder Get application data
Generate dashboard file daily Generate dashboard file sequence
Get ABR data (as built record) Get ATA control and GMirror status
Get cluster data
Get cluster events
Get cluster status Get contact info
Get contents (var/crash)

Description
Clears the contents of /var/crash.
Collects and uploads information about OneFS application programs.
Generates daily dashboard information.
Generates dashboard information in the sequence that it occurred.
Collects as-built information about hardware.
Collects system output and invokes a script when it receives an event that corresponds to a predetermined eventid.
Collects and uploads information about overall cluster configuration and operations.
Gets the output of existing critical events and uploads the information.
Collects and uploads cluster status details.
Extracts contact information and uploads a text file that contains it.
Uploads the contents of /var/crash.

90

General cluster administration

Table 3. Cluster information scripts (continued) Action Get job status Get domain data Get file system data Get IB data
Get logs data Get messages Get network data
Get NFS clients Get node data Get protocol data
Get Pcap client stats Get readonly status Get usage data

Description
Collects and uploads details on a job that is being monitored.
Collects and uploads information about the cluster's Active Directory Services (ADS) domain membership.
Collects and uploads information about the state and health of the OneFS /ifs/ file system.
Collects and uploads information about the configuration and operation of the InfiniBand back-end network.
Collects and uploads only the most recent cluster log information.
Collects and uploads active /var/log/messages files.
Collects and uploads information about cluster-wide and node-specific network configuration settings and operations.
Runs a command to check if nodes are being used as NFS clients.
Collects and uploads node-specific configuration, status, and operational information.
Collects and uploads network status information and configuration settings for the NFS, SMB, HDFS, FTP, and HTTP protocols.
Collects and uploads client statistics.
Warns if the chassis is open and uploads a text file of the event information.
Collects and uploads current and historical information about node performance and resource usage.

Disable SRS support
You can disable support for SRS on the PowerScale cluster. Disable SRS on a PowerScale OneFS cluster by running the following command:
isi esrs modify --enabled=false

View SRS configuration settings
You can view SRS settings that are specified on a PowerScale cluster. The output for the following commands includes Primary and Secondary SRS Gateways (SRS v3), SMTP status (enabled, or disabled) if email notification is enabled for failover, and Gateway Access Pools details. Run the isi esrs view command to view SRS configuration details.

General cluster administration

91

PowerScale SRS Managed File Transfer support
Managed File Transfer (MFT) support for OneFS provides the ability for customers to download suggested files and updates directly from Dell Technologies PowerScale by using SRS commands. MFT is not a licensed feature of OneFS, but it does require that SRS is enabled. Currently, MFT configuration and usage is only available from the CLI, and is integrated with the OneFS job engine. No more than one file at a time is downloaded to the cluster. Files can include packages, patches, and scripts.
Configure the MFT download feature
Use the isi esrs modify command to enable MFT.
isi esrs modify --download-enabled=true
The MFT feature has configurable options, with default settings that might not need any adjustments. Use the isi esrs view command to check the current MFT option settings. The following figure is an example of the command output.

Figure 2. The isi esrs view command output

The following table describes the configurable options. Table 4. Configurable MFT options MFT option Download Enabled ESRS File Download Timeout Period
ESRS File Download Error Retries ESRS File Download Chunk Size ESRS Download Filesystem Limit
To change any of the settings, use isi esrs modify.

Description Download is disabled by default.
Specifies the length of time in seconds for each file chunk to finish downloading.
Specifies the number of retries before the job fails.
Sets the size in Kb for each file chunk.
Sets the file system limit (percentage) at which MFT does not send any more files.

Download files with MFT
Use the isi esrs download start /ifs/<destination_location> command to begin downloading a file from the secure locker.

92

General cluster administration

NOTE: A secure locker is a customer-specific directory that is located in the Dell Technologies back end infrastructure. Access is only granted to users who are assigned permissions to do so.
The following figure is an example of the output for isi esrs download start.

Figure 3. The isi esrs download start command output
Use the isi esrs download list command to view the list of files to be downloaded. Use the isi job view command to view job details. A checksum is used to verify the contents and integrity of the file after it is downloaded. If the checksum fails, the file is quarantined. The following figure is an example of the command output.

Figure 4. isi job view command output Use the isi job reports view command to view job status.

General cluster administration

93

Figure 5. The isi job reports view command output Troubleshooting data is written to the /var/log/isi_job_d.log and the /var/log/isi_esrs_d.log log files.

94

General cluster administration

5
Access zones
This section contains the following topics:
Topics:
· Data Security overview · Base directory guidelines · Access zones best practices · Access zones on a SyncIQ secondary cluster · Access zone limits · Quality of service · Zone-based Role-based Access Control (zRBAC) · Zone-specific authentication providers · Managing access zones
Data Security overview
The default view of a PowerScale cluster is that of one physical machine. But you can partition a cluster into multiple virtual containers called access zones. Access zones allow you to isolate data and control who can access data in each zone.
Access zones support configuration settings for authentication and identity management services on a cluster. Access zones enable you to configure authentication providers and provision protocol directories such as SMB shares and NFS exports on a zone-by-zone basis. Creating an access zone automatically creates a local provider, which allows you to configure each access zone with a list of local users and groups. You can also authenticate through a different authentication provider in each access zone.
To control data access, you associate the access zone with a groupnet. A groupnet is a top-level networking container that manages DNS client connection settings and contains subnets and IP address pools. When you create an access zone, you must specify a groupnet. If a groupnet is not specified, the access zone references the default groupnet. Multiple access zones can reference a single groupnet. You can direct incoming connections to the access zone through a specific IP address pool in the groupnet. Associating an access zone with an IP address pool restricts authentication to the associated access zone and reduces the number of available and accessible SMB shares and NFS exports.
An advantage to multiple access zones is the ability to configure audit protocol access for individual access zones. You can modify the default list of successful and failed protocol audit events and then generate reports through a third-party tool for an individual access zone.
A cluster includes an access zone that is named System where you manage all aspects of a cluster and other access zones. By default, all cluster IP addresses connect to the System zone. Role-based access, which primarily allows configuration actions, is available through only the System zone. All administrators, including those given privileges by a role, must connect to the System zone to configure a cluster. The System zone is automatically configured to reference the default groupnet on the cluster, which is groupnet0.
Configuration management of a non-System access zone is not permitted through SSH, the OneFS API, or the web administration interface. However, you can create and delete SMB shares in an access zone through the Microsoft Management Console (MMC).
Base directory guidelines
A base directory defines the file system tree that is exposed by an access zone. The access zone cannot grant access to any files outside of the base directory. Assign a base directory to each access zone.
Base directories restrict path options for several features such as SMB shares, NFS exports, the HDFS root directory, and the local provider home directory template.

Access zones

95

Data isolation is required within an access zone. It is recommended that you create a unique base directory path that is not identical to or does not overlap another base directory, except for the System access zone. For example, do not specify /ifs/ data/hr as the base directory for both the zone2 and zone3 access zones. Or if /ifs/data/hr is assigned to zone2, do not assign /ifs/data/hr/personnel to zone3.
OneFS supports overlapping data between access zones for cases where your workflows require shared data. However, the added complexity to the access zone configuration might lead to future issues with client access. For the best results from overlapping data between access zones, it is recommended that the access zones also share the same authentication providers. Shared providers ensures that users will have consistent identity information when accessing the same data through different access zones.
If you cannot configure the same authentication providers for access zones with shared data, ensure the following:
 Select Active Directory as the authentication provider in each access zone. This causes files to store globally unique SIDs as the on-disk identity, eliminating the chance of users from different zones gaining access to each other's data.
 Avoid selecting local, LDAP, and NIS as the authentication providers in the access zones. These authentication providers use UIDs and GIDs, which are not guaranteed to be globally unique. This results in a high probability that users from different zones will be able to access each other's data.
 Set the on-disk identity to native, or preferably, to SID. When user mappings exist between Active Directory and UNIX users or if the Services for Unix option is enabled for the Active Directory provider, OneFS stores SIDs as the on-disk identity instead of UIDs.

Access zones best practices

You can avoid configuration problems on the PowerScale cluster when creating access zones by following best practices guidelines.

Best practice Create unique base directories.
Separate the function of the System zone from other access zones.
Create access zones to isolate data access for different clients or users. Assign only one authentication provider of each type to each access zone.
Avoid overlapping UID or GID ranges for authentication providers in the same access zone.

Details
To achieve data isolation, the base directory path of each access zone should be unique and should not overlap or be nested inside the base directory of another access zone. Overlapping is allowed, but should only be used if your workflows require shared data.
Reserve the System zone for configuration access, and create additional zones for data access. Move current data out of the System zone and into a new access zone.
Do not create access zones if a workflow requires data sharing between different classes of clients or users.
An access zone is limited to a single Active Directory provider; however, OneFS allows multiple LDAP, NIS, and file authentication providers in each access zone. It is recommended that you assign only one type of each provider per access zone in order to simplify administration.
The potential for zone access conflicts is slight but possible if overlapping UIDs/GIDs are present in the same access zone.

Access zones on a SyncIQ secondary cluster
You can create access zones on a SyncIQ secondary cluster that is used for backup and disaster recovery, with some limitations.
If you have an active SyncIQ license, you can maintain a secondary PowerScale cluster for backup and failover purposes in case your primary server should go offline. When you run a replication job on the primary server, file data is replicated to the backup server. That includes directory paths and other metadata that is associated with those files.
However, system configuration settings, such as access zones, are not replicated to the secondary server. In a failover scenario, the configuration settings of the primary and secondary clusters should be similar, if not identical.
Usually, including with access zones, it is recommended that you configure system settings before you run a SyncIQ replication job. The reason is that a replication job places target directories in read-only mode. If you attempt to create an access zone where the base directory is already in read-only mode, OneFS generates an error message.

96

Access zones

Access zone limits
You can follow access zone limits guidelines to help size the workloads on the OneFS system.
If you configure multiple access zones on a PowerScale cluster, limits guidelines are recommended for best system performance. The limits that are described in the PowerScale OneFS Technical Specifications Guide are recommended for heavy enterprise workflows on a cluster, treating each access zone as a separate physical server. The Technical Specifications Guide and related PowerScale documentation are available on Dell EMC Online Support.

Quality of service

You can set upper bounds on quality of service by assigning specific physical resources to each access zone.
Quality of service addresses physical hardware performance characteristics that can be measured, improved, and sometimes guaranteed. Characteristics that are measured for quality of service include but are not limited to throughput rates, CPU usage, and disk capacity. When you share physical hardware in a PowerScale cluster across multiple virtual instances, competition exists for the following services:  CPU  Memory  Network bandwidth  Disk I/O  Disk capacity
Access zones do not provide logical quality of service guarantees to these resources, but you can partition these resources between access zones on a single cluster. The following table describes a few ways to partition resources to improve quality of service:

Use NICs
SmartPools
SmartQuotas

Notes
You can assign specific NICs on specific nodes to an IP address pool that is associated with an access zone. By assigning these NICs, you can determine the nodes and interfaces that are associated with an access zone. This enables the separation of CPU, memory, and network bandwidth.
SmartPools are separated into multiple tiers of high, medium, and low performance. The data written to a SmartPool is written only to the disks in the nodes of that pool.
Associating an IP address pool with only the nodes of a single SmartPool enables partitioning of disk I/O resources.
Through SmartQuotas, you can limit disk capacity by a user or a group or in a directory. By applying a quota to an access zone's base directory, you can limit disk capacity that is used in that access zone.

Zone-based Role-based Access Control (zRBAC)
You can assign roles and a subset of privileges to users on a per-access-zone basis.
Role-based Access Control (RBAC) supports granting users with privileges and the ability to perform certain tasks. Tasks can be performed through the Platform API, such as creating or modifying or viewing NFS exports, SMB shares, authentication providers, and various cluster settings.
Users may want to perform these tasks inside a single access zone, enabling a local administrator to create SMB shares for a specific access zone, for example, but disallowing that administrator from modifying configuration that would affect other access zones.
Previous to zRBAC, only users in the System Access Zone were given privileges. These users could view and modify configuration in all other access zones. Thus, a user with a specific privilege was a global administrator for configuration that was accessible through that privilege.

Access zones

97

zRBAC enables you to assign roles and a subset of privileges that must be assigned on a per-access-zone basis. Administrative tasks that the zone-aware privileges covers can be delegated to an administrator of a specific access zone. As a result, you get the ability to create a local administrator who is responsible for a single access zone. A user in the System Access Zone can affect all other access zones, and remains a global administrator.

Non-System access zone privileges

The privileges available in non-System access zones are listed in the following table.

Privilege

Description

ISI_PRIV_AUDIT

 Add/remove your zone from list of audited zones  View/modify zone-specific audit settings for your zone  View global audit settings

ISI_PRIV_AUTH

 View/edit your access zone  Create/modify/view roles in your access zone  View/modify auth mapping settings for your zone  View global settings that are related to authentication

ISI_PRIV_FILE_FILTER

Use all functionalities that are associated with this privilege, in your own zone.

ISI_PRIV_HDFS

Use all functionalities that are associated with this privilege, in your own zone.

ISI_PRIV_NFS

 View global NFS settings, but do not modify them.  Otherwise, use all functionality that is associated with this privilege, in your own zone.

ISI_PRIV_RESTRICTED_AUTH  Create/modify/delete users and groups with the same or less privilege.  View users and groups.

ISI_PRIV_ROLE

Use all functionalities that are associated with this privilege, but only in your own zone.

ISI_PRIV_SMB

 View global SMB settings, but do not modify them.  Otherwise, use all functionalities that are associated with this privilege, in your own zone.

ISI_PRIV_SWIFT

Use all functionalities that are associated with this privilege, in your own zone.

ISI_PRIV_VCENTER

Configure VMware vCenter

ISI_PRIV_LOGIN_PAPI

Access the WebUI from a non-System access zone

ISI_PRIV_BACKUP

Bypass file permission checks and grant all read permissions.

ISI_PRIV_RESTORE

Bypass file permission checks and grant all write permissions.

ISI_PRIV_NS_TRAVERSE

Traverse and view directory metadata inside the zone base path.

ISI_PRIV_NS_IFS_ACCESS Access directories inside the zone base path through RAN

Built-in roles in non-System zones

In the non-System access zone, two integrated roles are provided. The roles with their privileges are listed below:

Role ZoneAdmin

Description
Allows administration of configuration aspects that are related to the current access zone.

Privileges
 ISI_PRIV_LOGIN_PAPI  ISI_PRIV_AUDIT  ISI_PRIV_FILE_FILTER  ISI_PRIV_HDFS  ISI_PRIV_NFS  ISI_PRIV_S3  ISI_PRIV_SMB  ISI_PRIV_SWIFT  ISI_PRIV_VCENTER

98

Access zones

Role ZoneSecurityAdmin

Description

Privileges

 ISI_PRIV_NS_TRAVERSE  ISI_PRIV_NS_IFS_ACCESS

Allows administration of security

 ISI_PRIV_LOGIN_PAPI

configuration aspects that are related to  ISI_PRIV_AUTH

the current access zone.

 ISI_PRIV_ROLE

NOTE: These roles do not have any default users who are automatically assigned to them.

Zone-specific authentication providers
Some information about how authentication providers work with zRBAC. Authentication providers are global objects in a OneFS cluster. However, as part of the zRBAC feature, an authentication provider is implicitly associated with the access zone from which it was created, and has certain behaviors that are based on that association.  All access zones can view and use an authentication provider that is created from the System zone. However, only a request
from the System access zone can modify or delete it.  An authentication provider that is created from (or on behalf of) a non-System access zone can only be viewed or modified
or deleted by that access zone and the System zone.  A local authentication provider is implicitly created whenever an access zone is created, and is associated with that access
zone.  A local authentication provider for a non-System access zone may no longer be used by another access zone. If you would
like to share a local authentication provider among access zones, then it must be the System zone's local provider.  The name of an authentication provider is still global. Therefore, authentication providers must have unique names. Thus, you
cannot create two LDAP providers named ldap5 in different access zones, for example.  The Kerberos provider can only be created from the System access zone.  Creating two distinct Active Directory (AD) providers to the same AD may require the use of the AD multi-instancing
feature. To assign a unique name to the AD provider, use --instance.
Managing access zones
You can create access zones on a PowerScale cluster, view and modify access zone settings, and delete access zones.
Create an access zone
You can create an access zone to isolate data and restrict which users can access the data. Run the isi zone zones create command. The following command creates an access zone named zone3 and sets the base directory to /ifs/hr/data:
isi zone zones create zone3 /ifs/hr/data
The following command creates an access zone named zone3, sets the base directory to /ifs/hr/data and creates the directory on the cluster if it does not already exist:
isi zone zones create zone3 /ifs/hr/data --create-path
The following command creates an access zone named zone3, sets the base directory to /ifs/hr/data, and associates the access zone with groupnet2:
isi zone zones create zone3 /ifs/hr/data --groupnet=groupnet2

Access zones

99

Assign an overlapping base directory
You can create overlapping base directories between access zones for cases where your workflows require shared data. Run the isi zone zones create command. The following command creates an access zone named zone5 and sets the base directory to /ifs/hr/data even though the same base directory was set for zone3:
isi zone zones create zone5 --path=/ifs/hr/data --force-overlap
Manage authentication providers in an access zone
You modify an access zone to add and remove authentication providers. When you add an authentication provider, it must belong to the same groupnet referenced by the access zone. When you remove an authentication provider from an access zone, the provider is not removed from the system and remains available for future use. The order in which authentication providers are added to access zone designates the order in which providers are searched during authentication and user lookup. 1. To add an authentication provider, run the isi zone zones modify command with the --add-auth-providers
option. You must specify the name of the authentication provider in the following format: <provider-type>:<provider-name>. The following command adds a file authentication provider named HR-Users to the zone3 access zone:
isi zone zones modify zone3 --add-auth-providers=file:hr-users 2. To remove an authentication provider, run the isi zone zones modify command with the --remove-auth-
providers option. You must specify the name of the authentication provider in the following format: <provider-type>:<provider-name>. The following command removes the file authentication provider named HR-Users from the zone3 access zone:
isi zone zones modify zone3 --remove-auth-providers=file:hr-users The following command removes all authentication providers from the zone3 access zone:
isi zone zones modify zone3 --clear-auth-providers
Associate an IP address pool with an access zone
You can associate an IP address pool with an access zone to ensure that clients can connect to the access zone only through the range of IP addresses assigned to the pool. The IP address pool must belong to the same groupnet referenced by the access zone. Run the isi network pools modify command. Specify the pool ID you want to modify in the following format:
<groupnet_name>.<subnet_name>.<pool_name>
The following command associates zone3 with pool1 which is under groupnet1 and subnet1: isi network pools modify groupnet1.subnet1.pool1 --access-zone=zone3
Modify an access zone
You can modify the properties of any access zone except the name of the built-in System zone. Run the isi zone zones modify command.
100 Access zones

The following command renames the zone3 access zone to zone5 and removes all current authentication providers from the access zone:
isi zone zones modify zone3 --name=zone5 --clear-auth-providers
Delete an access zone
You can delete any access zone except the built-in System zone. When you delete an access zone, all associated authentication providers remain available to other access zones, but IP addresses are not reassigned to other access zones. SMB shares, NFS exports, and HDFS data paths are deleted when you delete an access zone; however, the directories and data still exist, and you can map new shares, exports, or paths in another access zone. Run the isi zone zones delete command. The following command deletes the zone3 access zone :
isi zone zones delete zone3
View a list of access zones
You can view a list of all access zones on a cluster, or you can view details for a specific access zone. 1. To view a list of all access zones on the cluster, run the isi zone zones list command.
The system displays output similar to the following example:
Name Path -----------------------System /ifs zone3 /ifs/hr/benefits zone5 /ifs/marketing/collateral ------------------------
2. To view the details of a specific access zone, run the isi zone zones view command and specify the zone name. The following command displays the setting details of zone5:
isi zone zones view zone5
The system displays output similar to the following example:
Name: zone5 Path: /ifs/marketing/collateral Groupnet: groupnet0 Map Untrusted: Auth Providers: lsa-local-provider:zone5 NetBIOS Name: User Mapping Rules: Home Directory Umask: 0077 Skeleton Directory: /usr/share/skel Cache Entry Expiry: 4H Zone ID: 3
Create one or more access zones
You can create one or more access zones. 1. Run the isi zone zones create command.
Access zones 101

The following commands create three access zones, named zone1, zone2, and zone3, sets the base directory to /ifs/ access-zones/zone1,ifs/access-zones/zone2, and ifs/access-zones/zone3 and creates the directory on the cluster in case it does not exist:
isi zone zones create zone1 /ifs/access-zones/zone1 --create-path isi zone zones create zone2 /ifs/access-zones/zone2 --create-path isi zone zones create zone3 /ifs/access-zones/zone3 --create-path
2. Run the isi zone list command to view all the zones you have created:
isi zone list Name Path -----------------------------System /ifs zone1 /ifs/access-zones/zone1 zone2 /ifs/access-zones/zone2 zone3 /ifs/access-zones/zone3 -----------------------------Total: 4
Create local users in an access zone
You can create local users for different access zones. 1. Run the isi auth users create command.
The following commands create three users in each access zone.
isi auth users create z1-user1 --enabled yes --password a --zone zone1 isi auth users create z1-user2 --enabled yes --password a --zone zone1 isi auth users create z1-user3 --enabled yes --password a --zone zone1 isi auth users create z2-user1 --enabled yes --password a --zone zone2 isi auth users create z2-user2 --enabled yes --password a --zone zone2 isi auth users create z2-user3 --enabled yes --password a --zone zone2 isi auth users create z3-user1 --enabled yes --password a --zone zone3 isi auth users create z3-user2 --enabled yes --password a --zone zone3 isi auth users create z3-user3 --enabled yes --password a --zone zone3
2. Run the isi auth users list command to view all users created in zone1:
isi auth users list --zone zone1 Name -------Guest z1-user1 z1-user2 z1-user3 root nobody -------Total: 6
NOTE: To view users in zone2 and zone3, use the isi auth users list --zone zone2 and isi auth users list --zone zone3 commands respectively.
Access files through the RESTful Access to Namespace (RAN) in non-System zones
You can access data on the OneFS file system through RAN. 1. To access a file, you must make an HTTP call to /namespace/<path>.
curl -u user:password -k https://cluster.ip.addr:8080/namespace/ifs/data/file.txt
2. When accessing files through a non-System zone, the path name must be within the base path of the access zone through which you are accessing the data.
102 Access zones

For example, if IP address 1.2.3.4 is in zone2, which has a base path of /ifs/data/zone2, and then the following error is displayed:
NOTE: The user: password must be a valid user and password in access zone, zone2 in order to access RAN through zone2. # curl -u user:password -k 'https://1.2.3.4:8080/namespace/ifs/data/other-zone' { "errors" : [ { "code" : "AEC_FORBIDDEN", "message" : "Cannot access file/directory path outside base path of access zone" } ] }
Access zones 103

6

Authentication

Topics:
· Authentication overview · Authentication provider features · Security Identifier (SID) history overview · Supported authentication providers · Active Directory · LDAP · NIS · Kerberos authentication · File provider · Local provider · Multi-factor Authentication (MFA) · Multi-instance active directory · LDAP public keys · Managing Active Directory providers · Managing LDAP providers · Managing NIS providers · Managing MIT Kerberos authentication · Managing file providers · Managing local users and groups · SSH Authentication and Configuration
Authentication overview

You can manage authentication settings for your cluster, including authentication providers, Active Directory domains, LDAP, NIS, and Kerberos authentication, file and local providers, multi-factor authentication, and more.

Authentication provider features

You can configure authentication providers for your environment. Authentication providers support a mix of the features described in the following table.

Feature Authentication
Users and groups Netgroups UNIX-centric user and group properties
Windows-centric user and group properties

Description
All authentication providers support cleartext authentication. You can configure some providers to support NTLM or Kerberos authentication also.
OneFS provides the ability to manage users and groups directly on the cluster.
Specific to NFS, netgroups restrict access to NFS exports.
Login shell, home directory, UID, and GID. Missing information is supplemented by configuration templates or additional authentication providers.
NetBIOS domain and SID. Missing information is supplemented by configuration templates.

104 Authentication

Security Identifier (SID) history overview
SID history preserves the membership and access rights of users and groups during an Active Directory domain migration.
Security identifier (SID) history preserves the membership and access rights of users and groups during an Active Directory domain migration. When an object is moved to a new domain, the new domain generates a new SID with a unique prefix and records the previous SID information in an LDAP field. This process ensures that users and groups retain the same access rights and privileges in the new domain that they had in the previous domain.
Note the following when working with historical SIDS.  Use historical SIDs only to maintain historical file access and authentication privileges.  Do not use historical SIDs to add new users, groups, or roles.  Always use the current object SID as defined by the domain to modify a user or to add a user to any role or group.

Supported authentication providers

You can configure local and remote authentication providers to authenticate or deny user access to a cluster.
The following table compares features that are available with each of the authentication providers that OneFS supports. In the following table, an x indicates that a feature is fully supported by a provider; an asterisk (*) indicates that additional configuration or support from another provider is required.

Authentication provider
Active Directory LDAP NIS Local File MIT Kerberos

NTLM

Kerberos

x

x

*

x

x x
x

User/group management
x

Netgroup s

UNIX propertie s (RFC 2307)

Windows propertie s

*

x

x

x

*

x

x

x

x

x

x

*

*

*

Active Directory
Active Directory is a Microsoft implementation of Lightweight Directory Access Protocol (LDAP), Kerberos, and DNS technologies that can store information about network resources. Active Directory can serve many functions, but the primary reason for joining the cluster to an Active Directory domain is to perform user and group authentication.
You can join the cluster to an Active Directory (AD) domain by specifying the fully qualified domain name, which can be resolved to an IPv4 or an IPv6 address, and a user name with join permission. When the cluster joins an AD domain, a single AD machine account is created. The machine account establishes a trust relationship with the domain and enables the cluster to authenticate and authorize users in the Active Directory forest. By default, the machine account is named the same as the cluster. If the cluster name is more than 15 characters long, the name is hashed and displayed after joining the domain.
OneFS supports NTLM and Microsoft Kerberos for authentication of Active Directory domain users. NTLM client credentials are obtained from the login process and then presented in an encrypted challenge/response format to authenticate. Microsoft Kerberos client credentials are obtained from a key distribution center (KDC) and then presented when establishing server connections. For greater security and performance, we recommend that you implement Kerberos, according to Microsoft guidelines, as the primary authentication protocol for Active Directory.
Each Active Directory provider must be associated with a groupnet. The groupnet is a top-level networking container that manages hostname resolution against DNS nameservers and contains subnets and IP address pools. The groupnet specifies which networking properties the Active Directory provider will use when communicating with external servers. The groupnet associated with the Active Directory provider cannot be changed. Instead you must delete the Active Directory provider and create it again with the new groupnet association.
You can add an Active Directory provider to an access zone as an authentication method for clients connecting through the access zone. OneFS supports multiple instances of Active Directory on a PowerScale cluster; however, you can assign

Authentication 105

only one Active Directory provider per access zone. The access zone and the Active Directory provider must reference the same groupnet. Configure multiple Active Directory instances only to grant access to multiple sets of mutually-untrusted domains. Otherwise, configure a single Active Directory instance if all domains have a trust relationship. You can discontinue authentication through an Active Directory provider by removing the provider from associated access zones.
LDAP
The Lightweight Directory Access Protocol (LDAP) is a networking protocol that enables you to define, query, and modify directory services and resources. OneFS can authenticate users and groups against an LDAP repository in order to grant them access to the cluster. OneFS supports Kerberos authentication for an LDAP provider. The LDAP service supports the following features:  Users, groups, and netgroups.  Configurable LDAP schemas. For example, the ldapsam schema allows NTLM authentication over the SMB protocol for users
with Windows-like attributes.  Simple bind authentication, with and without TLS.  Redundancy and load balancing across servers with identical directory data.  Multiple LDAP provider instances for accessing servers with different user data.  Encrypted passwords.  IPv4 and IPv6 server URIs. Each LDAP provider must be associated with a groupnet. The groupnet is a top-level networking container that manages hostname resolution against DNS nameservers and contains subnets and IP address pools. The groupnet specifies which networking properties the LDAP provider will use when communicating with external servers. The groupnet associated with the LDAP provider cannot be changed. Instead you must delete the LDAP provider and create it again with the new groupnet association. You can add an LDAP provider to an access zone as an authentication method for clients connecting through the access zone. An access zone may include at most one LDAP provider. The access zone and the LDAP provider must reference the same groupnet. You can discontinue authentication through an LDAP provider by removing the provider from associated access zones.
NIS
The Network Information Service (NIS) provides authentication and identity uniformity across local area networks. OneFS includes an NIS authentication provider that enables you to integrate the cluster with your NIS infrastructure. NIS, designed by Sun Microsystems, can authenticate users and groups when they access the cluster. The NIS provider exposes the passwd, group, and netgroup maps from an NIS server. Hostname lookups are also supported. You can specify multiple servers for redundancy and load balancing. Each NIS provider must be associated with a groupnet. The groupnet is a top-level networking container that manages hostname resolution against DNS nameservers and contains subnets and IP address pools. The groupnet specifies which networking properties the NIS provider will use when communicating with external servers. The groupnet associated with the NIS provider cannot be changed. Instead you must delete the NIS provider and create it again with the new groupnet association. You can add an NIS provider to an access zone as an authentication method for clients connecting through the access zone. An access zone may include at most one NIS provider. The access zone and the NIS provider must reference the same groupnet. You can discontinue authentication through an NIS provider by removing the provider from associated access zones.
NOTE: NIS is different from NIS+, which OneFS does not support.
Kerberos authentication
Kerberos is a network authentication provider that negotiates encryption tickets for securing a connection. OneFS supports Microsoft Kerberos and MIT Kerberos authentication providers on a cluster. If you configure an Active Directory provider, support for Microsoft Kerberos authentication is provided automatically. MIT Kerberos works independently of Active Directory.
106 Authentication

For MIT Kerberos authentication, you define an administrative domain known as a realm. Within this realm, an authentication server has the authority to authenticate a user, host, or service; the server can resolve to either IPv4 or IPv6 addresses. You can optionally define a Kerberos domain to allow additional domain extensions to be associated with a realm. The authentication server in a Kerberos environment is called the Key Distribution Center (KDC) and distributes encrypted tickets. When a user authenticates with an MIT Kerberos provider within a realm, a cryptographic ticket-granting ticket (TGT) is created to enable the user request an access to a service principal name (SPN). Each MIT Kerberos provider must be associated with a groupnet. The groupnet is a top-level networking container that manages hostname resolution against DNS nameservers and contains subnets and IP address pools. The groupnet specifies which networking properties the Kerberos provider will use when communicating with external servers. The groupnet associated with the Kerberos provider cannot be changed. Instead you must delete the Kerberos provider and create it again with the new groupnet association. You can add an MIT Kerberos provider to an access zone as an authentication method for clients connecting through the access zone. An access zone may include at most one MIT Kerberos provider. The access zone and the Kerberos provider must reference the same groupnet. You can discontinue authentication through an MIT Kerberos provider by removing the provider from associated access zones.
NOTE: Do not use the NULL account with Kerberos authentication. Using the NULL account for Kerberos authentication can cause issues.
Keytabs and SPNs overview
A Key Distribution Center (KDC) is an authentication server that stores accounts and keytabs for users connecting to a network service within a cluster. A keytab is a key table that stores keys to validate and encrypt Kerberos tickets. One of the fields in a keytab entry is a service principal name (SPN). An SPN identifies a unique service instance within a cluster. Each SPN is associated with a specific key in the KDC. Users can use the SPN and its associated keys to obtain Kerberos tickets that enable access to various services on the cluster. A member of the SecurityAdmin role can create new keys for the SPNs and modify them later as necessary. An SPN for a service typically appears as <service>/<fqdn>@<realm>.
NOTE: SPNs must match the SmartConnect zone name and the FQDN hostname of the cluster. If the SmartConnect zone settings are changed, you must update the SPNs on the cluster to match the changes.
MIT Kerberos protocol support
MIT Kerberos supports certain standard network communication protocols such as HTTP, HDFS, and NFS. MIT Kerberos does not support SMB, SSH, and FTP protocols. For the NFS protocol support, MIT Kerberos must be enabled for an export and also a Kerberos provider must be included within the access zone.
File provider
A file provider enables you to supply an authoritative third-party source of user and group information to a PowerScale cluster. A third-party source is useful in UNIX and Linux environments that synchronize /etc/passwd, /etc/group, and etc/netgroup files across multiple servers. Standard BSD /etc/spwd.db and /etc/group database files serve as the file provider backing store on a cluster. You generate the spwd.db file by running the pwd_mkdb command in the OneFS command-line interface (CLI). You can script updates to the database files. On a PowerScale cluster, a file provider hashes passwords with libcrypt. For the best security, it is recommended that you use the Modular Crypt Format in the source /etc/passwd file to determine the hashing algorithm. OneFS supports the following algorithms for the Modular Crypt Format:  MD5  NT-Hash  SHA-256  SHA-512 For information about other available password formats, run the man 3 crypt command in the CLI to view the crypt man pages.
Authentication 107

NOTE: The built-in System file provider includes services to list, manage, and authenticate against system accounts such as root, admin, and nobody. It is recommended that you do not modify the System file provider.
Local provider
The local provider provides authentication and lookup facilities for user accounts added by an administrator. Local authentication is useful when Active Directory, LDAP, or NIS directory services are not configured or when a specific user or application needs access to the cluster. Local groups can include built-in groups and Active Directory groups as members. In addition to configuring network-based authentication sources, you can manage local users and groups by configuring a local password policy for each node in the cluster. OneFS settings specify password complexity, password age and re-use, and password-attempt lockout policies.
Multi-factor Authentication (MFA)
Multi-factor authentication (MFA) is a method of computer access control in which you are only granted access after successfully presenting several separate pieces of evidence to an authentication mechanism. Typically, authentication uses at least two of the following categories: Knowledge (something you know); possession (something you have), and inherence (something you are). MFA is a great way to increase the security of a cluster. Increasing the security of privileged account access (For example, administrators) to a cluster is the best way to prevent unauthorized access. MFA enables the LSASS daemon to require and accept multiple forms of credentials other than a username or password combination for some forms of authentication. There exist many ways to implement MFA with the most common being public or private key authentication. The MFA feature adds PAPI support for SSH configuration using public keys that are stored in LDAP and Multi-Factor Authentication support for SSH through the Duo security platform. Duo MFA supports the Duo App, SMS, and Voice. The use of Duo requires an account with the Duo service. Duo provides a host, ikey, and skey to use for configuration (skey should be treated as a secure credential). Duo MFA is on top of existing password and/or public key requirements. If the SSH configuration type is set to any or custom, Duo cannot be configured. Only specific users or groups may be enabled to bypass MFA if specified on the Duo server. Duo enables the creation of one time or date/time limited bypass keys for a specific user. Also, the bypass keys can be permanent.
Multi-instance active directory
If you are a zone-local administrator, you can create your own AD instance, even if the AD instance for the same domain is already created globally or in another access zone. Previously, only one connection to Active Directory was enabled, and the name of the Active Directory provider had to be the same as the name of the domain to which it was connecting. With the introduction of zone-local authentication providers, zone-local administrators can create their own Active Directory provider, and be able to modify its parameters. To perform this action, you must do two things:  Create a new provider instance name for this provider  Create a new machine account for this provider connection An AD provider may have a name different than its domain name, using -instance. Then commands can use the instance name to find the particular AD provider. Each access zone can have only one AD provider.
LDAP public keys
You can now use public SSH keys from LDAP rather than that of user's home directory on the OneFS cluster. The LDAP create and modify commands support the --ssh-public-key-attribute option. You can view your public key by adding --show-ssh-key.
108 Authentication

Multiple keys may be specified in the LDAP configuration. The key that corresponds to the private key that is presented in the ssh session is used. Nonetheless, you need a home directory on the cluster or you could get an error when you log in.
Managing Active Directory providers
You can view, configure, modify, and delete Active Directory providers. OneFS includes a Kerberos configuration file for Active Directory in addition to the global Kerberos configuration file, both of which you can configure through the command-line interface. You can discontinue authentication through an Active Directory provider by removing it from all access zones that are using it.
Configure an Active Directory provider
You can configure one or more Active Directory providers, each of which must be joined to a separate Active Directory domain. By default, when you configure an Active Directory provider, it is automatically added to the System access zone.
NOTE: Consider the following information when you configure an Active Directory (AD) provider:  When you join Active Directory from OneFS, cluster time is updated from the Active Directory server, as long as an NTP
server has not been configured for the cluster.  The Active Directory provider must be associated with a groupnet.  The Active Directory domain can be resolved to an IPv4 or an IPv6 address. Run the isi auth ads create command to create an Active Directory provider by specifying the domain name of the Active Directory server and the name of an AD user that has permission to join machines to the AD domain. The following command specifies adserver.company.com as the fully-qualified domain name of the Active Directory server to be created in the system, specifies "administrator" as the AD user that has permission to join the cluster to the AD domain, and associates the provider with groupnet3: isi auth ads create --name=adserver.company.com \ --user=administrator --groupnet=groupnet3
Modify an Active Directory provider
You can modify the advanced settings for an Active Directory provider. Run the following command to modify an Active Directory provider, where <provider-name> is a placeholder for the name of the provider that you want to modify.
isi auth ads modify <provider-name>
Delete an Active Directory provider
When you delete an Active Directory provider, you disconnect the cluster from the Active Directory domain that is associated with the provider, disrupting service for users who are accessing it. After you leave an Active Directory domain, users can no longer access the domain from the cluster. Run the following command to Delete an Active Directory provider, where <name> is a placeholder for the Active Directory name that you want to delete.
isi auth ads delete <name>
Authentication 109

Managing LDAP providers
You can view, configure, modify, and delete LDAP providers. You can discontinue authentication through an LDAP provider by removing it from all access zones that are using it.
Configure an LDAP provider
By default, when you configure an LDAP provider, it is automatically added to the System access zone. Run the isi auth ldap create command to create an LDAP provider. The following command creates an LDAP provider called test-ldap and associates it with groupnet3. The command also sets a base distinguished name, which specifies the root of the tree in which to search for identities, and specifies ldap:// 2001:DB8:170:7cff::c001 as the server URI:
isi auth ldap create test-ldap \ --base-dn="dc=test-ldap,dc=example,dc=com" \ --server-uris="ldap://[2001:DB8:170:7cff::c001]" \ --groupnet=groupnet3
NOTE: The base distinguished name is specified as a sequence of relative distinguished name values, separated by commas. Specify a base distinguished name if the LDAP server allows anonymous queries. The following command creates an LDAP provider called test-ldap and associates it with groupnet3. It also specifies a bind distinguished name and bind password, which are used to join the LDAP server, and specifies ldap://test-ldap.example.com as the server URI:
isi auth ldap create test-ldap \ --base-dn="dc=test-ldap,dc=example,dc=com" \ --bind-dn="cn=test,ou=users,dc=test-ldap,dc=example,dc=com" \ --bind-password="mypasswd" \ --server-uris="ldap://test-ldap.example.com" \ --groupnet=groupnet3
NOTE: The bind distinguished name is specified as a sequence of relative distinguished name values, separated by commas, and must have the proper permissions to join the LDAP server to the cluster. Specify a bind distinguished name if the LDAP server does not allow anonymous queries.
Modify an LDAP provider
You can modify any setting for an LDAP provider except its name. You must specify at least one server for the provider to be enabled. Run the following command to modify an LDAP provider, where <provider-name> is a placeholder for the name of the provider that you want to modify:
isi auth ldap modify <provider-name>
Delete an LDAP provider
When you delete an LDAP provider, it is removed from all access zones. As an alternative, you can stop using an LDAP provider by removing it from each access zone that contains it so that the provider remains available for future use. For information about the parameters and options that are available for this procedure, run the isi auth ldap delete --help command. Run the following command to delete an LDAP provider, where <name> is a placeholder for the name of the LDAP provider that you want to delete.
isi auth ldap delete <name>
110 Authentication

Configure the LDAP provider to use TLS connections
This procedure describes how to configure LDAP to use TLS connections. This requires some manual setup by a cluster administrator. 1. Obtain the X.509 Certificate Authority (CA) file for the LDAP server and upload it to the cluster. 2. Move the CA file to a directory under /ifs to distribute the file, such as /ifs/ldap-ca.pem. 3. Run the following command to copy the CA file locally to every node in the cluster, assuming the file is in the /ifs/ldap-
ca.pem directory: isi_for_array cp /ifs/ldap-ca.pem /etc/ssl/ldap-ca.pem
4. Optional: Run the following command to remove the ldap-ca.pem file that was created under /ifs since it is no longer needed: rm /ifs/ldap-ca.pem
5. Configure the LDAP provider to use the X.509 CA, where <LDAP URL> is the LDAP server: isi auth ldap {create | modify} --certificate-authority-file=/etc/ssl/ldap-ca.pem isi auth ldap create --name=tlsldap \ --server-uris=ldaps://<LDAP URL> --base-dn=dc=example,dc=com \ --bind-dn=cn=admin,dc=example,dc=com \ --set-bind-password \ --certificate-authority-file=/etc/ssl/ldap-ca.pem
6. Test the new configuration by listing some LDAP users: isi auth users list --provider=ldap:tlsldap --limit=10
Managing NIS providers
You can view, configure, and modify NIS providers or delete providers that are no longer needed. You can discontinue authentication through an NIS provider by removing it from all access zones that are using it.
Configure an NIS provider
You can configure multiple NIS providers, each with its own settings, and add them to one or more access zones. Configure an NIS provider by running the isi auth nis create command. The following example creates an NIS provider called nistest that is associated with groupnet3, specifies nistest.company.com as the NIS server and company.com as the domain:
isi auth nis create nistest --groupnet=groupnet3\ --servers="nistest.example.com" --nis-domain="example.com"
Modify an NIS provider
You can modify any setting for an NIS provider except its name. You must specify at least one server for the provider to be enabled. Run the following command to modify an NIS provider, where <provider-name> is a placeholder for provider that you want to modify.
isi auth nis modify <provider-name>
Authentication 111

Delete an NIS provider
When you delete an NIS provider, it is removed from all access zones. As an alternative, you can stop using an NIS provider by removing it from each access zone that contains it, so that the provider remains available for future use. Run the following command to delete an NIS provider, where <name> is a placeholder for the name of the NIS provider that you want to delete.
isi auth nis delete <name>
Managing MIT Kerberos authentication
You can configure an MIT Kerberos provider for authentication without Active Directory. Configuring an MIT Kerberos provider involves creating an MIT Kerberos realm, creating a provider, and joining a predefined realm. Optionally, you can configure an MIT Kerberos domain for the provider. You can also update the encryption keys if there are any configuration changes to the Kerberos provider. You can include the provider in one or more access zones.
Managing MIT Kerberos realms
An MIT Kerberos realm is an administrative domain that defines the boundaries within which an authentication server has the authority to authenticate a user or service. You can create, view, edit, or delete a realm. As a best practice, specify a realm name using uppercase characters.
Create an MIT Kerberos realm
You can create an MIT Kerberos realm by defining a Key Distribution Center (KDC) and an administrative server. You must be a member of a role that has ISI_PRIV_AUTH privileges to create an MIT Kerberos realm. Run the isi auth krb5 realm create command to create an MIT Kerberos realm. The following command creates an MIT Kerberos realm called TEST.COMPANY.COM, specifies admin.test.company.com as the administrative server, and specifies keys.test.company.com as a key distribution center:
isi auth krb5 realm create --realm=TEST.COMPANY.COM \ --kdc=keys.test.company.com --admin-server=admin.test.company.com
The realm name is case-sensitive and must be specified in uppercase letters. The administrative server and key distribution center can be specified as an IPv4 address, an IPv6 address, or a hostname.
Modify an MIT Kerberos realm
You can modify an MIT Kerberos realm by modifying the Key Distribution Center (KDC), the domain (optional), and the administrative server settings for that realm. You must be a member of a role that has ISI_PRIV_AUTH privileges to delete an MIT Kerberos provider. Run the isi auth krb5 realm modify command to modify an MIT Kerberos realm. The following command modifies the MIT Kerberos realm called TEST.COMPANY.COM by adding a KDC specified as an IPv6 address:
isi auth krb5 realm modify --realm=TEST.COMPANY.COM \ --kdc=2001:DB8:170:7cff::c001
The realm name is case-sensitive and must be specified in uppercase letters. The key distribution center can be specified as an IPv4 address, an IPv6 address, or a host name.
112 Authentication

View an MIT Kerberos realm
You can view details related to the name, Key Distribution Centers (KDCs), and the administrative server associated with an MIT Kerberos realm. 1. To view a list of all Kerberos realms configured on the cluster, run the isi auth krb5 realm list command.
The system displays output similar to the following example:
Realm --------------TEST.COMPANY.COM ENGKERB.COMPANY.COM OPSKERB.COMPANY.COM --------------Total: 3
2. To view the setting details for a specific Kerberos realm, run the isi auth krb5 realm view command followed by the realm name. The specified realm name is case-sensitive. The following command displays setting details for the realm called TEST.COMPANY.COM:
isi auth krb realm view TEST.COMPANY.COM
The systems displays output similar to the following example:
Realm: TEST.COMPANY.COM Is Default Realm: Yes
KDC: 2001:DB8:170:7cff::c001, keys.test.company.com Admin Server: admin.test.company.com
NOTE: The KDC and the admin server can be specified as an IPv4 or IPv6 address, or a hostname.
Delete an MIT Kerberos realm
You can delete one or more MIT Kerberos realms and all the associated MIT Kerberos domains. Kerberos realms are referenced by Kerberos providers. Before you can delete a realm for which you have created a provider, you must first delete that provider. You must be a member of a role that has ISI_PRIV_AUTH privileges to delete an MIT Kerberos realm. Run the isi auth krb5 realm delete command to delete an MIT Kerberos realm. For example, run the following command to delete a realm:
isi auth krb5 realm delete <realm>
Managing MIT Kerberos providers
You can create view, delete, or modify an MIT Kerberos provider. You can also configure the Kerberos provider settings.
Creating an MIT Kerberos provider
You can create an MIT Kerberos provider by obtaining the credentials for accessing a cluster through the Key Distribution Center (KDC) of the Kerberos realm. This process is also known as joining a realm. Thus when you create a Kerberos provider you also join a realm that has been previously defined. Depending on how OneFS manages your Kerberos environment, you can create a Kerberos provider through one of the following methods:  Accessing the Kerberos administration server and creating keys for services on the OneFS cluster.  Manually transferring the Kerberos key information in the form of keytabs.
Authentication 113

Create an MIT Kerberos provider and join a realm with administrator credentials
You can create an MIT Kerberos provider and join an MIT Kerberos realm using the credentials authorized to access the Kerberos administration server. You can then create keys for the various services on the cluster. This is the recommended method for creating a Kerberos provider and joining a Kerberos realm. You must be a member of a role that has ISI_PRIV_AUTH privileges to access the Kerberos administration server. Run the isi auth krb5 create command to create a Kerberos provider and join a Kerberos realm; , where <realm> is the name of the Kerberos realm which already exists or is created if it does not exist: The realm name is case-sensitive and must be specified in uppercase letters. In the following example command, the Kerberos realm TEST.COMPANY.COM is created and joined to the provider, which is associated with groupnet3. The command also specifies admin.test.company.com as the administrative server and keys.test.company.com as the KDC, and specifies a username and password that are authorized to access to the administration server:
isi auth krb5 create --realm=TEST.COMPANY.COM \ --user=administrator --password=secretcode \ --kdc=keys.test.company.com \ --admin-server=admin.test.company.com \ --groupnet=groupnet3
NOTE: The KDC and the admin server can be specified as an IPv4 or IPv6 address, or a hostname.
Create an MIT Kerberos provider and join a realm with a keytab file
You can create an MIT Kerberos provider and join an MIT Kerberos realm through a keytab file. Follow this method only if your Kerberos environment is managed by manually transferring the Kerberos key information through the keytab files. Make sure that the following prerequisites are met:  The Kerberos realm must already exist on the cluster  A keytab file must exist on the cluster.  You must be a member of a role that has ISI_PRIV_AUTH privileges to access the Kerberos administration server. Run the isi auth krb5 create command. The following command creates a Kerberos provider that is associated with groupnet3, joins the Kerberos realm called clustername.company.com and specifies a keytab file located at /tmp/krb5.keytab:
isi auth krb5 create cluster-name.company.com \ --keytab-file=/tmp/krb5.keytab --groupnet=groupnet3
View an MIT Kerberos provider
You can view the properties of an MIT Kerberos provider after creating it. Run the following command to view the properties of a Kerberos provider:
isi auth krb5 view <provider-name>
List the MIT Kerberos providers
You can list one or more MIT Kerberos providers and display the list in a specific format. You can also specify a limit for the number of providers to be listed. Run the isi auth krb5 list command to list one or more Kerberos providers.
114 Authentication

For example, run the following command to list the first five Kerberos providers in a tabular format without any headers or footers:
isi auth krb5 list -l 5 --format table --no-header --no-footer
Delete an MIT Kerberos provider
You can delete an MIT Kerberos provider and remove it from all the referenced access zones. When you delete a provider, you also leave an MIT Kerberos realm. You must be a member of a role that has ISI_PRIV_AUTH privileges to delete a Kerberos provider. Run the isi auth krb5 delete command as follows to delete a Kerberos provider.
isi auth krb5 delete <provider-name>
Configure MIT Kerberos provider settings
You can configure the settings of a Kerberos provider to allow the DNS records to locate the Key Distribution Center (KDC), Kerberos realms, and the authentication servers associated with a Kerberos realm. These settings are global to all Kerberos users across all nodes, services, and zones. Some settings are applicable only to client-side Kerberos and are independent of the Kerberos provider. You must be a member of a role that has ISI_PRIV_AUTH privileges to view or modify the settings of a Kerberos provider. 1. Run the isi auth settings krb5 command with the view or modify subcommand. 2. Specify the settings to modify.
Managing MIT Kerberos domains
You can optionally define MIT Kerberos domains to allow additional domain extensions to be associated with an MIT Kerberos realm. You can always specify a default domain for a realm. You can create, modify, delete, and view an MIT Kerberos domain. A Kerberos domain name is a DNS suffix that you specify typically using lowercase characters.
Add an MIT Kerberos domain to a realm
You can optionally add an MIT Kerberos domain to an MIT Kerberos realm to enable additional Kerberos domain extensions to be associated with a Kerberos realm. You must be a member of a role that has ISI_PRIV_AUTH privileges to associate a Kerberos domain with a Kerberos realm. Add a Kerberos domain by running the isi auth krb5 domain create command. For example, run the following command to add a Kerberos domain to a Kerberos realm:
isi auth krb5 domain create <domain>
Modify an MIT Kerberos domain
You can modify an MIT Kerberos domain by modifying the realm settings. You must be a member of a role that has ISI_PRIV_AUTH privileges to modify an MIT Kerberos domain. Run the isi auth krb5 domain modify command to modify a Kerberos domain.
Authentication 115

For example, run the following command to modify a Kerberos domain by specifying an alternate Kerberos realm: isi auth krb5 domain modify <domain> --realm <realm>
View an MIT Kerberos domain mapping
You can view the properties of an MIT Kerberos domain mapping. Run the isi auth krb5 domain view command with a value specified for the <domain> variable to view the properties of a Kerberos domain mapping:
isi auth krb5 domain view <domain>
List MIT Kerberos domains
You can list one or more MIT Kerberos domains and display the list in a tabular, JSON, CSV, or list format. You can also specify a limit for the number of domains to be listed. Run the isi auth krb5 domain list command to list one or more MIT Kerberos domains. For example, run the following command to list the first ten MIT Kerberos domains in a tabular format without any headers or footers:
isi auth krb5 domain list -l=10 --format=table --no-header --no-footer
Delete an MIT Kerberos domain mapping
You can delete one or more MIT Kerberos domain mappings. You must be a member of a role that has ISI_PRIV_AUTH privileges to delete an MIT Kerberos domain mapping. Run the isi auth krb5 domain delete command to delete an MIT Kerberos domain mapping. For example, run the following command to delete a domain mapping:
isi auth krb5 domain delete <domain>
Managing SPNs and keys
A service principal name (SPN) is the name referenced by a client to identify an instance of a service on a cluster. An MIT Kerberos provider authenticates services on a cluster through SPNs. You can perform the following operations on SPNs and their associated keys:  Update the SPNs if there are any changes to the SmartConnect zone settings that are based on those SPNs  List the registered SPNs to compare them against a list of discovered SPNs  Update keys associated with the SPNs either manually or automatically  Import keys from a keytab table  Delete specific key versions or delete all the keys associated with an SPN
View SPNs and keys
You can view the service principal names (SPNs) and their associated keys that are registered for an MIT Kerberos provider. Clients obtain Kerberos tickets and access services on clusters through SPNs and their associated keys. You must be a member of a role that has ISI_PRIV_AUTH privileges to view SPNs and keys. Run the isi auth krb5 spn list command to list one or more SPNs and their associated keys and the Key version numbers (Kvnos).
116 Authentication

For example, run the following command to list the first five SPNs for an MIT Kerberos provider in a tabular format without any headers or footers:
isi auth krb5 list <provider-name> -l 5 --format table --no-header --no-footer <spn-list>
Delete keys
You can delete specific key versions or all the keys associated with a service principal name (SPN). You must be a member of a role that has ISI_PRIV_AUTH privileges to delete keys. After creating new keys due to security reasons or to match configuration changes, follow this procedure to delete older version of the keys so that the keytab table is not populated with redundant keys. Run the isi auth krb5 spn delete command to delete all keys for a specified SPN or a specific version of a key. For example, run the following command to delete all the keys associated with an SPN for an MIT Kerberos provider:
isi auth krb5 spn delete <provider-name> <spn> --all The <provider-name> is the name of the MIT Kerberos provider. You can delete a specific version of the key by specifying a key version number value for the kvno argument and including that value in the command syntax.
Manually add or update a key for an SPN
You can manually add or update keys for a service principal name (SPN). This process creates a new key for the specified SPN. You must be a member of a role that has ISI_PRIV_AUTH privileges to add or update a key for an SPN. Run the isi auth krb5 spn create command to add or update keys for an SPN. For example, run the following command to add or update a key for an SPN by specifying the <provider-name>, <user>, and <spn> positional arguments:
isi auth krb5 spn create <provider-name> <user> <spn>
Automatically update an SPN
You can automatically update or add a service principal name (SPN) if it is registered with an MIT Kerberos provider but does not appear in the list of discovered SPNs. You must be a member of a role that has ISI_PRIV_AUTH privileges to automatically update an SPN. 1. Run the isi auth krb5 spn check command to compare the list of registered SPNs against the list of discovered
SPNs. Proceed to the next step if the comparison does not show similar results. 2. Run the isi auth krb5 spn fix command to fix the missing SPNs. For example, run the following command to add missing SPNs for an MIT Kerberos service provider:
isi auth krb5 spn fix <provider-name> <user>
You can optionally specify a password for <user> which is the placeholder for a user who has the permission to join clients to the given domain.
Import a keytab file
An MIT Kerberos provider joined through a legacy keytab file might not have the ability to manage keys through the Kerberos admin credentials. In such a case, import a new keytab file and then add the keytab file keys to the provider. Make sure that the following pre-requisites are met before you import a keytab file:  You must create and copy a keytab file to a node on the cluster where you will perform this procedure.
Authentication 117

 You must be a member of a role that has ISI_PRIV_AUTH privileges to import a keytab file. Import the keys of a keytab file by running the isi auth krb5 spn import command. For example, run the following command to import the keys of the <keytab-file> to the provider referenced as <providername>:
isi auth krb5 spn import <provider-name> <keytab-file>
Managing file providers
You can configure one or more file providers, each with its own combination of replacement files, for each access zone. Password database files, which are also called user database files, must be in binary format. Each file provider pulls directly from up to three replacement database files: a group file that has the same format as /etc/ group; a netgroups file; and a binary password file, spwd.db, which provides fast access to the data in a file that has the /etc/master.passwd format. You must copy the replacement files to the cluster and reference them by their directory path.
NOTE: If the replacement files are located outside the /ifs directory tree, you must distribute them manually to every node in the cluster. Changes that are made to the system provider's files are automatically distributed across the cluster.
Configure a file provider
You can specify replacement files for any combination of users, groups, and netgroups. Run the following command to configure a file provider, where <name> is your name for the file provider.
isi auth file create <name>
Generate a password file
Password database files, which are also called user database files, must be in binary format. This procedure must be performed through the command-line interface (CLI). For command-usage guidelines, run the man pwd_mkdb command. 1. Establish an SSH connection to any node in the cluster. 2. Run the pwd_mkdb <file> command, where <file> is the location of the source password file.
NOTE: By default, the binary password file, spwd.db, is created in the /etc directory. You can override the location to store the spwd.db file by specifying the -d option with a different target directory.
The following command generates an spwd.db file in the /etc directory from a password file that is located at /ifs/ test.passwd:
pwd_mkdb /ifs/test.passwd The following command generates an spwd.db file in the /ifs directory from a password file that is located at /ifs/ test.passwd:
pwd_mkdb -d /ifs /ifs/test.passwd
Modify a file provider
You can modify any setting for a file provider, including its name. NOTE: Although you can rename a file provider, there are two caveats: you can rename a file provider through only the web administration interface and you cannot rename the System file provider.
118 Authentication

Run the following command to modify a file provider, where <provider-name> is a placeholder for the name that you supplied for the provider.
isi auth file modify <provider-name>

Delete a file provider
To stop using a file provider, you can clear all of its replacement file settings or you can permanently delete the provider. NOTE: You cannot delete the System file provider.
Run the following command to delete a file provider, where <name> is a placeholder for the name of the provider that you want to delete.
isi auth file delete <name>

Password file format

The file provider uses a binary password database file, spwd.db. You can generate a binary password file from a master.passwd-formatted file by running the pwd_mkdb command.
The master.passwd file contains ten colon-separated fields, as shown in the following example:

admin:*:10:10::0:0:Web UI Administrator:/ifs/home/admin:/bin/zsh

The fields are defined below in the order in which they appear in the file.
NOTE: UNIX systems often define the passwd format as a subset of these fields, omitting the Class, Change, and Expiry fields. To convert a file from passwd to master.passwd format, add :0:0: between the GID field and the Gecos field.

Username
Password
UID
GID
Class Change Expiry Gecos Home Shell

The user name. This field is case-sensitive. OneFS does not limit the length; many applications truncate the name to 16 characters, however.
The user's encrypted password. If authentication is not required for the user, you can substitute an asterisk (*) for a password. The asterisk character is guaranteed to not match any password.
The UNIX user identifier. This value must be a number in the range 0-4294967294 that is not reserved or already assigned to a user. Compatibility issues occur if this value conflicts with an existing account's UID.
The group identifier of the user's primary group. All users are a member of at least one group, which is used for access checks and can also be used when creating files.
This field is not supported by OneFS and should be left empty.
OneFS does not support changing the passwords of users in the file provider. This field is ignored.
OneFS does not support the expiration of user accounts in the file provider. This field is ignored.
This field can store a variety of information but is usually used to store the user's full name.
The absolute path to the user's home directory.
The absolute path to the user's shell. If this field is set to /sbin/nologin, the user is denied commandline access.

Group file format
The file provider uses a group file in the format of the /etc/group file that exists on most UNIX systems. The group file consists of one or more lines containing four colon-separated fields, as shown in the following example:
admin:*:10:root,admin

Authentication 119

The fields are defined below in the order in which they appear in the file.

Group name Password GID
Group members

The name of the group. This field is case-sensitive. Although OneFS does not limit the length of the group name, many applications truncate the name to 16 characters.
This field is not supported by OneFS and should contain an asterisk (*).
The UNIX group identifier. Valid values are any number in the range 0-4294967294 that is not reserved or already assigned to a group. Compatibility issues occur if this value conflicts with an existing group's GID.
A comma-delimited list of user names.

Netgroup file format
A netgroup file consists of one or more netgroups, each of which can contain members. Hosts, users, or domains, which are members of a netgroup, are specified in a member triple. A netgroup can also contain another netgroup. Each entry in a netgroup file consists of the netgroup name, followed by a space-delimited set of member triples and nested netgroup names. If you specify a nested netgroup, it must be defined on a separate line in the file. A member triple takes the following form:
(<host>, <user>, <domain>)
Where <host> is a placeholder for a machine name, <user> is a placeholder for a user name, and <domain> is a placeholder for a domain name. Any combination is valid except an empty triple: (,,). The following sample file contains two netgroups. The rootgrp netgroup contains four hosts: two hosts are defined in member triples and two hosts are contained in the nested othergrp netgroup, which is defined on the second line.
rootgrp (myserver, root, somedomain.com) (otherserver, root, somedomain.com) othergrp othergrp (other-win,, somedomain.com) (other-linux,, somedomain.com)
NOTE: A new line signifies a new netgroup. You can continue a long netgroup entry to the next line by typing a backslash character (\) in the right-most position of the first line.

Managing local users and groups
When you create an access zone, each zone includes a local provider that allows you to create and manage local users and groups. Although you can view the users and groups of any authentication provider, you can create, modify, and delete users and groups in the local provider only.
View a list of users and groups by provider
You can view users and groups by a provider type. 1. Run the following command to view a list of users and groups for a specified provider, where <provider-type> is a
placeholder for your provider-type string and <provider-name> is a placeholder for the name that you assigned the specific provider:
isi auth users list --provider="<provider-type>:<provider-name>"
2. To list users and groups for an LDAP provider type that is named Unix LDAP, run a command similar to the following example:
isi auth users list --provider="lsa-ldap-provider:Unix LDAP"

120 Authentication

Create a local user
Each access zone includes a local provider that allows you to create and manage local users and groups. When creating a local user account, you can configure its name password, home directory, UNIX user identifier (UID), UNIX login shell, and group memberships. Run the following command to create a local user, where <name> is your name for the user, <provider-name> specifies the provider for this user, and <string> is the password for this user.
isi auth users create <name> --provider="local:<provider-name>" \ --password="<string>"
NOTE: A user account is disabled if no password is specified. If you do not create a password when you create the user account, you can add a password later by running the isi auth users modify command, specifying the appropriate user by username, UID, or SID.
Create a local group
In the local provider of an access zone, you can create groups and assign members to them. Run the following command to create a local group, where <name> and <provider-name> are values that you provide to define the group.
isi auth groups create <name> --provider "local:<provider-name>"
Naming rules for local users and groups
Local user and group names must follow naming rules in order to ensure proper authentication and access to the cluster. You must adhere to the following naming rules when creating and modifying local users and groups:  The maximum name length is 104 characters. It is recommended that names do not exceed 64 characters.  Names cannot contain the following invalid characters:
" / \ [ ] : ; | = , + * ? < >  Names can contain any special character that is not in the list of invalid characters. It is recommend that names do not
contain spaces.  Names are not case sensitive.
Configure or modify a local password policy
You can configure and modify a local password policy for a local provider. This procedure must be performed through the command-line interface (CLI).
NOTE: Separate password policies are configured for each access zone. Each access zone in the cluster contains a separate instance of the local provider, which allows each access zone to have its own list of local users who can authenticate. Password complexity is configured for each local provider, not for each user. 1. Establish an SSH connection to any node in the cluster. 2. Optional: Run the following command to view the current password settings: isi auth local view system
3. Run the isi auth local modify command, choosing from the parameters described in Local password policy default settings.
Authentication 121

The --password-complexity parameter must be specified for each setting.
isi auth local modify system --password-complexity=lowercase \ --password-complexity=uppercase -­password-complexity=numeric \ --password-complexity=symbol
The following command configures a local password policy for a local provider:
isi auth local modify <provider-name> \ --min-password-length=20 \ --lockout-duration=20m \ --lockout-window=5m \ --lockout-threshold=5 \ --add-password-complexity=uppercase \ --add-password-complexity=numeric

Local password policy settings

You can configure local password policy settings and specify the default for each setting through the isi auth local modify command. Password complexity increases the number of possible passwords that an attacker must check before the correct password is guessed.

Setting min-password-length password-complexity
min-password-age max-password-age password-history-length
lockout-duration
lockout-threshold

Description

Comments

Minimum password length in characters.

Long passwords are best. The minimum length should not be so long that users have a difficult time entering or remembering the password.

A list of cases that a new password must contain. By default, the list is empty.

You can specify as many as four cases. The following cases are valid:
 uppercase  lowercase  numeric  symbol (excluding # and @)

The minimum password age. You can A minimum password age ensures that a user

set this value using characters for cannot enter a temporary password and then

units; for example, 4W for 4 weeks, immediately change it to the previous password.

2d for 2 Days.

Attempts to check or set a password before the

time expires are denied.

The maximum password age. You can set this value using characters for units; for example, 4W for 4 weeks, 2d for 2 Days.

Attempts to login after a password expires forces a password change. If a password change dialog cannot be presented, the user is not allowed to login.

The number of historical passwords to keep. New passwords are checked against this list and rejected if the password is already present. The max history length is 24.

To avoid recycling of passwords, you can specify the number of previous passwords to remember. If a new password matches a remembered previous password, it is rejected.

The length of time in seconds that an account is locked after a configurable number of bad passwords are entered.

After an account is locked, it is unavailable from all sources until it is unlocked. OneFS provides two configurable options to avoid administrator interaction for every locked account:
 Specify how much time must elapse before the account is unlocked.
 Automatically reset the incorrect-password counter after a specified time, in seconds.

The number of incorrect password After an account is locked, it is unavailable from

attempts before an account is

all sources until it is unlocked.

122 Authentication

Setting lockout-window

Description
locked. A value of zero disables account lockout.
The time that elapses before the incorrect password attempts count is reset.

Comments
If the configured number of incorrect password attempts is reached, the account is locked and lockout-duration determines the length of time that the account is locked. A value of zero disables the window.

Modify a local user
You can modify any setting for a local user account except the user name. Run the following command to modify a local group, where <name> or <gid> or <sid> are placeholders for the user identifiers and <provider-name> is a placeholder for the name of the local provider associated with the user:
isi auth users modify (<name> or --gid <gid> or --sid <sid>) \ --provider "local:<provider-name>"

Modify a local group
You can add or remove members from a local group. Run the following command to modify a local group, where <name> or <gid> or <sid> are placeholders for the group identifiers and <provider-name> is a placeholder for the name of the local provider associated with the group:
isi auth groups modify (<name> or --gid <gid> or --sid <sid>) \ --provider "local:<provider-name>"

Delete a local user
A deleted user can no longer access the cluster through the command-line interface, web administration interface, or file access protocol. When you delete a local user account, its home directory remains in place. Run the following command to delete a local user, where <uid> and <sid> are placeholders for the UID and SID of the user that you want to delete, and <provider-name> is a placeholder for the local provider associated with the user.
isi auth users delete <name> --uid <uid> --sid <sid> \ --provider "local:<provider-name>"

Delete a local group
You can delete a local group even if members are assigned to it. Deleting a group does not affect the members of that group. Run the following command to delete a local group, where <group> is a placeholder for the name of the group that you want to delete:
isi auth groups delete <group>
NOTE: You can run the command with <gid> or <sid> instead of <group>.

SSH Authentication and Configuration
Multi-Factor Authentication (MFA) adds PAPI support for SSH configuration using public keys stored in LDAP and Multi-Factor Authentication support for SSH via the Duo security platform.

Authentication 123

Pre-requisites for Multi-factor Authentication (MFA)
In order to successfully authenticate through the MFA feature, the following must be true regarding the target user:  The user identity on cluster must belong to a role that enables SSH access.  The auth-settings-template SSH setting must be set to anything but any or custom.  If the user-auth-method SSH setting is set to publickey, all users that need SSH access must have a valid public key value
for sshPublicKey in their LDAP entry.  If the user-auth-method SSH setting is set to password, all users that need SSH access must have a valid password value
for userPassword in their LDAP entry. Also, the host, ikey, and skey must be set. You must set the enabled option in the isi auth duo command.
NOTE: This feature will only work as expected if the previous conditions are met. If any of the conditions above are not met, you could risk locking yourself out of your node.
SSH configuration using password
You can provide CLI support for SSH configuration using a password. 1. To provide support for SSH configuration using a password, run the isi ssh settings modify command.
isi ssh settings modify --auth-settings-template=password 2. To configure the Duo security platform, run the isi auth duo modify command.
isi auth duo modify --ikey=<key> --host=api-example.duosecurity.com
You are asked to enter the secret key and confirm the same.
Enter skey: Confirm:
3. To enable the Duo provider, run the isi auth duo modify command.
isi auth duo modify --enabled=true 4. To establish an SSH connection to a cluster node, run the following commands:
$ ssh [email protected] Duo two-factor login for someuser Enter a passcode or select one of the following options: 1. Duo Push to XXX-XXX-XXXX 2. Phone call to XXX-XXX-XXXX 3. SMS passcodes to XXX-XXX-XXXX Passcode or option (1-3): 1
If the SSH connection is established, it logs you in and asks for the password.
Success. Logging you in... Password:
SSH Configuration using public keys
You can provide CLI support for SSH configuration using public keys that are stored in LDAP. 1. To provide support for SSH configuration using a public key, run the isi ssh settings modify command.
isi ssh settings modify --auth-settings-template=publickey 2. To configure the Duo security platform, run the isi auth duo modify command.
isi auth duo modify --ikey=<key> --host=api-example.duosecurity.com
124 Authentication

You are asked to enter the secret key and confirm the same. Enter skey: Confirm:
3. To enable the Duo provider, run the isi auth duo modify command. isi auth duo modify --enabled=true
4. To establish an SSH connection to a cluster node, run the following commands: $ ssh [email protected] -i<location of the public key> Duo two-factor login for someuser Enter a passcode or select one of the following options: 1. Duo Push to XXX-XXX-XXXX 2. Phone call to XXX-XXX-XXXX 3. SMS passcodes to XXX-XXX-XXXX Passcode or option (1-3): 1
If the SSH connection is established, it logs you in. Success. Logging you in...
Authentication 125

7
Administrative roles and privileges
This section contains the following topics:
Topics:
· Role-based access · Roles · Privileges · Managing roles
Role-based access
You can assign role-based access to delegate administrative tasks to selected users. Role based access control (RBAC) allows the right to perform particular administrative actions to be granted to any user who can authenticate to a cluster. Roles are created by a Security Administrator, assigned privileges, and then assigned members. All administrators, including those given privileges by a role, must connect to the System zone to configure the cluster. When these members log in to the cluster through a configuration interface, they have these privileges. All administrators can configure settings for access zones, and they always have control over all access zones on the cluster. Roles also give you the ability to assign privileges to member users and groups. By default, only the root user and the admin user can log in to the web administration interface through HTTP or the command-line interface through SSH. Using roles, the root and admin users can assign others to built-in or custom roles that have login and administrative privileges to perform specific administrative tasks.
NOTE: As a best practice, assign users to roles that contain the minimum set of necessary privileges. For most purposes, the default permission policy settings, system access zone, and built-in roles are sufficient. You can create role-based access management policies as necessary for your particular environment.
Roles
You can permit and limit access to administrative areas of your cluster on a per-user basis through roles. OneFS includes several built-in administrator roles with predefined sets of privileges that cannot be modified. You can also create custom roles and assign privileges. The following list describes what you can and cannot do through roles:  You can assign privileges to a role.  You can assign privileges to a role as read-only, even if the privilege is read/write by default.  You can create custom roles and assign privileges to those roles.  You can copy an existing role.  You can add any user or group of users, including well-known groups, to a role as long as the users can authenticate to the
cluster.  You can add a user or group to more than one role.  You cannot assign privileges directly to users or groups.
NOTE: When OneFS is first installed, only users with root- or admin-level access can log in and assign users to roles.
Custom roles
Custom roles supplement built-in roles. You can create custom roles and assign privileges mapped to administrative areas in your cluster environment. For example, you can create separate administrator roles for security, auditing, storage provisioning, and backup.
126 Administrative roles and privileges

You can designate certain privileges as read-only or read/write when adding the privilege to a role. You can modify this option at any time to add or remove privileges as user responsibilities grow and change.
Built-in roles
Built-in roles are included in OneFS and have been configured with the most likely privileges necessary to perform common administrative functions. You cannot modify the list of privileges assigned to each built-in role; however, you can assign users and groups to built-in roles.

SecurityAdmin built-in role

The SecurityAdmin built-in role enables security configuration on the cluster, including authentication providers, local users and groups, and role membership.

Privileges ISI_PRIV_LOGIN_CONSOLE ISI_PRIV_LOGIN_PAPI ISI_PRIV_LOGIN_SSH ISI_PRIV_AUTH ISI_PRIV_ROLE

Read/write access N/A N/A N/A Read/write Read/write

SystemAdmin built-in role

The SystemAdmin built-in role enables administration of all cluster configuration that is not specifically handled by the SecurityAdmin role.

Privileges ISI_PRIV_LOGIN_CONSOLE ISI_PRIV_LOGIN_PAPI ISI_PRIV_LOGIN_SSH ISI_PRIV_SYS_SHUTDOWN ISI_PRIV_SYS_SUPPORT ISI_PRIV_SYS_TIME ISI_PRIV_SYS_UPGRADE ISI_PRIV_ANTIVIRUS ISI_PRIV_AUDIT ISI_PRIV_CLOUDPOOLS ISI_PRIV_CLUSTER ISI_PRIV_DEVICES ISI_PRIV_EVENT ISI_PRIV_FILE_FILTER ISI_PRIV_FTP ISI_PRIV_HARDENING ISI_PRIV_HDFS ISI_PRIV_HTTP

Read/write access N/A N/A N/A N/A N/A Read/write Read/write Read/write Read/write Read/write Read/write Read/write Read/write Read/write Read/write Read/write Read/write Read/write

Administrative roles and privileges 127

Privileges ISI_PRIV_JOB_ENGINE ISI_PRIV_LICENSE ISI_PRIV_MONITORING ISI_PRIV_NDMP ISI_PRIV_NETWORK ISI_PRIV_NFS ISI_PRIV_NTP ISI_PRIV_QUOTA ISI_PRIV_REMOTE_SUPPORT ISI_PRIV_SMARTPOOLS ISI_PRIV_SMB ISI_PRIV_SNAPSHOT ISI_PRIV_SNMP ISI_PRIV_STATISTICS ISI_PRIV_SWIFT ISI_PRIV_SYNCIQ ISI_PRIV_VCENTER ISI_PRIV_WORM ISI_PRIV_NS_TRAVERSE ISI_PRIV_NS_IFS_ACCESS

Read/write access Read/write Read/write Read/write Read/write Read/write Read/write Read/write Read/write Read/write Read/write Read/write Read/write Read/write Read/write Read/write Read/write Read/write Read/write N/A N/A

AuditAdmin built-in role

The AuditAdmin built-in role enables you to view all system configuration settings.
Because the AuditAdmin built-in role is designed only for viewing system configuration settings, normally read-write privileges are granted as read-only.

Privileges ISI_PRIV_LOGIN_CONSOLE ISI_PRIV_LOGIN_PAPI ISI_PRIV_LOGIN_SSH ISI_PRIV_SYS_TIME ISI_PRIV_SYS_UPGRADE ISI_PRIV_ANTIVIRUS ISI_PRIV_AUDIT ISI_PRIV_CLOUDPOOLS ISI_PRIV_CLUSTER ISI_PRIV_DEVICES ISI_PRIV_EVENT ISI_PRIV_FILE_FILTER

Read/write access N/A N/A N/A Read-only Read-only Read-only Read-only Read-only Read-only Read-only Read-only Read-only

128 Administrative roles and privileges

Privileges ISI_PRIV_FTP ISI_PRIV_HARDENING ISI_PRIV_HDFS ISI_PRIV_HTTP ISI_PRIV_JOB_ENGINE ISI_PRIV_LICENSE ISI_PRIV_MONITORING SI_PRIV_NDMP ISI_PRIV_NETWORK ISI_PRIV_NFS ISI_PRIV_NTP ISI_PRIV_QUOTA ISI_PRIV_REMOTE_SUPPORT ISI_PRIV_SMARTPOOLS ISI_PRIV_SMB ISI_PRIV_SNAPSHOT ISI_PRIV_SNMP ISI_PRIV_STATISTICS ISI_PRIV_SWIFT ISI_PRIV_SYNCIQ ISI_PRIV_VCENTER ISI_PRIV_WORM

Read/write access Read-only Read-only Read-only Read-only Read-only Read-only Read-only Read-only Read-only Read-only Read-only Read-only Read-only Read-only Read-only Read-only Read-only Read-only Read-only Read-only Read-only Read-only

BackupAdmin built-in role

The BackupAdmin built-in role enables backup and restore of files from /ifs.

Privileges ISI_PRIV_IFS_BACKUP ISI_PRIV_IFS_RESTORE

Read/write access Read-only Read/write

VMwareAdmin built-in role

The VMwareAdmin built-in role enables remote administration of storage needed by VMware vCenter.

Privileges ISI_PRIV_LOGIN_PAPI ISI_PRIV_NETWORK ISI_PRIV_SMARTPOOLS ISI_PRIV_SNAPSHOT ISI_PRIV_SYNCIQ

Read/write access N/A Read/write Read/write Read/write Read/write

Administrative roles and privileges 129

Privileges ISI_PRIV_VCENTER ISI_PRIV_NS_TRAVERSE ISI_PRIV_NS_IFS_ACCESS

Read/write access Read/write N/A N/A

Privileges

Privileges permit users to complete tasks on a cluster. Privileges are associated with an area of cluster administration such as Job Engine, SMB, or statistics. Privileges have one of two forms:

Action Read/Write

Allows a user to perform a specific action on a cluster. For example, the ISI_PRIV_LOGIN_SSH privilege allows a user to log in to a cluster through an SSH client.
Allows a user to view or modify a configuration subsystem such as statistics, snapshots, or quotas. For example, the ISI_PRIV_SNAPSHOT privilege allows an administrator to create and delete snapshots and snapshot schedules. A read/write privilege can grant either read-only or read/write access. Read-only access allows a user to view configuration settings; read/write access allows a user to view and modify configuration settings.

Privileges are granted to the user on login to a cluster through the OneFS API, the web administration interface, SSH, or a console session. A token is generated for the user, which includes a list of all privileges that are granted to the user. Each URI, web-administration interface page, and command requires a specific privilege to view or modify the information available through any of these interfaces.
Sometimes, privileges cannot be granted or there are privilege limitations.
 Privileges are not granted to users that do not connect to the System Zone during login or to users that connect through the deprecated Telnet service, even if they are members of a role.
 Privileges do not provide administrative access to configuration paths outside of the OneFS API. For example, the ISI_PRIV_SMB privilege does not grant a user the right to configure SMB shares using the Microsoft Management Console (MMC).
 Privileges do not provide administrative access to all log files. Most log files require root access.
The privilege ISI_PRIV_RESTRICTED_AUTH provides limited administrative privileges. Administrators with the ISI_PRIV_RESTRICTED_AUTH privilege can modify only those users and groups that have the same or less privilege. For example, a help desk administrator can be granted ISI_PRIV_RESTRICTED_AUTH privileges to enable performing basic user and group management operations without having the full abilities of the ISI_PRIV_AUTH privilege.

Supported OneFS privileges
Privileges supported by OneFS are categorized by the type of action or access that is granted to the user--for example, login, security, and configuration privileges.

Login privileges

The login privileges listed in the following table either allow the user to perform specific actions or grants read or write access to an area of administration on the cluster.

Privilege ISI_PRIV_LOGIN_CONSOLE ISI_PRIV_LOGIN_PAPI
ISI_PRIV_LOGIN_SSH

Description Log in from the console.
Log in to the Platform API and the web administration interface.
Log in through SSH.

Type Action Action
Action

130 Administrative roles and privileges

System privileges

The system privileges listed in the following table either allow the user to perform specific actions or grants read or write access to an area of administration on the cluster.

Privilege ISI_PRIV_SYS_SHUTDOWN ISI_PRIV_SYS_SUPPORT ISI_PRIV_SYS_TIME ISI_PRIV_SYS_UPGRADE

Description Shut down the system. Run cluster diagnostic tools. Change the system time. Upgrades the OneFS system.

Type Action Action Read/write Read/write

Security privileges

The security privileges listed in the following table either allow the user to perform specific actions or grants read or write access to an area of administration on the cluster.

Privilege ISI_PRIV_AUTH ISI_PRIV_RESTRICTED_AUTH
ISI_PRIV_ROLE

Description

Type

Configure external authentication

Read/write

providers, including root-level accounts.

Find and list users, set user passwords, unlock user accounts, add and remove users and groups. Administrators with this privilege can modify only users and groups that have the same or less privilege.

Read/write

Create new roles and assign privileges, Read/write including root-level accounts.

Configuration privileges

The configuration privileges listed in the following table either allow the user to perform specific actions or grant read or write access to an area of administration on the cluster.

Privilege ISI_PRIV_ANTIVIRUS ISI_PRIV_AUDIT ISI_PRIV_CERTIFICATE ISI_PRIV_CLOUDPOOLS ISI_PRIV_CLUSTER
ISI_PRIV_DEVICES ISI_PRIV_EVENT ISI_PRIV_FILE_FILTER ISI_PRIV_FTP ISI_PRIV_GET_SET ISI_PRIV_HARDENING ISI_PRIV_HDFS ISI_PRIV_HTTP

Description Configure antivirus scanning. Configure audit capabilities. Configure cluster TLS certificates. Configure CloudPools. Configure cluster identity and general settings. Create new roles and assign privileges. View and modify system events. Configure file filtering settings. Configure FTP server. View and set per-file OneFS metadata. Harden cluster security profile. Configure HDFS server. Configure HTTP server.

Type Read/write Read/write Read/write Read/write Read/write
Read/write Read/write Read/write Read/write Read/write Read/write Read/write Read/write

Administrative roles and privileges 131

Privilege ISI_PRIV_IPMI
ISI_PRIV_JOB_ENGINE ISI_PRIV_KEY_MANAGER ISI_PRIV_LICENSE ISI_PRIV_MONITORING
ISI_PRIV_NDMP ISI_PRIV_NETWORK ISI_PRIV_NFS ISI_PRIV_NTP ISI_PRIV_PERFORMANCE
ISI_PRIV_QUOTA ISI_PRIV_REMOTE_SUPPORT ISI_PRIV_S3 ISI_PRIV_SMARTPOOLS ISI_PRIV_SMB ISI_PRIV_SNAPSHOT ISI_PRIV_SNMP ISI_PRIV_STATISTICS ISI_PRIV_SWIFT ISI_PRIV_SYNCIQ ISI_PRIV_VCENTER ISI_PRIV_WORM

Description

Type

Configure remote IPMI management settings.

Read/write

Schedule cluster-wide jobs.

Read/write

Configure key management settings.

Read/write

Activate OneFS software licenses.

Read/write

Register applications monitoring the cluster.

Read/write

Configure NDMP server.

Read/write

Configure network interfaces.

Read/write

Configure the NFS server.

Read/write

Configure NTP.

Read/write

Configure performance resource accounting.

Read/write

Configure file system quotas.

Read/write

Configure remote support.

Read/write

Configure the S3 server.

Read/write

Configure storage pools.

Read/write

Configure the SMB server.

Read/write

Schedule, take, and view snapshots.

Read/write

Configure SNMP server.

Read/write

View file system performance statistics. Read/write

Configure Swift.

Read/write

Configure SyncIQ.

Read/write

Configure VMware for vCenter.

Read/write

Configure SmartLock directories.

Read/write

File access privileges

The file access privileges listed in the following table either allow the user to perform specific actions or grants read or write access to an area of administration on the cluster.

Privilege ISI_PRIV_IFS_BACKUP
ISI_PRIV_IFS_RESTORE
ISI_PRIV_IFS_WORM_DELETE

Description
Back up files from /ifs. NOTE: This privilege circumvents traditional file access checks, such as mode bits or NTFS ACLs.
Restore files from /ifs. NOTE: This privilege circumvents traditional file access checks, such as mode bits or NTFS ACLs.
Perform privileged delete operation on WORM committed files.
NOTE: If you are not logged in through the root user

Type Action
Action
Action

132 Administrative roles and privileges

Privilege ISI_PRIV_ESRS_DOWNLOAD

Description
account, you must also have the ISI_PRIV_NS_IFS_ACCESS privilege.

Type

Schedule file downloads through ESRS. Read/write

Namespace privileges

The namespace privileges listed in the following table either allow the user to perform specific actions or grants read or write access to an area of administration on the cluster.

Privilege ISI_PRIV_NS_TRAVERSE ISI_PRIV_NS_IFS_ACCESS

Description
Traverse and view directory metadata.
Access the /ifs directory through the OneFS API.

Type Action Action

Data backup and restore privileges
You can assign privileges to a user that are explicitly for cluster data backup and restore actions.
Two privileges allow a user to backup and restore cluster data over supported client-side protocols: ISI_PRIV_IFS_BACKUP and ISI_PRIV_IFS_RESTORE.
CAUTION: These privileges circumvent traditional file access checks, such as mode bits or NTFS ACLs.
Most cluster privileges allow changes to cluster configuration in some manner. The backup and restore privileges allow access to cluster data from the System zone, the traversing of all directories, and reading of all file data and metadata regardless of file permissions.
Users assigned these privileges use the protocol as a backup protocol to another machine without generating access-denied errors and without connecting as the root user. These two privileges are supported over the following client-side protocols:
 SMB  NFS  OneFS API  FTP  SSH Over SMB, the ISI_PRIV_IFS_BACKUP and ISI_PRIV_IFS_RESTORE privileges emulate the Windows privileges SE_BACKUP_NAME and SE_RESTORE_NAME. The emulation means that normal file-open procedures are protected by file system permissions. To enable the backup and restore privileges over the SMB protocol, you must open files with the FILE_OPEN_FOR_BACKUP_INTENT option, which occurs automatically through Windows backup software such as Robocopy. Application of the option is not automatic when files are opened through general file browsing software such as Windows File Explorer.
Both ISI_PRIV_IFS_BACKUP and ISI_PRIV_IFS_RESTORE privileges primarily support Windows backup tools such as Robocopy. A user must be a member of the BackupAdmin built-in role to access all Robocopy features, which includes copying file DACL and SACL metadata.
Command-line interface privileges
You can perform most tasks granted by a privilege through the command-line interface (CLI). Some OneFS commands require root access.

Administrative roles and privileges 133

Command-to-privilege mapping

Each CLI command is associated with a privilege. Some commands require root access.

isi command isi antivirus isi audit isi auth, excluding isi auth roles isi auth roles isi batterystatus isi certificate isi cloud isi cluster isi config isi dedupe, excluding isi dedupe stats isi dedupe stats isi devices isi diagnostics isi email isi event isi fc isi file-filter isi filepool isi ftp isi get isi hardening isi hdfs isi http isi ipmi isi job isi license isi ndmp isi network isi nfs isi ntp isi performance isi quota isi readonly isi remotesupport isi s3

Privilege ISI_PRIV_ANTIVIRUS ISI_PRIV_AUDIT ISI_PRIV_AUTH ISI_PRIV_ROLE ISI_PRIV_DEVICES ISI_PRIV_CERTIFICATE ISI_PRIV_CLOUDPOOLS ISI_PRIV_CLUSTER root ISI_PRIV_JOB_ENGINE ISI_PRIV_STATISTICS ISI_PRIV_DEVICES ISI_PRIV_SYS_SUPPORT ISI_PRIV_CLUSTER ISI_PRIV_EVENT ISI_PRIV_NDMP ISI_PRIV_FILE_FILTER ISI_PRIV_SMARTPOOLS ISI_PRIV_FTP root ISI_PRIV_HARDENING ISI_PRIV_HDFS ISI_PRIV_HTTP ISI_PRIV_IPMI ISI_PRIV_JOB_ENGINE ISI_PRIV_LICENSE ISI_PRIV_NDMP ISI_PRIV_NETWORK ISI_PRIV_NFS ISI_PRIV_NTP ISI_PRIV_PERFORMANCE ISI_PRIV_QUOTA ISI_PRIV_DEVICES ISI_PRIV_REMOTE_SUPPORT ISI_PRIV_S3

134 Administrative roles and privileges

isi command isi servicelight isi services isi set isi smb isi snapshot isi snmp isi ssh settings modify isi statistics isi status
isi storagepool isi swift isi sync isi tape isi time isi upgrade isi version isi worm excluding isi worm files delete isi worm files delete isi zone

Privilege ISI_PRIV_DEVICES root root ISI_PRIV_SMB ISI_PRIV_SNAPSHOT ISI_PRIV_SNMP ISI_PRIV_AUTH ISI_PRIV_STATISTICS ISI_PRIV_EVENT ISI_PRIV_DEVICES ISI_PRIV_JOB_ENGINE ISI_PRIV_NETWORK ISI_PRIV_SMARTPOOLS ISI_PRIV_STATISTICS
ISI_PRIV_SMARTPOOLS ISI_PRIV_SWIFT ISI_PRIV_SYNCIQ ISI_PRIV_NDMP ISI_PRIV_SYS_TIME ISI_PRIV_SYS_UPGRADE ISI_PRIV_CLUSTER ISI_PRIV_WORM ISI_PRIV_IFS_WORM_DELETE ISI_PRIV_AUTH

Privilege-to-command mapping

Each privilege is associated with one or more commands. Some commands require root access.

Privilege ISI_PRIV_ANTIVIRUS ISI_PRIV_AUDIT ISI_PRIV_AUTH
ISI_PRIV_CLOUDPOOLS ISI_PRIV_CLUSTER
ISI_PRIV_DEVICES

isi commands isi antivirus isi audit isi auth - excluding isi auth role isi zone
isi cloud isi email isi version
isi batterystatus isi devices isi readonly

Administrative roles and privileges 135

Privilege
ISI_PRIV_EVENT ISI_PRIV_FILE_FILTER ISI_PRIV_FTP ISI_PRIV_HARDENING ISI_PRIV_HDFS ISI_PRIV_HTTP ISI_PRIV_JOB_ENGINE
ISI_PRIV_LICENSE ISI_PRIV_NDMP
ISI_PRIV_NETWORK ISI_PRIV_NFS ISI_PRIV_NTP ISI_PRIV_QUOTA ISI_PRIV_REMOTE_SUPPORT ISI_PRIV_RESTRICTED_AUTH
ISI_PRIV_ROLE ISI_PRIV_SMARTPOOLS
ISI_PRIV_SMB ISI_PRIV_SNAPSHOT ISI_PRIV_SNMP ISI_PRIV_STATISTICS
ISI_PRIV_SWIFT
136 Administrative roles and privileges

isi commands isi servicelight isi status
isi event isi status
isi file-filter isi ftp isi hardening isi hdfs isi http isi job isi dedupe isi status
isi license isi fc isi tape isi ndmp
isi network isi status
isi nfs isi ntp isi quota isi remotesupport isi auth isi auth users isi auth groups isi auth status isi auth mapping token
isi auth role isi filepool isi storagepool isi status
isi smb isi snapshot isi snmp isi status isi statistics isi dedupe stats
isi swift

Privilege ISI_PRIV_SYNCIQ ISI_PRIV_SYS_TIME ISI_PRIV_SYS_UPGRADE ISI_PRIV_WORM ISI_PRIV_IFS_WORM_DELETE root

isi commands
isi sync
isi time
isi upgrade
isi worm excluding isi worm files delete
isi worm files delete
 isi config  isi get  isi services  isi set

Managing roles
You can view, add, or remove members of any role. Except for built-in roles, whose privileges you cannot modify, you can add or remove OneFS privileges on a role-by-role basis.
NOTE: Roles take both users and groups as members. If a group is added to a role, all users who are members of that group are assigned the privileges associated with the role. Similarly, members of multiple roles are assigned the combined privileges of each role.

View roles
You can view information about built-in and custom roles. Run one of the following commands to view roles.  To view a basic list of all roles on the cluster, run the following command:
isi auth roles list  To view detailed information about each role on the cluster, including member and privilege lists, run the following command:
isi auth roles list --verbose  To view detailed information about a single role, run the following command, where <role> is the name of the role:
isi auth roles view <role>

View privileges
You can view user privileges. This procedure must be performed through the command-line interface (CLI). You can view a list of your privileges or the privileges of another user using the following commands: 1. Establish an SSH connection to any node in the cluster. 2. To view privileges, run one of the following commands.
 To view a list of all privileges, run the following command:
isi auth privileges --verbose  To view a list of your privileges, run the following command:
isi auth id

Administrative roles and privileges 137

 To view a list of privileges for another user, run the following command, where <user> is a placeholder for another user by name:
isi auth mapping token <user>
Create and modify a custom role
You can create an empty custom role and then add users and privileges to the role. 1. Establish an SSH connection to any node in the cluster. 2. Run the following command to create a role, where <name> is the name that you want to assign to the role and <string>
specifies an optional description: isi auth roles create <name> [--description <string>]
3. Run the following command to add a user to the role, where <role> is the name of the role and <string> is the name of the user: isi auth roles modify <role> [--add-user <string>]
NOTE: You can also modify the list of users assigned to a built-in role.
4. Run the following command to add a privilege with read/write access to the role, where <role> is the name of the role and <string> is the name of the privilege: isi auth roles modify <role> [--add-priv <string>]
5. Run the following command to add a privilege with read-only access to the role, where <role> is the name of the role and <string> is the name of the privilege: isi auth roles modify <role> [--add-priv-ro <string>]
Delete a custom role
Deleting a role does not affect the privileges or users that are assigned to it. Built-in roles cannot be deleted. Run the following command to delete a custom role, where <name> is the name of the role that you want to delete:
isi auth roles delete <name>
Add a user to built-in roles
You can assign a built-in role to a new user. 1. To view the list of roles, run the isi auth roles list command.
The following authentication roles' list is displayed:
isi auth roles list Name --------------AuditAdmin BackupAdmin SecurityAdmin StatisticsAdmin SystemAdmin VMwareAdmin --------------Total: 6 2. Run the isi auth roles list --zone zone1 command to view the roles available in zone1
138 Administrative roles and privileges

The roles available in zone1 are displayed:
isi auth roles list --zone zone1 Name ----------------ZoneAdmin ZoneSecurityAdmin ----------------Total: 2
3. Run the isi auth roles view ZoneAdmin --zone zone1 command to view the privileges associated with the ZoneAdmin role in zone1. The details that are associated with the ZoneAdmin role are displayed:
isi auth roles view ZoneAdmin --zone zone1 Name: ZoneAdmin
Description: Administer aspects of configuration related to current access zone. Members: -
Privileges ID: ISI_PRIV_LOGIN_PAPI
Read Only: True
ID: ISI_PRIV_AUDIT Read Only: False
ID: ISI_PRIV_FILE_FILTER Read Only: False
ID: ISI_PRIV_HDFS Read Only: False
ID: ISI_PRIV_NFS Read Only: False
ID: ISI_PRIV_SMB Read Only: False
ID: ISI_PRIV_SWIFT Read Only: False
ID: ISI_PRIV_VCENTER Read Only: False
ID: ISI_PRIV_NS_TRAVERSE Read Only: True
ID: ISI_PRIV_NS_IFS_ACCESS Read Only: True
4. Run the isi auth roles view ZoneSecurityAdmin --zone zone1 command to view the privileges associated with the ZoneSecurityAdmin role in zone1. The details that are associated with the ZoneSecurityAdmin role are displayed:
isi auth roles view ZoneSecurityAdmin --zone zone1 Name: ZoneSecurityAdmin
Description: Administer aspects of security configuration related to current access zone.
Members: Privileges
ID: ISI_PRIV_LOGIN_PAPI Read Only: True
ID: ISI_PRIV_AUTH Read Only: False
ID: ISI_PRIV_ROLE Read Only: False
5. Run the isi auth roles modify command to add a user to the ZoneAdmin role.
isi auth roles modify --zone zone1 ZoneAdmin --add-user z1-user1
Administrative roles and privileges 139

6. Run the isi auth roles view command to view whether the new user has been added to the ZoneAdmin role.
isi auth roles view --zone zone1 ZoneAdmin Name: ZoneAdmin
Description: Administer aspects of configuration related to current access zone. Members: z1-user1
Privileges ID: ISI_PRIV_LOGIN_PAPI
Read Only: True
ID: ISI_PRIV_AUDIT Read Only: False
ID: ISI_PRIV_FILE_FILTER Read Only: False
ID: ISI_PRIV_HDFS Read Only: False
ID: ISI_PRIV_NFS Read Only: False
ID: ISI_PRIV_SMB Read Only: False
ID: ISI_PRIV_SWIFT Read Only: False
ID: ISI_PRIV_VCENTER Read Only: False
ID: ISI_PRIV_NS_TRAVERSE Read Only: True
ID: ISI_PRIV_NS_IFS_ACCESS Read Only: True
7. Run the isi auth roles modify command to add a user to the ZoneSecurityAdmin role.
isi auth roles modify --zone zone1 ZoneSecurityAdmin --add-user z1-user2
8. Run the isi auth roles view command to view whether the new user has been added to the ZoneSecurityAdmin role.
isi auth roles view ZoneSecurityAdmin --zone zone1 Name: ZoneSecurityAdmin
Description: Administer aspects of security configuration related to current access zone.
Members: z1-user2 Privileges
ID: ISI_PRIV_LOGIN_PAPI Read Only: True
ID: ISI_PRIV_AUTH Read Only: False
ID: ISI_PRIV_ROLE Read Only: False
Create a new role and add a user
You can create a new role and then add a user to the new role. 1. To create a new role, run the isi auth roles create command in zone1.
isi auth roles create --name Zone1SMBAdmin --zone zone1
140 Administrative roles and privileges

2. To view the newly added role in the authentication list, run the isi auth roles list command.
isi auth roles list --zone zone1 Name ----------------Zone1SMBAdmin ZoneAdmin ZoneSecurityAdmin ----------------Total: 3
3. To view the details associated with the new role, run the isi auth roles view command.
isi auth roles view Zone1SMBAdmin --zone zone1 Name: Zone1SMBAdmin
Description: Members: -
Privileges ID: -
Read Only: -
4. To add a privilege to the new role, run the isi auth roles modify command.
isi auth roles modify --zone zone1 Zone1SMBAdmin --add-priv ISI_PRIV_SMB
NOTE: You can also add a description to the new role using the isi auth roles modify command.
isi auth roles modify --zone zone1 Zone1SMBAdmin --description "Zone1 SMB Admin"
5. To view whether the privilege and description of the new role were added, run the following command.
isi auth roles view Zone1SMBAdmin --zone zone1 Name: Zone1SMBAdmin
Description: Zone1 SMB Admin Members: -
Privileges ID: ISI_PRIV_SMB
Read Only: False
6. To add a user to the new role, run the isi auth roles modify command again.
isi auth roles modify --zone zone1 SMBAdmin --add-user z1-user3
NOTE: You can also add a read-only privilege to the new user using the isi auth roles modify command.
isi auth roles modify --zone zone1 Zone1SMBAdmin --add-priv-ro ISI_PRIV_LOGIN_PAPI
7. To view whether the new user was assigned the new role along with the read-only privilege, run the following command:
isi auth roles view Zone1SMBAdmin --zone zone1 Name: Zone1SMBAdmin
Description: Zone1 SMB Admin Members: -
Privileges ID: ISI_PRIV_LOGIN_PAPI
Read Only: True
ID: ISI_PRIV_SMB Read Only: False
Administrative roles and privileges 141

8
Identity management
This section contains the following topics:
Topics:
· Identity management overview · Identity types · Access tokens · Access token generation · Managing ID mappings · Managing user identities
Identity management overview
In environments with several different types of directory services, OneFS maps the users and groups from the separate services to provide a single unified identity on a cluster and uniform access control to files and directories, regardless of the incoming protocol. This process is called identity mapping. PowerScale clusters are frequently deployed in multiprotocol environments with multiple types of directory services, such as Active Directory and LDAP. When a user with accounts in multiple directory services logs in to a cluster, OneFS combines the user's identities and privileges from all the directory services into a native access token. You can configure OneFS settings to include a list of rules for access token manipulation to control user identity and privileges. For example, you can set a user mapping rule to merge an Active Directory identity and an LDAP identity into a single token that works for access to files stored over both SMB and NFS. The token can include groups from Active Directory and LDAP. The mapping rules that you create can solve identity problems by manipulating access tokens in many ways, including the following examples:  Authenticate a user with Active Directory but give the user a UNIX identity.  Select a primary group from competing choices in Active Directory or LDAP.  Disallow login of users that do not exist in both Active Directory and LDAP. For more information about identity management, see the white paper OneFS User Mapping - Mapping Identities across Authentication Providers.
Identity types
OneFS supports three primary identity types, each of which you can store directly on the file system. Identity types are user identifier and group identifier for UNIX, and security identifier for Windows. When you log on to a cluster, the user mapper expands your identity to include your other identities from all the directory services, including Active Directory, LDAP, and NIS. After OneFS maps your identities across the directory services, it generates an access token that includes the identity information associated with your accounts. A token includes the following identifiers:  A UNIX user identifier (UID) and a group identifier (GID). A UID or GID is a 32-bit number with a maximum value of
4,294,967,295.  A security identifier (SID) for a Windows user account. A SID is a series of authorities and sub-authorities ending with
a 32-bit relative identifier (RID). Most SIDs have the form S-1-5-21-<A>-<B>-<C>-<RID>, where <A>, <B>, and <C> are specific to a domain or computer and <RID> denotes the object in the domain.  A primary group SID for a Windows group account.  A list of supplemental identities, including all groups in which the user is a member. The token also contains privileges that stem from administrative role-based access control. On a PowerScale cluster, a file contains permissions, which appear as an access control list (ACL). The ACL controls access to directories, files, and other securable system objects.
142 Identity management

When a user tries to access a file, OneFS compares the identities in the user's access token with the file's ACL. OneFS grants access when the file's ACL includes an access control entry (ACE) that allows the identity in the token to access the file and that does not include an ACE that denies the identity access. OneFS compares the access token of a user with the ACL of a file.
NOTE: For more information about access control lists, including a description of the permissions and how they correspond to POSIX mode bits, see OneFS: Authentication, Identity Management, and Authorization: Multi-protocol data access and the Unified Permission Model.
When a name is provided as an identifier, it is converted into the corresponding user or group object and the correct identity type. You can enter or display a name in various ways:
 UNIX assumes unique case-sensitive namespaces for users and groups. For example, Name and name represent different objects.
 Windows provides a single, case-insensitive namespace for all objects and also specifies a prefix to target an Active Directory domain; for example, domain\name.
 Kerberos and NFSv4 define principals, which require names to be formatted the same way as email addresses; for example, [email protected].
Multiple names can reference the same object. For example, given the name support and the domain example.com, support, EXAMPLE\support and [email protected] are all names for a single object in Active Directory.

Access tokens

An access token is created when the user first makes a request for access.
Access tokens represent who a user is when performing actions on the cluster and supply the primary owner and group identities during file creation. Access tokens are also compared against the ACL or mode bits during authorization checks.
During user authorization, OneFS compares the access token, which is generated during the initial connection, with the authorization data on the file. All user and identity mapping occurs during token generation; no mapping takes place during permissions evaluation.
An access token includes all UIDs, GIDs, and SIDs for an identity, in addition to all OneFS privileges. OneFS reads the information in the token to determine whether a user has access to a resource. It is important that the token contains the correct list of UIDs, GIDs, and SIDs. An access token is created from one of the following sources:

Source Username
Privilege Attribute Certificate (PAC) User identifier (UID)

Authentication
 SMB impersonate user  Kerberized NFSv3  Kerberized NFSv4  NFS export user mapping  HTTP  FTP  HDFS
 SMB NTLM  Active Directory Kerberos
 NFS AUTH_SYS mapping

Access token generation

For most protocols, the access token is generated from the username or from the authorization data that is retrieved during authentication.
The following steps present a simplified overview of the complex process through which an access token is generated:

Step 1: User identity lookup

Using the initial identity, the user is looked up in all configured authentication providers in the access zone, in the order in which they are listed. The user identity and group list are retrieved from the authenticating provider. Next, additional group memberships that are associated with the user and group list are looked up for all other authentication providers. All of these SIDs, UIDs, or GIDs are added to the initial token.

Identity management 143

Step 2: ID mapping
Step 3: User mapping
Step 4: Ondisk identity calculation

NOTE: An exception to this behavior occurs if the AD provider is configured to call other providers, such as LDAP or NIS.
The user's identifiers are associated across directory services. All SIDs are converted to their equivalent UID/GID and vice versa. These ID mappings are also added to the access token.
Access tokens from other directory services are combined. If the username matches any user mapping rules, the rules are processed in order and the token is updated accordingly.
The default on-disk identity is calculated from the final token and the global setting. These identities are used for newly created files.

ID mapping
The Identity (ID) mapping service maintains relationship information between mapped Windows and UNIX identifiers to provide consistent access control across file sharing protocols within an access zone.
NOTE: ID mapping and user mapping are different services, despite the similarity in names.
During authentication, the authentication daemon requests identity mappings from the ID mapping service in order to create access tokens. Upon request, the ID mapping service returns Windows identifiers mapped to UNIX identifiers or UNIX identifiers mapped to Windows identifiers. When a user authenticates to a cluster over NFS with a UID or GID, the ID mapping service returns the mapped Windows SID, allowing access to files that another user stored over SMB. When a user authenticates to the cluster over SMB with a SID, the ID mapping service returns the mapped UNIX UID and GID, allowing access to files that a UNIX client stored over NFS.
Mappings between UIDs or GIDs and SIDs are stored according to access zone in a cluster-distributed database called the ID map. Each mapping in the ID map is stored as a one-way relationship from the source to the target identity type. Two-way mappings are stored as complementary one-way mappings.
Mapping Windows IDs to UNIX IDs
When a Windows user authenticates with an SID, the authentication daemon searches the external Active Directory provider to look up the user or group associated with the SID. If the user or group has only an SID in the Active Directory, the authentication daemon requests a mapping from the ID mapping service.
NOTE: User and group lookups may be disabled or limited, depending on the Active Directory settings. You enable user and group lookup settings through the isi auth ads modify command.
If the ID mapping service does not locate and return a mapped UID or GID in the ID map, the authentication daemon searches other external authentication providers configured in the same access zone for a user that matches the same name as the Active Directory user.
If a matching user name is found in another external provider, the authentication daemon adds the matching user's UID or GID to the access token for the Active Directory user, and the ID mapping service creates a mapping between the UID or GID and the Active Directory user's SID in the ID map. This is referred to as an external mapping.
NOTE: When an external mapping is stored in the ID map, the UID is specified as the on-disk identity for that user. When the ID mapping service stores a generated mapping, the SID is specified as the on-disk identity.
If a matching user name is not found in another external provider, the authentication daemon assigns a UID or GID from the ID mapping range to the Active Directory user's SID, and the ID mapping service stores the mapping in the ID map. This is referred to as a generated mapping. The ID mapping range is a pool of UIDs and GIDs allocated in the mapping settings.
After a mapping has been created for a user, the authentication daemon retrieves the UID or GID stored in the ID map upon subsequent lookups for the user.
Mapping UNIX IDs to Windows IDs
The ID mapping service creates temporary UID-to-SID and GID-to-SID mappings only if a mapping does not already exist. The UNIX SIDs that result from these mappings are never stored on disk.
UIDs and GIDs have a set of predefined mappings to and from SIDs.

144 Identity management

If a UID-to-SID or GID-to-SID mapping is requested during authentication, the ID mapping service generates a temporary UNIX SID in the format S-1-22-1-<UID> or S-1-22-2-<GID> by applying the following rules:  For UIDs, the ID mapping service generates a UNIX SID with a domain of S-1-22-1 and a resource ID (RID) matching the UID.
For example, the UNIX SID for UID 600 is S-1-22-1-600.  For GIDs, the ID mapping service generates a UNIX SID with a domain of S-1-22-2 and an RID matching the GID. For
example, the UNIX SID for GID 800 is S-1-22-2-800.
ID mapping ranges
In access zones with multiple external authentication providers, such as Active Directory and LDAP, it is important that the UIDs and GIDs from different providers that are configured in the same access zone do not overlap. Overlapping UIDs and GIDs between providers within an access zone might result in some users gaining access to other users' directories and files. The range of UIDs and GIDs that can be allocated for generated mappings is configurable in each access zone through the isi auth settings mappings modify command. The default range for both UIDs and GIDs is 1000000­2000000 in each access zone. Do not include commonly used UIDs and GIDs in your ID ranges. For example, UIDs and GIDs below 1000 are reserved for system accounts and should not be assigned to users or groups.
User mapping
User mapping provides a way to control permissions by specifying a user's security identifiers, user identifiers, and group identifiers. OneFS uses the identifiers to check file or group ownership. With the user-mapping feature, you can apply rules to modify which user identity OneFS uses, add supplemental user identities, and modify a user's group membership. The user-mapping service combines a user's identities from different directory services into a single access token and then modifies it according to the rules that you create.
NOTE: You can configure mapping rules on a per-zone basis. Mapping rules must be configured separately in each access zone that uses them. OneFS maps users only during login or protocol access.
Default user mappings
Default user mappings determine access if explicit user-mapping rules are not created. If you do not configure rules, a user who authenticates with one directory service receives the identity information in other directory services when the account names are the same. For example, a user who authenticates with an Active Directory domain as Desktop\jane automatically receives identities in the final access token for the corresponding UNIX user account for jane from LDAP or NIS. In the most common scenario, OneFS is connected to two directory services, Active Directory and LDAP. In such a case, the default mapping provides a user with the following identity attributes:  A UID from LDAP  The user SID from Active Directory  An SID from the default group in Active Directory The user's groups come from Active Directory and LDAP, with the LDAP groups and the autogenerated group GID added to the list. To pull groups from LDAP, the mapping service queries the memberUid attribute. The user's home directory, gecos, and shell come from Active Directory.
Elements of user-mapping rules
You combine operators with user names to create a user-mapping rule. The following elements affect how the user mapper applies a rule:  The operator, which determines the operation that a rule performs  Fields for usernames  Options  A parameter  Wildcards
Identity management 145

User-mapping best practices

Follow best practices to simplify user mapping.

Use Active Directory with RFC 2307 and Windows Services for UNIX

Use Microsoft Active Directory with Windows Services for UNIX and RFC 2307 attributes to manage Linux, UNIX, and Windows systems. Integrating UNIX and Linux systems with Active Directory centralizes identity management and eases interoperability, reducing the need for user-mapping rules. Make sure your domain controllers are running Windows Server 2003 or later.

Employ a consistent username strategy

The simplest configurations name users consistently, so that each UNIX user corresponds to a similarly named Windows user. Such a convention allows rules with wildcard characters to match names and map them without explicitly specifying each pair of accounts.

Do not use overlapping ID ranges

In networks with multiple identity sources, such as LDAP and Active Directory with RFC 2307 attributes, you should ensure that UID and GID ranges do not overlap. It is also important that the range from which OneFS automatically allocates UIDs and GIDs does not overlap with any other ID range. OneFS automatically allocates UIDs and GIDs from the range 1,000,000-2,000,000. If UIDs and GIDs overlap multiple directory services, some users might gain access to other users' directories and files.

Avoid common UIDs and GIDs

Do not include commonly used UIDs and GIDs in your ID ranges. For example, UIDs and GIDs below 1000 are reserved for system accounts; do not assign them to users or groups.

Do not use UPNs in mapping rules

You cannot use a user principal name (UPN) in a user mapping rule. A UPN is an Active Directory domain and username that are combined into an Internet-style name with an @ symbol, such as an email address: jane@example. If you include a UPN in a rule, the mapping service ignores it and may return an error. Instead, specify names in the format DOMAIN\user.com.

Group rules by type and order them

The system processes every mapping rule by default, which can present problems when you apply a deny-all rule--for example, to deny access to all unknown users. In addition, replacement rules might interact with rules that contain wildcard characters. To minimize complexity, it is recommended that you group rules by type and organize them in the following order: 1. Replacement rules: Specify all rules that replace an identity first to ensure that OneFS replaces all
instances of the identity. 2. Join, add, and insert rules: After the names are set by any replacement operations, specify join, add,
and insert rules to add extra identifiers. 3. Allow and deny rules: Specify rules that allow or deny access last.
NOTE: Stop all processing before applying a default deny rule. To do so, create a rule that
matches allowed users but does nothing, such as an add operator with no field options, and has
the break option. After enumerating the allowed users, you can place a catchall deny at the end to
replace anybody unmatched with an empty user.

To prevent explicit rules from being skipped, in each group of rules, order explicit rules before rules that contain wildcard characters.

Add the LDAP or NIS primary group to the supplemental groups

When a PowerScale cluster is connected to Active Directory and LDAP, add the LDAP primary group to the list of supplemental groups. This enables OneFS to honor group permissions on files created over NFS or migrated from other UNIX storage systems. The same practice is advised when a PowerScale cluster is connected to Active Directory as well as and NIS.

On-disk identity
After the user mapper resolves a user's identities, OneFS determines an authoritative identifier for it, which is the preferred on-disk identity.
OnesFS stores either UNIX or Windows identities in file metadata on disk. On-disk identity types are UNIX, SID, and native. Identities are set when a file is created or a file's access control data is modified. Almost all protocols require some level of mapping to operate correctly, so choosing the preferred identity to store on disk is important. You can configure OneFS to store either the UNIX or the Windows identity, or you can allow OneFS to determine the optimal identity to store.
On-disk identity types are UNIX, SID, and native. Although you can change the type of on-disk identity, the native identity is best for a network with UNIX and Windows systems. In native on-disk identity mode, setting the UID as the on-disk identity improves NFS performance.

146 Identity management

NOTE: The SID on-disk identity is for a homogeneous network of Windows systems managed only with Active Directory. When you upgrade the on-disk identity setting is preserved. On new installations, the on-disk identity is set to native.
The native on-disk identity type allows the OneFS authentication daemon to select the correct identity to store on disk by checking for the identity mapping types in the following order:

Order 1 2 3
4

Mapping type Algorithmic mapping External mapping Persistent mapping
No mapping

Description
An SID that matches S-1-22-1-UID or S-1-22-2-GID in the internal ID mapping database is converted back to the corresponding UNIX identity, and the UID and GID are set as the on-disk identity.
A user with an explicit UID and GID defined in a directory service (such as Active Directory with RFC 2307 attributes, LDAP, NIS, or the OneFS file provider or local provider) has the UNIX identity set as the on-disk identity.
Mappings are stored persistently in the identity mapper database. An identity with a persistent mapping in the identity mapper database uses the destination of that mapping as the on-disk identity, which occurs primarily with manual ID mappings. For example, if there is an ID mapping of GID:10000 to S-1-5-32-545, a request for the on-disk storage of GID:10000 returns S-1-5-32-545.
If a user lacks a UID or GID even after querying the other directory services and identity databases, its SID is set as the on-disk identity. In addition, to make sure a user can access files over NFS, OneFS allocates a UID and GID from a preset range of 1,000,000 to 2,000,000. In native on-disk identity mode, a UID or GID that OneFS generates is never set as the on-disk identity.

NOTE: If you change the on-disk identity type, you should run the PermissionRepair job with the Convert repair type selected to make sure that the disk representation of all files is consistent with the changed setting. For more information, see the Run the PermissionRepair job section.

Managing ID mappings
You can create, modify, and delete identity mappings and configure ID mapping settings.

Create an identity mapping
You can create a manual identity mapping between source and target identities or automatically generate a mapping for a source identity.
This procedure is available only through the command-line interface. 1. Open a secure shell (SSH) connection to any node in the cluster and log in. 2. Run the isi auth mapping create command.
The following command specifies IDs of source and target identities in the zone3 access zone to create a two-way mapping between the identities:
isi auth mapping create --2way --source-sid=S-1-5-21-12345 \ --target-uid=5211 --zone=zone3

Modify an identity mapping
You can modify the configuration of an identity mapping. This procedure is available only through the command-line interface. 1. Open a secure shell (SSH) connection to any node in the cluster and log in. 2. Run the isi auth mapping modify command.

Identity management 147

The following command modifies the mapping of the user with UID 4236 in the zone3 access zone to include a reverse, 2-way mapping between the source and target identities:
isi auth mapping modify --source-uid=4236 \ --target-sid=S-1-5-21-12345 --zone=zone3 --2way
Delete an identity mapping
You can delete one or more identity mappings. This procedure is available only through the command-line interface. 1. Open a secure shell (SSH) connection to any node in the cluster and log in. 2. Run the isi auth mapping delete command.
The following command deletes all identity mappings in the zone3 access zone:
isi auth mapping delete --all --zone=zone3
The following command deletes all identity mappings in the zone3 access zone that were both created automatically and include a UID or GID from an external authentication source:
isi auth mapping delete --all --only-external --zone=zone3
The following command deletes the identity mapping of the user with UID 4236 in the zone3 access zone:
isi auth mapping delete --source-uid=4236 --zone=zone3
View an identity mapping
You can display mapping information for a specific identity. This procedure is available only through the command-line interface. 1. Open a secure shell (SSH) connection to any node in the cluster and log in. 2. Run the isi auth mapping view command.
The following command displays mappings for the user with UID 4236 in the zone3 access zone:
isi auth mapping view --uid=4236 --zone=zone3
The system displays output similar to the following example:
Name: user_36 On-disk: UID: 4236
Unix uid: 4236 Unix gid: -100000
SMB: S-1-22-1-4236
Flush the identity mapping cache
You can flush the ID map cache to remove in-memory copies of all or specific identity mappings. Modifications to ID mappings may cause the cache to become out of sync and users might experience slowness or stalls when authenticating. You can flush the cache to synchronize the mappings. This procedure is available only through the command-line interface. 1. Open a secure shell (SSH) connection to any node in the cluster and log in. 2. Run the isi auth mapping flush command.
The following command flushes all identity mappings on the cluster:
isi auth mapping flush --all
148 Identity management

The following command flushes the mapping of the user with UID 4236 in the zone3 access zone:
isi auth mapping flush --source-uid-4236 --zone=zone3
View a user token
You can view the contents of an access token generated for a user during authentication. This procedure is available only through the command-line interface. 1. Open a secure shell (SSH) connection to any node in the cluster and log in. 2. Run the isi auth mapping token command.
The following command displays the access token of a user with UID 4236 in the zone3 access zone:
isi auth mapping token --uid=4236 --zone=zone3
The system displays output similar to the following example:
User Name: user_36 UID: 4236 SID: S-1-22-1-4236 On Disk: 4236
ZID: 3 Zone: zone3 Privileges: Primary Group
Name: user_36 GID: 4236 SID: S-1-22-2-4236 On Disk: 4236
Configure identity mapping settings
You can enable or disable automatic allocation of UIDs and GIDS and customize the range of ID values in each access zone. The default range is 1000000­2000000. This procedure is available only through the command-line interface. 1. Open a secure shell (SSH) connection to any node in the cluster and log in. 2. Run the isi auth settings mapping modify command.
The following command enables automatic allocation of both UIDs and GIDs in the zone3 access zone and sets their allocation ranges to 25000­50000:
isi auth settings mapping modify --gid-range-enabled=yes \ --gid-range-min=25000 --gid-range-max=50000 --uid-range-enabled=yes \ --uid-range-min=25000 --uid-range-max=50000 --zone=zone3
View identity mapping settings
You can view the current configuration of identity mapping settings in each zone. This procedure is available only through the command-line interface. 1. Open a secure shell (SSH) connection to any node in the cluster and log in. 2. Run the isi auth settings mapping view command.
The following command displays the current settings in the zone3 access zone:
isi auth settings mapping view --zone=zone3
The system displays output similar to the following example:
Identity management 149

GID Range Enabled: Yes GID Range Min: 25000 GID Range Max: 50000
UID Range Enabled: Yes UID Range Min: 25000 UID Range Max: 50000
Managing user identities
You can manage user identities by creating user-mapping rules.
When you create user-mapping rules, it is important to remember the following information:
 You can only create user-mapping rules if you are connected to the cluster through the System zone; however, you can apply user-mapping rules to specific access zones. If you create a user-mapping rule for a specific access zone, the rule applies only in the context of its zone.
 When you change user-mapping on one node, OneFS propagates the change to the other nodes.  After you make a user-mapping change, the OneFS authentication service reloads the configuration.
View user identity
You can view the identities and group membership that a specified user has within the Active Directory and LDAP directory services, including the user's security identifier (SID) history.
This procedure must be performed through the command-line interface (CLI).
NOTE: The OneFS user access token contains a combination of identities from Active Directory and LDAP if both directory services are configured. You can run the following commands to discover the identities that are within each specific directory service.
1. Establish an SSH connection to any node in the cluster. 2. View a user identity from Active Directory only by running the isi auth users view command.
The following command displays the identity of a user named stand in the Active Directory domain named YORK:
isi auth users view --user=YORK\\stand --show-groups
The system displays output similar to the following example:
Name: YORK\stand DN: CN=stand,CN=Users,DC=york,DC=hull,DC=example,DC=com
DNS Domain: york.hull.example.com Domain: YORK
Provider: lsa-activedirectory-provider:YORK.HULL.EXAMPLE.COM Sam Account Name: stand
UID: 4326 SID: S-1-5-21-1195855716-1269722693-1240286574-591111 Primary Group
ID : GID:1000000 Name : YORK\york_sh_udg Additional Groups: YORK\sd-york space group
YORK\york_sh_udg YORK\sd-york-group YORK\sd-group YORK\domain users
3. View a user identity from LDAP only by running the isi auth users view command. The following command displays the identity of an LDAP user named stand:
isi auth user view --user=stand --show-groups
The system displays output similar to the following example:
Name: stand DN: uid=stand,ou=People,dc=colorado4,dc=hull,dc=example,dc=com
150 Identity management

DNS Domain: Domain: LDAP_USERS
Provider: lsa-ldap-provider:Unix LDAP Sam Account Name: stand
UID: 4326 SID: S-1-22-1-4326 Primary Group
ID : GID:7222 Name : stand Additional Groups: stand
sd-group sd-group2
Create a user-mapping rule
You can create user-mapping rules to manage user identities on the cluster.
You can create the first mapping rule with the --user-mapping-rules option for the isi zone zones modify System command. If you try to add a second rule with the command above, however, it replaces the existing rule rather than adding the new rule to the list of rules. To add more rules to the list of rules, you must use the --add-user-mapping-rules option with the isi zone zones modify System command.
NOTE: If you do not specify an access zone, user-mapping rules are created in the System zone.
1. To create a rule to merge the Active Directory user with a user from LDAP, run the following command, where <user-a> and <user-b> are placeholders for the identities to be merged; for example, user_9440 and lduser_010, respectively:
isi zone zones modify System --add-user-mapping-rules \ "<DOMAIN> <user-a> &= <user-b>"
Run the following command to view the rule:
isi zone zones view System
If the command runs successfully, the system displays the mapping rule, which is visible in the User Mapping Rules line of the output:
Name: System Cache Size: 4.77M Map Untrusted: SMB Shares: Auth Providers: Local Provider: Yes NetBIOS Name: All SMB Shares: Yes All Auth Providers: Yes User Mapping Rules: <DOMAIN>\<user_a> &= <user_b> Home Directory Umask: 0077 Skeleton Directory: /usr/share/skel
Zone ID: 1
2. To verify the changes to the token, run a command similar to the following example:
isi auth mapping token <DOMAIN>\\<user-a>
If the command runs successfully, the system displays output similar to the following example:
User Name : <DOMAIN>\<user-a> UID : 1000201 SID : S-1-5-21-1195855716-1269722693-1240286574-11547
ZID: 1 Zone: System Privileges: Primary Group
Name : <DOMAIN>\domain users GID : 1000000
Identity management 151

SID : S-1-5-21-1195855716-1269722693-1240286574-513 Supplemental Identities
Name : Users GID : 1545 SID : S-1-5-32-545
Name : lduser_010 UID : 10010 SID : S-1-22-1-10010
Name : example GID : 10000 SID : S-1-22-2-10000
Name : ldgroup_20user GID : 10026 SID : S-1-22-2-10026
Merge Windows and UNIX tokens
You can use either the join or append operator to merge two user names into a single token. When Windows and UNIX user names do not match across directory services, you can write user-mapping rules that use either the join or the append operator to merge two user names into a single token. For example, if a user's Windows username is win_bob and the users UNIX username is UNIX_bob, you can join or append them. When you append an account to another account, the append operator adds information from one identity to another. OneFS appends the fields that the options specify from the source identity to the target identity. OneFS appends the identifiers to the additional group list. 1. Establish an SSH connection to any node in the cluster. 2. Write a rule similar to the following example to join the Windows and UNIX user names, where <win-username> and
<UNIX-username> are placeholders for the user's Windows and UNIX accounts: MYDOMAIN\<win-username> &= <UNIX-username> []
3. Write a rule similar to the following example to append the UNIX account to the Windows account with the groups option: MYDOMAIN\<win-username> ++ <UNIX-username> [groups]
Retrieve the primary group from LDAP
You can create a user-mapping rule to insert or append primary group information from LDAP into a user's access token. By default, the user-mapping service combines information from AD and LDAP but gives precedence to the information from AD. Mapping rules control how OneFS combines the information. You can retrieve the primary group information from LDAP instead of AD. 1. Establish an SSH connection to any node in the cluster. 2. Write a rule similar to the following example to insert information from LDAP into a user's access token:
*\* += * [group]
3. Write a rule similar to the following example to append other information from LDAP to a user's access token: *\* ++ * [user,groups]
152 Identity management

Mapping rule options

Mapping rules can contain options that target the fields of an access token.
A field represents an aspect of a cross-domain access token, such as the primary UID and primary user SID from a user that you select. You can see some of the fields in the OneFS web administration interface. User in the web administration interface is the same as username. You can also see fields in an access token by running the command isi auth mapping token.
When you create a rule, you can add an option to manipulate how OneFS combines aspects of two identities into a single token. For example, an option can force OneFS to append the supplement groups to a token.
A token includes the following fields that you can manipulate with user mapping rules:
 username  unix_name  primary_uid  primary_user_sid  primary_gid  primary_group_sid  additional_ids (includes supplemental groups)
Options control how a rule combines identity information in a token. The break option is the exception: It stops OneFS from processing additional rules.
Although several options can apply to a rule, not all options apply to all operators. The following table describes the effect of each option and the operators that they work with.

Option user groups groups default_user
break

Operator insert, append insert, append insert, append
all operators except remove groups
all operators

Description
Copies the primary UID and primary user SID, if they exist, to the token.
Copies the primary GID and primary group SID, if they exist, to the token.
Copies all the additional identifiers to the token. The additional identifiers exclude the primary UID, the primary GID, the primary user SID, and the primary group SID.
If the mapping service fails to find the second user in a rule, the service tries to find the username of the default user. The name of the default user cannot include wildcards. When you set the option for the default user in a rule with the command-line interface, you must set it with an underscore: default_user.
Stops the mapping service from applying rules that follow the insertion point of the break option. The mapping service generates the final token at the point of the break.

Mapping rule operators
The operator determines what a mapping rule does.
You can create user-mapping rules through either the web-administration interface, where the operators are spelled out in a list, or from the command-line interface.
When you create a mapping rule with the OneFS command-line interface (CLI), you must specify an operator with a symbol. The operator affects the direction in which the mapping service processes a rule. For more information about creating a mapping rule, see the white paper Managing identities with the PowerScale OneFS user mapping service. The following table describes the operators that you can use in a mapping rule.
A mapping rule can contain only one operator.

Identity management 153

Operator Web interface

CLI

append

Append fields from a

++

user

Direction Left-to-right

insert

Insert fields from a user +=

Left-to-right

replace

Replace one user with a => different user

Left-to-right

remove groups
join

Remove supplemental -groups from a user
Join two users together &=

Unary Bidirectional

Description
Modifies an access token by adding fields to it. The mapping service appends the fields that are specified in the list of options (user, group, groups) to the first identity in the rule. The fields are copied from the second identity in the rule. All appended identifiers become members of the additional groups list. An append rule without an option performs only a lookup operation; you must include an option to alter a token.
Modifies an existing access token by adding fields to it. Fields specified in the options list (user, group, groups) are copied from the new identity and inserted into the identity in the token. When the rule inserts a primary user or primary group, it become the new primary user and primary group in the token. The previous primary user and primary group move to the additional identifiers list. Modifying the primary user leaves the token's username unchanged. When inserting the additional groups from an identity, the service adds the new groups to the existing groups.
Removes the token and replaces it with the new token that is identified by the second username. If the second username is empty, the mapping service removes the first username in the token, leaving no username. If a token contains no username, OneFS denies access with a no such user error.
Modifies a token by removing the supplemental groups.
Inserts the new identity into the token. If the new identity is the second user, the mapping service inserts it after the existing identity; otherwise, the service inserts it before the existing identity. The location of the insertion point is relevant when the existing identity is already the first in the list because OneFS uses the first identity to determine the ownership of new file system objects.

154 Identity management

9
Home directories
This section contains the following topics:
Topics:
· Home directories overview · Home directory permissions · Authenticating SMB users · Home directory creation through SMB · Home directory creation through SSH and FTP · Home directory creation in a mixed environment · Interactions between ACLs and mode bits · Default home directory settings in authentication providers · Supported expansion variables · Domain variables in home directory provisioning
Home directories overview
When you create a local user, OneFS automatically creates a home directory for the user. OneFS also supports dynamic home directory provisioning for users who access the cluster by connecting to an SMB share or by logging in through FTP or SSH. Regardless of the method by which a home directory was created, you can configure access to the home directory through a combination of SMB, SSH, and FTP.
Home directory permissions
You can set up a user's home directory with a Windows ACL or with POSIX mode bits, which are then converted into a synthetic ACL. The method by which a home directory is created determines the initial permissions that are set on the home directory. When you create a local user, the user's home directory is created with mode bits by default. For users who authenticate against external sources, you can specify settings to create home directories dynamically at login time. If a home directory is created during a login through SSH or FTP, it is set up with mode bits; if a home directory is created during an SMB connection, it receives either mode bits or an ACL. For example, if an LDAP user first logs in through SSH or FTP, the user's home directory is created with mode bits. If the same user first connects through an SMB share, the home directory is created with the permissions indicated by the configured SMB settings. If the --inheritable-path-acl option is enabled, an ACL is generated; otherwise, mode bits are used.
Authenticating SMB users
You can authenticate SMB users from authentication providers that can handle NT hashes. SMB sends an NT password hash to authenticate SMB users, so only users from authentication providers that can handle NT hashes can log in over SMB. The following OneFS-supported authentication providers can handle NT hashes:  Active Directory  Local  LDAPSAM (LDAP with Samba extensions enabled)
Home directories 155

Home directory creation through SMB
You can create SMB shares by including expansion variables in the share path. Expansion variables give users to access their home directories by connecting to the share. You can also enable dynamic provisioning of home directories that do not exist at SMB connection time.
NOTE: Share permissions are checked when files are accessed, before the underlying file system permissions are checked. Either of these permissions can prevent access to the file or directory.

Create home directories with expansion variables

You can configure settings with expansion variables to create SMB share home directories.
When users access the cluster over SMB, home directory access is through SMB shares. You can configure settings with a path that uses a variable expansion syntax, allowing a user to connect to their home directory share.
NOTE: Home directory share paths must be in the root path of the access zone in which the home directory SMB share is created.
In the following commands, the --allow-variable-expansion option is enabled to indicate that %U should be expanded to the user name, which is user411 in this example. The --auto-create-directory option is enabled to create the directory if it does not exist:

isi smb shares create HOMEDIR --path=/ifs/home/%U \ --allow-variable-expansion=yes --auto-create-directory=yes
isi smb shares permission modify HOMEDIR --wellknown Everyone \ --permission-type allow --permission full
isi smb shares view HOMEDIR

The system displays output similar to the following example:

Share Name: HOMEDIR

Path: /ifs/home/%U

Description:

Client-side Caching Policy: manual

Automatically expand user names or domain names: True

Automatically create home directories for users: True

Browsable: True

Permissions:

Account Account Type Run as Root Permission Type Permission

------------------------------------------------------------

Everyone wellknown False

allow

full

------------------------------------------------------------

Total: 1

...

When user411 connects to the share with the net use command, the user's home directory is created at /ifs/home/ user411. On user411's Windows client, the net use m: command connects /ifs/home/user411 through the HOMEDIR share:

net use m: \\cluster.company.com\HOMEDIR /u:user411

1. Run the following commands on the cluster with the --allow-variable-expansion option enabled. The %U expansion variable expands to the user name, and the --auto-create-directory option is enabled to create the directory if it does not exist:

isi smb shares create HOMEDIR --path=/ifs/home/%U \ --allow-variable-expansion=yes --auto-create-directory=yes
isi smb shares permission modify HOMEDIR --wellknown Everyone \ --permission-type allow --permission full

2. Run the following command to view the home directory settings: isi smb shares view HOMEDIR

156 Home directories

The system displays output similar to the following example:

Share Name: HOMEDIR

Path: /ifs/home/%U

Description:

Client-side Caching Policy: manual

Automatically expand user names or domain names: True

Automatically create home directories for users: True

Browsable: True

Permissions:

Account Account Type Run as Root Permission Type Permission

------------------------------------------------------------

Everyone wellknown False

allow

full

------------------------------------------------------------

Total: 1

...

If user411 connects to the share with the net use command, user411's home directory is created at /ifs/home/ user411. On user411's Windows client, the net use m: command connects /ifs/home/user411 through the HOMEDIR share, mapping the connection similar to the following example:

net use m: \\cluster.company.com\HOMEDIR /u:user411

Create home directories with the --inheritable-path-acl option
You can enable the --inheritable-path-acl option on a share to specify that it is to be inherited on the share path if the parent directory has an inheritable ACL. To perform most configuration tasks, you must log on as a member of the SecurityAdmin role. By default, an SMB share's directory path is created with a synthetic ACL based on mode bits. You can enable the -inheritable-path-acl option to use the inheritable ACL on all directories that are created, either at share creation time or for those dynamically provisioned when connecting to that share. 1. Run commands similar to the following examples to enable the --inheritable-path-acl option on the cluster to
dynamically provision a user home directory at first connection to a share on the cluster:
isi smb shares create HOMEDIR_ACL --path=/ifs/home/%U \ --allow-variable-expansion=yes --auto-create-directory=yes \ --inheritable-path-acl=yes
isi smb shares permission modify HOMEDIR_ACL \ --wellknown Everyone \ --permission-type allow --permission full
2. Run a net use command, similar to the following example, on a Windows client to map the home directory for user411:
net use q: \\cluster.company.com\HOMEDIR_ACL /u:user411
3. Run a command similar to the following example on the cluster to view the inherited ACL permissions for the user411 share:
cd /ifs/home/user411 ls -lde .
The system displays output similar to the following example:
drwx------ + 2 user411 PowerScale Users 0 Oct 19 16:23 ./ OWNER: user:user411 GROUP: group:PowerScale Users CONTROL:dacl_auto_inherited,dacl_protected 0: user:user411 allow dir_gen_all,object_inherit,container_inherit

Home directories 157

Create special home directories with the SMB share %U variable
The special SMB share name %U enables you to create a home-directory SMB share that appears the same as a user's user name. You typically set up a %U SMB share with a share path that includes the %U expansion variable. If a user attempts to connect to a share matching the login name and it does not exist, the user connects to the %U share instead and is directed to the expanded path for the %U share.
NOTE: If another SMB share exists that matches the user's name, the user connects to the explicitly named share rather than to the %U share. Run the following command to create a share that matches the authenticated user login name when the user connects to the share:
isi smb share create %U /ifs/home/%U \ --allow-variable-expansion=yes --auto-create-directory=yes \ --zone=System
After running this command, user Zachary will see a share named 'zachary' rather than '%U', and when Zachary tries to connect to the share named 'zachary', he will be directed to /ifs/home/zachary. On a Windows client, if Zachary runs the following commands, he sees the contents of his /ifs/home/zachary directory:
net use m: \\cluster.ip\zachary /u:zachary cd m: dir
Similarly, if user Claudia runs the following commands on a Windows client, she sees the directory contents of /ifs/home/ claudia:
net use m: \\cluster.ip\claudia /u:claudia cd m: dir
Zachary and Claudia cannot access one another's home directory because only the share 'zachary' exists for Zachary and only the share 'claudia' exists for Claudia.
Home directory creation through SSH and FTP
You can configure home directory support for users who access the cluster through SSH or FTP by modifying authentication provider settings.
Set the SSH or FTP login shell
You can use the --login-shell option to set the default login shell for the user. By default, the --login-shell option, if specified, overrides any login-shell information provided by the authentication provider, except with Active Directory. If the --login-shell option is specified with Active Directory, it simply represents the default login shell if the Active Directory server does not provide login-shell information.
NOTE: The following examples refer to setting the login shell to /bin/bash. You can also set the shell to /bin/rbash. 1. Run the following command to set the login shell for all local users to /bin/bash:
isi auth local modify System --login-shell /bin/bash
2. Run the following command to set the default login shell for all Active Directory users in your domain to /bin/bash:
isi auth ads modify YOUR.DOMAIN.NAME.COM --login-shell /bin/bash
158 Home directories

Set SSH/FTP home directory permissions
You can specify home directory permissions for a home directory that is accessed through SSH or FTP by setting a umask value. To perform most configuration tasks, you must log on as a member of the SecurityAdmin role. When a user's home directory is created at login through SSH or FTP, it is created using POSIX mode bits. The permissions setting on a user's home directory is set to 0755, then masked according to the umask setting of the user's access zone to further limit permissions. You can modify the umask setting for a zone with the --home-directory-umask option, specifying an octal number as the umask value. 1. Run the following command to view umask setting:
isi zone zones view System
The system displays output similar to the following example:

Name: System Path: /ifs Groupnet: groupnet0 Map Untrusted: Auth Providers: lsa-local-provider:System, lsa-file-provider:System
NetBIOS Name: User Mapping Rules: Home Directory Umask: 0077 Skeleton Directory: /usr/share/skel Cache Entry Expiry: 4H Negative Cache Entry Expiry: 1m
Zone ID: 1
In the command result, you can see the default setting for Home Directory Umask for the created home directory is 0700, which is equivalent to (0755 & ~(077)). You can modify the Home Directory Umask setting for a zone with the --home-directory-umask option, specifying an octal number as the umask value. This value indicates the permissions that are to be disabled, so larger mask values indicate fewer permissions. For example, a umask value of 000 or 022 yields created home directory permissions of 0755, whereas a umask value of 077 yields created home directory permissions of 0700.
2. Run a command similar to the following example to allow a group/others write/execute permission in a home directory:
isi zone zones modify System --home-directory-umask=022
In this example, user home directories will be created with mode bits 0755 masked by the umask field, set to the value of 022. Therefore, user home directories will be created with mode bits 0755, which is equivalent to (0755 & ~(022)).

Set SSH/FTP home directory creation options

You can configure home directory support for a user who accesses the cluster through SSH or FTP by specifying authentication provider options. 1. Run the following command to view settings for an Active Directory authentication provider on the cluster:
isi auth ads list

The system displays output similar to the following example:

Name

Authentication Status DC Name Site

---------------------------------------------------------

YOUR.DOMAIN.NAME.COM Yes

online -

SEA

---------------------------------------------------------

Total: 1

Home directories 159

2. Run the isi auth ads modify command with the --home-directory-template and --create-homedirectory options.
isi auth ads modify YOUR.DOMAIN.NAME.COM \ --home-directory-template=/ifs/home/ADS/%D/%U \ --create-home-directory=yes 3. Run the isi auth ads view command with the --verbose option. The system displays output similar to the following example:
Name: YOUR.DOMAIN.NAME.COM NetBIOS Domain: YOUR ... Create Home Directory: Yes Home Directory Template: /ifs/home/ADS/%D/%U
Login Shell: /bin/sh 4. Run the id command.
The system displays output similar to the following example:
uid=1000008(<your-domain>\user_100) gid=1000000(<your-domain>\domain users) groups=1000000(<your-domain>\domain users),1000024(<your-domain>\c1t),1545(Users) 5. Optional: To verify this information from an external UNIX node, run the ssh command from an external UNIX node. For example, the following command would create /ifs/home/ADS/<your-domain>/user_100 if it did not previously exist:
ssh <your-domain>\\[email protected]
Provision home directories with dot files
You can provision home directories with dot files. To perform most configuration tasks, you must log on as a member of the SecurityAdmin role. The skeleton directory, which is located at /usr/share/skel by default, contains a set of files that are copied to the user's home directory when a local user is created or when a user home directory is dynamically created during login. Files in the skeleton directory that begin with dot. are renamed to remove the dot prefix when they are copied to the user's home directory. For example, dot.cshrc is copied to the user's home directory as .cshrc. This format enables dot files in the skeleton directory to be viewable through the command-line interface without requiring the ls -a command. For SMB shares that might use home directories that were provisioned with dot files, you can set an option to prevent users who connect to the share through SMB from viewing the dot files. 1. Run the following command to display the default skeleton directory in the System access zone:
isi zone zones view System
The system displays output similar to the following example:
Name: System ...
Skeleton Directory: /usr/share/skel
2. Run the isi zone zones modify command to modify the default skeleton directory. The following command modifies the default skeleton directory, /usr/share/skel, in an access zone, where System is the value for the <zone> option and /usr/share/skel2 is the value for the <path> option:
isi zone zones modify System --skeleton-directory=/usr/share/skel2
160 Home directories

Home directory creation in a mixed environment
If a user logs in through both SMB and SSH, it is recommended that you configure home directory settings so the path template is the same for the SMB share and each authentication provider against which the user is authenticating through SSH.

Interactions between ACLs and mode bits
Home directory setup is determined by several factors, including how users authenticate and the options that specify home directory creation.
A user's home directory may be set up with either ACLs or POSIX mode bits, which are converted into a synthetic ACL. The directory of a local user is created when the local user is created, and the directory is set up with POSIX mode bits by default. Directories can be dynamically provisioned at log in for users who authenticate against external sources, and in some cases for users who authenticate against the File provider. In this situation, the user home directory is created according to how the user first logs in.
For example, if an LDAP user first logs in through SSH or FTP and the user home directory is created, it is created with POSIX mode bits. If that same user first connects through an SMB home directory share, the home directory is created as specified by the SMB option settings. If the --inherited-path-acl option is enabled, ACLs are generated. Otherwise, POSIX mode bits are used.

Default home directory settings in authentication providers

The default settings that affect how home directories are set up differ, based on the authentication provider that the user authenticates against.

Authentication provider Local
File Active Directory

Home directory
 --home-directorytemplate=/ifs/ home/%U
 --create-homedirectory=yes
 --loginshell=/bin/sh
 --home-directorytemplate=""
 --create-homedirectory=no
 --home-directorytemplate=/ifs/ home/%D/%U
 --create-homedirectory=no
 --loginshell=/bin/sh NOTE: If available, provider information overrides this value.

Home directory creation Enabled
Disabled Disabled

UNIX login shell /bin/sh
None /bin/sh

LDAP

 --home-directorytemplate=""
 --create-homedirectory=no

Disabled

None

Home directories 161

Authentication provider NIS

Home directory
 --home-directorytemplate=""
 --create-homedirectory=no

Home directory creation Disabled

UNIX login shell None

Related references Supported expansion variables on page 162

Supported expansion variables

You can include expansion variables in an SMB share path or in an authentication provider's home directory template.
OneFS supports the following expansion variables. You can improve performance and reduce the number of shares to be managed when you configure shares with expansion variables. For example, you can include the %U variable for a share rather than create a share for each user. When a %U is included in the name so that each user's path is different, security is still ensured because each user can view and access only his or her home directory.
NOTE: When you create an SMB share through the web administration interface, you must select the Allow Variable Expansion check box or the string is interpreted literally by the system.

Variable %U %D
%Z %L %0 %1 %2

Value

Description

User name (for example, user_001)

Expands to the user name to allow different users to use different home directories. This variable is typically included at the end of the path. For example, for a user named user1, the path /ifs/home/%U is mapped to /ifs/home/user1.

NetBIOS domain name (for example, YORK for YORK.EAST.EXAMPLE.COM)

Expands to the user's domain name, based on the authentication provider:
 For Active Directory users, %D expands to the Active Directory NetBIOS name.
 For local users, %D expands to the cluster name in uppercase characters. For example, for a cluster named cluster1, %D expands to CLUSTER1.
 For users in the System file provider, %D expands to UNIX_USERS.
 For users in other file providers, %D expands to FILE_USERS.
 For LDAP users, %D expands to LDAP_USERS.  For NIS users, %D expands to NIS_USERS.

Zone name (for example, ZoneABC)

Expands to the access zone name. If multiple zones are activated, this variable is useful for differentiating users in separate zones. For example, for a user named user1 in the System zone, the path /ifs/home/%Z/%U is mapped to /ifs/home/System/user1.

Host name (cluster host name Expands to the host name of the cluster, normalized to

in lowercase)

lowercase. Limited use.

First character of the user name

Expands to the first character of the user name.

Second character of the user Expands to the second character of the user name. name

Third character of the user name

Expands to the third character of the user name.

NOTE: If the user name includes fewer than three characters, the %0, %1, and %2 variables wrap around. For example, for a user named ab, the variables maps to a, b, and a, respectively. For a user named a, all three variables map to a.

162 Home directories

Domain variables in home directory provisioning

You can use domain variables to specify authentication providers when provisioning home directories.
The domain variable (%D) is typically used for Active Directory users, but it has a value set that can be used for other authentication providers. %D expands as described in the following table for the various authentication providers.

Authenticated user Active Directory user Local user
File user
LDAP user NIS user

%D expansion
Active Directory NetBIOS name--for example, YORK for provider YORK.EAST.EXAMPLE.COM.
The cluster name in all-uppercase characters--for example, if the cluster is named MyCluster, %D expands to MYCLUSTER.
 UNIX_USERS (for System file provider)  FILE_USERS (for all other file providers) LDAP_USERS (for all LDAP authentication providers)
NIS_USERS (for all NIS authentication providers)

Related references Supported expansion variables on page 162

Home directories 163

10
Data access control
This section contains the following topics:
Topics:
· Data access control overview · ACLs · UNIX permissions · Mixed-permission environments · Managing access permissions
Data access control overview
OneFS supports two types of permissions data on files and directories that control who has access: Windows-style access control lists (ACLs) and POSIX mode bits (UNIX permissions). You can configure global policy settings that enable you to customize default ACL and UNIX permissions to best support your environment. The OneFS file system installs with UNIX permissions as the default. You can give a file or directory an ACL by using Windows Explorer or OneFS administrative tools. Typically, files created over SMB or in a directory that has an ACL, receive an ACL. If a file receives an ACL, OneFS stops enforcing the file's mode bits; the mode bits are provided for only protocol compatibility, not for access control. OneFS supports multiprotocol data access over Network File System (NFS) and Server Message Block (SMB) with a unified security model. A user is granted or denied the same access to a file when using SMB for Windows file sharing as when using NFS for UNIX file sharing. NFS enables Linux and UNIX clients to remotely mount any subdirectory, including subdirectories created by Windows or SMB users. Linux and UNIX clients also can mount ACL-protected subdirectories created by a OneFS administrator. SMB provides Windows users access to files, directories and other file system resources stored by UNIX and Linux systems. In addition to Windows users, ACLs can affect local, NIS, and LDAP users. By default, OneFS maintains the same file permissions regardless of the client's operating system, the user's identity management system, or the file sharing protocol. When OneFS must transform a file's permissions from ACLs to mode bits or vice versa, it merges the permissions into an optimal representation that uniquely balances user expectations and file security.
ACLs
In Windows environments, file and directory permissions, referred to as access rights, are defined in access control lists (ACLs). Although ACLs are more complex than mode bits, ACLs can express much more granular sets of access rules. OneFS checks the ACL processing rules commonly associated with Windows ACLs. A Windows ACL contains zero or more access control entries (ACEs), each of which represents the security identifier (SID) of a user or a group as a trustee. In OneFS, an ACL can contain ACEs with a UID, GID, or SID as the trustee. Each ACE contains a set of rights that allow or deny access to a file or folder. An ACE can optionally contain an inheritance flag to specify whether the ACE should be inherited by child folders and files.
NOTE: Instead of the standard three permissions available for mode bits, ACLs have 32 bits of fine-grained access rights. Of these, the upper 16 bits are general and apply to all object types. The lower 16 bits vary between files and directories but are defined in a way that allows most applications to apply the same bits for files and directories. Rights grant or deny access for a given trustee. You can block user access explicitly through a deny ACE or implicitly by ensuring that a user does not directly, or indirectly through a group, appear in an ACE that grants the right.
164 Data access control

UNIX permissions
In a UNIX environment, file and directory access is controlled by POSIX mode bits, which grant read, write, or execute permissions to the owning user, the owning group, and everyone else. OneFS supports the standard UNIX tools for viewing and changing permissions, ls, chmod, and chown. For more information, run the man ls, man chmod, and man chown commands. All files contain 16 permission bits, which provide information about the file or directory type and the permissions. The lower 9 bits are grouped as three 3-bit sets, called triples, which contain the read, write, and execute (rwx) permissions for each class of users--owner, group, and other. You can set permissions flags to grant permissions to each of these classes. Unless the user is root, OneFS checks the class to determine whether to grant or deny access to the file. The classes are not cumulative: The first class matched is applied. It is therefore common to grant permissions in decreasing order.
Mixed-permission environments
When a file operation requests an object's authorization data, for example, with the ls -l command over NFS or with the Security tab of the Properties dialog box in Windows Explorer over SMB, OneFS attempts to provide that data in the requested format. In an environment that mixes UNIX and Windows systems, some translation may be required when performing create file, set security, get security, or access operations.
NFS access of Windows-created files
If a file contains an owning user or group that is a SID, the system attempts to map it to a corresponding UID or GID before returning it to the caller. In UNIX, authorization data is retrieved by calling stat(2) on a file and examining the owner, group, and mode bits. Over NFSv3, the GETATTR command functions similarly. The system approximates the mode bits and sets them on the file whenever its ACL changes. Mode bit approximations need to be retrieved only to service these calls.
NOTE: SID-to-UID and SID-to-GID mappings are cached in both the OneFS ID mapper and the stat cache. If a mapping has recently changed, the file might report inaccurate information until the file is updated or the cache is flushed.
SMB access of UNIX-created files
No UID-to-SID or GID-to-SID mappings are performed when creating an ACL for a file; all UIDs and GIDs are converted to SIDs or principals when the ACL is returned. OneFS initiates a two-step process for returning a security descriptor, which contains SIDs for the owner and primary group of an object: 1. The current security descriptor is retrieved from the file. If the file does not have a discretionary access control list (DACL),
a synthetic ACL is constructed from the file's lower 9 mode bits, which are separated into three sets of permission triples-- one each for owner, group, and everyone. For details about mode bits, see the UNIX permissions topic. 2. Two access control entries (ACEs) are created for each triple: the allow ACE contains the corresponding rights that are granted according to the permissions; the deny ACE contains the corresponding rights that are denied. In both cases, the trustee of the ACE corresponds to the file owner, group, or everyone. After all of the ACEs are generated, any that are not needed are removed before the synthetic ACL is returned.
Data access control 165

Managing access permissions
The internal representation of identities and permissions can contain information from UNIX sources, Windows sources, or both. Because access protocols can process the information from only one of these sources, the system may need to make approximations to present the information in a format the protocol can process.
View expected user permissions
You can view the expected permissions for user access to a file or directory. This procedure must be performed through the command-line interface (CLI). 1. Establish an SSH connection to any node in the cluster. 2. View expected user permissions by running the isi auth access command.
The following command displays permissions in /ifs/ for the user that you specify in place of <username>:
isi auth access <username> /ifs/
The system displays output similar to the following example:
User Name : <username> UID : 2018 SID : SID:S-1-5-21-2141457107-1514332578-1691322784-1018
File Owner : user:root Group : group:wheel Mode : drwxrwxrwx
Relevant Mode : d---rwx--Permissions
Expected : user:<username> \ allow dir_gen_read,dir_gen_write,dir_gen_execute,delete_child
3. View mode-bits permissions for a user by running the isi auth access command. The following command displays verbose-mode file permissions information in /ifs/ for the user that you specify in place of <username>:
isi auth access <username> /ifs/ -v
The system displays output similar to the following example:
User Name : <username> UID \ : 2018 SID : SID:S-1-5-21-2141457107-1514332578-1691322784-1018 File Owner : user:root Group : group:wheel Mode : drwxrwxrwx Relevant Mode : d---rwx--- Permissions Expected : user:<username> allow dir_gen_read,dir_gen_write,dir_gen_execute,delete_child
4. View expected ACL user permissions on a file for a user by running the isi auth access command. The following command displays verbose-mode ACL file permissions for the file file_with_acl.tx in /ifs/data/ for the user that you specify in place of <username>:
isi auth access <username> /ifs/data/file_with_acl.tx -v
The system displays output similar to the following example:
User Name : <username> \ UID : 2097 SID : SID:S-1-7-21-2141457107-1614332578-1691322789-1018 File Owner : user:<username> Group : group:wheel Permissions Expected : user:<username> allow file_gen_read,file_gen_write,std_write_dac Relevant Acl: group:<group-name> Users allow file_gen_read user:<username> allow std_write_dac,file_write,
166 Data access control

append,file_write_ext_attr,file_write_attr group:wheel allow file_gen_read,file_gen_write
Configure access management settings
Default access settings include whether to send NTLMv2 responses for SMB connections, the identity type to store on disk, the Windows workgroup name for running in local mode, and character substitution for spaces encountered in user and group names. Configure access management settings by running the isi auth settings global modify command. The following command modifies global settings for a workgroup:
isi auth settings global modify \ --send-ntlmv2=false --on-disk-identity=native \ --space-replacement="_" --workgroup=WORKGROUP
Modify ACL policy settings
You can modify ACL policy settings but the default ACL policy settings are sufficient for most cluster deployments. CAUTION: Because ACL policies change the behavior of permissions throughout the system, they should be modified only as necessary by experienced administrators with advanced knowledge of Windows ACLs. This is especially true for the advanced settings, which are applied regardless of the cluster's environment.
For UNIX, Windows, or balanced environments, the optimal permission policy settings are selected and cannot be modified. You can choose to manually configure the cluster's default permission settings if necessary to support your particular environment, however. Run the following command to modify ACL policy settings:
isi auth settings acls modify
Run the PermissionsRepair job
You can update file and directory permissions or ownership by running the Repair Permissions job. To prevent permissions issues that can occur after changing the on-disk identity, run this job with the Convert Permissions job to ensure that the changes are fully propagated throughout the cluster. To prevent permissions issues that can occur after changing the on-disk identity, run this authentication and access control job with convert mode specified to ensure that the changes are fully propagated throughout the cluster. Update cluster permissions by running the isi job jobs start command with the following syntax. The following command updates cluster permissions, where permissionrepair specifies the job type, where variables in angle brackets are placeholders for values specific to your environment:
isi job start permissionrepair --priority <1-10> \ --policy <policy> --mode <clone | inherit | convert > \ --mapping-type=<system | sid | unix | native> --zone <zone-name>
NOTE: You cannot combine the --template parameter with the convert mode option, but you can combine the parameter with the clone and inherit mode options. Conversely, you cannot combine the --mapping-type and --zone parameters with the clone and inherit mode options, but you can combine the parameters with the convert mode option.
Data access control 167

The following example updates cluster permissions, where permissionrepair specifies the job type, the priority is 3, the chosen mode is convert, and the mapping type is unix:
isi job jobs start permissionrepair --priority=3 \ --policy myPolicy --mode=convert --mapping-type=unix \ --template <isi path> --path </ifs directory> --zone zone2
168 Data access control

11
File sharing
This section contains the following topics:
Topics:
· File sharing overview · SMB security · NFS security · FTP · HTTP and HTTPS security
File sharing overview
Multiprotocol support in OneFS enables accessing files and directories on the PowerScale cluster through SMB for Windows file sharing, NFS for UNIX file sharing, secure shell (SSH), FTP, and HTTP. By default, all file sharing protocols are disabled. You enable each protocol that you intend to use and configure the default share for each protocol. For example, you can configure the /ifs directory as an SMB share and an NFS export.
NOTE: It is recommended that you do not save data to the root /ifs file path but in directories below /ifs. The design of your data storage structure should be planned carefully. A well-designed directory structure optimizes cluster performance and administration. You can set Windows- and UNIX-based permissions on OneFS files and directories. With the required permissions and administrative privileges, you can create, modify, and read data on the cluster through one or more of the supported file sharing protocols.  SMB. Allows Microsoft Windows and MacOS X clients to access files that are stored on the cluster.  NFS. Allows Linux and UNIX clients that adhere to the RFC1813 (NFSv3) and RFC3530 (NFSv4) specifications to access files that are stored on the cluster.  HTTP and HTTPS (with optional DAV). Allows clients to access files that are stored on the cluster through a web browser.  FTP. Allows any client that is equipped with an FTP client program to access files that are stored on the cluster through the FTP protocol.
Mixed protocol environments
You enable the protocols that you intend to use for file sharing. You can configure your OneFS cluster to use SMB or NFS exclusively, or both. You can also enable HTTP, FTP, and SSH. You configure default shares and exports for each protocol that you enable. Use the isi services protocol enablecommand to enable each protocol. For example, to enable NFS, use the following command.
isi services nfs enable
Access rights are consistently enforced across access protocols on all security models. For example, a user is granted or denied the same rights to a file whether using SMB or NFS. Clusters running OneFS support a set of global policy settings that enable you to customize the default access control list (ACL) and UNIX permissions settings.
NOTE: Access control lists (ACLs) are not supported with HDFS.
OneFS is configured with standard UNIX permissions on the file tree. Through Windows Explorer or OneFS administrative tools, you can give any file or directory an ACL. In addition to Windows domain users and groups, ACLs in OneFS can include local, NIS, and LDAP users and groups. After a file is given an ACL, the mode bits are no longer enforced and exist only as an estimate of the effective permissions.
File sharing 169

NOTE: It is recommended that you configure ACL and UNIX permissions only if you fully understand how they interact with one another.

Write caching with SmartCache

Write caching accelerates the process of writing data to the cluster. OneFS includes a write-caching feature called SmartChache, which is enabled by default for all files and directories.
If write caching is enabled, OneFS writes data to a write-back cache instead of immediately writing the data to disk. OneFS can write the data to disk at a time that is more convenient.
NOTE: We recommend that you keep write caching enabled. You should also enable write caching for all file pool policies.
OneFS interprets writes to the cluster as either synchronous or asynchronous, depending on a client's specifications. The impacts and risks of write caching depend on what protocols clients use to write to the cluster, and whether the writes are interpreted as synchronous or asynchronous. If you disable write caching, client specifications are ignored and all writes are performed synchronously.
The following table explains how clients' specifications are interpreted, according to the protocol.

Protocol NFS
SMB

Synchronous

Asynchronous

The stable field is set to data_sync or The stable field is set to unstable. file_sync.

The write-through flag has been applied.

The write-through flag has not been applied.

Write caching for asynchronous writes

Writing to the cluster asynchronously with write caching is the fastest method of writing data to your cluster.
Write caching for asynchronous writes requires fewer cluster resources than write caching for synchronous writes, and will improve overall cluster performance for most workflows. However, there is some risk of data loss with asynchronous writes.
The following table describes the risk of data loss for each protocol when write caching for asynchronous writes is enabled:

Protocol NFS
SMB

Risk
If a node fails, no data will be lost except in the unlikely event that a client of that node also crashes before it can reconnect to the cluster. In that situation, asynchronous writes that have not been committed to disk will be lost.
If a node fails, asynchronous writes that have not been committed to disk will be lost.

We recommend that you do not disable write caching, regardless of the protocol that you are writing with. If you are writing to the cluster with asynchronous writes, and you decide that the risks of data loss are too great, we recommend that you configure your clients to use synchronous writes, rather than disable write caching.

Write caching for synchronous writes
Write caching for synchronous writes costs cluster resources, including a negligible amount of storage space. Although it is not as fast as write caching with asynchronous writes, unless cluster resources are extremely limited, write caching with synchronous writes is faster than writing to the cluster without write caching.
Write caching does not affect the integrity of synchronous writes; if a cluster or a node fails, none of the data in the write-back cache for synchronous writes is lost.

170 File sharing

SMB security

OneFS includes a configurable SMB service to create and manage SMB shares. SMB shares provide Windows clients with network access to file system resources on the cluster. You can grant permissions to users and groups to perform operations such as reading, writing, and setting access permissions on SMB shares. SMB is disabled by default. To enable SMB, use the following command:
isi services smb enable

You then configure the default SMB share. See the section Managing SMB shares for more information.
OneFS supports both user and anonymous security modes. If the user security mode is enabled, users who connect to a share from an SMB client must provide a valid user name with proper credentials.
SMB shares act as checkpoints, and users must have access to a share in order to access objects in a file system on a share. If a user has access granted to a file system, but not to the share on which it resides, that user will not be able to access the file system regardless of privileges. For example, assume a share named ABCDocs contains a file named file1.txt in a path such as: /ifs/data/ABCDocs/file1.txt. If a user attempting to access file1.txt does not have share privileges on ABCDocs, that user cannot access the file even if originally granted read and/or write privileges to the file.
The SMB protocol uses security identifiers (SIDs) for authorization data. All identities are converted to SIDs during retrieval and are converted back to their on-disk representation before they are stored on the cluster.
When a file or directory is created, OneFS checks the access control list (ACL) of its parent directory. If the ACL contains any inheritable access control entries (ACEs), a new ACL is generated from those ACEs. Otherwise, OneFS creates an ACL from the combined file and directory create mask and create mode settings.
OneFS supports the following SMB clients:

SMB version 3.0 - Multichannel only 2.1 2.0
1.0

Supported operating systems Windows 8 or later Windows Server 2012 or later
Windows 7 or later Windows Server 2008 R2 or later
Windows Vista or later Windows Server 2008 or later Mac OS X 10.9 or later
Windows 2000 or later Windows XP or later Mac OS X 10.5 or later

SMB shares in access zones
You can create and manage SMB shares within access zones.
You can create access zones that partition storage on the cluster into multiple virtual containers. Access zones support all configuration settings for authentication and identity management services on the cluster. That means that you can configure authentication providers and provision SMB shares on a zone-by-zone basis. When you create an access zone, a local provider is created automatically. That allows you to configure each access zone with a list of local users and groups. You can also authenticate through a different Active Directory provider in each access zone. You can also control data access by directing incoming connections to the access zone from a specific IP address in a pool. Associating an access zone with an IP address pool restricts authentication to the associated access zone and reduces the number of available and accessible SMB shares.
Here are a few ways to simplify SMB management with access zones:
 Migrate multiple SMB servers, such as Windows file servers or NetApp filers, to a single PowerScale cluster, and then configure a separate access zone for each SMB server.

File sharing 171

 Configure each access zone with a unique set of SMB share names that do not conflict with share names in other access zones, and then join each access zone to a different Active Directory domain.
 Reduce the number of available and accessible shares to manage by associating an IP address pool with an access zone to restrict authentication to the zone.
 Configure default SMB share settings that apply to all shares in an access zone.
The cluster includes an integrated access zone named System. The System access zone is where you manage all aspects of the cluster and other access zones. If you do not specify an access zone when managing SMB shares, OneFS defaults to the System zone.

SMB Multichannel

SMB Multichannel supports establishing a single SMB session over multiple network connections. SMB Multichannel is a feature of the SMB 3.0 protocol that provides the following capabilities:

Increased throughput
Connection failure tolerance
Automatic discovery

OneFS can transmit more data to a client through multiple connections over high speed network adapters or over multiple network adapters.
When an SMB Multichannel session is established over multiple network connections, the session is not lost if one of the connections has a network fault, which enables the client to continue to work.
SMB Multichannel automatically discovers supported hardware configurations on the client that have multiple available network paths and then negotiates and establishes a session over multiple network connections. You are not required to install components, roles, role services, or features.

SMB Multichannel requirements
You must meet software and NIC configuration requirements to support SMB Multichannel on a cluster.
OneFS can only support SMB Multichannel when the following software requirements are met:
 Windows Server 2012, 2012 R2 or Windows 8, 8.1 clients  SMB Multichannel must be enabled on both the cluster and the Windows client computer. It is enabled on the cluster by
default.
SMB Multichannel establishes a single SMB session over multiple network connections only on supported network interface card (NIC) configurations. SMB Multichannel requires at least one of the following NIC configurations on the client computer:
 Two or more network interface cards.  One or more network interface cards that support Receive Side Scaling (RSS).  One or more network interface cards configured with link aggregation. Link aggregation enables you to combine the
bandwidth of multiple NICs on a node into a single logical interface.

Client-side NIC configurations supported by SMB Multichannel

SMB Multichannel automatically discovers supported hardware configurations on the client that have multiple available network paths.
Each node on the cluster has at least one RSS-capable network interface card (NIC). Your client-side NIC configuration determines how SMB Multichannel establishes simultaneous network connections per SMB session.

Client-side NIC Configuration Single RSS-capable NIC
Multiple NICs

Description
SMB Multichannel establishes a maximum of four network connections to the PowerScale cluster over the NIC. The connections are more likely to be spread across multiple CPU cores, which reduces the likelihood of performance bottleneck issues and achieves the maximum speed capability of the NIC.
If the NICs are RSS-capable, SMB Multichannel establishes a maximum of four network connections to the PowerScale cluster over each NIC. If the NICs on the client are not RSScapable, SMB Multichannel establishes a single network connection to the PowerScale cluster over each NIC. Both configurations allow SMB Multichannel to leverage the combined bandwidth of multiple NICs and provides connection fault tolerance if a connection or a NIC fails.

172 File sharing

Client-side NIC Configuration
Aggregated NICs

Description
NOTE: SMB Multichannel cannot establish more than eight simultaneous network connections per session. In a multiple NIC configuration, this might limit the number connections allowed per NIC. For example, if the configuration contains three RSS-capable NICs, SMB Multichannel might establish three connections over the first NIC, three connections over the second NIC and two connections over the third NIC.
SMB Multichannel establishes multiple network connections to the PowerScale cluster over aggregated NICs, which results in balanced connections across CPU cores, effective consumption of combined bandwidth, and connection fault tolerance.
NOTE: The aggregated NIC configuration inherently provides NIC fault tolerance that is not dependent upon SMB.

SMB share management through MMC
OneFS supports the Shared Folders snap-in for the Microsoft Management Console (MMC), which allows SMB shares on the cluster to be managed using the MMC tool. Typically, you connect to the global System zone through the web administration interface or the command line interface to manage and configure shares. If you configure access zones, you can connect to a zone through the MMC Shared Folders snap-in to directly manage all shares in that zone. You can establish a connection through the MMC Shared Folders snap-in to a PowerScale node and perform the following SMB share management tasks:  Create and delete shared folders  Configure access permission to an SMB share  View a list of active SMB sessions  Close open SMB sessions  View a list of open files  Close open files When you connect to a zone through the MMC Shared Folders snap-in, you can view and manage all SMB shares assigned to that zone; however, you can only view active SMB sessions and open files on the specific node that you are connected to in that zone. Changes you make to shares through the MMC Shared Folders snap-in are propagated across the cluster.
MMC connection requirements
You can connect to a cluster through the MMC Shared Folders snap-in if you meet access requirements. The following conditions are required to establish a connection through the MMC Shared Folders snap-in:  You must run the Microsoft Management Console (MMC) from a Windows workstation that is joined to the domain of an
Active Directory (AD) provider configured on the cluster.  You must be a member of the local <cluster>\Administrators group.
NOTE: Role-based access control (RBAC) privileges do not apply to the MMC. A role with SMB privileges is not sufficient to gain access.  You must log in to a Windows workstation as an Active Directory user that is a member of the local <cluster>\Administrators group.
SMBv3 encryption
Certain Microsoft Windows and Apple Mac client/server combinations can support data encryption in SMBv3 environments. You can configure SMBv3 encryption on a per-share, per-zone, or cluster-wide basis. You can allow encrypted and unencrypted clients access. Globally and for access zones, you can also require that all client connections are encrypted. If you set encryption settings on a per-zone basis, those settings will override global server settings.
NOTE: Per-zone and per-share encryption settings can only be configured through the OneFS command line interface.

File sharing 173

Enable SMBv3 encryption for an SMB share
You can enable or disable SMBv3 encryption on a share. To enable SMBv3 encryption for a share:  isi smb settings shares modify --smb3-encryption-enabled yes
SMBv3 encryption is enabled. To disable SMBv3 encryption, use the --revert-smb3-encryption-enabled option.
Enable SMBv3 encryption for an access zone
You can enable SMBv3 encryption on a per access zone basis. Zone-specific encryption settings override global encryption settings. To enable SMBv3 encryption for an access zone:  isi smb settings zone modify --zone=<zone> --support-smb3-encryption yes
SMBv3 encryption is enabled for the specific access zone. To disable SMBv3 encryption, use the --revert-supportsmb3-encryption option.
Enable SMBv3 encryption globally
You can enable SMBv3 encryption on a global basis. However, if you later set or modify encryption settings on an access zone level, those settings will override the global settings. To enable SMBv3 encryption globally:  isi smb settings global modify --support-smb3-encryption yes
SMBv3 encryption is enabled globally on the cluster. To disable SMBv3 encryption, use the --revert-support-smb3encryption option.
Enforce SMBv3 encryption
You can require that all client connections to a cluster or access zone are encrypted for SMBv3. For example, to require that all connections to an access zone are encrypted:  isi smb settings zone modify --zone=<zone> --reject-unencrypted-access yes
SMBv3 encryption is required for a client to connect to the specific access zone. To disable SMBv3 encryption requirement, use the --revert-reject-unencrypted access option.
SMB server-side copy
In order to increase system performance, SMB 2 and later clients can utilize the server-side copy feature in OneFS. Windows clients making use of server-side copy support may experience performance improvements for file copy operations, because file data no longer needs to traverse the network. The server-side copy feature reads and writes files only on the server, avoiding the network round-trip and duplication of file data. This feature only affects file copy or partial copy operations in which the source and destination file handles are open on the same share, and does not work for cross-share operations. This feature is enabled by default across OneFS clusters, and can only be disabled system-wide across all zones. Additionally, server-side copy in OneFS is incompatible with the SMB continuous availability feature. If continuous availability is enabled for a share and the client opens a persistent file handle, server-side copy is automatically disabled for that file.
NOTE: You can only disable or enable SMB server-side copy for OneFS using the command line interface (CLI).
Enable or disable SMB server-side copy
You can enable or disable the SMB server-side copy feature. The SMB server-side copy feature is enabled in OneFS by default. 1. Open a secure shell (SSH) connection to the cluster. 2. Run the isi smb settings global modify command.
174 File sharing

3. Modify the --server-side-copy option as necessary. This feature is enabled by default.
For example, the following command disables SMB server-side copy:
isi smb settings global modify --server-side-copy=no
SMB continuous availability
If you are running OneFS in an SMB 3.0 environment, you allow certain Windows clients to open files on a server with continuous availability enabled. If a server is using Windows 8 or Windows Server 2012, clients can create persistent file handles that can be reclaimed after an outage such as a network-related disconnection or a server failure. You can specify how long the persistent handle is retained after a disconnection or server failure, and also force strict lockouts on users attempting to open a file belonging to another handle. Furthermore, through the OneFS command-line interface (CLI), you can configure write integrity settings to control the stability of writes to the share. If continuous availability is enabled for a share and the client opens a persistent file handle, server-side copy is automatically disabled for that file.
NOTE: You can only enable continuous availability when creating a share, but you can update timeout, lockout, and write integrity settings when creating or modifying a share.
Enable SMB continuous availability
You can enable SMB 3.0 continuous availability and configure settings when you create a share. You can also update continuous availability timeout, lockout, and write integrity settings when you modify a share.  Run isi smb shares create to enable this feature and configure settings, and isi smb shares modify or isi
smb settings shares modify to change settings. The following command enables continuous availability on a new share named Share4, sets the timeout for the handle to three minutes (180 seconds), enforces a strict lockout, and changes the write integrity setting to full:
isi smb shares create --name=Share4 --path=/ifs/data/Share4 \ --continuously-available=yes --ca-timeout=180 \ --strict-ca-lockout=yes --ca-write-integrity=full
SMB file filtering
You can use SMB file filtering to allow or deny file writes to a share or access zone. This feature enables you to deny certain types of files that might cause throughput issues, security problems, storage clutter, or productivity disruptions. You can restrict writes by allowing writes of certain file types to a share.  If you choose to deny file writes, you can specify file types by extension that are not allowed to be written. OneFS permits all
other file types to be written to the share.  If you choose to allow file writes, you can specify file types by extension that are allowed to be written. OneFS denies all
other file types to be written to the share. You can add or remove file extensions if your restriction policies change.
Enable SMB file filtering
You can enable or disable SMB file filtering when you create or modify a share.  Run isi smb shares create or isi smb shares modify.
The following command enables file filtering on a share named Share2 and denies writes by the file types .wav and .mpg:
isi smb shares modify Share2 --file-filtering-enabled=yes \ file-filter-extensions=.wav,.mpg
File sharing 175

The following command enables file filtering on a share named Share3, specifies the file type .xml, and specifies to allow writes for that file type:
isi smb shares modify Share3 --file-filtering-enabled=yes \ file-filter-extensions=.xml --file-filter-type=allow
Symbolic links and SMB clients
OneFS enables SMB2 clients to access symbolic links in a seamless manner. Many administrators deploy symbolic links to virtually reorder file system hierarchies, especially when crucial files or directories are scattered around an environment. In an SMB share, a symbolic link (also known as a symlink or a soft link) is a type of file that contains a path to a target file or directory. Symbolic links are transparent to applications running on SMB clients, and they function as typical files and directories. Support for relative and absolute links is enabled by the SMB client. The specific configuration depends on the client type and version. A symbolic link that points to a network file or directory that is not in the path of the active SMB session is referred to as an absolute (or remote) link. Absolute links always point to the same location on a file system, regardless of the present working directory, and usually contain the root directory as part of the path. Conversely, a relative link is a symbolic link that points directly to a user's or application's working directory, so you do not have to specify the full absolute path when creating the link. OneFS exposes symbolic links through the SMB2 protocol, enabling SMB2 clients to resolve the links instead of relying on OneFS to resolve the links on behalf of the clients. To transverse a relative or absolute link, the SMB client must be authenticated to the SMB shares that the link can be followed through. However, if the SMB client does not have permission to access the share, access to the target is denied and Windows will not prompt the user for credentials. SMB2 and NFS links are interoperable for relative links only. For maximum compatibility, create these links from a POSIX client.
NOTE: SMB1 clients (such as Windows XP or 2002) may still use relative links, but they are traversed on the server side and referred to as "shortcut files." Absolute links do not work in these environments.
Enabling symbolic links
Before you can fully use symbolic links in an SMB environment, you must enable them. For Windows SMB clients to traverse each type of symbolic link, you must enable them on the client. Windows supports the following link types:  local to local  remote to remote  local to remote  remote to local You must run the following Windows command to enable all four link types:
fsutil behavior set SymlinkEvaluation L2L:1 R2R:1 L2R:1 R2L:1 For POSIX clients using Samba, you must set the following options in the [global] section of your Samba configuration file (smb.conf) to enable Samba clients to traverse relative and absolute links:
follow symlinks=yes wide links=yes In this case, "wide links" in the smb.conf file refers to absolute links. The default setting in this file is no.
176 File sharing

Managing symbolic links
After enabling symbolic links, you can create or delete them from the Windows command prompt or a POSIX command line. Create symbolic links using the Windows mklink command on an SMB2 client or the ln command from a POSIX command-line interface. For example, an administrator may want to give a user named User1 access to a file named File1.doc in the /ifs/ data/ directory without giving specific access to that directory by creating a link named Link1:
mklink \ifs\home\users\User1\Link1 \ifs\data\Share1\File1.doc
When you create a symbolic link, it is designated as a file link or directory link. Once the link is set, the designation cannot be changed. You can format symbolic link paths as either relative or absolute. To delete symbolic links, use the del command in Windows, or the rm command in a POSIX environment. Keep in mind that when you delete a symbolic link, the target file or directory still exists. However, when you delete a target file or directory, a symbolic link continues to exist and still points to the old target, thus becoming a broken link.
Anonymous access to SMB shares
You can configure anonymous access to SMB shares by enabling the local Guest user and allowing impersonation of the guest user. For example, if you store files such as browser executables or other data that is public on the internet, anonymous access allows any user to access the SMB share without authenticating.
Managing SMB settings
You can enable or disable the SMB service, configure global settings for the SMB service, and configure default SMB share settings that are specific to each access zone.
View global SMB settings
You can view the global SMB settings that are applied to all nodes on the cluster. This task can only be performed through the OneFS command-line interface. 1. Establish an SSH connection to any node in the cluster. 2. Run the isi smb settings global view command.
The system displays output similar to the following example:
Access Based Share Enum: No Dot Snap Accessible Child: Yes
Dot Snap Accessible Root: Yes Dot Snap Visible Child: No Dot Snap Visible Root: Yes
Enable Security Signatures: No Guest User: nobody Ignore Eas: No
Onefs Cpu Multiplier: 4 Onefs Num Workers: 0
Require Security Signatures: No Server Side Copy: Yes Server String: PowerScale Server
Srv Cpu Multiplier: 4 Srv Num Workers: 0
Support Multichannel: Yes Support NetBIOS: No Support Smb2: Yes Support Smb3 Encryption: No
File sharing 177

Configure global SMB settings
You can configure global settings for SMB file sharing. This task can only be performed through the OneFS command-line interface.
CAUTION: Modifying global SMB file sharing settings could result in operational failures. Be aware of the potential consequences before modifying these settings. 1. Establish an SSH connection to any node in the cluster. 2. Run the isi smb settings global modify command. The following example command disables SMB server-side copy: isi smb settings global modify --server-side-copy=no
Enable or disable the SMB service
The SMB service is enabled by default. NOTE: You can determine whether the service is enabled or disabled by running the isi services -l command.
 Run the isi services command. The following command disables the SMB service: isi services smb disable The following command enables the SMB service: isi services smb enable
Enable or disable SMB Multichannel
SMB Multichannel is required for multiple, concurrent SMB sessions from a Windows client computer to a node in a cluster. SMB Multichannel is enabled in the cluster by default. You can enable or disable SMB Multichannel only through the command-line interface. 1. Open a secure shell (SSH) connection to any node in the cluster and log in. 2. Run the isi smb settings global modify command.
The following command enables SMB Multichannel on the cluster: isi smb settings global modify ­-support-multichannel=yes
The following command disables SMB Multichannel on the cluster: isi smb settings global modify ­-support-multichannel=no
View default SMB share settings
You can view the default SMB share settings specific to an access zone.  Run the isi smb settings shares view command.
The following example command displays the default SMB share settings configured for zone5 : isi smb settings shares view --zone=zone5
The system displays output similar to the following example: Access Based Enumeration: No
Access Based Enumeration Root Only: No Allow Delete Readonly: No Allow Execute Always: No
178 File sharing

Ca Timeout: 120 Strict Ca Lockout: No
Change Notify: norecurse Create Permissions: default acl Directory Create Mask: 0700 Directory Create Mode: 0000
File Create Mask: 0700 File Create Mode: 0100 File Filtering Enabled: Yes File Filter Extensions: .wav File Filter Type: deny
Hide Dot Files: No Host ACL: -
Impersonate Guest: never Impersonate User: -
Mangle Byte Start: 0XED00 Mangle Map: 0x01-0x1F:-1, 0x22:-1, 0x2A:-1, 0x3A:-1,
0x3C:-1, 0x3E:-1, 0x3F:-1, 0x5C:-1 Ntfs ACL Support: Yes Oplocks: Yes Strict Flush: Yes Strict Locking: No
Configure default SMB share settings
You can configure SMB share settings specific to each access zone. The default settings are applied to all new shares that are added to the access zone.
CAUTION: If you modify the default settings, the changes are applied to all existing shares in the access zone.
Run the isi smb settings shares modify command. The following command specifies that guests are never allowed access to shares in zone5:
isi smb settings global modify --zone=zone5 --impersonate-guest=never
Managing SMB shares
You can configure the rules and other settings that govern the interaction between your Windows network and individual SMB shares on the cluster. OneFS supports %U, %D, %Z, %L, %0, %1, %2, and %3 variable expansion and automatic provisioning of user home directories. You can configure the users and groups that are associated with an SMB share, and view or modify their share-level permissions.
NOTE: We recommend that you configure advanced SMB share settings only if you have a solid understanding of the SMB protocol.
Create an SMB share
When you create an SMB share, you can override the default permissions, performance, and access settings. You can configure SMB home directory provisioning by including expansion variables in the share path to automatically create and redirect users to their own home directories. You must specify a path to use as the SMB share. Shares are specific to access zones and the share path must exist under the zone path. You can specify an existing path or create the path at the time you create the share. Create access zones before you create SMB shares. You can specify one or more expansion variables in the directory path but you must set the flags to true for both the --allow-variable-expansion and --auto-create-directory parameters. If you do not specify these settings, the variable expansion string is interpreted literally by the system. 1. Run the isi smb shares create command.
File sharing 179

The following commands creates a directory at /ifs/zone5/data/share1, creates a share named share1 using that path, and adds the share to the existing access zone named zone5:
mkdir /ifs/data/share1 isi smb shares create --name=share1 \ --path=/ifs/data/share1 --zone=zone5 --browsable=true \ --description="Example Share 1"
NOTE: Share names can contain up to 80 characters, except for the following: " \ / [ ] : | < > + = ; , * ? Also, if the cluster character encoding is not set to UTF-8, SMB share names are case-sensitive. The following command creates a directory at /ifs/data/share2, converts it to an SMB share, and adds the share to the default System zone because no zone is specified:
isi smb shares create share2 --path=/ifs/data/share2 \ --create-path --browsable=true --description="Example Share 2"
The following command creates a directory at /ifs/data/share3 and converts it to an SMB share. The command also applies an ACL to the share:
isi smb shares create share3 --path=/ifs/data/share3 \ --create-path --browsable=true --description="Example Share 3" \ --inheritable-path-acl=true --create-permissions="default acl"
NOTE: If no default ACL is configured and the parent directory does not have an inheritable ACL, an ACL is created for the share with the directory-create-mask and directory-create-mode settings. The following command creates the directory /ifs/data/share4 and converts it to a non-browsable SMB share. The command also configures the use of mode bits for permissions control:
isi smb shares create --name=share4 --path=/ifs/data/share4 \ --create-path --browsable=false --description="Example Share 4" \ --inheritable-path-acl=true --create-permissions="use create \ mask and mode"
2. The following command creates home directories for each user that connects to the share, based on the user's NetBIOS domain and user name. In this example, if a user is in a domain named DOMAIN and has a username of user_1, the path /ifs/home/%D/%U expands to /ifs/home/DOMAIN/user_1.
isi smb shares modify HOMEDIR --path=/ifs/home/%D/%U \ --allow-variable-expansion=yes --auto-create-directory=yes
The following command creates a share named HOMEDIR with the existing path /ifs/share/home:
isi smb shares create HOMEDIR /ifs/share/home
3. Run the isi smb shares permission modify command to enable access to the share. The following command allows the well-known user Everyone full permissions to the HOMEDIR share:
isi smb shares permission modify HOMEDIR --wellknown Everyone \ --permission-type allow --permission full
Modify an SMB share
You can modify the settings of individual SMB shares. SMB shares are zone-specific. When you modify a share, you must identify the access zone that the share belongs to. If you do not identify the access zone, OneFS defaults to the System zone. If the share you want to modify has the same name as a share in the System zone, the share in the System zone is modified. Run the isi smb shares modify command.
180 File sharing

In the following example, the file path for share1 in zone5 points to /ifs/zone5/data. The following commands modifies the file path of share1 to /ifs/zone5/etc, which is another directory in the zone5 path:
isi smb shares modify share1 --zone=zone5 \ --path=/ifs/zone5/etc
NOTE: If the cluster character encoding is not set to UTF-8, SMB share names are case-sensitive.
Delete an SMB share
You can delete SMB shares that are no longer needed. SMB shares are zone-specific. When you delete a share, you must identify the access zone that the share belongs to. If you do not identify the access zone, OneFS defaults to the System zone. If the share you want to delete has the same name as a share in the System zone, the share in the System zone is deleted. If you delete an SMB share, the share path is deleted but the directory it referenced still exists. If you create a new share with the same path as the share that was deleted, the directory that the previous share referenced will be accessible again through the new share. 1. Run the isi smb shares delete command.
The following command deletes a share named Share1 from the access zone named zone-5:
isi smb shares delete Share1 --zone=zone-5 2. Type yes at the confirmation prompt.
Limit access to /ifs share for the Everyone account
By default, the /ifs root directory is configured as an SMB share in the System access zone. It is recommended that you restrict the Everyone account of this share to read-only access. 1. Run the isi smb shares permission modify command.
The following example changes the Everyone account permissions to read-only on the SMB share configured for the /ifs directory:
isi smb shares permission modify ifs --wellknown=Everyone \ -d allow -p read
2. Optional: Verify the change by running the following command to list permissions on the share:
isi smb shares permission list ifs
Configure anonymous access to a single SMB share
You can configure anonymous access to data stored on a single share through Guest user impersonation. 1. Enable the Guest user account in the access zone that contains the share you want by running the isi auth users
modify command. The following command enables the guest user in the access zone named zone3:
isi auth users modify Guest --enabled=yes --zone=zone3 2. Set guest impersonation on the share you want to allow anonymous access to by running the isi smb share modify
command. The following command configures guest impersonation on a share named share1 in zone3:
isi smb share modify share1 --zone=zone3 \ --impersonate-guest=always 3. Verify that the Guest user account has permission to access the share by running the isi smb share permission list command.
File sharing 181

The following command list the permissions for share1 in zone3:

isi smb share permission list share1 --zone=zone3

The system displays output similar to the following example

Account Account Type Run as Root Permission Type Permission

----------------------------------------------------------------

Everyone wellknown

False

allow

read

Guest

user

False

allow

full

----------------------------------------------------------------

Configure anonymous access to all SMB shares in an access zone

You can configure anonymous access to data stored in an access zone through Guest user impersonation. 1. Enable the Guest user account in the access zone that contains the share you want by running the isi auth users
modify command. The following command enables the guest user in the access zone named zone3:
isi auth users modify Guest --enabled=yes --zone=zone3
2. Set guest impersonation as the default value for all shares in the access zone by running the isi smb settings share modify command. The following command configures guest impersonation for all shares in zone3:
isi smb settings share modify --zone=zone3 \ --impersonate-guest=always
3. Verify that the Guest user account has permission to each share in the access zone by running the isi smb share permission list command. The following command list the permissions for share1 in zone3:
isi smb share permission list share1 --zone=zone3

The system displays output similar to the following example

Account Account Type Run as Root Permission Type Permission

----------------------------------------------------------------

Everyone wellknown

False

allow

read

Guest

user

False

allow

full

----------------------------------------------------------------

Configure multi-protocol home directory access
For users who will access this share through FTP or SSH, you can make sure that their home directory path is the same whether they connect through SMB or they log in through FTP or SSH. This task may only be performed at the OneFS command-line interface.
This command directs the SMB share to use the home directory template that is specified in the user's authentication provider. This procedure is available only through the command-line interface. 1. Establish an SSH connection to any node in the cluster. 2. Run the following command, where <share> is the name of the SMB share and --path is the directory path of the home
directory template specified by the user's authentication provider:
isi smb shares modify <share> --path=""

182 File sharing

Supported expansion variables

You can include expansion variables in an SMB share path or in an authentication provider's home directory template.
OneFS supports the following expansion variables. You can improve performance and reduce the number of shares to be managed when you configure shares with expansion variables. For example, you can include the %U variable for a share rather than create a share for each user. When a %U is included in the name so that each user's path is different, security is still ensured because each user can view and access only his or her home directory.
NOTE: When you create an SMB share through the web administration interface, you must select the Allow Variable Expansion check box or the string is interpreted literally by the system.

Variable %U %D
%Z %L %0 %1 %2

Value

Description

User name (for example, user_001)

Expands to the user name to allow different users to use different home directories. This variable is typically included at the end of the path. For example, for a user named user1, the path /ifs/home/%U is mapped to /ifs/home/user1.

NetBIOS domain name (for example, YORK for YORK.EAST.EXAMPLE.COM)

Expands to the user's domain name, based on the authentication provider:
 For Active Directory users, %D expands to the Active Directory NetBIOS name.
 For local users, %D expands to the cluster name in uppercase characters. For example, for a cluster named cluster1, %D expands to CLUSTER1.
 For users in the System file provider, %D expands to UNIX_USERS.
 For users in other file providers, %D expands to FILE_USERS.
 For LDAP users, %D expands to LDAP_USERS.  For NIS users, %D expands to NIS_USERS.

Zone name (for example, ZoneABC)

Expands to the access zone name. If multiple zones are activated, this variable is useful for differentiating users in separate zones. For example, for a user named user1 in the System zone, the path /ifs/home/%Z/%U is mapped to /ifs/home/System/user1.

Host name (cluster host name Expands to the host name of the cluster, normalized to

in lowercase)

lowercase. Limited use.

First character of the user name

Expands to the first character of the user name.

Second character of the user Expands to the second character of the user name. name

Third character of the user name

Expands to the third character of the user name.

NOTE: If the user name includes fewer than three characters, the %0, %1, and %2 variables wrap around. For example, for a user named ab, the variables maps to a, b, and a, respectively. For a user named a, all three variables map to a.

NFS security
OneFS provides an NFS server so you can share files on your cluster with NFS clients that adhere to the RFC1813 (NFSv3) and RFC3530 (NFSv4) specifications. NFS is disabled by default. To enable NFS, use the following command:
isi services nfs enable

File sharing 183

In OneFS, the NFS server is fully optimized as a multithreaded service running in user space instead of the kernel. This architecture load balances the NFS service across all nodes of the cluster, providing the stability and scalability necessary to manage up to thousands of connections across multiple NFS clients. NFS mounts run and refresh quickly, and the server constantly monitors fluctuating demands on NFS services and makes adjustments across all nodes to ensure continuous, reliable performance. Using an integrated process scheduler, OneFS helps ensure fair allocation of node resources so that no client can seize more than its fair share of NFS services. The NFS server also supports access zones that are defined in OneFS, so that clients can access only the exports appropriate to their zone. For example, if NFS exports are specified for Zone 2, only clients that are assigned to Zone 2 can access these exports. To simplify client connections, especially for exports with large path names, the NFS server also supports aliases, which are shortcuts to mount points that clients can specify directly. For secure NFS file sharing, OneFS supports NIS and LDAP authentication providers.
NFS exports
You can manage individual NFS export rules that define mount-points (paths) available to NFS clients and how the server should perform with these clients. In OneFS, you can create, delete, list, view, modify, and reload NFS exports. NFS export rules are zone-aware. Each export is associated with a zone, can only be mounted by clients on that zone, and can only expose paths below the zone root. By default, any export command applies to the client's current zone. Each rule must have at least one path (mount-point), and can include additional paths. You can also specify that all subdirectories of the given path or paths are mountable. Otherwise, only the specified paths are exported, and child directories are not mountable. An export rule can specify a particular set of clients, enabling you to restrict access to certain mount-points or to apply a unique set of options to these clients. If the rule does not specify any clients, then the rule applies to all clients that connect to the server. If the rule does specify clients, then that rule is applied only to those clients.
NFS aliases
You can create and manage aliases as shortcuts for directory path names in OneFS. If those path names are defined as NFS exports, NFS clients can specify the aliases as NFS mount points. NFS aliases are designed to give functional parity with SMB share names within the context of NFS. Each alias maps a unique name to a path on the file system. NFS clients can then use the alias name in place of the path when mounting. Aliases must be formed as top-level Unix path names, having a single forward slash followed by name. For example, you could create an alias named /q4 that maps to /ifs/data/finance/accounting/winter2015 (a path in OneFS). An NFS client could mount that directory through either of:
mount cluster_ip:/q4
mount cluster_ip:/ifs/data/finance/accounting/winter2015
Aliases and exports are completely independent. You can create an alias without associating it with an NFS export. Similarly, an NFS export does not require an alias. Each alias must point to a valid path on the file system. While this path is absolute, it must point to a location beneath the zone root (/ifs on the System zone). If the alias points to a path that does not exist on the file system, any client trying to mount the alias would be denied in the same way as attempting to mount an invalid full pathname. NFS aliases are zone-aware. By default, an alias applies to the client's current access zone. To change this, you can specify an alternative access zone as part of creating or modifying an alias. Each alias can only be used by clients on that zone, and can only apply to paths below the zone root. Alias names are unique per zone, but the same name can be used in different zones--for example, /home. When you create an alias in the web administration interface, the alias list displays the status of the alias. Similarly, using the --check option of the isi nfs aliases command, you can check the status of an NFS alias (status can be: good, illegal path, name conflict, not exported, or path not found).
184 File sharing

NFS log files

OneFS writes log messages associated with NFS events to a set of files in /var/log.
With the log level option, you can now specify the detail at which log messages are output to log files. The following table describes the log files associated with NFS.

Log file nfs.log rpc_lockd.log rpc_statd.log isi_netgroup_d.log

Description Primary NFS server functionality (v3, v4, mount) NFS v3 locking events through the NLM protocol NFS v3 reboot detection through the NSM protocol Netgroup resolution and caching

Managing the NFS service
You can enable or disable the NFS service and specify the NFS versions to support, including NFSv3 and NFSv4. NFS settings are applied across all nodes in the cluster.
NOTE: NFSv4 can be enabled non-disruptively on a OneFS cluster, and it will run concurrently with NFSv3. Any existing NFSv3 clients will not be impacted by enabling NFSv4.
View NFS settings
You can view the global NFS settings that are applied to all nodes in the cluster.  Run the isi nfs settings global view command.
The system displays output similar to the following example:
NFSv3 Enabled: Yes NFSv4 Enabled: No NFS Service Enabled: Yes

Configure NFS file sharing
You can enable or disable the NFS service, and set the lock protection level and security type. These settings are applied across all nodes in the cluster. You can change the settings for individual NFS exports that you define.  Run the isi nfs settings global modify command.
The following command enables NFSv4 support:
isi nfs settings global modify --nfsv4-enabled=yes

Enable or disable the NFS service
In OneFS, the NFSv3 and NFSv4 services are disabled by default. You can enable NFSv3 and NFSv4. NOTE: You can determine whether NFS services are enabled or disabled by running the isi nfs settings global view command.
 Run the isi nfs settings global modify command. The following command disables the NFSv3 service:
isi nfs settings global modify --nfsv3-enabled=no
The following command enables the NFSv4 service:
isi nfs settings global modify --nfsv4-enabled=yes

File sharing 185

Managing NFS exports
You can create NFS exports, view and modify export settings, and delete exports that are no longer needed. You configure the default export after enabling NFS.
NOTE: It is recommended that you configure your default export to limit access only to trusted clients, or to restrict access completely. To help ensure that sensitive data is not compromised, avoid creating other exports in readily accessible or visible points in the OneFS file hierarchy. Ensure that your exports can be protected by access zones or limited to specific clients with either root, read-write, or read-only access, as appropriate.
Configure default NFS export settings
The default NFS export settings are applied to new NFS exports. You can override these settings when you create or modify an export. You can view the current default export settings by running the isi nfs settings export view command.
CAUTION: We recommend that you not modify default export settings unless you are sure of the result.  Run the isi nfs settings export modify command.
The following command specifies a maximum export file size of one terabyte: isi nfs settings export modify --max-file-size 1099511627776
The following command restores the maximum export file size to the system default: isi nfs settings export modify --revert-max-file-size
Create a root-squashing rule for an export
By default, the NFS service implements a root-squashing rule for the default NFS export. The root-squashing rule prevents root users on NFS clients from exercising root privileges on the NFS server. 1. Use the isi nfs exports view command to view the current settings of the default export.
The following command displays the settings of the default export: isi nfs exports view 1
2. Confirm the following default values for these settings, which show that root is mapped to nobody, restricting root access: Map Root Enabled: True User: Nobody Primary Group: Secondary Groups: -
3. If the root-squashing rule is not in effect, you can implement it for the default NFS export. Run the isi nfs export modify command, as follows: isi nfs exports modify 1 --map-root-enabled true --map-root nobody
Users cannot gain root privileges on the NFS server regardless of their credentials on the NFS client.
Create an NFS export
You can create NFS exports to share files in OneFS with UNIX-based clients. Each directory path that you designate for an export must exist. Multiple exports can use a particular directory path, provided those exports do not have any of the same explicit clients.
186 File sharing

The NFS service runs in user space and distributes the load across all nodes in the cluster. This enables the service to be highly scalable and support thousands of exports. As a best practice, however, you should avoid creating a separate export for each client on your network. It is more efficient to create fewer exports, and to use access zones and user mapping to control access.
NOTE: The default security flavor (UNIX) relies upon having a trusted network. If you do not completely trust everything on your network, create the NFS export with Kerberos using the isi nfs exports create command option [-security-flavors (unix | krb5 | krb5i | krb5p)]. If the system does not support Kerberos, it will not be fully protected. NFS without Kerberos trusts everything on the network and sends all packets in cleartext. If you cannot use Kerberos, find another way to protect the Internet connection. At a minimum, do the following:  Limit root access to the cluster to trusted host IP addresses.  Ensure that all new devices that you add to the network are trusted. Methods for ensuring trust include, but are not
limited to:  Use an IPsec tunnel. This option is secure because it authenticates the devices using secure keys.  Configure all switch ports to go inactive if they are physically disconnected. Ensure that the switch ports are MAC
limited.
1. Run the isi nfs exports create command. The following command creates an export supporting client access to multiple paths and their subdirectories:
isi nfs exports create /ifs/data/projects,/ifs/home --all-dirs=yes
2. Optional: To view the export ID, which is required for modifying or deleting the export, run the isi nfs exports list command.

Check NFS exports for errors

You can check for errors in NFS exports, such as conflicting export rules, invalid paths, and unresolvable hostnames and netgroups. This task may be performed only through the OneFS command-line interface. 1. Establish an SSH connection to any node in the cluster. 2. Run the isi nfs exports check command.
In the following example output, no errors were found:

ID Message ------------------Total: 0

In the following example output, export 1 contains a directory path that does not currently exist:

ID Message

-----------------------------------

1

'/ifs/test' does not exist

-----------------------------------

Total: 1

Modify an NFS export
You can modify the settings for an existing NFS export. CAUTION: Changing export settings may cause performance issues. Make sure you understand the potential impact of any settings alterations prior to committing any changes.
Run the isi nfs exports modify command. For example, the following adds a client with read-write access to NFS export 2:
isi nfs exports modify 2 --add-read-write-clients 10.1.249.137
This command would override the export's access-restriction setting if there was a conflict. For example, if the export was created with read-write access disabled, the client, 10.1.249.137, would still have read-write permissions on the export.

File sharing 187

Delete an NFS export
You can delete unneeded NFS exports. Any current NFS client connections to these exports become invalid. You need the export ID number to delete the export. Run the isi nfs exports list command to display a list of exports and their ID numbers. 1. Run the isi nfs exports delete command.
In the following example, the command deletes an export whose ID is 2: isi nfs exports delete 2
In the following example, isi nfs exports delete deletes an export whose ID is 3 without displaying a confirmation prompt. Be careful when using the --force option.
isi nfs exports delete 3 --force 2. If you did not specify the --force option, type yes at the confirmation prompt.
Managing NFS aliases
You can create NFS aliases to simplify exports that clients connect to. An NFS alias maps an absolute directory path to a simple directory path. For example, suppose you created an NFS export to /ifs/data/hq/home/archive/first-quarter/finance. You could create the alias /finance1 to map to that directory path. NFS aliases can be created in any access zone, including the System zone.
Create an NFS alias
You can create an NFS alias to map a long directory path to a simple pathname. Aliases must be formed as a simple Unix-style directory path, for example, /home. Run the isi nfs aliases create command. The following command creates an alias to a full pathname in OneFS in an access zone named hq-home:
isi nfs aliases create /home /ifs/data/offices/hq/home --zone hq-home When you create an NFS alias, OneFS performs a health check. If, for example, the full path that you specify is not a valid path, OneFS issues a warning:
Warning: health check on alias '/home' returned 'path not found' Nonetheless, the alias is created, and you can create the directory that the alias points to at a later time.
Modify an NFS alias
You can modify an NFS alias, for example, if an export directory path has changed. Aliases must be formed as a simple Unix-style directory path, for example, /home. Run the isi nfs aliases modify command. The following command changes the name of an alias in the access zone hq-home:
isi nfs aliases modify /home --zone hq-home --name /home1 When you modify an NFS alias, OneFS performs a health check. If, for example, the path to the alias is not valid, OneFS issues a warning:
Warning: health check on alias '/home' returned 'not exported'
188 File sharing

Nonetheless, the alias is modified, and you can create the export at a later time.

Delete an NFS alias
You can delete an NFS alias. If an NFS alias is mapped to an NFS export, deleting the alias can disconnect clients that used the alias to connect to the export. 1. Run the isi nfs aliases delete command.
The following command deletes the alias /home in an access zone named hq-home:
isi nfs aliases delete /home --zone hq-home
When you delete an NFS alias, OneFS asks you to confirm the operation:
Are you sure you want to delete NFS alias /home? (yes/[no])
2. Type yes, and then press ENTER. The alias is deleted, unless an error condition was found, for example, you typed the name of the alias incorrectly.

List NFS aliases

You can view a list of NFS aliases that have already been defined for a particular zone. Aliases in the system zone are listed by default. Run the isi nfs aliases list command. In the following example, the command lists aliases that have been created in the system zone (the default):
isi nfs aliases list
In the following example, isi nfs aliases list lists aliases that have been created in an access zone named hq-home:
isi nfs aliases list --zone hq-home

Output from isi nfs aliases list looks similar to the following example:

Zone Name

Path

-------------------------------------------------

hq-home /home

/ifs/data/offices/newyork

hq-home /root_alias

/ifs/data/offices

hq-home /project

/ifs/data/offices/project

-------------------------------------------------

Total: 3

View an NFS alias

You can view the settings of an NFS alias in the specified access zone. Run the isi nfs aliases view command. The following command provides information on an alias in the access zone, hq-home, including the health of the alias:
isi nfs aliases view /projects --zone hq-home --check

Output from the command looks similar to the following example:

Zone Name

Path

Health

--------------------------------------------------------

hq-home /projects

/ifs/data/offices/project good

--------------------------------------------------------

Total: 1

File sharing 189

FTP
OneFS includes a secure FTP service that is called Very Secure FTP Daemon (VSFTPD), that you can configure for standard FTP and FTPS file transfers. FTP is disabled by default. To enable FTP, use the following command:
isi services ftp enable
View FTP settings
You can view a list of current FTP configuration settings.  Run the isi ftp settings view command.
The system displays output similar to the following example:
Accept Timeout: 1m Allow Anon Access: No Allow Anon Upload: Yes
Allow Dirlists: Yes Allow Downloads: Yes Allow Local Access: Yes
Allow Writes: Yes Always Chdir Homedir: Yes
Anon Chown Username: root Anon Password List: Anon Root Path: /ifs/home/ftp Anon Umask: 0077 Ascii Mode: off
Chroot Exception List: Chroot Local Mode: none Connect Timeout: 1m Data Timeout: 5m Denied User List: Dirlist Localtime: No Dirlist Names: hide File Create Perm: 0666
Limit Anon Passwords: Yes Local Root Path: Local Umask: 0077
Server To Server: No Session Support: Yes Session Timeout: 5m User Config Dir: -
FTP Service Enabled: Yes
Enable FTP file sharing
The FTP service, vsftpd, is disabled by default.
NOTE: You can determine whether the service is enabled or disabled by running the isi services -l command.
Run the following command:
isi services vsftpd enable
The system displays the following confirmation message:
The service 'vsftpd' has been enabled.
You can configure FTP settings by running the isi ftp command.
190 File sharing

Configure FTP file sharing
You can set the FTP service to allow any node in the cluster to respond to FTP requests through a standard user account. You must enable the FTP service before you can use it. You can enable the transfer of files between remote FTP servers and enable anonymous FTP service on the root by creating a local user named anonymous or ftp. When configuring FTP access, make sure that the specified FTP root is the home directory of the user who logs in. For example, the FTP root for local user jsmith should be ifs/home/jsmith.  Run the isi ftp settings modify command.
You must run this command separately for each action. The following command enables server-to-server transfers:
isi ftp settings modify --server-to-server=yes The following command disables anonymous uploads:
isi ftp settings modify --allow-anon-upload=no You must run this command separately for each action.
HTTP and HTTPS security
OneFS includes a configurable Hypertext Transfer Protocol (HTTP) service. Use HTTP to request files that are stored on the cluster and to interact with the web administration interface. HTTP and HTTPS are disabled by default. To enable them, use the following commands:
isi http settings modify --service=enabled isi http settings modify --https=true
NOTE: Set the file and directory permissions to allow HTTP or HTTPS to access them. OneFS supports both HTTP and its secure variant, HTTPS. Each node in the cluster runs an instance of the Apache HTTP Server to provide HTTP access. You can configure the HTTP service to run in different modes. Both HTTP and HTTPS are supported for file transfer, but only HTTPS is supported for API calls. The HTTPS-only requirement includes the web administration interface. OneFS supports a form of the web-based DAV (WebDAV) protocol that enables users to modify and manage files on remote web servers. OneFS performs distributed authoring, but does not support versioning and does not perform security checks. You can enable DAV in the web administration interface.
Enable and configure HTTP
You can configure HTTP and WebDAV to enable users to edit and manage files collaboratively across remote web servers. You can use the isi http settings modify command to configure HTTP-related settings.  Run the isi http settings modify command.
The following command enables the HTTP service, WebDAV, and basic authentication: isi http settings modify --service=enabled --dav=yes \ basic-authentication=yes
NOTE: Basic authentication is disabled by default.
File sharing 191

Enable HTTPS through the Apache service
You can access a PowerScale cluster through the Apache service over HTTPS.  To enable HTTPS, run the following command:
isi_gconfig -t http-config https_enabled=true The HTTPS service is enabled.
NOTE: It might take up to 10 seconds for the configuration change to take effect. As a result, data transfers that are in progress over HTTP might be interrupted.
Disable HTTPS through the Apache service
You can disable access to a PowerScale cluster through the Apache service over HTTPS.  To disable HTTPS, run the following command:
isi_gconfig -t http-config https_enabled=false The HTTPS service is disabled.
NOTE: It might take up to 10 seconds for the configuration change to take effect. As a result, data transfers that are in progress over HTTP might be interrupted.
192 File sharing

12
File filtering
This section contains the following topics:
Topics:
· File filtering in an access zone · Enable and configure file filtering in an access zone · Disable file filtering in an access zone · View file filtering settings
File filtering in an access zone
In an access zone, you can use file filtering to allow or deny file writes based on file type. If some file types might cause throughput issues, security problems, storage clutter, or productivity disruptions on your cluster, or if your organizations must adhere to specific file policies, you can restrict writes to specified file types or only allow writes to a specified list of file types. When you enable file filtering in an access zone, OneFS applies file filtering rules only to files in that access zone.  If you choose to deny file writes, you can specify file types by extension that are not allowed to be written. OneFS permits all
other file types to be written.  If you choose to allow file writes, you can specify file types by extension that are allowed to be written. OneFS denies all
other file types to be written. OneFS does not take into consideration which file sharing protocol was used to connect to the access zone when applying file filtering rules; however, you can apply additional file filtering at the SMB share level. See "SMB file filtering" in the File sharing chapter of this guide.
Enable and configure file filtering in an access zone
You can enable file filtering per access zone and specify which file types users are denied or allowed write access to within the access zone. Run the isi file-filter settings modify command. The following command enables file filtering in the zone3 access zone and allows users to write only to specific file types:
isi file-filter settings modify --zone=zone3 \ file-filtering-enabled=yes file-filter-type=allow \ file-filter-extensions=.xml,.html,.txt
File types are designated by their extension and should start with a "." such as .txt. The following command enables file filtering in zone3 and denies users write access only to specific file types:
isi file-filter settings modify --zone=zone3 \ file-filtering-enabled=yes file-filter-type=deny \ file-filter-extensions=.xml,.html,.txt
Disable file filtering in an access zone
You can disable file filtering per access zone. Previous settings that specify filter type and file type extensions are preserved but no longer applied. Run the isi file-filter settings modify command.
File filtering 193

The following command disables file filtering in the zone3 access zone: isi file-filter settings modify --zone=zone3 \ file-filtering-enabled=no
View file filtering settings
You can view file filtering settings in an access zone. Run the isi file-filter settings view command. The following command displays file filtering settings in the zone3 access zone:
isi file-filter settings view --zone=zone3 The system displays output similar to the following example:
File Filtering Enabled: Yes File Filter Extensions: xml, html, txt
File Filter Type: deny
194 File filtering

13
Auditing and logging
This section contains the following topics:
Topics:
· Auditing overview · Syslog · Protocol audit events · Supported audit tools · Delivering protocol audit events to multiple CEE servers · Supported event types · Sample audit log · Audit log purging · Managing audit settings · Integrating with the Common Event Enabler · Tracking the delivery of protocol audit events
Auditing overview
You can audit system configuration changes and protocol activity on an PowerScale cluster. All audit data is stored and protected in the cluster file system and organized by audit topics. Auditing can detect many potential sources of data loss, including fraudulent activities, inappropriate entitlements, and unauthorized access attempts. Customers in industries such as financial services, health care, life sciences, and media and entertainment, as well as in governmental agencies, must meet stringent regulatory requirements developed to protect against these sources of data loss. System configuration auditing tracks and records all configuration events that are handled by the OneFS HTTP API. The process involves auditing the command-line interface (CLI), web administration interface, and OneFS APIs. When you enable system configuration auditing, no additional configuration is required. System configuration auditing events are stored in the config audit topic directories. Protocol auditing tracks and stores activity performed through SMB, NFS, and HDFS protocol connections. You can enable and configure protocol auditing for one or more access zones in a cluster. If you enable protocol auditing for an access zone, file-access events through the SMB, NFS, and HDFS protocols are recorded in the protocol audit topic directories. You can specify which events to log in each access zone. For example, you might want to audit the default set of protocol events in the System access zone but audit only successful attempts to delete files in a different access zone. The audit events are logged on the individual nodes where the SMB, NFS, or HDFS client initiated the activity. The events are then stored in a binary file under /ifs/.ifsvar/audit/logs. The logs automatically roll over to a new file after the size reaches 1 GB. The logs are then compressed to reduce space. The protocol audit log file is consumable by auditing applications that support the Common Event Enabler (CEE).
Syslog
Syslog is a protocol that is used to convey certain event notification messages. You can configure a PowerScale cluster to log audit events and forward them to syslog by using the syslog forwarder. By default, all protocol events that occur on a particular node are forwarded to the /var/log/audit_protocol.log file, regardless of the access zone the event originated from. All the config audit events are logged to /var/log/ audit_config.log by default.
Auditing and logging 195

Syslog is configured with an identity that depends on the type of audit event that is being sent to it. It uses the facility daemon and a priority level of info. The protocol audit events are logged to syslog with the identity audit_protocol. The config audit events are logged to syslog with the identity audit_config. To configure auditing on a PowerScale cluster, you must either be a root user or you must be assigned to an administrative role that includes auditing privileges (ISI_PRIV_AUDIT).
CAUTION: If the size of an audit event is greater than 65,535, that log event is discarded and the file access operation that caused the event fails.
Syslog forwarding
The syslog forwarder is a daemon that, when enabled, retrieves configuration changes and protocol audit events in an access zone and forwards the events to syslog. Only user-defined audit success and failure events are eligible for being forwarded to syslog. On each node there is an audit syslog forwarder daemon running that will log audit events to the same node's syslog daemon.
Protocol audit events
By default, audited access zones track only certain events on the PowerScale cluster, including successful and failed attempts to access files and directories. The default tracked events are create, close, delete, rename, and set_security. The names of generated events are loosely based on the Windows I/O request packet (IRP) model in which all operations begin with a create event to obtain a file handle. A create event is required before all I/O operations, including the following: close, create, delete, get_security, read, rename, set_security, and write. A close event marks when the client is finished with the file handle that was produced by a create event.
NOTE: For the NFS and HDFS protocols, the rename and delete events might not be enclosed with the create and close events. These internally stored events are translated to events that are forwarded through the CEE to the auditing application. The CEE export facilities on OneFS perform this mapping. The CEE can be used to connect to any third party application that supports the CEE. NOTE: The CEE does not support forwarding HDFS protocol events to a third-party application. Different SMB, NFS, and HDFS clients issue different requests, and one particular version of a platform such as Windows or Mac OS X using SMB might differ from another. Similarly, different versions of an application such as Microsoft Word or Windows Explorer might make different protocol requests. For example, a client with a Windows Explorer window open might generate many events if an automatic or manual refresh of that window occurs. Applications issue requests with the logged-in user's credentials, but you should not assume that all requests are purposeful user actions. When enabled, OneFS audit will track all changes that are made to the files and directories in SMB shares, NFS exports, and HDFS data.
Supported audit tools
You can configure OneFS to send protocol auditing logs to servers that support the Common Event Enabler (CEE). CEE has been tested and verified to work on several third-party software vendors.
NOTE: We recommend that you install and configure third-party auditing applications before you enable the OneFS auditing feature. Otherwise, all the events that are logged are forwarded to the auditing application, and a large backlog causes a delay in receiving the most current events.
196 Auditing and logging

Delivering protocol audit events to multiple CEE servers
OneFS supports concurrent delivery of protocol audit events to multiple CEE servers running the CEE service.
You can establish up to 20 HTTP 1.1 connections across a subset of CEE servers. Each node in a PowerScale cluster can select up to five CEE servers for delivery. The CEE servers are shared in a global configuration and are configured with OneFS by adding the URI of each server to the OneFS configuration.
After configuring the CEE servers, a node in a PowerScale cluster automatically selects the CEE servers from a sorted list of CEE URIs. The servers are selected starting from the node's logical node number offset within the sorted list. When a CEE server is unavailable, the next available server is selected in the sorted order. All the connections are evenly distributed between the selected servers. When a node is moved because a CEE server was previously unavailable, checks are made every 15 minutes for the availability of the CEE server. The node is moved back as soon as the CEE Server is available.
Follow some of these best practices before configuring the CEE servers:
 We recommend that you provide only one CEE server per node. You can use extra CEE servers beyond the PowerScale cluster size only when the selected CEE server goes offline. NOTE: In a global configuration, there should be one CEE server per node.
 Configure the CEE server and enable protocol auditing at the same time. If not, a backlog of events might accumulate causing stale delivery for a period of time.
You can either receive a global view of the progress of delivery of the protocol audit events or you can receive a logical node number view of the progress by running the isi audit progress view command.

Supported event types

You can view or modify the event types that are audited in an access zone. Event name Example protocol activity Audited by default

create
close rename delete set_security read write

 Create a file or directory X  Open a file, directory, or
share
 Mount a share  Delete a file
NOTE: While the SMB protocol allows you to set a file for deletion with the create operation, you must enable the delete event in order for the auditing tool to log the event.

 Close a directory

X

 Close a modified or unmodified file

Rename a file or directory

X

Delete a file or directory

X

Attempt to modify file or

X

directory permissions

The first read request on an open file handle

The first write request on an open file handle

Can be exported through CEE X
X X X X X X

Cannot be exported through CEE

Auditing and logging 197

Event name Example protocol activity Audited by default

get_security
logon logoff tree_connect

The client reads security information for an open file handle
SMB session create request by a client
SMB session logoff
SMB first attempt to access a share

Can be exported through CEE

Cannot be exported through CEE
X

X
X X

Sample audit log
You can view both configuration audit and protocol audit logs by running the isi_audit_viewer command on any node in the PowerScale cluster.
You can view protocol access audit logs by running isi_audit_viewer -t protocol. You can view system configuration logs by running isi_audit_viewer -t config. The following output is an example of a system configuration log:
[0: Fri Jan 23 16:17:03 2015] {"id":"524e0928a35e-11e4-9d0c-005056302134","timestamp":1422058623106323,"payload":"PAPI config logging started."}
[1: Fri Jan 23 16:17:03 2015] {"id":"5249b99da35e-11e4-9d0c-005056302134","timestamp":1422058623112992,"payload":{"user":{"token": {"UID":0, "GID":0, "SID": "SID:S-1-22-1-0", "GSID": "SID:S-1-22-2-0", "GROUPS": ["SID:S-1-5-11", "GID:5", "GID:20", "GID:70", "GID:10"], "protocol": 17, "zone id": 1, "client": "10.7.220.97", "local": "10.7.177.176" }},"uri":"/1/protocols/smb/ shares","method":"POST","args":"","body":{"path": "/ifs/data", "name": "Test"}}}
[2: Fri Jan 23 16:17:05 2015] {"id":"5249b99d-a35e-11e4-9d0c-005056302134","timestamp":1422058625144567,"payload": {"status":201,"statusmsg":"Created","body":{"id":"Test"}}}
[3: Fri Jan 23 16:17:39 2015] {"id":"67e7ca62a35e-11e4-9d0c-005056302134","timestamp":1422058659345539,"payload":{"user":{"token": {"UID":0, "GID":0, "SID": "SID:S-1-22-1-0", "GSID": "SID:S-1-22-2-0", "GROUPS": ["SID:S-1-5-11", "GID:5", "GID:20", "GID:70", "GID:10"], "protocol": 17, "zone id": 1, "client": "10.7.220.97", "local": "10.7.177.176" }},"uri":"/1/ audit/settings","method":"PUT","args":"","body":{"config_syslog_enabled": true}}}
[4: Fri Jan 23 16:17:39 2015] {"id":"67e7ca62-a35e-11e4-9d0c-005056302134","timestamp":1422058659387928,"payload": {"status":204,"statusmsg":"No Content","body":{}}}
Configuration audit events come in pairs; a pre event is logged before the command is carried out and a post event is logged after the event is triggered. Protocol audit events are logged as post events after an operation has been carried out. Configuration audit events can be correlated by matching the id field.
The pre event always comes first, and contains user token information, the PAPI path, and whatever arguments were passed to the PAPI call. In event 1, a POST request was made to /1/protocols/smb/shares with arguments path=/ifs/data and name=Test. The post event contains the HTTP return status and any output returned from the server.
Audit log purging
You can delete the audit logs.
In OneFS , the audit system writes audit logs in the /ifs/.ifsvar/audit/logs directory. After the audit log file reaches 1G, a new file is created, and the old file is compressed. As time goes on, the audit log file exhausts all space on the file system.
The audit log purging feature is now supported on the cluster. There are two ways to delete the audit log. In the first method, automatic deletion, the audit log gets deleted on its own after passing the retention period.

198 Auditing and logging

The retention period works like a window, any audit log out of the window is deleted. For example, if you configure the retention period as 90 days, and the current date is 2019-5-30. Then, any log before 2019-3-1 will be deleted, since from 2019-3-1 to 2019-5-30 is a 90 days window. The second method is manual deletion, and here you can manually delete the audit log before a specified period. Both automatic deletion and manual deletion apply to configuration audit and protocol audit logs simultaneously.
Managing audit settings
You can enable and disable system configuration and protocol access audit settings, in addition to configuring integration with the Common Event Enabler.
Enable protocol access auditing
You can audit SMB, NFS, and HDFS protocol access to generate events on a per-access zone basis and forward the events to the Common Event Enabler (CEE) for export to third-party products.
NOTE: Because each audited event consumes system resources, we recommend that you only configure zones for events that are needed by your auditing application. In addition, we recommend that you install and configure third-party auditing applications before you enable the OneFS auditing feature. Otherwise, the large backlog performed by this feature may cause results to not be up-to-date for a considerable amount of time. Additionally, you can manually configure the time that you want audit events to be forwarded by running the isi audit settings global modify --cee-log-time command.
Run the isi audit settings global modify command. The following command enables auditing of protocol access events in the zone3 and zone5 access zones, and forwards logged events to a CEE server:
isi audit settings global modify --protocol-auditing-enabled=yes \ --cee-server-uris=http://sample.com:12228/cee \ --hostname=cluster.domain.com --audited-zones=zone3,zone5
OneFS logs audited protocol events to a binary file within /ifs/.ifsvar/audit/logs. The CEE service forwards the logged events through an HTTP PUT operation to a defined endpoint. You can modify the types of protocol access events to be audited by running the isi audit settings modify command. You can also enable forwarding of protocol access events to syslog by running the isi audit settings modify command with the --syslog-forwarding-enabled option.
Forward protocol access events to syslog
You can enable or disable forwarding of audited protocol access events to syslog in each access zone. Forwarding is not enabled by default when protocol access auditing is enabled. This procedure is available only through the command-line interface. To enable forwarding of protocol access events in an access zone, you must first enable protocol access auditing in the access zone. The --audit-success and --audit-failure options define the event types that are audited, and the --syslogaudit-events option defines the event types that are forwarded to syslog. Only the audited event types are eligible for forwarding to syslog. If syslog forwarding is enabled, protocol access events are written to the /var/log/ audit_protocol.log file. 1. Open a Secure Shell (SSH) connection to any node in the cluster and log in. 2. Run the isi audit settings modify command with the --syslog-forwarding-enabled option to enable or
disable audit syslog. The following command enables forwarding of the audited protocol access events in the zone3 access zone and specifies that the only event types forwarded are close, create, and delete events:
isi audit settings modify --syslog-forwarding-enabled=yes --syslog-auditevents=close,create,delete --zone=zone3
Auditing and logging 199

The following command disables forwarding of audited protocol access events from the zone3 access zone: isi audit settings modify --syslog-forwarding-enabled=no --zone=zone3
Enable system configuration auditing
OneFS can audit system configuration events on your PowerScale cluster. When enabled, OneFS records all system configuration events that are handled by the platform API including writes, modifications, and deletions. System configuration change logs are populated in the config topic in the audit back-end store under /ifs/.ifsvar/audit.
NOTE: Configuration events are not forwarded to the Common Event Enabler (CEE). 1. Open a Secure Shell (SSH) connection to any node in the cluster and log in. 2. Run the isi audit settings global modify command.
The following command enables system configuration auditing on the cluster: isi audit settings global modify --config-auditing-enabled=yes
You can enable forwarding of system configuration changes to syslog by running the isi audit settings global modify command with the --config-syslog-enabled option.
Set the audit hostname
You can optionally set the audit hostname for some of the third-party auditing applications that require a unified hostname. If you do not set a hostname for these applications, each node in a PowerScale cluster sends its hostname as the server name to the CEE server. Otherwise, the configured audit hostname is used as the global server name. 1. Open a Secure Shell (SSH) connection to any node in the cluster and log in. 2. Run the isi audit settings global modify command with the --hostname option to set the audit hostname.
The following command sets mycluster as the audit hostname: isi audit settings global modify --hostname=mycluster
Configure protocol audited zones
Only the protocol audit events within an audited zone are captured and sent to the CEE server. Therefore, you must configure a protocol audited zone to send audit events. 1. Open a Secure Shell (SSH) connection to any node in the cluster and log in. 2. Run the isi audit settings global modify command with the --audited-zones option to configure protocol
audited zones. The following command configures HomeDirectory and Misc as the protocol audited zones:
isi audit settings global modify --audited-zones=HomeDirectory,Misc
Forward system configuration changes to syslog
You can enable or disable forwarding of system configuration changes on the PowerScale cluster to syslog, which is saved to /var/log/audit_config.log. This procedure is available only through the command-line interface. Forwarding is not enabled by default when system configuration auditing is enabled. To enable forwarding of system configuration changes to syslog, you must first enable system configuration auditing on the cluster. 1. Open a Secure Shell (SSH) connection to any node in the cluster and log in. 2. Run the isi audit settings global modify command with the --config-syslog-enabled option to enable
or disable forwarding of system configuration changes.
200 Auditing and logging

The following command enables forwarding of system configuration changes to syslog:
isi audit settings global modify --config-syslog-enabled=yes
The following command disables forwarding of system configuration changes to syslog:
isi audit settings global modify --config-syslog-enabled=no
Configure protocol event filters
You can filter the types of protocol access events to be audited in an access zone. You can create filters for successful events and failed events. The following protocol events are collected for audited access zones by default: create, delete, rename, close, and set_security. This procedure is available only through the command-line interface. To create protocol event filters, you should first enable protocol access auditing in the access zone. 1. Open a Secure Shell (SSH) connection to any node in the cluster and log in. 2. Run the isi audit settings modify command
The following command creates a filter that audits the failure of create, close, and delete events in the zone3 access zone:
isi audit settings modify --audit-failure=create,close,delete --zone=zone3
The following command creates a filter that audits the success of create, close, and delete events in the zone5 access zone:
isi audit settings modify --audit-success=create,close,delete --zone=zone5
Integrating with the Common Event Enabler
OneFS integration with the Common Event Enabler (CEE) enables third-party auditing applications to collect and analyze protocol auditing logs. OneFS supports the Common Event Publishing Agent (CEPA) component of CEE for Windows. For integration with OneFS, you must install and configure CEE for Windows on a supported Windows client.
NOTE: We recommend that you install and configure third-party auditing applications before you enable the OneFS auditing feature. Otherwise, the large backlog performed by this feature may cause results to not be up-to-date for a considerable time.
Install CEE for Windows
To integrate CEE with OneFS, you must first install CEE on a computer that is running the Windows operating system. Be prepared to extract files from the .iso file, as described in the following steps. If you are not familiar with the process, consider choosing one of the following methods: 1. Install WinRAR or another suitable archival program that can open .iso files as an archive, and copy the files. 2. Burn the image to a CD-ROM, and then copy the files. 3. Install SlySoft Virtual CloneDrive, which allows you to mount an ISO image as a drive that you can copy files from.
NOTE: You should install a minimum of two servers. We recommend that you install CEE 6.6.0 or later.
1. Download the CEE framework software from Online Support: a. Go to Online Support. b. In the search field, type Common Event Enabler for Windows, and then click the Search icon. c. Click Common Event Enabler <Version> for Windows, where <Version> is 6.2 or later, and then follow the instructions to open or save the .iso file.
2. From the .iso file, extract the 32-bit or 64-bit EMC_CEE_Pack executable file that you need. After the extraction completes, the CEE installation wizard opens.
3. Click Next to proceed to the License Agreement page. 4. Select the I accept... option to accept the terms of the license agreement, and then click Next.
Auditing and logging 201

5. On the Customer Information page, type your user name and organization, select your installation preference, and then click Next.
6. On the Setup Type page, select Complete, and then click Next.
7. Click Install to begin the installation. The progress of the installation is displayed. When the installation is complete, the InstallShield Wizard Completed page appears.
8. Click Finish to exit the wizard.
9. Restart the system.

Configure CEE for Windows

After you install CEE for Windows on a client computer, you must configure additional settings through the Windows Registry Editor (regedit.exe).
1. Open the Windows Registry Editor. 2. Configure the following registry keys, if supported by your audit application:

Setting
CEE HTTP listen port
Enable audit remote endpoints
Audit remote endpoints

Registry location [HKEY_LOCAL_MACHINE\SOFTWARE\EMC\CEE\Configuration]

Key HttpPort

[HKEY_LOCAL_MACHINE\SOFTWARE\EMC\CEE\CEPP\Audit\Co Enabled nfiguration]

[HKEY_LOCAL_MACHINE\SOFTWARE\EMC\CEE\CEPP\Audit\Co EndPoint nfiguration]

Value 12228 1
<EndPoint>

NOTE:  The HttpPort value must match the port in the CEE URIs that you specify during OneFS protocol audit configuration.  The EndPoint value must be in the format <EndPoint_Name>@<IP_Address>. You can specify multiple endpoints by
separating each value with a semicolon (;).
The following key specifies a single remote endpoint:
[HKEY_LOCAL_MACHINE\SOFTWARE\EMC\CEE\CEPP\Audit\Configuration] EndPoint = [email protected]
The following key specifies multiple remote endpoints:
[HKEY_LOCAL_MACHINE\SOFTWARE\EMC\CEE\CEPP\Audit\Configuration] EndPoint = [email protected];[email protected] 3. Close the Windows Registry Editor.

Configure CEE servers to deliver protocol audit events
You can configure CEE servers with OneFS to deliver protocol audit events by adding the URI of each server to the OneFS configuration.  Run the isi audit settings global modify command with the --cee-server-uris option to add the URIs of
the CEE servers to the OneFS configuration. The following command adds the URIs of three CEE servers to the OneFS configuration:
isi audit settings global modify --cee-server-uris=http://server1.example.com:12228/ vee,http://server2.example.com:12228/vee,http://server3.example.com:12228/vee

202 Auditing and logging

Tracking the delivery of protocol audit events
The processes of capturing protocol audit events and their delivery to the CEE server do not happen simultaneously. Therefore, even when no CEE servers are available, protocol audit events are still captured and stored for delivery to the CEE server at a later time. You can view the time of the last captured protocol audit event and the event time of the last event that was sent to the CEE server. You can also move the log position of the CEE forwarder to a desired time.
View the time stamps of delivery of events to the CEE server and syslog
You can view the time stamps of delivery of events to the CEE server and syslog on the node on which you are running the isi audit progress view command. This setting is available only through the command-line interface.  Run the isi audit progress view command to view the time stamps of delivery of events to the CEE server and
syslog on the node on which you are running the command. A sample output of the isi audit progress view is shown:
Protocol Audit Log Time: Tue Mar 29 13:32:38 2016 Protocol Audit Cee Time: Tue Mar 29 13:32:38 2016 Protocol Audit Syslog Time: Fri Mar 25 17:00:28 2016
You can run the isi audit progress view command with the --lnn option to view the time stamps of delivery of the audit events on a node specified through its logical node number. The following command displays the progress of delivery of the audit events on a node with logical node number 2:
isi audit progress view --lnn=2
The output appears as shown:
Protocol Audit Log Time: Tue Mar 29 13:32:38 2016 Protocol Audit Cee Time: Tue Mar 29 13:32:38 2016 Protocol Audit Syslog Time: Fri Mar 25 17:00:28 2016
Display a global view of delivery of protocol audit events to the CEE server and syslog
You can display the latest protocol audit event log time for a cluster. You can also view the time stamp of delivery of the oldest unsent protocol audit event to the CEE server and the time stamp of delivery of the oldest non-forwarded protocol audit event to syslog in the cluster. This setting is available only through the command-line interface.  Run the isi audit progress global view command to view the time stamps of delivery of the oldest unsent
protocol audit events to the CEE server and the oldest unsent syslog in the cluster. A sample output of the isi audit progress global view is shown:
Protocol Audit Latest Log Time: Fri Sep 2 10:06:36 2016 Protocol Audit Oldest Cee Time: Fri Sep 2 10:02:28 2016 Protocol Audit Oldest Syslog Time: Fri Sep 2 10:02:28 2016
Auditing and logging 203

Move the log position of the CEE forwarder
You can manually move the log position of the CEE forwarder if the event time in the audit log indicates a lag in comparison to the current time. This action globally moves the event time in all of the logs of the CEE forwarder within a PowerScale cluster to the closest time.
NOTE: The events that are skipped will not be forwarded to the CEE server even though they might still be available on the cluster.
 Run the isi audit settings global modify command with the --cee-log-time option to move the log position of the CEE forwarder. The following command moves the log position of the CEE forwarder manually:
isi audit settings global modify --cee-log-time='protocol@2016-01-27 01:03:02'

View the rate of delivery of protocol audit events to the CEE server

You can view the rate of delivery of protocol audit events to the CEE server.
 Run the isi statistics query command to view the current rate of delivery of the protocol audit events to the CEE server on a node. The following command displays the current rate of delivery of the protocol audit events to the CEE server:

isi statistics query current list --keys=node.audit.cee.export.rate

The output appears as shown:

Node node.audit.cee.export.rate

---------------------------------

1

3904.600000

---------------------------------

Total: 1

Automatic deletion
The audit logs are deleted on its own from the command-line interface.
The automatic deletion runs periodically (once every hour) . It iterates over the audit directories and compares the date of the file to the current date to determine if it should be deleted. If the file passes the retention period, it gets deleted. The default retention period value is 180 days. The automatic deletion function is disabled by default. If you enable automatic purging, deletion is triggered immediately. When automatic purging is enabled and you modify the retention period, deletion occurs immediately. You can check the current audit settings using the isi audit settings global view command.
View audit settings
You can view the existing audit settings on the cluster.  Run the isi audit settings global view command to view the current audit settings.
The following details appear:
Protocol Auditing Enabled: No Audited Zones: -
CEE Server URIs: Hostname:
Config Auditing Enabled: No Config Syslog Enabled: No Config Syslog Servers: -
Protocol Syslog Servers: Auto Purging Enabled: No Retention Period: 180
The auto purging feature is disabled by default.

204 Auditing and logging

Enable automatic purging
You can enable automatic purging of audit log files. 1. Optional: To enable automatic purging, run the following command:
isi audit settings global modify --auto-purging-enabled=yes
The following message appears:
You are enabling the automatic log purging. Automatic log purging will run in background to delete audit log files. Please check the retention period before enabling automatic log purging. Are you sure you want to do this?? (yes/[no])
2. Enter "yes." The automatic purging feature is enabled.
3. You can check if the automatic purging feature is enabled using the isi audit settings global view command.
isi audit settings global view Protocol Auditing Enabled: No
Audited Zones: CEE Server URIs: -
Hostname: Config Auditing Enabled: No
Config Syslog Enabled: No Config Syslog Servers: Protocol Syslog Servers: -
Auto Purging Enabled: Yes Retention Period: 180
Disable automatic purging
You can disable automatic purging of audit log files. 1. Optional: To disable automatic purging, run the following command:
isi audit settings global modify --auto-purging-enabled=no
The automatic purging feature is disabled. 2. You can check if the automatic purging feature is disabled using the isi audit settings global view command.
Protocol Auditing Enabled: No Audited Zones: -
CEE Server URIs: Hostname:
Config Auditing Enabled: No Config Syslog Enabled: No Config Syslog Servers: -
Protocol Syslog Servers: Auto Purging Enabled: No Retention Period: 180
Modify retention period
At any time, you can modify the retention period value. 1. Optional: To modify the retention period value, run the following command:
isi audit settings global modify --retention-period=50
The retention period is now changed to 50 days. NOTE: The default retention value is 180 days.
Auditing and logging 205

2. You can check if the retention period has changed from 180 to 50 days using the isi audit settings global view command. isi audit settings global view Protocol Auditing Enabled: No Audited Zones: CEE Server URIs: Hostname: Config Auditing Enabled: No Config Syslog Enabled: No Config Syslog Servers: Protocol Syslog Servers: Auto Purging Enabled: Yes Retention Period: 50
Manual deletion
You can delete audit logs manually from the command-line interface. By using this method, you can delete audit logs before a certain day forcibly. There is no way to delete audit logs for a time span. The deletion deletes the audit logs of all the nodes present on the cluster. The deletion runs in background, and you can only run one instance of manual deletion at one time. If a manual deletion task is running, any other deletion request will be rejected.
Check status of manual deletion
You can check the status of the manual deletion. Optional: To check if the audit logs for the specified time period is deleted, run the isi audit logs check command. The deletion is successful, and the following message appears:
Purging Status: Using Before Value: 2019-11-01 Currently Manual Purging Status: COMPLETED
NOTE: If there are some audit logs that cannot be deleted, the output displays the reason.
Delete audit logs manually
You can delete the audit logs manually for a specified time period. 1. Optional: To delete audit logs manually, run the isi audit logs delete --before=<date> command:
isi audit logs delete --before=2019-11-1
The following message appears: You are going to delete the audit logs before 2019-11-01. Are you sure you want to do this?? (yes/[no]):
2. Enter "yes." The deletion request is triggered, and the following message appears: The purging request has been triggered. `isi audit logs check` can be used to monitor the process.
206 Auditing and logging

14
Snapshots
This section contains the following topics:
Topics:
· Snapshots overview · Data protection with SnapshotIQ · Snapshot disk-space usage · Snapshot schedules · Snapshot aliases · File and directory restoration · Best practices for creating snapshots · Best practices for creating snapshot schedules · File clones · Snapshot locks · Snapshot reserve · SnapshotIQ license functionality · Creating snapshots with SnapshotIQ · Managing snapshots · Restoring snapshot data · Managing snapshot schedules · Managing snapshot aliases · Managing with snapshot locks · Configure SnapshotIQ settings · Set the snapshot reserve · Managing changelists
Snapshots overview
A OneFS snapshot is a logical pointer to data that is stored on a cluster at a specific point in time. A snapshot references a directory on a cluster, including all data stored in the directory and its subdirectories. If the data referenced by a snapshot is modified, the snapshot stores a physical copy of the data that was modified. Snapshots are created according to user specifications or are automatically generated by OneFS to facilitate system operations. To create and manage snapshots, you must activate a SnapshotIQ license on the cluster. Some applications must generate snapshots to function but do not require you to activate a SnapshotIQ license; by default, these snapshots are automatically deleted when OneFS no longer needs them. However, if you activate a SnapshotIQ license, you can retain these snapshots. You can view snapshots that are generated by other modules without activating a SnapshotIQ license. You can identify and locate snapshots by name or ID. A snapshot name is specified by a user and assigned to the virtual directory that contains the snapshot. A snapshot ID is a numerical identifier that OneFS automatically assigns to a snapshot.
Data protection with SnapshotIQ
You can create snapshots to protect data with the SnapShotIQ software module. Snapshots protect data against accidental deletion and modification by enabling you to restore deleted and modified files. To use SnapshotIQ, you must activate a SnapshotIQ license on the cluster. Snapshots are less costly than backing up your data on a separate physical storage device in terms of both time and storage consumption. The time required to move data to another physical device depends on the amount of data being moved, whereas snapshots are always created almost instantaneously regardless of the amount of data referenced by the snapshot. Also,
Snapshots 207

because snapshots are available locally, end-users can often restore their data without requiring assistance from a system administrator. Snapshots require less space than a remote backup because unaltered data is referenced rather than recreated. Snapshots do not protect against hardware or file-system issues. Snapshots reference data that is stored on a cluster, so if the data on the cluster becomes unavailable, the snapshots will also be unavailable. Because of this, it is recommended that you back up your data to separate physical devices in addition to creating snapshots.
Snapshot disk-space usage
The amount of disk space that a snapshot consumes depends on both the amount of data stored by the snapshot and the amount of data the snapshot references from other snapshots. Immediately after OneFS creates a snapshot, the snapshot consumes a negligible amount of disk space. The snapshot does not consume additional disk space unless the data referenced by the snapshot is modified. If the data that a snapshot references is modified, the snapshot stores read-only copies of the original data. A snapshot consumes only the space that is necessary to restore the contents a directory to the state it was in when the snapshot was taken. To reduce disk-space usage, snapshots that reference the same directory reference each other, with older snapshots referencing newer snapshots. If a file is deleted, and several snapshots reference the file, a single snapshot stores a copy the file, and the other snapshots reference the file from the snapshot that stored the copy. The reported size of a snapshot reflects only the amount of data stored by the snapshot and does not include the amount of data referenced by the snapshot. Because snapshots do not consume a set amount of storage space, there is no available-space requirement for creating a snapshot. The size of a snapshot grows according to how the data referenced by the snapshot is modified. A cluster cannot contain more than 20,000 snapshots.
Snapshot schedules
You can automatically generate snapshots according to a snapshot schedule. With snapshot schedules, you can periodically generate snapshots of a directory without having to manually create a snapshot every time. You can also assign an expiration period that determines when SnapshotIQ deletes each automatically generated snapshot.
Snapshot aliases
A snapshot alias is a logical pointer to a snapshot. If you specify an alias for a snapshot schedule, the alias will always point to the most recent snapshot generated by that schedule. Assigning a snapshot alias allows you to quickly identify and access the most recent snapshot generated according to a snapshot schedule. If you allow clients to access snapshots through an alias, you can reassign the alias to redirect clients to other snapshots. In addition to assigning snapshot aliases to snapshots, you can also assign snapshot aliases to the live version of the file system. This can be useful if clients are accessing snapshots through a snapshot alias, and you want to redirect the clients to the live version of the file system.
File and directory restoration
You can restore the files and directories that are referenced by a snapshot alias. You can copy the data from the snapshot, clone a file from the snapshot, or revert the entire snapshot. Copying a file from a snapshot duplicates the file, which roughly doubles the amount of storage space consumed. Even if you delete the original file from the nonsnapshot directory, the copy of the file remains in the snapshot. Cloning a file from a snapshot also duplicates the file. However, a clone does not consume additional space on the cluster unless the clone or cloned file is modified. Reverting a snapshot replaces the contents of a directory with the data that is stored in the snapshot. Before a snapshot is reverted, SnapshotIQ creates a snapshot of the directory that is being replaced, which enables you to undo the snapshot revert later. Reverting a snapshot can be useful if you want to undo many changes that you made to files and directories. If new files or directories have been created in a directory since a snapshot of the directory was created, those files and directories are deleted when the snapshot is reverted.
208 Snapshots

NOTE: If you move a directory, you cannot revert snapshots of the directory that were taken before the directory was moved. Deleting and then re-creating a directory has the same effect as a move. You cannot revert snapshots of a directory that were taken before the directory was deleted and then re-created.

Best practices for creating snapshots
Consider the following snapshot best practices when working with a large number of snapshots.
It is recommended that you do not create more than 1,000 snapshots of a single directory to avoid performance degradation. If you create a snapshot of a root directory, that snapshot counts towards the total number of snapshots for any subdirectories of the root directory. For example, if you create 500 snapshots of /ifs/data and 500 snapshots of /ifs/data/media, you have created 1,000 snapshots of /ifs/data/media. Avoid creating snapshots of directories that are already referenced by other snapshots.
It is recommended that you do not create more than 1,000 hard links per file in a snapshot to avoid performance degradation. Always attempt to keep directory paths as shallow as possible. The deeper the depth of directories referenced by snapshots, the greater the performance degradation.
Creating snapshots of directories higher on a directory tree will increase the amount of time it takes to modify the data referenced by the snapshot and require more cluster resources to manage the snapshot and the directory. However, creating snapshots of directories lower on directories trees will require more snapshot schedules, which can be difficult to manage. It is recommended that you do not create snapshots of /ifs or /ifs/data.
You can create up to 20,000 snapshots on a cluster at a time. If your workflow requires a large number of snapshots on a consistent basis, you might find that managing snapshots through the OneFS command-line interface is preferable to managing snapshots through the OneFS web administration Interface. In the CLI, you can apply a wide variety of sorting and filtering options and redirect lists into text files.
You should mark snapshots for deletion when they are no longer needed, and make sure that the SnapshotDelete system job is enabled. Disabling the SnapshotDelete job prevents unused disk space from being recaptured and can also cause performance degradation over time.
If the system clock is set to a time zone other than Coordinated Universal Time (UTC), SnapShotIQ modifies snapshot duration periods to match Daylight Savings Time (DST). Upon entering DST, snapshot durations are increased by an hour to adhere to DST; when exiting DST, snapshot durations are decreased by an hour to adhere to standard time.

Best practices for creating snapshot schedules

Snapshot schedule configurations can be categorized by how they delete snapshots: ordered deletions and unordered deletions.
An ordered deletion is the deletion of the oldest snapshot of a directory. An unordered deletion is the deletion of a snapshot that is not the oldest snapshot of a directory. Unordered deletions take approximately twice as long to complete and consume more cluster resources than ordered deletions. However, unordered deletions can save space by retaining a smaller total number of snapshots.
The benefits of unordered deletions versus ordered deletions depend on how often the data referenced by the snapshots is modified. If the data is modified frequently, unordered deletions will save space. However, if data remains unmodified, unordered deletions will most likely not save space, and it is recommended that you perform ordered deletions to free cluster resources.
To implement ordered deletions, assign the same duration period for all snapshots of a directory. The snapshots can be created by one or multiple snapshot schedules. Always ensure that no more than 1000 snapshots of a directory are created.
To implement unordered snapshot deletions, create several snapshot schedules for a single directory, and then assign different snapshot duration periods for each schedule. Ensure that all snapshots are created at the same time when possible.

NOTE: Snapshot schedules with frequency of "Every Minute" are not recommended and are to be avoided.

The following table describes snapshot schedules that follow snapshot best practices:

Table 5. Snapshot schedule configurations

Deletion type Snapshot frequency

Snapshot time

Ordered

Every hour

deletion (for

Beginning at 12:00 AM Ending at 11:59 AM

Snapshot expiration Max snapshots retained

1 month

720

Snapshots 209

Table 5. Snapshot schedule configurations (continued)

Deletion type Snapshot frequency

Snapshot time

mostly static data)

Unordered deletion (for frequently modified data)

Every other hour
Every day Every week

Beginning at 12:00 AM Ending at 11:59 PM
At 12:00 AM
Saturday at 12:00 AM

Every month

The first Saturday of the month at 12:00 AM

Snapshot expiration Max snapshots retained

1 day

27

1 week 1 month 3 months

File clones
SnapshotIQ enables you to create file clones that share blocks with existing files in order to save space on the cluster. A file clone usually consumes less space and takes less time to create than a file copy. Although you can clone files from snapshots, clones are primarily used internally by OneFS.
The blocks that are shared between a clone and cloned file are contained in a hidden file called a shadow store. Immediately after a clone is created, all data originally contained in the cloned file is transferred to a shadow store. Because both files reference all blocks from the shadow store, the two files consume no more space than the original file; the clone does not take up any additional space on the cluster. However, if the cloned file or clone is modified, the file and clone will share only blocks that are common to both of them, and the modified, unshared blocks will occupy additional space on the cluster.
Over time, the shared blocks contained in the shadow store might become useless if neither the file nor clone references the blocks. The cluster routinely deletes blocks that are no longer needed. You can force the cluster to delete unused blocks at any time by running the ShadowStoreDelete job.
Clones cannot contain alternate data streams (ADS). If you clone a file that contains alternate data streams, the clone will not contain the alternate data streams.
Shadow-store considerations
Shadow stores are hidden files that are referenced by cloned and deduplicated files. Files that reference shadow stores behave differently than other files.
 Reading shadow-store references might be slower than reading data directly. Reading noncached shadow-store references is slower than reading noncached data. Reading cached shadow-store references takes no more time than reading cached data.
 When files that reference shadow stores are replicated to another PowerScale cluster or backed up to a Network Data Management Protocol (NDMP) backup device, the shadow stores are not transferred to the target PowerScale cluster or backup device. The files are transferred as if they contained the data that they reference from shadow stores. On the target PowerScale cluster or backup device, the files consume the same amount of space as if they had not referenced shadow stores.
 When OneFS creates a shadow store, OneFS assigns the shadow store to a storage pool of a file that references the shadow store. If you delete the storage pool that a shadow store resides on, the shadow store is moved to a pool that contains another file that references the shadow store.
 OneFS does not delete a shadow-store block immediately after the last reference to the block is deleted. Instead, OneFS waits until the ShadowStoreDelete job is run to delete the unreferenced block. If many unreferenced blocks exist on the cluster, OneFS might report a negative deduplication savings until the ShadowStoreDelete job is run.
 Shadow stores are protected at least as much as the most protected file that references it. For example, if one file that references a shadow store resides in a storage pool with +2 protection and another file that references the shadow store resides in a storage pool with +3 protection, the shadow store is protected at +3.
 Quotas account for files that reference shadow stores as if the files contained the data that is referenced from shadow stores. From the perspective of a quota, shadow-store references do not exist. However, if a quota includes data protection overhead, the quota does not account for the data protection overhead of shadow stores.

210 Snapshots

Snapshot locks
A snapshot lock prevents a snapshot from being deleted. If a snapshot has one or more locks applied to it, the snapshot cannot be deleted and is referred to as a locked snapshot. If the duration period of a locked snapshot expires, OneFS will not delete the snapshot until all locks on the snapshot have been deleted.
OneFS applies snapshot locks to ensure that snapshots generated by OneFS applications are not deleted prematurely. For this reason, it is recommended that you do not delete snapshot locks or modify the duration period of snapshot locks.
A limited number of locks can be applied to a snapshot at a time. If you create snapshot locks, the limit for a snapshot might be reached, and OneFS could be unable to apply a snapshot lock when necessary. For this reason, it is recommended that you do not create snapshot locks.

Snapshot reserve
The snapshot reserve enables you to set aside a minimum percentage of the cluster storage capacity specifically for snapshots. If specified, all other OneFS operations are unable to access the percentage of cluster capacity that is reserved for snapshots.
NOTE: The snapshot reserve does not limit the amount of space that snapshots can consume on the cluster. Snapshots can consume a greater percentage of storage capacity specified by the snapshot reserve. It is recommended that you do not specify a snapshot reserve.

SnapshotIQ license functionality

You can create snapshots only if you activate a SnapshotIQ license on a cluster. However, you can view snapshots and snapshot locks that are created for internal use by OneFS without activating a SnapshotIQ license.
The following table describes what snapshot functionality is available depending on whether the SnapshotIQ license is active:

Functionality Create snapshots and snapshot schedules Configure SnapshotIQ settings View snapshot schedules Delete snapshots Access snapshot data View snapshots

Inactive No
No Yes Yes Yes Yes

Active Yes
Yes Yes Yes Yes Yes

If you a SnapshotIQ license becomes inactive, you will no longer be able to create new snapshots, all snapshot schedules will be disabled, and you will not be able to modify snapshots or snapshot settings. However, you will still be able to delete snapshots and access data contained in snapshots.

Creating snapshots with SnapshotIQ
To create snapshots, you must configure the SnapshotIQ licence on the cluster. You can create snapshots either by creating a snapshot schedule or manually generating an individual snapshot.
Manual snapshots are useful if you want to create a snapshot immediately, or at a time that is not specified in a snapshot schedule. For example, if you plan to make changes to your file system, but are unsure of the consequences, you can capture the current state of the file system in a snapshot before you make the change.
Before creating snapshots, consider that reverting a snapshot requires that a SnapRevert domain exist for the directory that is being reverted. If you intend on reverting snapshots for a directory, it is recommended that you create SnapRevert domains for those directories while the directories are empty. Creating a domain for a directory that contains less data takes less time.

Snapshots 211

Create a SnapRevert domain
Before you can revert a snapshot that contains a directory, you must create a SnapRevert domain for the directory. It is recommended that you create SnapRevert domains for a directory while the directory is empty.
The root path of the SnapRevert domain must be the same root path of the snapshot. For example, a domain with a root path of /ifs/data/media cannot be used to revert a snapshot with a root path of /ifs/data/media/archive. To revert /ifs/data/media/archive, you must create a SnapRevert domain with a root path of /ifs/data/media/ archive.
Run the isi job jobs start command. The following command creates a SnapRevert domain for /ifs/data/media:
isi job jobs start domainmark --root /ifs/data/media \ --dm-type SnapRevert

Create a snapshot schedule
You can create a snapshot schedule to continuously generate snapshots of directories.
Run the isi snapshot schedules create command. The following command creates a snapshot schedule for /ifs/data/media:
isi snapshot schedules create hourly /ifs/data/media \ HourlyBackup_%m-%d-%Y_%H:%M "Every day every hour" \ --duration 1M
The following commands create multiple snapshot schedules for /ifs/data/media that generate and expire snapshots at different rates:
isi snapshot schedules create every-other-hour \ /ifs/data/media EveryOtherHourBackup_%m-%d-%Y_%H:%M \ "Every day every 2 hours" --duration 1D isi snapshot schedules create daily /ifs/data/media \ Daily_%m-%d-%Y_%H:%M "Every day at 12:00 AM" --duration 1W isi snapshot schedules create weekly /ifs/data/media \ Weekly_%m-%d-%Y_%H:%M "Every Saturday at 12:00 AM" --duration 1M isi snapshot schedules create monthly /ifs/data/media \ Monthly_%m-%d-%Y_%H:%M \ "The 1 Saturday of every month at 12:00 AM" --duration 3M

Create a snapshot
You can create a snapshot of a directory. Run the isi snapshot snapshots create command. The following command creates a snapshot for /ifs/data/media:
isi snapshot snapshots create /ifs/data/media --name media-snap

Snapshot naming patterns

If you schedule snapshots to be automatically generated, either according to a snapshot schedule or a replication policy, you must assign a snapshot naming pattern that determines how the snapshots are named. Snapshot naming patterns contain variables that include information about how and when the snapshot was created.
The following variables can be included in a snapshot naming pattern:

Variable %A

Description The day of the week.

212 Snapshots

Variable %a %B %b %C %c %d %e %F %G
%g
%H
%h %I
%j %k %l
%M %m %p %{PolicyName}

Description
The abbreviated day of the week. For example, if the snapshot is generated on a Sunday, %a is replaced with Sun.
The name of the month.
The abbreviated name of the month. For example, if the snapshot is generated in September, %b is replaced with Sep.
The first two digits of the year. For example, if the snapshot is created in 2014, %C is replaced with 20.
The time and day. This variable is equivalent to specifying %a %b %e %T %Y.
The two digit day of the month.
The day of the month. A single-digit day is preceded by a blank space.
The date. This variable is equivalent to specifying %Y-%m-%d.
The year. This variable is equivalent to specifying %Y. However, if the snapshot is created in a week that has less than four days in the current year, the year that contains the majority of the days of the week is displayed. The first day of the week is calculated as Monday. For example, if a snapshot is created on Sunday, January 1, 2017, %G is replaced with 2016, because only one day of that week is in 2017.
The abbreviated year. This variable is equivalent to specifying %y. However, if the snapshot was created in a week that has less than four days in the current year, the year that contains the majority of the days of the week is displayed. The first day of the week is calculated as Monday. For example, if a snapshot is created on Sunday, January 1, 2017, %g is replaced with 16, because only one day of that week is in 2017.
The hour. The hour is represented on the 24-hour clock. Single-digit hours are preceded by a zero. For example, if a snapshot is created at 1:45 AM, %H is replaced with 01.
The abbreviated name of the month. This variable is equivalent to specifying %b.
The hour represented on the 12-hour clock. Single-digit hours are preceded by a zero. For example, if a snapshot is created at 1:45 PM, %I is replaced with 01.
The numeric day of the year. For example, if a snapshot is created on February 1, %j is replaced with 32.
The hour represented on the 24-hour clock. Single-digit hours are preceded by a blank space.
The hour represented on the 12-hour clock. Single-digit hours are preceded by a blank space. For example, if a snapshot is created at 1:45 AM, %I is replaced with 1.
The two-digit minute.
The two-digit month.
AM or PM.
The name of the replication policy that the snapshot was created for. This variable is valid only if you are specifying a snapshot naming pattern for a replication policy.
Snapshots 213

Variable %R %r %S %s %{SrcCluster}
%T %U %u
%V
%v %W %w
%X %Y %y %Z %z
%+ %%
214 Snapshots

Description
The time. This variable is equivalent to specifying %H:%M.
The time. This variable is equivalent to specifying %I:%M:%S %p.
The two-digit second.
The second represented in UNIX or POSIX time.
The name of the source cluster of the replication policy that the snapshot was created for. This variable is valid only if you are specifying a snapshot naming pattern for a replication policy.
The time. This variable is equivalent to specifying %H:%M:%S
The two-digit numerical week of the year. Numbers range from 00 to 53. The first day of the week is calculated as Sunday.
The numerical day of the week. Numbers range from 1 to 7. The first day of the week is calculated as Monday. For example, if a snapshot is created on Sunday, %u is replaced with 7.
The two-digit numerical week of the year that the snapshot was created in. Numbers range from 01 to 53. The first day of the week is calculated as Monday. If the week of January 1 is four or more days in length, then that week is counted as the first week of the year.
The day that the snapshot was created. This variable is equivalent to specifying %e-%b-%Y.
The two-digit numerical week of the year that the snapshot was created in. Numbers range from 00 to 53. The first day of the week is calculated as Monday.
The numerical day of the week that the snapshot was created on. Numbers range from 0 to 6. The first day of the week is calculated as Sunday. For example, if the snapshot was created on Sunday, %w is replaced with 0.
The time that the snapshot was created. This variable is equivalent to specifying %H:%M:%S.
The year that the snapshot was created in.
The last two digits of the year that the snapshot was created in. For example, if the snapshot was created in 2014, %y is replaced with 14.
The time zone that the snapshot was created in.
The offset from coordinated universal time (UTC) of the time zone that the snapshot was created in. If preceded by a plus sign, the time zone is east of UTC. If preceded by a minus sign, the time zone is west of UTC.
The time and date that the snapshot was created. This variable is equivalent to specifying %a %b %e %X %Z %Y.
Escapes a percent sign. For example, 100%% is replaced with 100%.

Managing snapshots
You can delete and view snapshots. You can also modify the name, duration period, and snapshot alias of an existing snapshot. However, you cannot modify the data contained in a snapshot; the data contained in a snapshot is read-only.
Reducing snapshot disk-space usage
If multiple snapshots contain the same directories, deleting one of the snapshots might not free the entire amount of space that the system reports as the size of the snapshot. The size of a snapshot is the maximum amount of data that might be freed if the snapshot is deleted. Deleting a snapshot frees only the space that is taken up exclusively by that snapshot. If two snapshots reference the same stored data, that data is not freed until both snapshots are deleted. Remember that snapshots store data contained in all subdirectories of the root directory; if snapshot_one contains /ifs/data/, and snapshot_two contains /ifs/data/dir, the two snapshots most likely share data. If you delete a directory, and then re-create it, a snapshot containing the directory stores the entire re-created directory, even if the files in that directory are never modified. Deleting multiple snapshots that contain the same directories is more likely to free data than deleting multiple snapshots that contain different directories. If multiple snapshots contain the same directories, deleting older snapshots is more likely to free disk-space than deleting newer snapshots. Snapshots that are assigned expiration dates are automatically marked for deletion by the snapshot daemon. If the daemon is disabled, snapshots will not be automatically deleted by the system. It is recommended that you do not disable the snapshot daemon.
Delete a snapshot
You can delete a snapshot if you no longer want to access the data contained in the snapshot. OneFS frees disk space occupied by deleted snapshots when the SnapshotDelete job is run. Also, if you delete a snapshot that contains clones or cloned files, data in a shadow store might no longer be referenced by files on the cluster; OneFS deletes unreferenced data in a shadow store when the ShadowStoreDelete job is run. OneFS routinely runs both the shadow store delete and SnapshotDelete jobs. However, you can also manually run the jobs at any time. 1. Delete a snapshot by running the isi snapshot snapshots delete command.
The following command deletes a snapshot named OldSnapshot:
isi snapshot snapshots delete OldSnapshot 2. Optional: To increase the speed at which deleted snapshot data is freed on the cluster, start the SnapshotDelete job by
running the following command:
isi job jobs start snapshotdelete
3. To increase the speed at which deleted data shared between deduplicated and cloned files is freed on the cluster, start the ShadowStoreDelete job by running the following command:
isi job jobs start shadowstoredelete
Modify snapshot attributes
You can modify the name and expiration date of a snapshot. Run the isi snapshot snapshots modify command.
Snapshots 215

The following command causes HourlyBackup_07-15-2014_22:00 to expire on 1:30 PM on July 25th, 2014:
isi snapshot snapshots modify HourlyBackup_07-15-2014_22:00 \ --expires 2014-07-25T01:30

Modify a snapshot alias
You can modify the alias of a snapshot to assign an alternative name for the snapshot. Run the isi snapshot snapshots modify command. The following command assigns an alias of LastKnownGood to HourlyBackup_07-15-2013_22:00:
isi snapshot snapshots modify HourlyBackup_07-15-2013_22:00 \ --alias LastKnownGood

View snapshots

You can view a list of snapshots or detailed information about a specific snapshot. 1. View all snapshots by running the following command:
isi snapshot snapshots list

The system displays output similar to the following example:

ID Name

Path

-------------------------------------------------------------------

2

SIQ-c68839394a547b3fbc5c4c4b4c5673f9-latest /ifs/data/source

6

SIQ-c68839394a547b3fbc5c4c4b4c5673f9-restore /ifs/data/target

8

SIQ-Failover-newPol-2013-07-11_18-47-08

/ifs/data/target

12 HourlyBackup_07-15-2013_21:00

/ifs/data/media

14 HourlyBackup_07-15-2013_22:00

/ifs/data/media

16 EveryOtherHourBackup_07-15-2013_22:00

/ifs/data/media

18 HourlyBackup_07-15-2013_23:00

/ifs/data/media

20 HourlyBackup_07-16-2013_15:00

/ifs/data/media

22 EveryOtherHourBackup_07-16-2013_14:00

/ifs/data/media

-------------------------------------------------------------------

2. Optional: To view detailed information about a specific snapshot, run the isi snapshot snapshots view command. The following command displays detailed information about HourlyBackup_07-15-2013_22:00:

isi snapshot snapshots view HourlyBackup_07-15-2013_22:00

The system displays output similar to the following example:

ID: 14 Name: HourlyBackup_07-15-2013_22:00 Path: /ifs/data/media Has Locks: No Schedule: hourly Alias: Created: 2013-07-15T22:00:10 Expires: 2013-08-14T22:00:00 Size: 0b Shadow Bytes: 0b % Reserve: 0.00% % Filesystem: 0.00% State: active

216 Snapshots

Snapshot information

You can view information about snapshots through the output of the isi snapshot snapshots list command.

ID Name Path

The ID of the snapshot. The name of the snapshot. The path of the directory contained in the snapshot.

Restoring snapshot data
You can restore snapshot data through various methods. You can revert a snapshot or access snapshot data through the snapshots directory.
From the snapshots directory, you can either clone a file or copy a directory or a file. The snapshots directory can be accessed through Windows Explorer or a UNIX command line. You can disable and enable access to the snapshots directory for any of these methods through snapshots settings.
Revert a snapshot
You can revert a directory back to the state it was in when a snapshot was taken.
 Create a SnapRevert domain for the directory.  Create a snapshot of a directory. 1. Optional: To identify the ID of the snapshot you want to revert, run the isi snapshot snapshots view command.
The following command displays the ID of HourlyBackup_07-15-2014_23:00:
isi snapshot snapshots view HourlyBackup_07-15-2014_23:00
The system displays output similar to the following example:
ID: 18 Name: HourlyBackup_07-15-2014_23:00 Path: /ifs/data/media Has Locks: No Schedule: hourly Alias: Created: 2014-07-15T23:00:05 Expires: 2014-08-14T23:00:00 Size: 0b Shadow Bytes: 0b % Reserve: 0.00% % Filesystem: 0.00% State: active
2. Revert a snapshot by running the isi job jobs start command. The following command reverts HourlyBackup_07-15-2014_23:00:
isi job jobs start snaprevert --snapid 18

Restore a file or directory using Windows Explorer
If the Microsoft Shadow Copy Client is installed on your computer, you can use it to restore files and directories that are stored in snapshots.
This method of restoring files and directories does not preserve the original permissions. Instead, this method assigns the file or directory the same permissions as the directory you are copying that file or directory into. To preserve permissions while restoring data from a snapshot, run the cp command with the -a option on a UNIX command line.
NOTE: You can access up to 64 snapshots of a directory through Windows explorer, starting with the most recent snapshot. To access more than 64 snapshots for a directory, access the cluster through a UNIX command line.

Snapshots 217

1. In Windows Explorer, navigate to the directory that you want to restore or the directory that contains the file that you want to restore. If the directory has been deleted, you must recreate the directory.
2. Right-click the folder, and then click Properties. 3. In the Properties window, click the Previous Versions tab. 4. Select the version of the folder that you want to restore or the version of the folder that contains the version of the file that
you want to restore. 5. Restore the version of the file or directory.
 To restore all files in the selected directory, click Restore.  To copy the selected directory to another location, click Copy, and then specify a location to copy the directory to.  To restore a specific file, click Open, and then copy the file into the original directory, replacing the existing copy with
the snapshot version.
Restore a file or directory through a UNIX command line
You can restore a file or directory from a snapshot through a UNIX command line. 1. Open a connection to the cluster through a UNIX command line. 2. Optional: To view the contents of the snapshot you want to restore a file or directory from, run the ls command for a
directory contained in the snapshots root directory. For example, the following command displays the contents of the /archive directory contained in Snapshot2014Jun04:
ls /ifs/.snapshot/Snapshot2014Jun04/archive 3. Copy the file or directory by using the cp command.
For example, the following command creates a copy of the file1 file: cp -a /ifs/.snapshot/Snapshot2014Jun04/archive/file1 \ /ifs/archive/file1_copy
Clone a file from a snapshot
You can clone a file from a snapshot. 1. Open a secure shell (SSH) connection to any node in the cluster and log in. 2. To view the contents of the snapshot you want to restore a file or directory from, run the ls command for a subdirectory of
the snapshots root directory. For example, the following command displays the contents of the /archive directory contained in Snapshot2014Jun04:
ls /ifs/.snapshot/Snapshot2014Jun04/archive 3. Clone a file from the snapshot by running the cp command with the -c option.
For example, the following command clones test.txt from Snapshot2014Jun04: cp -c /ifs/.snapshot/Snapshot2014Jun04/archive/test.txt \ /ifs/archive/test_clone.text
218 Snapshots

Managing snapshot schedules
You can modify, delete, and view snapshot schedules.
Modify a snapshot schedule
You can modify a snapshot schedule. Any changes to a snapshot schedule are applied only to snapshots generated after the modifications are made. Existing snapshots are not affected by schedule modifications. If you modify the alias of a snapshot schedule, the alias is assigned to the next snapshot generated based on the schedule. However, if you do this, the old alias is not removed from the last snapshot that it was assigned to. Unless you manually remove the old alias, the alias will remain attached to the last snapshot that it was assigned to. Run the isi snapshot schedules modify command. The following command causes snapshots created according to the snapshot schedule hourly_media_snap to be deleted 15 days after they are created:
isi snapshot schedules modify hourly_media_snap --duration 15D

Delete a snapshot schedule
You can delete a snapshot schedule. Deleting a snapshot schedule will not delete snapshots that were previously generated according to the schedule. Run the isi snapshot schedules delete command. The following command deletes a snapshot schedule named hourly_media_snap:
isi snapshot schedules delete hourly_media_snap

View snapshot schedules

You can view snapshot schedules. 1. View snapshot schedules by running the following command:
isi snapshot schedules list

The system displays output similar to the following example:

ID Name

---------------------

1

every-other-hour

2

daily

3

weekly

4

monthly

---------------------

2. Optional: View detailed information about a specific snapshot schedule by running the isi snapshot schedules view command. The following command displays detailed information about the snapshot schedule every-other-hour:

isi snapshot schedules view every-other-hour

The system displays output similar to the following example:

ID: 1 Name: every-other-hour Path: /ifs/data/media Pattern: EveryOtherHourBackup_%m-%d-%Y_%H:%M Schedule: Every day every 2 hours Duration: 1D Alias: -

Snapshots 219

Next Run: 2013-07-16T18:00:00 Next Snapshot: EveryOtherHourBackup_07-16-2013_18:00
Managing snapshot aliases
You can configure snapshot schedules to assign a snapshot alias to the most recent snapshot created by a snapshot schedule. You can also manually assign snapshot aliases to specific snapshots or the live version of the file system.
Configure a snapshot alias for a snapshot schedule
You can configure a snapshot schedule to assign a snapshot alias to the most recent snapshot created by the schedule. If you configure an alias for a snapshot schedule, the alias is assigned to the next snapshot generated based on the schedule. However, if you do this, the old alias is not removed from the last snapshot that it was assigned to. Unless you manually remove the old alias, the alias will remain attached to the last snapshot that it was assigned to. Run the isi snapshot schedules modify command. The following command configures the alias LatestWeekly for the snapshot schedule WeeklySnapshot:
isi snapshot schedules modify WeeklySnapshot --alias LatestWeekly
Assign a snapshot alias to a snapshot
You can assign a snapshot alias to a snapshot. Run the isi snapshot aliases create command. The following command creates a snapshot alias for Weekly-01-30-2015:
isi snapshot aliases create latestWeekly Weekly-01-30-2015
Reassign a snapshot alias to the live file system
You can reassign a snapshot alias to redirect clients from a snapshot to the live file system. This procedure is available only through the command-line interface (CLI). 1. Open a secure shell (SSH) connection to any node in the cluster and log in. 2. Run the isi snapshot aliases modify command.
The following command reassigns the latestWeekly alias to the live file system: isi snapshot aliases modify latestWeekly --target LIVE
View snapshot aliases
You can view a list of all snapshot aliases. This procedure is available only through the command-line interface (CLI). 1. Open a secure shell (SSH) connection to any node in the cluster and log in. 2. View a list of all snapshot aliases by running the following command:
isi snapshot aliases list If a snapshot alias references the live version of the file system, the Target ID is -1. 3. Optional: View information about a specific snapshot by running the isi snapshot aliases view command.
220 Snapshots

The following command displays information about latestWeekly: isi snapshot aliases view latestWeekly

Snapshot alias information

You can view information about snapshot aliases through the output of the isi snapshot aliases view command.

ID Name Target ID Target Name Created

The numerical ID of the snapshot alias. The name of the snapshot alias. The numerical ID of the snapshot that is referenced by the alias. The name of the snapshot that is referenced by the alias. The date that the snapshot alias was created.

Managing with snapshot locks
You can delete, create, and modify the expiration date of snapshot locks. CAUTION:
It is recommended that you do not create, delete, or modify snapshot locks unless you are instructed to do so by PowerScale Technical Support.
Deleting a snapshot lock that was created by OneFS might result in data loss. If you delete a snapshot lock that was created by OneFS, it is possible that the corresponding snapshot might be deleted while it is still in use by OneFS. If OneFS cannot access a snapshot that is necessary for an operation, the operation will malfunction and data loss might result. Modifying the expiration date of a snapshot lock created by OneFS can also result in data loss because the corresponding snapshot can be deleted prematurely.
Create a snapshot lock
You can create snapshot locks that prevent snapshots from being deleted. Although you can prevent a snapshot from being automatically deleted by creating a snapshot lock, it is recommended that you do not create snapshot locks. To prevent a snapshot from being automatically deleted, it is recommended that you extend the duration period of the snapshot. This procedure is available only through the command-line interface (CLI). 1. Open a secure shell (SSH) connection to any node in the cluster and log in. 2. Create a snapshot lock by running the isi snapshot locks create command.
For example, the following command applies a snapshot lock to SnapshotApril2016, sets the lock to expire in one month, and adds a description of "Maintenance Lock":
isi snapshot locks create SnapshotApril2016 --expires 1M \ --comment "Maintenance Lock"

Modify a snapshot lock expiration date
You can modify the expiration date of a snapshot lock. CAUTION: It is recommended that you do not modify the expiration dates of snapshot locks.
This procedure is available only through the command-line interface (CLI). 1. Open a secure shell (SSH) connection to any node in the cluster and log in. 2. Run the isi snapshot locks modify command.

Snapshots 221

The following command sets an expiration date two days from the present date for a snapshot lock with an ID of 1 that is applied to a snapshot named SnapshotApril2014:
isi snapshot locks modify SnapshotApril2014 1 --expires 2D

Delete a snapshot lock
You can delete a snapshot lock.
CAUTION: It is recommended that you do not delete snapshot locks.
This procedure is available only through the command-line interface (CLI). 1. Open a secure shell (SSH) connection to any node in the cluster and log in. 2. Delete a snapshot lock by running the isi snapshot locks delete command.
The following command deletes a snapshot lock that is applied to SnapshotApril2014 and has a lock ID of 1:
isi snapshot locks delete Snapshot2014Apr16 1
The system prompts you to confirm that you want to delete the snapshot lock. 3. Type yes and then press ENTER.

Snapshot lock information

You can view snapshot lock information through the isi snapshot locks view and isi snapshot locks list commands.

ID Comment Expires Count

Numerical identification number of the snapshot lock.
Description of the snapshot lock. This can be any string specified by a user.
The date that the snapshot lock will be automatically deleted by OneFS.
The number of times the snapshot lock is held.
The file clone operation can hold a single snapshot lock multiple times. If multiple file clones are created simultaneously, the file clone operation holds the same lock multiple times, rather than creating multiple locks. If you delete a snapshot lock that is held more than once, you will delete only one of the instances that the lock is held. In order to delete a snapshot lock that is held multiple times, you must delete the snapshot lock the same number of times as displayed in the count field.

Configure SnapshotIQ settings
You can configure SnapshotIQ settings that determine how snapshots can be created and the methods that users can access snapshot data. 1. Optional: View current SnapshotIQ settings by running the following command:
isi snapshot settings view
The system displays output similar to the following example:
Service: Yes Autocreate: Yes Autodelete: Yes
Reserve: 0.00% Global Visible Accessible: Yes
NFS Root Accessible: Yes NFS Root Visible: Yes
NFS Subdir Accessible: Yes SMB Root Accessible: Yes SMB Root Visible: Yes
SMB Subdir Accessible: Yes

222 Snapshots

Local Root Accessible: Yes Local Root Visible: Yes
Local Subdir Accessible: Yes
2. Configure SnapshotIQ settings by running the isi snapshot settings modify command: The following command prevents snapshots from being created on the cluster:
isi snapshot settings modify --service disable

SnapshotIQ settings

SnapshotIQ settings determine how snapshots behave and can be accessed. The following settings are displayed in the output of the isi snapshot settings view command:

Service Autocreate

Determines whether SnapshotIQ is enabled on the cluster.
Determines whether snapshots are automatically generated according to snapshot schedules. NOTE: Disabling snapshot generation might cause some OneFS operations to fail. It is recommended that you do not disable this setting.

Autodelete

Determines whether snapshots are automatically deleted according to their expiration dates.

Reserve

Specifies the percentage of disk space on the cluster that is reserved for snapshots.

NFS Root Accessible

Determines whether snapshot directories are accessible through NFS.

NFS Root Visible Determines whether snapshot directories are visible through NFS.

NFS Subdir Accessible

Determines whether snapshot subdirectories are accessible through NFS.

SMB Root Accessible

Determines whether snapshot directories are accessible through SMB.

SMB Root Visible Determines whether snapshot directories are visible through SMB.

SMB Subdir Accessible

Determines whether snapshot subdirectories are accessible through SMB.

Local Root Accessible

Determines whether snapshot directories are accessible through an SSH connection or the local console.

Local Root Visible

Determines whether snapshot directories are visible through the an SSH connection or the local console.

Local Subdir Accessible

Determines whether snapshot subdirectories are accessible through an SSH connection or the local console.

Set the snapshot reserve
You can specify a minimum percentage of cluster-storage capacity that you want to reserve for snapshots. The snapshot reserve does not limit the amount of space that snapshots are allowed to consume on the cluster. Snapshots can consume more than the percentage of capacity specified by the snapshot reserve. It is recommended that you do not specify a snapshot reserve. This procedure is available only through the command-line interface (CLI). 1. Open a secure shell (SSH) connection to any node in the cluster and log in. 2. Set the snapshot reserve by running the isi snapshot settings modify command with the --reserve option.
For example, the following command sets the snapshot reserve to 20%:
isi snapshot settings modify --reserve 20

Snapshots 223

Managing changelists
You can create and view changelists that describe the differences between two snapshots. You can create a changelist for any two snapshots that have a common root directory. Changelists are most commonly accessed by applications through the OneFS Platform API. For example, a custom application could regularly compare the two most recent snapshots of a critical directory path to determine whether to back up the directory, or to trigger other actions.
Create a changelist
You can create a changelist that shows what data was changed between snapshots. 1. Optional: To view the IDs of the snapshots you want to create a changelist for, run the following command:
isi snapshot snapshots list
2. Create a changelist by running the isi job jobs start command with the ChangelistCreate option. The following command creates a changelist: isi job jobs start ChangelistCreate --older-snapid 2 --newer-snapid 6
Delete a changelist
You can delete a changelist Run the isi_changelist_mod command with the -k option. The following command deletes changelist 22_24:
isi_changelist_mod -k 22_24

View a changelist
You can view a changelist that describes the differences between two snapshots. This procedure is available only through the command-line interface (CLI). 1. View the IDs of changelists by running the following command:
isi_changelist_mod -l
Changelist IDs include the IDs of both snapshots used to create the changelist. If OneFS is still in the process of creating a changelist, inprog is appended to the changelist ID.
2. Optional: View all contents of a changelist by running the isi_changelist_mod command with the -a option. The following command displays the contents of a changelist named 2_6:
isi_changelist_mod -a 2_6

Changelist information

You can view the information contained in changelists.

NOTE: The information contained in changelists is meant to be consumed by applications through the OneFS Platform API.

The following information is displayed for each item in the changelist when you run the isi_changelist_mod command:

st_ino

Displays the inode number of the specified item.

224 Snapshots

st_mode st_size st_atime st_mtime st_ctime cl_flags
path

Displays the file type and permissions for the specified item. Displays the total size of the item in bytes.

Displays the POSIX timestamp of when the item was last accessed. Displays the POSIX timestamp of when the item was last modified. Displays the POSIX timestamp of when the item was last changed.

Displays information about the item and what kinds of changes were made to the item.

01

The item was added or moved under the root directory of the snapshots.

02

The item was removed or moved out of the root directory of the snapshots.

04

The path of the item was changed without being removed from the root directory

of the snapshot.

10

The item either currently contains or at one time contained Alternate Data Streams

(ADS).

20

The item is an ADS.

40

The item has hardlinks.

NOTE: These values are added together in the output. For example, if an ADS was added, the code would be cl_flags=021.

The absolute path of the specified file or directory.

Snapshots 225

15
Deduplication with SmartDedupe
This section contains the following topics:
Topics:
· Deduplication overview · Deduplication jobs · Data replication and backup with deduplication · Snapshots with deduplication · Deduplication considerations · Shadow-store considerations · SmartDedupe license functionality · Managing deduplication
Deduplication overview
SmartDedupe enables you to save storage space on your cluster by reducing redundant data. Deduplication maximizes the efficiency of your cluster by decreasing the amount of storage that is required to store multiple files with identical blocks. The SmartDedupe software module deduplicates data by scanning a PowerScale cluster for identical data blocks. Each block is 8 KB. If SmartDedupe finds duplicate blocks, SmartDedupe moves a single copy of the blocks to a hidden file called a shadow store. SmartDedupe then deletes the duplicate blocks from the original files and replaces the blocks with pointers to the shadow store. Deduplication is applied at the directory level, targeting all files and directories underneath one or more root directories. SmartDedupe not only deduplicates identical blocks in different files, it also deduplicates identical blocks within a single file. Before you deduplicate a directory, you can get an estimate of the amount of space you can expect to save. After you begin deduplicating a directory, you can monitor the amount of space that deduplication is saving in real time. To enable deduplicating two or more files, the files must have the same disk pool policy ID and protection policy. If either or both of these attributes differ between two or more identical files, or files with identical 8 K blocks, the files are not deduplicated. Because it is possible to specify protection policies on a per-file or per-directory basis, deduplication can be further affected. Consider the example of two files, /ifs/data/projects/alpha/logo.jpg and /ifs/data/projects/ beta/logo.jpg. Even if the logo.jpg files in both directories are identical, they would not be deduplicated if they have different protection policies. If you have activated a SmartPools license on your cluster, you can also specify custom file pool policies. These file pool policies might result in identical files or files with identical 8 K blocks being stored in different node pools. Those files would have different disk pool policy IDs and would not be deduplicated. SmartDedupe also does not deduplicate files that are 32 KB or smaller, because doing so would consume more cluster resources than the storage savings are worth. The default size of a shadow store is 2 GB. Each shadow store can contain up to 256,000 blocks. Each block in a shadow store can be referenced up to 32,000 times.
Deduplication jobs
A deduplication system maintenance job deduplicates data on a cluster. You can monitor and control deduplication jobs as you would any other maintenance job on the cluster. Although the overall performance impact of deduplication is minimal, the deduplication job consumes 400 MB of memory per node. When a deduplication job runs for the first time on a cluster, SmartDedupe samples blocks from each file and creates index entries for those blocks. If the index entries of two blocks match, SmartDedupe scans the blocks that are next to the matching pair and then deduplicates all duplicate blocks. After a deduplication job samples a file once, new deduplication jobs will not sample the file again until the file is modified.
226 Deduplication with SmartDedupe

The first deduplication job that you run can take longer to complete than subsequent deduplication jobs. The first deduplication job must scan all files under the specified directories to generate the initial index. If subsequent deduplication jobs take a long time to complete, the most likely cause is that a large amount of data is being deduplicated. However, it can also indicate that users are storing large amounts of new data on the cluster. If a deduplication job is interrupted during the deduplication process, the job automatically restarts the scanning process from where the job was interrupted.
NOTE: Run deduplication jobs when users are not modifying data on the cluster. If users continually modify files on the cluster, the storage savings are minimal because the deduplicated blocks are constantly removed from the shadow store.
How frequently you should run a deduplication job on your PowerScale cluster varies, depending on the size of your dataset, the rate of changes, and opportunity. For most clusters, it is recommended that you start a deduplication job every 7 to 10 days. You can start a deduplication job manually or schedule a recurring job at specified intervals. By default, the deduplication job is configured to run at a low priority. However, you can specify job controls, such as priority and impact, on deduplication jobs that run manually or by schedule.
The permissions required to modify deduplication settings are not the same as the permissions required to run a deduplication job. Although a user must have the maintenance job permission to run a deduplication job, the user must have the deduplication permission to modify deduplication settings. By default, the root user and SystemAdmin user have the necessary permissions for all deduplication operations.
Data replication and backup with deduplication
When deduplicated files are replicated to another PowerScale cluster or backed up to a tape device, the deduplicated files no longer share blocks on the target PowerScale cluster or backup device. However, although you can deduplicate data on a target PowerScale cluster, you cannot deduplicate data on an NDMP backup device.
Shadows stores are not transferred to target clusters or backup devices. Because of this, deduplicated files do not consume less space than non-deduplicated files when they are replicated or backed up. To avoid running out of space, you must ensure that target clusters and tape devices have enough free space to store deduplicated data as if the data had not been deduplicated. To reduce the amount of storage space consumed on a target PowerScale cluster, you can configure deduplication for the target directories of your replication policies. Although this will deduplicate data on the target directory, it will not allow SyncIQ to transfer shadow stores. Deduplication is still performed by deduplication jobs running on the target cluster.
The amount of cluster resources required to backup and replicate deduplicated data is the same as for non-deduplicated data. You can deduplicate data while the data is being replicated or backed up.
Snapshots with deduplication
You cannot deduplicate the data stored in a snapshot. However, you can create snapshots of deduplicated data.
If you create a snapshot for a deduplicated directory, and then modify the contents of that directory, the references to shadow stores will be transferred to the snapshot over time. Therefore, if you enable deduplication before you create snapshots, you will save more space on your cluster. If you implement deduplication on a cluster that already has a significant amount of data stored in snapshots, it will take time before the snapshot data is affected by deduplication. Newly created snapshots can contain deduplicated data, but snapshots created before deduplication was implemented cannot.
If you plan on reverting a snapshot, it is best to revert the snapshot before running a deduplication job. Restoring a snapshot can overwrite many of the files on the cluster. Any deduplicated files are reverted back to normal files if they are overwritten by a snapshot revert. However, after the snapshot revert is complete, you can deduplicate the directory and the space savings persist on the cluster.
Deduplication considerations
Deduplication can significantly increase the efficiency at which you store data. However, the effect of deduplication varies depending on the cluster.
You can reduce redundancy on a cluster by running SmartDedupe. Deduplication creates links that can impact the speed at which you can read from and write to files. In particular, sequentially reading chunks smaller than 512 KB of a deduplicated file can be significantly slower than reading the same small, sequential chunks of a non-deduplicated file. This performance degradation applies only if you are reading non-cached data. For cached data, the performance for deduplicated files is potentially better than non-deduplicated files. If you stream chunks larger than 512 KB, deduplication does not significantly
Deduplication with SmartDedupe 227

impact the read performance of the file. If you intend on streaming 8 KB or less of each file at a time, and you do not plan on concurrently streaming the files, it is recommended that you do not deduplicate the files. Deduplication is most effective when applied to static or archived files and directories. The less files are modified, the less negative effect deduplication has on the cluster. For example, virtual machines often contain several copies of identical files that are rarely modified. Deduplicating a large number of virtual machines can greatly reduce consumed storage space.
Shadow-store considerations
Shadow stores are hidden files that are referenced by cloned and deduplicated files. Files that reference shadow stores behave differently than other files.  Reading shadow-store references might be slower than reading data directly. Reading noncached shadow-store references
is slower than reading noncached data. Reading cached shadow-store references takes no more time than reading cached data.  When files that reference shadow stores are replicated to another PowerScale cluster or backed up to a Network Data Management Protocol (NDMP) backup device, the shadow stores are not transferred to the target PowerScale cluster or backup device. The files are transferred as if they contained the data that they reference from shadow stores. On the target PowerScale cluster or backup device, the files consume the same amount of space as if they had not referenced shadow stores.  When OneFS creates a shadow store, OneFS assigns the shadow store to a storage pool of a file that references the shadow store. If you delete the storage pool that a shadow store resides on, the shadow store is moved to a pool that contains another file that references the shadow store.  OneFS does not delete a shadow-store block immediately after the last reference to the block is deleted. Instead, OneFS waits until the ShadowStoreDelete job is run to delete the unreferenced block. If many unreferenced blocks exist on the cluster, OneFS might report a negative deduplication savings until the ShadowStoreDelete job is run.  Shadow stores are protected at least as much as the most protected file that references it. For example, if one file that references a shadow store resides in a storage pool with +2 protection and another file that references the shadow store resides in a storage pool with +3 protection, the shadow store is protected at +3.  Quotas account for files that reference shadow stores as if the files contained the data that is referenced from shadow stores. From the perspective of a quota, shadow-store references do not exist. However, if a quota includes data protection overhead, the quota does not account for the data protection overhead of shadow stores.
SmartDedupe license functionality
You can deduplicate data only if you activate a SmartDedupe license on a cluster. However, you can assess deduplication savings without activating a SmartDedupe license. If you activate a SmartDedupe license, and then deduplicate data, the space savings are not lost if the license becomes inactive. You can also still view deduplication savings while the license is inactive. However, you will not be able to deduplicate additional data until you re-activate the SmartDedupe license.
Managing deduplication
You can manage deduplication on a cluster by first assessing how much space you can save by deduplicating individual directories. After you determine which directories are worth deduplicating, you can configure SmartDedupe to deduplicate those directories specifically. You can then monitor the actual amount of disk space you are saving.
Assess deduplication space savings
You can assess the amount of disk space you will save by deduplicating a directory. 1. Specify which directory to assess by running the isi dedupe settings modify command.
The following command configures SmartDedupe to assess deduplication savings for /ifs/data/archive:
isi dedupe settings modify --assess-paths /ifs/data/archive
If you assess multiple directories, disk savings will not be differentiated by directory in the deduplication report.
228 Deduplication with SmartDedupe

2. Start the assessment job by running the following command: isi job jobs start DedupeAssessment
3. Identify the ID of the assessment report by running the following command: isi dedupe reports list
4. View prospective space savings by running the isi dedupe reports view command: The following command displays the prospective savings recorded in a deduplication report with an ID of 46: isi dedupe reports view 46

Specify deduplication settings
You can specify which directories you want to deduplicate. 1. Specify which directories you want to deduplicate by running the isi dedupe settings modify command.
The following command targets /ifs/data/archive and /ifs/data/media for deduplication:
isi dedupe settings modify --paths /ifs/data/media, /ifs/data/archive
2. Optional: To modify the settings of the deduplication job, run the isi job types modify command. The following command configures the deduplication job to be run every Friday at 10:00 PM:
isi job types modify Dedupe --schedule "Every Friday at 10:00 PM"

View deduplication space savings
You can view the amount of disk space that you are currently saving with deduplication. Run the following command:
isi dedupe stats

View a deduplication report
After a deduplication job completes, you can view information about the job in a deduplication report. 1. Optional: To identify the ID of the deduplication report you want to view, run the following command:
isi dedupe reports list
2. View a deduplication report by running the isi dedupe reports view command. The following command displays a deduplication report with an ID of 44: isi dedupe reports view 44

Deduplication job report information

You can view the following deduplication specific information in deduplication job reports:

Start time

The time the deduplication job started.

Deduplication with SmartDedupe 229

End time

The time the deduplication job ended.

Iteration Count

The number of times that SmartDedupe interrupted the sampling process. If SmartDedupe is sampling a large amount of data, SmartDedupe might interrupt sampling in order to start deduplicating the data. After SmartDedupe finishes deduplicating the sampled data, SmartDedupe will continue sampling the remaining data.

Scanned blocks The total number of blocks located underneath the specified deduplicated directories.

Sampled blocks The number of blocks that SmartDedupe created index entries for.

Deduped blocks The number of blocks that were deduplicated.

Dedupe percent The percentage of scanned blocks that were deduplicated.

Created dedupe requests

The total number of deduplication requests created. A deduplication request is created for each matching pair of data blocks. For example, if you have 3 data blocks that all match, SmartDedupe creates 2 requests. One of the requests could pair file1 and file2 together and the other request could pair file2 and file3 together.

Successful

The number of deduplication requests that completed successfully.

dedupe requests

Failed dedupe requests

The number of deduplication requests that failed. If a deduplication request fails, it doesn't mean that the job failed too. A deduplication request can fail for any number of reasons. For example, the file might have been modified since it was sampled.

Skipped files

The number of files that were not scanned by the deduplication job. SmartDedupe skips files for a number of reasons. For example, SmartDedupe skips files that have already been scanned and haven't been modified since. SmartDedupe also skips all files that are smaller than 4 KB.

Index entries

The number of entries that currently exist in the index.

Index lookup attempts

The total number of lookups that have been done by earlier deduplication jobs plus the number of lookups done by this deduplication job. A lookup is when the deduplication job attempts to match a block that was indexed with a block that hasn't been indexed.

Index lookup hits The number of blocks that matched index entries.

Deduplication information

You can view information about how much disk space is being saved by deduplication. The following information is displayed in the output of the isi dedupe stats command:

Cluster Physical Size Cluster Used Size Logical Size Deduplicated Logical Saving
Estimated Size Deduplicated
Estimated Physical Saving

The total amount of physical disk space on the cluster.
The total amount of disk space currently occupied by data on the cluster.
The amount of disk space that has been deduplicated in terms of reported file sizes. For example, if you have three identical files that are all 5 GB, the logical size deduplicated is 15 GB.
The amount of disk space saved by deduplication in terms of reported file sizes. For example, if you have three identical files that are all 5 GB, the logical saving is 10 GB.
The total amount of physical disk space that has been deduplicated, including protection overhead and metadata. For example, if you have three identical files that are all 5 GB, the estimated size deduplicated would be greater than 15 GB, because of the disk space consumed by file metadata and protection overhead.
The total amount of physical disk space saved by deduplication, including protection overhead and metadata. For example, if you have three identical files that are all 5 GB, the estimated physical saving would be greater than 10 GB, because deduplication saved space that would have been occupied by file metadata and protection overhead.

230 Deduplication with SmartDedupe

16
Inline Data Deduplication
Inline data deduplication performs deduplication of data before the data is committed to disk.
Topics:
· Inline Data Deduplication overview · Inline deduplication interoperability · Considerations for using inline deduplication · Enable inline deduplication · Verify inline deduplication is enabled · View inline deduplication reports · Disable or pause inline deduplication · Remove deduplication · Troubleshoot index allocation issues
Inline Data Deduplication overview
Inline data deduplication for Isilon F810, H5600, and PowerScale F200 and F600 nodes deduplicates data before the data is committed to disk. Deduplicating data before it is committed avoids redundant writes to disk. Inline data deduplication (inline deduplication) includes inline zero block elimination, asynchronous data deduplication, and an in-memory, nonpersistent index table. Inline deduplication is supported as follows:  Isilon F810 disk pools with OneFS 8.2.1 or later on all nodes  Isilon H5600 disk pools with OneFS 8.2.2 or later on all nodes  PowerScale F200 and F600 disk pools with OneFS 9.0.0.0 on all nodes Depending on workload, the data reduction rate with the inline compression and inline data deduplication features enabled is typically around 3:1. No license is required for inline data deduplication. Inline deduplication is a cluster-wide setting and is disabled by default. When enabled, the feature is always active, applies globally, and applies to all files on disk pools that support data reduction. Exceptions include:  Packed files  Writes to snapshots, though deduplicated data can be copied on write to snapshots.  Shadow stores  Stubbed files, such as CloudPools files  Files with the no_dedupe attribute set You cannot selectively enable inline deduplication on individual files. To be deduplicated, two files with identical data blocks or a file and a shadow store with identical data blocks must have the same disk pool policy ID. OneFS deduplicates data to a shadow store. OneFS uses a protection policy that is at least as high as the protection policy of the files being deduplicated.
NOTE: The "always on" aspect of inline deduplication can affect performance. Inline deduplication may not be right for performance-sensitive workloads. More guidance is available in Considerations for using inline deduplication. You must have the ISI_PRIV_CLUSTER privilege to enable or disable inline deduplication. You enable inline deduplication from the command line:
isi dedupe inline settings modify --mode enabled
Inline Data Deduplication 231

Comparing inline deduplication with SmartDedupe

The following table compares inline deduplication with the SmartDedupe service.

Inline deduplication Globally enabled Processes all regular files Deduplicates sequential runs of blocks of matching data to single blocks Per node, nonpersistent in-memory index Can convert copy operations to clone. Opportunistic No license required

SmartDedupe Directory tree based Skips files less than 32 KB by default Can only deduplicate between files.
Large persistent on-disk index Post process only Exhaustive License required

Inline deduplication workflow
Inline deduplication begins when data is flushed from the SmartCache (also known as the coalescer). The stages are:
 SmartCache (coalescer) flush.  Determine the data to copy on write to snapshots.  Remove zero blocks.  Replace duplicate data with shadow store references.  Compress the remaining data.  Write to storage. Zero block elimination is performed before inline deduplication. Files that are not eligible for deduplication may still have zero blocks that are removed. Data blocks that contain only zeros are detected and prevented from being written to disk. Skipping zero blocks can reduce the work that inline deduplication and data compression require.
The in-memory index table
Inline deduplication uses an in-memory index table to track dedupable data blocks. The index table is allocated on each node that supports the feature. Allocating the index table depends on available resources.
Inline deduplication is an opportunistic best effort service and is not a substitute for the SmartDedupe service. However, inline deduplication can reduce the amount of work that SmartDedupe has to do.
The default size of the index table is 10% of RAM up to a maximum of 16 GB. Each node has its own index: there is no sharing between nodes. Because the index is in-memory only, its contents are lost on reboot.
If you enable inline deduplication on a system that is booting, index allocation should happen quickly. If the system has been running for a while, locating the memory required for the index table may be difficult. In that case, index allocation can take longer and, if there is insufficient memory, can fail. See Troubleshoot index allocation issues for guidance.
The newly allocated index table is empty. Inline deduplication hashes data blocks as they are read and written and records the results in the index table. If inline deduplication encounters matching data blocks, data is deduplicated immediately. Over time, finding matching data becomes more effective as the index accumulates file system data.
The following describes the deduplication process when an initial data match is found between two files.
1. The data being written is redirected to a shadow store. 2. Shadow references are inserted into the current file. 3. Inline deduplication queues an asynchronous worker process to deduplicate the matching file with the shadow store.
After the initial match, inline deduplication compares data being written with the data in the shadow store. If it finds a match, it updates the current file with shadow references and does not write data to storage. Subsequent data matches are typically faster than the initial match since they involve less work.

232 Inline Data Deduplication

Inline deduplication upgrade considerations
The following are upgrade considerations for using inline deduplication.
 Specific versions of OneFS must run on all nodes in the cluster, as follows:  F810 requires 8.2.1 or later.  H5600 requires 8.2.2 or later.  F200 and F600 require 9.0.0.0 or later.
 Disk pools that can support inline deduplication must have the data_reduce flag set.  The data_reduce flag is set automatically on upgrade commit on all disk pools that support compression and inline
deduplication.

Inline deduplication interoperability

Inline deduplication interoperates with OneFS as follows.

SmartDedupe Data compression Snapshots
Packing (Small File Storage Efficiency) Backup and restore

Use to find deduplication matches not found by inline deduplication. Inline deduplication operates on uncompressed data. Data written to shadow stores is compressed.
Writes to snapshots are not inline deduplicated, but deduplicated data can be copied on write to snapshots. Packed files are skipped by inline deduplication, though zero block elimination still occurs.
Backup and inline deduplication interoperate in the same way that backup and SmartDedupe interoperate. Files that were deduplicated using inline deduplication are indistinguishable from files that were deduplicated using SmartDedupe, and the same conditions apply. The local file remains deduplicated on disk. Files are rehydrated on read and the full file contents are backed up. As with SmartDedupe, ensure that the target clusters or backup devices have enough space to accommodate un-deduplicated files. However, transferring data to another PowerScale cluster that has inline deduplication enabled can help you to avoid requiring the full rehydrated capacity on the target cluster.

Considerations for using inline deduplication
This section describes considerations for using (or not using) inline deduplication. Enabling inline deduplication can be advantageous if your users or workload have characteristics such as the following.  If your users frequently copy files, either large files or whole data sets. In this case, inline deduplication can effectively turn
these operations into clone operations.  If your workloads involve lots of small files (such as EDA), those small files are deduplicated more efficiently with inline
deduplication. By default, SmartDedupe skips small files.  If your data sets contain a large amount of zeroed data, storage savings are available from that alone. You may not need inline deduplication if:  Your data sets have little or no deduplication.  You prefer to run SmartDedupe during off hours. Inline deduplication is always on.  Your workload is performance sensitive. Inline deduplication may add too much overhead.
Enable inline deduplication
Enable inline deduplication from the command line. You can enable inline deduplication on Isilon F810 nodes, H5600 nodes, and on PowerScale F200 and F600 nodes. After inline deduplication is enabled, it is always on and applies globally, cluster-wide. You must have the ISI_PRIV_CLUSTER privilege to administer inline deduplication.

Inline Data Deduplication 233

1. Log in to your cluster as a user with the administrator role. 2. Enter the following command:
isi dedupe inline settings modify --mode enabled
Inline deduplication is enabled.
Verify inline deduplication is enabled
You can verify that inline deduplication is enabled either globally or by checking each node manually. 1. Log in to your cluster as a user with the administrator role. 2. To check whether inline deduplication is enabled globally, enter the following command:
# isi_for_array isi_inline_dedupe_status
If the command returns OK, inline deduplication is globally enabled. 3. To check each node manually, enter the following command:
# isi_for_array sysctl efs.sfm.inline_dedupe.mode
Each node returns its status. If the status is enabled, inline deduplication is active on that node. For example: node-1: efs.sfm.inline_dedupe.mode:enabled node-2: efs.sfm.inline_dedupe.mode:enabled node-3: efs.sfm.inline_dedupe.mode:enabled

View inline deduplication reports

You can view reports to monitor the results of inline deduplication on your clusters. Inline deduplication is supported on Isilon F810 nodes, H5600 nodes, and on PowerScale F200 and F600 nodes. You must have the ISI_PRIV_STATISTICS privilege to run the isi statistics data-reduction or isi dedupe stats commands. 1. Log in to your cluster as a user with the administrator role. 2. To view data reduction statistics, enter the following command:
# isi statistics data-reduction

A report similar to the following appears.

Recent Writes Cluster Data Reduction

(5 mins)

--------------------- ------------- ----------------------

Logical data

3.20M

47.74M

Zero-removal saved

0

-

Deduplication saved

0

0

Compression saved

0

0

Preprotected physical

3.20M

47.74M

Protection overhead

6.39M

94.81M

Protected physical

9.59M

152.79M

Zero removal ratio

1.00 : 1

-

Deduplication ratio

1.00 : 1

1.00 : 1

Compression ratio

1.00 : 1

1.00 : 1

Data reduction ratio

1.00 : 1

1.00 : 1

Efficiency ratio

0.33 : 1

0.31 : 1

--------------------- ------------- ----------------------

234 Inline Data Deduplication

3. To view inline deduplication statistics, enter the following command: # isi dedupe stats
Statistics similar to the following appear: Cluster Physical Size: 86.14T Cluster Used Size: 248.93G Logical Size Deduplicated: 1.17T Logical Saving: 611.10G Estimated Size Deduplicated: 124.11G Estimated Physical Saving: 63.20G
Disable or pause inline deduplication
You can disable or pause inline deduplication. Disabling inline deduplication deactivates inline deduplication and deallocates the index table. Pausing inline deduplication deactivates inline deduplication but leaves the index table intact.
NOTE: Disabling inline deduplication does not remove the effects of deduplication. See Remove deduplication. If you plan to temporarily deactivate inline deduplication, it is recommended that you pause instead of disabling the feature. You must have the ISI_PRIV_CLUSTER privilege to administer inline deduplication. 1. Log in to your cluster as a user with the administrator role. 2. To disable inline deduplication, enter the following command:
# isi dedupe inline settings modify --mode disabled
3. To pause deduplication, enter the following command: # isi dedupe inline settings modify --mode paused
Remove deduplication
You can remove the effects of inline deduplication. Disabling inline deduplication does not remove its effects. To reverse deduplication, you must manually undeduplicate the affected data by running the undedupe job on each affected path.
NOTE: Running the undedupe job marks every file with the no_dedupe attribute. The no_dedupe attribute prevents all future deduplication, whether inline deduplication or through SmartDedupe. You must have the ISI_PRIV_JOB_ENGINE privilege to run isi job start undedupe. 1. Log in to the cluster as a user with the administrator role. 2. Run the undedupe job on each affected path, similar to the following: # isi job start undedupe --paths <path> Where <path> is an absolute path within the /ifs file system.
Troubleshoot index allocation issues
This section describes what to do if index allocation fails.
Inline Data Deduplication 235

The in-memory index requires allocating chunks of physically contiguous memory. On a running system this may not be possible. Check the inline deduplication state:
isi_inline_dedupe_status The isi_inline_dedupe_status command reports whether the index failed to allocate or if its allocation is not optimal. If the index cannot be allocated, inline deduplication will attempt a non-optimal layout. If the non-optimal layout fails, inline deduplication will reduce the size of the index until it can be successfully allocated. Running isi_flush on the node may clear enough memory to allocate a full index.
236 Inline Data Deduplication

17
Data replication with SyncIQ
This section contains the following topics:
Topics:
· SyncIQ data replication overview · Replication policies and jobs · Replication snapshots · Data failover and failback with SyncIQ · Recovery times and objectives for SyncIQ · Replication policy priority · SyncIQ license functionality · Replication for nodes with multiple interfaces · Restrict SyncIQ source nodes · Creating replication policies · Managing replication to remote clusters · Initiating data failover and failback with SyncIQ · Performing disaster recovery for older SmartLock directories · Managing replication policies · Managing replication to the local cluster · Managing replication performance rules · Managing replication reports · Managing failed replication jobs · Restrict SyncIQ to use the interfaces in the IP address pool · Modify the Restrict Target Network value
SyncIQ data replication overview
OneFS enables you to replicate data from one PowerScale cluster to another through the SyncIQ software module. Activate a SyncIQ license on both PowerScale clusters before you can replicate data between them. You can replicate data at the directory level while optionally excluding specific files and subdirectories from being replicated. SyncIQ creates and references snapshots to replicate a consistent point-in-time image of a source directory. Metadata, such as access control lists (ACL) and alternate data streams (ADS), are replicated along with data. SyncIQ enables you to maintain a consistent replica of your data on another PowerScale cluster and to control the frequency of data replication. For example, you could configure SyncIQ to back up data from your primary cluster to a secondary cluster once a day at 10 PM. Depending on the size of your dataset, the first replication operation could take considerable time. After that, however, replication operations would complete more quickly. SyncIQ also offers automated failover and failback capabilities so that you can continue operations on the secondary PowerScale cluster should your primary cluster become unavailable.
Replication policies and jobs
Data replication is coordinated according to replication policies and replication jobs. Replication policies specify what data is replicated, where the data is replicated to, and how often the data is replicated. Replication jobs are the operations that replicate data from one PowerScale cluster to another. SyncIQ generates replication jobs according to replication policies. A replication policy specifies two clusters: the source and the target. The cluster on which the replication policy exists is the source cluster. The cluster that data is being replicated to is the target cluster. When a replication policy starts, SyncIQ generates a replication job for the policy. When a replication job runs, files from a directory tree on the source cluster are replicated to a directory tree on the target cluster; these directory trees are known as source and target directories.
Data replication with SyncIQ 237

After the first replication job created by a replication policy finishes, the target directory and all files contained in the target directory are set to a read-only state, and can be modified only by other replication jobs belonging to the same replication policy. We recommend that you do not create more than 1,000 policies on a cluster.
NOTE: To prevent permissions errors, make sure that ACL policy settings are the same across source and target clusters.
You can create two types of replication policies: synchronization policies and copy policies. A synchronization policy maintains an exact replica of the source directory on the target cluster. If a file or sub-directory is deleted from the source directory, the file or directory is deleted from the target cluster when the policy is run again.
You can use synchronization policies to fail over and fail back data between source and target clusters. When a source cluster becomes unavailable, you can fail over data on a target cluster and make the data available to clients. When the source cluster becomes available again, you can fail back the data to the source cluster.
A copy policy maintains recent versions of the files that are stored on the source cluster. However, files that are deleted on the source cluster are not deleted from the target cluster. Failback is not supported for copy policies. Copy policies are most commonly used for archival purposes.
Copy policies enable you to remove files from the source cluster without losing those files on the target cluster. Deleting files on the source cluster improves performance on the source cluster while maintaining the deleted files on the target cluster. This can be useful if, for example, your source cluster is being used for production purposes and your target cluster is being used only for archiving.
After creating a job for a replication policy, SyncIQ must wait until the job completes before it can create another job for the policy. Any number of replication jobs can exist on a cluster at a given time; however, no more than 50 replication jobs can run on a source cluster at the same time. If more than 50 replication jobs exist on a cluster, the first 50 jobs run while the others are queued to run.
There is no limit to the number of replication jobs that a target cluster can support concurrently. However, because more replication jobs require more cluster resources, replication will slow down as more concurrent jobs are added.
When a replication job runs, OneFS generates workers on the source and target cluster. Workers on the source cluster read and send data while workers on the target cluster receive and write data.
You can replicate any number of files and directories with a single replication job. You can prevent a large replication job from overwhelming the system by limiting the amount of cluster resources and network bandwidth that data synchronization is allowed to consume. Because each node in a cluster is able to send and receive data, the speed at which data is replicated increases for larger clusters.
SmartLock considerations for SyncIQ
There are restrictions for using SyncIQ policies with SmartLock directories. These restrictions apply to all SmartLock directories: compliance and enterprise.
The restrictions are:  There can be only one SmartLock directory for each SyncIQ policy.  The SmartLock directory and the SyncIQ policy directory must be configured at the same root directory level. For example, if the SyncIQ policy root directory is /ifs/data, you create the SmartLock directory in that same root directory: /ifs/data. Do not create SmartLock subdirectories under /ifs/data.
Automated replication policies
You can manually start a replication policy at any time. You can also configure replication policies to start automatically based on source directory modifications or schedules.
You can configure a replication policy to run according to a schedule so that you can control when replication is performed. You can also configure policies to replicate the data captured in snapshots of a directory. You can also configure a replication policy to start when SyncIQ detects a modification to the source directory. Such a policy allows SyncIQ to maintain a more current version of your data on the target cluster.
Scheduling a policy can be useful under the following conditions:
 You want to replicate data when user activity is minimal.  You can accurately predict when modifications are made to the data. A scheduled policy can be reconfigured so that it does not run if there are no changes to the source directory between jobs. However, if changes are made to the parent directory of the source directory or a sibling directory of the source directory, and
238 Data replication with SyncIQ

then a snapshot of the parent directory is taken, SyncIQ creates a job for the policy even if no changes have been made to the source directory. If you monitor the cluster through the File System Analytics (FSA) feature of InsightIQ, the FSA job creates snapshots of /ifs. This process might cause a replication job to start whenever the FSA job is run. Replicating data that is contained in snapshots of a directory can be useful under the following conditions:  You want to replicate data according to a schedule, and you are already generating snapshots of the source directory
through a snapshot schedule.  You want to maintain identical snapshots on both the source and target cluster.  You want to replicate existing snapshots to the target cluster.
To enable archival snapshots on the target cluster. This setting can only be enabled when the policy is created.
If a policy is configured to replicate snapshots, you can configure SyncIQ to replicate only snapshots that match a specified naming pattern. Configuring a policy to start when changes are made to the source directory can be useful under the following conditions:  You want to retain an up-to-date copy of your data always.  You are expecting many changes at unpredictable intervals. For policies that are configured to start whenever changes are made to the source directory, SyncIQ checks the source directories every ten seconds. SyncIQ checks all files and directories underneath the source directory, regardless of whether those files or directories are excluded from replication. Consequently, SyncIQ might occasionally run a replication job unnecessarily. For example, assume that newPolicy replicates /ifs/data/media but excludes /ifs/data/media/temp. If a modification is made to /ifs/data/media/temp/file.txt, SyncIQ runs newPolicy even though /ifs/data/media/ temp/file.txt is not replicated. If a policy is configured to start whenever changes are made to the source directory and a replication job fails, SyncIQ waits one minute before attempting to run the policy again. SyncIQ increases this delay exponentially for each failure up to a maximum of eight hours. You can override the delay by running the policy manually at any time. After a job for the policy completes successfully, SyncIQ will resume checking the source directory every ten seconds. If a policy is configured to start whenever changes are made to the source directory, you can configure SyncIQ to wait a specified period after the source directory is modified before starting a job.
NOTE: To avoid frequent synchronization of minimal sets of changes and overtaxing system resources, you should not configure continuous replication when the source directory is highly active. It is better to configure continuous replication with a change-triggered delay of several hours to consolidate groups of changes.
Source and target cluster association
SyncIQ associates a replication policy with a target cluster by marking the target cluster when the job runs for the first time. Even if you modify the name or IP address of the target cluster, the mark persists on the target cluster. When a replication policy is run, SyncIQ checks the mark to ensure that data is being replicated to the correct location. On the target cluster, you can manually break an association between a replication policy and target directory. Breaking the association between a source and target cluster causes the mark on the target cluster to be deleted. You might want to manually break a target association if an association is obsolete. If you break the association of a policy, the policy is disabled on the source cluster and you cannot run the policy. If you want to run the disabled policy again, you must reset the replication policy. Breaking a policy association causes either a full replication or differential replication to occur the next time you run the replication policy. During a full or differential replication, SyncIQ creates a new association between the source and target clusters. Depending on the amount of data being replicated, a full or differential replication can take a very long time to complete.
CAUTION: Changes to the configuration of the target cluster outside of SyncIQ can introduce an error condition that effectively breaks the association between the source and target cluster. For example, changing the DNS record of the target cluster could cause this problem. If you need to make significant configuration changes to the target cluster outside of SyncIQ, make sure that your SyncIQ policies can still connect to the target cluster.
Data replication with SyncIQ 239

Configuring SyncIQ source and target clusters with NAT

Source and target clusters can use NAT (network address translation) for SyncIQ failover and failback purposes, but must be configured appropriately.
In this scenario, source and target clusters are typically at different physical locations, use private, non-routable address space, and do not have direct connections to the Internet. Each cluster typically is assigned a range of private IP addresses. For example, a cluster with 12 nodes might be assigned IP addresses 192.168.10.11 to 192.168.10.22.
To communicate over the public Internet, source and target clusters must have all incoming and outgoing data packets appropriately translated and redirected by a NAT-enabled firewall or router.
CAUTION: SyncIQ data is not encrypted by default. Running SyncIQ jobs over the public Internet provides no protection against data theft.
SyncIQ enables you to limit replication jobs to particular nodes within your cluster. For example, if your cluster was made up of 12 nodes, you could limit replication jobs to just three of those nodes. For NAT support, you would must establish a one-for-one association between the source and target clusters. So, if you are limiting replication jobs to three nodes on your source cluster, you must associate three nodes on your target cluster.
In this instance, you would need to configure static NAT, sometimes referred to as inbound mapping. On both the source and target clusters, for the private address assigned to each node, you would associate a static NAT address. For example:

Source cluster Node name source-1 source-2 source-3

Private address 192.168.10.11 192.168.10.12 192.168.10.13

NAT address 10.8.8.201 10.8.8.202 10.8.8.203

Target Cluster Node name target-1 target-2 target-3

Private address 192.168.55.101 192.168.55.102 192.168.55.103

NAT address 10.1.2.11 10.1.2.12 10.1.2.13

To configure static NAT, you would must edit the /etc/local/hosts file on all six nodes, and associate them with their counterparts by adding the appropriate NAT address and node name. For example, in the /etc/local/hosts file on the three nodes of the source cluster, the entries would look like:
10.1.2.11 target-1
10.1.2.12 target-2
10.1.2.13 target-3
Similarly, on the three nodes of the target cluster, you would edit the /etc/local/hosts file, and insert the NAT address and name of the associated node on the source cluster. For example, on the three nodes of the target cluster, the entries would look like:
10.8.8.201 source-1
10.8.8.202 source-2
10.8.8.203 source-3
When the NAT server receives packets of SyncIQ data from a node on the source cluster, the NAT server replaces the packet headers and the node's port number and internal IP address with the NAT server's own port number and external IP address. The NAT server on the source network then sends the packets through the Internet to the target network, where another NAT server performs a similar process to transmit the data to the target node. The process is reversed when the data fails back. With this type of configuration, SyncIQ can determine the correct addresses to connect with, so that SyncIQ can send and receive data. In this scenario, no SmartConnect zone configuration is required. For information about the ports used by SyncIQ, see the OneFS Security Configuration Guide for your OneFS version.

Full and differential replication
If a replication policy encounters an issue that cannot be fixed (for example, if the association was broken on the target cluster), you might need to reset the replication policy. If you reset a replication policy, SyncIQ performs either a full replication or a differential replication the next time the policy is run. You can specify the type of replication that SyncIQ performs.

240 Data replication with SyncIQ

During a full replication, SyncIQ transfers all data from the source cluster regardless of what data exists on the target cluster. A full replication consumes large amounts of network bandwidth and can take a very long time to complete. However, a full replication is less strenuous on CPU usage than a differential replication. During a differential replication, SyncIQ first checks whether a file already exists on the target cluster and then transfers only data that does not already exist on the target cluster. A differential replication consumes less network bandwidth than a full replication; however, differential replications consume more CPU. Differential replication can be much faster than a full replication if there is an adequate amount of available CPU for the replication job to consume.
Controlling replication job resource consumption
You can create rules that limit the network traffic created by replication jobs, the rate at which files are sent by replication jobs, the percent of CPU used by replication jobs, and the number of workers created for replication jobs. If you limit the percentage of total workers that SyncIQ can create, the limit is applied to the total amount of workers that SyncIQ could create, which is determined by cluster hardware. Workers on the source cluster read and send data while workers on the target cluster receive and write data.
NOTE: File-operation rules might not work accurately for files that can take more than a second to transfer and for files that are not predictably similar in size.
Replication policy priority
When creating a replication policy, you can configure a policy to have priority over other jobs. If multiple replication jobs are queued to be run because the maximum number of jobs are already running, jobs created by policies with priority will be run before jobs without priorities. For example, assume that 50 jobs are currently running. A job without priority is the created and queued to run; next, a job with priority is created and queued to run. The job with priority will run next, even though the job without priority has been queued for a longer period of time. SyncIQ will also pause replication jobs without priority to allow jobs with priority to run. For example, assume that 50 jobs are already running, and one of them does not have priority. If a replication job with priority is created, SyncIQ will pause the replication job without priority and run the job with priority.
Replication reports
After a replication job completes, SyncIQ generates a replication report that contains detailed information about the job, including how long the job ran, how much data was transferred, and what errors occurred. If a replication report is interrupted, SyncIQ might create a subreport about the progress of the job so far. If the job is then restarted, SyncIQ creates another subreport about the progress of the job until the job either completes or is interrupted again. SyncIQ creates a subreport each time the job is interrupted until the job completes successfully. If multiple subreports are created for a job, SyncIQ combines the information from the subreports into a single report. SyncIQ routinely deletes replication reports. You can specify the maximum number of replication reports that SyncIQ retains and the length of time that SyncIQ retains replication reports. If the maximum number of replication reports is exceeded on a cluster, SyncIQ deletes the oldest report each time a new report is created. You cannot customize the content of a replication report.
NOTE: If you delete a replication policy, SyncIQ automatically deletes any reports that were generated for that policy.
Data replication with SyncIQ 241

Replication snapshots
SyncIQ generates snapshots to facilitate replication, failover, and failback between PowerScale clusters. Snapshots generated by SyncIQ can also be used for archival purposes on the target cluster.
Source cluster snapshots
SyncIQ generates snapshots on the source cluster to ensure that a consistent point-in-time image is replicated and that unaltered data is not sent to the target cluster. Before running a replication job, SyncIQ creates a snapshot of the source directory. SyncIQ then replicates data according to the snapshot rather than the current state of the cluster, allowing users to modify source directory files while ensuring that an exact point-in-time image of the source directory is replicated. For example, if a replication job of /ifs/data/dir/ starts at 1:00 PM and finishes at 1:20 PM, and /ifs/data/dir/file is modified at 1:10 PM, the modifications are not reflected on the target cluster, even if /ifs/data/dir/file is not replicated until 1:15 PM. You can replicate data according to a snapshot generated with the SnapshotIQ software module. If you replicate data according to a SnapshotIQ snapshot, SyncIQ does not generate another snapshot of the source directory. This method can be useful if you want to replicate identical copies of data to multiple PowerScale clusters. SyncIQ generates source snapshots to ensure that replication jobs do not transfer unmodified data. When a job is created for a replication policy, SyncIQ checks whether it is the first job created for the policy. If it is not the first job created for the policy, SyncIQ compares the snapshot generated for the earlier job with the snapshot generated for the new job. SyncIQ replicates only data that has changed since the last time a snapshot was generated for the replication policy. When a replication job is completed, SyncIQ deletes the previous source-cluster snapshot and retains the most recent snapshot until the next job is run.
Target cluster snapshots
When a replication job is run, SyncIQ generates a snapshot on the target cluster to facilitate failover operations. When the next replication job is created for the replication policy, the job creates a new snapshot and deletes the old one. If a SnapshotIQ license has been activated on the target cluster, you can configure a replication policy to generate additional snapshots that remain on the target cluster even as subsequent replication jobs run. SyncIQ generates target snapshots to facilitate failover on the target cluster regardless of whether a SnapshotIQ license has been configured on the target cluster. Failover snapshots are generated when a replication job completes. SyncIQ retains only one failover snapshot per replication policy, and deletes the old snapshot after the new snapshot is created. If a SnapshotIQ license has been activated on the target cluster, you can configure SyncIQ to generate archival snapshots on the target cluster that are not automatically deleted when subsequent replication jobs run. Archival snapshots contain the same data as the snapshots that are generated for failover purposes. However, you can configure how long archival snapshots are retained on the target cluster. You can access archival snapshots the same way that you access other snapshots generated on a cluster.
Data failover and failback with SyncIQ
SyncIQ enables you to perform automated data failover and failback operations between PowerScale clusters. If your primary cluster goes offline, you can fail over to a secondary PowerScale cluster, enabling clients to continue accessing their data. If the primary cluster becomes operational again, you can fail back to the primary cluster. For the purposes of SyncIQ failover and failback, the cluster originally accessed by clients is referred to as the primary cluster. The cluster that client data is replicated to is referred to as the secondary cluster. Failover is the process that allows clients to access, view, modify, and delete data on a secondary cluster. Failback is the process that allows clients to resume their workflow on the primary cluster. During failback, any changes made to data on the secondary cluster are copied back to the primary cluster by means of a replication job using a mirror policy. Failover and failback can be useful in disaster recovery scenarios. For example, if a primary cluster is damaged by a natural disaster, you can migrate clients to a secondary cluster where they can continue normal operations. When the primary cluster is repaired and back online, you can migrate clients back to operations on the primary cluster.
242 Data replication with SyncIQ

You can fail over and fail back to facilitate scheduled cluster maintenance, as well. For example, if you are upgrading the primary cluster, you might want to migrate clients to a secondary cluster until the upgrade is complete and then migrate clients back to the primary cluster.
NOTE: Data failover and failback is supported both for enterprise and compliance SmartLock directories. Compliance SmartLock directories adhere to U.S. Securities and Exchange Commission (SEC) regulation 17a-4(f), which requires securities brokers and dealers to preserve records in a non-rewritable, non-erasable format. SyncIQ properly maintains compliance with the 17a-4(f) regulation during failover and failback.
Data failover
Failover is the process of preparing data on a secondary cluster and switching over to the secondary cluster for normal client operations. After you fail over to a secondary cluster, you can direct clients to access, view, and modify their data on the secondary cluster. Before failover is performed, you must create and run a SyncIQ replication policy on the primary cluster. You initiate the failover process on the secondary cluster. To migrate data from the primary cluster that is spread across multiple replication policies, you must initiate failover for each replication policy. If the action of a replication policy is set to copy, any file that was deleted on the primary cluster will still be present on the secondary cluster. When the client connects to the secondary cluster, all files that were deleted on the primary cluster will be available. If you initiate failover for a replication policy while an associated replication job is running, the failover operation completes but the replication job fails. Because data might be in an inconsistent state, SyncIQ uses the snapshot generated by the last successful replication job to revert data on the secondary cluster to the last recovery point. If a disaster occurs on the primary cluster, any modifications to data that were made after the last successful replication job started are not reflected on the secondary cluster. When a client connects to the secondary cluster, their data appears as it was when the last successful replication job was started.
Data failback
Failback is the process of restoring primary and secondary clusters to the roles that they occupied before a failover operation. After failback is complete, the primary cluster holds the latest data set and resumes normal operations, including hosting clients and replicating data to the secondary cluster through SyncIQ replication policies in place. The first step in the failback process is updating the primary cluster with all of the modifications that were made to the data on the secondary cluster. The next step is preparing the primary cluster to be accessed by clients. The final step is resuming data replication from the primary to the secondary cluster. At the end of the failback process, you can redirect users to resume data access on the primary cluster. To update the primary cluster with the modifications that were made on the secondary cluster, SyncIQ must create a SyncIQ domain for the source directory. You can fail back data with any replication policy that meets all of the following criteria:  The policy has been failed over.  The policy is a synchronization policy (not a copy policy).  The policy does not exclude any files or directories from replication.
SmartLock compliance mode failover and failback
Using OneFS 8.0.1 and later releases, you can replicate SmartLock compliance mode domains to a target cluster. This support includes failover and failback of these SmartLock domains. Because SmartLock compliance mode adheres to the U.S. Securities and Exchange Commission (SEC) regulation 17a-4(f), failover and failback of a compliance mode WORM domain requires some planning and setup. Most importantly, both your primary (source) and secondary (target) clusters must be configured at initial setup as compliance mode clusters. This process is described in the PowerScale installation guide for your node model (for example, the Generation 6 Installation Guide). Both clusters must have directories defined as WORM domains with the compliance type. For example, if you are storing your WORM files in the SmartLock compliance domain /ifs/financial-records/locked on the primary cluster, you
Data replication with SyncIQ 243

must have a SmartLock compliance domain on the target cluster to fail over to. Although the source and target SmartLock compliance domains can have the same pathname, this is not required.
The source WORM domain can have a retention policy set. However, SyncIQ requires that there is no retention policy set on the target WORM domain.
You must also start the compliance clock on both clusters.
SyncIQ handles conflicts during failover/failback operations on a SmartLock compliance mode domain by unlinking committed files from the user store and leaving a link of the file in the compliance store. The ComplianceStoreDelete job automatically tracks and removes expired files from the compliance store if they were put there as a result of SyncIQ conflict resolution. The job runs automatically once per month or when started manually. For information about how to start the ComplianceStoreDelete job, see Run the ComplianceStoreDelete job in a Smartlock compliance mode domain on page 254.

SmartLock replication limitations

Be aware of the limitations of replicating and failing back SmartLock directories with SyncIQ.
If the source directory or target directory of a SyncIQ policy is a SmartLock directory, replication and failback might not be allowed. For more information, see the following table:

Source directory type Target directory type

Non-SmartLock

Non-SmartLock

Non-SmartLock

SmartLock enterprise

Non-SmartLock SmartLock enterprise

SmartLock compliance Non-SmartLock

SmartLock enterprise SmartLock enterprise

SmartLock enterprise SmartLock compliance SmartLock compliance SmartLock compliance

SmartLock compliance Non-SmartLock SmartLock enterprise SmartLock compliance

Replication Allowed Failback allowed

Yes

Yes

Yes

Yes, unless files are committed to a

WORM state on the target cluster

No

No

Yes; however, retention Yes; however, the files do not have

dates and commit

WORM status

status of files are lost.

Yes

Yes; any newly committed WORM

files are included

No

No

No

No

No

No

Yes

Yes; any newly committed WORM

files are included

If you are replicating a SmartLock directory to another SmartLock directory, you must create the target SmartLock directory prior to running the replication policy. Although OneFS creates a target directory automatically if a target directory does not already exist, OneFS does not create a target SmartLock directory automatically. If you attempt to replicate an enterprise directory before the target directory has been created, OneFS creates a non-SmartLock target directory and the replication job succeeds. If you replicate a compliance directory before the target directory has been created, the replication job fails.
If you replicate SmartLock directories to another PowerScale cluster with SyncIQ, the WORM state of files is replicated. However, SmartLock directory configuration settings are not transferred to the target directory.
For example, if you replicate a directory that contains a committed file that is set to expire on March 4th, the file is still set to expire on March 4th on the target cluster. However, if the directory on the source cluster is set to prevent files from being committed for more than a year, the target directory is not automatically set to the same restriction.
In the scenario where a WORM exclusion domain has been created on an enterprise mode or compliance mode directory, replication of the SmartLock exclusion on the directory occurs only if the SyncIQ policy is rooted at the SmartLock domain that contains the exclusion. If this condition is not met, only data is replicated, and the SmartLock exclusion is not created on the target directory.

244 Data replication with SyncIQ

Recovery times and objectives for SyncIQ
The Recovery Point Objective (RPO) and the Recovery Time Objective (RTO) are measurements of the impacts that a disaster can have on business operations. You can calculate your RPO and RTO for a disaster recovery with replication policies. RPO is the maximum amount of time for which data is lost if a cluster suddenly becomes unavailable. For a PowerScale cluster, the RPO is the amount of time that has passed since the last completed replication job started. The RPO is never greater than the time it takes for two consecutive replication jobs to run and complete. If a disaster occurs while a replication job is running, the data on the secondary cluster is reverted to the state it was in when the last replication job completed. For example, consider an environment in which a replication policy is scheduled to run every three hours, and replication jobs take two hours to complete. If a disaster occurs an hour after a replication job begins, the RPO is four hours, because it has been four hours since a completed job began replicating data. RTO is the maximum amount of time required to make backup data available to clients after a disaster. The RTO is always less than or approximately equal to the RPO, depending on the rate at which replication jobs are created for a given policy. If replication jobs run continuously, meaning that another replication job is created for the policy before the previous replication job completes, the RTO is approximately equal to the RPO. When the secondary cluster is failed over, the data on the cluster is reset to the state it was in when the last job completed; resetting the data takes an amount of time proportional to the time it took users to modify the data. If replication jobs run on an interval, meaning that there is a period of time after a replication job completes before the next replication job for the policy starts, the relationship between RTO and RPO depends on whether a replication job is running when the disaster occurs. If a job is in progress when a disaster occurs, the RTO is roughly equal to the RPO. However, if a job is not running when a disaster occurs, the RTO is negligible because the secondary cluster was not modified since the last replication job ran, and the failover process is almost instantaneous.
RPO Alerts
You can configure SyncIQ to create OneFS events that alert you to the fact that a specified Recovery Point Objective (RPO) has been exceeded. You can view these events through the same interface as other OneFS events. The events have an event ID of 400040020. The event message for these alerts follows the following format:
SW_SIQ_RPO_EXCEEDED: SyncIQ RPO exceeded for policy <replication_policy>
For example, assume you set an RPO of 5 hours; a job starts at 1:00 PM and completes at 3:00 PM; a second job starts at 3:30 PM; if the second job does not complete by 6:00 PM, SyncIQ will create a OneFS event.
Replication policy priority
When creating a replication policy, you can configure a policy to have priority over other jobs. If multiple replication jobs are queued to be run because the maximum number of jobs are already running, jobs created by policies with priority will be run before jobs without priorities. For example, assume that 50 jobs are currently running. A job without priority is the created and queued to run; next, a job with priority is created and queued to run. The job with priority will run next, even though the job without priority has been queued for a longer period of time. SyncIQ will also pause replication jobs without priority to allow jobs with priority to run. For example, assume that 50 jobs are already running, and one of them does not have priority. If a replication job with priority is created, SyncIQ will pause the replication job without priority and run the job with priority.
SyncIQ license functionality
You can replicate data to another PowerScale cluster only if you activate a SyncIQ license on both the local cluster and the target cluster. If a SyncIQ license becomes inactive, you cannot create, run, or manage replication policies. Also, all previously created replication policies are disabled. Replication policies that target the local cluster are also disabled. However, data that was previously replicated to the local cluster is still available.
Data replication with SyncIQ 245

Replication for nodes with multiple interfaces
You can force replication and restrict target network settings to use only when specified pools or interfaces are selected. If you create a policy and specify a particular SmartConnect IP pool, SyncIQ traffic is restricted to the nodes that participate in that pool. It does not restrict the network ports that the pool uses. This situation can result in SyncIQ traffic using the correct nodes, but the wrong ports. You can force the policy to use only the ports in a specified pool for both source and target clusters using the command-line interface.
Restrict SyncIQ source nodes
You can restrict SyncIQ to use the interfaces in the specified IP address pool using the command-line interface. SyncIQ uses the front-end network ports of a node to send replication data from the source to the target cluster. By default, SyncIQ policies use all nodes and interfaces to allow for maximum throughput of a given policy. However, you may want to exclude certain nodes from a SyncIQ policy. Excluding nodes from a SyncIQ policy is beneficial for larger clusters where data replication jobs can be assigned to certain nodes. By selecting a predefined IP address pool, you can restrict replication processing to specific nodes on the source cluster. This option is useful to ensure that replication jobs are not competing with other applications for specific node resources. Specifying the IP address pool allows you to define which networks are used for replication data transfer. By default, SyncIQ uses all interfaces in the nodes that belong to the IP address pool, disregarding any interface membership settings in the pool. You must use the force replication and restrict target network options together to tie traffic from the source to the correct interface.
NOTE: SyncIQ only allows source node restrictions on subnets and pools from the default groupnet.
Creating replication policies
You can create replication policies that determine when data is replicated with SyncIQ.
Excluding directories in replication
You can exclude directories from being replicated by replication policies even if the directories exist under the specified source directory.
NOTE: Failback is not supported for replication policies that exclude directories.
By default, all files and directories under the source directory of a replication policy are replicated to the target cluster. However, you can prevent directories under the source directory from being replicated. If you specify a directory to exclude, files and directories under the excluded directory are not replicated to the target cluster. If you specify a directory to include, only the files and directories under the included directory are replicated to the target cluster; any directories that are not contained in an included directory are excluded. If you both include and exclude directories, any excluded directories must be contained in one of the included directories; otherwise, the excluded-directory setting has no effect. For example, consider a policy with the following settings:  The root directory is /ifs/data  The included directories are /ifs/data/media/music and /ifs/data/media/movies  The excluded directories are /ifs/data/archive and /ifs/data/media/music/working In this example, the setting that excludes the /ifs/data/archive directory has no effect because the /ifs/data/ archive directory is not under either of the included directories. The /ifs/data/archive directory is not replicated regardless of whether the directory is explicitly excluded. However, the setting that excludes the /ifs/data/media/music/ working directory does have an effect, because the directory would be replicated if the setting was not specified.
246 Data replication with SyncIQ

In addition, if you exclude a directory that contains the source directory, the exclude-directory setting has no effect. For example, if the root directory of a policy is /ifs/data, explicitly excluding the /ifs directory does not prevent /ifs/data from being replicated.
Any directories that you explicitly include or exclude must be contained in or under the specified root directory. For example, consider a policy in which the specified root directory is /ifs/data. In this example, you could include both the /ifs/data/ media and the /ifs/data/users/ directories because they are under /ifs/data.
Excluding directories from a synchronization policy does not cause the directories to be deleted on the target cluster. For example, consider a replication policy that synchronizes /ifs/data on the source cluster to /ifs/data on the target cluster. If the policy excludes /ifs/data/media from replication, and /ifs/data/media/file exists on the target cluster, running the policy does not cause /ifs/data/media/file to be deleted from the target cluster.

Excluding files in replication
If you do not want specific files to be replicated by a replication policy, you can exclude them from the replication process through file-matching criteria statements. You can configure file-matching criteria statements during the replication-policy creation process.
NOTE: You cannot fail back replication policies that exclude files.
A file-criteria statement can include one or more elements. Each file-criteria element contains a file attribute, a comparison operator, and a comparison value. You can combine multiple criteria elements in a criteria statement with Boolean "AND" and "OR" operators. You can configure any number of file-criteria definitions.
Configuring file-criteria statements can cause the associated jobs to run slowly. It is recommended that you specify file-criteria statements in a replication policy only if necessary.
Modifying a file-criteria statement will cause a full replication to occur the next time that a replication policy is started. Depending on the amount of data being replicated, a full replication can take a very long time to complete.
For synchronization policies, if you modify the comparison operators or comparison values of a file attribute, and a file no longer matches the specified file-matching criteria, the file is deleted from the target the next time the job is run. This rule does not apply to copy policies.

File criteria options

You can configure a replication policy to exclude files that meet or do not meet specific criteria. You can specify file criteria based on the following file attributes:

Date created

Includes or excludes files based on when the file was created. This option is available for copy policies only.
You can specify a relative date and time, such as "two weeks ago", or specific date and time, such as "January 1, 2012." Time settings are based on a 24-hour clock.

Date accessed

Includes or excludes files based on when the file was last accessed. This option is available for copy policies only, and only if the global access-time-tracking option of the cluster is enabled.
You can specify a relative date and time, such as "two weeks ago", or specific date and time, such as "January 1, 2012." Time settings are based on a 24-hour clock.

Date modified

Includes or excludes files based on when the file was last modified. This option is available for copy policies only.
You can specify a relative date and time, such as "two weeks ago", or specific date and time, such as "January 1, 2012." Time settings are based on a 24-hour clock.

File name

Includes or excludes files based on the file name. You can specify to include or exclude full or partial names that contain specific text.
The following wildcard characters are accepted:
NOTE: Alternatively, you can filter file names by using POSIX regular-expression (regex) text. PowerScale clusters support IEEE Std 1003.2 (POSIX.2) regular expressions. For more information about POSIX regular expressions, see the BSD man pages.

Data replication with SyncIQ 247

Path Size Type

Table 6. Replication file matching wildcards

Wildcard character *

Description Matches any string in place of the asterisk. For example, m* matches movies and m123.

[ ]

Matches any characters contained in the

brackets, or a range of characters separated by a

dash.

For example, b[aei]t matches bat, bet, and bit.

For example, 1[4-7]2 matches 142, 152, 162, and 172.

You can exclude characters within brackets by following the first bracket with an exclamation mark.

For example, b[!ie] matches bat but not bit or bet.

You can match a bracket within a bracket if it is either the first or last character.

For example, [[c]at matches cat and [at.

You can match a dash within a bracket if it is either the first or last character.

For example, car[-s] matches cars and car-.

?

Matches any character in place of the question

mark.

For example, t?p matches tap, tip, and top.

Includes or excludes files based on the file path. This option is available for copy policies only. You can specify to include or exclude full or partial paths that contain specified text. You can also include the wildcard characters *, ?, and [ ].
Includes or excludes files based on their size. NOTE: File sizes are represented in multiples of 1024, not 1000.
Includes or excludes files based on one of the following file-system object types:  Soft link  Regular file  Directory

Configure default replication policy settings
You can configure default settings for replication policies. If you do not modify these settings when creating a replication policy, the specified default settings are applied. Run the isi sync settings modify command. The following command configures SyncIQ to delete replication reports that are older than 2 years:
isi sync settings modify --report-max-age 2Y

248 Data replication with SyncIQ

Create a replication policy
You can create a replication policy with SyncIQ that defines how and when data is replicated to another PowerScale cluster. A replication policy specifies the target cluster, source and target directories, and directories and files to be excluded during replication.
CAUTION: In a SyncIQ replication policy, OneFS enables you to specify a source directory that is a target directory, or is contained within a target directory, from a different replication policy. Referred to as cascading replication, this use case is specifically for backup purposes, and should be configured carefully. OneFS does not allow failback in such cases.
If you modify any of the following policy settings after a policy is run, OneFS performs either a full or differential replication the next time the policy is run.  Source directory  Included or excluded directories  File-criteria statement  Target cluster name or address
This applies only if you modify a replication policy to specify a different target cluster. If you modify the IP or domain name of a target cluster, and then modify the replication policy on the source cluster to match the new IP or domain name, a full replication is not performed. Note also that SyncIQ does not support dynamically allocated IP address pools. If a replication job connects to a dynamically allocated IP address, SmartConnect might reassign the address while a replication job is running, which would cause the job to fail.
 Target directory NOTE: If you create a replication policy for a SmartLock directory, the SyncIQ and SmartLock domains must be configured at the same root directory level. A SmartLock directory cannot be nested inside a SyncIQ directory.
Run the isi sync policies create command. The following command creates a policy that replicates the directory /ifs/data/source on the source cluster to /ifs/ data/target on target cluster 10.1.99.36 every week. The command also creates archival snapshots on the target cluster:
isi sync policies create mypolicy sync /ifs/data/source 10.1.99.36 /ifs/data/target --schedule "Every Sunday at 12:00 AM" --target-snapshot-archive on --target-snapshot-expiration 1Y --target-snapshot-pattern "%{PolicyName}-%{SrcCluster}-%Y-%m-%d
Create a SyncIQ domain
You can create a SyncIQ domain to increase the speed at which failback is performed for a replication policy. Because you can fail back only synchronization policies, it is not necessary to create SyncIQ domains for copy policies. Failing back a replication policy requires that a SyncIQ domain be created for the source directory. OneFS automatically creates a SyncIQ domain during the failback process. However, if you intend on failing back a replication policy, it is recommended that you create a SyncIQ domain for the source directory of the replication policy while the directory is empty. Creating a domain for a directory that contains less data takes less time. Run the isi job jobs start command. The following command creates a SyncIQ domain for /ifs/data/source:
isi job jobs start DomainMark --root /ifs/data/media \ --dm-type SyncIQ
Assess a replication policy
Before running a replication policy for the first time, you can view statistics on the files that would be affected by the replication without transferring any files. This can be useful if you want to preview the size of the data set that will be transferred if you run the policy. You can assess only replication policies that have never been run before. 1. Run the isi sync jobs start command with the --test option.
Data replication with SyncIQ 249

The following command creates a report about how much data will be transferred when a sync job named weeklySync is run:
isi sync jobs start weeklySync --test 2. To view the assessment report, run the isi sync reports view command.
The following command displays the assessment report for weeklySync: isi sync reports view weeklySync 1
Managing replication to remote clusters
You can manually run, view, assess, pause, resume, cancel, resolve, and reset replication jobs that target other clusters. After a policy job starts, you can pause the job to suspend replication activities. Afterwards, you can resume the job, continuing replication from the point where the job was interrupted. You can also cancel a running or paused replication job if you want to free the cluster resources allocated for the job. A paused job reserves cluster resources whether or not the resources are in use. A cancelled job releases its cluster resources and allows another replication job to consume those resources. No more than five running and paused replication jobs can exist on a cluster at a time. However, an unlimited number of canceled replication jobs can exist on a cluster. If a replication job remains paused for more than a week, SyncIQ automatically cancels the job.
Start a replication job
You can manually start a replication job for a replication policy at any time. You can also replicate data according to a snapshot created by SnapshotIQ. You cannot replicate data according to a snapshot generated by SyncIQ. Run the isi sync jobs start command. The following command starts weeklySync:
isi sync jobs start weeklySync The following command replicates the source directory of weeklySync according to the snapshot HourlyBackup_07-15-2013_23:00:
isi sync jobs start weeklySync \ --source-snapshot HourlyBackup_07-15-2013_23:00
Pause a replication job
You can pause a running replication job and then resume the job later. Pausing a replication job temporarily stops data from being replicated, but does not free the cluster resources replicating the data. Run the isi sync jobs pause command. The following command pauses weeklySync:
isi sync jobs pause weeklySync
Resume a replication job
You can resume a paused replication job. Run the isi sync jobs resume command. The following command resumes weeklySync:
isi sync jobs resume weeklySync
250 Data replication with SyncIQ

Cancel a replication job
You can cancel a running or paused replication job. Cancelling a replication job stops data from being replicated and frees the cluster resources that were replicating data. You cannot resume a cancelled replication job; to restart replication, you must start the replication policy again. Run the isi sync jobs cancel command. The following command cancels weeklySync:
isi sync jobs cancel weeklySync

View active replication jobs
You can view information about replication jobs that are currently running or paused. 1. View all active replication jobs by running the following command:
isi sync jobs list
2. To view detailed information about a specific replication job, run the isi sync jobs view command. The following command displays detailed information about a replication job created by weeklySync:
isi sync jobs view weeklySync
The system displays output similar to the following example:
Policy Name: weeklySync ID: 3
State: running Action: run Duration: 5s Start Time: 2013-07-16T23:12:00

Restrict SyncIQ to use the interfaces in the IP address pool
You can restrict SyncIQ to use only the interfaces in the IP address pool. To restrict SyncIQ to use only the interfaces in the IP address pool, modify the SyncIQ policy:
isi sync policies modify --policy <my_policy> --force_interface=on The policy is modified, and SyncIQ now uses only the interfaces in the IP address pool.
Modify the Restrict Target Network value
You can set the value to determine whether replication jobs connect only to nodes in a given SmartConnect zone. To modify the Restrict Target Network value, modify the SyncIQ policy:
isi sync policies modify <policy-name> --restrict-targetnetwork=true

Replication job information

You can view information about replication jobs. The following information is displayed in the output of the isi snapshot settings view command:

Policy Name ID

The name of the associated replication policy. The ID of the replication job.

Data replication with SyncIQ 251

State Action

The status of the job. The type of replication policy.

Initiating data failover and failback with SyncIQ
You can fail over from one PowerScale cluster to another if, for example, your primary cluster becomes unavailable. You can fail back when the primary cluster becomes available again. You can revert failover if you decide that the failover was unnecessary or if you failed over for testing purposes.
NOTE: Data failover and failback are supported for both compliance SmartLock directories and enterprise SmartLock directories. Compliance SmartLock directories can be created only on clusters that have been set up as compliance mode clusters during initial configuration. You cannot rename folders after creation in compliance mode.

Fail over data to a secondary cluster
You can fail over to a secondary PowerScale cluster if, for example, your primary cluster becomes unavailable. You must have created and successfully run a replication policy on the primary cluster. This action replicated data to the secondary cluster.
NOTE: Data failover is supported both for compliance and enterprise SmartLock directories. SmartLock compliance directories require their own replication policies. Such directories cannot be nested inside non-compliance directories and replicated as part of an overall policy.
Complete the following procedure for each replication policy that you want to fail over. 1. If your primary cluster is still online, complete the following steps:
a. Stop all writes to the replication policy's path, including both local and client activity. This action ensures that new data is not written to the policy path as you prepare for failover to the secondary cluster.
b. Modify the replication policy so that it is set to run only manually. This action prevents the policy on the primary cluster from automatically running a replication job. If the policy on the primary cluster runs a replication job while writes are allowed to the target directory, the job fails and the replication policy is deactivated. If this happens, modify the policy so that it is set to run only manually, resolve the policy, and complete the failback process. After you complete the failback process, you can modify the policy to run according to a schedule again. The following command ensures that the policy weeklySync runs only manually:
isi sync policies modify weeklySync --schedule ""
2. On the secondary cluster, run the isi sync recovery allow-write command. The following command enables replicated directories and files specified in the weeklySync policy to be writable:
isi sync recovery allow-write weeklySync
NOTE: SmartLock compliance mode WORM files, although replicated, are stored in a non-writable, non-erasable format. 3. Direct your users to the secondary cluster for data access and normal operations.
Revert a failover operation
Failover reversion undoes a failover operation on a secondary cluster, enabling you to replicate data from the primary cluster to the secondary cluster again. Failover reversion is useful if the primary cluster becomes available before data is modified on the secondary cluster or if you failed over to a secondary cluster for testing purposes. Fail over a replication policy. Reverting a failover operation does not migrate modified data back to the primary cluster. To migrate data that clients have modified on the secondary cluster, you must fail back to the primary cluster.
NOTE:

252 Data replication with SyncIQ

Failover reversion is not supported for SmartLock directories. Complete the following procedure for each replication policy that you want to fail over. Run the isi sync recovery allow-write command with the --revert option. For example, the following command reverts a failover operation for newPolicy:
isi sync recovery allow-write newPolicy --revert
Fail back data to a primary cluster
After you fail over to a secondary cluster, you can fail back to the primary cluster. Before you can fail back to the primary cluster, you must already have failed over to the secondary cluster. Also, you must ensure that your primary cluster is back online. 1. Create mirror policies on the secondary cluster by running the isi sync recovery resync-prep command on the
primary cluster. The following command creates a mirror policy for weeklySync:
isi sync recovery resync-prep weeklySync
SyncIQ names mirror policies according to the following pattern:
<replication-policy-name>_mirror
2. Before beginning the failback process, prevent clients from accessing the secondary cluster. This action ensures that SyncIQ fails back the latest data set, including all changes that users made to data on the secondary cluster while the primary cluster was out of service. We recommend that you wait until clients are inactive before preventing access to the secondary cluster.
3. On the secondary cluster, run the isi sync jobs start command to run the mirror policy and replicate data to the primary cluster. The following command runs a mirror policy named weeklySync_mirror immediately:
isi sync jobs start weeklySync_mirror
Alternatively, you can modify the mirror policy to run on a particular schedule. The following command schedules a mirror policy named weeklySync_mirror to run daily at 12:01 AM:
isi sync policies modify weeklySync_mirror --enabled yes --schedule "every day at 12:01 AM"
If specifying a schedule for the mirror policy, you need only allow the mirror policy to run once at the scheduled time. After that, you should set the mirror policy back to a manual schedule. 4. On the primary cluster, allow writes to the target directories of the mirror policy by running the isi sync recovery allow-write command. The following command allows writes to the directories specified in the weeklySync_mirror policy:
isi sync recovery allow-write weeklySync_mirror 5. On the secondary cluster, complete the failback process by running the isi sync recovery resync-prep command
for the mirror policy. The following command completes the failback process for weeklySync_mirror by placing the secondary cluster back into read-only mode and ensuring that the data sets are consistent on both the primary and secondary clusters. :
isi sync recovery resync-prep weeklySync_mirror
Direct clients back to the primary cluster for normal operations. Although not required, it is safe to remove a mirror policy after failback has completed successfully.
Data replication with SyncIQ 253

Run the ComplianceStoreDelete job in a Smartlock compliance mode domain
SyncIQ handles conflicts during failover/failback operations on a SmartLock compliance mode domain by unlinking committed files from the user store and leaving a link of the file in the compliance store. The ComplianceStoreDelete job automatically tracks and removes expired files from the compliance store if they were put there as a result of SyncIQ conflict resolution. For example, you perform a SyncIQ failover or failback on a SmartLock compliance mode domain. The operation results in a committed file being reverted to an uncommitted state. For conflict resolution, a copy of the committed file is stored in the compliance store. The committed file in the compliance store eventually expires. The ComplianceStoreDelete job runs automatically once a month and deletes the expired file. Expired files that are in use (referenced from outside of compliance store) will not be deleted. The ComplianceStoreDelete job runs automatically once per month or when started manually. You can run the job manually from the CLI. Run the isi job jobs start ComplianceStoreDelete command.
Performing disaster recovery for older SmartLock directories
If you replicated a SmartLock compliance directory to a secondary cluster running OneFS 7.2.1 or earlier, you cannot fail back the SmartLock compliance directory to a primary cluster running OneFS 8.0.1 or later. However, you can recover the SmartLock compliance directory stored on the secondary cluster, and migrate it back to the primary cluster.
NOTE: Data failover and failback with earlier versions of OneFS are supported for SmartLock enterprise directories.
Recover SmartLock compliance directories on a target cluster
You can recover compliance SmartLock directories that you have replicated to a secondary cluster running OneFS 7.2.1 or earlier versions. Complete the following procedure for each SmartLock directory that you want to recover. 1. On the secondary cluster, enable writes to the SmartLock directories that you want to recover.
 If the last replication job completed successfully and a replication job is not currently running, run the isi sync recovery allow-write command on the secondary cluster. For example, the following command enables writes to the target directory of SmartLockSync:
isi sync recovery allow-write SmartLockSync  If a replication job is currently running, wait until the replication job completes, and then run the isi sync recovery
allow-write command.  If the primary cluster became unavailable while a replication job was running, run the isi sync target break
command. Note that you should only break the association if the primary cluster has been taken offline permanently. For example, the following command breaks the association for the target directory of SmartLockSync:
isi sync target break SmartLockSync
2. If you ran isi sync target break, restore any files that are left in an inconsistent state. a. Delete all files that are not committed to a WORM state from the target directory. b. Copy all files from the failover snapshot to the target directory. Failover snapshots are named according to the following naming pattern:
SIQ-Failover-<policy-name>-<year>-<month>-<day>_<hour>-<minute>-<second>
Snapshots are located under the hidden directory /ifs/.snapshot. 3. If any SmartLock directory configuration settings, such as an autocommit time period, were specified for the source
directory of the replication policy, apply those settings to the target directory.
254 Data replication with SyncIQ

Because autocommit information is not transferred to the target cluster, files that were scheduled to be committed to a WORM state on the source cluster would not be scheduled to be committed at the same time on the target cluster. To ensure that all files are retained for the appropriate time period, you can commit all files in target SmartLock directories to a WORM state. For example, the following command automatically commits all files in /ifs/data/smartlock to a WORM state after one minute.
isi worm domains modify --domain /ifs/data/smartlock --autocommit-offset 1m
Migrate SmartLock compliance directories
You can migrate SmartLock compliance directories from a recovery cluster, either by replicating the directories back to the original source cluster, or to a new cluster. Migration is necessary only when the recovery cluster is running OneFS 7.2.1 or earlier. These OneFS versions do not support failover and failback of SmartLock compliance directories. 1. On the recovery cluster, create a replication policy for each SmartLock compliance directory that you want to migrate to
another cluster (the original primary cluster or a new cluster). The policies must meet the following requirements:  The source directory on the recovery cluster is the SmartLock compliance directory that you are migrating.  The target directory is an empty SmartLock compliance directory on the cluster to which the data is to be migrated. The
source and target directories must both be SmartLock compliance directories. 2. Replicate recovery data to the target directory by running the policies that you created.
You can replicate data either by manually starting the policy or by specifying a policy schedule. 3. Optional: To ensure that SmartLock protection is enforced for all files, commit all migrated files in the SmartLock target
directory to a WORM state. Because autocommit information is not transferred from the recovery cluster, commit all migrated files in target SmartLock directories to a WORM state. For example, the following command automatically commits all files in /ifs/data/smartlock to a WORM state after one minute:
isi worm domains modify --domain /ifs/data/smartlock \ --autocommit-offset 1m
This step is necessary only if you have not configured an autocommit time period for the SmartLock directory. 4. On the target cluster, enable writes to the replication target directories by running the isi sync recovery allow-
writes command. For example, the following command enables writes to the SmartLockSync target directory:
isi sync recovery allow-writes SmartLockSync
5. If any SmartLock directory configuration settings, such as an autocommit time period, were specified for the source directories of the replication policies, apply those settings to the target directories.
6. Delete the copy of the SmartLock data on the recovery cluster. You cannot recover the space consumed by the source SmartLock directories until all files are released from a WORM state. If you want to free the space before files are released from a WORM state, contact PowerScale Technical Support for information about reformatting your recovery cluster.
Managing replication policies
You can modify, view, enable, and disable replication policies.
Modify a replication policy
You can modify the settings of a replication policy. If you modify any of the following policy settings after the policy runs, OneFS performs either a full or differential replication the next time the policy runs:
Data replication with SyncIQ 255

 Source directory  Included or excluded directories  File-criteria statement  Target cluster
This applies only if you target a different cluster. If you modify the IP or domain name of a target cluster, and then modify the replication policy on the source cluster to match the new IP or domain name, a full replication is not performed.  Target directory Run the isi sync policies modify command. Assuming that weeklySync has been reset and has not been run since it was reset, the following command causes a differential replication to be performed the next time weeklySync is run:
isi sync policies modify weeklySync --target-compare-initial-sync=true
Delete a replication policy
You can delete a replication policy. Once a policy is deleted, SyncIQ no longer creates replication jobs for the policy. Deleting a replication policy breaks the target association on the target cluster, and allows writes to the target directory. If you want to temporarily suspend a replication policy from creating replication jobs, you can disable the policy, and then enable the policy again later. Run the isi sync policies delete command. The following command deletes weeklySync from the source cluster and breaks the target association on the target cluster:
isi sync policies delete weeklySync
NOTE: The operation will not succeed until SyncIQ can communicate with the target cluster; until then, the policy will still appear in the output of the isi sync policies list command. After the connection between the source cluster and target cluster is reestablished, SyncIQ will delete the policy the next time that the job is scheduled to run; if the policy is configured to run only manually, you must manually run the policy again. If SyncIQ is permanently unable to communicate with the target cluster, run the isi sync policies delete command with the --local-only option. This will delete the policy from the local cluster only and not break the target association on the target cluster.
Enable or disable a replication policy
You can temporarily suspend a replication policy from creating replication jobs, and then enable it again later. NOTE: If you disable a replication policy while an associated replication job is running, the running replication job is not interrupted. However, the policy will not create another job until the policy is enabled.
Run either the isi sync policies enable or the isi sync policies disable command. The following command enables weeklySync:
isi sync policies enable weeklySync
The following command disables weeklySync:
isi sync policies disable weeklySync
View replication policies
You can view information about replication policies. 1. View information about all replication policies by running the following command:
isi sync policies list
256 Data replication with SyncIQ

2. Optional: To view detailed information about a specific replication policy, run the isi sync policies view command.
The following command displays detailed information about weeklySync:
isi sync policies view weeklySync
The system displays output similar to the following example:
ID: dd16d277ff995a78e9affbba6f6e2919 Name: weeklySync Path: /ifs/data/archive Action: sync Enabled: No Target: localhost Description: Check Integrity: Yes Source Include Directories: Source Exclude Directories: Source Subnet: Source Pool: Source Match Criteria: Target Path: /ifs/data/sometarget Target Snapshot Archive: No Target Snapshot Pattern: SIQ-%{SrcCluster}-%{PolicyName}-%Y-%m-%d_%H-%M-%S Target Snapshot Expiration: Never Target Snapshot Alias: SIQ-%{SrcCluster}-%{PolicyName}-latest Target Detect Modifications: Yes Source Snapshot Archive: No Source Snapshot Pattern: Source Snapshot Expiration: Never Schedule: Manually scheduled Log Level: notice Log Removed Files: No Workers Per Node: 3 Report Max Age: 2Y Report Max Count: 2000 Force Interface: No Restrict Target Network: No Target Compare Initial Sync: No Disable Stf: No Disable Fofb: No Resolve: Last Job State: finished Last Started: 2013-07-17T15:39:49 Last Success: 2013-07-17T15:39:49 Password Set: No Conflicted: No Has Sync State: Yes

Replication policy information

You can view information about replication policies through the output of the isi sync policies list command.

Name Path Action Enabled Target

The name of the policy. The path of the source directory on the source cluster. The type of replication policy. Whether the policy is enabled or disabled. The IP address or fully qualified domain name of the target cluster.

Data replication with SyncIQ 257

Managing replication to the local cluster
You can interrupt replication jobs that target the local cluster. You can cancel a currently running job that targets the local cluster, or you can break the association between a policy and its specified target. Breaking a source and target cluster association causes SyncIQ to perform a full replication the next time the policy is run.
Cancel replication to the local cluster
You can cancel a replication job that is targeting the local cluster. Run the isi sync target cancel command.  To cancel a job, specify a replication policy. For example, the following command cancels a replication job created according
to weeklySync:
isi sync target cancel weeklySync  To cancel all jobs targeting the local cluster, run the following command:
isi sync target cancel --all
Break local target association
You can break the association between a replication policy and the local cluster. Breaking this association requires you to reset the replication policy before you can run the policy again.
NOTE: After a replication policy is reset, SyncIQ performs a full or differential replication the next time the policy is run. Depending on the amount of data being replicated, a full or differential replication can take a very long time to complete. Run the isi sync target break command. The following command breaks the association between weeklySync and the local cluster:
isi sync target break weeklySync
View replication policies targeting the local cluster
You can view information about replication policies that are currently replicating data to the local cluster. 1. View information about all replication policies that are currently targeting the local cluster by running the following command:
isi sync target list
2. To view detailed information about a specific replication policy, run the isi sync target view command. The following command displays detailed information about weeklySync:
isi sync target view weeklySync
The system displays output similar to the following example:
Name: weeklySync Source: cluster Target Path: /ifs/data/sometarget Last Job State: finished FOFB State: writes_disabled Source Cluster GUID: 000c295159ae74fcde517c1b85adc03daff9 Last Source Coordinator IP: 127.0.0.1 Legacy Policy: No Last Update: 2013-07-17T15:39:51
258 Data replication with SyncIQ

Remote replication policy information

You can view information about replication policies that are currently targeting the local cluster through the output of the isi sync target list command.

Name Source Target Path Last Job State FOFB State

The name of the replication policy. The name of the source cluster. The path of the target directory on the target cluster. The state of the most recent replication job for the policy. The failover-failback state of the target directory.

Managing replication performance rules
You can manage the impact of replication on cluster performance by creating rules that limit the network traffic that is created and the rate at which files replication jobs send files.
NOTE: The SyncIQ performance rules apply only to SyncIQ processes.

Create a network traffic rule
You can create a network traffic rule that limits the amount of network traffic that replication policies are allowed to generate during a specified time period. Run the isi sync rules create command. The following command creates a network traffic rule that limits bandwidth consumption to 100 kilobits per second from 9:00 AM to 5:00 PM every weekday:
isi sync rules create bandwidth 9:00-17:00 M-F 100

Create a file operations rule
You can create a file-operations rule that limits the number of files that replication jobs can send per second. Run the isi sync rules create command. The following command creates a file-operations rule that limits the file-send rate to 3 files per second from 9:00 AM to 5:00 PM every weekday: :
isi sync rules create file_count 9:00-17:00 M-F 3

Modify a performance rule
You can modify a performance rule. 1. Optional: To identify the ID of the performance rule you want to modify, run the following command:
isi sync rules list 2. Modify a performance rule by running the isi sync rules modify command.
The following command causes a performance rule with an ID of bw-0 to be enforced only on Saturday and Sunday:
isi sync rules modify bw-0 --days X,S

Data replication with SyncIQ 259

Delete a performance rule
You can delete a performance rule. 1. Optional: To identify the ID of the performance rule you want to modify, run the following command:
isi sync rules list 2. Delete a performance rule by running the isi sync rules delete command.
The following command deletes a performance rule with an ID of bw-0: isi sync rules delete bw-0
Enable or disable a performance rule
You can disable a performance rule to temporarily prevent the rule from being enforced. You can also enable a performance rule after it has been disabled. 1. Optional: To identify the ID of the performance rule you want to enable or disable, run the following command:
isi sync rules list
2. Run the isi sync rules modify command. The following command enables a performance rule with an ID of bw-0: isi sync rules modify bw-0 --enabled true The following command disables a performance rule with an ID of bw-0: isi sync rules modify bw-0 --enabled false
View performance rules
You can view performance rules. 1. View information about all performance rules by running the following command:
isi sync rules list
2. Optional: To view detailed information about a specific performance rule, run the isi sync rules view command. The following command displays detailed information about a performance rule with an ID of bw-0: isi sync rules view --id bw-0 The system displays output similar to the following example: ID: bw-0 Enabled: Yes Type: bandwidth Limit: 100 kbps Days: Sun,Sat Schedule Begin : 09:00 End : 17:00 Description: Bandwidth rule for weekdays
260 Data replication with SyncIQ

Managing replication reports
In addition to viewing replication reports, you can configure how long reports are retained on the cluster. You can also delete any reports that have passed their expiration period.
Configure default replication report settings
You can configure the default amount of time that SyncIQ retains replication reports for. You can also configure the maximum number of reports that SyncIQ retains for each replication policy. Run the isi sync settings modify command. The following command causes OneFS to delete replication reports that are older than 2 years:
isi sync settings modify --report-max-age 2Y
Delete replication reports
Replication reports are routinely deleted by SyncIQ after the expiration date for the reports has passed. SyncIQ also deletes reports after the number of reports exceeds the specified limit. Excess reports are periodically deleted by SyncIQ; however, you can manually delete all excess replication reports at any time. This procedure is available only through the command-line interface (CLI). 1. Open a secure shell (SSH) connection to any node in the cluster, and log in. 2. Delete excess replication reports by running the following command:
isi sync reports rotate
View replication reports
You can view replication reports and subreports. 1. View a list of all replication reports by running the following command:
isi sync reports list 2. View a replication report by running the isi sync reports view command.
The following command displays a replication report for weeklySync:
isi sync reports view weeklySync 2 3. Optional: To view a list of subreports for a report, run the isi sync reports subreports list command.
The following command displays subreports for weeklySync:
isi sync reports subreports list weeklySync 1 4. Optional: To view a subreport, run the isi sync reports subreports view command.
The following command displays a subreport for weeklySync:
isi sync reports subreports view weeklySync 1 2
The system displays output similar to the following example:
Policy Name: weeklySync Job ID: 1
Subreport ID: 2 Start Time: 2013-07-17T21:59:10 End Time: 2013-07-17T21:59:15 Action: run State: finished Policy ID: a358db8b248bf432c71543e0f02df64e Sync Type: initial
Data replication with SyncIQ 261

Duration: 5s Errors: -
Source Directories Visited: 0 Source Directories Deleted: 0 Target Directories Deleted: 0 Source Directories Created: 0 Target Directories Created: 0
Source Directories Linked: 0 Target Directories Linked: 0 Source Directories Unlinked: 0 Target Directories Unlinked: 0
Num Retransmitted Files: 0 Retransmitted Files: Total Files: 0 Files New: 0
Source Files Deleted: 0 Files Changed: 0
Target Files Deleted: 0 Up To Date Files Skipped: 0 User Conflict Files Skipped: 0
Error Io Files Skipped: 0 Error Net Files Skipped: 0 Error Checksum Files Skipped: 0
Bytes Transferred: 245 Total Network Bytes: 245
Total Data Bytes: 20 File Data Bytes: 20
Sparse Data Bytes: 0 Target Snapshots: SIQ-Failover-newPol123-2013-07-17_21-59-15, newPol123-
Archive-cluster-17 Total Phases: 2 Phases Phase : STF_PHASE_IDMAP_SEND Start Time : 2013-07-17T21:59:11 End Time : 2013-07-17T21:59:13

Replication report information

You can view information about replication jobs through the Reports table.

Policy Name

The name of the associated policy for the job. You can view or edit settings for the policy by clicking the policy name.

Status

Displays the status of the job. The following job statuses are possible: Running
The job is currently running without error.

Paused

The job has been temporarily paused.

Finished

The job completed successfully.

Failed

The job failed to complete.

Started

Indicates when the job started.

Ended Duration Transferred

Indicates when the job ended. Indicates how long the job took to complete.
The total number of files that were transferred during the job run, and the total size of all transferred files. For assessed policies, Assessment appears.

Source Directory The path of the source directory on the source cluster.

Target Host

The IP address or fully qualified domain name of the target cluster.

262 Data replication with SyncIQ

Action

Displays any report-related actions that you can perform.

Managing failed replication jobs
If a replication job fails due to an error, SyncIQ might disable the corresponding replication policy. For example SyncIQ might disable a replication policy if the IP or hostname of the target cluster is modified. If a replication policy is disabled, the policy cannot be run. To resume replication for a disabled policy, you must either fix the error that caused the policy to be disabled, or reset the replication policy. It is recommended that you attempt to fix the issue rather than reset the policy. If you believe you have fixed the error, you can return the replication policy to an enabled state by resolving the policy. You can then run the policy again to test whether the issue was fixed. If you are unable to fix the issue, you can reset the replication policy. However, resetting the policy causes a full or differential replication to be performed the next time the policy is run.
NOTE: Depending on the amount of data being synchronized or copied, full and differential replications can take a very long time to complete.
Resolve a replication policy
If SyncIQ disables a replication policy due to a replication error, and you fix the issue that caused the error, you can resolve the replication policy. Resolving a replication policy enables you to run the policy again. If you cannot resolve the issue that caused the error, you can reset the replication policy. Run the isi sync policies resolve command. The following command resolves weeklySync:
isi sync policies resolve weeklySync

Reset a replication policy
If a replication job encounters an error that you cannot resolve, you can reset the corresponding replication policy. Resetting a policy causes OneFS to perform a full or differential replication the next time the policy is run. Resetting a replication policy deletes the source-cluster snapshot.
NOTE: Depending on the amount of data being replicated, a full or differential replication can take a very long time to complete. Reset a replication policy only if you cannot fix the issue that caused the replication error. If you fix the issue that caused the error, resolve the policy instead of resetting the policy.
Run the isi sync policies reset command. The following command resets weeklySync:
isi sync policies reset weeklySync

Perform a full or differential replication
After you reset a replication policy, you must perform either a full or differential replication. You can do this replication only from the CLI. Reset a replication policy.
NOTE: Files smaller than 32 KB are sent directly to the target cluster even if they are the same. Only files 32 KB and larger are compared to the target to determine if they should be sent.
1. Open a secure shell (SSH) connection to any node in the cluster and log in through the root or compliance administrator account.
2. Specify the type of replication you want to perform by running the isi sync policies modify command.  To perform a full replication, disable the --target-compare-initial-sync option.

Data replication with SyncIQ 263

For example, the following command disables differential synchronization for newPolicy: isi sync policies modify newPolicy \ --target-compare-initial-sync false
 To perform a differential replication, enable the --target-compare-initial-sync option. For example, the following command enables differential synchronization for newPolicy: isi sync policies modify newPolicy \ --target-compare-initial-sync true
3. Run the policy by running the isi sync jobs start command. For example, the following command runs newPolicy: isi sync jobs start newPolicy
Restrict SyncIQ to use the interfaces in the IP address pool
You can restrict SyncIQ to use only the interfaces in the IP address pool. To restrict SyncIQ to use only the interfaces in the IP address pool, modify the SyncIQ policy:
isi sync policies modify --policy <my_policy> --force_interface=on The policy is modified, and SyncIQ now uses only the interfaces in the IP address pool.
Modify the Restrict Target Network value
You can set the value to determine whether replication jobs connect only to nodes in a given SmartConnect zone. To modify the Restrict Target Network value, modify the SyncIQ policy:
isi sync policies modify <policy-name> --restrict-targetnetwork=true
264 Data replication with SyncIQ

18
Data Encryption with SyncIQ
This section contains the following topics:
Topics:
· SyncIQ data encryption overview · SyncIQ traffic encryption · Per-policy throttling overview · Troubleshooting SyncIQ encryption
SyncIQ data encryption overview
OneFS now enables you to encrypt SyncIQ data from one PowerScale cluster to another. You can use the integrated capabilities of SyncIQ to encrypt the data during transfer between PowerScale clusters and protect the data in flight during intercluster replications. SyncIQ policies now support end-to-end encryption for cross-cluster communications. You can manage certificates with the help of the new SyncIQ store. SyncIQ provides encryption by using X.509 certificates that are paired with TLS version 1.2 and OpenSSL version 1.0.2o. The certificates are stored and managed in the certificate stores of the source and target clusters . Encryption between clusters takes place with each cluster storing its own certificate and the certificate of its peer. The source cluster is required to store the certificate of the target cluster, and conversely. Storing the certificate of the peer essentially creates a white list of approved clusters for data replication. Certification revocation is supported through an external Online Certificate Status Protocol (OCSP) responder.
NOTE: Both the source and target cluster must be upgraded and committed to OneFS 8.2.x, before enabling SyncIQ encryption. PowerScale clusters may now require that all incoming and outgoing SyncIQ policies be encrypted through a simple change in the SyncIQ Global Settings.
SyncIQ traffic encryption
SyncIQ data that is transmitted between the source and target clusters is encrypted. SyncIQ provides additional protection from man-in-the-middle attacks and prevents unauthorized source or target relationships. The standard certificate configuration for SyncIQ policy encryption requires six files:  SourceClusterCert.pem - A single end-entity certificate that identifies the Source cluster  SourceClusterKey.pem - The associated private key file that goes with the Source cluster identity cert  SourceClusterCA.pem - A self-signed root CA file that issued the Source cluster identity cert  TargetClusterCert.pem - A single end-entity certificate that identifies the Target cluster  TargetClusterKey.pem - The associated private key file that goes with the Target cluster identity cert  TargetClusterCA.pem - A self-signed root CA file that issued the Target cluster identity cert (may be the same as 3) Because SyncIQ encryption requires mutual authentication SSL handshakes, each cluster must specify its own identity certificate and the CA certificate of the peer.
Data Encryption with SyncIQ 265

Configure certificates
You can configure certificates for SyncIQ policy encryption. 1. On the Source cluster, install the identity certificate and private key pair to the server certificate store.
isi sync cert server import SourceClusterCert.pem SourceClusterKey.pem --setcertificate-key-password
--name myClusterCertID You will be prompted to enter the certificate key password, and then to confirm that password. 2. On the Source cluster, set the newly installed ID from the server store as your SyncIQ cluster certificate. The full ID of the certificate is displayed when the -v option is used to the server store list command.
isi sync cert server list -v isi sync setting mod ­cluster-certificate-id=<fullID>
3. On the Source cluster, install the Source cluster CA to the global cluster CA store. This CA was used to issue TargetClusterCert.pem. isi cert auth import TargetClusterCA.pem --name SyncIQTargetCA
4. On the Source cluster, add the Target's certificate to the whitelist peer certificate store. isi sync cert peer import TargetClusterCert.pem --name SyncIQTargetClusterCert
NOTE: This step requires the end-entity certificate for each SyncIQ peer be shared with the peer. This action is not an SSL requirement. It is an implementation specific requirement to add a whitelist layer of security to SyncIQ encryption policies. The associated private key for peer certificates should not be shared when exchanging end-entity certificates with peers. 5. On the Target cluster, install the identity certificate and private key pair to the server certificate store. isi sync cert server import TargetClusterCert.pem TargetClusterKey.pem --certificatekey-password <string> --name myClusterCertID 6. On the Target cluster, set the newly installed ID from the server store as your SyncIQ cluster certificate. The full ID of the certificate is displayed when the -v option is used to the server store list command. isi sync cert server list -v isi sync setting mod ­cluster-certificate-id=<fullID> 7. On the Target cluster, install the Source cluster CA to the global cluster CA store. This CA that was used to issue SourceClusterCert.pem. isi cert auth import SourceClusterCA.pem --name SyncIQSourceCA 8. On the Target cluster, add the Source's certificate to the whitelist peer certificate store. isi sync cert peer import SourceClusterCert.pem --name SyncIQSourceClusterCert
NOTE: This step requires the end-entity certificate for each SyncIQ peer be shared with the peer. This action is not an SSL requirement. It is an implementation specific requirement to add a whitelist layer of security to SyncIQ encryption policies. The associated private key for peer certificates should not be shared when exchanging end-entity certificates with peers.
266 Data Encryption with SyncIQ

Create encrypted SyncIQ policies
You can create encrypted SyncIQ policies. 1. To enable encryption, associate the target certificate ID with the policy. The nominated certificate is used as the whitelist
check during the sync job. You can see the full ID for a certificate when the -v option is used in the certificate list command. isi sync cert peer list -v isi sync pol create foo sync /ifs/syncDir <targetIP> /ifs/syncDir ­target-
certificate-id=<targetFullID> 2. Optionally, force all SyncIQ policies to require encryption.
isi sync setting mod --encryption-required=True
Per-policy throttling overview
OneFS now enables you to set per-policy throttling rules. Existing versions of OneFS enables you to configure global bandwidth throttling rules that are applied evenly across running policies. Now, you can set bandwidth reservations per policy, instead of the global level.
Create a bandwidth rule
You can create a bandwidth rule for SyncIQ and configure policy-level reservation. 1. License SyncIQ and create one or more policies. 2. Create a bandwidth rule for SyncIQ.
isi sync rules create bandwidth --limit=1000 00:00-23:59 M-F 3. Configure policy-level reservation.
isi sync policies modify test --bandwidth-reservation=500
Troubleshooting SyncIQ encryption
If you are unable to configure the SyncIQ encryption, check the report of the SyncIQ policy in question and follow the troubleshooting tips given below to fix the issue.  If the failure is due to a Transport Layer Security (TLS) authentication failure, you can find the error message from TLS
library in the report.  If it is a TLS authentication failure, detailed information can be found at /var/log/messages on the source and target
clusters. The detailed information includes:  The ID of the certificate that caused the failure  The subject name of the certificate that caused the failure  The depth at which the failure occurred in the certification chain  The error code and the reason for the failure
Data Encryption with SyncIQ 267

19

Data Compression

This section contains the following topics:
Topics:
· Data compression · Data compression settings and monitoring · Enable or disable data compression · View compression statistics

Data compression

OneFS supports inline data compression on Isilon F810 and H5600 nodes, and on PowerScale F200 and F600 nodes.
The F810 node contains a Network Interface Card (NIC) that compresses and decompresses data.
Hardware compression and decompression are performed in parallel across the 40Gb Ethernet interfaces of supported nodes as clients read and write data to the cluster. This distributed interface model allows compression to scale linearly across the node pool as supported nodes are added to a cluster.
You can enable inline data compression on a cluster that:  Contains F810, H5600, F200, or F600 node pools  Offers a 40Gb Ethernet back-end network
The following table lists the nodes and OneFS release combinations that support inline data compression.

Nodes F810 H5600 F200 and F600

Required OneFS releases 8.1.3 or 8.2.1 and later 8.2.2 and later 9.0.0.0 and later

Mixed Clusters
In a mixed cluster environment, data is stored in a compressed form on F810, H5600, F200, and F600 node pools. Data that is written or tiered to storage pools of other node types is uncompressed when it moves between pools.
Data compression settings and monitoring
From the OneFS command line, you can enable and disable inline data compression on an Isilon or PowerScale cluster. You can also view statistics that are related to compression activity and efficiency across the cluster. Data compression is available only with node pools of Isilon F810 and H5600 nodes and PowerScale F200 and F600 nodes.
Enable or disable data compression
You can turn data compression on or off from the OneFS command line. This procedure is available only through the OneFS command-line interface (CLI). The following are the settings for data compression configuration:  Enabled: yes

268 Data Compression

 Enabled: no The default setting is Enabled: yes.
NOTE: This compression setting only applies to data stored on Isilon F810 and H5600 node pools, and on PowerScale F200 and F600 node pools. Data written to any other node types ignore this setting and are not compressed. If a cluster does not contain a supported node pool, this setting is ignored.
NOTE: When you enable compression, OneFS does not go back and compress the data that was written while compression was disabled.
1. To view the current compression setting, run the following command: isi compression settings view The system displays output similar to the following example:
Enabled: Yes
2. If compression is enabled and you want to disable it, run the following command: isi compression settings modify --enabled=False
3. If compression is disabled and you want to enable it, run the following command: isi compression settings modify --enabled=True
4. After you adjust settings, confirm that the setting is correct. Run the following command: isi compression settings view

View compression statistics

You can view reports about data compression that include current and historic compression ratios, as well as logical and physical data block totals.
This procedure is available only through the OneFS command-line interface (CLI).
1. To view a report that contains recent writes and total cluster data reduction, run the following command: isi statistics data-reduction The system displays output similar to the following example:

Recent Writes Cluster Data Reduction

(5 mins)

--------------------- ------------- ----------------------

Logical data

1.07M

49.57M

Zero-removal saved

0

-

Deduplication saved

0

0

Compression saved

0

0

Preprotected physical

1.07M

49.57M

Protection overhead

2.14M

99.14M

Protected physical

3.21M

160.61M

Zero removal ratio

1.00 : 1

-

Deduplication ratio

1.00 : 1

1.00 : 1

Compression ratio

1.00 : 1

1.00 : 1

Data reduction ratio

1.00 : 1

1.00 : 1

Efficiency ratio

0.33 : 1

0.31 : 1

--------------------- ------------- ----------------------

The Recent Writes column displays statistics for the previous five minutes. The Cluster Data Reduction column displays statistics for overall data efficiency across the entire cluster.
2. To view a report that contains statistics about compression ratios from the last five minutes, the percent of data that is not compressible, the total logical and physical data blocks that were processed, and writes where compression was not attempted, run the following command:
isi compression stats view

Data Compression 269

The system displays output similar to the following example: stats for 300 seconds at: 2021-02-25 19:20:36 (1614280836) compression ratio for compressed writes: 0.00 : 1 compression ratio for all writes: 1.00 : 1 incompressible data percent: 0.00% total logical blocks: 135 total physical blocks: 135 writes for which compression was not attempted: 100.00%
 If the incompressible data percentage is high, the data being written to the cluster might be a type that has already been compressed.
 If the number of writes for which compression was not attempted is high, you might be working with a cluster with multiple node types. If so, OneFS might be directing writes to a node pool that does not support data compression.
3. To view a report that contains the statistics that the isi compression stats view command provides, but also shows statistics from previous five minute intervals, run the following command: isi compression stats list The system displays output similar to the following example:
270 Data Compression

Data Compression 271

272 Data Compression

20
Data layout with FlexProtect
This section contains the following topics:
Topics:
· FlexProtect overview · File striping · Requested data protection · FlexProtect data recovery · Requesting data protection · Requested protection settings · Requested protection disk space usage
FlexProtect overview
A PowerScale cluster is designed to continuously serve data, even when one or more components simultaneously fail. OneFS ensures data availability by striping or mirroring data across the cluster. If a cluster component fails, data that is stored on the failed component is available on another component. After a component failure, lost data is restored on healthy components by the FlexProtect proprietary system. Data protection is specified at the file level, not the block level, enabling the system to recover data quickly. All data, metadata, and parity information is distributed across all nodes: the cluster does not require a dedicated parity node or drive. No single node limits the speed of the rebuild process.
File striping
OneFS uses a PowerScale cluster's internal network to distribute data automatically across individual nodes and disks in the cluster. OneFS protects files as the data is being written. No separate action is necessary to protect data. Before writing files to storage, OneFS breaks files into smaller logical chunks called stripes. The size of each file chunk is referred to as the stripe unit size. Each OneFS block is 8 KB, and a stripe unit consists of 16 blocks, for a total of 128 KB per stripe unit. During a write, OneFS breaks data into stripes and then logically places the data into a stripe unit. As OneFS writes data across the cluster, OneFS fills the stripe unit and protects the data according to the number of writable nodes and the specified protection policy. OneFS can continuously reallocate data and make storage space more usable and efficient. As the cluster size increases, OneFS stores large files more efficiently. To protect files that are 128KB or smaller, OneFS does not break these files into smaller logical chunks. Instead, OneFS uses mirroring with forward error correction (FEC). With mirroring, OneFS makes copies of each small file's data (N), adds an FEC parity chunk (M), and distributes multiple instances of the entire protection unit (N+M) across the cluster.
Requested data protection
The requested protection of data determines the amount of redundant data created on the cluster to ensure that data is protected against component failures. OneFS enables you to modify the requested protection in real time while clients are reading and writing data on the cluster. OneFS provides several data protection settings. You can modify these protection settings at any time without rebooting or taking the cluster or file system offline. When planning your storage solution, keep in mind that increasing the requested protection reduces write performance and requires additional storage space for the increased number of nodes.
Data layout with FlexProtect 273

OneFS uses the Reed Solomon algorithm for N+M protection. In the N+M data protection model, N represents the number of data-stripe units, and M represents the number of simultaneous node or drive failures--or a combination of node and drive failures--that the cluster can withstand without incurring data loss. N must be larger than M. In addition to N+M data protection, OneFS also supports data mirroring from 2x to 8x, allowing from two to eight mirrors of data. In terms of overall cluster performance and resource consumption, N+M protection is often more efficient than mirrored protection. However, because read and write performance is reduced for N+M protection, data mirroring might be faster for data that is updated often and is small in size. Data mirroring requires significant overhead and might not always be the best data-protection method. For example, if you enable 3x mirroring, the specified content is duplicated three times on the cluster; depending on the amount of content mirrored, this can consume a significant amount of storage space.
Related concepts Requesting data protection on page 275
Related references Requested protection settings on page 275 Requested protection disk space usage on page 276
FlexProtect data recovery
OneFS uses the FlexProtect proprietary system to detect and repair files and directories that are in a degraded state due to node or drive failures. OneFS protects data in the cluster based on the configured protection policy. OneFS rebuilds failed disks, uses free storage space across the entire cluster to further prevent data loss, monitors data, and migrates data off of at-risk components. OneFS distributes all data and error-correction information across the cluster and ensures that all data remains intact and accessible even in the event of simultaneous component failures. Under normal operating conditions, all data on the cluster is protected against one or more failures of a node or drive. However, if a node or drive fails, the cluster protection status is considered to be in a degraded state until the data is protected by OneFS again. OneFS reprotects data by rebuilding data in the free space of the cluster. While the protection status is in a degraded state, data is more vulnerable to data loss. Because data is rebuilt in the free space of the cluster, the cluster does not require a dedicated hot-spare node or drive in order to recover from a component failure. Because a certain amount of free space is required to rebuild data, it is recommended that you reserve adequate free space through the virtual hot spare feature. As you add more nodes, the cluster gains more CPU, memory, and disks to use during recovery operations. As a cluster grows larger, data restriping operations become faster.
Smartfail
OneFS protects data stored on failing nodes or drives through a process called smartfailing. During the smartfail process, OneFS places a device into quarantine. Data stored on quarantined devices is read only. While a device is quarantined, OneFS reprotects the data on the device by distributing the data to other devices. After all data migration is complete, OneFS logically removes the device from the cluster, the cluster logically changes its width to the new configuration, and the node or drive can be physically replaced. OneFS smartfails devices only as a last resort. Although you can manually smartfail nodes or drives, it is recommended that you first consult PowerScale Technical Support. Occasionally a device might fail before OneFS detects a problem. If a drive fails without being smartfailed, OneFS automatically starts rebuilding the data to available free space on the cluster. However, because a node might recover from a failure, if a node fails, OneFS does not start rebuilding data unless the node is logically removed from the cluster.
Node failures
Because node loss is often a temporary issue, OneFS does not automatically start reprotecting data when a node fails or goes offline. If a node reboots, the file system does not need to be rebuilt because it remains intact during the temporary failure. If you configure N+1 data protection on a cluster, and one node fails, all of the data is still accessible from every other node in the cluster. If the node comes back online, the node rejoins the cluster automatically without requiring a full rebuild.
274 Data layout with FlexProtect

To ensure that data remains protected, if you physically remove a node from the cluster, you must also logically remove the node from the cluster. After you logically remove a node, the node automatically reformats its own drives, and resets itself to the factory default settings. The reset occurs only after OneFS has confirmed that all data has been reprotected. You can logically remove a node using the smartfail process. It is important that you smartfail nodes only when you want to permanently remove a node from the cluster.
If you remove a failed node before adding a new node, data stored on the failed node must be rebuilt in the free space in the cluster. After the new node is added, OneFS distributes the data to the new node. It is more efficient to add a replacement node to the cluster before failing the old node because OneFS can immediately use the replacement node to rebuild the data stored on the failed node.

Requesting data protection
You can specify the protection of a file or directory by setting its requested protection. This flexibility enables you to protect distinct sets of data at higher than default levels. Requested protection of data is calculated by OneFS and set automatically on storage pools within your cluster. The default setting is referred to as suggested protection, and provides the optimal balance between data protection and storage efficiency. For example, a suggested protection of N+2:1 means that two drives or one node can fail without causing any data loss. For best results, we recommend that you accept at least the suggested protection for data on your cluster. You can always specify a higher protection level than suggested protection on critical files, directories, or node pools. OneFS allows you to request protection that the cluster is currently incapable of matching. If you request an unmatchable protection, the cluster will continue trying to match the requested protection until a match is possible. For example, in a four-node cluster, you might request a mirror protection of 5x. In this example, OneFS would mirror the data at 4x until you added a fifth node to the cluster, at which point OneFS would reprotect the data at 5x. If you set requested protection to a level below suggested protection, OneFS warns you of this condition.
NOTE:
For 4U Isilon IQ X-Series and NL-Series nodes, and IQ 12000X/EX 12000 combination platforms, the minimum cluster size of three nodes requires a minimum protection of N+2:1.
Related concepts Requested data protection on page 273

Requested protection settings

Requested protection settings determine the level of hardware failure that a cluster can recover from without suffering data loss.

Requested protection setting [+1n] [+2d:1n] [+2n] [+3d:1n] [+3d:1n1d]

Minimum number of nodes required 3 3
4
3
3

Definition
The cluster can recover from one drive or node failure without sustaining any data loss.
The cluster can recover from two simultaneous drive failures or one node failure without sustaining any data loss.
The cluster can recover from two simultaneous drive or node failures without sustaining any data loss.
The cluster can recover from three simultaneous drive failures or one node failure without sustaining any data loss.
The cluster can recover from three simultaneous drive failures or simultaneous failures of one node and one drive without sustaining any data loss.

Data layout with FlexProtect 275

Requested protection setting [+3n] [+4d:1n] [+4d:2n] [+4n] Nx (Data mirroring)

Minimum number of nodes required 6
3
4
8
N For example, 5x requires a minimum of five nodes.

Definition
The cluster can recover from three simultaneous drive or node failures without sustaining any data loss.
The cluster can recover from four simultaneous drive failures or one node failure without sustaining any data loss.
The cluster can recover from four simultaneous drive failures or two node failures without sustaining any data loss.
The cluster can recover from four simultaneous drive or node failures without sustaining any data loss.
The cluster can recover from N - 1 drive or node failures without sustaining data loss. For example, 5x protection means that the cluster can recover from four drive or node failures.

Related concepts Requested data protection on page 273

Requested protection disk space usage

Increasing the requested protection of data also increases the amount of space consumed by the data on the cluster.
The parity overhead for N + M protection depends on the file size and the number of nodes in the cluster. The percentage of parity overhead declines as the cluster gets larger.
The following table describes the estimated percentage of overhead depending on the requested protection and the size of the cluster or node pool. The table does not show recommended protection levels based on cluster size.

Number of nodes 3 4 5 6 7 8 9 10 12

[+1n]

[+2d:1n]

2 +1 (33%) 4 + 2 (33%)

3 +1 (25%)

6 + 2 (25%)

4 +1 (20%)

8 + 2 (20%)

5 +1 (17%) 10 + 2 (17%)

6 +1 (14%) 12 + 2 (14%)

7 +1 (13%) 14 + 2 (12.5%)

8 +1 (11%) 16 + 2 (11%)

9 +1 (10%) 16 + 2 (11%)

11 +1 (8%) 16 + 2 (11%)

[+2n]
--
2 + 2 (50%)
3 + 2 (40%)
4 + 2 (33%)
5 + 2 (29%)
6 + 2 (25%)
7 + 2 (22%)
8 + 2 (20%)
10 + 2 (17%)

[+3d:1n] [+3d:1n1d] [+3n] [+4d:1n] [+4d:2n] [+4n]

6 + 3 (33%)
9 + 3 (25%)
12 + 3 (20%)
15 + 3 (17%)
15 + 3 (17%)
15 + 3 (17%)
15 + 3 (17%)
15 + 3 (17%)
15 + 3 (17%)

3 + 3 (50%) --
5 + 3 (38%) --
7 + 3 (30%) --
9 + 3 (25%) 3 + 3 (50%)
11 + 3 (21%) 4 + 3 (43%)
13 + 3 (19%) 5 + 3 (38%)
15+3 (17%) 6 + 3 (33%)
15+3 (17%) 7 + 3 (30%)
15+3 (17%) 9 + 3 (25%)

8 + 4 (33%)
12 + 4 (25%)
16 + 4 (20%)
16 + 4 (20%)
16 + 4 (20%)
16 + 4 (20%)
16 + 4 (20%)
16 + 4 (20%)
16 + 4 (20%)

--
4 + 4 (50%)
6 + 4 (40%)
8 + 4 (33%)
10 + 4 (29%)
12 + 4 (25% )
14 + 4 (22%)
16 + 4 (20%)
16 + 4 (20%)

--
--
--
--
--
4 + 4 (50%) 5 + 4 (44%) 6 + 4 (40%) 8 + 4 (33%)

276 Data layout with FlexProtect

Number of nodes 14
16
18
20
30

[+1n]
13 + 1 (7%)
15 + 1 (6%)
16 + 1 (6%)
16 + 1 (6%)
16 + 1 (6%)

[+2d:1n] [+2n]

16 + 2 (11%)
16 + 2 (11%)
16 + 2 (11%)
16 + 2 (11%)
16 + 2 (11%)

12 + 2 (14%)
14 + 2 (13%)
16 + 2 (11%)
16 + 2 (11%)
16 + 2 (11%)

[+3d:1n] [+3d:1n1d] [+3n] [+4d:1n] [+4d:2n] [+4n]

15 + 3 (17%)
15 + 3 (17%)
15 + 3 (17%)
16 + 3 (16%)
16 + 3 (16%)

15+3 (17%) 11 + 3 (21%)
15+3 (17%) 13 + 3 (19%)
15+3 (17%) 15 + 3 (17%)
16 + 3 (16%) 16 + 3 (16%)
16 + 3 (16%) 16 + 3 (16%)

16 + 4 (20%)
16 + 4 (20%)
16 + 4 (20%)
16 + 4 (20%)
16 + 4 (20%)

16 + 4 (20%)
16 + 4 (20%)
16 + 4 (20%)
16 + 4 (20% )
16 + 4 (20%)

10 + 4 (29%)
12 + 4 (25%)
14 + 4 (22%)
16 + 4 (20%)
16 + 4 (20%)

The parity overhead for mirrored data protection is not affected by the number of nodes in the cluster. The following table describes the parity overhead for requested mirrored protection.

2x

3x

4x

5x

6x

7x

8x

50%

67%

75%

80%

83%

86%

88%

Related concepts Requested data protection on page 273

Data layout with FlexProtect 277

21
Large file size support
This section contains the following topics:
Topics:
· Large file support · Feature enablement requirements · Restrictions after enabling large file support · Enable large file support · Check SyncIQ and cluster disk space compatibility
Large file support
You can create files with a maximum size of 16 TiB. OneFS can support a maximum file size of 16 TiB on supported cluster configurations when SyncIQ partner clusters are also running version 8.2.2 or later on supported cluster configurations . You must ensure that your cluster(s) are one of the supported cluster configurations. To know more about the supported cluster configurations, see https://support.emc.com/kb/ 539758. The isi_large_file -c command is used to check if feature requirements are met on cluster. The isi_large_file -l command is used to show current large file settings. After the feature is enabled, files can be created up to a size of 16 TiB on the cluster by any of the supported protocols. First, the cluster(s) must be brought up to the feature release version and committed. Second, a feature enable script must be run. This enable script ensures that the cluster meets the requirements to run the feature.
Feature enablement requirements
The isi_large_file -c command assesses the cluster configuration and the existing SyncIQ policies to determine if they meet the requirements for the large file feature. Disk Pool Requirements Before you enable the feature, the isi_large_file -c command is used to check all disk pools to verify that your cluster is one of the supported configurations. SyncIQ Policy Requirements Before you enable the feature, all SyncIQ policies are checked to ensure that all remote clusters are running a release that supports this feature. In addition to all the remote clusters running a release that supports this feature, the clusters must be checked to ensure that they also meet the disk pool requirements. This activity is done by running and passing the checks that the isi_large_file -c command performs. No cluster in the SyncIQ chain of policies should enable the feature unless all those clusters can pass the isi_large_file -c command check. After the feature is enabled, you cannot disable it.
Restrictions after enabling large file support
After the isi_large_file -e command is enabled, there are certain restrictions that you may notice. SyncIQ policies after enabling The new SyncIQ policy partner clusters must have large file feature enabled. A new SyncIQ policy targeting a cluster without the large file feature enabled fails to sync.
278 Large file size support

Disk pool management commands after enabling
Disk pool commands that deviate from the protection settings that are described in the list of supported hardware configurations or in other ways reduce the size of disk pools. This situation may result in CELOG reports indicating that your disk pools do not meet requirements for the large file feature. As a result, you might see that the performance degrades with the use of large files.

Enable large file support

You can run isi_large_file -e on the cluster and all SyncIQ partners to enable the large file support Check if feature requirements are met on this cluster and view current large file settings by running the following command:

isi_large_file -c

The cluster is now ready for large file support, and the system displays output similar to the following example:

Checking cluster compatibility with large file support...

NOTE: PowerScale requires ALL clusters in your data-center that are part of any SyncIQ relationship to be running on versions of OneFS compatible with large file support before any of them can enable it. If any cluster requires upgrade to a compatible version, all SyncIQ policies in a SyncIQ relationship with the upgraded cluster will need to resync before you can successfully enable large file support.

* Checking SyncIQ compatibility... - SyncIQ compatibility check passed

* Checking cluster disk space compatibility... - The following disk pools do not have enough usable storage capacity to support large files:

Disk Pool Name Members

Usable Required Potential Capable Add Nodes

---------------------------------------------------------------------------

v200_25gb_6gb:2 1-3:bay2-7 32GB 240TB

212G

N

X

Disk Pool Name - Node pool name and this disk pool id Members - Current nodes and bays in this disk pool Usable - Current usable capacity of this disk pool Required - Usable capacity required for this disk pool to support large fi les Potential - The max usable capacity this disk pool could support at the ta rget node count Capable - Whether this disk pool has the size of disk and number per node to support large files Add Nodes - If this disk pool is capable, how many more nodes need to be a dded

The cluster is not compatible with large file support: - Incompatible disk pool(s)

Check SyncIQ and cluster disk space compatibility
You can check whether large file support is compatible with SyncIQ and get the list of disk pools that do not have the capacity to support large files. Check whether large file support is compatible with SyncIQ by running the following command:
isi_large_file -c
The system displays output similar to the following example along with the list of disk pools that do not support large files: Checking cluster compatibility with large file support... NOTE:

Large file size support 279

PowerScale requires ALL clusters in your data-center that are part of any SyncIQ relationship to be running on versions of OneFS compatible with large file support before any of them can enable it. If any cluster requires upgrade to a compatible version, all SyncIQ policies in a SyncIQ relationship with the upgraded cluster will need to resync before you can successfully enable large file support.

* Checking SyncIQ compatibility... - SyncIQ compatibility check passed

* Checking cluster disk space compatibility... - The following disk pools do not have enough usable storage capacity to support large files:

Disk Pool Name Members

Usable Required Potential Capable Add Nodes

---------------------------------------------------------------------------

v200_25gb_6gb:2 1-3:bay2-7 32GB 240TB

212G

N

X

Disk Pool Name - Node pool name and this disk pool id Members - Current nodes and bays in this disk pool Usable - Current usable capacity of this disk pool Required - Usable capacity required for this disk pool to support large files Potential - The max usable capacity this disk pool could support at the target node count Capable - Whether this disk pool has the size of disk and number per node to support large files Add Nodes - If this disk pool is capable, how many more nodes need to be added

The cluster is not compatible with large file support: - Incompatible disk pool(s)

280 Large file size support

22
Administering NDMP
This chapter contains the following topics:
Topics:
· NDMP backup and recovery overview · NDMP two-way backup · NDMP three-way backup · Support for NDMP sessions on Generation 6 hardware · Setting preferred IPs for NDMP three-way operations · NDMP multi-stream backup and recovery · Snapshot-based incremental backups · NDMP backup and restore of SmartLink files · NDMP protocol support · Supported DMAs · NDMP hardware support · NDMP backup limitations · NDMP performance recommendations · Excluding files and directories from NDMP backups · Configuring basic NDMP backup settings · Managing NDMP user accounts · Managing NDMP backup devices · Managing NDMP Fibre Channel ports · Managing NDMP preferred IP settings · Managing NDMP sessions · Managing NDMP restartable backups · NDMP restore operations · Managing default NDMP variables · Managing snapshot based incremental backups · Managing cluster performance for NDMP sessions · Managing CPU usage for NDMP sessions · View NDMP backup logs
NDMP backup and recovery overview
In OneFS, you can back up and recover file-system data through the Network Data Management Protocol (NDMP). From a backup server, you can direct backup and recovery processes between a PowerScale cluster and backup devices such as tape devices, media servers, and virtual tape libraries (VTLs). Some of the NDMP features are described below:  NDMP supports two-way and three-way backup models.  With certain data management applications, NDMP supports backup restartable extension (BRE). The NDMP BRE allows you
to resume a failed backup job from the last checkpoint taken prior to the failure. The failed job is restarted automatically and cannot be scheduled or started manually.  You do not need to activate a SnapshotIQ license on the cluster to perform NDMP backups. If you have activated a SnapshotIQ license on the cluster, you can generate a snapshot through the SnapshotIQ tool, and then back up the same snapshot. If you back up a SnapshotIQ snapshot, OneFS does not create another snapshot for the backup.  You can back up WORM domains through NDMP.
Administering NDMP 281

NDMP two-way backup
The NDMP two-way backup is also known as the local or direct NDMP backup. To perform NDMP two-way backups, you must connect your PowerScale cluster to a Backup Accelerator node which is synonymous with a Fibre Attached Storage node, and attach a tape device to that node. You must then use OneFS to detect the tape device before you can back up to that device. You can connect supported tape devices directly to the Fibre Channel ports of a Fibre Attached Storage node. Alternatively, you can connect Fibre Channel switches to the Fibre Channel ports on the Fibre Attached Storage node, and connect tape and media changer devices to the Fibre Channel switches. For more information, see your Fibre Channel switch documentation about zoning the switch to allow communication between the Fibre Attached Storage node and the connected tape and media changer devices. If you attach tape devices to a Fibre Attached Storage node, the cluster detects the devices when you start or restart the node or when you re-scan the Fibre Channel ports to discover devices. If a cluster detects tape devices, the cluster creates an entry for the path to each detected device. If you connect a device through a Fibre Channel switch, multiple paths can exist for a single device. For example, if you connect a tape device to a Fibre Channel switch, and then connect the Fibre Channel switch to two Fibre Channel ports, OneFS creates two entries for the device, one for each path.
NOTE: Generation 6 nodes added to an InfiniBand back end network are supported with the A100 Backup Accelerator as part of an NDMP 2-way backup solution. The A100 Backup Accelerator is not supported as part of an NDMP two-way backup solution with an all-Generation 6 cluster with an Ethernet back end.
NDMP three-way backup
The NDMP three-way backup is also known as the remote NDMP backup. During a three-way NDMP backup operation, a data management application (DMA) on a backup server instructs the cluster to start backing up data to a tape media server that is either attached to the LAN or directly attached to the DMA. The NDMP service runs on one NDMP Server and the NDMP tape service runs on a separate server. Both the servers are connected to each other across the network boundary.
Support for NDMP sessions on Generation 6 hardware
You can enable two-way NDMP sessions by configuring them with the optional 2x10GbE + 2x8GB Fibre Channel network interface card (NIC) on Generation 6 nodes. A 2x10GE + 2x8GB Fibre Channel NIC is a hybrid host bus adapter (HBA) that enables two-way NDMP sessions over the Fibre Channel port. Contact Dell EMC Professional Services to enable support for the 2x10GbE + 2x8GB Fibre Channel NIC.
Setting preferred IPs for NDMP three-way operations
If you are using Avamar as your data management application (DMA) for an NDMP three-way operation in an environment with multiple network interfaces, you can apply a preferred IP setting across a PowerScale cluster or to one or more subnets that are defined in OneFS. A preferred IP setting is a list of prioritized IP addresses to which a data server or tape server connects during an NDMP three-way operation. The IP address on the NDMP server that receives the incoming request from the DMA decides the scope and precedence for setting the preference. If the incoming IP address is within a subnet scope that has a preference, then the preference setting is applied. If a subnet-specific preference does not exist but a cluster-wide preference exists, the cluster-wide preference setting is applied. Subnet-specific preference always overrides the cluster-wide preference. If both the cluster-wide and subnet-specific preferences do not exist, the IP addresses within the subnet of the IP address receiving the incoming requests from the DMA are used as the preferred IP addresses. You can have one preferred IP setting per cluster or per network subnet. You can specify a list of NDMP preferred IPs through the isi ndmp settings preferred-ips command.
282 Administering NDMP

NDMP multi-stream backup and recovery
You can use the NDMP multi-stream backup feature, in conjunction with certain data management applications (DMAs), to speed up backups.
With multi-stream backup, you can use your DMA to specify multiple streams of data to back up concurrently. OneFS considers all streams in a specific multi-stream backup operation to be part of the same backup context. A multi-stream backup context is deleted if a multi-stream backup session is successful. If a specific stream fails, the backup context is retained for five minutes after the backup operation completes and you can retry the failed stream within that time period.
If you used the NDMP multi-stream backup feature to back data up to tape drives, you can also recover that data in multiple streams, depending on the DMA. In OneFS 8.0.0.0 and later releases, multi-stream backups are supported with CommVault Simpana version 11.0 Service Pack 3 and NetWorker version 9.0.1. If you back up data using CommVault Simpana, a multi-stream context is created, but data is recovered one stream at a time.

Snapshot-based incremental backups

You can implement snapshot-based incremental backups to increase the speed at which these backups are performed.
During a snapshot-based incremental backup, OneFS checks the snapshot taken for the previous NDMP backup operation and compares it to a new snapshot. OneFS then backs up all files that was modified since the last snapshot was made.
If the incremental backup does not involve snapshots, OneFS must scan the directory to discover which files were modified. OneFS can perform incremental backups significantly faster if the change rate is low.
You can perform incremental backups without activating a SnapshotIQ license on the cluster. Although SnapshotIQ offers a number of useful features, it does not enhance snapshot capabilities in NDMP backup and recovery.
Set the BACKUP_MODE environment variable to SNAPSHOT to enable snapshot-based incremental backups. If you enable snapshot-based incremental backups, OneFS retains each snapshot taken for NDMP backups until a new backup of the same or lower level is performed. However, if you do not enable snapshot-based incremental backups, OneFS automatically deletes each snapshot generated after the corresponding backup is completed or canceled.
NOTE: A snapshot-based incremental backup shares the dumpdates entries in dumpdates database along with the other level-based backups. Therefore, make sure that you do not run snapshot-based backups and regular level-based backups in the same backup paths. For example, make sure that you do not run a level 0 backup and snapshot-based incremental backup in the same backup path or vice versa.
After setting the BACKUP_MODE environment variable, snapshot-based incremental backup works with certain data management applications (DMAs) as listed in the next table.

Table 7. DMA support for snapshot-based incremental backups

DMA

DMA-integrated

Symantec NetBackup

Enabled only through an environment variable.

Networker

Enabled only through an environment variable.

Avamar

Yes

CommVault Simpana

Enabled only through a cluster-based environment variable.

Tivoli Storage Manager

Enabled only through a cluster-based environment variable.

Symantec Backup Exec

Enabled only through a cluster-based environment variable.

NetVault

Enabled only through a cluster-based environment variable.

ASG-Time Navigator

Enabled only through a cluster-based environment variable.

NDMP backup and restore of SmartLink files
You can perform NDMP backup and restore operations on data that has been archived to the cloud. Backup and restore capabilities with CloudPools data include:

Administering NDMP 283

 Archiving SmartLink files when backing up from a cluster  Restoring data, including SmartLink files, to the same cluster  Restoring data, including SmartLink files, to another cluster  Backing up version information with each SmartLink file, and restoring the Smartlink file after verifying the version
compatibility on the target cluster NOTE: SmartLink files that are backed up with OneFS 8.2.0 and later releases cannot be restored to releases earlier than 8.2.0.
You specify how files are backed up and restored by setting the NDMP environment variables BACKUP_OPTIONS and RESTORE_OPTIONS. See Administering NDMP in the PowerScale OneFS CLI Administration Guide for details about configuring the backup settings and managing NDMP environment variables.
NOTE: DeepCopy and ComboCopy backups recall file data from the cloud. The data is not stored on disks. Recall of file data may incur charges from cloud vendors.
With NDMP backup, by default, CloudPools supports the backup of SmartLink files that contain cloud metadata such as location of the object. Other details such as version information, account information, local cache state, and unsynchronized cache data that are associated with the SmartLink file are also backed up.
To prevent data loss when recovering SmartLink files with incompatible versions, use the NDMP combo copy backup option. Use this option to back up SmartLink files with full data. Full data includes metadata and user data. Use the NDMP combo copy option by setting the BACKUP_OPTIONS environment variable.
When the combo copy option is used for backup, you can use the combo copy, shallow copy, or deep copy restore options to recover SmartLink files. You can specify these options by setting appropriate values to the RESTORE_OPTIONS environment variable:
 The combo copy restore option restores SmartLink files from the backup stream only if their version is compatible with the OneFS version on the target cluster. If the SmartLink file version is incompatible with the OneFS version on the target cluster, a regular file is restored.
 If the version check operation on the target cluster is successful, the shallow copy restore operation restores the backed-up SmartLink file as a SmartLink file on the target cluster.
 If the version check operation on the target cluster fails, the deep copy restore operation forces the recovery of the SmartLink files as regular files on the target cluster .
 If you do not specify any restore operation, NDMP restores SmartLink files using the combo copy restore operation by default.
 When you specify multiple restore options, the combo copy restore operation has the highest priority. The shallow copy restore operation has the next highest priority. The deep copy restore operation has the lowest priority.
In CloudPools settings, you can set three retention periods that affect backed up SmartLink files and their associated cloud data:
 Full Backup Retention Period for NDMP takes effect when the SmartLink file is backed up as part of a full backup. The default is five years.
 Incremental Backup Retention Period for Incremental NDMP Backup and SyncIQ takes effect when a SmartLink file is backed up as part of an incremental backup. The default is five years.
 Cloud Data Retention Period defines the duration that data in the cloud is kept when its related SmartLink file is deleted. The default is one week.
CloudPools ensures the validity of a backed-up SmartLink file within the cloud data retention period. Set the retention periods appropriately to ensure that when the SmartLink file is restored from tape, it remains valid. CloudPools disallows restoring invalid SmartLink files.
CloudPools ensures that a backed-up SmartLink file is still valid by checking the retention periods that are stored for the file. If the retention time is past the restore time, CloudPools prevents NDMP from restoring the SmartLink file.
CloudPools ensures that the account under which the SmartLink files were originally created is not deleted. If it is deleted, both NDMP backup and restore of SmartLink files fail.
NDMP protocol support
You can back up the PowerScale cluster data through version 3 or 4 of the NDMP protocol.
OneFS supports the following features of NDMP versions 3 and 4:
 Full (level 0) NDMP backups
284 Administering NDMP

 Incremental (levels 1-9) NDMP backups and Incremental Forever (level 10) NOTE: In a level 10 NDMP backup, only data changed since the most recent incremental (level 1-9) backup or the last level 10 backup is copied. By repeating level 10 backups, you can be assured that the latest versions of files in your data set are backed up without having to run a full backup.
 Token-based NDMP backups  NDMP TAR backup type  Dump backup type  Path-based and dir/node file history format  Direct Access Restore (DAR)  Directory DAR (DDAR)  Including and excluding specific files and directories from backup  Backup of file attributes  Backup of Access Control Lists (ACLs)  Backup of Alternate Data Streams (ADSs)  Backup Restartable Extension (BRE)  Backup and restore of HDFS attributes
OneFS supports connecting to clusters through IPv4 or IPv6.
Supported DMAs
NDMP backups are coordinated by a data management application (DMA) that runs on a backup server.
NOTE: All supported DMAs can connect to a PowerScale cluster through the IPv4 protocol. However, only some of the DMAs support the IPv6 protocol for connecting to a PowerScale cluster.

NDMP hardware support

OneFS can back up data to and recover data from tape devices and virtual tape libraries (VTLs).

Supported tape For NDMP three-way backups, the data management application (DMA) determines the tape devices that

devices

are supported.

Supported tape For both the two-way and three-way NDMP backups, OneFS supports all of the tape libraries that are

libraries

supported by the DMA.

Supported virtual For three-way NDMP backups, the DMA determines the virtual tape libraries that will be supported. tape libraries

NDMP backup limitations
NDMP backups have the following limitations.
 Supports block sizes up to 512 KB.  Does not support more than 4 KB file path length.  Does not back up file system configuration data, such as file protection level policies and quotas.  Does not support recovering data from a file system other than OneFS.  Fibre Attached Storage nodes cannot interact with more than 4096 tape paths.  The maximum length of the FILESYSTEM environment variable supported for a backup operation is 1024.

Administering NDMP 285

NDMP performance recommendations

Consider the following recommendations to optimize OneFS NDMP backups.

General performance recommendations
 Install the latest patches for OneFS and your data management application (DMA).  Run a maximum of eight NDMP concurrent sessions per Fibre Attached Storage node and four NDMP concurrent sessions
per PowerScale IQ Backup Accelerator node to obtain optimal throughput per session.  NDMP backups result in very high Recovery Point Objectives (RPOs) and Recovery Time Objectives (RTOs). You can reduce
your RPO and RTO by attaching one or more Fibre Attached Storage nodes to the cluster and then running two-way NDMP backups.  The throughput for a PowerScale cluster during the backup and recovery operations is dependent on the dataset and is considerably reduced for small files.  If you are backing up large numbers of small files, set up a separate schedule for each directory.  If you are performing NDMP three-way backups, run multiple NDMP sessions on multiple nodes in your PowerScale cluster.  Recover files through Directory DAR (DDAR) if you recover large numbers of files frequently.  Use the largest tape record size available for your version of OneFS to increase throughput.  If possible, do not include or exclude files from backup. Including or excluding files can affect backup performance, due to filtering overhead.  Limit the depth of nested subdirectories in your file system.

SmartConnect recommendations
 A two-way NDMP backup session with SmartConnect requires Fibre Attached Storage node for backup and recovery operations. However, a three-way NDMP session with SmartConnect does not require Fibre Attached Storage nodes for these operations.
 For a NDMP two-way backup session with SmartConnect, connect to the NDMP session through a dedicated SmartConnect zone consisting of a pool of Network Interface Cards (NICs) on the Fibre Attached Storage nodes.
 For a two-way NDMP backup session without SmartConnect, initiate the backup session through a static IP address or fully qualified domain name of the Fibre Attached Storage node.
 For a three-way NDMP backup operation, the front-end Ethernet network or the interfaces of the nodes are used to serve the backup traffic. Therefore, it is recommended that you configure a DMA to initiate an NDMP session only using the nodes that are not already overburdened serving other workloads or connections.
 For a three-way NDMP backup operation with or without SmartConnect, initiate the backup session using the IP addresses of the nodes that are identified for running the NDMP sessions.

Fibre Attached Storage recommendations

 Assign static IP addresses to Fibre Attached Storage nodes.
 Attach more Fibre Attached Storage nodes to larger clusters. The recommended number of Fibre Attached Storage nodes is listed in the following table.

Table 8. Nodes per Fibre Attached Storage node

Node type

Recommended number of nodes per Fibre Attached Storage node

X-Series

3

NL-Series

3

S-Series

3

HD-Series

3

 Attach more Fibre Attached Storage nodes if you are backing up to more tape devices.

286 Administering NDMP

DMA-specific recommendations
 Enable parallelism for the DMA if the DMA supports this option. This allows OneFS to back up data to multiple tape devices at the same time.

Excluding files and directories from NDMP backups

You can exclude files and directories from NDMP backup operations by specifying NDMP environment variables through a data management application (DMA). If you include a file or directory, all other files and directories are automatically excluded from backup operations. If you exclude a file or directory, all files and directories except the excluded one are backed up.
You can include or exclude files and directories by specifying the following character patterns. The examples given in the table are valid only if the backup path is /ifs/data.

Table 9. NDMP file and directory matching wildcards

Character

Description

Example

*

Takes the place of any archive*

character or characters

Includes or excludes the following directories archive1 src/archive42_a/media

[]

Takes the place of

data_store_[a-f]

/ifs/data/data_store_a

a range of letters or numbers

data_store_[0-9]

/ifs/data/data_store_c

/ifs/data/data_store_8

?

Takes the place of any user_?

single character

/ifs/data/user_1 /ifs/data/user_2

\

Includes a blank space user\ 1

/ifs/data/user 1

//

Takes the place of a

ifs//data//archive

/ifs/data/archive

single slash (/)

***

Takes the place of a

single asterisk (*)

..

Ignores the pattern if it is ../home/john

home/john

at the beginning of a path

NOTE: " " are required for Symantec NetBackup when multiple patterns are specified. The patterns are not limited to directories.
Unanchored patterns such as home or user1 target a string of text that might belong to many files or directories. If a pattern contains '/', it is an anchored pattern. An anchored pattern is always matched from the beginning of a path. A pattern in the middle of a path is not matched. Anchored patterns target specific file pathnames, such as ifs/data/home. You can include or exclude either types of patterns.
If you specify both the include and exclude patterns, the include pattern is first processed followed by the exclude pattern.
If you specify both the include and exclude patterns, any excluded files or directories under the included directories would not be backed up. If the excluded directories are not found in any of the included directories, the exclude specification would have no effect.
NOTE: Specifying unanchored patterns can degrade the performance of backups. It is recommended that you avoid unanchored patterns whenever possible.

Administering NDMP 287

Configuring basic NDMP backup settings
You can configure NDMP backup settings to control how these backups are performed on the PowerScale cluster. You can also configure OneFS to interact with a specific data management application (DMA) for NDMP backups.
Configure and enable NDMP backup
OneFS prevents NDMP backups by default. Before you can perform NDMP backups, you must enable NDMP backups and configure NDMP settings. 1. Enable NDMP backup by running the following command:
isi ndmp settings global modify --service=true
2. Configure NDMP backup by running the isi ndmp settings set command. The following command configures OneFS to interact with NetWorker: isi ndmp settings global modify --dma=emc

Disable NDMP backup
You can disable NDMP backup if you no longer want to back up data through NDMP.  Run the following command:
isi ndmp settings global modify service=false

NDMP backup settings

You can configure settings that control how NDMP backups are performed on the cluster. The following information is displayed in the output of the isi ndmp settings global view command:

port dma

The number of the port through which data management applications (DMAs) can connect to the cluster. The DMA vendor that the cluster is configured to interact with.

View NDMP backup settings
You can view current NDMP backup settings, which indicate whether the service is enabled, the port through which data management applications (DMAs) connect to the cluster, and the DMA vendor that OneFS is configured to interact with.  Run the isi ndmp settings global view command:
The system displays the NDMP settings:
Service: True Port: 10000 Dma: generic
Bre Max Num Contexts: 64 Msb Context Retention Duration: 300 Msr Context Retention Duration: 600

288 Administering NDMP

Managing NDMP user accounts
You can create, delete, and modify the passwords of NDMP user accounts.
Create an NDMP user account
Before you can perform NDMP backups, you must create an NDMP user account through which a data management application (DMA) can access the cluster.  Run the isi ndmp users create command.
The following command creates an NDMP user account called NDMPuser with a password of 1234: isi ndmp users create --name=NDMPuser --password=1234
Modify the password of an NDMP user account
You can modify the password for an NDMP user account.  Run the isi ndmp users modify command.
The following command modifies the password of a user named NDMPuser to 5678: isi ndmp users modify --name=NDMPuser --password=5678
Delete an NDMP user account
You can delete an NDMP user account.  Run the isi ndmp users delete command.
The following command deletes a user named NDMPuser after a confirmation message: isi ndmp users delete --name=NDMPuser
View NDMP user accounts
You can view information about NDMP user accounts.  Run the isi ndmp users view command
The following command displays information about the account for a user named NDMPuser: isi ndmp users view --name=NDMPuser
Managing NDMP backup devices
After you attach a tape or media changer device to a Fibre Attached Storage node, you must configure OneFS to detect and establish a connection to the device. After the connection between the cluster and the backup device is established, you can modify the name that the cluster has assigned to the device, or disconnect the device from the cluster. In case the virtual tape library (VTL) device has multiple LUNs, you must configure LUN0 so that all the LUNs are detected properly.
Administering NDMP 289

Detect NDMP backup devices
If you connect devices to a Backup Accelerator node, you must configure OneFS to detect the devices before OneFS can back up data to and restore data from the devices. You can scan a specific node, a specific port, or all ports on all nodes.  Run the isi tape rescan command.
The following command detects devices on node 18: isi tape rescan --node=18
Modify an NDMP backup device entry name
You can modify the name of an NDMP device entry.  Run the isi tape modify command.
The following command renames tape003 to tape005: isi tape modify --name=tape003 --new-name=tape005
Delete a device entry for a disconnected NDMP backup device
If you physically remove an NDMP device from a cluster, OneFS retains the entry for the device. You can delete a device entry for a removed device. You can also remove the device entry for a device that is still physically attached to the cluster; this causes OneFS to disconnect from the device. If you remove a device entry for a device that is connected to the cluster, and you do not physically disconnect the device, OneFS will detect the device the next time it scans the ports. You cannot remove a device entry for a device that is currently being backed up to or restored from.  The following command disconnects tape001 from the cluster:
isi tape delete --name=tape001
View NDMP backup devices
You can view information about tape and media changer devices that are currently attached to the cluster through a Backup Accelerator node.  Run the following command to list tape devices on node 18:
isi tape list --node=18 --tape
Managing NDMP Fibre Channel ports
You can manage the Fibre Channel ports that connect tape and media changer devices to a Fibre Attached Storage node. You can also enable, disable, or modify the settings of an NDMP Fibre Channel port.
Modify NDMP backup port settings
You can modify the settings of an NDMP backup port.  Run the isi fc settings modify command.
The following command configures port 1 on node 5 to support a point-to-point Fibre Channel topology: isi fc settings modify --port=5.1 --topology=ptp
290 Administering NDMP

Enable or disable an NDMP backup port
You can enable or disable an NDMP backup port.  Run the isi fc settings modify command:
The following command disables port 1 on node 5:
isi fc settings modify --port=5.1 --state=disable
The following command enables port 1 on node 5:
isi fc settings modify --port=5.1 --state=enable

View NDMP backup ports
You can view information about Fibre Channel ports of Backup Accelerator nodes attached to a cluster.  Run the following command to view Fibre Channel port settings for port 1 on node 5:
isi fc settings view --port=5.1

NDMP backup port settings

OneFS assigns default settings to each port on each Backup Accelerator node attached to the cluster. These settings identify each port and specify how the port interacts with NDMP backup devices.
The following information is displayed in the output of the isi fc settings list command:

Port WWNN WWPN State Topology Rate
Firmware

The name of the Backup Accelerator node, and the number of the port. The world wide node name (WWNN) of the port. This name is the same for each port on a given node. The world wide port name (WWPN) of the port. This name is unique to the port. Whether the port is enabled or disabled. The type of Fibre Channel topology that the port is configured to support. The rate at which data is sent through the port. The rate can be set to 1 Gb/s, 2 Gb/s, 4 Gb/s, 8 Gb/s, and Auto. 8 Gb/s is available for A100 nodes only. If set to Auto, OneFS automatically negotiates with the DMA to determine the rate. Auto is the recommended setting. The firmware version for OCS ports. For Qlogic ports, the firmware version appears blank.

Managing NDMP preferred IP settings
If you are performing NDMP three-way operations using Avamar in an environment with multiple network interfaces, you can create, modify, delete, list, and view cluster-wide or subnet-specific NDMP preferred IP settings. You can manage NDMP preferred IP settings only through the OneFS command-line interface.
Create an NDMP preferred IP setting
If you are performing an NDMP three-way backup or restore operation using Avamar, you can create a cluster-wide or a subnet-specific NDMP preferred IP setting.  Create an NDMP preferred IP setting by running the isi ndmp settings preferred-ips create command.
For example, run the following command to apply a preferred IP setting for a cluster:
isi ndmp settings preferred-ips create cluster groupnet0.subnet0,10gnet.subnet0

Administering NDMP 291

Run the command as shown in the following example to apply a preferred IP setting for a subnet group: isi ndmp settings preferred-ips create 10gnet.subnet0 10gnet.subnet0,groupnet0.subnet0
Modify an NDMP preferred IP setting
If you are performing an NDMP three-way backup or restore operation using Avamar, you can modify an NDMP preferred IP setting by adding or deleting a subnet group.  Modify an NDMP preferred IP setting by running the isi ndmp settings preferred-ips modify command.
For example, run the following commands to modify the NDMP preferred IP setting for a cluster: isi ndmp settings preferred-ips modify 10gnet.subnet0 --add-data-subnets 10gnet.subnet0,groupnet0.subnet0
Run the command as shown in the following example to modify the NDMP preferred IP setting for a subnet: isi ndmp settings preferred-ips modify 10gnet.subnet0 --remove-data-subnets groupnet0.subnet0
List NDMP preferred IP settings
If you are performing an NDMP three-way backup or restore operation using Avamar, you can list all the NDMP preferred IP settings.  List the NDMP preferred IP settings by running the isi ndmp settings preferred-ips list command.
For example, run the following command to list the NDMP preferred IP settings: isi ndmp settings preferred-ips list
View NDMP preferred IP settings
If you are performing an NDMP three-way backup or restore operation using Avamar, you can view the NDMP preferred IP settings for a subnet or cluster.  View an NDMP preferred IP setting by running the isi ndmp settings preferred-ips view command.
For example, run the following command to view the NDMP preferred IP setting for a subnet: isi ndmp settings preferred-ips view --scope=10gnet.subnet0
Delete NDMP preferred IP settings
If you are performing an NDMP three-way backup or restore operation using Avamar, you can delete an NDMP preferred IP setting for a subnet or cluster.  Delete NDMP preferred IP settings by running the isi ndmp settings preferred-ips delete command.
For example, run the following command to delete the preferred IP setting for a subnet: isi ndmp settings preferred-ips delete --scope=10gnet.subnet0
292 Administering NDMP

Managing NDMP sessions
You can view the status of NDMP sessions or terminate a session that is in progress.
End an NDMP session
You can interrupt an NDMP backup or restore operation by ending an NDMP session. 1. To retrieve the ID of the NDMP session that you want to end, run the isi ndmp sessions list command. 2. Run the isi ndmp sessions delete command.
The following command ends an NDMP session with an ID of 4.36339 and skips the confirmation prompt: isi ndmp sessions delete --session=4.36339 --force
View NDMP sessions
You can view information about NDMP sessions that exist between the cluster and data management applications (DMAs).  Run the isi ndmp sessions view command. The following command displays information about session 4.36339.
isi ndmp sessions view --session=4.36339

NDMP session information

You can view information about active NDMP sessions. The following information is displayed in the output of the isi ndmp sessions list command:

Session Data Mover OP

Displays the unique identification number that OneFS assigned to the session. Specifies the current state of the data server. Specifies the current state of the data mover. Specifies the type of operation (backup or restore) that is currently in progress. If no operation is in progress, this field is blank. A backup operation could include the following details:

B({M} {F} [L[0-10] | T0 | Ti | S[0-10]] {r | R})

Where: [ a ]--a is required { a }--a is optional a | b--a or b but not at the same time M--Multi-stream backup F--File list L--Level-based T--Token-based S--Snapshot mode s--Snapshot mode and a full backup (when root dir is new) r--Restartable backup R--Restarted backup 0-10--Dump level

Administering NDMP 293

A restore operation could include the following details:

R ({M|s}[F | D | S]{h})

Where: M--Multi-stream restore s--Single-threaded restore (when RESTORE_OPTIONS=1) F--Full restore D--DAR S--Selective restore h--Restore hardlinks by table

Elapsed Time Bytes Moved Throughput

Specifies the time that has elapsed since the session started. Specifies the amount of data in bytes that was transferred during the session. Specifies the average throughput of the session over the past five minutes.

NDMP backup and restore operations Examples of active NDMP backup sessions indicated through the OP setting described previously are as follows:

B(T0): Token based full backup B(Ti): Token based incremental backup B(L0): Level based full backup B(L5): Level 5 incremental backup B(S0): Snapshot based full backup B(S3): Snapshot based level 3 backup B(FT0): Token based full filelist backup B(FL4): Level 4 incremental filelist backup B(L0r): Restartable level based full backup B(S4r): Restartable snapshot based level 4 incremental backup B(L7R): Restarted level 7 backup B(FT1R): Restarted token based incremental filelist backup B(ML0): Multi-stream full backup

Examples of active NDMP restore sessions indicated through the OP setting described previously are as follows:

R(F): Full restore R(D): DAR R(S): Selective restore R(MF): Multi-stream full restore R(sFh): single threaded full restore with restore hardlinks by table option

Managing NDMP restartable backups
An NDMP restartable backup also known as backup restartable extension (BRE) is a type of backup that you can enable in your data management application (DMA). If a restartable backup fails, for example, because of a power outage, you can restart the backup from a checkpoint close to the point of failure. In contrast, when a non-restartable backup fails, you must back up all data from the beginning, regardless of what was transferred during the initial backup process.
After you enable restartable backups from your DMA, you can manage restartable backup contexts from OneFS. These contexts are the data that OneFS stores to facilitate restartable backups. Each context represents a checkpoint that the restartable backup process can return to if a backup fails. There can be only one restartable backup context per restartable backup session. When your DMA restarts a failed backup session, the working files are reverted to the state corresponding to the NDMP restart backup request.
Restartable backups are supported for NetWorker 8.1 and later versions and CommVault Simpana DMAs.
NOTE: NDMP multi-stream backup does not support restartable backups.

294 Administering NDMP

Configure NDMP restartable backups for NetWorker
You must configure NetWorker to enable NDMP restartable backups and, optionally, define the checkpoint interval. If you do not specify a checkpoint interval, NetWorker uses the default interval of 5 GB. 1. Configure the client and the directory path that you want to back up as you would normally. 2. In the Client Properties dialog box, enable restartable backups.
a. On the General page, click the Checkpoint enabled checkbox. b. In the Checkpoint granularity drop-down list, select File. 3. In the Application information field, type any NDMP variables that you want to specify. The following variable setting specifies a checkpoint interval of 1 GB: CHECKPOINT_INTERVAL_IN_BYTES=1GB 4. Finish configuration and click OK in the Client Properties dialog box. 5. Start the backup. 6. If the backup is interrupted--for example, because of a power failure--restart it. a. On the Monitoring page, locate the backup process in the Groups list. b. Right-click the backup process and then, in the context menu, click Restart. NetWorker automatically restarts the backup from the last checkpoint.
View NDMP restartable backup contexts
You can view NDMP restartable backup contexts that have been configured. 1. List all the restartable backup contexts by running the following command:
isi ndmp contexts list --type=bre
2. To view detailed information about a specific restartable backup context, run the isi ndmp contexts view command. The following command displays detailed information about a backup context with an ID of 792eeb8a-8784-11e2aa70-0025904e91a4: isi ndmp contexts view bre_792eeb8a-8784-11e2-aa70-0025904e91a4
Delete an NDMP restartable backup context
After an NDMP restartable backup context is no longer needed, your data management application (DMA) automatically requests OneFS to delete the context. You can manually delete a restartable backup context before the DMA requests it.
NOTE: We recommend that you do not manually delete restartable backup contexts. Manually deleting a restartable backup context requires you to restart the corresponding NDMP backup from the beginning.  Run the isi ndmp contexts delete command. The following command deletes a restartable backup context with an ID of 792eeb8a-8784-11e2-aa70-0025904e91a4: isi ndmp contexts delete --id=bre_792eeb8a-8784-11e2-aa70-0025904e91a4
Configure NDMP restartable backup settings
You can specify the number of restartable backup contexts that OneFS can retain at a time, up to a maximum of 1024 contexts. The default number of restartable backup contexts is set to 64.  Run the isi ndmp settings global modify command.
The following command sets the maximum number of restartable backup contexts to 128: isi ndmp settings global modify --bre_max_num_contexts=128
Administering NDMP 295

View NDMP restartable backup settings
You can view the current limit of restartable backup contexts that OneFS retains at one time.  Run the following command:
isi ndmp settings global view
NDMP restore operations
NDMP supports the following types of restore operations:  NDMP parallel restore (multi-threaded process)  NDMP serial restore (single-threaded process)
NDMP parallel restore operation
Parallel (multi-threaded) restore enables faster full or partial restore operations by writing data to the cluster as fast as the data can be read from the tape. Parallel restore is the default restore mechanism in OneFS. The restore operation can restore multiple files concurrently through the parallel restore mechanism.
NDMP serial restore operation
For troubleshooting or for other purposes, you can run a serial restore operation which uses fewer system resources. The serial restore operation runs as a single-threaded process and restores one file at a time to the specified path.
Specify a NDMP serial restore operation
You can use the RESTORE_OPTIONS environment variable to specify a serial (single-threaded) restore operation. 1. In your data management application, configure a restore operation as you normally would. 2. Make sure that the RESTORE_OPTIONS environment variable is set to 1 on your data management application.
If the RESTORE_OPTIONS environment variable is not already set to 1, specify the isi ndmp settings variables modify command from the OneFS command line. The following command specifies serial restore for the /ifs/data/ projects directory:
isi ndmp settings variables modify /ifs/data/projects RESTORE_OPTIONS 1 The value of the path option must match the FILESYSTEM environment variable that is set during the backup operation. The value that you specify for the name option is case sensitive. 3. Start the restore operation.
Managing default NDMP variables
In OneFS, you can manage NDMP backup and restore operations by specifying default NDMP environment variables. You can specify NDMP environment variables for all the backup and restore operations or for a specific path. When you set the path to "/BACKUP", the environment variables are applied to all the backup operations. Similarly, when you set the path to "/RESTORE", the environment variables are applied to all the restore operations. You can override default NDMP environment variables through your data management application (DMA). For more information about specifying NDMP environment variables through your DMA, see the relevant DMA documentation.
296 Administering NDMP

Specify the default NDMP variable settings for a path
You can specify default NDMP variable settings for a path. 1. Open a secure shell (SSH) connection to any node in the PowerScale cluster and log in. 2. Set default NDMP variables by running the isi ndmp settings variables create command.
For example, the following command enables snapshot-based incremental backups for /ifs/data/media:
isi ndmp settings variables create /ifs/data/media BACKUP_MODE SNAPSHOT
Modify the default NDMP variable settings for a path
You can modify the default NDMP variable settings for a path. 1. Open a secure shell (SSH) connection to any node in the PowerScale cluster and log in. 2. Modify default NDMP variable settings by running the isi ndmp settings variables modify command.
For example, the following command sets the default file history format to path-based format for /ifs/data/media:
isi ndmp settings variables modify /ifs/data/media HIST F 3. Optional: To remove a default NDMP variable setting for a path, run the isi ndmp settings variables delete
command: For example, the following command removes the default file history format for /ifs/data/media:
isi ndmp settings variables delete /ifs/data/media --name=HIST
NOTE: If you do not specify the --name option, all the variables for the specified path are deleted after a confirmation.

View the default NDMP settings for a path
You can view the default NDMP settings for a path. 1. Open a secure shell (SSH) connection to any node in the PowerScale cluster and log in. 2. View the default NDMP settings by running the following command:
isi ndmp settings variables list

NDMP environment variables

You can specify default settings of NDMP backup and recovery operations through NDMP environment variables. You can also specify NDMP environment variables through your data management application (DMA).
Symantec NetBackup and NetWorker are the only two DMAs that allow you to directly set environment variables and propagate them to OneFS.

Table 10. NDMP environment variables

Environment variable

Valid values

BACKUP_FILE_LIST

<file-path>

Default None

Description
Triggers a file list backup.
Currently, only Networker and Symantec NetBackup can pass environment variables to OneFS.

BACKUP_MODE

TIMESTAMP SNAPSHOT

TIMESTAMP

Enables or disables snapshot-based incremental backups. To enable snapshot-based incremental backups, specify SNAPSHOT.

Administering NDMP 297

Table 10. NDMP environment variables (continued)

Environment variable

Valid values

Default

BACKUP_OPTIONS

0x00000400

0

0x00000200

0x00000100

0x00000001

0x00000002

0x00000004

Description

This environment variable controls the behavior of the backup operations.
The following settings are applicable only to datasets containing the CloudPools-driven SmartLink files:

0x00000400

Backs up SmartLink files with full data. This is the combo copy backup option.

0x00000200 0x00000100

Backs up all the cache data. This is the shallow copy backup option.
Reads SmartLink file data from the cloud and backs up the SmartLink files as regular files. This is the deep copy option.

0x00000001

Always adds DUMP_DATE into the list of environment variables at the end of a backup operation. The DUMP_DATE value is the time when the backup snapshot was taken. A DMA can use the DUMP_DATE value to set BASE_DATE for the next backup operation.

0x00000002

Retains the backup snapshot of a token-based backup in the dumpdates file. Since a token-based backup has no LEVEL, its level is set to 10 by default. The snapshot allows a faster-incremental backup as the next incremental backup after the token-based backup is done.

0x00000004

Retains the previous snapshot.
After a faster-incremental backup, the prior snapshot is saved at level 10. In order to avoid two snapshots at the same level, the prior snapshot is kept at a lower level in the dumpdates file. This allows the BASE_DATE and BACKUP_MODE=snapshot settings to trigger a faster-

298 Administering NDMP

Table 10. NDMP environment variables (continued)

Environment variable

Valid values

Default

BASE_DATE DIRECT
EXCLUDE FILES
HIST

Y

N

N

<file-matching-pattern> None <file-matching-pattern> None

<file-history-format>

Y

LEVEL

<integer>

0

Description

incremental backup instead of a token-based backup. The environment variable settings prompt the NDMP server to compare the BASE_DATE value against the timestamp in the dumpdates file to find the prior backup. Even though the DMA fails the latest faster-incremental backup, OneFS retains the prior snapshot. The DMA can then retry the fasterincremental backup in the next backup cycle using the BASE_DATE value of the prior backup.

Enables a token-based incremental backup. The dumpdates file will not be updated in this case.

Enables or disables Direct Access Restore (DAR) and Directory DAR (DDAR). The following values are valid:

Y

Enables DAR and DDAR.

N

Disables DAR and DDAR.

If you specify this option, OneFS does not back up files and directories that meet the specified pattern. Separate multiple patterns with a space.
If you specify this option, OneFS backs up only files and directories that meet the specified pattern. Separate multiple patterns with a space.
NOTE: As a rule, files are matched first and then the EXCLUDE pattern is applied.

Specifies the file history format. The following values are valid:

D

Specifies directory or node

file history.

F

Specifies path-based file

history.

Y

Specifies the default file

history format determined

by your NDMP backup

settings.

N

Disables file history.

Specifies the level of NDMP backup to perform. The following values are valid:

0

Performs a full NDMP

backup.

Administering NDMP 299

Table 10. NDMP environment variables (continued)

Environment variable

Valid values

Default

MSB_RETENTION_PERIOD MSR_RETENTION_PERIOD

Integer 0 through 60*60*24

300 sec 600 sec

RECURSIVE

Y

Y

N

RESTORE_BIRTHTIME

Y

N

N

RESTORE_HARDLINK

Y

N

_BY_TABLE

N

RESTORE_OPTIONS

0x00000001

0

0x00000002

0x00000004

0x00000100

0x00000200

Description

1 - 9 10

Performs an incremental backup at the specified level.
Performs Incremental Forever backups.

For a multi-stream backup session, specifies the backup context retention period.
For a multi-stream restore session, specifies the recovery context retention period within which a recovery session can be retried.
For restore sessions only. Specifies that the restore session should recover files or subdirectories under a directory automatically.
Specifies whether to recover the birth time for a recovery session.

For a single-threaded restore session, determines whether OneFS recovers hard links by building a hard-link table during recovery operations. Specify this option if hard links are incorrectly backed up and recovery operations are failing.
If a recovery operation fails because hard links were incorrectly backed up, the following message appears in the NDMP backup logs:

Bad hardlink path for <path>

NOTE: This variable is not effective for a parallel restore operation.

This environment variable controls the behavior of the restore operations.

0x00000001 0x00000002 0x00000004

Performs a singlethreaded restore operation.
Restores attributes to the existing directories.
Creates intermediate directories with default attributes. The default behavior is to get attributes from the first object under a given directory.

The following settings are applicable only to datasets backed up with the combo copy backup option:

0x00000100

Forces deep copy restoration of the SmartLink files. That is, restores the backed up

300 Administering NDMP

Table 10. NDMP environment variables (continued)

Environment variable

Valid values

Default

UPDATE

Y

Y

N

Description

0x00000200

SmartLink file as a regular file on the target cluster.
Forces shallow copy restoration of the SmartLink files. That is, restores the backed up SmartLink file as a SmartLink file on the target cluster.

Determines whether OneFS updates the dumpdates file. The default is to perform a combo copy restore.

Y

OneFS updates the

dumpdates file.

N

OneFS does not update

the dumpdates file.

Setting environment variables for backup and restore operations
You can set environment variables to support the backup and restore operations for your NDMP session.
You can set environment variables through a data management application (DMA) or the command-line interface. Alternatively, you can set global environment variables. The precedence to apply their settings for a backup or restore operation follows:
 The environment variables specified through a DMA have the highest precedence.  Path-specific environment variables specified by the isi ndmp settings variables take the next precedence.  Global environment variable settings of "/BACKUP" or "/RESTORE" take the lowest precedence.
You can set environment variables to support different types of backup operations as described in the following scenarios:
 If the BASE_DATE environment variable is set to any value and if you set the BACKUP_MODE environment variable to SNAPSHOT, the LEVEL environment variable is automatically set to 10 and an Incremental Forever backup is performed.
 If the BASE_DATE environment variable is set to 0, a full backup is performed.  If the BACKUP_MODE environment variable is set to snapshot and the BASE_DATE environment variable is not set to 0,
the entries in the dumpdates file are read and compared with the BASE_DATE environment variable. If an entry is found and a prior valid snapshot is found, a faster incremental backup is performed.  If the BACKUP_MODE environment variable is set to snapshot, the BASE_DATE environment variable is not set to 0, and if no entries are found in the dumpdates file and no prior valid snapshots are found, a token-based backup is performed using the value of the BASE_DATE environment variable.  If the BASE_DATE environment variable is set, the BACKUP_OPTIONS environment variable is set to 0x0001 by default.  If the BACKUP_MODE environment variable is set to snapshot, the BACKUP_OPTIONS environment variable is set to 0x0002 by default.  If the BACKUP_OPTIONS environment variable is set to 0x0004 , the snapshot is saved and maintained by the application used for the backup process.  In order to run an Incremental Forever backup with faster incremental backups, you must set the following environment variables:  BASE_DATE=<time>  BACKUP_MODE=snapshot  BACKUP_OPTIONS=7

Administering NDMP 301

Managing snapshot based incremental backups
After you enable snapshot-based incremental backups, you can view and delete the snapshots created for these backups.
Enable snapshot-based incremental backups for a directory
You can configure OneFS to perform snapshot-based incremental backups for a directory by default. You can also override the default setting in your data management application (DMA).  Run the isi ndmp settings variable create command.
The following command enables snapshot-based incremental backups for /ifs/data/media: isi ndmp settings variables create /ifs/data/media BACKUP_MODE SNAPSHOT
Delete snapshots for snapshot-based incremental backups
You can delete snapshots created for snapshot-based incremental backups. NOTE: It is recommended that you do not delete snapshots created for snapshot-based incremental backups. If all snapshots are deleted for a path, the next backup performed for the path is a full backup.
1. Click Data Protection > NDMP > Environment Settings. 2. In the Dumpdates table, click Delete against the entry that you want to delete. 3. In the Confirm Delete dialog box, click Delete.
View snapshots for snapshot-based incremental backups
You can view snapshots generated for snapshot-based incremental backups. 1. Click Data Protection > NDMP > Environment Settings. 2. In the Dumpdates table, view information about the snapshot-based incremental backups.
Managing cluster performance for NDMP sessions
NDMP Redirector distributes NDMP loads automatically over nodes by using the optional 2x10GbE + 2x8GB Fibre Channel NIC on Generation 6 nodes. You can enable NDMP Redirector to automatically distribute NDMP two-way sessions to nodes with lesser loads. The load-distribution capability results in improved cluster performance when multiple NDMP operations are initiated. NDMP Redirector checks for the following before redirecting the NDMP operation:  CPU usage  The number of running NDMP operations  The availability of tape devices
Enable NDMP Redirector to manage cluster performance
You must enable NDMP Redirector in order to automatically distribute NDMP two-way sessions to nodes with lesser loads. Make sure that the cluster is committed before enabling NDMP Redirector. 1. Run the following command through the command line interface to enable NDMP Redirector:
isi ndmp settings global modify --enable-redirector true
302 Administering NDMP

2. View the setting change by running the following command: isi ndmp settings global modify
A sample output of the previous command is shown: Service: False Port: 10000 DMA: generic Bre Max Num Contexts: 64 Context Retention Duration: 300 Smartlink File Open Timeout: 10 Enable Redirector: True
Managing CPU usage for NDMP sessions
NDMP Throttler manages the CPU usage during NDMP two-way sessions on 6th Generation nodes. The nodes are then available to adequately support other system activities.
Enable NDMP Throttler
You must enable NDMP Throttler in order to manage CPU usage of NDMP sessions on 6th Generation nodes. 1. Run the following command through the command line interface to enable NDMP Throttler:
isi ndmp settings global modify --enable-throttler true
2. View the setting change by running the following command: isi ndmp settings global modify
A sample output of the previous command is shown: Service: False Port: 10000 DMA: generic Bre Max Num Contexts: 64 Context Retention Duration: 600 Smartlink File Open Timeout: 10 Enable Throttler: True Throttler CPU Threshold: 50
3. If required, change the throttler CPU threshold as shown in the following example: isi ndmp settings global modify ­throttler-cpu-threshold 80
View NDMP backup logs
You can view information about NDMP backup and restore operations through NDMP backup logs. View the contents of the /var/log/isi_ndmp_d directory by running the following command:
more /var/log/isi_ndmp_d
Administering NDMP 303

23
File retention with SmartLock
This section contains the following topics:
Topics:
· SmartLock overview · Compliance mode · Enterprise mode · SmartLock directories · Replication and backup with SmartLock · SmartLock license functionality · SmartLock considerations · Delete WORM domain and directories · Set the compliance clock · View the compliance clock · Creating a SmartLock directory · Managing SmartLock directories · Managing files in SmartLock directories
SmartLock overview
With the SmartLock software module, you can protect files on a PowerScale cluster from being modified, overwritten, or deleted. To protect files in this manner, you must activate a SmartLock license. With SmartLock, you can identify a directory in OneFS as a WORM domain. WORM stands for write once, read many. All files within the WORM domain can be committed to a WORM state, meaning that those files cannot be overwritten, modified, or deleted. After a file is removed from a WORM state, you can delete the file. However, you can never modify a file that has been committed to a WORM state, even after it is removed from a WORM state. In OneFS, SmartLock can be deployed in one of two modes: compliance mode or enterprise mode.
Compliance mode
SmartLock compliance mode enables you to protect your data in compliance with U.S. Securities and Exchange Commission rule 17a-4. Rule 17a-4 is aimed at securities brokers and dealers, and specifies that records of all securities transactions must be archived in a nonrewritable, nonerasable manner.
NOTE: You can configure a PowerScale cluster for SmartLock compliance mode only during the initial cluster configuration process, before you activate a SmartLock license. A cluster cannot be converted to SmartLock compliance mode after the cluster is initially configured and put into production. Configuring a cluster for SmartLock compliance mode disables the root user. You cannot to log in to that cluster through the root user account. Instead, you can log in to the cluster through the compliance administrator account that is configured during initial SmartLock compliance mode configuration. When you are logged in to a SmartLock compliance mode cluster through the compliance administrator account, you can perform administrative tasks through the sudo command.
304 File retention with SmartLock

Enterprise mode
You can create SmartLock domains and apply WORM status to files by activating a SmartLock license on a cluster in standard configuration. This is referred to as SmartLock enterprise mode. SmartLock enterprise mode does not conform to SEC regulations, but does enable you to create SmartLock directories and apply SmartLock controls to protect files so that they cannot be rewritten or erased. In addition, the root user account remains on your system.
SmartLock directories
In a SmartLock directory, you can commit a file to a WORM state manually or you can configure SmartLock to commit the file automatically. Before you can create SmartLock directories, you must activate a SmartLock license on the cluster. You can create two types of SmartLock directories: enterprise and compliance. However, you can create compliance directories only if the PowerScale cluster has been set up in SmartLock compliance mode during initial configuration. Enterprise directories enable you to protect your data without restricting your cluster to comply with U.S. Securities and Exchange Commission rule 17a-4. Files that you commit to a WORM state in an enterprise directory cannot be modified and cannot be deleted until the retention period passes. However, there is a privileged delete feature that allows deleting a file before the retention period passes. You can delete a file in WORM state if:  You own the file and have been assigned the ISI_PRIV_IFS_WORM_DELETE privilege.  You are logged in through the root user account. The privileged delete feature is not available for compliance directories. Enterprise directories reference the system clock to facilitate time-dependent operations, including file retention. Compliance directories enable you to protect your data in compliance with the regulations that are defined by U.S. Securities and Exchange Commission rule 17a-4. If you commit a file to a WORM state in a compliance directory, the file cannot be modified or deleted before the specified retention period has expired. You cannot delete committed files, even if you are logged in to the compliance administrator account. Compliance directories reference the compliance clock to facilitate time-dependent operations, including file retention. You must set the compliance clock before you can create compliance directories. You can set the compliance clock only once, after which you cannot modify the compliance clock time. You can increase the retention time of WORM committed files on an individual basis, if necessary, but you cannot decrease the retention time. The compliance clock is controlled by the compliance clock daemon. Root and compliance administrator users could disable the compliance clock daemon, which would have the effect of increasing the retention period for all WORM committed files. Doing so is not recommended.
NOTE: Using WORM exclusions, files inside a WORM compliance or enterprise domain can be excluded from having a WORM state. All the files inside the excluded directory will behave as normal non-Smartlock protected files. For more information, see Exclude a SmartLock directory on page 309.
Replication and backup with SmartLock
OneFS enables both compliance and enterprise SmartLock directories to be replicated or backed up to a target cluster. If you are replicating SmartLock directories with SyncIQ, it is recommended that you configure all nodes on the source and target clusters with Network Time Protocol (NTP) peer mode to ensure that the node clocks are synchronized. For compliance clusters, it is recommended that you configure all nodes on the source and target clusters with NTP peer mode before you set the compliance clocks. Configuring all nodes with NTP peer mode sets the source and target clusters to the same time initially and helps to ensure compliance with U.S. Securities and Exchange Commission rule 17a-4.
NOTE: If you replicate data to a SmartLock directory, do not configure SmartLock settings for that directory until you are no longer replicating data to the directory. Configuring an autocommit time period for a SmartLock target directory, for example, can cause replication jobs to fail. If the target directory commits a file to a WORM state, and the file is modified on the source cluster, the next replication job will fail because it cannot overwrite the committed file.
If you back up data to an NDMP device, all SmartLock metadata relating to the retention date and commit status is transferred to the NDMP device. If you recover data to a SmartLock directory on the cluster, the metadata persists on the cluster.
File retention with SmartLock 305

However, if the directory that you recover data to is not a SmartLock directory, the metadata is lost. You can recover data to a SmartLock directory only if the directory is empty.
For information about the limitations of replicating and failing back SmartLock directories with SyncIQ, see SmartLock replication limitations on page 244.
SmartLock license functionality
You must activate a SmartLock license on a PowerScale cluster before you can create SmartLock directories and commit files to a WORM state.
If a SmartLock license becomes inactive, you will not be able to create new SmartLock directories on the cluster, modify SmartLock directory configuration settings, or delete files committed to a WORM state in enterprise directories before their expiration dates. However, you can still commit files within existing SmartLock directories to a WORM state.
If a SmartLock license becomes inactive on a cluster that is running in SmartLock compliance mode, root access to the cluster is not restored.
SmartLock considerations
 If a file is owned exclusively by the root user, and the file exists on a PowerScale cluster that is in SmartLock compliance mode, the file will be inaccessible: the root user account is disabled in compliance mode. For example, if a file is assigned root ownership on a cluster that has not been configured in compliance mode, and then the file is replicated to a cluster in compliance mode, the file becomes inaccessible. This can also occur if a root-owned file is restored onto a compliance cluster from a backup.
 It is recommended that you create files outside of SmartLock directories and then transfer them into a SmartLock directory after you are finished working with the files. If you are uploading files to a cluster, it is recommended that you upload the files to a non-SmartLock directory, and then later transfer the files to a SmartLock directory. If a file is committed to a WORM state while the file is being uploaded, the file will become trapped in an inconsistent state.
 Files can be committed to a WORM state while they are still open. If you specify an autocommit time period for a directory, the autocommit time period is calculated according to the length of time since the file was last modified, not when the file was closed. If you delay writing to an open file for more than the autocommit time period, the file is automatically committed to a WORM state, and you will not be able to write to the file.
 In a Microsoft Windows environment, if you commit a file to a WORM state, you can no longer modify the hidden or archive attributes of the file. Any attempt to modify the hidden or archive attributes of a WORM committed file generates an error. This can prevent third-party applications from modifying the hidden or archive attributes.
 You cannot rename a SmartLock compliance directory. You can rename a SmartLock enterprise directory only if it is empty.  You can only rename files in SmartLock compliance or enterprise directories if the files are uncommitted.  You cannot move:
 SmartLock directories within a WORM domain  SmartLock directories in a WORM domain into a directory in a non-WORM domain.  directories in a non-WORM domain into a SmartLock directory in a WORM domain.
Delete WORM domain and directories
You can set an attribute on a WORM domain using the CLI to enable you to delete the directories and files in the domain, and the domain itself. This is useful in the scenario where you created a WORM domain that is not needed, incorrectly named a SmartLock directory, or created a SmartLock directory in the wrong location.
In order to delete SmartLock directories and the corresponding WORM domain, you must set the pending delete flag on the domain using the isi worm domain modify <domain> --set-pending-delete CLI command. For more information, see Delete a SmartLock directory on page 309.
NOTE: You cannot set the pending delete flag in the Web UI.
Once a WORM domain is marked pending for delete:
 No new files may be created, renamed, or hard-linked into the domain.  Existing files may not be committed or have their retention dates extended.  SyncIQ will fail to sync to and from the domain.
306 File retention with SmartLock

Deleting a file allowed? Deleting a directory allowed?
Renaming a file allowed? Renaming a directory allowed? Creating a new file allowed?

Compliance WORM domain is marked pending for delete Yes, if file is uncommitted or expired Yes, if it doesn't contain committed and unexpired files Yes, if uncommitted No
No

Compliance WORM domain is not marked pending for delete Yes, if file is uncommitted or expired Yes, if it doesn't contain committed and unexpired files Yes, if uncommitted No
Yes

Set the compliance clock
Before you can create SmartLock compliance directories, you must set the compliance clock. Setting the compliance clock configures the clock to the same time as the cluster system clock. Before you set the compliance clock, ensure that the system clock is set to the correct time. If the compliance clock later becomes unsynchronized with the system clock, the compliance clock will slowly correct itself to match the system clock. The compliance clock corrects itself at a rate of approximately one week per year. 1. Open a secure shell (SSH) connection to any node in the cluster and log in through the compliance administrator account. 2. Set the compliance clock by running the following command:
isi worm cdate set

View the compliance clock
You can view the current time of the compliance clock. 1. Open a secure shell (SSH) connection to any node in the cluster and log in through the compliance administrator account. 2. View the compliance clock by running the following command:
isi worm cdate view
Creating a SmartLock directory
You can create a SmartLock directory and configure settings that control how long files are retained in a WORM state and when files are automatically committed to a WORM state. You cannot move or rename a directory that contains a SmartLock directory.
Retention periods
A retention period is the length of time that a file remains in a WORM state before being released from a WORM state. You can configure SmartLock directory settings that enforce default, maximum, and minimum retention periods for the directory. If you manually commit a file, you can optionally specify the date that the file is released from a WORM state. You can configure a minimum and a maximum retention period for a SmartLock directory to prevent files from being retained for too long or too short a time period. It is recommended that you specify a minimum retention period for all SmartLock directories. For example, assume that you have a SmartLock directory with a minimum retention period of two days. At 1:00 PM on Monday, you commit a file to a WORM state, and specify the file to be released from a WORM state on Tuesday at 3:00 PM. The file will be released from a WORM state two days later on Wednesday at 1:00 PM, because releasing the file earlier would violate the minimum retention period. You can also configure a default retention period that is assigned when you commit a file without specifying a date to release the file from a WORM state.

File retention with SmartLock 307

Autocommit time periods
You can configure an autocommit time period for SmartLock directories. An autocommit time period causes files that have been in a SmartLock directory for a period of time without being modified to be automatically committed to a WORM state. If you modify the autocommit time period of a SmartLock directory that contains uncommitted files, the new autocommit time period is immediately applied to the files that existed before the modification. For example, consider a SmartLock directory with an autocommit time period of 2 hours. If you modify a file in the SmartLock directory at 1:00 PM, and you decrease the autocommit time period to 1 hour at 2:15 PM, the file is instantly committed to a WORM state. If a file is manually committed to a WORM state, the read-write permissions of the file are modified. However, if a file is automatically committed to a WORM state, the read-write permissions of the file are not modified.
Create an enterprise directory for a non-empty directory
You can make a non-empty directory into a SmartLock enterprise directory. This procedure is available only through the command-line interface (CLI). Before creating a SmartLock directory, be aware of the following conditions and requirements:  You cannot create a SmartLock directory as a subdirectory of an existing SmartLock directory.  Hard links cannot cross SmartLock directory boundaries.  Creating a SmartLock directory causes a corresponding SmartLock domain to be created for that directory. Run the isi job jobs start command. The following command creates a SmartLock enterprise domain for /ifs/data/smartlock:
isi job jobs start DomainMark --root /ifs/data/smartlock --dm-type Worm
Create a SmartLock directory
You can create a SmartLock directory and commit files in that directory to a WORM state. Before creating a SmartLock directory, be aware of the following conditions and requirements:  You cannot create a SmartLock directory as a subdirectory of an existing SmartLock directory.  Hard links cannot cross SmartLock directory boundaries.  Creating a SmartLock directory causes a corresponding SmartLock domain to be created for that directory. Run the isi worm domains create command. If you specify the path of an existing directory, the directory must be empty. The following command creates a compliance directory with a default retention period of four years, a minimum retention period of three years, and an maximum retention period of five years:
isi worm domains create /ifs/data/SmartLock/directory1 \ --compliance --default-retention 4Y --min-retention 3Y \ --max-retention 5Y --mkdir
The following command creates an enterprise directory with an autocommit time period of thirty minutes and a minimum retention period of three months:
isi worm domains create /ifs/data/SmartLock/directory2 \ --autocommit-offset 30m --min-retention 3M --mkdir
308 File retention with SmartLock

Managing SmartLock directories
You can modify SmartLock directory settings, including the default, minimum, maximum retention period and the autocommit time period. A SmartLock enterprise directory can be renamed only if the directory is empty. A SmartLock compliance directory cannot be renamed.
Modify a SmartLock directory
You can modify the SmartLock configuration settings for a SmartLock directory. NOTE: You can modify SmartLock directory settings only 32 times per directory. It is recommended that you set SmartLock configuration settings only once and do not modify the settings after files are added to the SmartLock directory.
1. Open a secure shell (SSH) connection to any node in the cluster and log in. 2. Modify SmartLock configuration settings by running the isi worm modify command.
The following command sets the default retention period to one year:
isi worm domains modify /ifs/data/SmartLock/directory1 \ --default-retention 1Y
Exclude a SmartLock directory
You can exclude a SmartLock enterprise mode or compliance mode directory in a WORM domain to exempt the directory and its files within it from WORM retention policies and protection. In order to do this, you must create a WORM exclusion domain on a directory. This procedure is available only through the command-line interface (CLI). To create a WORM exclusion domain on a directory, the directory must meet all of the following conditions:  is a member of a WORM domain.  is not the root directory of a WORM domain.  is not the virtual .snapshot directory.  is not within the compliance store of a WORM compliance domain.  is not within another WORM exclusion domain (nesting).  is empty.
NOTE: You cannot create WORM domains within WORM exclusion domains.
Run the isi worm domain modify /ifs/data/worm_domain --exclude /ifs/data/worm_domain/dir/ <excluded_dir> command. To remove an existing exclusion domain on a directory, you must remove the directory and all of its constituent files.
Delete a SmartLock directory
You can delete a SmartLock compliance mode directory and its corresponding compliance mode WORM domain (if needed). In order to do this, you must set the pending delete flag on the domain. You cannot set the pending delete flag on an enterprise mode WORM domain. This procedure is available only through the CLI. Before marking a compliance mode WORM domain as pending delete, be aware of the following conditions:  No new files may be created, renamed, or hard-linked into the domain.  Existing files may not be committed or have their retention dates extended.  SyncIQ will fail to sync to and from the domain. Run the isi worm domain modify <domain> --set-pending-delete command.
File retention with SmartLock 309

View SmartLock directory settings

You can view the SmartLock directory settings for SmartLock directories. 1. Open a secure shell (SSH) connection to any node in the PowerScale cluster and log in. 2. View all SmartLock domains by running the following command:

isi worm domains list

The system displays output similar to the following example:

ID Path

Type

-----------------------------------------------

65536 /ifs/data/SmartLock/directory1 enterprise

65537 /ifs/data/SmartLock/directory2 enterprise

65538 /ifs/data/SmartLock/directory3 enterprise

-----------------------------------------------

3. Optional: To view detailed information about a specific SmartLock directory, run the isi worm domains view command.
The following command displays detailed information about /ifs/data/SmartLock/directory2:
isi worm domains view /ifs/data/SmartLock/directory2
The system displays output similar to the following example:
ID: 65537 Path: /ifs/data/SmartLock/directory2 Type: enterprise
LIN: 4295426060 Autocommit Offset: 30m
Override Date: Privileged Delete: off Default Retention: 1Y
Min Retention: 3M Max Retention: Total Modifies: 3/32 Max

SmartLock directory configuration settings

You can configure SmartLock directory settings that determine when files are committed to and how long files are retained in a WORM state.

ID Path Type LIN Autocommit offset

The numerical ID of the corresponding SmartLock domain. The path of the directory. The type of SmartLock directory. The inode number of the directory. The autocommit time period for the directory. After a file exists in this SmartLock directory without being modified for the specified time period, the file is automatically committed to a WORM state. Times are expressed in the format "<integer> <time>", where <time> is one of the following values:

Y

Specifies years

M

Specifies months

W

Specifies weeks

D

Specifies days

H

Specifies hours

m

Specifies minutes

310 File retention with SmartLock

s

Specifies seconds

Override date

The override retention date for the directory. Files committed to a WORM state are not released from a WORM state until after the specified date, regardless of the maximum retention period for the directory or whether a user specifies an earlier date to release a file from a WORM state.

Privileged delete

Indicates whether files committed to a WORM state in the directory can be deleted through the privileged delete functionality. To access the privilege delete functionality, you must either be assigned the ISI_PRIV_IFS_WORM_DELETE privilege and own the file you are deleting. You can also access the privilege delete functionality for any file if you are logged in through the root or compadmin user account.

on

Files committed to a WORM state can be deleted through the isi worm files

delete command.

off

Files committed to a WORM state cannot be deleted, even through the isi worm

files delete command.

disabled

Files committed to a WORM state cannot be deleted, even through the isi worm files delete command. After this setting is applied, it cannot be modified.

Default retention The default retention period for the directory. If a user does not specify a date to release a file from a

period

WORM state, the default retention period is assigned.

Times are expressed in the format "<integer> <time>", where <time> is one of the following values:

Y

Specifies years

M

Specifies months

W

Specifies weeks

D

Specifies days

H

Specifies hours

m

Specifies minutes

s

Specifies seconds

Forever indicates that WORM committed files are retained permanently by default. Use Min indicates that the default retention period is equal to the minimum retention date. Use Max indicates that the default retention period is equal to the maximum retention date.

Minimum retention period

The minimum retention period for the directory. Files are retained in a WORM state for at least the specified amount of time, even if a user specifies an expiration date that results in a shorter retention period.
Times are expressed in the format "<integer> <time>", where <time> is one of the following values:

Y

Specifies years

M

Specifies months

W

Specifies weeks

D

Specifies days

H

Specifies hours

m

Specifies minutes

s

Specifies seconds

Forever indicates that all WORM committed files are retained permanently.

Maximum retention period

The maximum retention period for the directory. Files cannot be retained in a WORM state for more than the specified amount of time, even if a user specifies an expiration date that results in a longer retention period.
Times are expressed in the format "<integer> <time>", where <time> is one of the following values:

Y

Specifies years

File retention with SmartLock 311

M

Specifies months

W

Specifies weeks

D

Specifies days

H

Specifies hours

m

Specifies minutes

s

Specifies seconds

Forever indicates that there is no maximum retention period.

Managing files in SmartLock directories
You can commit files in SmartLock directories to a WORM state by removing the read-write privileges of the file. You can also set a specific date at which the retention period of the file expires. Once a file is committed to a WORM state, you can increase the retention period of the file, but you cannot decrease the retention period of the file. You cannot move a file that has been committed to a WORM state, even after the retention period for the file has expired. The retention period expiration date is set by modifying the access time of a file. In a UNIX command line, the access time can be modified through the touch command. Although there is no method of modifying the access time through Windows Explorer, you can modify the access time through Windows Powershell. Accessing a file does not set the retention period expiration date. If you run the touch command on a file in a SmartLock directory without specifying a date on which to release the file from a SmartLock state, and you commit the file, the retention period is automatically set to the default retention period specified for the SmartLock directory. If you have not specified a default retention period for the SmartLock directory, the file is assigned a retention period of zero seconds. It is recommended that you specify a minimum retention period for all SmartLock directories.
Set a retention period through a UNIX command line
You can specify when a file will be released from a WORM state through a UNIX command line. 1. Open a connection to any node in the PowerScale cluster through a UNIX command line and log in. 2. Set the retention period by modifying the access time of the file through the touch command.
The following command sets an expiration date of June 1, 2025 for /ifs/data/test.txt:
touch -at 202506010000 /ifs/data/test.txt
Other touch command input formats are also allowed to modify the access time of files. For example, the command:
touch -a MMDDhhmm[yy] [file]
can modify the access time in some versions of FreeBSD. Other commands that modify the access time have the same effect of modifying the retention period. For example, the command:
cp -p <source> <destination>
copies the contents of source to destination, and then updates the attributes of destination to match the attributes of source, including setting the same access time.
Set a retention period through Windows Powershell
You can specify when a file will be released from a WORM state through Microsoft Windows Powershell. 1. Open the Windows PowerShell command prompt. 2. Optional: Establish a connection to the PowerScale cluster by running the net use command.

312 File retention with SmartLock

The following command establishes a connection to the /ifs directory on cluster.ip.address.com:
net use "\\cluster.ip.address.com\ifs" /user:root password 3. Specify the name of the file you want to set a retention period for by creating an object.
The file must exist in a SmartLock directory. The following command creates an object for /smartlock/file.txt:
$file = Get-Item "\\cluster.ip.address.com\ifs\smartlock\file.txt" 4. Specify the retention period by setting the last access time for the file.
The following command sets an expiration date of July 1, 2015 at 1:00 PM:
$file.LastAccessTime = Get-Date "2015/7/1 1:00 pm"
Commit a file to a WORM state through a UNIX command line
You can commit a file to a WORM state through a UNIX command line. To commit a file to a WORM state, you must remove all write privileges from the file. If a file is already set to a read-only state, you must first add write privileges to the file, and then return the file to a read-only state. 1. Open a connection to the PowerScale cluster through a UNIX command line interface and log in. 2. Remove write privileges from a file by running the chmod command.
The following command removes write privileges of /ifs/data/smartlock/file.txt:
chmod ugo-w /ifs/data/smartlock/file.txt
Commit a file to a WORM state through Windows Explorer
You can commit a file to a WORM state through Microsoft Windows Explorer. This procedure describes how to commit a file through Windows 7. To commit a file to a WORM state, you must apply the read-only setting. If a file is already set to a read-only state, you must first remove the file from a read-only state and then return it to a read-only state. 1. In Windows Explorer, navigate to the file you want to commit to a WORM state. 2. Right-click the folder and then click Properties. 3. In the Properties window, click the General tab. 4. Select the Read-only check box, and then click OK.
Override the retention period for all files in a SmartLock directory
You can override the retention period for files in a SmartLock directory. All files committed to a WORM state within the directory will remain in a WORM state until after the specified day. If files are committed to a WORM state after the retention period is overridden, the override date functions as a minimum retention date. All files committed to a WORM state do not expire until at least the given day, regardless of user specifications. 1. Open a secure shell (SSH) connection to any node in the PowerScale cluster and log in. 2. Override the retention period expiration date for all WORM committed files in a SmartLock directory by running the isi
worm modify command. For example, the following command overrides the retention period expiration date of /ifs/data/SmartLock/ directory1 to June 1, 2024:
isi worm domains modify /ifs/data/SmartLock/directory1 \ --override-date 2024-06-01
File retention with SmartLock 313

Delete a file committed to a WORM state
You can delete a WORM committed file in an enterprise WORM domain before the expiration date through the privileged delete functionality. This procedure is available only through the CLI.  Privileged delete functionality must not be permanently disabled for the SmartLock directory that contains the file.  You must either be the owner of the file and have the ISI_PRIV_IFS_WORM_DELETE and ISI_PRIV_NS_IFS_ACCESS
privileges, or be logged in through the root user account. 1. Open a connection to the PowerScale cluster through a UNIX command line and log in. 2. If privileged delete functionality was disabled for the SmartLock directory, modify the directory by running the isi worm
domains modify command with the --privileged-delete option. The following command enables privileged delete for /ifs/data/SmartLock/directory1:
isi worm domains modify /ifs/data/SmartLock/directory1 \ --privileged-delete true 3. Delete the WORM committed file by running the isi worm files delete command. The following command deletes /ifs/data/SmartLock/directory1/file: isi worm files delete /ifs/data/SmartLock/directory1/file The system displays output similar to the following: Are you sure? (yes, [no]): 4. Type yes and then press ENTER.
View WORM status of a file
You can view the WORM status of an individual file. This procedure is available only through the command-line interface (CLI). 1. Open a connection to the PowerScale cluster through a UNIX command line. 2. View the WORM status of a file by running the isi worm files view command.
For example, the following command displays the WORM status of a file: isi worm files view /ifs/data/SmartLock/directory1/file
The system displays output similar to the following: WORM Domains ID Root Path -----------------------------------65539 /ifs/data/SmartLock/directory1 WORM State: COMMITTED Expires: 2015-06-01T00:00:00
314 File retention with SmartLock

24
Data Removal with Instant Secure Erase (ISE)
This section contains the following topics:
Topics:
· Instant Secure Erase · ISE during drive smartfail · Enable Instant Secure Erase (ISE) · View current ISE configuration · Disable Instant Secure Erase (ISE)
Instant Secure Erase
You can use the Instant Secure Erase (ISE) functionality to remove confidential data out of a drive before returning the equipment.
OneFS now enables you to use the Instant Secure Erase (ISE) feature. This is a Data Security Standard (DSS) feature that is coupled with isi_drive_d. ISE adds the ability to use the cryptographic sanitize command (SANITIZE-cryptographic for SAS, and CRYPTO SCRAMBLE EXT for ATA). This command helps you to jumble-up readable data on supported drives and securely erase confidential data out of a drive . The following drives now have ISE support:  SAS HDD and SSD:
 Seagate Skybolt (300GB/600GB/900GB/1.2TB) - 2.5" HDD  Toshiba PM5 - 2.5" SSD
 3WPD: 400GB/800GB/1.6TB/3.2TB  1WPD: 3.84TB/7.68TB.15.36TB  Samsung PM1645 (RFX) - 2.5" SSD  3WPD: 400GB/800GB/1.6TB/3.2TB  1WPD: 3.84TB/7.68TB.15.36TB  Bear Cove Plus - 2.5" SSD  3WPD: 200GB/400GB/800GB/1.6TB/3.2TB  1WPD: 3.84TB/7.68TB/15.36TB  SATA HDD:  HGST Vela:  Vela-A: 2TB/4TB/6TB  Vela-AP: 8TB  HGST Leo-A (12TB)
ISE during drive smartfail
ISE acts automatically during drive smartfail.
After ISE is enabled, the data on the supported drive is erased upon smartfail. The results are logged into isi_drive_d or isi_drive_history files. Some logs also go to /var/log/messages. ISE failures and errors do not block the normal smartfail process.
Data Removal with Instant Secure Erase (ISE) 315

Enable Instant Secure Erase (ISE)
Enable ISE from the OneFS command line. You can configure ISE with drive subsystem configuration. You must have the ISI_PRIV_DEVICES privilege to enable ISE. This procedure is available only through the OneFS command-line interface (CLI). To enable ISE on a drive, enter the following command: isi devices drive config modify --instant-secureerase yes ISE support is enabled on the cluster.
View current ISE configuration
You can view the ISE configuration details from the OneFS command line. You must have the ISI_PRIV_DEVICES privilege view ISE configuration details. This procedure is available only through the OneFS command-line interface (CLI). To view the current ISE settings on a drive, enter the following command: isi device drive config view An example similar to the following appears.

isi device drive config view Lnn: 1
Instant Secure Erase: Enabled : True

Stall:

Max Total Stall Time : 10800

Max Slow Frequency : 0

Max Error Frequency : 0

Diskscrub Stripes : 128

Clear Time

: 2592000

Scan Size

: 16777216

Scan Max Ecc Delay : 60

Sleep

: 30

Max Slow Access

: 0

Log: Drive Stats : True

Reboot: None Present : True Chassis Loss : True

Automatic Replacement Recognition: Enabled : True

Allow: Format Unknown Firmware : True Format Unknown Model : True

Spin Wait:

Stagger

: 5

Check Drive : 45

Alert: Unknown Model : True Unknown Firmware : True

316 Data Removal with Instant Secure Erase (ISE)

Disable Instant Secure Erase (ISE)
Disable ISE from the OneFS command line. You must have the ISI_PRIV_DEVICES privilege to disable ISE. This procedure is available only through the OneFS command-line interface (CLI). To disable ISE support on the drive, enter the following command: isi devices drive config modify --instantsecure-erase no ISE support is disabled on the cluster.
Data Removal with Instant Secure Erase (ISE) 317

25
Protection domains
This section contains the following topics:
Topics:
· Protection domains overview · Protection domain considerations · Create a protection domain · Delete a protection domain
Protection domains overview
Protection domains are markers that prevent modifications to files and directories. If a domain is applied to a directory, the domain is also applied to all of the files and subdirectories under the directory. You can specify domains manually; however, OneFS usually creates domains automatically. For information about Protection domains, see the OneFS Administration Guide and the OneFS Command Line Interface Administration Guide.
Protection domain considerations
You can manually create protection domains before they are required by OneFS to perform certain actions. However, manually creating protection domains can limit your ability to interact with the data marked by the domain.  Copying a large number of files into a protection domain might take a very long time because each file must be marked
individually as belonging to the protection domain.  You cannot move directories in or out of protection domains. However, you can move a directory contained in a protection
domain to another location within the same protection domain.  Creating a protection domain for a directory that contains a large number of files will take more time than creating a
protection domain for a directory with fewer files. Because of this, it is recommended that you create protection domains for directories while the directories are empty, and then add files to the directory.  If a domain is currently preventing the modification or deletion of a file, you cannot create a protection domain for a directory that contains that file. For example, if /ifs/data/smartlock/file.txt is set to a WORM state by a SmartLock domain, you cannot create a SnapRevert domain for /ifs/data/. NOTE: If you use SyncIQ to create a replication policy for a SmartLock compliance directory, the SyncIQ and SmartLock compliance domains must be configured at the same root directory level. A SmartLock compliance domain cannot be nested inside a SyncIQ domain.
Create a protection domain
You can create replication or snapshot revert domains to facilitate snapshot revert and failover operations. You cannot create a SmartLock domain. OneFS automatically creates a SmartLock domain when you create a SmartLock directory. Run the isi job jobs start command. The following command creates a SyncIQ domain for /ifs/data/source:
isi job jobs start domainmark --root /ifs/data/media \ --dm-type SyncIQ
318 Protection domains

Delete a protection domain
You can delete a replication or snapshot revert domain if you want to move directories out of the domain. You cannot delete a SmartLock domain. OneFS automatically deletes a SmartLock domain when you delete a SmartLock directory. Run the isi job jobs start command. The following command deletes a SyncIQ domain for /ifs/data/source:
isi job jobs start domainmark --root /ifs/data/media \ --dm-type SyncIQ --delete
Protection domains 319

26
Data-at-rest-encryption
Topics:
· Data-at-rest encryption overview · Self-encrypting drives · Data security on self-encrypting drives · Data migrations and upgrades to a cluster with self-encrypting drives · Enabling external key management · Migrate nodes and SEDs to external key management · Chassis and drive states · Smartfailed drive REPLACE state · Smartfailed drive ERASE state
Data-at-rest encryption overview
You can enhance data security on a cluster that contains only self-encrypting-drive nodes, providing data-at-rest encryption (DARE) protection. For more information about data-at-rest encryption, see the OneFS Data-at-Rest Encryption whitepaper.
Self-encrypting drives
Self-encrypting drives store data on a cluster that is specially designed for data-at-rest encryption. Data-at-rest encryption on self-encrypting drives occurs when data that is stored on a device is encrypted to prevent unauthorized data access. All data that is written to the storage device is encrypted when it is stored, and all data that is read from the storage device is decrypted when it is read. The stored data is encrypted with a 256-bit data AES encryption key and decrypted in the same manner. OneFS controls data access by combining the drive authentication key with data-encryption keys.
NOTE: All nodes in a cluster must be of the self-encrypting drive type. Mixed nodes are not supported.
Data security on self-encrypting drives
Self-encrypting drives guarantee data security with the use of encryption keys. Data on self-encrypting drives is protected from unauthorized access by authenticating with encryption keys. Encryption keys can be hosted on the local drive or on an external key management server. Successful authentication with encryption keys unlocks the drive for data access. For specific information about supported external key management servers, refer to the PowerScale Supportability and Compatibility Guide. The data on self-encrypting drives is rendered inaccessible in the following conditions:  When a self-encrypting drive is smartfailed, drive authentication keys are deleted, making the drive unreadable. When you
smartfail and then remove a drive, it is cryptographically erased. NOTE: Smartfailing a drive is the preferred method for removing a self-encrypting drive.
 When a self-encrypting drive loses power, the drive locks to prevent unauthorized access. When power is restored, data is again accessible when the appropriate drive authentication key is provided.
 When a cluster using external key management loses network connection to the external key management server, the drives are locked until the network connection is restored.
320 Data-at-rest-encryption

Data migrations and upgrades to a cluster with selfencrypting drives
You can have data from your existing cluster migrated or upgraded to a cluster of nodes made up of self-encrypting drives (SEDs). As a result, all migrated and future data on the new cluster are encrypted. Upgrading from a cluster with SEDs using on-disk keys to a cluster with SEDs using an external key management server retains the on-disk keys. After the upgrade, you must migrate your drives to the external key management server.
NOTE: Data migration and upgrades to a cluster with SEDs must be performed by PowerScale Professional Services. For more information, contact your Dell Technologies representative.
Enabling external key management
You can enable external key management for self-encrypting drives (SED). Prerequisites:  A OneFS cluster of nodes made up of self-encrypting drives (SEDs)  A KMIP 1.2 compatible external key management server
 Dell Technologies CloudLink Center 6.0  Gemalto KeySecure 8.7 k150v  KeySecure k170v  IBM Secure Key Lifecycle Manager (SKLM) v2.6.0.2; v2.7.0.0; v3.0.0  Thales e-Security keyAuthority 4.0  Certificates using X.509 PKI for TLS mutual authentication  Network connectivity between the OneFS cluster and the external key management server For more information about external key management, see the OneFS Data-at-Rest Encryption whitepaper. To enable external key management, follow these steps: 1. Run the isi keymanager kmip servers create command to enable an external key management server. The following command enables an external key management server with ID of 1, with the hostname of key.management.onefs.com, with a server certificate at /ifs/certificates/onefs_kmip_ca.pem, and a client certificate at /ifs/certificates/onefs_client_bundle.pem.
isi keymanager kmip servers create 1 key.management.onefs.com /ifs/certificates/ onefs_kmip_ca.pem /ifs/certificates/onefs_client_bundle.pem 2. (Optional) To view configuration information about the external key management server, run the following command where <ID> is the id of the server you want to view.
isi keymanager kmip servers view 1
OneFS confirms the connectivity between the OneFS server and the external key management server. Once confirmed, the external key management server is ready for SEDs to be migrated.
Migrate nodes and SEDs to external key management
Once the external key management server is enabled, you can migrate the key authentication for each of your OneFS nodes. External key management enabled. To migrate the key authentication for your nodes and SEDs, follow these steps: 1. To begin migrating the key authentication for the nodes on your cluster, run the following command:
isi keymanager sed migrate server
OneFS begins migrating the key authentication for each node.
Data-at-rest-encryption 321

2. To track migration progress, run the following command: isi keymanager sed status

Chassis and drive states

You can view chassis and drive state details.
In a cluster, the combination of nodes in different degraded states determines whether read requests, write requests, or both work. A cluster can lose write quorum but keep read quorum. OneFS provides details about the status of chassis and drives in your cluster. The following table describes all the possible states that you may encounter in your cluster.

State HEALTHY L3 SMARTFAIL or Smartfail or restripe in progress NOT AVAILABLE
SUSPENDED
NOT IN USE REPLACE STALLED
NEW USED

Description

Interface

Error state

All drives in the node are functioning correctly.

Command-line interface, web administration interface

A solid state drive (SSD) was deployed as level 3 Command-line interface (L3) cache to increase the size of cache memory and improve throughput speeds.

The drive is in the process of being removed safely from the file system, either because of an I/O error or by user request. Nodes or drives in a smartfail or read-only state affect only write quorum.

Command-line interface, web administration interface

A drive is unavailable for a variety of reasons.

Command-line

X

You can click the bay to view detailed information interface, web

about this condition.

administration interface

NOTE: In the web administration interface,

this state includes the ERASE and

SED_ERROR command-line interface states.

This state indicates that drive activity is temporarily suspended and the drive is not in use. The state is manually initiated and does not occur during normal cluster activity.

Command-line interface, web administration interface

A node in an offline state affects both read and write quorum.

Command-line interface, web administration interface

The drive was smartfailed successfully and is ready to be replaced.

Command-line interface only

The drive is stalled and undergoing stall evaluation. Stall evaluation is the process of checking drives that are slow or having other issues. Depending on the outcome of the evaluation, the drive may return to service or be smartfailed. This is a transient state.

Command-line interface only

The drive is new and blank. This is the state that a drive is in when you run the isi dev command with the -a add option.

Command-line interface only

The drive was added and contained a PowerScaleGUID but the drive is not from this node. This drive likely will be formatted into the cluster.

Command-line interface only

322 Data-at-rest-encryption

State PREPARING EMPTY WRONG_TYPE BOOT_DRIVE SED_ERROR
ERASE
INSECURE
UNENCRYPTED

Description

Interface

Error state

The drive is undergoing a format operation. The drive state changes to HEALTHY when the format is successful.

Command-line interface only

No drive is in this bay.

Command-line interface only

The drive type is wrong for this node. For example, a non-SED drive in a SED node, SAS instead of the expected SATA drive type.

Command-line interface only

Unique to the A100 drive, which has boot drives Command-line interface

in its bays.

only

The drive cannot be acknowledged by the OneFS Command-line

X

system.

interface, web

NOTE: In the web administration interface, administration interface

this state is included in Not available.

The drive is ready for removal but needs your attention because the data has not been erased. You can erase the drive manually to guarantee that data is removed.
NOTE: In the web administration interface,
this state is included in Not available.

Command-line interface only

Data on the self-encrypted drive is accessible

Command-line interface X

by unauthorized personnel. Self-encrypting drives only

should never be used for non-encrypted data

purposes.

NOTE: In the web administration interface,

this state is labeled Unencrypted SED.

Data on the self-encrypted drive is accessible

Web administration

X

by unauthorized personnel. Self-encrypting drives interface only

should never be used for non-encrypted data

purposes.

NOTE: In the command-line interface, this

state is labeled INSECURE.

Smartfailed drive REPLACE state

You can see different drive states during the smartfail process.
If you run the isi dev list command while the drive in bay 1 is being smartfailed, the system displays output similar to the following example:

Node 1, [ATTN]

Bay 1

Lnum 11

Bay 2

Lnum 10

Bay 3

Lnum 9

Bay 4

Lnum 8

Bay 5

Lnum 7

Bay 6

Lnum 6

Bay 7

Lnum 5

Bay 8

Lnum 4

Bay 9

Lnum 3

Bay 10

Lnum 2

Bay 11

Lnum 1

Bay 12

Lnum 0

[SMARTFAIL] [HEALTHY] [HEALTHY] [HEALTHY] [HEALTHY] [HEALTHY] [HEALTHY] [HEALTHY] [HEALTHY] [HEALTHY] [HEALTHY] [HEALTHY]

SN:Z296M8HK SN:Z296M8N5 SN:Z296LBP4 SN:Z296LCJW SN:Z296M8XB SN:Z295LXT7 SN:Z296M8ZF SN:Z296M8SD SN:Z296M8QA SN:Z296M8Q7 SN:Z296M8SP SN:Z296M8QZ

000093172YE04 00009330EYE03 00009330EYE03 00009327BYE03 00009330KYE03 000093172YE03 00009330KYE03 00009330EYE03 00009330EYE03 00009330EYE03 00009330EYE04 00009330JYE03

/dev/da1 /dev/da2 /dev/da3 /dev/da4 /dev/da5 /dev/da6 /dev/da7 /dev/da8 /dev/da9 /dev/da10 /dev/da11 /dev/da12

If you run the isi dev list command after the smartfail completes successfully, the system displays output similar to the following example, showing the drive state as REPLACE:

Data-at-rest-encryption 323

Node 1, [ATTN]

Bay 1

Lnum 11

Bay 2

Lnum 10

Bay 3

Lnum 9

Bay 4

Lnum 8

Bay 5

Lnum 7

Bay 6

Lnum 6

Bay 7

Lnum 5

Bay 8

Lnum 4

Bay 9

Lnum 3

Bay 10

Lnum 2

Bay 11

Lnum 1

Bay 12

Lnum 0

[REPLACE] [HEALTHY] [HEALTHY] [HEALTHY] [HEALTHY] [HEALTHY] [HEALTHY] [HEALTHY] [HEALTHY] [HEALTHY] [HEALTHY] [HEALTHY]

SN:Z296M8HK SN:Z296M8N5 SN:Z296LBP4 SN:Z296LCJW SN:Z296M8XB SN:Z295LXT7 SN:Z296M8ZF SN:Z296M8SD SN:Z296M8QA SN:Z296M8Q7 SN:Z296M8SP SN:Z296M8QZ

000093172YE04 00009330EYE03 00009330EYE03 00009327BYE03 00009330KYE03 000093172YE03 00009330KYE03 00009330EYE03 00009330EYE03 00009330EYE03 00009330EYE04 00009330JYE03

/dev/da1 /dev/da2 /dev/da3 /dev/da4 /dev/da5 /dev/da6 /dev/da7 /dev/da8 /dev/da9 /dev/da10 /dev/da11 /dev/da12

If you run the isi dev list command while the drive in bay 3 is being smartfailed, the system displays output similar to the following example:

Node 1, [ATTN]

Bay 1

Lnum 11

Bay 2

Lnum 10

Bay 3

Lnum 9

Bay 4

Lnum 8

Bay 5

Lnum 7

Bay 6

Lnum 6

Bay 7

Lnum 5

Bay 8

Lnum 4

Bay 9

Lnum 3

Bay 10

Lnum 2

Bay 11

Lnum 1

Bay 12

Lnum 0

[REPLACE] [HEALTHY] [SMARTFAIL] [HEALTHY] [HEALTHY] [HEALTHY] [HEALTHY] [HEALTHY] [HEALTHY] [HEALTHY] [HEALTHY] [HEALTHY]

SN:Z296M8HK SN:Z296M8N5 SN:Z296LBP4 SN:Z296LCJW SN:Z296M8XB SN:Z295LXT7 SN:Z296M8ZF SN:Z296M8SD SN:Z296M8QA SN:Z296M8Q7 SN:Z296M8SP SN:Z296M8QZ

000093172YE04 00009330EYE03 00009330EYE03 00009327BYE03 00009330KYE03 000093172YE03 00009330KYE03 00009330EYE03 00009330EYE03 00009330EYE03 00009330EYE04 00009330JYE03

/dev/da1 /dev/da2 N/A /dev/da4 /dev/da5 /dev/da6 /dev/da7 /dev/da8 /dev/da9 /dev/da10 /dev/da11 /dev/da12

Smartfailed drive ERASE state

At the end of a smartfail process, OneFS attempts to delete the authentication key on a drive if it is unable to reset the key.
NOTE:  To securely delete the authentication key on a single drive, smartfail the individual drive.  To securely delete the authentication key on a single node, smartfail the node.  To securely delete the authentication keys on an entire cluster, smartfail each node and run the isi_reformat_node
command on the last node.
Upon running the isi dev list command, the system displays output similar to the following example, showing the drive state as ERASE:

Node 1, [ATTN]

Bay 1

Lnum 11

Bay 2

Lnum 10

Bay 3

Lnum 9

[REPLACE] [HEALTHY] [ERASE]

SN:Z296M8HK SN:Z296M8N5 SN:Z296LBP4

000093172YE04 /dev/da1 00009330EYE03 /dev/da2 00009330EYE03 /dev/da3

Drives showing the ERASE state can be safely retired, reused, or returned.
Any further access to a drive showing the ERASE state requires the authentication key of the drive to be set to its default manufactured security ID (MSID). This action erases the data encryption key (DEK) on the drive and renders any existing data on the drive permanently unreadable.

324 Data-at-rest-encryption

27

SmartQuotas

This section contains the following topics:
Topics:
· SmartQuotas overview · Quota types · Default quota type · Usage accounting and limits · Disk-usage calculations · Quota notifications · Quota notification rules · Quota reports · Creating quotas · Managing quotas

SmartQuotas overview
The SmartQuotas module is an optional quota-management tool that monitors and enforces administrator-defined storage limits. Using accounting and enforcement quota limits, reporting capabilities, and automated notifications, SmartQuotas manages storage use, monitors disk storage, and issues alerts when disk-storage limits are exceeded.
Quotas help you manage storage usage according to criteria that you define. Quotas are used for tracking--and sometimes limiting--the amount of storage that a user, group, or directory consumes. Quotas help ensure that a user or department does not infringe on the storage that is allocated to other users or departments. In some quota implementations, writes beyond the defined space are denied, and in other cases, a simple notification is sent.
NOTE: Do not apply quotas to /ifs/.ifsvar/ or its subdirectories. If you limit the size of the /ifs/.ifsvar/ directory through a quota, and the directory reaches its limit, jobs such as File-System Analytics fail. A quota blocks older job reports from being deleted from the /ifs/.ifsvar/ subdirectories to make room for newer reports.
The SmartQuotas module requires a separate license. For more information about the SmartQuotas module or to activate the module, contact your Dell Technologies sales representative.

Quota types

OneFS uses the concept of quota types as the fundamental organizational unit of storage quotas. Storage quotas comprise a set of resources and an accounting of each resource type for that set. Storage quotas are also called storage domains.
Storage quotas creation requires three identifiers:
 The directory to monitor  Whether snapshots are tracked against the quota limit  The quota type (directory, user, or group)
NOTE: Do not create quotas of any type on the OneFS root (/ifs). A root-level quota may significantly degrade performance.

You can choose a quota type from the following entities:

Directory

A specific directory and its subdirectories. NOTE: You cannot choose a default directory quota type using the Web UI. You can only create a default directory quota using the CLI. However, you can manage default directory quotas using the UI (modify the quota settings, link, and unlink subdirectories). All immediate subdirectories in a

SmartQuotas 325

default directory quota inherit the parent directory quota settings unless otherwise modified. Specific directory quotas that you configure take precedence over a default directory.

User Group

Either a specific user or default user (every user). Specific-user quotas that you configure take precedence over a default user quota.
All members of a specific group or all members of a default group (every group). Any specific-group quotas that you configure take precedence over a default group quota. Associating a group quota with a default group quota creates a linked quota.

You can create multiple quota types on the same directory, but they must be of a different type or have a different snapshot option. You can specify quota types for any directory in OneFS and nest them within each other to create a hierarchy of complex storage-use policies.
Nested storage quotas can overlap. For example, the following quota settings ensure that the finance directory never exceeds 5 TB, while limiting the users in the finance department to 1 TB each:
 Set a 5 TB hard quota on /ifs/data/finance.  Set 1 TB soft quotas on each user in the finance department.

Default quota type
Default quotas automatically create other quotas for users, groups, or immediate subdirectories in a specified directory. A default quota specifies a policy for new entities that match a trigger. The default-user@/ifs/cs becomes specificuser@/ifs/cs for each specific-user that is not otherwise defined.

Default user quota type example

For example, you can create a default-user quota on the /ifs/dir-1 directory, where that directory is owned by the root user. The default-user type automatically creates a domain on that directory for root and adds the usage there:

my-OneFS-1# mkdir /ifs/dir-1

my-OneFS-1# isi quota quotas create /ifs/dir-1 default-user

my-OneFS-1# isi quota quotas ls --path=/ifs/dir-1

Type

AppliesTo Path

Snap Hard Soft Adv Used

---------------------------------------------------------------

default-user DEFAULT

/ifs/dir-1 No

-

-

- 0b

user

root

/ifs/dir-1 No -

-

- 0b

---------------------------------------------------------------

Total: 2

Now add a file owned by a different user (admin):

my-OneFS-1# touch /ifs/dir-1/somefile

my-OneFS-1# chown admin /ifs/dir-1/somefile

my-OneFS-1# isi quota quotas ls --path=/ifs/dir-1

Type

AppliesTo Path

Snap Hard Soft Adv Used

---------------------------------------------------------------

default-user DEFAULT

/ifs/dir-1 No

-

-

- 0b

user

root

/ifs/dir-1 No -

-

- 26b

user

admin

/ifs/dir-1 No -

-

- 0b

---------------------------------------------------------------

Total: 3

In this example, the default-user type created a specific-user type automatically (user:admin) and added the new usage to it. Default-user does not have any usage because it is used only to generate new quotas automatically. Default-user enforcement is copied to a specific-user (user:admin), and the inherited quota is called a linked quota. In this way, each user account gets its own usage accounting.
Defaults can overlap. For example, default-user@/ifs/dir-1 and default-user@/ifs/cs both may be defined. If the default enforcement changes, OneFS storage quotas propagate the changes to the linked quotas asynchronously. Because the update is asynchronous, there is some delay before updates are in effect. If a default type, such as every user or every group, is deleted, OneFS deletes all children that are marked as inherited. As an option, you can delete the default without deleting the children, but it is important to note that this action breaks inheritance on all inherited children.

326 SmartQuotas

Continuing with the example, add another file owned by the root user. Because the root type exists, the new usage is added to it.
Files: 0 Ph: 0.00b W/O Overhead: 0.00b
my-OneFS-1# touch /ifs/dir-1/anotherfile my-OneFS-1# isi quota ls -v --path=/ifs/dir-1 --format=list
Type: default-user AppliesTo: DEFAULT
Path: /ifs/dir-1 Snap: No Thresholds
Hard: Soft: -
Adv: Grace: Usage
Files: 0 Physical: 0.00b FSLogical: 0.00b AppLogical: 0.00b Over: Enforced: No Container: No Linked: ---------------------------------------------------------------------Type: user AppliesTo: root Path: /ifs/dir-1 Snap: No Thresholds Hard: Soft: Adv: Grace: Usage
Files: 2 Physical: 3.50k FSLogical: 55.00b AppLogical: 0.00b Over: Enforced: No Container: No Linked: Yes ----------------------------------------------------------------------Type: user AppliesTo: admin Path: /ifs/dir-1 Snap: No Thresholds Hard: Soft: Adv: Grace: Usage
Files: 1 Physical: 1.50k FSLogical: 0.00b AppLogical: 0.00b Over: Enforced: No Container: No Linked: Yes
The enforcement on default-user is copied to the specific-user when the specific-user allocates within the type, and the new inherited quota type is also a linked quota.
SmartQuotas 327

NOTE: Configuration changes for linked quotas must be made on the parent quota that the linked quota is inheriting from. Changes to the parent quota are propagated to all children. To override configuration from the parent quota, unlink the quota first.

Default directory quota example type
If a default directory quota is configured on the /ifs/parent folder, then any immediate subdirectory created within that folder automatically inherits quota configuration information from the default domain. Only immediate subdirectories inherit default directory quotas; a subdirectory within an immediate subdirectory (a second-level or deeper subdirectory) will not inherit the default directory quota. For example, you create a default-directory quota type on the /ifs/parent directory. Then you create the /ifs/parent/child subdirectory. This subdirectory inherits the default directory quota settings. Then you create the second-level /ifs/parent/child/grandchild subdirectory. This subdirectory does not inherit the default directory quota settings.

Usage accounting and limits

Storage quotas can perform two functions: they monitor storage space through usage accounting and they manage storage space through enforcement limits.
You can configure OneFS quotas by usage type to track or limit storage use. The accounting option, which monitors diskstorage use, is useful for auditing, planning, and billing. Enforcement limits set storage limits for users, groups, or directories.

Track storage consumption without specifying a storage limit

The accounting option tracks but does not limit disk-storage use. Using the accounting option for a quota, you can monitor inode count and physical and logical space resources. Physical space refers to all of the space that is used to store files and directories, including data, metadata, and data protection overhead in the domain. There are two types of logical space:
 File system logical size: Logical size of files as per file system. Sum of all files sizes, excluding file metadata and data protection overhead.
 Application logical size : Logical size of file apparent to the application. Used file capacity from the application point of view, which is usually equal to or less than the file system logical size. However, in the case of a sparse file, application logical size can be greater than file system logical size. Application logical size includes capacity consumption on the cluster as well as data tiered to the cloud.
Storage consumption is tracked using file system logical size by default, which does not include protection overhead. As an example, by using the accounting option, you can do the following:
 Track the amount of disk space that is used by various users or groups to bill each user, group, or directory for only the disk space used.
 Review and analyze reports that help you identify storage usage patterns and define storage policies.
 Plan for capacity and other storage needs.

Specify storage limits

Enforcement limits include all of the functionality of the accounting option, plus the ability to limit disk storage and send notifications. Using enforcement limits, you can logically partition a cluster to control or restrict how much storage that a user, group, or directory can use. For example, you can set hardor soft-capacity limits to ensure that adequate space is always available for key projects and critical applications and to ensure that users of the cluster do not exceed their allotted storage capacity. Optionally, you can deliver real-time email quota notifications to users, group managers, or administrators when they are approaching or have exceeded a quota limit.

NOTE:
If a quota type uses the accounting-only option, enforcement limits cannot be used for that quota.
The actions of an administrator who is logged in as root may push a domain over a quota threshold. For example, changing the protection level or taking a snapshot has the potential to exceed quota parameters. System actions such as repairs also may push a quota domain over the limit. The system provides three types of administrator-defined enforcement thresholds.

328 SmartQuotas

Threshold type Hard
Soft Advisory

Description
Limits disk usage to a size that cannot be exceeded. If an operation, such as a file write, causes a quota target to exceed a hard quota, the following events occur:  The operation fails  An alert is logged to the cluster  A notification is issued to specified recipients.
Writes resume when the usage falls below the threshold.
Allows a limit with a grace period that can be exceeded until the grace period expires. When a soft quota is exceeded, an alert is logged to the cluster and a notification is issued to specified recipients; however, data writes are permitted during the grace period. If the soft threshold is still exceeded when the grace period expires, data writes fail, and a notification is issued to the recipients you have specified. Writes resume when the usage falls below the threshold.
An informational limit that can be exceeded. When an advisory quota threshold is exceeded, an alert is logged to the cluster and a notification is issued to specified recipients. Advisory thresholds do not prevent data writes.

Disk-usage calculations

For each quota that you configure, you can specify whether physical or logical space is included in future disk usage calculations. You can configure quotas to include the following types of physical or logical space:

Type of physical or logical space to include in quota Physical size
File system logical size
Application logical size

Description
Total on-disk space consumed to store files in OneFS. Apart from file data, this counts user metadata (for example, ACLs and user-specified extended attributes) and data protection overhead. Accounts for on-premise capacity consumption with data protection.
Approximation of disk usage on other systems by ignoring protection overhead. The space consumed to store files with 1x protection. Accounts for on-premise capacity consumption without data protection.
Apparent size of file that a user/application observes. How an application sees space available for storage regardless of whether files are cloud-tiered, sparse, deduped, or compressed. It is the offset of the file's last byte (end-of-file). Application logical size is unaffected by the physical location of the data, on or off cluster, and therefore includes CloudPools capacity across multiple locations. Accounts for on-premise and offpremise capacity consumption without data protection.

File data blocks (non-sparse regions) + IFS metadata (ACLs, ExAttr, inode) + data protection overhead
File data blocks (non-sparse regions) + IFS metadata (Acls, ExAttr, inode)
The physical size and file system logical size quota metrics count the number of blocks required to store file data (block-aligned). The application logical size quota metric is not blockaligned. In general, the application logical size is smaller than either the physical size or file system logical size, as the file system logical size counts the full size of the last block of the file, whereas application logical size considers the data present in the last block. However, application logical size will be higher for sparse files.

Most quota configurations do not need to include data protection overhead calculations, and therefore do not need to include physical space, but instead can include logical space (either file system logical size, or application logical size). If you do not include data protection overhead in usage calculations for a quota, future disk usage calculations for the quota include only the

SmartQuotas 329

logical space that is required to store files and directories. Space that is required for the data protection setting of the cluster is not included.
Consider an example user who is restricted by a 40 GB quota that does not include data protection overhead in its disk usage calculations. (The 40 GB quota includes file system logical size or application logical size.) If your cluster is configured with a 2x data protection level and the user writes a 10 GB file to the cluster, that file consumes 20 GB of space but the 10GB for the data protection overhead is not counted in the quota calculation. In this example, the user has reached 25 percent of the 40 GB quota by writing a 10 GB file to the cluster. This method of disk usage calculation is recommended for most quota configurations.
If you include data protection overhead in usage calculations for a quota, future disk usage calculations for the quota include the total amount of space that is required to store files and directories, in addition to any space that is required to accommodate your data protection settings, such as parity or mirroring. For example, consider a user who is restricted by a 40 GB quota that includes data protection overhead in its disk usage calculations. (The 40 GB quota includes physical size.) If your cluster is configured with a 2x data protection level (mirrored) and the user writes a 10 GB file to the cluster, that file actually consumes 20 GB of space: 10 GB for the file and 10 GB for the data protection overhead. In this example, the user has reached 50 percent of the 40 GB quota by writing a 10 GB file to the cluster.
NOTE: Cloned and deduplicated files are treated as ordinary files by quotas. If the quota includes data protection overhead, the data protection overhead for shared data is not included in the usage calculation.
You can configure quotas to include the space that is consumed by snapshots. A single path can have two quotas applied to it: one without snapshot usage, which is the default, and one with snapshot usage. If you include snapshots in the quota, more files are included in the calculation than are in the current directory. The actual disk usage is the sum of the current directory and any snapshots of that directory. You can see which snapshots are included in the calculation by examining the .snapshot directory for the quota path.
NOTE: Only snapshots created after the QuotaScan job finishes are included in the calculation.

Quota notifications

Quota notifications are generated for enforcement quotas, providing users with information when a quota violation occurs. Reminders are sent periodically while the condition persists.
Each notification rule defines the condition that is to be enforced and the action that is to be executed when the condition is true. An enforcement quota can define multiple notification rules. When thresholds are exceeded, automatic email notifications can be sent to specified users, or you can monitor notifications as system alerts or receive emails for these events.
Notifications can be configured globally, to apply to all quota domains, or be configured for specific quota domains.
Enforcement quotas support the following notification settings. A given quota can use only one of these settings.

Limit notification settings Disable quota notifications Use the system settings for quota notifications
Create custom notifications rules

Description
Disables all notifications for the quota.
Uses the global default notification for the specified type of quota.
Enables the creation of advanced, custom notifications that apply to the specific quota. Custom notifications can be configured for any or all of the threshold types (hard, soft, or advisory) for the specified quota.

Quota notification rules
You can write quota notification rules to generate alerts that are triggered by event thresholds. When an event occurs, a notification is triggered according to your notification rule. For example, you can create a notification rule that sends an email when a disk-space allocation threshold is exceeded by a group. You can configure notification rules to trigger an action according to event thresholds (a notification condition). A rule can specify a schedule, such as "every day at 1:00 AM," for executing an action or immediate notification of certain state transitions.

330 SmartQuotas

When an event occurs, a notification trigger may execute one or more actions, such as sending an email or sending a cluster alert to the interface. The following examples demonstrate the types of criteria that you can use to configure notification rules.
 Notify when a threshold is exceeded; at most, once every 5 minutes  Notify when allocation is denied; at most, once an hour  Notify while over threshold, daily at 2 AM  Notify while grace period expired weekly, on Sundays at 2 AM
Notifications are triggered for events grouped by the following categories:

Instant notifications

Includes the write-denied notification, triggered when a hard threshold denies a write, and the thresholdexceeded notification, triggered at the moment a hard, soft, or advisory threshold is exceeded. These are one-time notifications because they represent a discrete event in time.

Ongoing notifications

Generated on a scheduled basis to indicate a persisting condition, such as a hard, soft, or advisory threshold being over a limit or a soft threshold's grace period being expired for a prolonged period.

Quota reports
The OneFS SmartQuotas module provides reporting options that enable administrators to manage cluster resources and analyze usage statistics.
Storage quota reports provide a summarized view of the past or present state of the quota domains. After raw reporting data is collected by OneFS, you can produce data summaries by using a set of filtering parameters and sort types. Storage-quota reports include information about violators, grouped by threshold types. You can generate reports from a historical data sample or from current data. In either case, the reports are views of usage data at a given time. OneFS does not provide reports on data aggregated over time, such as trending reports, but you can use raw data to analyze trends. There is no configuration limit on the number of reports other than the space needed to store them.
OneFS provides the following data-collection and reporting methods:
 Scheduled reports are generated and saved on a regular interval.  Ad hoc reports are generated and saved at the request of the user.  Live reports are generated for immediate and temporary viewing. Scheduled reports are placed by default in the /ifs/.isilon/smartquotas/reports directory, but the location is configurable to any directory under /ifs. Each generated report includes quota domain definition, state, usage, and global configuration settings. By default, ten reports are kept at a time, and older reports are purged. You can create ad hoc reports at any time to view the current state of the storage quotas system. These live reports can be saved manually. Ad hoc reports are saved to a location that is separate from scheduled reports to avoid skewing the timed-report sets.
Creating quotas
You can create two types of storage quotas to monitor data: accounting quotas and enforcement quotas. Storage quota limits and restrictions can apply to specific users, groups, or directories.
The type of quota that you create depends on your goal.
 Enforcement quotas monitor and limit disk usage. You can create enforcement quotas that use any combination of hard limits, soft limits, and advisory limits. NOTE: Enforcement quotas are not recommended for snapshot-tracking quota domains.
 Accounting quotas monitor, but do not limit, disk usage. NOTE: Before using quota data for analysis or other purposes, verify that no QuotaScan jobs are running.

Create an accounting quota
You can create an accounting quota to monitor but not limit disk usage. Optionally, you can include snapshot data, data-protection overhead, or both in the accounting quota. For information about the parameters and options that you can use for this procedure, run the isi quota quotas create --help command.

SmartQuotas 331

Run the isi quota quotas create command to create an accounting quota. The following command creates a quota for the /quota_test_1 directory. The quota sets an advisory threshold that is informative rather than enforced.
isi quota quotas create /ifs/data/quota_test_1 directory \ --advisory-threshold=10M --enforced=false
Before using quota data for analysis or other purposes, verify that no QuotaScan jobs are in progress by running the isi job events list --job-type quotascan command.
Create an enforcement quota
You can create an enforcement quota to monitor and limit disk usage. You can create enforcement quotas that set hard, soft, and advisory limits. For information about the parameters and options that you can use for this procedure, run the isi quota quotas create --help command. Run the isi quota quotas create command and set the --enforced parameter to true. The following command creates a quota for the /quota_test_2 directory. The quota sets an advisory threshold that is enforced when the threshold specified is exceeded.
isi quota quotas create /ifs/data/quota_test_2 directory \ --advisory-threshold=100M --enforced=true
Before using quota data for analysis or other purposes, verify that no QuotaScan jobs are in progress by running the isi job events list --job-type quotascan command.
Managing quotas
You can modify the configured values of a storage quota, and you can enable or disable a quota. You can also create quota limits and restrictions that apply to specific users, groups, or directories. Quota management in OneFS is simplified by the quota search feature, which helps you locate a quota or quotas by using filters. You can unlink quotas that are associated with a parent quota, and configure custom notifications for quotas. You can also disable a quota temporarily and then enable it when needed.
NOTE: Moving quota directories across quota domains is not supported.
Search for quotas
You can search for a quota using a variety of search parameters. For information about the parameters and options that you can use for this procedure, run the isi quota quotas list --help command. Run the isi quota quotas list command to search for quotas. The following command finds all quotas that monitor the /ifs/data/quota_test_1 directory:
isi quota quotas list --path=/ifs/data/quota_test_1
Manage quotas
Quotas help you monitor and analyze the current or historic use of disk storage. You can search for quotas, and modify, delete, and unlink quotas. You must run an initial QuotaScan job for the default or scheduled quotas to prevent displaying incomplete data. Before you modify a quota, consider how the changes will affect the file system and end users.
332 SmartQuotas

For information about the parameters and options that you can use for this procedure, run the isi quota quotas list --help command.
NOTE:  You can edit or delete a quota report only when the quota is not linked to a default quota.  You can unlink a quota only when the quota is linked to a default quota. 1. To monitor and analyze current disk storage, run the isi quota quotas view command. The following example provides current usage information for the root user on the specified directory and includes snapshot data. For more information about the parameters for this command, run the isi quota quotas list --help command.
isi quota quotas list -v --path=/ifs/data/quota_test_2 \ --include-snapshots="yes"
2. To view all information in the quota report, run the isi quota reports list command. To view specific information in a quota report, run the isi quota quotas list --help command to view the filter parameters. The following command lists all information in the quota report:
isi quota reports list -v 3. Optional: To delete a quota, run the isi quota quotas delete command.
The following command deletes the specified directory-type quota. For information about parameters for this command, run the isi quota quotas delete --help command:
isi quota quotas delete /ifs/data/quota_test_2 directory 4. To unlink a quota, run the isi quota quotas modify command.
The following command example unlinks a user quota:
isi quota quotas modify /ifs/dir-1 user --linked=false --user=admin
NOTE: Configuration changes for linked quotas must be made on the parent (default) quota that the linked quota is inheriting from. Changes to the parent quota are propagated to all children. If you want to override configuration from the parent quota, you must first unlink the quota.
Export a quota configuration file
You can export quota settings as a configuration file, which can then be imported for reuse to another PowerScale cluster. You can also store the exported quota configurations in a location outside of the cluster. This task may only be performed from the OneFS command line interface. You can pipe the XML report to a file or directory. The file can then be imported to another cluster. 1. Open a secure shell (SSH) connection to any node in the cluster and log in. 2. At the command prompt, run the following command:
isi_classic quota list --export
The quota configuration file displays as raw XML.
Import a quota configuration file
You can import quota settings in the form of a configuration file that has been exported from another PowerScale cluster. This task can only be performed from the OneFS command-line interface. 1. Open a secure shell (SSH) connection to any node in the cluster and log in. 2. Navigate to the location of the exported quota configuration file. 3. At the command prompt, run the following command, where <filename> is the name of an exported configuration file:
isi_classic quota import --from-file=<filename>
SmartQuotas 333

The system parses the file and imports the quota settings from the configuration file. Quota settings that you configured before importing the quota configuration file are retained, and the imported quota settings are effective immediately.
Managing quota notifications
Quota notifications can be enabled or disabled, modified, and deleted. By default, a global quota notification is already configured and applied to all quotas. You can continue to use the global quota notification settings, modify the global notification settings, or disable or set a custom notification for a quota. Enforcement quotas support four types of notifications and reminders:  Threshold exceeded  Over-quota reminder  Grace period expired  Write access denied
If a directory service is used to authenticate users, you can configure notification mappings that control how email addresses are resolved when the cluster sends a quota notification. If necessary, you can remap the domain that is used for quota email notifications and you can remap Active Directory domains, local UNIX domains, or both.
Configure default quota notification settings
You can configure default global quota notification settings that apply to all quotas of a specified threshold type. The custom notification settings that you configure for a quota take precedence over the default global notification settings. For information about the parameters and options that you can use for this procedure, run the isi quota settings notifications modify --help command. Run the isi quota settings notifications modify command. The following command configures the default quota notification settings to generate an alert when the advisory threshold is exceeded:
isi quota settings notifications modify advisory exceeded \ --action-alert=true
Before using quota data for analysis or other purposes, verify that no QuotaScan jobs are in progress by running the isi job events list --job-type quotascan command.
Configure custom quota notification rules
You can configure custom quota notification rules that apply only to a specified quota. An enforcement quota must exist or be in the process of being created. To configure notifications for an existing enforcement quota, follow the procedure to modify a quota and then use these steps. Quota-specific custom notification rules must be configured for that quota. If notification rules are not configured for a quota, the default event notification configuration is used. For information about the parameters and options that you can use for this procedure, run the isi quota quotas notifications create --help command. To configure custom quota notification rules, run the isi quota quotas notifications create command. The following command creates an advisory quota notification rule for the /ifs/data/quota_test_2 directory that uses the --holdoff parameter to specify the length of time to wait before generating a notification:
isi quota quotas notifications create /ifs/data/quota_test_2 \ directory advisory exceeded --holdoff=10W
Before using quota data for analysis or other purposes, verify that no QuotaScan jobs are in progress by running the isi job events list --job-type quotascan command.
334 SmartQuotas

Map an email notification rule for a quota
Email notification mapping rules control how email addresses are resolved when the cluster sends a quota notification.
If required, you can remap the domain that is used for SmartQuotas email notifications. You can remap Active Directory Windows domains, local UNIX domains, or NIS domains.
NOTE: You must be logged in to the web administration interface to perform this task.
1. Click File System > SmartQuotas > Settings. 2. Optional: In the Email Mapping area, click Add a Mapping Rule. 3. From the Type list, select the authentication provider type for this notification rule. The default is Local. To determine
which authentication providers are available on the cluster, browse to Access > Authentication Providers. 4. From the Current domain list, select the domain that you want to use for the mapping rule. If the list is blank, browse to
Cluster Management > Network Configuration, and then specify the domains that you want to use for mapping. 5. In the Map to domain field, type the name of the domain that you want to map email notifications to. This can be the same
domain name that you selected from the Current domain list. To specify multiple domains, separate the domain names with commas. 6. Click Create Rule.

Email quota notification messages

If email notifications for exceeded quotas are enabled, you can customize PowerScale templates for email notifications or create your own.
There are three email notification templates provided with OneFS. The templates are located in /etc/ifs and are described in the following table:

Template quota_email_template.txt quota_email_grace_template.txt
quota_email_test_template.txt

Description
A notification that disk quota has been exceeded.
A notification that disk quota has been exceeded (also includes a parameter to define a grace period in number of days).
A notification test message you can use to verify that a user is receiving email notifications.

If the default email notification templates do not meet your needs, you can configure your own custom email notification templates by using a combination of text and SmartQuotas variables. Whether you choose to create your own templates or modify the existing ones, make sure that the first line of the template file is a Subject: line. For example:
Subject: Disk quota exceeded
If you want to include information about the message sender, include a From: line immediately under the subject line. If you use an email address, include the full domain name for the address. For example:
From: [email protected]
In this example of the quota_email_template.txt file, a From: line is included. Additionally, the default text "Contact your system administrator for details" at the end of the template is changed to name the administrator:
Subject: Disk quota exceeded From: [email protected]
The <ISI_QUOTA_DOMAIN_TYPE> quota on path <ISI_QUOTA_PATH> owned by <ISI_QUOTA_OWNER> has exceeded the <ISI_QUOTA_TYPE> limit. The quota limit is <ISI_QUOTA_THRESHOLD>, and <ISI_QUOTA_USAGE> is currently in use. You may be able to free some disk space by deleting unnecessary files. If your quota includes snapshot usage, your administrator may be able to free some disk space by deleting one or more snapshots. Contact Jane Anderson ([email protected]) for details.

SmartQuotas 335

This is an example of a what a user will see as an emailed notification (note that the SmartQuotas variables are resolved):
Subject: Disk quota exceeded From: [email protected]
The advisory disk quota on directory /ifs/data/sales_tools/collateral owned by jsmith on production-Boris was exceeded.
The quota limit is 10 GB, and 11 GB is in use. You may be able to free some disk space by deleting unnecessary files. If your quota includes snapshot usage, your administrator may be able to free some disk space by deleting one or more snapshots. Contact Jane Anderson ([email protected]) for details.

Custom email notification template variable descriptions

An email template contains text, and, optionally, variables that represent values. You can use any of the SmartQuotas variables in your templates.

Variable ISI_QUOTA_DOMAIN_TYPE
ISI_QUOTA_EXPIRATION ISI_QUOTA_GRACE ISI_QUOTA_HARD_LIMIT
ISI_QUOTA_NODE
ISI_QUOTA_OWNER ISI_QUOTA_PATH ISI_QUOTA_THRESHOLD ISI_QUOTA_TYPE ISI_QUOTA_USAGE

Description Quota type. Valid values are: directory, user, group, default-directory, default-user, default-group
Expiration date of grace period
Grace period, in days
Includes the hard limit information of the quota to make advisory/soft email notifications more informational.
Hostname of the node on which the quota event occurred
Name of quota domain owner
Path of quota domain
Threshold value
Threshold type
Disk space in use

Example default-directory
Fri May 22 14:23:19 PST 2015 5 days You have 30 MB left until you hit the hard quota limit of 50 MB.
someHost-prod-wf-1
jsmith /ifs/data 20 GB Advisory 10.5 GB

Customize email quota notification templates
You can customize PowerScale templates for email notifications. Customizing templates can be performed only from the OneFS command line interface. This procedure assumes that you are using the PowerScale templates, which are located in the /etc/ifs directory.
NOTE: It is recommend that you do not edit the templates directly. Instead, copy them to another directory to edit and deploy them. 1. Open a secure shell (SSH) connection to any node in the cluster and log in. 2. Copy one of the default templates to a directory in which you can edit the file and later access it through the OneFS web administration interface. For example:
cp /etc/ifs/quota_email_template.txt /ifs/data/quotanotifiers/ quota_email_template_copy.txt
3. Open the template file in a text editor. For example:
edit /ifs/data/quotanotifiers/quota_email_template_copy.txt
The template appears in the editor.

336 SmartQuotas

4. Edit the template. If you are using or creating a customized template, ensure the template has a Subject: line. 5. Save the changes. Template files must be saved as .txt files. 6. In the web administration interface, browse to File System > SmartQuotas > Settings. 7. In the Notification Rules area, click Add a Notification Rule.
The Create a Notification Rule dialog box appears. 8. From the Rule type list, select the notification rule type that you want to use with the template. 9. In the Rule Settings area, select a notification type option. 10. Depending on the rule type that was selected, a schedule form might appear. Select the scheduling options that you want to
use. 11. In the Message template field, type the path for the message template, or click Browse to locate the template. 12. Optional: Click Create Rule
Managing quota reports
You can configure and schedule reports to help you monitor, track, and analyze storage use on a PowerScale cluster. You can view and schedule reports and customize report settings to track, monitor, and analyze disk storage use. Quota reports are managed by configuring settings that give you control over when reports are scheduled, how they are generated, where and how many are stored, and how they are viewed. The maximum number of scheduled reports that are available for viewing in the web-administration interface can be configured for each report type. When the maximum number of reports are stored, the system deletes the oldest reports to make space for new reports as they are generated.
Create a quota report schedule
You can configure quota report settings to generate the quota report on a specified schedule. Quota report settings determine whether and when scheduled reports are generated, and where and how the reports are stored. If you disable a scheduled report, you can still run unscheduled reports at any time. For information about the parameters and options that you can use for this procedure, run the isi quota reports list --help command. To configure a quota report schedule, run the isi quota settings reports modify command. The following command creates a quota report schedule that runs every two days. For more information about date pattern or other schedule parameters, see man isi-schedule.
isi quota settings reports modify --schedule="Every 2 days"
Reports are generated according to the criteria and can be viewed by running the isi quota reports list command.
Generate a quota report
In addition to scheduled quota reports, you can generate a report to capture usage statistics at any time. Before you can generate a quota report, quotas must exist and no QuotaScan jobs can be running. For information about the parameters and options that you can use for this procedure, run the isi quota reports create --help command. To generate a quota report, run the isi quota reports create command. The following command creates an ad hoc quota report.
isi quota reports create -v
You can view the quota report by running the isi quota reports list -v command.
SmartQuotas 337

Locate a quota report
You can locate quota reports, which are stored as XML files, and use your own tools and transforms to view them. This task can only be performed from the OneFS command-line interface. 1. Open a secure shell (SSH) connection to any node in the cluster and log in. 2. Navigate to the directory where quota reports are stored. The following path is the default quota report location:
/ifs/.isilon/smartquotas/reports NOTE: If quota reports are not in the default directory, you can run the isi quota settings command to find the directory where they are stored.
3. At the command prompt, run the ls command.  To view a list of all quota reports in the directory, run the following command:
ls -a *.xml
 To view a specific quota report in the directory, run the following command:
ls <filename>.xml

Basic quota settings

When you create a storage quota, the following attributes must be defined, at a minimum. When you specify usage limits, additional options are available for defining the quota.

Option

Description

Path

The directory that the quota is on.

Directory Quota

Set storage limits on a directory.

User Quota

Create a quota for every current or future user that stores data in the specified directory.

Group Quota

Create a quota for every current or future group that stores data in the specified directory.

Include snapshots in the storage quota

Count all snapshot data in usage limits. This option cannot be changed after the quota is created.

Enforce the limits for this quota based on physical size

Base quota enforcement on storage usage which includes metadata and data protection.

Enforce the limits for this quota based on file system logical size Base quota enforcement on storage usage which does not include metadata and data protection.

Enforce the limits for this quota based on application logical size Base quota enforcement on storage usage which includes capacity consumption on the cluster as well as data tiered to the cloud.

Track storage without specifying a storage limit

Account for usage only.

Specify storage limits

Set and enforce advisory, soft, or absolute limits.

338 SmartQuotas

Advisory limit quota notification rules settings

You can configure custom quota notification rules for advisory limits for a quota. These settings are available when you select the option to use custom notification rules.

Option Notify owner
Notify other contact(s)

Description

Exceeded

Select to send an email

Yes

notification to the owner of

the entity.

Select to send email

Yes

notifications to other

recipient(s) and type the

recipient's email address(es).

NOTE: You can only enter

one email address before

the cluster is committed.

After the cluster is

committed, you can enter

multiple comma-separated

email addresses. Duplicate

email addresses are

identified and only unique

addresses are stored. You

can enter a maximum

of 1,024 characters of

comma-separated email

addresses.

Message template

Type the path for the custom Yes template, or click Browse to locate the custom template.
Leave the field blank to use the default template.

Create cluster event

Select to generate an event Yes notification for the quota when exceeded.

Minimum notification interval

Specify the time interval

Yes

to wait (in hours, days, or

weeks) before generating the

notification. This minimizes

duplicate notifications.

Schedule

Specify the notification and No alert frequency: daily, weekly, monthly, yearly. Depending on the selection, specify intervals, day to send, time of day, multiple email messages per rule.

Remains exceeded Yes Yes
Yes Yes No Yes

SmartQuotas 339

Soft limit quota notification rules settings

You can configure custom soft limit notification rules for a quota. These settings are available when you select the option to use custom notification rules.

Option

Description

Exceeded

Notify owner
Notify other contact(s)

Select to send an Yes email notification to the owner of the entity.

Select to

Yes

send email

notifications to

other recipient(s)

and type the

recipient's email

address(es).

NOTE: You can

only enter one

email address

before the

cluster is

committed.

After the

cluster is

committed, you

can enter

multiple

comma-

separated email

addresses.

Duplicate email

addresses are

identified and

only unique

addresses are

stored. You can

enter a

maximum of

1,024

characters of

comma-

separated email

addresses.

Message template Type the path

Yes

for the custom

template, or

click Browse to

locate the custom

template.

Leave the field blank to use the default template.

Create cluster event

Select to

Yes

generate an event

Remains exceeded Yes Yes
Yes Yes

Grace period expired
Yes

Write access denied
Yes

Yes

Yes

Yes

Yes

Yes

Yes

340 SmartQuotas

Option

Description

Exceeded

notification for the quota.

Minimum

Specify the time Yes

notification interval interval to wait

(in hours, days,

or weeks) before

generating the

notification. This

minimizes duplicate

notifications.

Schedule

Specify the

No

notification and

alert frequency:

daily, weekly,

monthly, yearly.

Depending on the

selection, specify

intervals, day to

send, time of

day, multiple email

messages per rule.

Remains exceeded No
Yes

Grace period expired

Write access denied

No

Yes

Yes

No

Hard limit quota notification rules settings

You can configure custom quota notification rules for hard limits for a quota. These settings are available when you select the option to use custom notification rules.

Option Notify owner Notify other contact(s)
Message template

Description

Write access denied

Select to send an email

Yes

notification to the owner of

the entity.

Select to send email

Yes

notifications to other

recipient(s) and type the

recipient's email address(es).

NOTE: You can only enter

one email address before

the cluster is committed.

After the cluster is

committed, you can enter

multiple comma-separated

email addresses. Duplicate

email addresses are

identified and only unique

addresses are stored. You

can enter a maximum

of 1,024 characters of

comma-separated email

addresses.

Type the path for the custom Yes template, or click Browse to locate the custom template.
Leave the field blank to use the default template.

Exceeded Yes Yes
Yes

SmartQuotas 341

Option Create cluster event Minimum notification interval
Schedule

Description

Write access denied

Select to generate an event Yes notification for the quota.

Specify the time interval

Yes

to wait (in hours, days, or

weeks) before generating the

notification. This minimizes

duplicate notifications.

Specify the notification and No alert frequency: daily, weekly, monthly, yearly. Depending on the selection, specify intervals, day to send, time of day, multiple email messages per rule.

Exceeded Yes No
Yes

Limit notification settings

Enforcement quotas support the following notification settings for each threshold type. A quota can use only one of these settings.

Notification setting Disable quota notifications Use the system settings for quota notifications
Create custom notification rules

Description
Disable all notifications for the quota.
Use the default notification rules that you configured for the specified threshold type.
Provide settings to create basic custom notifications that apply only to this quota.

Quota report settings

You can configure quota report settings that track disk usage. These settings determine whether and when scheduled reports are generated, and where and how reports are stored. When the maximum number of reports are stored, the system deletes the oldest reports to make space for new reports as they are generated.

Setting Scheduled reporting
Report frequency

Description
Enables or disables the scheduled reporting feature.
 Off. Manually generated on-demand reports can be run at any time.
 On. Reports run automatically according to the schedule that you specify.
Specifies the interval for this report to run: daily, weekly, monthly, or yearly. You can use the following options to further refine the report schedule.
Generate report every. Specify the numeric value for the selected report frequency; for example, every 2 months.
Generate reports on. Select the day or multiple days to generate reports.
Select report day by. Specify date or day of the week to generate the report.
Generate one report per specified by. Set the time of day to generate this report.

342 SmartQuotas

Setting Scheduled report archiving
Manual report archiving

Description
Generate multiple reports per specified day. Set the intervals and times of day to generate the report for that day.
Determines the maximum number of scheduled reports that are available for viewing on the SmartQuotas Reports page.
Limit archive size for scheduled reports to a specified number of reports. Type the integer to specify the maximum number of reports to keep.
Archive Directory. Browse to the directory where you want to store quota reports for archiving.
Determines the maximum number of manually generated (on-demand) reports that are available for viewing on the SmartQuotas Reports page.
Limit archive size for live reports to a specified number of reports. Type the integer to specify the maximum number of reports to keep.
Archive Directory. Browse to the directory where you want to store quota reports for archiving.

SmartQuotas 343

28

Storage Pools

This section contains the following topics:
Topics:
· Storage pools overview · Storage pool functions · Autoprovisioning · Node pools · Virtual hot spare · Spillover · Suggested protection · Protection policies · SSD strategies · Other SSD mirror settings · Global namespace acceleration · L3 cache overview · Tiers · File pool policies · Managing node pools through the command-line interface · Managing L3 cache from the command-line interface · Managing tiers · Creating file pool policies · Managing file pool policies · Monitoring storage pools

Storage pools overview

OneFS organizes different node types into separate node pools. You can configure node pool membership to include node types that you specify. You can also add node types to, and remove node types from, existing node pools. You can organize these node pools into logical tiers of storage. By activating a SmartPools license, you can create file pool policies that store files in these tiers automatically, based on criteria that you specify.
Without an active SmartPools license, OneFS manages all node pools as a single pool of storage. File data and metadata are striped across the entire cluster so that data is protected, secure, and readily accessible. All files belong to the default file pool and are governed by the default file pool policy. In this mode, OneFS provides functions such as autoprovisioning, virtual hot spare (VHS), global namespace acceleration (GNA), L3 cache, and storage tiers.
When you activate a SmartPools license, additional functions become available, including custom file pool policies and spillover management. With a SmartPools license, you can manage your dataset with more granularity to improve the performance of your cluster.
The following table summarizes storage pool functions based on whether a SmartPools license is active.

Function Automatic storage pool provisioning Virtual hot spare SSD strategies L3 cache Tiers

Inactive SmartPools license Yes Yes Yes Yes Yes

Active SmartPools license Yes Yes Yes Yes Yes

344 Storage Pools

Function GNA File pool policies Spillover management

Inactive SmartPools license Yes No No

Active SmartPools license Yes Yes Yes

Storage pool functions

When a cluster is installed, and whenever nodes are added to the cluster, OneFS automatically groups nodes into node pools. Autoprovisioning of nodes into node pools enables OneFS to optimize performance, reliability, and data protection on the cluster.
OneFS uses specific criteria to determine how, or whether, to group nodes into node pools. Nodes are compatible if there are no restrictions between them, or if existing restrictions can be overridden ("soft restrictions"). Nodes are equivalent if no restrictions exist between them. Nodes are automatically provisioned together only if they are equivalent. If nodes are compatible but not equivalent, you can manually move them into the same node pool.
Without an active SmartPools license, OneFS applies a default file pool policy to organize all data into a single file pool. With this policy, OneFS distributes data across the entire cluster so that data is protected and readily accessible. When you activate a SmartPools license, additional functions become available.
OneFS provides the following functions, with or without an active SmartPools license:

Autoprovisioning Automatically groups equivalent nodes into node pools for optimal storage efficiency and protection. At

of node pools

least three equivalent nodes are required for autoprovisioning to work.

Tiers

Groups node pools into logical tiers of storage. If you activate a SmartPools license for this feature, you can create custom file pool policies and direct different file pools to appropriate storage tiers.

Default file pool Governs all file types and can store files anywhere on the cluster. Custom file pool policies, which require

policy

a SmartPools license, take precedence over the default file pool policy.

Requested protection

Specifies a requested protection setting for the default file pool, per node pool, or even on individual files. You can leave the default setting in place, or choose the suggested protection calculated by OneFS for optimal data protection.

Virtual hot spare Reserves a portion of available storage space for data repair in the event of a disk failure.

SSD strategies

Defines the type of data that is stored on SSDs in the cluster. For example, storing metadata for read/ write acceleration.

L3 cache

Specifies that SSDs in nodes are used to increase cache memory and speed up file system performance across larger working file sets.

Global namespace acceleration

Activates global namespace acceleration (GNA), which enables data stored on node pools without SSDs to access SSDs elsewhere in the cluster to store extra metadata mirrors. Extra metadata mirrors accelerate metadata read operations.

When you activate a SmartPools license, OneFS provides the following additional functions:

Custom file pool policies
Storage pool spillover

Creates custom file pool policies to identify different classes of files, and stores these file pools in logical storage tiers. For example, you can define a high-performance tier of node pools and an archival tier of high-capacity node pools. Then, with custom file pool policies, you can identify file pools based on matching criteria, and you can define actions to perform on these pools. For example, one file pool policy can identify all JPEG files older than a year and store them in an archival tier. Another policy can move all files that were created or modified within the last three months to a performance tier.
Enables automated capacity overflow management for storage pools. Spillover defines how to handle write operations when a storage pool is not writable. If spillover is enabled, data is redirected to a specified storage pool. If spillover is disabled, new data writes fail and an error message is sent to the client that is attempting the write operation.

Storage Pools 345

Autoprovisioning
When you add a node to a cluster, OneFS attempts to assign the node to a node pool. This process is known as autoprovisioning, which helps OneFS to provide optimal performance, load balancing, and file system integrity across a cluster. A node is not autoprovisioned to a node pool and made writable until at least three equivalent nodes are added to the cluster. If you add only two equivalent nodes, no data is stored on these nodes until a third equivalent node is added. If a node fails or is removed from the cluster so that fewer than three nodes remain, the node pool becomes underprovisioned. In this case, the two remaining nodes are still writable. If only one node remains, the node is not writable, but remains readable. New nodes added to your cluster are likely to be different from the nodes in existing node pools. Unless you add three new equivalent nodes each time you upgrade your cluster, the new nodes are not autoprovisioned. However, you can add new node types to existing node pools. You can add nodes one at a time to your cluster, and the new nodes can become fully functioning peers within existing node pools. See Compatibilities for more information.
Node pools
A node pool is a group of three or more nodes that forms a single pool of storage. As you add nodes to the cluster, OneFS attempts to automatically provision the new nodes into node pools. To autoprovision a node, OneFS requires that the new node be equivalent to the other nodes in the node pool. If the new node is equivalent, OneFS provisions the new node to the node pool. All nodes in a node pool are peers, and data is distributed across nodes in the pool. Each provisioned node increases the aggregate disk, cache, CPU, and network capacity of the cluster. We strongly recommend that you let OneFS handle node provisioning. However, if you have a special requirement or use case, you can move nodes from an autoprovisioned node pool into a node pool that you define manually. The capability to create manually-defined node pools is available only through the OneFS command-line interface, and should be deployed only after consulting with Dell EMC PowerScale Technical Support. If you try to remove a node from a node pool for the purpose of adding it to a manual node pool, and the result would leave fewer than three nodes in the original node pool, the removal fails. When you remove a node from a manually-defined node pool, OneFS attempts to autoprovision the node back into an equivalent node pool. If you add fewer than three equivalent nodes to your cluster, OneFS cannot autoprovision these nodes. In these cases, you can add new node types to existing node pools. Adding the new node types can enable OneFS to provision the newly added nodes to a compatible node pool. Node pools can use SSDs either as storage or as L3 cache, but not both, with the following exception. PowerScale F200 and F600 nodes are full SSD nodes and can only be used as storage. Enabling L3 cache on F200 and F600 nodes is not an option.
NOTE: Do not use NL nodes in node pools used for NFS or SMB. It is recommended that you use high performance nodes to handle NFS and SMB workloads.
Compatibilities
OneFS cannot autoprovision new nodes if there are compatibility restrictions between the new nodes and the existing nodes in a node pool. To enable new nodes to join a compatible node pool, you can add a new node type to the existing node pool. You modify node pool compatibilities using the command line interface. For example, if your cluster already has an X410 node pool and you add a new X410 node, OneFS would attempt to autoprovision the new node to the X410 node pool. However, if the new X410 node has different RAM than the older X410 nodes, then OneFS cannot autoprovision the new node. To enable the new node to be provisioned into the existing X410 node pool, you must add the new X410 node type to the existing X410 node pool. Use the isi storagepool nodetypes list command to view the node types and their IDs, then isi storagepool nodepools modify <nodepool_name> --add-nodetype-ids=<new_nodetype_id> to add the new node type to the existing node pool. For example, suppose that your x410 nodepool name is x410_nodepool and isi storagepool nodetypes list shows the new node type ID as 12:
isi storagepool nodepools modify x410_nodepool --add-node-type-ids=12
You can also remove a node type from a node pool, for example, if you want to move that node type into its own pool. Using the above example, to remove the X410 nodes with a different RAM capacity and node type ID 12 from the X410 node pool:
isi storagepool nodepools modify x410_nodepool --remove-node-type-ids=12
346 Storage Pools

Some compatibility restrictions apply because there may be performance impacts if you add a particular node type to an existing node pool, or because a node type is incompatible with the nodes in an existing node pool. In that case, OneFS generates a message describing the compatibility issue.
NOTE: SSD compatibilities require that all nodes have L3 cache enabled. If you attempt to move nodes with SSDs into a node pool that does not have L3 cache enabled, the process will fail with an error message. Ensure that the existing node pool has L3 cache enabled and try again. L3 cache can only be enabled on nodes that have fewer than 16 SSDs and at least a 2:1 ratio of HDDs to SSDs. On Generation 6 nodes that support SSD compatibilities, SSD count is ignored. If SSDs are used for storage, then SSD counts must be identical on all nodes in a node pool. If SSD counts are left unbalanced, node pool efficiency and performance will be less than optimal.
For example, the PowerScale F200 and F600 node types are incompatible with each other and with previous node types. You cannot add F200 or F600 nodes to a node pool containing any other node types (for example, S210 or F800 nodes). They are not hybrid nodes, so enabling L3 cache is not an option. They can be used as storage only.
See Compatibility restrictions for more information.
Compatibility restrictions
OneFS enforces pool and node type restrictions for cluster configuration and node compatibility. Restrictions represent the rules governing cluster configuration and node compatibility. They prevent performance degradation of the node types within a node pool.
OneFS supports the following restriction types.  Hard node type restriction: A rule that is not allowed. If you try to modify a cluster configuration in a way that generates a
hard restriction, the modification fails. OneFS presents a message that describes the restrictions that result in denying the modification request.  Soft node type restriction: A rule that is allowed but requires confirmation before being implemented. If you try to modify a cluster configuration in a way that generates a soft restriction, OneFS presents an advisory notice. To continue, you must confirm the modification.
NOTE: If the modification request results in both hard and soft restrictions, OneFS reports only the hard restrictions.
 Pool restriction: A rule that exists for a node pool.  Hard pool restriction: A rule that represents an invalid change to a node group. For example, you cannot modify a manual node pool or modify a pool in a way that results in that pool being underprovisioned.  Soft pool restriction: A rule that represents a change to a node group that requires confirmation. Requesting a modification that results in a soft pool restriction generates an advisory notice. To continue, you must confirm the modification.
Some examples of hard and soft restrictions are as follows.  There are hard node type restrictions for the PowerScale F200 and F600 node types.
 F200 and F600 node types are incompatible with each other and with previous node types.  F200 node types can form node pools only with other compatible F200 nodes.  F600 node types can form node pools only with other compatible F600 nodes.  F200 and F600 nodes are storage only nodes and cannot be used as L3 cache.  F200 nodes must have the same SSD size to be considered compatible. If you try to add F200 or F600 nodes to an incompatible node pool, the modification fails.  There is a soft node type restriction for different RAM capacities. Any difference in RAM is allowed and there are no RAM ranges for compatibilities. If you add a node to a node pool that has different RAM than existing nodes in that pool, OneFS displays an advisory notice. Confirm the operation to add the node to the node pool.
Manual node pools
If the node pools automatically provisioned by OneFS do not meet your needs, you can configure node pools manually. You do this by moving nodes from an existing node pool into the manual node pool.
This capability enables you to store data on specific nodes according to your purposes, and is available only through the OneFS command-line interface.
CAUTION: It is recommended that you enable OneFS to provision nodes automatically. Manually created node pools might not provide the same performance and efficiency as automatically managed node pools, particularly if your changes result in fewer than 20 nodes in the manual node pool.
Storage Pools 347

Virtual hot spare
Virtual hot spare (VHS) settings enable you to reserve disk space to rebuild the data in the event that a drive fails. You can specify both a number of virtual drives to reserve and a percentage of total storage space. For example, if you specify two virtual drives and 15 percent, each node pool reserves virtual drive space equivalent to two drives or 15 percent of their total capacity (whichever is larger). You can reserve space in node pools across the cluster for this purpose by specifying the following options:  At least 1­4 virtual drives.  At least 0­20% of total storage. OneFS calculates the larger number of the two factors to determine the space that is allocated. When configuring VHS settings, be sure to consider the following information:  If you deselect the option to Ignore reserved space when calculating available free space (the default), free-space
calculations include the space reserved for VHS.  If you deselect the option to Deny data writes to reserved disk space (the default), OneFS can use VHS for normal data
writes. We recommend that you leave this option selected, or data repair can be compromised.  If Ignore reserved space when calculating available free space is enabled while Deny data writes to reserved disk
space is disabled, it is possible for the file system to report utilization as more than 100 percent. NOTE: VHS settings affect spillover. If the VHS option Deny data writes to reserved disk space is enabled while Ignore reserved space when calculating available free space is disabled, spillover occurs before the file system reports 100% utilization.
Spillover
When you activate a SmartPools license, you can designate a node pool or tier to receive spillover data when the hardware specified by a file pool policy is full or otherwise not writable. If you do not want data to spill over to a different location because the specified node pool or tier is full or not writable, you can disable this feature.
NOTE: Virtual hot spare reservations affect spillover. If the setting Deny data writes to reserved disk space is enabled, while Ignore reserved space when calculating available free space is disabled, spillover occurs before the file system reports 100% utilization.
Suggested protection
Based on the configuration of your PowerScale cluster, OneFS automatically calculates the amount of protection that is recommended to maintain Dell Technologies PowerScale stringent data protection requirements. OneFS includes a function to calculate the suggested protection for data to maintain a theoretical mean-time to data loss (MTTDL) of 5000 years. Suggested protection provides the optimal balance between data protection and storage efficiency on your cluster. By configuring file pool policies, you can specify one of multiple requested protection settings for a single file, for subsets of files called file pools, or for all files on the cluster. It is recommended that you do not specify a setting below suggested protection. OneFSperiodically checks the protection level on the cluster, and alerts you if data falls below the recommended protection.
Protection policies
OneFS provides a number of protection policies to choose from when protecting a file or specifying a file pool policy. The more nodes you have in your cluster, up to 20 nodes, the more efficiently OneFS can store and protect data, and the higher levels of requested protection the operating system can achieve. Depending on the configuration of your cluster and how much data is stored, OneFS might not be able to achieve the level of protection that you request. For example, if you
348 Storage Pools

have a three-node cluster that is approaching capacity, and you request +2n protection, OneFS might not be able to deliver the requested protection.
The following table describes the available protection policies in OneFS.

Protection policy +1n +2d:1n +2n +3d:1n +3d:1n1d +3n +4d:1n +4d:2n +4n Mirrors: 2x 3x 4x 5x 6x 7x 8x

Summary Tolerate the failure of 1 drive or the failure of 1 node
Tolerate the failure of 2 drives or the failure of 1 node
Tolerate the failure of 2 drives or the failure of 2 nodes
Tolerate the failure of 3 drives or the failure of 1 node
Tolerate the failure of 3 drives or the failure of 1 node and 1 drive
Tolerate the failure of 3 drives or the failure of 3 nodes
Tolerate the failure of 4 drives or the failure of 1 node
Tolerate the failure of 4 drives or the failure of 2 nodes
Tolerate the failure of 4 drives or the failure of 4 nodes
Duplicates, or mirrors, data over the specified number of nodes. For example, 2x results in two copies of each data block.
NOTE: Mirrors can use more data than the other protection policies, but might be an effective way to protect files that are written non-sequentially or to provide faster access to important files.

SSD strategies

OneFS clusters can contain nodes that include solid-state drives (SSD). OneFS autoprovisions nodes with SSDs into one or more node pools. The SSD strategy defined in the default file pool policy determines how SSDs are used within the cluster, and can be set to increase performance across a wide range of workflows. SSD strategies apply only to SSD storage.
You can configure file pool policies to apply specific SSD strategies as needed. When you select SSD options during the creation of a file pool policy, you can identify the files in the OneFS cluster that require faster or slower performance. When the SmartPools job runs, OneFS uses file pool policies to move this data to the appropriate storage pool and drive type.
The following SSD strategy options that you can set in a file pool policy are listed in order of slowest to fastest choices:

Avoid SSDs

Writes all associated file data and metadata to HDDs only. CAUTION: Use this option to free SSD space only after consulting with PowerScale Technical Support. Using this strategy can negatively affect performance.

Metadata read acceleration
Metadata read/write acceleration Data on SSDs

Writes both file data and metadata to HDDs. This is the default setting. An extra mirror of the file metadata is written to SSDs, if available. The extra SSD mirror is included in the number of mirrors, if any, required to satisfy the requested protection.
Writes file data to HDDs and metadata to SSDs, when available. This strategy accelerates metadata writes in addition to reads but requires about four to five times more SSD storage than the Metadata read acceleration setting. Enabling GNA does not affect read/write acceleration.
Uses SSD node pools for both data and metadata, regardless of whether global namespace acceleration is enabled. This SSD strategy does not result in the creation of additional mirrors beyond the normal requested protection but requires significantly increased storage requirements compared with the other SSD strategy options.

Note the following considerations for setting and applying SSD strategies.

Storage Pools 349

 To use an SSD strategy that stores metadata and/or data on SSDs, you must have SSD storage in the node pool or tier, otherwise the strategy is ignored.
 If you specify an SSD strategy but there is no storage of the type that you specified, the strategy is ignored.
 If you specify an SSD strategy that stores metadata and/or data on SSDs but the SSD storage is full, OneFS attempts to spill data to HDD. If HDD storage is full, OneFS raises an out of space error.

Other SSD mirror settings
OneFS creates multiple mirrors for file system structures and, by default, stores one mirror for each of these structures on SSD. You can specify that all mirrors for these file system structures be stored on SSD.
OneFS creates mirrors for the following file system structures:
 system B-tree  system delta  QAB (quota accounting block) For each structure, OneFS creates multiple mirrors across the file system and stores at least one mirror on an SSD. Because SSDs provide faster I/O than HDDs, OneFS can more quickly locate and access a mirror for each structure when needed.
Alternatively, you can specify that all mirrors created for those file system structures are stored on SSDs.
NOTE: The capability to change the default mirror setting for system B-tree, system delta, and QAB is available only in the OneFS CLI, specifically in the isi storagepool settings command.

Global namespace acceleration
Global namespace acceleration (GNA) enables data on node pools without SSDs to have additional metadata mirrors on SSDs elsewhere in the cluster. Metadata mirrors on SSDs can improve file system performance by accelerating metadata read operations.
You can enable GNA only if 20 percent or more of the nodes in the cluster contain at least one SSD and 1.5 percent or more of total cluster storage is SSD-based. For best results, before enabling GNA, make sure that at least 2.0 percent of total cluster storage is SSD-based.
Even when enabled, GNA becomes inactive if the ratio of SSDs to HDDs falls below the 1.5 percent threshold, or if the percentage of nodes containing at least one SSD falls below 20 percent. GNA is reactivated when those requirements are met again. While GNA is inactive in such cases, existing SSD mirrors are readable, but newly written metadata does not get the extra SSD mirror.
NOTE: Node pools with L3 cache enabled are effectively invisible for GNA purposes. All ratio calculations for GNA are done exclusively for node pools without L3 cache enabled. So, for example, if you have six node pools on your cluster, and three of them have L3 cache enabled, GNA is applied only to the three remaining node pools without L3 cache enabled. On node pools with L3 cache enabled, metadata does not need an additional GNA mirror, because metadata access is already accelerated by L3 cache.

L3 cache overview

You can configure nodes with solid-state drives (SSDs) to increase cache memory and speed up file system performance across larger working file sets.
OneFS caches file data and metadata at multiple levels. The following table describes the types of file system cache available on a PowerScale cluster.

Name L1 cache
L2 cache

Type RAM
RAM

Profile Volatile
Volatile

Scope Local node
Global

Description
Also known as front-end cache, holds copies of file system metadata and data requested by the front-end network through NFS, SMB, HTTP, and so on.
Also known as back-end cache, holds copies of file system metadata and data on the node that owns the data.

350 Storage Pools

Name SmartCache
L3 cache

Type Variable
SSD

Profile Nonvolatile
Nonvolatile

Scope Local node
Global

Description
Holds any pending changes to front-end files waiting to be written to storage. This type of cache protects write-back data through a combination of RAM and stable storage.
Holds file data and metadata released from L2 cache, effectively increasing L2 cache capacity.

NOTE: L3 cache can only be enabled on nodes that have fewer than 16 SSDs and at least a 2:1 ratio of HDDs to SSDs.
OneFS caches frequently accessed file and metadata in available random access memory (RAM). Caching enables OneFS to optimize data protection and file system performance. When RAM cache reaches capacity, OneFS normally discards the oldest cached data and processes new data requests by accessing the storage drives. This cycle is repeated each time RAM cache fills up.
You can deploy SSDs as L3 cache to reduce the cache cycling issue and further improve file system performance. L3 cache adds significantly to the available cache memory and provides faster access to data than hard disk drives (HDD).
As L2 cache reaches capacity, OneFS evaluates data to be released and, depending on your workflow, moves the data to L3 cache. In this way, much more of the most frequently accessed data is held in cache, and overall file system performance is improved.
For example, consider a cluster with 128GB of RAM. Typically the amount of RAM available for cache fluctuates, depending on other active processes. If 50 percent of RAM is available for cache, the cache size would be approximately 64GB. If this same cluster had three nodes, each with two 200GB SSDs, the amount of L3 cache would be 1.2TB, approximately 18 times the amount of available L2 cache.
L3 cache is enabled by default for new node pools. A node pool is a collection of nodes that are all of the same equivalence class, or for which compatibilities have been created. L3 cache applies only to the nodes where the SSDs reside. For the HD400 node, which is primarily for archival purposes, L3 cache is on by default and cannot be turned off. On the HD400, L3 cache is used only for metadata.
If you enable L3 cache on a node pool, OneFS manages all cache levels to provide optimal data protection, availability, and performance. In addition, in case of a power failure, the data on L3 cache is retained and still available after power is restored.
NOTE: Although some benefit from L3 cache is found in workflows with streaming and concurrent file access, L3 cache provides the most benefit in workflows that involve random file access.

Migration to L3 cache
L3 cache is enabled by default on new nodes. You can enable L3 cache as the default for all new node pools or manually for a specific node pool, either through the command line or from the web administration interface. L3 cache can be enabled only on node pools with nodes that contain SSDs. When you enable L3 cache, OneFS migrates data that is stored on the SSDs to HDD storage disks and then begins using the SSDs as cache. When you enable L3 cache, OneFS displays the following message:
WARNING: Changes to L3 cache configuration can have a long completion time. If this is a concern, please contact PowerScale Technical Support for more information.
You must confirm whether OneFS should proceed with the migration. After you confirm the migration, OneFS handles the migration as a background process, and, depending on the amount of data stored on your SSDs, the process of migrating data from the SSDs to the HDDs might take a long time.
NOTE: You can continue to administer your cluster while the data is being migrated.

Storage Pools 351

L3 cache on archive-class node pools

Some PowerScale nodes are high capacity units designed primarily for archival work flows, which involve a higher percentage of data writes compared to data reads. On node pools made up of these archive-class nodes, SSDs are deployed for L3 cache, which significantly improves the speed of file system traversal activities such as directory lookup.
L3 cache with metadata only stored in SSDs provides the best performance for archiving data on these high-capacity nodes. L3 cache is on by default, as described in the following table.

Nodes HD-series
Generation 6 A-series

Comments
For all node pools made up of HD-series nodes, L3 cache stores metadata only in SSDs and cannot be disabled.
For all node pools made up of Generation 6 A-series nodes, L3 cache stores metadata only in SSDs and cannot be disabled.

Tiers
A tier is a user-defined collection of node pools that you can specify as a storage pool for files. A node pool can belong to only one tier.
You can create tiers to assign your data to any of the node pools in the tier. For example, you can assign a collection of node pools to a tier specifically created to store data that requires high availability and fast access. In a three-tier system, this classification may be Tier 1. You can classify data that is used less frequently or that is accessed by fewer users as Tier-2 data. Tier 3 usually comprises data that is seldom used and can be archived for historical or regulatory purposes.
File pool policies
File pool policies define sets of files--file pools--and where and how they are stored on your cluster. You can configure multiple file pool policies with filtering rules that identify specific file pools and the requested protection and I/O optimization settings for these file pools. Creating custom file pool policies requires an active SmartPools license.
The initial installation of OneFS places all files into a single file pool, which is subject to the default file pool policy. Without an active SmartPools license, you can configure only the default file pool policy, which controls all files and stores them anywhere on the cluster.
With an active SmartPools license, OneFS augments basic storage functions by enabling you to create custom file pool policies that identify, protect, and control multiple file pools. With a custom file pool policy, for example, you can define and store a file pool on a specific node pool or tier for fast access or archival purposes.
When you create a file pool policy, flexible filtering criteria enable you to specify time-based attributes for the dates that files were last accessed, modified, or created. You can also define relative time attributes, such as 30 days before the current date. Other filtering criteria include file type, name, size, and custom attributes. The following examples demonstrate a few ways you can configure file pool policies:
 A file pool policy to set stronger protection on a specific set of important files.  A file pool policy to store frequently accessed files in a node pool that provides the fastest reads or read/writes.  A file pool policy to evaluate the last time files were accessed, so that older files are stored in a node pool best suited for
regulatory archival purposes.
When the SmartPools job runs, typically once a day, it processes file pool policies in priority order. You can edit, reorder, or remove custom file pool policies at any time. The default file pool policy, however, is always last in priority order. Although you can edit the default file pool policy, you cannot reorder or remove it. When custom file pool policies are in place, the settings in the default file pool policy apply only to files that are not covered by another file pool policy.
When a new file is created, OneFS chooses a storage pool based on the default file pool policy, or, if it exists, a higher-priority custom file pool policy that matches the file. If a new file was originally matched by the default file pool policy, and you later create a custom file pool policy that matches the file, the file will be controlled by the new custom policy. As a result, the file could be placed in a different storage pool the next time the SmartPools job runs.

352 Storage Pools

FilePolicy job
You can use the FilePolicy job to apply file pool policies. The FilePolicy job supplements the SmartPools job by scanning the file system index that the File System Analytics (FSA) job uses. You can use this job if you are already using snapshots (or FSA) and file pool policies to manage data on the cluster. The FilePolicy job is an efficient way to keep inactive data away from the fastest tiers. The scan is done on the index, which does not require many locks. In this way, you can vastly reduce the number of times a file is visited before it is tiered down. You need to keep down-tiering data in ways they already have, such as file pool policies that move data based on a fixed age. Adjust the data based on the fullness of their tiers. To ensure that the cluster is correctly laid out and adequately protected, run the SmartPools job. You may use the SmartPools job after modifying the cluster, such as adding or removing nodes. You can also use the job for modifying the SmartPools settings (such as default protection settings), and if a node is down. To use this feature, you must schedule the FilePolicy job daily and continue running the SmartPools job at a lower frequency. You can run the SmartPools job after events that may affect node pool membership. You can use the following options when running the FilePolicy job:  --directory-only: This option helps you to process directories and is done to redirect new file ingest.  --policy-only: This option helps you to set policies. Make sure not to restripe.  --ingest: This option helps you to is a use -directory-only and -policy-only in combination.  --nop: This option helps you to calculate and report the work that you have done.
Managing node pools through the command-line interface
You can manage node pools through the command-line interface. You can work with node pools that are automatically provisioned, create and manage manual node pools, and create SSD compatibilities for new nodes. A node pool, whether automatically provisioned or manually created, must contain a minimum of three compatible nodes. Nodes are provisioned when at least three compatible nodes are added to the cluster. If you add only two compatible nodes to a cluster, you cannot store data on the nodes until you add a third node. OneFS provides SSD compatibilities, which you can create to enable compatible nodes to become members of an existing node pool. After you create a compatibility, any time a new compatible node is added to the cluster, OneFS provisions the new node to the appropriate node pool. OneFS supports SSD size and count compatibilities on Generation 5 nodes. OneFS supports only SSD size compatibilities on Generation 6 nodes. You can create a node pool manually only by selecting a subset of compatible nodes from a single autoprovisioned node pool. You cannot create a manual node pool that takes some nodes from one node pool and some nodes from another. You must have the ISI_PRIV_SMARTPOOLS or greater administrative privilege to manage node pools.
Delete an SSD compatibility
You can delete an SSD compatibility. If you do this, any nodes that are part of a node pool because of this compatibility are removed from the node pool.
CAUTION: Deleting an SSD compatibility could result in unintended consequences. For example, if you delete an SSD compatibility, and fewer than three compatible nodes are removed from a node pool as a result, these nodes are removed from your cluster's available pool of storage. The next time the SmartPools job runs, data on those nodes is restriped elsewhere on the cluster, which could be a time-consuming process. If three or more compatible nodes are removed from the node pool, these nodes form their own node pool, but data is restriped. Any file pool policy pointing to the original node pool points instead to the node pool's tier, if one existed, or, otherwise, to a new tier created by OneFS.
1. Run the isi storagepool compatibilities ssd active delete command.
Storage Pools 353

You can run the isi storagepool compatibilities ssd active list command to determine the ID number of active compatibilities. The following command deletes an SSD compatibility with an ID number of 1:
isi storagepool compatibilities ssd active delete 1
The following command deletes an ssd compatibility between two different PowerScale models:
isi storagepool compatibilities ssd active delete 1 --id-2 2
Before executing your command, OneFS provides a summary of the results and requires you to confirm the operation. 2. To proceed, type yes, and then press ENTER. To cancel, type no, and then press ENTER. If you proceed with the operation, OneFS splits any merged node pools, or unprovisions any previously compatible nodes fewer than three in number.
Create a node pool manually
You can create node pools manually if autoprovisioning does not meet your requirements. When you add new nodes to your cluster, OneFS places these nodes into node pools. This process is called autoprovisioning. For some workflows, you might prefer to create node pools manually. A manually created node pool must have at least three nodes, identified by the logical node numbers (LNNs).
CAUTION: It is recommended that you enable OneFS to provision nodes automatically. Manually created node pools might not provide the same performance and efficiency as automatically managed node pools, particularly if your changes result in fewer than 20 nodes in the manual node pool. Run the isi storagepool nodepools create command. You can specify the nodes to be added to a nodepool by a comma-delimited list of LNNs (for example, --lnns 1,2,5) or by using ranges (for example, --lnns 5-8). The following command creates a node pool by specifying the LNNs of three nodes to be included.
isi storagepool nodepools create PROJECT-1 --lnns 1,2,5
Add a node to a manually managed node pool
You can add a node to a manually managed node pool. If you specify a node that is already part of another node pool, OneFS removes the node from the original node pool and adds it to the manually managed node pool. Run the isi storagepool nodepools modify command. The following command adds nodes with the LNNs (logical node numbers) of 3, 4, and 10 to an existing node pool:
isi storagepool nodepools modify PROJECT-1 --lnns 3-4, 10
Add a new node type to an existing node pool
You can add a new node type to an existing node pool. OneFS cannot autoprovision new nodes when there are compatibility restrictions between the new nodes and the nodes in an existing node pool. To allow a new compatible node to join an existing, compatible node pool, add the new node's type ID to the existing compatible node pool. 1. Run the isi storagepool nodetypes list command to view the node types and their IDs.
Note the ID of the node type to be added to the existing compatible node pool. 2. Run the isi storagepool nodepools modify <nodepool_name> --add-node-type-
ids=<new_nodetype_id> The new node type is added to the existing compatible node pool.
354 Storage Pools

For example, suppose that your cluster has an X410 node pool named x410_nodepool and that you want to add a new X410 node to that node pool. However, the new node has different RAM than the older nodes. Running the isi storagepool nodetypes list command shows that the new X410 node type ID is 12. You can then add the new node type to the existing node pool as follows:
isi storagepool nodepools modify x410_nodepool --add-node-type-ids=12
Remove a node type from a node pool
You can remove a node type from a node pool. You can remove a node type from a node pool, for example, if you want to move that node type into its own pool. 1. Run the isi storagepool nodetypes list command to view the node types and their IDs.
Note the ID of the node type to remove from the existing node pool. 2. Run the isi storagepool nodepools modify <nodepool_name> --remove-node-type-
ids=<nodetype_id> The node type is removed from the node pool. Suppose that you have an X410 node pool named x410_nodepool that includes X410 nodes with different RAM capacities, and you now want to remove the X410's with greater RAM capacity from that pool. Running the isi storagepool nodetypes list command shows that the node type ID of the X410 nodes with greater RAM capacity is 12. You remove the node type from the node pool as follows:
isi storagepool nodepools modify x410_nodepool --remove-node-type-ids=12
Change the name or protection policy of a node pool
You can change the name or protection policy of a node pool. Run the isi storagepool nodepools modify command. The following command changes the name and protection policy of a node pool:
isi storagepool nodepools modify PROJECT-1 --set-name PROJECT-A \ --protection-policy +2:1
Remove a node from a manually managed node pool
You can remove a node from a manually managed node pool. If you attempt to remove nodes from either a manually managed or automatically managed node pool so that the removal leaves only one or two nodes in the pool, the removal fails. You can, however, move all nodes from an autoprovisioned node pool into one that is manually managed. When you remove a node from the manually managed node pool, OneFS autoprovisions the node into another node pool with compatible nodes. Run the isi storagepool nodepools modify command. The following command removes two nodes, identified by its LNNs (logical node numbers) from a node pool.
isi storagepool nodepools modify ARCHIVE_1 --remove-lnns 3,6 LNN values can be specified as a range, for example, --lnns=1-3, or in a comma-separated list, for example, -lnns=1,2,5,9.
Storage Pools 355

View node pool settings

You can view details for a node pool.
View node pool settings by running the isi storagepool nodepools view <name> command, where <name> is the name of the node pool you want to view.
The following sample shows output for a PowerScale F600 node pool named v600_nodepool. Note that the L3 Enabled field is always No for F600 and F200 nodes.

#isi storagepool nodepools view v600_50gb_8gb

ID:

1912

Name:

v600_nodepool

Nodes:

1, 2, 3, 4, 5, 6

Node Type IDs:

4

Protection Policy: +2d:1n

Manual:

No

L3 Enabled:

No

L3 Migration Status: storage

Tier:

-

Usage

Avail Bytes:

141.05G

Avail SSD Bytes:

0.00

Balanced:

No

Free Bytes:

217.90G

Free SSD Bytes:

0.00

Total Bytes:

223.84G

Total SSD Bytes:

0.00

Virtual Hot Spare Bytes: 76.84G

Modify default storage pool settings
You can modify default storage pool settings for requested protection, I/O optimization, global namespace acceleration, virtual hot spare, and spillover.
Run the isi storagepool settings modify command. The following command specifies automatic file protection and I/O optimization, disables global namespace acceleration, specifies a percentage of storage for a virtual hot spare, enables L3 cache for node pools with SSDs, and changes the mirror settings for the QAB, system B-tree, and system delta file system structures:
isi storagepool settings modify --automatically-manage-protection files_at_default --automatically-manage-io-optimization files_at_default --global-namespace-acceleration-enabled no --virtual-hot-spare-limit-percent 5 --ssd-l3-cache-default-enabled yes --ssd-qab-mirrors all --ssd-system-btree-mirrors all --ssd-system-delta-mirrors all
OneFS applies your changes to any files managed by the default file pool policy the next time the SmartPools job runs.

SmartPools settings

SmartPools settings include directory protection, global namespace acceleration, L3 cache, virtual hot spare, spillover, requested protection management, and I/O optimization management.

Settings in Web Admin
Increase directory protection to a higher level than its contents

Settings in CLI

Description

--protect-directories-onelevel-higher

Increases the amount of protection for directories at a higher level than the directories and files that they contain, so

Notes
This setting should be enabled (the default).

356 Storage Pools

Settings in Web Admin Settings in CLI

Enable global

--global-namespace-

namespace acceleration acceleration-enabled

Use SSDs as L3 Cache --ssd-l3-cache-defaultby default for new node enabled pools

Description
that data that is not lost can still be accessed.
When device failures result in data loss (for example, three drives or two nodes in a +2:1 policy), enabling this setting ensures that intact data is still accessible.

Notes
When this setting is disabled, the directory that contains a file pool is protected according to your protection-level settings, but the devices used to store the directory and the file may not be the same. There is potential to lose nodes with file data intact but not be able to access the data because those nodes contained the directory.
As an example, consider a cluster that has a +2 default file pool protection setting and no additional file pool policies. OneFS directories are always mirrored, so they are stored at 3x, which is the mirrored equivalent of the +2 default.
This configuration can sustain a failure of two nodes before data loss or inaccessibility. If this setting is enabled, all directories are protected at 4x. If the cluster experiences three node failures, although individual files may be inaccessible, the directory tree is available and provides access to files that are still accessible.
In addition, if another file pool policy protects some files at a higher level, these too are accessible in the event of a three-node failure.

Specifies whether to allow perfile metadata to use SSDs in the node pool.
 When disabled, restricts perfile metadata to the storage pool policy of the file, except in the case of spillover. This is the default setting.
 When enabled, allows per-file metadata to use the SSDs in any node pool.

This setting is available only if 20 percent or more of the nodes in the cluster contain SSDs and at least 1.5 percent of the total cluster storage is SSD-based.
If nodes are added to or removed from a cluster, and the SSD thresholds are no longer satisfied, GNA becomes inactive. GNA remains enabled, so that when the SSD thresholds are met again, GNA is reactivated.

NOTE: Node pools with L3 cache enabled are effectively invisible for GNA purposes. All ratio calculations for GNA are done exclusively for node pools without L3 cache enabled.

For node pools that include solid-state drives, deploy the SSDs as L3 cache. L3 cache extends L2 cache and speeds up

L3 cache is enabled by default on new node pools. When you enable L3 cache on an existing node pool, OneFS performs a

Storage Pools 357

Settings in Web Admin Virtual Hot Spare

Settings in CLI
--virtual-hot-spare-denywrites --virtual-hot-spare-hidespare --virtual-hot-spare-limitdrives --virtual-hot-spare-limitpercent

Description
file system performance across larger working file sets.

Notes
migration, moving any existing data on the SSDs to other locations on the cluster.
OneFS manages all cache levels to provide optimal data protection, availability, and performance. In case of a power failure, the data on L3 cache is retained and still available after power is restored.

Reserves a minimum amount of space in the node pool that can be used for data repair in the event of a drive failure.
To reserve disk space for use as a virtual hot spare, select from the following options:
 Ignore reserved disk space when calculating available free space. Subtracts the space reserved for the virtual hot spare when calculating available free space.

If you configure both the minimum number of virtual drives and a minimum percentage of total disk space when you configure reserved VHS space, the enforced minimum value satisfies both requirements.
If this setting is enabled and Deny new data writes is disabled, it is possible for the file system utilization to be reported at more than 100%.

 Deny data writes to reserved disk space. Prevents write operations from using reserved disk space.

 VHS Space Reserved. You can reserve a minimum number of virtual drives (1-4), as well as a minimum percentage of total disk space (0-20%).

Enable global spillover
Spillover Data Target Manage protection settings

--spillover-enabled
--spillover-target --spillover-anywhere --automatically-manageprotection

Specifies how to handle write operations to a node pool that is not writable.

 When enabled, redirects write operations from a node pool that is not writable either to another node pool or anywhere on the cluster (the default).
 When disabled, returns a disk space error for write operations to a node pool that is not writable.

Specifies another storage pool to target when a storage pool is not writable.

When spillover is enabled, but it is important that data writes do not fail, select anywhere for the Spillover Data Target setting, even if file pool policies send data to specific pools.

When this setting is enabled, SmartPools manages requested protection levels automatically.

When Apply to files with
manually-managed protection is enabled, overwrites any protection settings that were configured through File System

358 Storage Pools

Settings in Web Admin Manage I/O optimization settings
None None
None

Settings in CLI
--automatically-manageio-optimization
--ssd-qab-mirrors
--ssd-system-btreemirrors
--ssd-system-deltamirrors

Description

Notes

Explorer or the command-line interface.

When enabled, uses SmartPools technology to manage I/O optimization.

When Apply to files
with manually-managed I/O optimization settings is enabled, overwrites any I/O optimization settings that were configured through File System Explorer or the command-line interface

Either one mirror or all mirrors for the quota account block (QAB) are stored on SSDs

Improve quota accounting performance by placing all QAB mirrors on SSDs for faster I/O. By default, only one QAB mirror is stored on SSD.

Either one mirror or all mirrors for the system B-tree are stored on SSDs

Increase file system performance by placing all system B-tree mirrors on SSDs for faster access. Otherwise only one system B-tree mirror is stored on SSD.

Either one mirror or all mirrors for the system delta are stored on SSDs

Increase file system performance by placing all system delta mirrors on SSDs for faster access. Otherwise only one system delta mirror is stored on SSD.

Managing L3 cache from the command-line interface
L3 cache can be administered globally or on specific node pools. If you choose to, you can also revert SSDs back to storage drives. In Isilon HD400 node pools, SSDs are exclusively for L3 cache purposes. On these nodes, L3 cache is turned on by default and cannot be turned off.
Set L3 cache as the default for new node pools
You can set L3 cache as the default, so that when new node pools are created, L3 cache is enabled automatically. L3 cache is effective only on nodes that include SSDs. If none of your nodes has SSD storage, there is no need to enable L3 cache as the default. 1. Run the isi storagepool settings modify command.
The following command sets L3 cache enabled as the default for new node pools that are added.
isi storagepool settings modify --ssd-l3-cache-default-enabled yes
2. Run the isi storagepool settings view command to confirm that the SSD L3 Cache Default Enabled attribute is set to Yes.
Enable L3 cache on a specific node pool
You can enable L3 cache for a specific node pool. This is useful when only some of your node pools are equipped with SSDs. 1. Run the isi storagepool nodepools modify command on a specific node pool.

Storage Pools 359

The following command enables L3 cache on a node pool named hq_datastore: isi storagepool nodepools modify hq_datastore --l3 true
If the SSDs on the specified node pool previously were used as storage drives, a message appears asking you to confirm the change. 2. If prompted, type yes, and then press ENTER.
Restore SSDs to storage drives for a node pool
You can disable L3 cache for SSDs on a specific node pool and restore those SSDs to storage drives. NOTE: On HD400, A200, and A2000 node pools, SSDs are used only for L3 cache, which is turned on by default and cannot be turned off. If you attempt to turn off L3 cache on an HD400, A200, or A2000 node pool through the command-line interface, OneFS generates this error message: Disabling L3 not supported for the given node type.
1. Run the isi storagepool nodepools modify command on a specific node pool. The following command disables L3 cache on a node pool named hq_datastore: isi storagepool nodepools modify hq_datastore --l3 false
2. At the confirmation prompt, type yes, and then press ENTER.
Managing tiers
You can move node pools into tiers to optimize file and storage management. Managing tiers requires ISI_PRIV_SMARTPOOLS or higher administrative privileges.
Create a tier
You can create a tier to group together one or more node pools for specific storage purposes. Depending on the types of nodes in your cluster, you can create tiers for different categories of storage, for example, an archive tier, performance tier, or general-use tier. After creating a tier, you need to add the appropriate node pools to the tier. Run the isi storagepool tiers create command. The following command creates a tier named ARCHIVE_1, and adds node pools named hq_datastore1 and hq_datastore2 to the tier.
isi storagepool tiers create ARCHIVE_1 --children hq_datastore1 --children hq_datastore2
Add or move node pools in a tier
You can group node pools into tiers and move node pools from one tier to another. Run the isi storagepool nodepools modify command. The following example adds a node pool named PROJECT-A to a tier named ARCHIVE_1.
isi storagepool nodepools modify PROJECT-A --tier ARCHIVE_1 If the node pool, PROJECT-A, happened to be in another tier, the node pool would be moved to the ARCHIVE_1 tier.
Rename a tier
A tier name can contain alphanumeric characters and underscores but cannot begin with a number. Run the isi storagepool tiers modify command.
360 Storage Pools

The following command renames a tier from ARCHIVE_1 to ARCHIVE_A:
isi storagepool tiers modify ARCHIVE_1 --set-name ARCHIVE_A
Delete a tier
When you delete a tier, its node pools remain available and can be added to other tiers. Run the isi storagepool tiers delete command. The following command deletes a tier named ARCHIVE_A:
isi storagepool tiers delete ARCHIVE_A
Creating file pool policies
You can configure file pool policies to identify logical groups of files called file pools, and you can specify storage operations for these files. Before you can create file pool policies, you must activate a SmartPools license, and you must have the SmartPools or higher administrative privilege. File pool policies have two parts: file-matching criteria that define a file pool, and the actions to be applied to the file pool. You can define file pools based on characteristics, such as file type, size, path, birth, change, and access timestamps, and combine these criteria with Boolean operators (AND, OR). In addition to file-matching criteria, you can identify a variety of actions to apply to the file pool. These actions include:  Setting requested protection and data-access optimization parameters  Identifying data and snapshot storage targets  Defining data and snapshot SSD strategies  Enabling or disabling SmartCache For example, to free up disk space on your performance tier (S-series node pools), you could create a file pool policy to match all files greater than 25 MB in size, which have not been accessed or modified for more than a month, and move them to your archive tier (NL-series node pools). You can configure and prioritize multiple file pool policies to optimize file storage for your particular work flows and cluster configuration. When the SmartPools job runs, by default once a day, it applies file pool policies in priority order. When a file pool matches the criteria defined in a policy, the actions in that policy are applied, and lower-priority custom policies are ignored for the file pool. After the list of custom file pool policies is traversed, if any of the actions are not applied to a file, the actions in the default file pool policy are applied. In this way, the default file pool policy ensures that all actions apply to every file.
NOTE: You can reorder the file pool policy list at any time, but the default file pool policy is always last in the list of file pool policies. OneFS also provides customizable template policies that you can copy to make your own policies. These templates, however, are only available from the OneFS web administration interface.
Create a file pool policy
You can create a file pool policy to match specific files and apply SmartPools actions to the matched file pool. SmartPools actions include moving files to certain storage tiers, changing the requested protection levels, and optimizing write performance and data access.
CAUTION: If existing file pool policies direct data to a specific storage pool, do not configure other file pool policies that match this data with anywhere for the --data-storage-target setting. Because the specified storage pool is included when you use anywhere, you should target specific storage pools to avoid unintentional file storage locations. Run the isi filepool policies create command.
Storage Pools 361

The following command creates a file pool policy that archives older files to a specific storage tier:
isi filepool policies create ARCHIVE_OLD --description "Move older files to archive storage" --data-storage-target ARCHIVE_TIER --data-ssd-strategy metadata --begin-filter --file-type=file --and --birth-time=2013-09-01 --operator=lt --and --accessed-time=2013-12-01 --operator=lt --end-filter
The file pool policy is applied when the next scheduled SmartPools job runs. By default, the SmartPools job runs once a day; however, you can also start the SmartPools job manually.

Valid wildcard characters

You can combine wildcard characters with file-matching options to define a file pool policy.
OneFS supports UNIX shell-style (glob) pattern matching for file name attributes and paths.
The following table lists the valid wildcard characters that you can combine with file-matching options to define a file pool policy.

Wildcard * [a-z]
?

Description
Matches any string in place of the asterisk.
For example, m* matches movies and m123.
Matches any characters contained in the brackets, or a range of characters separated by a hyphen. For example, b[aei]t matches bat, bet, and bit, and 1[4-7]2 matches 142, 152, 162, and 172.
You can exclude characters within brackets by following the first bracket with an exclamation mark. For example, b[!ie] matches bat but not bit or bet.
You can match a bracket within a bracket if it is either the first or last character. For example, [[c]at matches cat and [at.
You can match a hyphen within a bracket if it is either the first or last character. For example, car[-s] matches cars and car-.
Matches any character in place of the question mark. For example, t?p matches tap, tip, and top.

Default file pool requested protection settings

Default protection settings include specifying the data storage target, snapshot storage target, requested protection, and SSD strategy for files that are filtered by the default file pool policy.

Settings (Web Admin)
Storage Target

Settings (CLI)

Description

Notes

--data-storagetarget
--data-ssd-strategy

Specifies the storage pool (node pool or tier) that you want to target with this file pool policy.
CAUTION:
If existing file pool policies direct data to a specific storage pool, do not configure other file pool policies with anywhere for the Data storage target option. Because the specified storage pool is included when you use anywhere, target specific storage pools to avoid unintentional file storage locations.

NOTE: If GNA is not enabled and the storage pool that you choose to target does not contain SSDs, you cannot define an SSD strategy.
Use SSDs for metadata read acceleration writes both file data and metadata to HDD storage pools but adds an additional SSD mirror if possible to accelerate read

362 Storage Pools

Settings (Web Admin)

Settings (CLI)

Description

Notes

Select one of the following options to define your SSD strategy:

Use SSDs for metadata read acceleration

Default. Write both file data and metadata to HDDs and metadata to SSDs. Accelerates metadata reads only. Uses less SSD space than the Metadata read/ write acceleration setting.

Use SSDs for metadata read/write acceleration

Write metadata to SSD pools. Uses significantly more SSD space than Metadata read acceleration, but accelerates metadata reads and writes.

Use SSDs for data & metadata

Use SSDs for both data and metadata. Regardless of whether global namespace acceleration is enabled, any SSD blocks reside on the storage target if there is room.

Avoid SSDs

Write all associated file data and metadata to HDDs only.
CAUTION:

Use this to free SSD space only after consulting with PowerScale Technical Support; the setting can negatively affect performance.

performance. Uses HDDs to provide reliability and an extra metadata mirror to SSDs, if available, to improve read performance. Recommended for most uses.
When you select Use SSDs for metadata read/write acceleration , the strategy uses SSDs, if available in the storage target, for performance and reliability. The extra mirror can be from a different storage pool using GNA enabled or from the same node pool.
Neither the Use SSDs for data & metadata strategy nor the Use SSDs for data & metadata strategy result in the creation of additional mirrors beyond the normal requested protection. Both file data and metadata are stored on SSDs if available within the file pool policy. This option requires a significant amount of SSD storage.

Snapshot storage target
Requested protection

--snapshot-storagetarget
--snapshot-ssdstrategy

Specifies the storage pool that you want to target for snapshot storage with this file pool policy. The settings are the same as those for data storage target, but apply to snapshot data.

Notes for data storage target apply to snapshot storage target

--set-requestedprotection

Default of storage pool. Assign the default requested protection of the storage pool to the filtered files.
Specific level. Assign a specified requested protection to the filtered files.

To change the requested protection , select a new value from the list.

Default file pool I/O optimization settings
You can manage the I/O optimization settings that are used in the default file pool policy, which can include files with manually managed attributes. To allow SmartPools to overwrite optimization settings that were configured using File System Explorer or the isi set command, select the Including files with manually-managed I/O optimization settings option in the Default Protection

Storage Pools 363

Settings group. In the CLI, use the --automatically-manage-io-optimization option with the isi storagepool settings modify command.

Setting (Web Admin)

Setting (CLI)

Write Performance --enable-coalescer

Data Access Pattern

--data-accesspattern

Description

Notes

Enables or disables SmartCache (also referred to as the coalescer).

Enable SmartCache is the recommended setting for optimal write performance. With asynchronous writes, the PowerScale server buffers writes in memory. However, if you want to disable this buffering, we recommend that you configure your applications to use synchronous writes. If that is not possible, disable SmartCache.

Defines the optimization settings for accessing concurrent, streaming, or random data types.

Files and directories use a concurrent access pattern by default. To optimize performance, select the pattern dictated by your workflow. For example, a workflow heavy in video editing should be set to Optimize for streaming access. That workflow would suffer if the data access pattern was set to Optimize for random access.

Managing file pool policies
You can perform a number of file pool policy management tasks. File pool policy management tasks include:  Modifying file pool policies  Modifying the default file pool policy  Creating a file pool policy from a template  Reordering file pool policies  Deleting file pool policies
NOTE: You can create a file pool policy from a template only in the OneFS web administration interface.

Modify a file pool policy
You can modify the name, description, filter criteria, and the protection and I/O optimization settings applied by a file pool policy.
CAUTION:
If existing file pool policies direct data to a specific storage pool, do not configure other file pool policies with anywhere for the Data storage target option. Because the specified storage pool is included when you use anywhere, target specific storage pools to avoid unintentional file storage locations.
1. Run the isi filepool policies list command to view a list of available file pool policies. A tabular list of policies and their descriptions appears.
2. Run the isi filepool policies view command to view the current settings of a file pool policy. The following example displays the settings of a file pool policy named ARCHIVE_OLD.
isi filepool policies view ARCHIVE_OLD
3. Run the isi filepool policies modify command to change a file pool policy. The following example modifies the settings of a file pool policy named ARCHIVE_OLD.
isi filepool policies modify ARCHIVE_OLD --description "Move older files to archive storage" --data-storage-target TIER_A --data-ssd-strategy metadata-write --begin-filter --file-type=file

364 Storage Pools

--and --birth-time=2013-01-01 --operator=lt --and --accessed-time= 2013-09-01 --operator=lt --end-filter
Changes to the file pool policy are applied when the next SmartPools job runs. However, you can also manually run the SmartPools job immediately.

Configure default file pool policy settings
Files that are not managed by custom file pool policies are managed by the default file pool policy. You can configure the default file pool policy settings. 1. Run the isi filepool default-policy view command to display the current default file pool policy settings.
Output similar to the following example appears:
Set Requested Protection: default Data Access Pattern: random Enable Coalescer: True Data Storage Target: anywhere Data SSD Strategy: metadata
Snapshot Storage Target: anywhere Snapshot SSD Strategy: metadata
2. Run the isi filepool default-policy modify command to change default settings. The following command modifies all default settings:
isi filepool default-policy modify --set-requested-protection +2 \ --data-access-pattern concurrency --enable-coalescer false \ --data-storage-target ARCHIVE_A --data-ssd-strategy avoid \ --snapshot-storage-target ARCHIVE_A --snapshot-ssd-strategy avoid
3. Run the isi filepool default-policy view command again to ensure that default file pool policy settings reflect your intentions.
OneFS implements the new default file pool policy settings when the next scheduled SmartPools job runs and applies these settings to any files that are not managed by a custom file pool policy.

Prioritize a file pool policy

You can change the priority order of a file pool policy.
File pool policies are evaluated in descending order according to their position in the file pool policies list. By default, when you create a new policy, it is inserted immediately above the default file pool policy. You can assign a policy a different priority by moving it up or down in the list. The default policy is always the last in priority, and applies to all files that are not matched by any other file pool policy.
1. Run the isi filepool policies list command to view the list of available file pool policies and their priority order. Output similar to the following appears:

Name

Description

-----------------------------------------------

ARCHIVE_1 Move older files to archive tier

MOVE-LARGE Move large files to archive tier

PERFORM_1 Move recent files to perf. tier

-----------------------------------------------

Total: 3

2. Run the isi filepool policies modify command to change the priority of a file pool policy. The following example changes the priority of a file pool policy named PERFORM_1.

isi filepool policies modify PERFORM_1 --apply-order 1 3. Run the isi filepool policies list command again to ensure that the policy list displays the correct priority order.

Storage Pools 365

Delete a file pool policy
You can delete a file pool policy. Delete a file pool policy only if you are aware of, or unconcerned with, the consequences. 1. Run the isi filepool policies delete command.
The following example deletes a file pool policy named ARCHIVE_1.
isi filepool policies delete ARCHIVE_1
The system asks you to confirm the deletion. 2. Type yes, then press ENTER. The file pool policy is removed. When you delete a policy, its file pool will be controlled either by another policy or by the default file pool policy the next time the SmartPools job runs.

Monitoring storage pools
You can access information on storage pool health and usage. The following information is available:  File pool policy health  SmartPools health, including tiers, node pools, and subpools  For each storage pool, percentage of HDD and SSD disk space usage  SmartPools job status
Monitor storage pools
You can view storage pool status and details. Details include the names of tiers and associated node pools, requested protection, HDD and SSD capacities and usage. Run the isi storagepool list command. Output similar to the following example appears:

Name

Nodes Protect HDD Total %

SSD Total %

------------------------------------------------------------------

PERF_TIER 1-3

-

12.94T 17.019T 26.99% 0.4T 1.2T 33.00%

- s-series 1-3

+2:1

12.94T 17.019T 26.99% 0.4T 1.2T 33.00%

HOME_TIER 4-6

-

16.59T 19.940T 77.73% 0b 0b

0.00%

- x-series 4-6

+2:1

16.59T 19.940T 77.73% 0b 0b

0.00%

ARCHIVE_1 7-9

-

100.8T 200.60T 49.88% 0b 0b

0.00%

- nl-serie 7-9

+2:1

100.8T 200.60T 49.88% 0b 0b

0.00%

------------------------------------------------------------------

Total: 6

200.5G 17.019G 26.99% 0b 0b

0.00%

View the health of storage pools
You can view the health of storage pools. Run the isi storagepool health command. The following command, using the verbose option, displays a tabular description of storage pool health:
isi storagepool health --verbose

366 Storage Pools

View results of a SmartPools job
You can review detailed results from the last time the SmartPools job ran. The SmartPools job, by default, runs once a day. It processes the file pool policies that you have created to manage storage on your cluster. 1. Run the isi job events list command.
A tabular listing of the most recent system jobs appears. The listing for the SmartPools job is similar to the following example:
2014-04-28T02:00:29 SmartPools [105] Succeeded 2. Locate the SmartPools job in the listing, and make note of the number in square brackets.
This is the job ID number. 3. Run the isi job reports view command, using the job ID number.
The following example displays the report for a SmartPools job with the job ID of 105. isi job reports view 105
The SmartPools report shows the outcome of all of the file pool policies that were run, including summaries for each policy, and overall job information such as elapsed time, LINs traversed, files and directories processed, and memory and I/O statistics.
Storage Pools 367

29
Pool-based tree reporting in FSAnalyze (FSA)
This section contains the following topics:
Topics:
· FSAnalyze (FSA) · Pool-based tree reporting in FSAnalyze (FSA) · Enable pool-based tree reporting in FSA · Disable pool-based tree reporting in FSA
FSAnalyze (FSA)
The FSAnalyze job in OneFS gathers file system analytic information. The FSAnalyze (FSA) job is used for namespace analysis. You can run FSA on demand or on a schedule through the command line interface and PAPI. A successful FSA job produces analysis result. You may run FSA at least once a day. Disk usage is a component of FSA that adds up the overall usage for any given directory. It gathers disk usage efficiently and stores the results in a results database table, one row for every directory. FSA PAPI directory information endpoint exposes stored result. The result for a directory may be compared with the result at a different time using the same PAPI endpoint. You can run FSA on demand or on a schedule through the command line interface and PAPI. A successful FSA job produces analysis result. You may run FSA at least once a day. The FSA job runs in two modes. The SCAN mode scans the OneFS file system entirely. The INDEX mode is the default mode and is more efficient. It relies on snapshots and change lists, updates, and walks a global metadata index. FSA or IndexUpdate job is used to build the metadata index. The FSA job running in INDEX mode generates FSA Index. IndexUpdate job generates Cluster Index. The disk usage component walks the metadata index.
Pool-based tree reporting in FSAnalyze (FSA)
You can store a subset of files remotely. A storage policy can lead to a subset of files that are stored remotely. The metadata or namespace is stored on a PowerScale cluster and the data is stored on a remote machine, hosted by a cloud storage provider. This mechanism helps you to monitor the usage of cloud and on-premises resources through the cluster. The pool-based tree reporting feature is used to classify directory usage by storage level. The disk usage component is extended to add up information by node pool. A file that is stored in cloud is tagged with a special node pool. The database table is extended to have a node pool column, and a given directory has usage that the node pool records. The report for directory pool usage can be accessed using the Job ID and PAPI endpoints. The feature has storage impact from the running FSA and IndexUpdate jobs, which are otherwise optional. Snapshots are taken and kept for the duration while the FSA job runs. FSA and IndexUpdate jobs creates and manages metadata indexes to track file system metadata. A new FSA database is created for every FSA result that is generated. The disk usage table, the biggest table within FSA database grows linearly with the number of node pools.
Enable pool-based tree reporting in FSA
You can enable the pool-based tree reporting feature in FSA using the command line interface. This feature can be configured, enabled, or disabled only by cluster administrators. IndexUpdate job and FSA job are scheduled. FSA job must run in default (INDEX) mode.
368 Pool-based tree reporting in FSAnalyze (FSA)

Enable default (INDEX) mode by running the following command on any node: isi_gconfig -t job-config ­R jobs.fsa.snap_based_mode
The IndexUpdate job must be run at least every six hours, or as often as FSA, whichever is more frequent. 1. Set up a schedule to run IndexUpdate job. 2. Run the following command on any node:
isi_gconfig fsa.du_has_dpid=true 3. Set up a schedule to run FSA job.
The pool-based tree reporting feature in FSA is enabled.
Disable pool-based tree reporting in FSA
You can disable the pool-based tree reporting feature in FSA using the command line interface. Remove IndexUpdate schedule if not required by the FilePolicy job. Remove the FSA schedule if not required by InsightIQ or LogIQ. The IndexUpdate job must be run at least every six hours, or as often as FSA, whichever is more frequent. Run the following command on any node:
isi_gconfig fsa.du_has_dpid=false The pool-based tree reporting feature in FSA is disabled.
Pool-based tree reporting in FSAnalyze (FSA) 369

30
System jobs
This section contains the following topics:
Topics:
· System jobs overview · System jobs library · Job operation · Job performance impact · Job priorities · Managing system jobs · Modify job type settings · Managing impact policies · Viewing job reports and statistics
System jobs overview
The most critical function of OneFS is maintaining the integrity of data on your PowerScale cluster. Other important system maintenance functions include monitoring and optimizing performance, detecting and mitigating drive and node failures, and freeing up available space. Maintenance functions use system resources and can take hours to run. System components manage some maintenance functions. For example, SyncIQ creates jobs to synchronize source and target content, and CloudPools creates jobs to upload and download data to and from the cloud. This section describes the system maintenance functions, called jobs, that are managed by the Job Engine. Jobs run in the background. The time that it takes for a job to run can vary depending on several factors, including:  Other system jobs that are running.  Other processes that are taking up CPU and I/O cycles while the job is running.  The configuration of your cluster  The size of your dataset  How long since the last iteration of the job was run. It is recommended that you have no more than three jobs running simultaneously. Job Engine evaluates jobs to enable higher priority jobs and administrator tasks to proceed. Job evaluation includes:  Ensuring that jobs that have the same exclusion sets do not run simultaneously.  Running jobs at different priority and impact levels  Temporarily suspending jobs (with no loss of progress) If a power failure occurs, the Job Engine checkpoint system resumes jobs as close as possible to the point at which they were interrupted. The checkpoint system monitors the tasks in the current job phase until they are completed. When the cluster is back up and available, Job Engine resumes the job phase from the last checkpoint. As system administrator, through the Job Engine service, you can monitor, schedule, run, terminate, and apply other controls to system maintenance jobs. The Job Engine provides statistics and reporting tools that you can use to determine how long different system jobs take to run in your OneFS environment.
NOTE: To initiate any Job Engine tasks, you must have the role of SystemAdmin in the OneFS system.
370 System jobs

System jobs library

OneFS contains a library of system jobs that run in the background to help maintain your PowerScale cluster.
By default, system jobs are categorized as either manual or scheduled. You can run any job manually, and you can create a schedule for most jobs according to your workflow. Some jobs do not accept a schedule. Typically such jobs have mandatory input arguments, such as the Treedelete job. In addition, OneFS starts some jobs automatically when particular system conditions arise--for example, FlexProtect or FlexProtectLin, which start when a drive is smartfailed.
The lower the priority value, the higher the job priority. For example, a job with priority value 1 has higher priority than a job with priority value 2 or higher.
The default protection, +2:+1, enables all jobs to run during a scan if there is no more than one failed device in each disk pool. (FlexProtect ad FlexProtectLin continue to run even if there are failed devices.) OneFS does not check file protection. If you have files with no protection setting, the job can fail.

Job name AutoBalance
AutoBalanceLin
AVScan ChangelistCreate CloudPoolsLin CloudPoolsTreewalk

Description
Rebalances disk space usage in a disk pool. JobEngine starts a rebalance job if there is an imbalance of 5% of more between any two drives. If MultiScan is enabled, Job Engine runs the AutoBalance part of the MultiScan job. Otherwise, if Job Engine determines that rebalancing should be LINbased, it tries to start AutoBalance or AutoBalanceLin. If none of these jobs are enabled, no rebalancing is done.
If AutoBalance is enabled, the system runs it automatically when a device joins (or rejoins) the cluster.
AutoBalance is most efficient in clusters that contain only hard disk drives (HDDs).

Exclusion Set
Restripe

Balances free space in a cluster. Job Engine starts a rebalance job when there is an imbalance of 5% or more between any two drives, and when Job Engine determines that rebalancing should be LINbased. AutoBalanceLin is most efficient in clusters when file system metadata is stored on solid state drives (SSDs). Run as part of MultiScan, or automatically by the system when a device joins (or rejoins) the cluster.

Restripe

Performs an antivirus scan on all files using an external antivirus server, such as a CAVA antivirus server. Scans are scheduled independently by the AV system or run manually.

None

Creates a list of changes between two snapshots with matching root paths. You can specify these snapshots from the CLI.

None

Performs a LIN-based scan for files to be None managed by CloudPools. If a CloudPools policy matches a given LIN, it either archives or recalls the cloud files.

Performs a treewalk scan on a given file path to identify files to be managed by CloudPools.

None

Impact Policy Low
Low
Low Low Low Low

Priority Operation

4

Manual

4

Manual

6

Manual

5

Manual

6

Manual

6

Manual

System jobs 371

Job name

Description

Exclusion Set

Collect

Reclaims free space from previously unavailable nodes or drives. Collect is a "mark and sweep" garbage collector: it marks valid blocks in the first two phases of its run, then reclaims all blocks that are flagged in-use but not marked. Runs as part of MultiScan, or automatically by the system when a device joins (or rejoins) the cluster.

Mark

ComplianceStoreDelete Scan for, and unlink, expired files in compliance stores. By default, runs on the second Saturday of each month at 12am. Can also be run manually.

None

Dedupe*

Scans a directory for redundant data blocks Dedupe

and deduplicates all redundant data stored in the directory. Available only if you activate a

Restripe

SmartDedupe license.

DedupeAssessment

Scans a directory for redundant data blocks and reports an estimate of the amount of space that could be saved by deduplicating the directory.

Dedupe Restripe

DomainMark

Associates a path, and the contents of that None path, with a domain.

DomainTag

Performs policy domain updates.

Restripe

FlexProtect FlexProtectLin

Scan the file system after a device failure to Restripe ensure that all files remain protected.
OneFS checks the jobs.common.lin_based_jobs setting to determine whether to run FlexProtect or FlexProtectLin.
FlexProtect and FlexProtectLin continue to run even if there are failed devices.
Depending on the size of your data set, this process can last for an extended period. The cluster is said to be in a degraded state until FlexProtect (or FlexProtectLin) finishes its work. If you notice that other system jobs cannot be started or have been paused, you can use the isi job status command to see if a "Cluster Is Degraded" message appears.
FlexProtect is most efficient on clusters that contain only HDDs. FlexProtectLin is most efficient when file system metadata is stored on SSDs.

NOTE: Unlike HDDs and SSDs that are used for storage, when an SSD used for L3 cache fails, the drive state should immediately change to REPLACE without a FlexProtect job running. An SSD drive used for L3 cache contains only cache data that does not have to be protected by FlexProtect. After the drive state changes to REPLACE, you can pull and replace the failed SSD.

Impact Policy Low
Low
Low
Low
Low Low Medium

Priority Operation

4

Manual

6

Scheduled

4

Manual

6

Manual

5

Manual

6

Manual

1

Manual

372 System jobs

Job name FSAnalyze*
IntegrityScan MediaScan
MultiScan
PermissionRepair QuotaScan* SetProtectPlus ShadowStoreDelete ShadowStoreProtect SmartPools*
SmartPoolsTree*

Description

Exclusion Set

Gathers and reports information about all None files and directories beneath the /ifs path. This job requires you to activate an InsightIQ license. Reports from this job are used by InsightIQ users for system analysis purposes. For more information, see the PowerScale InsightIQ User Guide.

Verifies file system integrity.

Mark

Locates and clears media-level errors from disks to ensure that all data remains protected.

Restripe

This job is only useful on HDD drives. If the cluster is all flash, you can disable this job.

This job is scheduled to run every 1st Saturday of every month at 12 a.m.

Performs the work of the AutoBalanceLin and Collect jobs. Runs automatically on group changes, including storage changes. Multiscan runs only if there is any unbalanced diskpool or if it determines that a drive has been down for a long enough period that running the Collect process to reclaim free space is worthwhile.

Restripe Mark

Uses a template file or directory as the basis for permissions to set on a target file or directory. The target directory must always be subordinate to the /ifs path. This job must be manually started.

None

Updates quota accounting for domains created on an existing file tree. Available only if you activate a SmartQuotas license. This job should be run manually in off-hours after setting up all quotas, and whenever setting up new quotas.

None

Applies a default file policy across the cluster. Runs only if a SmartPools license is not active.

Restripe

Frees up space that is associated with shadow stores. Shadow stores are hidden files that are referenced by cloned and deduplicated files.

None

Protects shadow stores that are referenced Restripe by a logical i-node (LIN) with a higher level of protection.

Enforces SmartPools file pool policies. Available only if you activate a SmartPools license. This job runs on a regularly scheduled basis, and can also be started by the system when a change is made (for example, creating a compatibility that merges node pools).

Restripe

Enforce SmartPools file policies on a subtree. Available only if you activate a SmartPools license.

Restripe

Impact Policy Low
Medium Low
Low
Low Low
Low Low Low Low
Medium

Priority Operation

1

Scheduled

1

Manual

8

Scheduled

4

Manual

5

Manual

6

Manual

6

Manual

2

Scheduled

6

Scheduled

6

Scheduled

5

Manual

System jobs 373

Job name SnapRevert SnapshotDelete
TreeDelete
Undedupe
Upgrade

Description

Exclusion Set

Reverts an entire snapshot back to head. None

Creates free space associated with deleted snapshots. Triggered by the system when you mark snapshots for deletion.

None

Deletes a specified file path in the /ifs directory. TreeDelete is equivalent to rm -rf but scales cluster-wide.

None

Undedupe undoes the work that the dedupe Dedupe

job performed, potentially increasing disk space usage.

Restripe

Upgrades the file system after a software version upgrade.
NOTE: The Upgrade job should be run only when you are updating your cluster with a major software version. For complete information, see the
IPowerScale OneFS Upgrade Planning and Process Guide.

Restripe

WormQueue

Processes the WORM queue, which tracks the commit times for WORM files. After a file is committed to WORM state, it is removed from the queue.

* Available only if you activate an additional license

None

Impact Policy Low Medium Medium Medium Medium
Low

Priority Operation

5

Manual

2

Manual

4

Manual

6

Manual

3

Manual

6

Scheduled

Job operation
OneFS includes system maintenance jobs that run to ensure that your PowerScale cluster performs at peak health.
Through the Job Engine, OneFS runs a subset of these jobs automatically, as needed, to:  Ensure file and data integrity.  Check for and mitigate drive and node failures.  Optimize free space. For other jobs, such as Dedupe, you can use Job Engine to start them manually or schedule them to run automatically at regular intervals.
The Job Engine runs system maintenance jobs in the background and prevents jobs within the same classification (exclusion set) from running simultaneously. Two exclusion sets are enforced: restripe and mark.
Restripe job types are:
 AutoBalance  AutoBalanceLin  FlexProtect  FlexProtectLin  MediaScan  MultiScan  SetProtectPlus  SmartPools
Mark job types are:
 Collect  IntegrityScan  MultiScan

374 System jobs

MultiScan is a member of both the restripe and mark exclusion sets. You cannot change the exclusion set parameter for a job type.
The Job Engine is sensitive to job priority. It is recommended that you have no more than three jobs of any priority running simultaneously. Job priority is denoted as 1 through 10, with 1 being the highest and 10 being the lowest. The system uses job priority when there is a conflict among running or queued jobs. For example, suppose that you manually start a job that has a higher priority than three other jobs that are already running. Job Engine pauses the lowest-priority active job, runs the new job, then restarts the older job at the point at which it was paused. Or, suppose that you start a job within the restripe exclusion set, and another restripe job is already running. The system uses priority to determine which job should run (or remain running) and which job should be paused (or remain paused).
Other job parameters determine whether jobs are enabled, their performance impact, and schedule. As system administrator, you can accept the job defaults or adjust these parameters (except for exclusion set) based on your requirements.
When a job starts, the Job Engine distributes its tasks across the nodes of your cluster. At any given time, one task belongs to one node. Multiple nodes do not share the work of one task. One node acts as job coordinator. The job coordinator tracks the tasks in progress. It works with the other nodes to load-balance the work according to the impact of the current job as determined by its policy: requested or configured. This distribution ensures that the tasks run in parallel and the load is distributed throughout the cluster.
A task is complete when there are no more items to perform for that task. Each node reports its task status to the job engine coordinator after the task is complete. The job engine coordinator merges the task results into the job results, updates the progress of the job, and stops tracking this task. When there are no more tasks to track, the phase is complete.
Checkpoints are taken periodically: when a job sends results or when the job is paused. The default checkpoint interval is 30 seconds. Some jobs request checkpoints at points of significant progress as well. If there is a power outage or if a job is paused, the job can be restarted from the point at which it was interrupted. This is important because some jobs can take hours or even days to run and can use considerable system resources.

Job performance impact

The Job Engine service uses impact policies to monitor the impact of maintenance jobs on system performance. You can manage the impact policies to determine when a job can run and the system resources that it consumes.
Job Engine monitors maintenance jobs to ensure that they do not interfere with regular cluster I/O activity or system administration tasks. Job Engine provides default impact policies that you can use but not modify, as shown in the following table.

Impact policy LOW MEDIUM HIGH OFF_HOURS

Allowed to run Any time of day.

Resource consumption Low

Any time of day.

Medium

Any time of day.

High

Outside of business hours.

Low

Business hours are defined as 9AM

to 5PM, Monday through Friday.

OFF_HOURS is paused during

business hours.

If your workflow requires an impact policy different from the defaults, you can create a custom policy with new settings.
Jobs with a low impact policy have the least impact on available CPU and disk I/O resources. Jobs with a high impact policy have a significant impact. Job Engine limits the number of tasks that can process in parallel according to the impact policy. However, if the impact of a job is under the impact limits, Job Engine can increase the number of tasks that are in process.
CAUTION: Job Engine can use more CPU and I/O even if doing so delays other system activities. Requesting a HIGH impact for a job can be disruptive to the cluster and can affect the client connection.

System jobs 375

Job priorities
Job priorities determine the precedence of a job when more than the maximum number of jobs attempt to run simultaneously. The Job Engine assigns a priority value from 1 to 10 to every job, with 1 the most important and 10 the least important. You can configure the maximum number of jobs that can run simultaneously. It is recommended that you run no more than three jobs simultaneously. If more than the configured maximum number of jobs of different exclusion sets attempt to run simultaneously, Job Engine manages priorities as follows:  Suppose that the maximum number of jobs are running and a higher-priority job is queued. Job Engine pauses the job with
the lowest priority to allow the higher-priority job to run. Job Engine automatically resumes the paused job when one of the other jobs completes.  Suppose that fewer than the maximum number of jobs are running when a higher-priority job is queued. If a lower-priority job has an exclusion set that overlaps the exclusion set of the new, higher-priority job, Job Engine pauses the lower-priority job to allow the higher-priority job to run.  In the case of a tie, Job Engine will continue running the job that is already running.
Managing system jobs
The Job Engine enables you to control periodic system maintenance tasks that ensure OneFS file system stability and integrity. As maintenance jobs run, Job Engine ensures that jobs remain within specified impact settings for resource usage and impact to other processes. However, it is possible to configure impact settings that enable the job to use resources to the point of affecting performance. As system administrator, you can tailor these jobs to the specific workflow of your PowerScale cluster. You can view active jobs and job history, modify job settings, and start, pause, resume, cancel, and update job instances. The Job Engine CLI commands are available as subcommands of the isi job CLI command. Use the --help option to view details and syntax for CLI commands. Job Engine parameter values are not case sensitive. For details about all available Job Engine CLI commands, see the OneFS CLI Command Reference.  To list all available isi job subcommands:
isi job --help
For more details about each subcommand, including a summary and syntax: isi job <subcommand> --help
 To view a list of the available subcommands for managing active jobs: isi job jobs --help
 To list available reports for finished jobs: isi job reports list
 To view a report for a specific job: isi job reports view <job ID>
 To view a summary of Job Engine activity and status: isi job status
 To view jobs statistics: isi job statistics
376 System jobs

Start a job
Although OneFS runs several critical system maintenance jobs automatically when necessary, you can also manually start any job at any time. The Collect job, used here as an example, reclaims free space that previously could not be freed because the node or drive was unavailable. Run the isi job jobs start command. The following command runs the Collect job with the specified parameters. It sets a stronger impact policy and a higher priority than the default values.
isi job jobs start Collect --policy MEDIUM --priority 2
When the job starts, a message such as Started job [7] appears. In this example, 7 is the job ID number, which you can use to run other commands on the job.
Pause a job
To free up system resources, you can pause a job temporarily. To pause a job, you need to know the job ID number. If you are unsure of the job ID number, you can use the isi job jobs list command to see a list of running jobs. Run the isi job jobs pause command. The following command pauses a job with an ID of 7.
isi job jobs pause 7 If there is only one instance of a job type currently active, you can specify the job type instead of the job ID.
isi job jobs pause Collect In all instructions that include the isi job jobs command, you can omit the jobs entry.
isi job pause Collect
Modify a job
You can change the priority and impact policy of an active, waiting, or paused job. To modify a job, you need to know the job ID number. If you are unsure of the job ID number, you can use the isi job jobs list command to see a list of running jobs. When you modify a job, only the current instance of the job runs with the updated settings. The next instance of the job returns to the default settings for that job type. Run the isi job jobs modify command. The following command updates the priority and impact policy of an active job (job ID number 7).
isi job jobs modify 7 --priority 3 --policy medium If there is only one instance of a job type currently active, you can specify the job type instead of the job ID.
isi job jobs modify Collect --priority 3 --policy medium
System jobs 377

Resume a job
You can resume a paused job. To resume a job, you need to know the job ID number. If you are unsure of the job ID number, you can use the isi job jobs list command. Run the isi job jobs resume command. The following command resumes a job with the ID number 7.
isi job jobs resume 7 If there is only one instance of a job type currently active, you can specify the job type instead of the job ID.
isi job jobs resume Collect
Cancel a job
If you want to free up system resources, or for any reason, you can cancel a running, paused, or waiting job. To cancel a job, you need to know the job ID number. If you are unsure of the job ID number, you can use the isi job jobs list command. Run the isi job jobs cancel command. The following command cancels a job with the ID number 7.
isi job jobs cancel 7 If there is only one instance of a job type currently active, you can specify the job type instead of the job ID.
isi job jobs cancel Collect
View active jobs
You can view information about jobs that are running on your PowerScale cluster. You can check active jobs or view details about a job. For example, if you notice slower system response or want to see what jobs are active before starting a new job. 1. To check active jobs, run the isi job jobs list command. 2. To view details about a job, run the isi job jobs view command.
View job history
You can view recent activity for system maintenance jobs. You might want to check the last time a critical job ran, view all job history within a recent time period, or output job history for a certain time period into a comma-delimited format file. 1. Run the isi job events list command for a specific job type.
The following command displays the activity of the MultiScan job type. isi job events list --job-type multiscan
2. View all jobs within a specific time frame. The following command displays all jobs that ran since September 16, 2013. isi job events list --begin 2013-09-16
378 System jobs

3. For reporting purposes, redirect output to a comma-delimited file. The following command outputs job history for a specific two-week period to a specified path name.
isi job events list --begin 2013-09-15 --end 2013-09-16 > /ifs/data/report1.txt

Time

Message

---------------------------------------------------------------

2013-09-15T12:55:55 MultiScan[4] Phase 1: end lin scan and mark

2013-09-15T12:55:57 MultiScan[4] Phase 2: begin lin repair scan

2013-09-15T12:56:10 MultiScan[4] Phase 2: end lin repair scan

2013-09-16T01:47:12 SetProtectPlus[3] System Cancelled

2013-09-16T07:00:00 SmartPools[5] Waiting

Modify job type settings
You can customize system maintenance jobs for your administrative workflow by modifying the default priority level, impact level, and schedule for a job type. The job type ID is the job name, for example, MediaScan. 1. Run the isi job types modify command.
The following command modifies the default priority level and impact level for the MediaScan job type.
isi job types modify mediascan --priority 2 --policy medium
When you run this command, the system prompts you to confirm the change. Type yes or no, and then press ENTER. 2. Establish a regular schedule for a job type.
The following command schedules the MediaScan job to run every Saturday morning at 9 AM. The --force option overrides the confirmation step.
isi job types modify mediascan --schedule 'every Saturday at 09:00' --force
3. Remove a regular schedule for a job type. The following command removes the schedule for a job type that is scheduled.
isi job types modify mediascan --clear-schedule --force
All subsequent iterations of the MediaScan job type run with the new settings. If a MediaScan job is in progress, it continues to use the old settings.
Managing impact policies
For system maintenance jobs that run through the Job Engine service, you can create and assign policies that help control how jobs affect system performance. As system administrator, you can create, copy, modify, and delete impact policies, and view their settings.
Create an impact policy
The Job Engine includes four default impact policies, which you cannot modify or delete. However, you can create new impact policies. You can create custom impact policies to define the best times for system maintenance jobs to run and mitigate their impact on system resources. 1. Run the isi job policies create command.

System jobs 379

The following command creates a custom policy defining a specific time frame and impact level. You can apply the custom policy to any job instance to enable the job to run at a higher impact over the weekend.
isi job policies create MY_POLICY --impact medium --begin 'Saturday 00:00' --end 'Sunday 23:59'
2. View available impact policies to see if your custom policy was created successfully. The following command displays a list of impact policies.
isi job policies list
The displayed list appears as follows.

ID

Description

---------------------------------------------------------------

HIGH

Isilon template: high impact at all times

LOW

Isilon template: high impact at all times

MEDIUM Isilon template: high impact at all times

OFF-HOURS Isilon template: Paused M-F 9-5, low impact otherwise

MY_POLICY

---------------------------------------------------------------

3. Add a description to the custom policy. The following command adds a description to the custom policy.

isi job policies modify MY_POLICY --description 'Custom policy: medium impact when run on weekends'

View impact policy settings
You can view the settings of any impact policy. If you intend to modify an impact policy, you can view the current policy settings. In addition, after you have modified an impact policy, you can view the policy settings to ensure that they are correct. Run the isi job policies view command. The following command displays the impact policy settings of the custom impact policy MY_POLICY.
isi job policies view MY_POLICY

Modify an impact policy
You can change the description and policy intervals of a custom impact policy. You cannot modify the default impact policies, HIGH, MEDIUM, LOW, and OFF_HOURS. You can only modify policies that you create. 1. Run the isi job policies modify command to reset current settings to base defaults.
Policy settings are cumulative, so defining a new impact level and time interval adds to any existing impact level and interval already set on the custom policy. The following command resets the policy interval settings to the base defaults: low impact and anytime operation.
isi job policies modify MY_POLICY --reset-intervals
2. Run the isi job policies modify command to establish new impact level and interval settings for the custom policy. The following command defines the new impact level and interval of a custom policy named MY_POLICY.
isi job policies modify MY_POLICY --impact high --begin 'Saturday 09:00' --end 'Sunday 11:59'

380 System jobs

3. Verify that the custom policy has the settings that you intended. The following command displays the current settings for the custom policy.
isi job policies view MY_POLICY
Delete an impact policy
You can delete impact policies that you have created. You cannot delete default impact policies, HIGH, MEDIUM, LOW, and OFF_HOURS. 1. Run the isi job policies delete command.
The following command deletes a custom impact policy named MY_POLICY.
isi job policies delete MY_POLICY
OneFS displays a message asking you to confirm the deletion of your custom policy. 2. Type yes and press ENTER.
Viewing job reports and statistics
You can generate reports for system jobs and view statistics to better determine the amounts of system resources being used. Most system jobs controlled by the Job Engine run at a low priority and with a low impact policy, and generally do not have a noticeable impact on cluster performance. A few jobs, because of the critical functions they perform, run at a higher priority and with a medium impact policy. These jobs include FlexProtect and FlexProtect Lin, IntegrityScan, SmartPoolsTree, SnapshotDelete, TreeDelete, Undedupe, and Upgrade. As a system administrator, if you are concerned about the impact a system job might have on cluster performance, you can view job statistics and reports. These tools enable you to view detailed information about job load, including CPU and memory usage and I/O operations.
View statistics for a job in progress
You can view statistics for a job in progress. You specify the job ID to view statistics for a job in progress. Run isi job jobs list to get a list of active jobs, including job IDs. Run the isi job statistics view command with a specific job ID. The following command displays statistics for a Collect job with the ID of 857:
isi job statistics view --job-id 857
The system displays output similar to the following example:
Job ID: 857 Phase: 1 Nodes Node: 1 PID: 26224 CPU: 7.96% (0.00% min, 28.96% max, 4.60% avg) Virtual: 187.23M (187.23M min, 187.23M max, 187.23M avg) Physical: 19.01M (18.52M min, 19.33M max, 18.96M avg) Read: 931043 ops, 7.099G Write: 1610213 ops, 12.269G Workers: 1 (0.00 STW avg.)
System jobs 381

View a report for a completed job

After a job finishes, you can view a report about the job with isi job reports view.
The isi job reports view command shows the results for each phase of a job when each phase completes, as well as the final status of the job. If a phase is not yet complete, it is possible to view results of past phases for a current running job. Use the --verbose option to see Job Engine statistics such as average CPU and bytes read and written.
You must specify the job ID to view the report for a completed job. Use the isi job reports list command to get the list of recent jobs, including job IDs. By default, isi job reports list lists the last 30 jobs. Use the --limit option to specify the number of jobs to list. For example, the following command displays the last 10 completed jobs:

isi job reports list --limit 10

Run the isi job reports view command with a specific job ID. The following command displays the report of a ShadowStoreProtect job with an ID of 35:

isi job reports view 35

The system displays output similar to the following example:

ShadowStoreProtect[35] phase 1 (2020-09-22T20:00:42)

----------------------------------------------------

Elapsed time 1 second

Working time 1 second

Errors

0

LINs

16

Zombies

34794251264

ShadowStoreProtect[35] Job Summary ---------------------------------Final Job State Succeeded Phase Executed 1

382 System jobs

31
S3 support
This section contains the following topics:
Topics:
· S3 · Server Configuration · Bucket handling · Object handling · Authentication · Access key management
S3
OneFS supports the Amazon Web Services Simple Storage Service (AWS S3) protocol for reading data from and writing data to the OneFS platform. The S3-on-OneFS technology enables the usage of Amazon Web Services Simple Storage Service (AWS S3) protocol to store data in the form of objects on top of the OneFS file system storage. The data resides under a single namespace. The AWS S3 protocol becomes a primary resident of the OneFS protocol stack, along with NFS, SMB, and HDFS. The technology allows multiprotocol access to objects and files. The S3 protocol supports bucket and object creation, retrieving, updating, and deletion. Object retrievals and updates are atomic. Bucket properties can be updated. Objects are accessible using NFS and SMB as normal files, providing cross-protocol support. To use S3, administrators generate access IDs and secret keys to authenticated users for access. Etag consistency is now implemented for S3 on OneFS protocol.
S3 concepts
This section describes some of the key concepts related to the S3 protocol. Buckets: A bucket is a container for objects stored in S3. Every object is contained in a bucket. Buckets organize the S3 namespace at the highest level, identify the account responsible for storage and data transfer charges, and play a role in access control. Objects: Objects are the fundamental entities stored in S3. An object is uniquely identified within a bucket by a key name and version ID. Objects consist of object data, metadata and others. Key is the object name, value is the data portion that is not visible by users, and metadata is the data about the data and is a set of name-value pairs that describe the object for example, content-type, size, last modified. Custom metadata can also be specified at the time the object is stored. Keys: A key is the unique identifier for an object within a bucket. Every object in a bucket has a key and a value. An Account ID and a secret key are used to authenticate a user. The Account ID and secret key are created by the Administrator and mapped to users (such as UNIX, AD, LDAP, and so on).
Server Configuration
The S3 settings are defined in the registry. The server configuration settings for the S3 protocol are separated into global service configuration and per-zone configuration.
S3 support 383

Global S3 settings
You can enable and disable the S3 protocol on the OneFS cluster and set ports for HTTP and HTTPS across the cluster. You can view or modify the global S3 settings for service related parameters using the command-line interface.
Enable S3 service
You can enable the s3 protocol on the OneFS cluster through the command-line interface. Run the isi services s3 enable command. The S3 protocol is enabled on the cluster, and the system displays the following message:
The service 's3' has been enabled.
Disable S3 service
You can disable the s3 protocol on the OneFS cluster through the command-line interface. Run the isi services s3 disable command. The S3 protocol is disabled on the cluster, and the system displays the following message:
The service 's3' has been disabled.
View global settings
You can view the global s3 options through the OneFS command-line interface. The S3 protocol should be enabled on the OneFS cluster. Run the isi s3 settings global view command. The details related to global settings are displayed.
HTTP Port: 9020 HTTPS Port: 4354 HTTPS only: Yes S3 Service Enabled: Yes
Modify global settings
You can modify the global s3 options through the OneFS command-line interface. The S3 protocol should be enabled on the OneFS cluster. 1. Run the isi s3 settings global modify command.
In this example, the HTTPS port is changed from 4354 to 9021. isi s3 settings global modify --https-port 9021
The HTTPS port is changed. 2. Run the isi s3 settings global view command to check your modifications.
The follow details appear: HTTP Port: 9020
HTTPS Port: 9021 HTTPS only: Yes S3 Service Enabled: Yes
384 S3 support

S3 zone settings
Access zones provide default locations for creating buckets. You can view or modify specific S3 settings of an access zone from the OneFS command-line interface. If you are creating a bucket, and a zone ID or name is not provided, the creation of the bucket defaults to the System zone.
View zone settings
You can view s3 server settings per zone through the OneFS command-line interface. The S3 protocol should be enabled on the OneFS cluster. Run the isi s3 settings zone view command. The details related to zone settings are displayed.
Root Path: /ifs Base Domain: Object ACL Policy: replace Bucket Directory Create Mode: 0777 Use Md5 For Etag: No Validate Content Md5: No
Modify zone settings
You can modify s3 server settings per zone through the OneFS command-line interface. The S3 protocol should be enabled on the OneFS cluster. 1. Run the isi s3 settings zone modify command.
In this example, the usage of Etag for the zone is changed from "No" to "Yes." isi s3 settings zone modify --use-md5-for-etag yes
The Etag option is enabled for the zone. 2. Run the isi s3 settings zone view command to check your modifications.
The follow details appear: Root Path: /ifs/home
Base Domain: Object ACL Policy: replace Bucket Directory Create Mode: 0777
Use Md5 For Etag: Yes Validate Content Md5: No
Similarly you can use isi s3 settings zone modify --validate-content-md5 yes command to enable the Validate Content Md5 option.
Certificates
Server certificates are a requirement for the server to set up a TLS handshake. On a OneFS cluster, the certificate manager manages all the certificates. The certificate manager is designed to provide a generic programmatic way for accessing and configuring certificates on the cluster. The HTTPS certificates used by S3 are handled by the isi certificate manager. The Apache instance uses the same store.
S3 support 385

Bucket handling
Buckets are the containers for objects. You can have one or more buckets. For each bucket, you can control access to it (who can create, delete, and list objects in the bucket).
Buckets are a similar concept to exports in NFS and shares in SMB. A major difference between buckets and NFS export is that any user with valid credentials can create a bucket on the server, and the bucket is owned by that user.
OneFS now supports these bucket and account operations:
 PUT bucket  GET bucket (list objects in a bucket)  GET bucket location  DELETE bucket  GET Bucket acl  PUT Bucket acl  HEAD Bucket  List Multipart Uploads  GET Service

Managing buckets
You can manage S3 buckets from the OneFS command-line interface. You can now view, create, modify, and delete buckets.

View bucket list

You can view a list of existing S3 buckets on the cluster through the OneFS command-line interface.
The S3 protocol should be enabled on the OneFS cluster.
Run the isi s3 buckets list command. The list of existing buckets on the cluster appears.

Bucket Name Path

Owner Object ACL Policy Description

-------------------------------------------------------------

default

/ifs/data root replace

-------------------------------------------------------------

.

View bucket details
You can view properties of an existing S3 bucket through the OneFS command-line interface. The S3 protocol should be enabled on the OneFS cluster. Run the isi s3 buckets view command. The following command allows you to view the details of the bucked named "default"
isi s3 buckets view default
The properties of the "default" bucket is displayed:
Bucket Name: default Path: /ifs/data
Owner: root Object ACL Policy: replace
Description:

386 S3 support

Create a bucket
You can create a s3 bucket through the OneFS command-line interface. The S3 protocol should be enabled on the OneFS cluster. Run the isi s3 buckets create command. The following command specifies the name of the bucket and the path of the S3 bucket. The path must be within /ifs.
isi s3 buckets create default /ifs/data
The bucket called "default" is created in the "data" path of the /ifs file system.
Modify bucket details
You can modify the properties of an existing S3 bucket through the OneFS command-line interface. The S3 protocol should be enabled on the OneFS cluster. Run the isi s3 buckets modify command. The following command allows you to modify the details of the bucked named "default." In this example, the modification is made to the description of the bucket.
isi s3 buckets modify default --description <root>
The description of the "default" bucket is changed. Run the isi s3 buckets view command to verify the modifications.
Bucket Name: default Path: /ifs/data
Owner: root Object ACL Policy: replace
Description: root
Delete a bucket
You can delete a bucket through the OneFS command-line interface. The S3 protocol should be enabled on the OneFS cluster. 1. Run the isi s3 buckets delete command.
isi s3 buckets delete default
The following message appears:
Are you sure you want to delete S3 bucket default? (yes/[no]): 2. Type "yes".
The "default" bucket is deleted.
Object handling
An object consists of a file and optionally any metadata that describes that file. To store an object in S3, you upload the file that you want to store to a bucket. You can set permissions on the object and any metadata. S3 stores data in the form of objects, which are key-value pairs. An object is identified using the key. The data is stored inside the object as a value. In OneFS, you use files to represent objects. An object key points out to a path name and an object value to the contents of the file. An object can have associated metadata with size limits. There can be system metadata, which are generated for every object. Also, there can be user metadata that applications create for selected objects. Objects reside within buckets. The life cycle and access of an object depends on the policies and ACLs enforced on the bucket. Also, each object can have its own ACLs.
S3 support 387

An object key can have prefixes or delimiters, which are used to organize them efficiently.
Object key
The object key is the path of the file from the root of the bucket directory. For OneFS the object key is treated as a file path from root. "/" is treated as the path for directories. The limitations on object keys are listed below:  Cannot use " / " (It is treated as a delimiter) .  Cannot use ". " and ".." as a key or as a part of prefix.  If snapshot is already present, .snapshot cannot be created.  Maximum key length including prefix and delimiter is 1023 bytes.  Key length or each prefix split by / is 255 bytes.  Can use ASCII or UTF-8.  Other OneFS data services may have a problem if path length as a file exceeds 1024 bytes.  Cannot place object under the .isi_s3 directory.  Cannot place file object if a directory with the same name already exists.
Object Metadata
An object can have two types of metadata, system metadata and user-defined metadata. Both system and user-defined metadata are defined as a set of name-value pairs. In OneFS, system metadata gets stored as an inode attribute and the user-defined metadata gets stored as an extended attribute of the file.
Multipart upload
The S3 protocol allows you to upload a large file as multiple parts rather than as a single request. The client initiates a multipart upload with a POST request with the uploads the query parameter and the object key. On the cluster, a unique userUploadId string is generated by concatenating the bucket ID and upload ID and returned to the client. The pair of bucket ID and upload ID is also stored in a per-zone SBT. A directory, .s3_parts_userUploadID is created in the target directory to store the parts. After getting created, the directory and kvstore entry persists until the multipart operation is either completed or stopped. Parts are uploaded with a part number and stored in the temporary directory. A part has a maximum size of 5 GB and the last part a minimum size of 5 MB. Complete multipart upload is handled by concatenating the parts to a temporary file under the .isi_s3 directory. Once the concatenation succeeds, the temporary file is copied to the target, the .s3_parts_userUploadID is deleted, and the SBT entry is removed.
Etag
S3 may use an MD5 checksum as an ETag. This value is specified in the HTTP header "Content-MD5." The Etag consistency is implemented to S3 on OneFS protocol to calculate an MD5 hash for a single PUT object operation when no Etag is provided. This feature provides compatibly with AWS S3 behavior and helps the S3 service to calculate and return the MD5 hash for single object PUT operations when an Etag is not supplied by the client. Two new configuration options, use-md5-for-etag and validate-content-md5 are added to control when the MD5 hash is calculated. The options are in the S3 zone settings and can be configured on a per-zone basis. By default, both options are disabled and the MD5 hash is not calculated. If the validate-content-md5 is set to true, then the MD5 hash is calculated provided the PUT object request has a Content-MD5 to check the content integrity. If use-md5-for-etag is set to true, then the MD5 hash is calculated on request if no Content-MD5 is provided to store as the Etag . If both options are set to true , the MD5 hash is always calculated.
388 S3 support

PUT object
The PUT object operation allows you to add an object to a bucket. You must have the WRITE permission on a bucket to add an object to it. To emulate the atomicity guarantee of an S3 PUT object, objects are written to a temporary directory, .isi_s3 before getting moved to the target path. On PUT, directories are implicitly created from writing the object. Implicitly created directories are owned by the object owner and have permissions that are inherited from the parent directory.
Cross protocol locking
S3 object operations only operate in the shared mode lock domain. The S3 protocol ignores byte range locks and other advisory locks. The PUT object operation takes a share mode lock DENY_NONE on the target file with the delete access bit only for the rename operation on the target file which is released upon completion. The GET operation takes a shared mode lock DENY_WRITE on the file with the read access bit to maintain the atomicity during the read. This process ensures that the data is not modified from other clients or protocols during the GET object operation.
Authentication
S3 uses its own method of authentication which relies on access keys that are generated for the user. The access ID is sent in the HTTP request and is used to identify the user. The secret key is used in the signing algorithm. There are two signing algorithms, Version 2 (v2) and Version 4 (v4). S3 requests can either be signed or unsigned. A signed request contains an access ID and a signature. The access ID indicates who the user is. The included signature value is the result of hashing several header values in the request with a secret key. The server must use the access ID to retrieve a copy of the secret key, recompute the expected hash value of the request, and compare against the signature sent. If they match, then the requester is authenticated, and any header value that was used in the signature is now verified to be unchanged as well. An S3 operation is only performed after the following criteria are met:  Verify signatures that use AWS Signature Version 4 or AWS Signature Version 2 and validate it against the S3 request.  Get user credential using access ID, once verification is complete.  Perform authorization of user credential against bucket ACL.  Perform traversal check of user credential against object path.  Perform access check of user credential against object ACL.
Access keys
On OneFS, user keys are created using PAPI and stored in the kvstore. The entry format in the kvstore is access_id:secret_key. The secret key is the randomly generated base64 string. The access key is formatted as ZoneId_username_accid. In the S3 protocol, on receiving an authenticated request, the access key is used to retrieve the secret key from the keystore. The signature is then generated on the server side, using the header fields from the request and the user's secret key. If the signature matches, the request is successfully authenticated. The username and zone information encoded in the access ID is used to generate the user security context and the request is performed. By default, when a new key is created, the previous user key remains valid for 10 minutes. If you want, you can change it up to 1440 minutes (24 hrs) by using the --existing-key-expiry-time command.
Access control
In S3, permissions on objects and buckets are defined by an ACL. S3 supports five grant permission types: READ, WRITE, READ_ACP, WRITE_ACP, and FULL_CONTROL. The FULL_CONTROL grant is a shorthand for all grants. Each ACE consists of one grantee and one grant. The grantee can either be a user or one of the defined groups that OneFS S3 supports, Everyone and Authenticated Users. S3 ACLs are limited to a maximum of 100 entries.
S3 support 389

ACL concepts
In S3, you must understand some concepts that are related to an ACL.
Grantee: S3 ACL grantees can be specified as either an ID or an email address to an AWS account. The ID is a randomly generated value for each user. For the OneFS S3, only ID is supported and the ID is set to be the username or group of the grantee.
S3 Groups: S3 has two predefined groups, Everyone and Authenticated Users. On OneFS, Everyone is translated to the integrated World group SID S-1-1-0 and Authenticated Users is translated to the integrated group Authenticated User SID S-1-5-11.
Canned ACL: When specifying ACLs in S3, the user can either specify the ACL as a list of grants or use a canned ACL. The canned ACL is a predefined ACL list which is added to the file. The supported canned ACLS are private, public-read, public-read-write, authenticated-read, bucket-owner-read, and bucket-owner-full-control.
Default ACL: When objects and buckets are created in S3 by a PUT operation, the user has the option of setting the ACL. If no ACL is specified, then the private canned ACL is used by default, granting full control to the creator.

Object ACL

S3 ACLs are a legacy access control mechanism that predates Identity and Access Management (IAM).
On OneFSobjects, ACLs are translated to NTFS ACLs and stored on-disk. The table below lists the mapping of S3 grants to NTFS grants. The difference in the OneFSS3 implementation is the WRITE grant is allowed on object ACLs. In S3, the WRITE grant has no meaning as the S3 protocol does not allow modifying objects.
The WRITE grant instead allows an object to be modified through other access protocols. For translating S3 ACLs to NTFS ACLs for operations PUT object ACL, the translation of each entry happens as shown in the table. The translation of NTFS ACL to S3 ACL, as needed in the GET object ACL some entries may not be shown. As NTFS ACLs have a richer set of grants, permissions that are not in the table are omitted. Deny ACEs are also omitted as S3 ACLs do not support a deny entry.

Table 11. Mapping S3 grants to NTFS grants

S3 ACL

NTFS Permissions

READ

SYNCHRONIZE | READ_DATA | READ_ATTR | READ_EA

WRITE

SYNCHRONIZE | WRITE_DATA | WRITE_ATTR | WRITE_EA | APPEND_DATA

READ_ACP

READ_CONTROL

WRITE_ACP

WRITE_DAC

FULL_CONTROL

FILE_ALL_ACCESS

An S3 ACL can also have one of the following pre-defined groups as a grantee:
 Authenticated Users: Any signed request is included in this group.  All Users: Any request, signed or unsigned, is included in this group.  Log Delivery Group: This group represents the log server that writes server access logs in the bucket.
Object ACLs translate to the following S3 permissions:

Table 12. Equivalent S3 Permissions - Object ACLs

ACL

S3 Permissions

READ

s3:GetObject, s3:GetObjectVersion, s3:GetObjectTorrent

WRITE

Not Applicable

READ_ACP

s3:GetObjectAcl, s3:GetObjectVersionAcl

WRITE_ACP

s3:PutObjectAcl, s3:PutObjectVersionAcl

FULL_CONTROL

All of the above

A difference in the OneFS implementation is the implicit owner ACE permission. In S3 the object owner is implicitly granted FULL_CONTROL, regardless of the ACL on the file. On OneFS to emulate this behavior, an ace entry granting FULL_CONTROL to the object owner is appended to the end of any ACL set by S3 which does not grant the owner FULL_CONTROL privilege.

390 S3 support

Bucket ACL

S3 ACLs are a legacy access control mechanism that predates Identity and Access Management (IAM).
ACLs set on the bucket are written as part of the bucket configuration in Tardis. The ACLs define which S3 bucket operations are allowed by which user.

Table 13. Grants for S3 operation Operation PUT Object DELETE Object Multipart Upload (Initiate, upload, complete, and abort) List Multipart Upload List Parts HEAD Bucket GET BUCKET (List Ob jets) GET BUCKET ACL PUT BUCKET ACL

Grant Required WRITE WRITE WRITE
READ READ READ READ READ_ACP WRITE_ACP

Bucket ACLs translate to the following S3 permissions:

Table 14. Equivalent S3 Permissions - Bucket ACLs

ACL

S3 Permissions

READ

s3:ListBucket, s3:ListBucketVersions, s3:ListBucketMultipartUploads

WRITE

s3:PutObject, s3:DeleteObject

READ_ACP

s3:GetBucketAcl

WRITE_ACP

s3:PutBucketAcl

FULL_CONTROL

All of the above

Directory permissions
In S3, directories may be implicitly related on a PUT object for keys with delimiters.
For directories related this way, the user issuing the PUT object request becomes the owner of the directory and the directory mode gets copied from the parent.
S3 Permissions
The following is a list of S3 permissions which OneFS supports.  AbortMultipartUpload  DeleteObject  DeleteObjectVersion  GetObject  GetObjectAcl  GetObjectVersion  GetObjectVersionAcl  ListMultipartUploadParts  PutObject  PutObjectAcl

S3 support 391

 PutObjectVersionAcl  CreateBucket  DeleteBucket  ListBucket  ListBucketVersions  ListAllMyBuckets  ListBucketMultipartUploads  GetBucketAcl  PutBucketAcl
Some of these permissions require special handling. The following permissions are handled outside of the bucket, and may be handled in PAPI:

Table 15. S3 Permissions Permissions ListAllMyBuckets
CreateBucket

Effect
This permission gives an IAM user the ability to list all their buckets. However, it is only applied in user policies, which OneFS does not support. OneFS users are automatically given the ability to list their own buckets without must set this permission. Also, a user with ISI_PRIV_S3 privilege can list buckets using PAPI.
This permission gives the users the ability to create a bucket. This can only be used in S3 user policies. Users are allowed or denied this permission using PAPI bucket configuration.

The following permissions interact with file system ACLs and require extra handling:

Table 16. S3 Permissions Permissions DeleteObject CreateBucket
ListBucket

Effect S3 gives a user permission to delete a particular object.
S3 gives a user permission to create or update a particular object.
S3 gives a user permission to list objects in the bucket.

You cannot bypass file system permissions. If a user has the ListBucket permission, but does not have read permission on a directory, then the user cannot list the files in that directory.

Anonymous authentication
Requests sent without an authentication header in S3 are run as the anonymous user. An anonymous user is mapped to the user 'nobody'.

Access key management
Access keys are used to sign the requests you send to the S3 protocol.
Access keys consist of two parts, an access key ID and a secret access key. Like a username and password, you must use both the access key ID and secret access key together to authenticate your requests.
You must generate access key ID and secret access key for an authenticated user upon request. When the user makes an S3 request, the access key ID in request is used to look up the secret access key, and then the signing of request is verified. A PAPI interface is provided for generating this pair for each identity and persists the pair to a cluster-wide store. Each request looks up its credentials in this cluster-wide store, with a possible in-memory cache in the S3 protocol head.
To generate the access key ID and secret access key for an authenticated user upon request, the following rules apply:
 The access key ID can be a 16 to 128-byte string.

392 S3 support

 A secret key of size 28 bytes is randomly generated, and the user cannot set it.  You must store the access key ID and secret access key on disk, for high availability.  There is a username (1 to 64-byte string) associated with the access key ID. Users with the Administrator role are only authorized to generate access keys. Users have only one access key ID. However, users may have at most two secret keys when the old key has an expiry date set. If an Administrator creates a new secret key for a user and forgets to set the expiry time, the administrator cannot go back and set the expiry time again. The new key is created and the old key is set to expire after 10 minutes, by default.
Managing keys
You can manage keys from the OneFS command-line interface. You can now view, create, and delete the keys that you have created. The OneFS command-line interface does not provide a mechanism to view the secret keys of other users and the data that are related to the keys.
Create keys
You can create a secret key for a user through the OneFS command-line interface. The S3 service should be enabled on the OneFS cluster. 1. Run the isi s3 keys create command.
The following command allows you to create a secret key for the user, "guest": isi s3 keys create guest
S3 key is created successfully for the user "guest", and you are asked if you want to reveal the key information. 2. Enter "yes"
The secret key for the user is displayed: Access ID: 1_Guest_accid Secret Key: xShccR_XRWNuXkcKFspGIEJj_WB3 Timestamp: 2021-01-21T10:12:51 Old Secret Key: -
Old Key Timestamp: Old Key Expiry: -
Delete a key
You can delete an existing secret key for a user through the OneFS command-line interface. The S3 service should be enabled on the OneFS cluster. 1. Run the isi s3 buckets delete command.
The following command allows you to delete a secret key for the user, "guest": isi s3 keys delete guest
The following message appears: Are you sure you want to delete the S3 key for user guest? (yes/[no]):
2. Enter "yes." The secret key for the "guest" user is deleted.
S3 support 393

Create your own keys
You can create your own secret keys through the OneFS command-line interface. The S3 service should be enabled on the OneFS cluster. 1. Run the isi s3 mykeys create command.
Your S3 key is created successfully, and you are asked if you want to reveal the key information. 2. Enter "yes".
The secret key details are displayed. Access ID: 1_root_accid Secret Key: UHmlQXDhaO6m168fT5Xu8xv4QYGh Timestamp: 2021-01-21T10:30:26 Old Secret Key: Old Key Timestamp: Old Key Expiry: -
View your own keys
You can view the secret keys that you have created through the OneFS command-line interface. The S3 service should be enabled on the OneFS cluster. 1. Run the isi s3 mykeys view command.
You are asked if you want to reveal the key information. 2. Enter "yes".
The secret key details are displayed. Access ID: 1_root_accid Secret Key: UHmlQXDhaO6m168fT5Xu8xv4QYGh Timestamp: 2021-01-21T10:30:26 Old Secret Key: Old Key Timestamp: Old Key Expiry: -
Delete your own keys
You can delete the secret keys that you have created through the OneFS command-line interface. The S3 service should be enabled on the OneFS cluster. 1. Run the isi s3 mykeys delete command.
The following message appears: Are you sure you want to delete the S3 key? (yes/[no]):
2. Enter" yes." The secret key is deleted.
394 S3 support

32
Small Files Storage Efficiency for archive workloads
Small Files Storage Efficiency for archive workloads improves the overall storage efficiency of clusters in which small files consume most of the logical space.
Topics:
· Overview · Requirements · Upgrades and rollbacks · Interoperability · Managing Small Files Storage Efficiency · Reporting features · File system structure · Defragmenter overview · Managing the defragmenter · CLI commands for Small Files Storage Efficiency · Troubleshooting Small Files Storage Efficiency
Overview
The Small Files Storage Efficiency feature improves storage efficiency for small file archive workloads.
Archive workloads are large numbers of small files that are rarely modified but must be stored long term and available for retrieval. Small files are defined as 1 MB or less in size. Storage efficiency of these data sets is improved by consolidating file data and reducing overall protection overhead. Files that meet specified criteria are packed (containerized) in a special container called a ShadowStore. FilePools policies provide the selection criteria.
NOTE: There is a trade-off between storage efficiency and performance. The goal of Small Files Storage Efficiency is to improve storage efficiency, which can affect performance. Small Files Storage Efficiency is enabled using the isi_packing utility. After enabling the feature, you configure FilePools policies to specify the selection criteria for files that should be packed. The SmartPools job packs and containerizes the selected files in the background. The job handles packing and unpacking according to a FilePools policy packing flag. The SmartPools job can take significant time to complete its initial run on the data set. Subsequent runs are faster because only new and modified files are packed.
Defragmenter tool
After files are packed, overwrites and file deletions can cause fragmentation of the ShadowStore. Fragmentation affects storage efficiency. To accommodate archive workloads with moderate levels of overwrites and deletions, Small Files Storage Efficiency provides a ShadowStore defragmenter. See Defragmenter overview on page 404 for more information. The ShadowStoreDelete job runs periodically to reclaim unused blocks from ShadowStores. This job also runs the defragmenter.
Small Files Storage Efficiency for archive workloads 395

Assessment tool
An assessment tool is available to estimate the raw space that could be reclaimed by enabling Small Files Storage Efficiency. You can use the assessment tool without enabling Small Files Storage Efficiency.

Requirements
Small Files Storage Efficiency is designed for the following conditions.  A significant portion of the space used in a defined data set is for small files. Small file is defined as less than 1 MB.  The files are used in an archive workflow, meaning that the files are not modified often. Moderate levels of modifications are
accommodated by running the defragmentation tool.  There must be an active SmartPools license and a SmartPools policy enabled on the cluster.
A File System Analytics (FSA) license is highly recommended. That license permits you to use the FSAnalyze job and isi_packing utility to monitor storage efficiency.

Upgrades and rollbacks
There is a minor difference in upgrade procedures related to Small Files Storage Efficiency depending on whether you are upgrading from OneFS versions earlier than 8.0.1 or 8.0.1 and later.
Upgrading from OneFS versions earlier than 8.0.1
If you are upgrading from OneFS versions earlier than 8.0.1, Small Files Storage Efficiency is available only after the upgrade is committed. The feature cannot be rolled back, although you may disable it if needed. Small Files Storage Efficiency implements new on-disk structures and fields. Because rollback of the feature is not possible, you are not permitted to enable the feature until the installation or upgrade is committed. The sequence of steps for obtaining and enabling Small Files Storage Efficiency is: 1. Perform the upgrade. 2. Test the upgrade. At this point, you cannot enable or test Small Files Storage Efficiency. 3. Commit the upgrade. 4. Enable Small Files Storage Efficiency. 5. Test Small Files Storage Efficiency. 6. You may disable Small Files Storage Efficiency if needed.
Upgrading from OneFS 8.0.1 and later
If all nodes are running OneFS 8.0.1 or later, and all are committed, then you may follow the normal upgrade, test, and commit procedures. You may enable Small Files Storage Efficiency before committing the upgrade, and the upgrade can be rolled back if needed.

Interoperability

This section describes how Small Files Storage Efficiency interoperates with OneFS components.

OneFS component SyncIQ

Description
 Packed files are treated as normal files during failover and failback operations. Packed files are packed on the target cluster if Small Files Storage Efficiency is enabled on the target cluster and the correct file pools policy can be applied.
 SyncIQ does not synchronize the file pools policies. You must manually create the correct file pools policies on the target cluster.

396 Small Files Storage Efficiency for archive workloads

OneFS component

Description
 Best practice is to enable Small Files Storage Efficiency on the source cluster and on the target cluster so that you retain the benefits of storage efficiency on both clusters.
 If Small Files Storage Efficiency is enabled on only the source cluster, there is risk that the target cluster may run out of space. Running out of space blocks data replication. NOTE: The benefits of Small Files Storage Efficiency are not realized on the target cluster until a SmartPools job runs on the replicated data.

File clones and deduplication InsightIQ CloudPools

Interoperability is limited.  Cloned files are not optimized.  Deduplication skips packed files.  Packing skips deduplicated files.
 Adding shadow references to files does not change the logical file size. It does change the physical block usage of files.
 InsightIQ cluster summary figures accurately display the used and free space.  The figures for per-directory and per-file usage may not be accurate.
Interoperability is limited.
 The CloudPools SmartLink files are not packed.  Packed files can be unpacked first and included in SmartLink files.  Recalled files are not packed immediately.

SmartLock

The packing process handles write-once/read-many (WORM) files as regular files. WORM files are good candidates for packing. WORM files are unlikely to cause fragmentation due to writing changed files back to disk, so will not degrade storage efficiency.

Managing Small Files Storage Efficiency
You enable and configure Small Files Storage Efficiency using the OneFS command-line interface (CLI). You can also run reports and disable the feature using the CLI. You must have an active SmartPools license and a SmartPools policy enabled before you configure Small Files Storage Efficiency. We also recommend that you have a File System Analytics (FSA) license.

Implementation overview

Use the following steps to implement Small Files Storage Efficiency.

Task

Instructions

1

Optionally generate the Storage Efficiency report to get a

baseline of your cluster's storage efficiency.

See Monitor storage efficiency with FSAnalyze on page 402.

Save the job number so you can compare the before-packing and after-packing storage efficiency results.

2 Enable Small Files Storage Efficiency.

See Enable Small Files Storage Efficiency on page 398.

3 Configure the global options that control file packing behavior. See View and configure global settings on page 398.

4 Create FilePools policies that define selection criteria for the See Specify selection criteria for files to pack on page

files to pack.

399.

When all of the above tasks are completed, the SmartPools job runs in the background, selecting files based on FilePools policies and packing them.

Small Files Storage Efficiency for archive workloads 397

Enable Small Files Storage Efficiency
Small Files Storage Efficiency is disabled by default. Use this procedure to enable it. You must have an active SmartPools license and at least one SmartPools policy enabled. 1. Run the isi_packing --enabled=true command. 2. Type yes to display the license agreement.
The license agreement displays. 3. Enter q, and then accept the license agreement. Small Files Storage Efficiency is enabled.

View and configure global settings

Use the isi_packing command to configure the behavior of Small Files Storage Efficiency. 1. To view the current configuration of Small Files Storage Efficiency, enter the following command:

# isi_packing --ls

For example:

# isi_packing --ls

Enabled:

No

Enable ADS:

No

Enable snapshots:

No

Enable mirror containers:

No

Enable mirror translation:

No

Unpack recently modified:

No

Unpack snapshots:

No

Avoid deduped files:

Yes

Maximum file size:

1016.00k

SIN cache cutoff size:

8.00M

Minimum age before packing:

1D

Directory hint maximum entries: 16

Container minimum size:

1016.00k

Container maximum size:

1.00G

2. To view configuration options and syntax, enter: # isi_packing -I --help

The following usage displays.

usage: isi_packing [--ls] [--fsa] [--enabled true|false] [--enable-ads true|false] [--enable-snaps true|false] [--enable-mirror-containers true|false] [--enable-mirror-translation true|false] [--unpack-recent true|false] [--unpack-snaps true|false] [--avoid-dedupe true|false] [--max-size bytes] [--sin-cache-cutoff-size bytes] [--min-age seconds] [--dir-hint-entries entries] [--container-min-size bytes] [--container-max-size bytes] [-v]
Additional help information also appears below the syntax. 3. To change any configuration setting, use the following command:
# isi_packing <option>=<value>
For example:
isi_packing --enabled=false

398 Small Files Storage Efficiency for archive workloads

For guidance, use the additional help that appears after the usage display or see the reference page for isi_packing on page 414 .

Specify selection criteria for files to pack

To define which files to select for packing, create FilePools policies. The policies you create are applied when the SmartPools background job runs.
To create FilePools policies, you must activate a SmartPools license and have the SmartPools or higher administrative privilege.
You should be familiar with the isi filepool policies create command.
See Storage Pools on page 344 for information about SmartPools and FilePools policies. 1. Create FilePools policies that define the data sets to pack.
The syntax is:
isi filepool policies create <policy_name> --enable_packing=true / --begin_filter --path=<path_name> --changed-time=<time> --operator=<operator> --endfilter

where:

<policy_name> Identifies this FilePool policy.

--

Enables packing on the data sets in this policy. See Disable packing on page 399 for disable options.

enable_packing=t

rue

--

Identifies a data set.

path=<path_nam

e>

--changed-

These two parameters set the amount of time that must pass since a file was modified before that

time=<time>

file is selected for packing. The default amount of time is the global value set by isi packing

--

--min-age. Use this policy-specific --changed-time parameter to increase the amount of time

operator=<operat to wait since changes were made to a value greater than the global setting.

or>

NOTE: The policy-specific --changed-time parameter can not decrease the amount of time

to wait before selecting the file for packing to less than the global setting. Any value less than

the global --min-age setting is ignored. If you want a setting that is less than the current global

default, you must first change the global --min-age parameter in the isi packing command.

The following example enables packing on the /ifs/data/pacs data set.
isi filepool policies create pacs --enable_packing=true / --begin_filter --path=/ifs/data/pacs --end-filter
The following example specifies that a file in /ifs/data/pacs is selected for packing only if 1 week has passed since the file was last modified.
isi filepool policies create pacs --enable_packing=true / --begin_filter --path=/ifs/data/pacs --changed-time=1W / --operator=gt --end-filter
2. Wait for the SmartPools job to run. The data sets identified in the FilePools policies are packed by the SmartPools job.

Disable packing
You can disable packing either globally or by editing the individual FilePools policies. 1. To globally disable packing, run the following command:
# isi_packing --enable=false

Small Files Storage Efficiency for archive workloads 399

Packed files remain packed, but no additional files will be packed. 2. To disable packing by policy, use the following steps:
a. Check the available space on the cluster to ensure that there is sufficient free space to handle unpacked files. b. Run the command isi filepool policies modify policy-name. There are two choices.
 Include the --enable-packing=false parameter. With --enable-packing= set to false, the packed files that match the policy are unpacked. For example, the following command disables packing on the myfiles policy and unpacks all existing packed files that match the policy criteria:
# isi filepool policies modify myfiles --enable-packing=false
 Exclude the --enable-packing parameter. If the parameter does not exist in the policy, then no packing or unpacking activity occurs. Files remain packed or unpacked depending on their state. For example, the following command disables packing on the myfiles policy but does not change the state of existing packed files:
# isi filepool policies modify myfiles

Reporting features

Small Files Storage Efficiency includes the following reporting features.

Activity

Description

More information

Estimate possible storage savings with the isi_sfse_assess command.

The isi_sfse_assess command scans a set of files and simulates the work that Small Files Storage Efficiency would do. The results are an estimate of the savings that could be achieved by packing files.

See Estimate possible storage savings on page 400.

View packing and unpacking

The SmartPools job report shows the number See View packing and unpacking activity

results from the SmartPools job of files that were packed, re-packed, or

by SmartPools jobs on page 401.

report.

unpacked during the run.

Monitor storage efficiency with the FSAnalyze job.

The FSAnalyze job generates detailed statistics which can be further analyzed. The isi_packing --fsa command uses data from an FSAnalyze job to generate overall storage efficiency numbers.

See Monitor storage efficiency with FSAnalyze on page 402.

View ShadowStore details with the isi_sstore command.

The isi_sstore command shows statistics for each ShadowStore.

See View ShadowStore information on page 403.

Monitor storage efficiency on a small data set.

The isi_storage_efficiency command is a debugging script that calculates storage efficiency on small, sample data sets.

See Monitor storage efficiency on a small data set on page 403.

Estimate possible storage savings
Use the isi_sfse_assess command to generate an estimate of possible storage savings that could be achieved by packing files.
This command scans a set of files and simulates the work that Small Files Storage Efficiency would do. It generates an estimation of the savings that could be achieved with packing, without moving any data.
You can run this command against the entire file system or against a specific directory. If you have an idea of a directory that might benefit from file packing, you can confirm the potential savings with this command, and then create an appropriate FilePool policy.
For information about the options and example output, see the reference page for isi_sfse_assess on page 408.

400 Small Files Storage Efficiency for archive workloads

1. To start the assessment, use the isi_sfse_assess command, as follows:

Usage: isi_sfse_assess <assess mode> [process options] [sysctl options]

Assess Modes: -a | --all -p <path> | --path=<path> -r | --resume

: assess all files on OneFS : assess <path> and sub-dirs : resume previous assessment

Process Options: -q | --quick -f <fails> | --max-fails=<fails>
1000) -v | --verbose

: slow mode (better accuracy) : max failures before aborting (default:
: verbose mode

Sysctl Options:

--max-size=<bytes>

: max file size to pack

--avoid-bsin[=on|off]

: avoid cloned/dudped files

--mirror-translation-enabled[=on|off] : convert mirrored to FEC

--mirror-containers-enabled[=on|off] : process mirrored files

--snaps-enabled[=on|off]

: process snapshots

--ads-enabled[=on|off]

: process ADS files

For example:
root# isi_sfse_assess -a -q -v --mirror-translation-enabled --mirror-containersenabled

2. To interrupt processing, use the Ctrl-C keys. A summary of the estimation progress displays. In the background, the context of the current run-time status is saved.
3. To resume processing, use the following command:
# isi_sfse_assess -r [-v]
The assessment processing continues where it left off, using the same options that were provided in the original command. The -v option, for more verbose output, is the only additional option permitted with the resume option. If other options are included with the resume option, they are ignored.

View packing and unpacking activity by SmartPools jobs
The SmartPools job report shows the number of files that were packed, re-packed, or unpacked during the job run. 1. View the job report from a SmartPool job.
# isi job reports view -v 12
Alternatively, to view a job report using the Web UI:
a. Select Cluster management > Job operations > Job reports > Type=SmartPools, Phase=1. b. Click View details. 2. Review the job output. For each policy, output similar to the following displays:
'pol1': {'Policy Number': 0, 'Files matched': {'head':500, 'snapshot': 0}, 'Directories matched': {'head':1, 'snapshot': 0}, 'ADS containers matched': {'head':0, 'snapshot': 0}, 'ADS streams matched': {'head':0, 'snapshot': 0}, 'Access changes skipped': 0, 'Protection changes skipped': 0, 'Packing changes skipped': 0, 'File creation templates matched': 1, 'Skipped packing non-regular files': 1, 'Skipped packing regular files': 0,

Small Files Storage Efficiency for archive workloads 401

'Skipped files already in containers': 0, 'Files packed': 500,
'Files repacked': 0, 'Files unpacked': 0, },

Monitor storage efficiency with FSAnalyze
To monitor storage efficiency, run an FSAnalyze job and then the isi_packing --fsa command. An active File System Analytics (FSA) license is required. This is a two-step procedure. First, the FSAnalyze job scans the file system and records the total of logical and physical blocks used. A CLI command uses that data to calculate a global storage efficiency. The storage efficiency should improve when packing is enabled. We recommend using this procedure before and after you enable packing, to observe the storage efficiency improvements. Thereafter, use this procedure periodically to monitor the current global state of the file system's storage efficiency. If storage efficiency degrades, you might want to use FilePool policies that match more data. You can also use the isi_sfse_assess command to help identify additional directory trees that could benefit from packing. 1. Run the FSAnalyze job or obtain the jobid of a previously run FSAnalyze job.
You can run the FSAnalyze job manually or on a schedule, or both. To run it manually, enter the following:
# isi job start FSAnalyze
2. Generate the storage efficiency report using the following command: # isi_packing --fsa [--fsa-jobid $jobid]
Where: --fsa-jobid $jobid Specifies an FSAnalyze job run to use for generating the report. If this option is not included,
isi_packing --fsa defaults to using the most recently run FSAnalyze job.
For example:
# isi_packing --fsa --fsa-jobid 100

The isi_packing --fsa command produces the FSA storage efficiency report. The report is similar to the following.

# isi_packing --fsa

FSAnalyze job: 83 (Wed Jul 27 01:57:41 2016)

Logical size: 1.8357T

Physical size: 3.7843T

Efficiency:

48.51%

Where: FSAnalyze job Logical size Physical size Efficiency

Shows the FSA job ID. In the example, this is 83.
Shows the logical size of all files scanned during the FSA job run.
Shows the physical size of all files scanned during the FSA job run.
Shows the file storage efficiency, calculated as logical size / physical size. The isi_packing --fsa command does not count the files in the /ifs/.ifsvar directory as logical data. Because of that, if you run the FSA job on an empty cluster, the space used by /ifs/.ifsvar can cause FSA to report a low storage efficiency. As you store more data, the effect of /ifs/.ifsvar dissipates.

402 Small Files Storage Efficiency for archive workloads

View ShadowStore information
The isi_sstore command displays information about ShadowStores. For example command output and explanations of each field, see the reference page for isi_sstore on page 416. 1. To list ShadowStores, use the following command:
# isi_sstore list -l
2. For more information about each ShadowStore, including fragmentation and efficiency scores, use the verbose option. # isi_sstore list -v
3. To view statistics about ShadowStores, use the following command: # isi_sstore stats

Monitor storage efficiency on a small data set
The isi_storage_efficiency debugging script calculates storage efficiency on small, sample data sets. The script runs out of memory if you run it on large data sets. This script recursively scans through a directory of files and calculates the storage efficiency of files in the sample data set, taking into account the use of shadow stores. The Unix du command does not show accurate usage for files with shadow references, including packed files. To obtain storage efficiency for a small data set, enter the following command:
isi_storage_efficiency <filepath>
For example:
isi_storage_efficiency /ifs/data/my_small_files
For sample output, see the reference page for isi_storage_efficiency on page 422.

File system structure

Small Files Storage Efficiency uses a specific class of container ShadowStore to contain packed data. File attributes indicate the pack state and pack policy type.
You can determine the types of data that reside in a ShadowStore (SIN) by checking the SIN prefix.
 Base shadow stores (BSINs) with the prefix 0x40 contain clone or deduplicated data.  Container shadow stores (CSINs) with the prefix 0x41 contain packed data. The following file attributes indicate a file's pack state.

File attribute packing_policy
packing_target

Description

Original default before any packing or unpacking

Indicates whether the file meets the criteria set by your file pool policies and is eligible for packing. The value is updated by the SmartPools job. Values are:
 container--The file is eligible to be packed.  native--The file is not eligible to be packed.

native

Describes the file's current state. Values are:

native

 container--The file is packed.

 native--The file is explicitly unpacked, or never packed.

Small Files Storage Efficiency for archive workloads 403

File attribute packing_complete

Description

Original default before any packing or unpacking

Indicates whether the target is satisfied. Values are:

complete, indicating that the

 complete--The target is satisfied, meaning that packing packing target state is intact. or unpacking is complete.

 incomplete--The packing state of the file is not specifically known.

NOTE: To ensure that there is sufficient space to handle unpacking or expanding packed or deduplicated files, monitor physical space on a regular basis. In particular, SyncIQ operations unpack and expand deduplicated files on the target cluster. Those files are then repacked on the target cluster.

Viewing file attributes

Use the isi get command to view file attributes. 1. Enter the following command:
# isi get -D <file name>

2. Scan the output for packing attributes. For example:

# isi get -D /ifs/data/pol1/file.001

POLICY W LEVEL PERFORMANCE COAL ENCODING

FILE

IADDRS

+2d:1n 18 4+2/2 concurrency on UTF-8

file.001

<1,1,2411008:512>, <2,4,1406976:512>, <3,5,1359360:512> ct: 1554959873 rt: 0

*************************************************

* IFS inode: [ 1,1,2411008:512, 2,4,1406976:512, 3,5,1359360:512 ]

*************************************************

*

* Inode Version:

6

.

. <output intentionally deleted>

.

* Packing policy: container

* Packing target: container

* Packing status: complete

.

. <output intentionally deleted>

.

Defragmenter overview
The Small Files Storage Efficiency defragmenter reclaims space in the ShadowStore containers by moving data into a more optimal layout.
The defragmenter divides each SIN into logical chunks and assesses each chunk for fragmentation. If the current storage efficiency of each chunk is below a target efficiency then the chunk is processed by moving all of the data out of it and to another location where it is stored more efficiently.
The default target efficiency is 90% of the maximum storage efficiency available with the protection level used by the shadow store. Larger protection group sizes can tolerate a higher level of fragmentation before the storage efficiency drops below this threshold.
Attributes such as the chunk size, target efficiency, and the types of SINs to examine are configurable. In addition, you can configure the defragmenter to reduce the number of protection groups, when possible.
The defragmenter is implemented in the ShadowStoreDelete job. This job runs periodically to reclaim unused blocks from shadow stores. There is also a CLI command that runs the defragmenter.
The feature includes the following methods for obtaining statistics about fragmentation and storage efficiency. These aids can help you decide when and how often to run the defragmenter.

404 Small Files Storage Efficiency for archive workloads

 Running the defragmenter in assessment mode generates estimates of the amount of space that could be saved by defragmentation.
 The isi_sstore list -v commmand generates fragmentation and storage efficiency scores.
Managing the defragmenter
The defragmenter is disabled by default. You use command line commands to enable and configure it. When it is enabled, the defragmenter runs as part of the ShadowStoreDelete job. You can also run it on the command line. An assessment mode lets you preview space savings before running the defragmenter.
Enable the defragmenter
The defragmenter is an optional feature of the Small Files Storage Efficiency and does not require a separate license. It must be explicitly enabled. 1. Log in to any node
You do not need to be root but you need PRIV_ROOT privilege. Using sudo is typically enough. 2. Ensure that Small Files Storage Efficiency is enabled using the following command:
# isi_packing --ls
3. Enable the defragmenter using the following command:
# isi_gconfig -t defrag-config defrag_enabled=true
Configure the defragmenter
Use the isi_gconfig -t defrag-config command to configure global values for the defragmenter options. The defragmenter runs as part of the ShadowStoreDelete job. It uses global configuration values that are set in a global configuration. The following list describes the global options. defrag_enabled={true | false}
Controls whether the shadow store defragmenter is enabled. The installed value is false. access_mode={true | false} Controls whether the defragmenter runs in assessment mode. Assessment mode generates an estimate of the disk space savings that could occur with defragmentation without actually performing the defragmentation. This mode does not move any data or make any other on-disk changes. This is a quick operation that can be used to determine if the defragmentation feature should be fully enabled. The assessment mode must be turned off for the defragmentation process to do any actual work. The installed value is false. bsins_enabled={true | false} Controls whether the defragmenter examines BSINs. BSINs are block-based shadow stores, which are stores used by clone and dedupe operations. The defragmentation process on BSINs can be intensive and may take some time. The installed value is false. csins_enabled={true | false} Controls whether the defragmenter examines CSINs. CSINs are small file storage efficiency containers. The installed value is true. pg_efficiency={true | false}
Small Files Storage Efficiency for archive workloads 405

Enables or disables protection group efficiency. This is a compaction feature. When enabled, this option attempts to reduce the number of protection groups needed by the shadow stores, which in turn reduces restripe time. The installed value is true. snapshots_enabled={true | false} Determines whether the defragmenter examines snapshot files for references to the shadow store being defragged. Consider the following:  When this option is disabled, if snapshot files contain references to shadow store blocks that need
to be defragmented, the defragmenter can not move those blocks and the shadow store may remain fragmented.  When this option is enabled, it can add significant processing overhead for clusters with many snapshots.
Depending on your workflow, it may be preferable to run the defragmenter most frequently without examining files from snapshots, with occasional runs that include the snapshot files.
The installed value is false. target_efficiency=<efficiency-percent>
Sets the target efficiency percentage. The target_efficiency determines the minimum acceptable storage efficiency relative to the maximum storage efficiency achievable by the shadow store based on its current protection level.
A target of 90% is relatively easy to achieve with a large cluster. The value can be set even higher. Smaller clusters, such as a 3-node cluster, may perform better with a lower target, such as 80%.
The percent is a whole number. If a fraction is specified, the digits after the decimal point are ignored.
The installed global configuration value is 90.
chunk_size=<bytes> Sets the defragmentation chunk size, in bytes. The chunk size is the size of each region in the shadow store that is independently evaluated for defragmentation. The optimal size depends on your workflow.  Setting a value greater than the size of the shadow store (for example, 2GB), forces the entire shadow store to be defragmented only when the efficiency of the entire store is degraded.  Setting a small value (for example, 1MB) achieves more aggressive gains. The installed global configuration value is 33554432 which is 32MB. This setting works well in most scenarios.
log_level=<defrag_log_level> This parameter is currently not used.
The following procedure describes how to view the current settings and how to change them.
1. To view the current global settings, enter this command:
# isi_gconfig -t defrag-config
The output looks similar to the following:
# isi_gconfig -t defrag-config [root] {version:1} defrag_enabled (bool) = true assess_mode (bool) = false bsins_enabled (bool) = true csins_enabled (bool) = true pg_efficiency (bool) = true snapshots_enabled (bool) = true target_efficiency (uint8) = 90 chunk_size (uint64) = 33554432 log_level (enum defrag_log_level) = notice
2. Run the following command to change the value of a global setting.
# isi_gconfig -t defrag-config <option>=<value>
406 Small Files Storage Efficiency for archive workloads

For example: # isi_gconfig -t defrag-config target_efficiency=95

Run the defragmenter

The ShadowStoreDelete job runs the defragmenter. There is also a CLI command that runs the defragmenter. The following table describes the two methods for running defragmentation.

Method

Explanation

Automatically in the ShadowStoreDelete job.

When the defragmentation feature is enabled in the global configuration, defragmentation runs automatically as part of the ShadowStoreDelete job. In this case, the option settings in the global configuration control the behavior of the defragmenter.

On the command line.

You can run the defragmenter on the command line using the isi_sstore defrag command. In this case, the options are set on the command line. See isi_sstore defrag on page 419 for more information.

The following procedure describes how to prepare for running the defragmenter automatically as part of the ShadowStoreDelete job..
1. Make sure that defragmentation is enabled and review current global configuration settings as described in previous procedures.
2. Optionally, run the isi_sstore list -v command to check the fragmentation and storage efficiency scores before defragmentation. The output is similar to the following:

# isi_sstore list -v

SIN

lsize

psize

type underfull frag score efficiency

4100:0001:0001:0000 66584576 129504K

container

no

0.49 0.50

...

3. Run the ShadowStoreDelete job.

refs filesize

date

sin

16256 128128K Sep 20 22:55

# isi job jobs start ShadowStoreDelete [-o HIGH]

This job reclaims unused blocks from shadow stores and runs the defragmenter if it is enabled.

4. Optionally rerun the isi_sstore list -v command to check the fragmentation and storage efficiency scores after defragmentation. The following results are expected:  The fragmentation score should be lower after defragmentation.
If the fragmentation score is not reduced, check if the shadow store underfull flag is true. When the shadow store contains only a small amount of data, the defragmenter may not move the data, and the fragmentation score remains unchanged.
 The efficiency score should be higher after defragmentation.
The storage efficiency may be reported as low if the shadow store is underfull due to an insufficient sample of containerized data.

View estimated storage savings before defragmenting
Run the defragmenter in assessment mode to generate statistics about the amount of disk space that could be reclaimed by the defragmentation process. Assessment mode does not move any data or make any other on-disk changes. It is a quick operation, useful to help determine if the defragmenter should be run. It requires existing ShadowStores with previously containerized data.

Small Files Storage Efficiency for archive workloads 407

To run the assessment and view results, use the isi_sstore defrag command with appropriate options on the command line .
NOTE: Although you may set up the global configuration so that the ShadowStoreDelete job runs the defragmenter in assessment mode, the job output does not display the statistics.
1. Log onto any node in the cluster. You do not need to be root but you need PRIV_ROOT privilege. Using sudo is typically enough.
2. Run the isi_sstore defrag command using at least the -d and -a options. The -d option enables the defragmenter for this command run. The defragmenter does not need to be globally enabled. The -a option runs the defragmenter in assessment mode. For descriptions of all available options, see isi_sstore defrag on page 419. As an example, the following command displays potential storage savings if the small files storage efficiency containers are defragmented and protection groups are reduced.
# isi_sstore defrag -d -a -c -p -v
The example above displays statistics similar to the following:
# isi_sstore defrag -d -a -c -p -v ... Processed 1 of 1 (100.00%) shadow stores, space reclaimed 31M Summary: Shadows stores total: 1 Shadows stores processed: 1 Shadows stores skipped: 0 Shadows stores with error: 0 Chunks needing defrag: 4 Estimated space savings: 31M
CLI commands for Small Files Storage Efficiency
The CLI commands in this section configure, monitor, and manage Small Files Storage Efficiency and the related defragmenter.
isi_sfse_assess
Generates an estimate of possible storage savings that could be achieved by packing files with Small Files Storage Efficiency.
Usage
The isi_sfse_assess command scans a set of files and simulates the work that Small Files Storage Efficiency would do. This command generates an estimate of disk space savings without moving any data. It does not require a license and does not require that Small Files Storage Efficiency be enabled. Use this tool before enabling Small Files Storage Efficiency to see possible storage savings. Use the tool after some file packing has occurred to identify additional possible savings given the current state of the file system. The assessment is based on calculating the blocks saved when small files are packed into containers at the same protection level. A file is categorized as small if its size is less than the value of the max_size option. The default is about 1MB. Many of the options in the isi_sfse_assess command mirror options available during actual packing. These are system level control options (sysctl options) with preset default values. For packing to achieve the results predicted during assessment, you must use the same settings for packing and assessment.  You can change the default settings for sysctl options used during packing with the isi_packing command.  You can change the default settings for sysctl options used during assessment with this isi_sfse_assess command. The assessment skips the following types of files.  Non-regular files (not recorded).  Unlinked files (not recorded).
408 Small Files Storage Efficiency for archive workloads

 ADS files, if ads_enabled is false.  Stubbed (CloudPools) files.  Empty files (not recorded).  Zero-sized files, where all blocks have no physical content, such as shadow references, ditto, etc.  Oversized files, where the file size is greater than the max_size value.  Mirror protected files, if mirror_containers_enabled is false.  Clone/deduped files, if avoid_bsin is true.
The command reports progress as it runs by displaying the following information:
 % complete.  Estimated possible space savings on the files scanned so far. This number should continually increases as the program
progresses.  Estimated time remaining.
You can temporarily interrupt processing at any time using CTRL-C. The command saves its progress, allowing you to restart processing at a later time. Use the --resume (or -r) option to restart the processing. For details, see Example: Stop and restart processing on page 411 below.

Syntax

Usage: isi_sfse_assess <assess-mode> [process options] [sysctl options]

Assess Modes: -a | --all -p <path> | --path=<path> -r | --resume

: assess all files on OneFS : assess <path> and sub-dirs : resume previous assessment

Process Options: -q | --quick -f <fails> | --max-fails=<fails> -v | --verbose

: slow mode (better accuracy) : max failures before aborting (default: 1000) : verbose mode

Sysctl Options:

--max-size=<bytes>

: max file size to pack

--avoid-bsin[=on|off]

: avoid cloned/deduped files

--mirror-translation-enabled[=on|off] : convert mirrored to FEC

--mirror-containers-enabled[=on|off] : process mirrored files

--snaps-enabled[=on|off]

: process snapshots

--ads-enabled[=on|off]

: process ADS files

Options

-a | --all

Scans all files across the cluster for possible storage savings. The scan includes snapshots if both of the following are true:
 The --snaps-enabled option is set to on.
 The default (slow) process option is selected. If slow is not the process option, the scan is adjusted for faster processing, and snapshots are not included.

-p <path> | --path=<path>
Scans files in the named path for possible storage savings across the named directory path. This option performs a tree walk across all files and subdirectories within the named path. Because snapshots are invisible to the directory tree structure, the tree walk does not process any snapshots. Both absolute and relative path names are acceptable.

-r | --resume

Users can interrupt a running assessment using theCTRL-C keys simultaneously. This option resumes the assessment processing at the point where it was interrupted. The resumed process uses all of the same options that were specified on the original command.

-q | --quick

Small Files Storage Efficiency for archive workloads 409

Slow mode is the default. Use this option to override the default and run in quick mode. The differences are:  Quick mode makes some assumptions during processing based on file and block size, as opposed to
gathering actual data block information. If your OneFS system stores only regular files (no snapshots, cloned or deduped files, etc.), the results of quick mode can be very close to the accuracy achieved in slow mode.  Slow mode is more accurate but is very time-consuming. This mode collects actual data block information, including overhead blocks, and the results are precise. -f <fails> | --max-fails=<fails> The maximum number of failures allowed before aborting the assessment process. The default is 1000. The command first collects a list of files to process and then proceeds with actual processing. A failure occurs when it attempts to process a file that was modified or deleted after being added to the list.These failures are more likely to occur on a busy cluster with a very large number of files. -v | --verbose Turns on verbose output. --max-size=<bytes> Sets the maximum size of files to select for processing. The default is 1040384 bytes, which is 8192 bytes less than 1MB, or 127 fs blocks. This value makes files less than 1MB available for packing. --avoid-bsin[=on | off] Controls whether to avoid cloned and deduped files. The default is on, or true, meaning that deduped files are not processed. We recommend not to pack deduped files. Packing them has the effect of undoing the benefits of dedupe. Also, packing deduped files may affect performance when reading the packed file. NOTE: The dedupe functionality does not dedupe packed files.
--mirror-translation-enabled[=on | off] Controls whether to pack mirrored files into FEC containers with equivalent protection. The default is off, or false.  The off setting ensures that a mirrored file remains a true mirror. This is an important quality for some users.  The on setting allows packing of files with mirror protection polices into containers with equivalent FEC protection policies. The on setting can increase space savings.
--mirror-containers-enabled[=on | off] Controls whether to process mirrored files. The default is off, or false.  The off setting does not process mirrored files.  The on setting allows creation of containers with mirrored protection policies. Mirrored files remain mirrored, so there is no space saving. However, this setting can reduce the total protection group count and potentially reduce rebuild times.
--snaps-enabled[=on | off] Controls whether to process snapshots. The default is off, or false.  The off setting does not process snapshot files. Use this setting if processing time is an issue.  The on setting processes snapshot files. This processing can significantly increase the time it takes to pack a data set if there are many snapshots with data. The advantage to using the on setting is the storage savings that may be gained. Snapshot files are often sporadically allocated, which typically results in poor storage efficiency. Packing can improve the storage efficiency.
--ads-enabled[=on | off] Controls whether to process ADS files. The default is off, or false.  The off setting does not process ADS files. Typically, these stream files are too large to be considered for packing. In addition, it is more efficient to process directories of streams files, but not efficient to process them singly from various locations.  The on setting processes ADS files. Use this setting if you have small ADS files located together in a directory.
410 Small Files Storage Efficiency for archive workloads

Example: Start assessment in slow mode on all files
The following command scans all files in slow mode.
# isi_sfse_assess -a
Example: Start assessment in quick mode on a directory
The following command uses quick mode to generate precise space saving estimates on the /ifs/my-data directory.
# isi_sfse_assess -q -p /ifs/my-data
Example: Stop and restart processing
# isi_sfse_assess -a --snaps-enabled --mirror-containers-enabled # <CTRL-C> # isi_sfse_assess -r
The process resumes using all of the same options that were originally entered.
Example: Verbose output
root# isi_sfse_assess -a -s -v --mirror-translation-enabled --mirror-containers-enabled -----------------------------------------------SFSE simulation options:
Slow mode: on Max fails: 1000 Verbose output: on Sysctls:
efs.sfm.pack.max_size: 1040384 efs.sfm.pack.avoid_bsin: 1 efs.sfm.pack.mirror_translation_enabled: 1 (5 nodes involved in mirror translation) efs.sfm.pack.mirror_containers_enabled: 1 efs.sfm.pack.snaps_enabled: 0 efs.sfm.pack.ads_enabled: 0 ----------------------------------------------->> Starting LIN scan assessment... >> 1632 files (13.27%) scanned...[ETC: 1 minute, 5 seconds] >> 3014 files (19.51%) scanned...[ETC: 1 minute, 22 seconds] >> 4423 files (22.95%) scanned...[ETC: 1 minute, 40 seconds] >> 5685 files (25.75%) scanned...[ETC: 1 minute, 55 seconds] >> 6960 files (28.87%) scanned...[ETC: 2 minutes, 3 seconds] >> 8367 files (35.34%) scanned...[ETC: 1 minute, 49 seconds] >> 9708 files (38.96%) scanned...[ETC: 1 minute, 49 seconds] >> 11019 files (46.11%) scanned...[ETC: 1 minute, 33 seconds] >> 12307 files (49.25%) scanned...[ETC: 1 minute, 32 seconds] >> 13565 files (52.68%) scanned...[ETC: 1 minute, 29 seconds] >> 14892 files (56.11%) scanned...[ETC: 1 minute, 26 seconds] >> 16289 files (63.05%) scanned...[ETC: 1 minute, 10 seconds] >> 17738 files (66.40%) scanned...[ETC: 1 minute, 5 seconds] >> 19135 files (73.03%) scanned...[ETC: 51 seconds] >> 20455 files (76.86%) scanned...[ETC: 45 seconds] >> 21779 files (80.93%) scanned...[ETC: 37 seconds] >> 22989 files (82.21%) scanned...[ETC: 36 seconds] >> 24309 files (88.52%) scanned...[ETC: 23 seconds] >> 25635 files (100.00%) scanned...[ETC: 0 seconds] >> 25938 files (100.00%) scanned...[ETC: 0 seconds] -----------------------------------------------25938 files scanned: * Packable: 23978 * Non-packable: 14 - Oversized: 14
Small Files Storage Efficiency for archive workloads 411

- Cloned/deduled: 0 - Snapshots: 0 - ADS: 0 - Stubbed: 0 - Zero-sized: 0 * Failed: 0 * Skipped: 1946
SFSE estimation summary: * Raw space saving: 1.5 GB * PG reduction: 22298
SFSE estimation details: * prot level: 3x, files: 3995, size: 314857823, data blks: 41090 - effective prot level: 3+2 - prot overhead: 82180 -> 27396 - prot groups: 5666 -> 857 * prot level: 4x, files: 3996, size: 315210935, data blks: 41134 - effective prot level: 4x - prot overhead: 123402 -> 123402 - prot groups: 5669 -> 2571 * prot level: 5x, files: 3995, size: 314857823, data blks: 41090 - effective prot level: 5x - prot overhead: 164360 -> 164360 - prot groups: 5666 -> 2569 * prot level: 8+2/2, files: 4002, size: 314867566, data blks: 41097 - prot overhead: 38374 -> 10290 - prot groups: 4002 -> 322 * prot level: 12+3/3, files: 3995, size: 314857823, data blks: 41090 - prot overhead: 57540 -> 10278 - prot groups: 3995 -> 215 * prot level: 16+4/4, files: 3995, size: 314857823, data blks: 41090 - prot overhead: 76720 -> 10304 - prot groups: 3995 -> 161
------------------------------------------------
isi_gconfig -t defrag-config
Enables or disables ShadowStore defragmentation and changes the global configuration settings for the defragmenter.
Usage
When the defragmenter runs in the ShadowStoreDelete job, it is controlled by settings in the global configuration.
When the defragmenter is initiated on the command line using the isi_sstore defrag command, that command uses the global settings for target_efficiency, chunk_size, and log_level unless you override the values with command line options. The isi_sstore defrag command resets all of the boolean values in the global configuration to false, allowing you to set them using command options specific to each command line execution.
Syntax
isi_gconfig -t defrag-config [defrag_enabled={true | false}] [access_mode={true | false}] [bsins_enabled={true | false}] [csins_enabled={true | false}] [pg_efficiency={true | false}] [snapshots_enabled={true | false}] [target_efficiency=<efficiency-percent>] [chunk_size=<bytes>][log_level=<defrag_log_level>]
Options
defrag_enabled={true | false}
412 Small Files Storage Efficiency for archive workloads

Controls whether the shadow store defragmenter is enabled. The installed value is false.
access_mode={true | false} Controls whether the defragmenter runs in assessment mode. Assessment mode generates an estimate of the disk space savings that could occur with defragmentation without actually performing the defragmentation. This mode does not move any data or make any other on-disk changes. This is a quick operation that can be used to determine if the defragmentation feature should be fully enabled. The assessment mode must be turned off for the defragmentation process to do any actual work. The installed value is false.
bsins_enabled={true | false} Controls whether the defragmenter examines BSINs. BSINs are block-based shadow stores, which are stores used by clone and dedupe operations. The defragmentation process on BSINs can be intensive and may take some time. The installed value is false.
csins_enabled={true | false} Controls whether the defragmenter examines CSINs. CSINs are small file storage efficiency containers. The installed value is true.
pg_efficiency={true | false} Enables or disables protection group efficiency. This is a compaction feature. When enabled, this option attempts to reduce the number of protection groups needed by the shadow stores, which in turn reduces restripe time. The installed value is true.
snapshots_enabled={true | false} Determines whether the defragmenter examines snapshot files for references to the shadow store being defragged. Consider the following:  When this option is disabled, if snapshot files contain references to shadow store blocks that need to be defragmented, the defragmenter can not move those blocks and the shadow store may remain fragmented.  When this option is enabled, it can add significant processing overhead for clusters with many snapshots.
Depending on your workflow, it may be preferable to run the defragmenter most frequently without examining files from snapshots, with occasional runs that include the snapshot files.
The installed value is false.
target_efficiency=<efficiency-percent> Sets the target efficiency percentage. The target_efficiency determines the minimum acceptable storage efficiency relative to the maximum storage efficiency achievable by the shadow store based on its current protection level.
A target of 90% is relatively easy to achieve with a large cluster. The value can be set even higher. Smaller clusters, such as a 3-node cluster, may perform better with a lower target, such as 80%.
The percent is a whole number. If a fraction is specified, the digits after the decimal point are ignored.
The installed global configuration value is 90.
chunk_size=<bytes> Sets the defragmentation chunk size, in bytes. The chunk size is the size of each region in the shadow store that is independently evaluated for defragmentation. The optimal size depends on your workflow.  Setting a value greater than the size of the shadow store (for example, 2GB), forces the entire shadow store to be defragmented only when the efficiency of the entire store is degraded.  Setting a small value (for example, 1MB) achieves more aggressive gains. The installed global configuration value is 33554432 which is 32MB. This setting works well in most scenarios.
Small Files Storage Efficiency for archive workloads 413

log_level=<defrag_log_level> This parameter is currently not used.
Examples
Enable and disable the defragmentation tool
The defragmenter is disabled by default after installation. The following example enables the defragmentation tool in the global configuration.
# isi_gconfig -t defrag-config defrag_enabled=true
Display the global configuration for the defragmentation tool
The isi_gconfig -t defrag-config command displays the current settings for the defragmentation tool in the global configuration. The following example shows the command and typical settings.
# isi_gconfig -t defrag-config [root] {version:1} defrag_enabled (bool) = true assess_mode (bool) = false bsins_enabled (bool) = true csins_enabled (bool) = true pg_efficiency (bool) = true snapshots_enabled (bool) = true target_efficiency (uint8) = 90 chunk_size (uint64) = 33554432 log_level (enum defrag_log_level) = notice
Change a default value in the global configuration for the defragmentation tool
The following example changes the default value for the target_efficiency setting to 95%.
# isi_gconfig -t defrag-config target_efficiency=95
isi_packing
Enables or disables file packing and controls the behavior of pack operations. This command sets global options that apply to the packing operation regardless of FilePool policy.
Usage
FilePool policies control files that are selected for packing. In addition, many of the options in the isi_packing command set system level control options (sysctl options) that are also applied to file selection. The system level control options are preset with default values. You may change the default settings with this command. The packing operation skips the following types of files.  Non-regular files (not recorded).  Unlinked files (not recorded).  ADS files, if ads_enabled is false.  Stubbed (CloudPools) files.  Empty files (not recorded).  Zero-sized files, where all blocks have no physical content, such as shadow references, ditto, etc.  Oversized files, where the file size is greater than the max_size value.  Mirror protected files, if mirror_containers_enabled is false.  Clone/deduped files, if avoid_dedupe is true.
414 Small Files Storage Efficiency for archive workloads

Syntax
isi_packing [--ls] [--fsa] [--enabled true|false] [--enable-ads true|false] [--enable-snaps true|false] [--enable-mirror-containers true|false] [--enable-mirror-translation true|false] [--unpack-recent true|false] [--unpack-snaps true|false] [--avoid-dedupe true|false] [--max-size bytes] [--sin-cache-cutoff-size bytes] [--min-age seconds] [--dir-hint-entries entries] [--container-min-size bytes] [--container-max-size bytes] [-v]
Options
--ls List current settings.
--fsa [ --fsa-jobid <job id> ] Reports FSAnalyze job results. If you do not specify an FSAnalyze job ID, the request uses the results from the last FSAnalyze job run.
--enabled {true | false} Enable or disable packing. Set to true to enable packing. Set to false to disable packing.
--enable-ads {true | false} Controls whether to process ADS files. The default is false.  The false setting does not process ADS files. Typically, these stream files are too large to be considered for packing. In addition, it is more efficient to process directories of streams files, but not efficient to process them singly from various locations.  The true setting processes ADS files. Use this setting if you have small ADS files located together in a directory.
--enable-snaps {true | false} Controls whether to process snapshots. The default is false.  The false setting does not process snapshot files. Use this setting if processing time is an issue.  The true setting processes snapshot files. This processing can significantly increase the time it takes to pack a data set if there are many snapshots with data. The advantage to using the true setting is the storage savings that may be gained. Snapshot files are often sporadically allocated, which typically results in poor storage efficiency. Packing can improve the storage efficiency.
--enable-mirror-containers {true | false} Controls whether to process mirrored files. The default is false.  The false setting does not process mirrored files.  The true setting allows creation of containers with mirrored protection policies. Mirrored files remain mirrored, so there is no space saving. However, this setting can reduce the total protection group count and potentially reduce rebuild times.
--enable-mirror-translation {true | false} Controls whether to pack mirrored files into FEC containers with equivalent protection. The default is false.  The false setting ensures that a mirrored file remains a true mirror. This is an important quality for some users.  The true setting allows packing of files with mirror protection polices into containers with equivalent FEC protection policies. This setting can increase space savings.
--unpack-recent {true | false}
Small Files Storage Efficiency for archive workloads 415

Unpack recently modified files. --unpack-snaps {true | false}
Unpack packed snapshot version files. --avoid-dedupe {true | false}
Controls whether to avoid cloned and deduped files. The default is true, meaning that deduped files are not processed. We recommend not to pack deduped files. Packing them has the effect of undoing the benefits of dedupe. Also, packing deduped files may affect performance when reading the packed file.
NOTE: The dedupe functionality does not dedupe packed files. --max-size <bytes>
Maximum size of files to select for processing. The default is 1040384 bytes, which is 8192 bytes less than 1MB, or 127 fs blocks. This value makes files less than 1MB available for packing. --sin-cache-cutoff-size <bytes> Maximum size of a container ShadowStore in cache. --min-age <seconds> A global setting for the minimum amount of time that must pass since a file was modified before the file is selected for packing, in seconds. The installed default minimum age is one day. A FilePool policy may set a higher minimum age using its --changed-time parameter. --dir-hint-entries <entries> Number of entries in the directory container hint. --container-min-size <bytes> Minimum size of a container shadow file. The default is 1040384 bytes. Any containers whose size is less than this are considered to be underfull and not able to provide decent savings. A container shadow file is a ShadowStore used exclusively by packing. --container-max-size <bytes> Maximum size of a container shadow file.
Examples
FSAnalyze job results
To look at the FSAnalyze results for job 2069:
# isi_packing --fsa --fsa-jobid 2069
Show current configuration settings
# isi_packing --ls
List help
# isi_packing -I --help
isi_sstore
Displays information about ShadowStores. This section describes the isi_sstore parameters related to Small Files Storage Efficiency: list and stats.
416 Small Files Storage Efficiency for archive workloads

Syntax

isi_sstore [list { -l| -v}] [stats]

Options

list { -l | -v }

Lists all shadow stores. The -l option displays a summary. The -v option displays more details and can take some time to run depending on the number of shadow stores.

stats

Displays statistics for each shadow store.

Examples

Example: isi_sstore list -l
The following is example summary output from isi_sstore list -l.

# isi_sstore list -l

SIN 4000:0001:0000:0001 4000:0001:0000:0002 4000:0001:0000:0003 4000:0001:0000:0004 4000:0001:0000:0005 4000:0001:0000:0006 4000:0001:0000:0007 4000:0001:0000:0008 4000:0001:0000:0009 4000:0001:0000:000a 4100:0001:0000:0000

lsize 516096 368640
0 139264 401408
8192 360448 450560 294912 516096 2654208

psize 828928 681472
26112 452096 714240
50688 673280 763392 607744 828928 4081152

refs filesize

date

126 32632K Jul 11 12:22

90 32632K Jul 11 12:22

0 32632K Jul 11 12:22

34 32632K Jul 11 12:22

98 32632K Jul 11 12:22

2 32632K Jul 11 12:22

88 32632K Jul 11 12:22

110 32632K Jul 11 12:22

72 32632K Jul 11 12:22

126 32632K Jul 11 12:22

648 32632K Jul 11 12:24

The output includes the following information.

SIN
lsize psize refs

Identifies the ShadowStore. The SIN number prefix identifies the type of ShadowStore.  The prefix 0x40 identifies a Container ShadowStore with clone and deduplicated data.  The prefix 0x41 identifies a Container ShadowStore with packed data.
Logical size of the ShadowStore, indicating the amount of data contained within.
Physical size.
Total references in the ShadowStore. Includes the number of incoming references to blocks stored in the ShadowStore and references from the ShadowStore.

filesize

The filesize of the ShadowStore. Because ShadowStores often have sparse regions, this metric does not indicate the amount of data contained within. See lsize, above.
 For BSINs, the filesize is set to 2GB when the ShadowStore is created. The space is filled as needed and is never extended.
 For CSINs, the filesize increases as data is added until the size reaches a threshold. Then a new CSIN is created.

date

Creation date of the ShadowStore .

Small Files Storage Efficiency for archive workloads 417

Example: isi_sstore list -v

The following is example verbose output from the isi_sstore list -v command:

# isi_sstore list -v

SIN

lsize

psize

underfull frag score efficiency

4000:0001:0000:0001 1163264 5136384

no

0.00

0.23

4000:0001:0001:0000 2777088 6012928

no

0.75

0.46

4000:0001:0002:0000 1433600 5947392

no

0.00

0.24

4000:0001:0003:0000 163840 2138112

yes

0.00

0.08

4100:0001:0000:0001

0

24576

yes

0.00

0.00

4100:0001:0000:0002

0

32768

yes

0.00

0.00

4100:0001:0000:0003

0

40960

yes

0.00

0.00

4100:0001:0000:0004

0

32768

yes

0.00

0.00

4100:0001:0001:0000

0

24576

yes

0.00

0.00

4100:0001:0001:0001

0

32768

yes

0.00

0.00

4100:0001:0001:0002

0

40960

yes

0.00

0.00

4100:0001:0001:0003

0

40960

yes

0.00

0.00

4100:0001:0001:0004

0

24576

yes

0.00

0.00

4100:0001:0002:0000

0

24576

yes

0.00

0.00

4100:0001:0002:0001

0

32768

yes

0.00

0.00

4100:0001:0002:0002

0

40960

yes

0.00

0.00

4100:0001:0002:0003

0

32768

yes

0.00

0.00

4100:0001:0002:0004

0

24576

yes

0.00

0.00

4100:0001:0003:0000

0

32768

yes

0.00

0.00

4100:0001:0004:0000

0

40960

yes

0.00

0.00

4100:0001:0005:0000

0

24576

yes

0.00

0.00

4100:0001:0005:0001

0

40960

yes

0.00

0.00

refs filesize

date sin type

157 2097152K Apr 11 03:22 block

372 2097152K Apr 11 03:23

block

1055 2097152K Apr 11 03:24 block

20 2097152K Apr 11 03:24 block

0 131072 Apr 11 05:18 container

0 131072 Apr 11 05:18 container

0 131072 Apr 11 05:18 container

0 131072 Apr 11 05:18 container

0 131072 Apr 11 05:18 container

0 131072 Apr 11 05:18 container

0 131072 Apr 11 05:18 container

0 131072 Apr 11 05:18 container

0 131072 Apr 11 05:18 container

0 131072 Apr 11 05:18 container

0 131072 Apr 11 05:18 container

0 131072 Apr 11 05:18 container

0 131072 Apr 11 05:18 container

0 131072 Apr 11 05:18 container

0 131072 Apr 11 05:18 container

0 131072 Apr 11 05:18 container

0 131072 Apr 11 05:18 container

0 131072 Apr 11 05:18 container

All shadow store summary: Block SINs:
4 shadow stores 5537792 (5408 KB) logical bytes 19234816 (18 MB) physical bytes (including metadata) 928 (928) incoming refs Container SINs: 18 shadow stores 0 (0 B) logical bytes 589824 (576 KB) physical bytes (including metadata) 0 (0) incoming refs

SStores in CStat summary: Block SINs:
4 shadow stores 5537792 (5408 KB) logical bytes

The output includes the following information.

418 Small Files Storage Efficiency for archive workloads

SIN type
underfull flag frag score
efficiency score

One of the following:  block--the shadow store is used for dedupe or clone operations.  container--the shadow store is used for packing operations.
If yes, the container is too small to provide storage savings benefits.
A measure of the level of fragmentation in the shadow store. Higher numbers mean more fragmentation.
The value is the ratio of the sparse blocks in partially allocated stripes to the total size of all stripes containing data.
A ratio of the logical size of the shadow store versus the physical size required to store it (including protection overhead). Higher numbers are better but there will be a limit based on the protection level in use by the shadow stores.

Example: isi_sstore stats

The following is example output from the isi_sstore stats command.

# isi_sstore stats Block SIN stats: 6 MB user data takes 3 MB in shadow stores, using 6 MB physical space. 280K physical average per shadow store.
2 refs per block. Reference efficiency 50%. Storage efficiency 200%
Container SIN stats: 3 MB user data takes 3 MB in shadow stores, using 4 MB physical space. 3984K physical average per shadow store.
1 refs per block. Reference efficiency 0%. Storage efficiency 100% Raw counts={ type 0 num_ss=20 lsize=3055616 pblk=715 refs=1119 } { type 1 num_ss=1 lsize=2654208 pblk=498 refs=648 }

The first set of statistics is for a shadow store that contains cloned data. The Storage efficiency of 200% means that every two files consume one file space after cloning.
The second set of statistics is for a shadow store that contains packed data. Because packed data is single-referenced, its Storage efficiency is 100% and its Reference efficiency is 0% (that is, no block sharing).
The Raw counts field at the end of the output contains the following statistics:

num_ss lsize pblk refs

The number of ShadowStores. Logical size of data contained in the ShadowStores. Number of physical blocks. Number of incoming block references for the shadow store.

isi_sstore defrag
Runs ShadowStore defragmentation on a node.
Usage
This command runs on a single node and iterates through all shadow stores serially. The command starts by retrieving the defragmentation global configuration from gconfig and then resets all of the boolean options to false. Use command line options to enable (re-enable) each of the options as needed.

Small Files Storage Efficiency for archive workloads 419

Syntax

isi_sstore defrag [-a] [-b] [-c] [-d] [-e percent] [-h] [-l level] [-p] [-s] [-v] [-z size] [sins]

Options
-a
-b -c -d -e percent
-h -l level -p -s

Runs the shadow store defragmenter in assessment mode. Assessment mode generates an estimate of the disk space savings that could occur with defragmentation without actually performing the defragmentation. This mode does not move any data or make any other on-disk changes. This is a quick operation that can be used to determine if the defragmentation feature should be fully enabled. The assessment mode must be turned off for the defragmentation process to do any actual work.
Runs the defragmenter against block-based stores. BSINs are block-based shadow stores, which are stores used by clone and dedupe operations. The defragmentation process on BSINs can be intensive and may take some time.
Runs the defragmenter against containers. CSINs are small file storage efficiency containers.
Enables defragmentation. This option is always required, even when running the defragmenter in assessment mode.
Sets the target efficiency percentage. The target_efficiency determines the minimum acceptable storage efficiency relative to the maximum storage efficiency achievable by the shadow store based on its current protection level. A target of 90% is relatively easy to achieve with a large cluster. The value can be set even higher. Smaller clusters, such as a 3-node cluster, may perform better with a lower target, such as 80%. The percent is a whole number. If a fraction is specified, the digits after the decimal point are ignored. The installed global configuration value is 90. For example: -e 95
Displays the command help.
This parameter is currently not used.
Enables protection group efficiency. This is a compaction feature. When enabled, this option attempts to reduce the number of protection groups needed by the shadow stores, which in turn reduces restripe time.
Determines whether the defragmenter examines snapshot files for references to the shadow store being defragged. Consider the following:  When this option is disabled, if snapshot files contain references to shadow store blocks that need
to be defragmented, the defragmenter can not move those blocks and the shadow store may remain fragmented.  When this option is enabled, it can add significant processing overhead for clusters with many snapshots.

420 Small Files Storage Efficiency for archive workloads

-v -z size
sins

Depending on your workflow, it may be preferable to run the defragmenter most frequently without examining files from snapshots, with occasional runs that include the snapshot files.
Sets the output to verbose.
Sets the defragmentation chunk size, in bytes. The chunk size is the size of each region in the shadow store that is independently evaluated for defragmentation. The optimal size depends on your workflow.  Setting a value greater than the size of the shadow store (for example, 2GB), forces the entire
shadow store to be defragmented only when the efficiency of the entire store is degraded.  Setting a small value (for example, 1MB) achieves more aggressive gains. The installed global configuration value is 33554432 which is 32MB. This setting works well in most scenarios.
Provide an optional list of SINs, separated with spaces, on which to apply this command. The default is to include all SINs.

Examples

Example of isi_sstore defrag -d -b
The following is sample output for isi_sstore defrag -d -b . The command runs the defragmenter on all BSINs without moving files in snapshots.
# isi_sstore defrag -d -b Summary:
Shadows stores total: 1 Shadows stores processed: 1 Shadows stores with error: 0 Chunks needing defrag: 1 Estimated space savings: 8192K Files moved: 2 Files repacked: 0 Files missing: 0 Files skipped: 0 Blocks needed: 3072 Blocks rehydrated: 4096 Blocks deduped: 2048 Blocks freed: 4096 Shadows stores removed: 1

Example of isi_sstore defrag -d -a -b -v
The following is sample output for isi_sstore defrag -d -a -b -v . The command runs the shadow store defragmenter in assessment mode. The output shows the disk space that would be reclaimed by defragmenting all block-based shadow stores.
# isi_sstore defrag -d -a -b -v Configuration:
Defrag enabled: 1 BSINs enabled: 1 CSINs enabled: 0 Chunk size: 33554432 Target efficiency: 90 PG efficiency: 0 Snapshots enabled: 0 Log level: 5 Summary: Shadows stores total: 1 Shadows stores processed: 1 Shadows stores skipped: 0 Shadows stores with error: 0

Small Files Storage Efficiency for archive workloads 421

Chunks needing defrag: 1 Estimated space savings: 8192K
Example with a SIN list
The following command requests defragmentation on a list of SINs.
# isi_sstore defrag -v -d -a -c -p 4000:0001:0000:0001 4000:0001:0000:0002 4000:0001:0000:0003 4000:0001:0000:0004 4000:0001:0000:0005

isi_storage_efficiency
Calculates storage efficiency on small, sample data sets.
Usage
The isi_storage_efficiency debugging script recursively scans through a directory and calculates the storage efficiency of files in the sample data set, taking into account the use of ShadowStores. This script runs out of memory if you run it on large data sets.
NOTE: The Unix du command does not show accurate usage for files with shadow references, including packed files.
Syntax

isi_storage_efficiency <path>

Options
path

Path name of directory to scan.

Storage efficiency example

The following example shows the storage efficiency of a small file data set, /ifs/data/my_small_files, before and after packing. Before packing, the storage efficiency is 33%. After packing, storage efficiency is 65.5%.

# isi_storage_efficiency /ifs/data/my_small_files

Mode

Count Logical data size

Size

DIR

1

0

48045

REG

2048

134217728

134217728

Storage efficiency

File data

0.330749354005

File logical data 0.330749354005

Overall

0.330465808769

Overall logical 0.330347556519

Blocks 964
792576

Now, assume that the following activities have occurred:
 A FilePools policy that selects /ifs/data/my_small_files is enabled  A SmartPools job has run.

422 Small Files Storage Efficiency for archive workloads

Rerunning the report shows improved storage efficiency.

# isi_storage_efficiency /ifs/data/my_small_files

Mode

Count Logical data size

Size

Blocks

DIR

1

0

48045

964

REG

2048

134217728

134217728

6144

SIN

1

134217728

167075840

394227

Shadow store usage may be affected by the ShadowStoreDelete job.

Storage efficiency

File data

0.654752716855

File logical data 0.654752716855

Overall

0.653413826082

Overall logical 0.653180011711

Troubleshooting Small Files Storage Efficiency

Possible issues generally fall into the following categories.

Performance Space

I/O efficiency, fragmentation, and packing can affect performance. Unpacking or expanding clones, packed files, and deduplicated files require sufficient available space.

Following are suggestions for investigating these issues.

Log files

To locate container ShadowStores that might be affected by storage efficiency problems, look for the following types of information in the logs.

Module name
SIN ID prefix
File pack state
Packing policy and packing target

Look for the value SFM. Container shadow stores (CSINs) that contain packed data have the prefix 0x41. Values are complete or incomplete. Values are native or container.

For information about file attributes, see File system structure on page 403.

Fragmentation issues
Fragmentation affects space usage and is the most common storage efficiency issue.
If files are overwritten or deleted repeatedly, there can be fragmentation in the container ShadowStores.
The following commands provide information about fragmentation:
 isi_sstore defrag -v -d -a -c -p shows the fragmentation space that defragmentation can reclaim.  isi_packing --ls shows the current packing configuration.  SmartPools job reports in verbose mode shows statistics about files that were packed.  isi_sstore list and isi_sstore stats show the degree of fragmentation. Use isi_sstore list -v to see
the fragmentation score and other verbose attributes for each SIN entry.  isi_storage_efficiency scans through a directory or files and calculates the storage efficiency of files in the sample
data set.  isi get shows file attributes, including packing attributes.  isi_cpr
See the PowerScale OneFS CLI Command Reference for information about isi get. The reference pages for the commands related to Small Files Storage Efficiency are here.

Small Files Storage Efficiency for archive workloads 423

The ShadowStoreDelete job frees up blocks for the shadow store and runs the defragmenter if it is enabled. Be aware of how much space you have packed. Clones, packed files, and deduplicated files are unpacked or expanded on the target cluster during SyncIQ operations and have to be re-packed or re-deduplicated on the target cluster.
424 Small Files Storage Efficiency for archive workloads

33

Networking

This section contains the following topics:
Topics:
· Networking overview · About the internal network · About the external network · Managing internal network settings · Managing groupnets · Managing external network subnets · Managing Multi-SSIP · Managing IP address pools · Managing SmartConnect Settings · Managing connection rebalancing · Managing network interface members · Managing node provisioning rules · Managing routing options · Managing DNS cache settings

Networking overview

After you determine the topology of your network, you can set up and manage your internal and external networks. There are two types of networks on a cluster:

Internal External

Generation 5 nodes communicate with each other using a high-speed, low latency InfiniBand network. Generation 6 nodes support using InfiniBand or Ethernet for the internal network. PowerScale F200 and F600 nodes support only Ethernet as the backend network. You can optionally configure a second InfiniBand network to enable failover for redundancy.
Clients connect to the cluster through the external network with Ethernet. The PowerScale cluster supports standard network communication protocols, including NFS, SMB, HDFS, HTTP, and FTP. The cluster includes various external Ethernet connections, providing flexibility for a wide variety of network configurations.

About the internal network
A cluster must connect to at least one high-speed, low-latency InfiniBand switch (Generation 5 and Generation 6 nodes) or Ethernet (Generation 6 and PowerScale F200 and F600 nodes) for internal communications and data transfer. The connection is also referred to as an internal network. The internal network is separate from the external network (Ethernet) by which users access the cluster.
Upon initial configuration of your cluster, OneFS creates an initial internal network. The interface to the default internal network is int-a. You can add a second internal network for redundancy and failover. Failover allows continuous connectivity during path failures. The interface to the secondary internal network is int-b, which is referred to as int-b/failover in the web administration interface.
CAUTION: Only PowerScale nodes should be connected to your internal network. Information exchanged on the back-end network is not encrypted. Connecting anything other than PowerScale nodes to the internal network creates a security risk.

Networking 425

Internal IP address ranges
The number of IP addresses assigned to the internal network determines how many nodes can be joined to the cluster. When you initially configure the cluster, you specify one or more IP address ranges for the primary InfiniBand switch or Ethernet. This range of addresses is used by the nodes to communicate with each other. It is recommended that you create a range of addresses large enough to accommodate adding additional nodes to your cluster. While all clusters will have, at minimum, one internal InfiniBand or Ethernet network (int-a), you can enable a second internal network to support network failover (int-b/failover). You must assign at least one IP address range for the secondary network and one range for failover. If any IP address ranges defined during the initial configuration are too restrictive for the size of the internal network, you can add ranges to the int-a network or int-b/failover networks, which might require a cluster restart. Other configuration changes, such as deleting an IP address assigned to a node, might also require that the cluster be restarted.
NOTE: Generation 5 nodes support InfiniBand for the internal network. Generation 6 nodes support both InfiniBand and Ethernet for the internal network. PowerScale F200 and F600 nodes support Ethernet for the internal network.
Internal network failover
You can configure an internal switch as a failover network to provide redundancy for intra-cluster communications. In order to support an internal failover network, the int-a port on each node in the cluster must be physically connected to the primary internal network switch, and the int-b port on each node must be connected to the other internal network switch. After the ports are connected, you must configure two IP address ranges; an address range to support the int-b internal interfaces, and an address range to support failover. The failover addresses enable seamless failover in the event that either the int-a or int-b switches fail.
About the external network
You connect a client computer to the cluster through the external network. External network configuration is composed of groupnets, subnets, IP address pools, and features node provisioning rules. Groupnets are the configuration level for managing multiple tenants on your external network. DNS client settings, such as nameservers and a DNS search list, are properties of the groupnet. Groupnets reside at the top tier of the networking hierarchy. You can create one or more subnets within a groupnet. Subnets simplify external (front-end) network management and provide flexibility in implementing and maintaining the cluster network. You can create IP address pools within subnets to partition your network interfaces according to workflow or node type. The IP address pool of a subnet consists of one or more IP address ranges. IP address pools can be associated with network interfaces on cluster nodes. Client connection settings are configured at the IP address pool level. An initial external network subnet is created during the setup of your cluster with the following configuration:  An initial groupnet called groupnet0 with the specified global, outbound DNS settings to the domain name server list and
DNS search list, if provided.  An initial subnet called subnet0 with the specified netmask, gateway, and SmartConnect service address.  An initial IP address pool called pool0 with the specified IP address range, the SmartConnect zone name, and the network
interface of the first node in the cluster as the only pool member.  An initial node provisioning rule called rule0 that automatically assigns the first network interface for all newly added nodes
to pool0.  Adds subnet0 to groupnet0.  Adds pool0 to subnet0 and configures pool0 to use the virtual IP of subnet0 as its SmartConnect service address.
Groupnets
Groupnets reside at the top tier of the networking hierarchy and are the configuration level for managing multiple tenants on your external network. DNS client settings, such as nameservers and a DNS search list, are properties of the groupnet. You can
426 Networking

create a separate groupnet for each DNS namespace that you want to use to enable portions of the PowerScale cluster to have different networking properties for name resolution. Each groupnet maintains its own DNS cache, which is enabled by default.
A groupnet is a container that includes subnets, IP address pools, and provisioning rules. Groupnets can contain one or more subnets, and every subnet is assigned to a single groupnet. Each cluster contains a default groupnet named groupnet0 that contains an initial subnet named subnet0, an initial IP address pool named pool0, and an initial provisioning rule named rule0.
Each groupnet is referenced by one or more access zones. When you create an access zone, you can specify a groupnet. If a groupnet is not specified, the access zone will reference the default groupnet. The default System access zone is automatically associated with the default groupnet. Authentication providers that communicate with an external server, such as Active Directory and LDAP, must also reference a groupnet. You can specify the authentication provider with a specific groupnet; otherwise, the provider will reference the default groupnet. You can only add an authentication provider to an access zone if they are associated with the same groupnet. Client protocols such as SMB, NFS, HDFS, and Swift, are supported by groupnets through their associated access zones.

DNS name resolution
You can designate up to three DNS servers per groupnet to handle DNS name resolution. DNS servers must be configured as an IPv4 or IPv6 address. You can specify up to six DNS search suffixes per groupnet; the suffixes settings are appended to domain names that are not fully qualified. Additional DNS server settings at the groupnet level include enabling a DNS cache, enabling server-side search, and enabling DNS resolution on a rotating basis.

Subnets
Subnets are networking containers that enable you to sub-divide your network into smaller, logical IP networks.
On a cluster, subnets are created under a groupnet and each subnet contains one or more IP address pools. Both IPv4 and IPv6 addresses are supported on OneFS; however, a subnet cannot contain a combination of both. When you create a subnet, you specify whether it supports IPv4 or IPv6 addresses.
You can configure the following options when you create a subnet:
 Gateway servers that route outgoing packets and gateway priority.  Maximum transmission unit (MTU) that network interfaces in the subnet will use for network communications.  SmartConnect service address, which is the IP address on which the SmartConnect module listens for DNS requests on this
subnet.  VLAN tagging to allow the cluster to participate in multiple virtual networks.  Direct Server Return (DSR) address, if your cluster contains an external hardware load balancing switch that uses DSR.
How you set up your external network subnets depends on your network topology. For example, in a basic network topology where all client-node communication occurs through direct connections, only a single external subnet is required. In another example, if you want clients to connect through both IPv4 and IPv6 addresses, you must configure multiple subnets.

IPv6 support

OneFS supports both IPv4 and IPv6 address formats on a cluster.
IPv6 is the next generation of internet protocol addresses and was designed with the growing demand for IP addresses in mind. OneFS supports the USGv6r1 standard of IPv6 used by the US Government.
The following table describes distinctions between IPv4 and IPv6.

IPv4 32-bit addresses Address Resolution Protocol (ARP)

IPv6 128-bit addresses
Neighbor Discovery Protocol (NDP); Duplicate Address Detection (DAD)
Router Advertisement

You can configure the PowerScale cluster for IPv4, IPv6, or both (dual-stack) in OneFS. You set the IP family when creating a subnet, and all IP address pools assigned to the subnet must use the selected format.

Networking 427

VLANs
Virtual LAN (VLAN) tagging is an optional setting that enables a cluster to participate in multiple virtual networks. You can partition a physical network into multiple broadcast domains, or virtual local area networks (VLANs). You can enable a cluster to participate in a VLAN which allows multiple cluster subnet support without multiple network switches; one physical switch enables multiple virtual subnets. VLAN tagging inserts an ID into packet headers. The switch refers to the ID to identify from which VLAN the packet originated and to which network interface a packet should be sent.
IP address pools
IP address pools are assigned to a subnet and consist of one or more IP address ranges. You can partition nodes and network interfaces into logical IP address pools. IP address pools are also utilized when configuring SmartConnect DNS zones and client connection management. Each IP address pool belongs to a single subnet. Multiple pools for a single subnet are available only if you activate a SmartConnect Advanced license. The IP address ranges assigned to a pool must be unique and belong to the IP address family (IPv4 or IPv6) specified by the subnet that contains the pool. You can add network interfaces to IP address pools to associate address ranges with a node or a group of nodes. For example, based on the network traffic that you expect, you might decide to establish one IP address pool for storage nodes and another for accelerator nodes. SmartConnect settings that manage DNS query responses and client connections are configured at the IP address pool level.
Link aggregation
Link aggregation, also known as network interface card (NIC) aggregation, combines the network interfaces on a physical node into a single, logical connection to provide improved network throughput. You can add network interfaces to an IP address pool singly or as an aggregate. A link aggregation mode is selected on a per-pool basis and applies to all aggregated network interfaces in the IP address pool. The link aggregation mode determines how traffic is balanced and routed among aggregated network interfaces.
SmartConnect module
The SmartConnect module specifies how the cluster DNS server handles connection requests from clients and the policies that assign IP addresses to network interfaces, including failover and rebalancing. You can think of SmartConnect as a limited implementation of a custom DNS server. SmartConnect answers only for the SmartConnect zone names or aliases that are configured on it. Settings and policies that are configured for SmartConnect are applied per IP address pool.
NOTE: Enable gratuitous Address Resolution Protocol (gratuitous ARP, or GARP) on the network switch to ensure consistent connectivity. You can configure basic and advanced SmartConnect settings.
SmartConnect Basic
SmartConnect Basic is included with OneFS as a standard feature and does not require a license. SmartConnect Basic supports the following settings:  Specifying the DNS zone.  Round-robin connection balancing method only  Service subnet to answer DNS requests. SmartConnect Basic enables you to add two SmartConnect Service IP addresses to a subnet. SmartConnect Basic has the following limitations to IP address pool configuration:
428 Networking

 You may only specify a static IP address allocation policy.  You cannot specify an IP address failover policy.  You cannot specify an IP address rebalance policy.  You may assign two IP address pools per external network subnet.
SmartConnect Advanced
SmartConnect Advanced extends the settings available from SmartConnect Basic. It requires an active license. SmartConnect Advanced supports the following settings:  Round-robin, CPU utilization, connection counting, and throughput balancing methods  Static and dynamic IP address allocation SmartConnect Advance enables you to add a maximum of six SmartConnect Service IP addresses per subnet. SmartConnect Advanced enables you to specify the following IP address pool configuration options:  You can define an IP address failover policy for the IP address pool.  You can define an IP address rebalance policy for the IP address pool.  SmartConnect Advanced supports multiple IP address pools per external subnet to enable multiple DNS zones within a single
subnet.
SmartConnect Multi-SSIP
OneFS supports defining more than one SmartConnect Service IP (SSIP) per subnet. Support for multiple SmartConnect Service IPs (Multi-SSIP) ensures that client connections continue uninterrupted if an SSIP becomes unavailable. The additional SSIPs provide fault tolerance and a failover mechanism to ensure continued load balancing of clients according to the selected policy. Though the additional SSIPs are in place for failover, they are active and respond to DNS server requests. The SmartConnect Basic license allows defining 2 SSIPs per subnet. The SmartConnect Advanced license allows defining up to 6 SSIPs per subnet.
NOTE: SmartConnect Multi-SSIP is not an additional layer of load balancing for client connections: additional SSIPs only provide redundancy and reduce failure points in the client connection sequence. Do not configure the site DNS server to perform load balancing for the SSIPs. Allow OneFS to perform load balancing through the selected SmartConnect policy to ensure effective load balancing.
Configure DNS servers for SSIP failover to ensure that the next SSIP is contacted only if the first SSIP connection times out. If the SSIPs are not configured in a failover sequence, the SSIP load balancing policy resets each time a new SSIP is contacted. The SSIPs function independently: they do not track the current distribution status of the other SSIPs. Configuring IP addresses as failover-only addresses is not supported on all DNS servers. To support Multi-SSIP as a failover only option, it is recommended that you use a DNS server that supports failover addresses. If a DNS server does not support failover addresses, Multi-SSIP still provides advantages over a single SSIP. However, increasing the number of SSIPs may affect SmartConnect's ability to load balance.
NOTE: If the DNS server does not support failover addresses, test Multi-SSIP in a lab environment that mimics the production environment to confirm the impact on SmartConnect's load balancing for a specific workflow. Only after confirming workflow impacts in a lab environment should you update a production cluster.
SmartConnect zones and aliases
Clients can connect to the cluster through a specific IP address or though a domain that represents an IP address pool. SmartConnect zone aliases enable you to view all the DNS names that a cluster answers for. You create Service Principal Name (SPN) records in Active Directory or in MIT Kerberos for the SmartConnect zone names, as a component of the machine account of the cluster. To create the SPN records, use the CLI isi auth command after you add the zone alias, similar to the following:
isi auth ads spn check --domain=<domain.com> --repair
You can configure a SmartConnect DNS zone name for each IP address pool. The zone name must be a fully qualified domain name. Add a new name server (NS) record that references the SmartConnect service IP address in the existing authoritative
Networking 429

DNS zone that contains the cluster. Provide a zone delegation to the fully qualified domain name (FQDN) of the SmartConnect zone in your DNS infrastructure.
If you have a SmartConnect Advanced license, you can also specify a list of alternate SmartConnect DNS zone names for the IP address pool.
When a client connects to the cluster through a SmartConnect DNS zone:  SmartConnect handles the incoming DNS requests on behalf of the IP address pool.  The service subnet distributes incoming DNS requests according to the connection balancing policy of the pool.
NOTE: Using SmartConnect zone aliases is recommended for making clusters accessible using multiple domain names. Use of CNAMES is not recommended.
DNS request handling
SmartConnect handles all incoming DNS requests on behalf of an IP address pool if a SmartConnect service subnet has been associated with the pool.
The SmartConnect service subnet is an IP address pool setting. You can specify any subnet that has been configured with a SmartConnect service IP address and references the same groupnet as the pool. You must have at least one subnet configured with a SmartConnect service IP address in order to handle client DNS requests. You can configure only one service IP address per subnet.
A SmartConnect service IP address should be used exclusively for answering DNS requests and cannot be an IP address that is in any pool's IP address range. Client connections through the SmartConnect service IP address result in unexpected behavior or disconnection.
Once a SmartConnect service subnet has been associated with an IP address pool, the service subnet distributes incoming DNS requests according to the pool's connection balancing policy. If a pool does not have a designated service subnet, incoming DNS requests are answered by the subnet that contains the pool, provided that the subnet is configured with a SmartConnect service IP address. Otherwise, the DNS requests are excluded.
NOTE: SmartConnect requires that you add a new name server (NS) record that references the SmartConnect service IP address in the existing authoritative DNS zone that contains the cluster. You must also provide a zone delegation to the fully qualified domain name (FQDN) of the SmartConnect zone.

IP address allocation

The IP address allocation policy specifies how IP addresses in the pool are assigned to an available network interface. You can specify whether to use static or dynamic allocation.

Static Dynamic

Assigns one IP address to each network interface added to the IP address pool, but does not guarantee that all IP addresses are assigned.
Once assigned, the network interface keeps the IP address indefinitely, even if the network interface becomes unavailable. To release the IP address, remove the network interface from the pool or remove it from the node.
Without a license for SmartConnect Advanced, static is the only method available for IP address allocation.
Assigns IP addresses to each network interface added to the IP address pool until all IP addresses are assigned. This guarantees a response when clients connect to any IP address in the pool.
If a network interface becomes unavailable, its IP addresses are automatically moved to other available network interfaces in the pool as determined by the IP address failover policy.
This method is only available with a license for SmartConnect Advanced.

IP address failover
When a network interface becomes unavailable, the IP address failover policy specifies how to handle the IP addresses that were assigned to the network interface.
To define an IP address failover policy, you must have a license for SmartConnect Advanced, and the IP address allocation policy must be set to dynamic. Dynamic IP allocation ensures that all of the IP addresses in the pool are assigned to available network interfaces.

430 Networking

When a network interface becomes unavailable, the IP addresses that were assigned to it are redistributed to available network interfaces according to the IP address failover policy. Subsequent client connections are directed to the new network interfaces.
You can select one of the following connection balancing methods to determine how the IP address failover policy selects which network interface receives a redistributed IP address:
 Round-robin  Connection count  Network throughput  CPU usage

Connection balancing

The connection balancing policy determines how the DNS server handles client connections to the cluster. You can specify one of the following balancing methods:

Round-robin

Selects the next available network interface on a rotating basis. This is the default method. Without a SmartConnect license for advanced settings, this is the only method available for load balancing.

Connection count Determines the number of open TCP connections on each available network interface and selects the network interface with the fewest client connections.

Network throughput

Determines the average throughput on each available network interface and selects the network interface with the lowest network interface load.

CPU usage

Determines the average CPU utilization on each available network interface and selects the network interface with lightest processor usage.

IP address rebalancing

The IP address rebalance policy specifies when to redistribute IP addresses if one or more previously unavailable network interfaces becomes available again.
To define an IP address rebalance policy, you must have a license for SmartConnect Advanced, and the IP address allocation policy must be set to dynamic. Dynamic IP addresses allocation ensures that all of the IP addresses in the pool are assigned to available network interfaces.
You can set rebalancing to occur manually or automatically:

Manual Automatic

Does not redistribute IP addresses until you manually start the rebalancing process.
Upon rebalancing, IP addresses will be redistributed according to the connection balancing method specified by the IP address failover policy defined for the IP address pool.
Automatically redistributes IP addresses according to the connection balancing method specified by the IP address failover policy defined for the IP address pool.
Automatic rebalancing may also be triggered by changes to cluster nodes, network interfaces, or the configuration of the external network.
NOTE: Rebalancing can disrupt client connections. Ensure the client workflow on the IP address pool is appropriate for automatic rebalancing.

Node provisioning rules
Node provisioning rules specify how new nodes are configured when they are added to a cluster. If the new node type matches the type defined in a rule, the network interfaces on the node are added to the subnet and the IP address pool specified in the rule. For example, you can create a node provisioning rule that configures new PowerScale storage nodes, and another rule that configures new accelerator nodes. OneFS automatically checks for multiple provisioning rules when new rules are added to ensure there are no conflicts.

Networking 431

Routing options
OneFS supports source-based routing and static routes which allow for more granular control of the direction of outgoing client traffic on the cluster. If no routing options are defined, by default, outgoing client traffic on the cluster is routed through the default gateway, which is the gateway with the lowest priority setting on the node. If traffic is being routed to a local subnet and does not need to route through a gateway, the traffic will go directly out through an interface on that subnet.
Source-based routing
Source-based routing selects which gateway to direct outgoing client traffic through based on the source IP address in each packet header. When enabled, source-based routing automatically scans your network configuration to create client traffic rules. If you modify your network configuration, for example, changing the IP address of a gateway server, source-based routing adjusts the rules. Source-based routing is applied across the entire cluster and does not support the IPv6 protocol. In the following example, you enable source-based routing on a PowerScale cluster that is connected to SubnetA and SubnetB. Each subnet is configured with a SmartConnect zone and a gateway, also labeled A and B. When a client on SubnetA makes a request to SmartConnect ZoneB, the response originates from ZoneB. The result is a ZoneB address as the source IP in the packet header, and the response is routed through GatewayB. Without source-based routing, the default route is destinationbased, so the response is routed through GatewayA. In another example, a client on SubnetC, which is not connected to the PowerScale cluster, makes a request to SmartConnect ZoneA and ZoneB. The response from ZoneA is routed through GatewayA, and the response from ZoneB is routed through GatewayB. In other words, the traffic is split between gateways. Without source-based routing, both responses are routed through the same gateway. Source-based routing is disabled by default. Enabling or disabling source-based routing goes into effect immediately. Packets in transit continue on their original courses, and subsequent traffic is routed based on the status change. If the status of source-based routing changes during transmission, transactions that are composed of multiple packets might be disrupted or delayed. Source-based routing can conflict with static routes. If a routing conflict occurs, source-based routing rules are prioritized over the static route. Consider enabling source-based routing if you have a large network with a complex topology. For example, if your network is a multitenant environment with several gateways, traffic is more efficiently distributed with source-based routing.
Static routing
A static route directs outgoing client traffic to a specified gateway based on the IP address of the client connection. You configure static routes by IP address pool, and each route applies to all nodes that have network interfaces as IP address pool members. You might configure static routing in order to connect to networks that are unavailable through the default routes or if you have a small network that only requires one or two routes.
Managing internal network settings
You can modify internal IP address ranges and configure an internal network switch for failover.
Add or remove an internal IP address range
You can configure IP address ranges for the int-a, int-b, and failover networks. Each internal Infiniband switch requires an IP address range. The ranges should have a sufficient number of IP addresses for present operating conditions as well as future expansion and addition of nodes. 1. Run the isi config command.
The command-line prompt changes to indicate that you are in the isi config subsystem.
432 Networking

2. Modify the internal IP address ranges by running the iprange command. The following command adds an IP range to the int-a internal network: iprange int-a 192.168.206.10-192.168.206.20
The following command deletes an existing IP address range from the int-a internal network: deliprange int-a 192.168.206.15-192.168.206.20
3. Run the commit command to complete the configuration changes and exit isi config.
Modify an internal network netmask
You can modify the subnet mask, or netmask, value for the int-a and int-b internal network interfaces. If the netmask is too restrictive for the size of the internal network, you must modify the netmask settings. It is recommended that you specify a class C netmask, such as 255.255.255.0, for the internal netmask, that is large enough to accommodate future growth of your PowerScale clusters. It is recommended that the netmask values you specify for int-a and int-b/failover are the same. If you modify the netmask value of one, modify the other.
NOTE: You must reboot the cluster to apply modifications to the netmask. 1. Run the isi config command.
The command-line prompt changes to indicate that you are in the isi config subsystem. 2. Modify the internal network netmask by running the netmask command.
The following command changes the int-a internal network netmask: netmask int-a 255.255.255.0
The system displays output similar to the following example: !! WARNING: The new netmask will not take effect until the nodes are rebooted.
3. Run the commit command to complete the configuration changes and exit isi config.
Configure and enable internal network failover
You can configure the int-b internal interfaces to provide backup in the event of an int-a network failure. Failover configuration involves enabling the int-b interface, specifying a valid netmask, and adding IP address ranges for the int-b interface and the failover network. By default, the int-b interface and failover network are disabled.
NOTE: You must reboot the cluster to apply modifications to internal network failover. 1. Run the isi config command.
The command-line prompt changes to indicate that you are in the isi config subsystem. 2. Set a netmask for the second interface by running the netmask command.
The following command changes the int-b internal network netmask: netmask int-b 255.255.255.0
The system displays output similar to the following example: !! WARNING: The new netmask will not take effect until the nodes are rebooted.
3. Set an IP address range for the second interface by running the iprange command. The following command adds an IP range to the int-b internal network: iprange int-b 192.168.206.21-192.168.206.30
4. Set an IP address range for the failover interface by running the iprange command.
Networking 433

The following command adds an IP range to the internal failover network: iprange failover 192.168.206.31-192.168.206.40
5. Enable a second interface by running the interface command. The following command specifies the interface name as int-b and enables it: interface int-b enable
6. Run the commit command to complete the configuration changes and exit isi config. 7. Restart the cluster to apply netmask modifications.
Disable internal network failover
You can disable internal network failover by disabling the int-b interface. You must reboot the cluster to apply modifications to internal network failover. 1. Run the isi config command.
The command-line prompt changes to indicate that you are in the isi config subsystem. 2. Disable the int-b interface by running the interface command.
The following command specifies the int-b interface and disables it: interface int-b disable
3. Run the commit command to complete the configuration changes and exit isi config. 4. Restart the cluster to apply failover modifications.
Managing groupnets
You can create and manage groupnets on a cluster.
Create a groupnet
You can create a groupnet and configure DNS client settings. Run the isi network groupnet create command. The following command creates a groupnet named groupnet1 that supports two DNS servers, which are specified by IPv6 addresses:
isi network groupnet create groupnet1 \ --dns-servers=2001:DB8:170:9904::be06,2001:DB8:170:9904::be07 The following command creates a groupnet named groupnet1 that supports one DNS server, which is specified by an IPv4 address, and enables DNS caching: isi network groupnet create groupnet1 \ --dns-servers=192.0.2.0 --dns-cache-enabled=true
Modify a groupnet
You can modify groupnet attributes including the name, supported DNS servers, and DNS configuration settings. Run the isi network groupnet modify command. The following command modifies groupnet1 to enable DNS search on three suffixes:
isi network groupnet modify groupnet1 \ --dns-search=data.company.com,storage.company.com
434 Networking

The following command modifies groupnet1 to support a second DNS server and to enable rotation through the configured DNS resolvers:
isi network groupnet modify groupnet1 \ --add-dns-servers=192.0.2.1 --dns-options=rotate

Delete a groupnet
You can delete a groupnet from the system, unless it is the default groupnet. If the groupnet is associated with an access zone, an authentication provider, removal from the system might affect several other areas of OneFS and should be performed with caution. In several cases, the association between a groupnet and another OneFS component, such as access zones or authentication providers, is absolute. You cannot modify these components to associate them with another groupnet. In the event that you need to delete a groupnet, we recommend that you complete the these tasks in the following order: 1. Delete IP address pools in subnets associated with the groupnet. 2. Delete subnets associated with the groupnet. 3. Delete authentication providers associated with the groupnet. 4. Delete access zones associated with the groupnet. 1. Run the isi network groupnet delete command.. 2. At the prompt to confirm deletion, type yes.
The following command deletes a groupnet named groupnet1:
isi network groupnet delete groupnet1
The following command attempts to delete groupnet1, which is still associated with an access zone:
isi network modify groupnet groupnet1
The system displays output similar to the following example:
Groupnet groupnet1 is not deleted; groupnet can't be deleted while pointed at by zone(s) zoneB

View groupnets

You can retrieve and sort a list of all groupnets on the system and view the details of a specific groupnet.
1. To retrieve a list of groupnets in the system, run the isi network groupnets list command. The following command sorts the list of groupnets by ID in descending order:

isi network groupnets list --sort=id --descending

The system displays output similar to the following example:

ID

DNS Cache DNS Search

DNS Servers Subnets

------------------------------------------------------------

groupnet2 True

data.company.com 192.0.2.75 subnet2

192.0.2.67 subnet4

groupnet1 True

192.0.2.92 subnet1

192.0.2.83 subnet3

groupnet0 False

192.0.2.11 subnet0

192.0.2.20

--------

Total: 3

2. To view the details of a specific groupnet, run the isi network groupnets view command. The following command displays the details of a groupnet named groupnet1:

isi network groupnets view groupnet1

Networking 435

The system displays output similar to the following example: ID: groupnet1
Name: groupnet1 Description: Data storage groupnet DNS Cache Enabled: True DNS Options: -
DNS Search: data.company.com DNS Servers: 192.0.1.75, 10.7.2.67 Server Side DNS Search: True
Subnets: subnet1, subnet3
Enabling Router Advertisement
With IPv6 configured on your cluster, you can enable router advertisement support. Router advertisement provides detailed information about MTUs and routes to IPv6 configured devices. Your cluster must have at least one groupnet or subnet configured to use IPv6. To enable IPv6 auto-configuration, run the following command:
isi network external modify -ipv6-auto-config-enabled true
Managing external network subnets
You can create and manage subnets on a cluster.
Create a subnet
You can add a subnet to the external network of a cluster. Subnets must be associated with a groupnet. Ensure that the groupnet you want to associate with this subnet exists in the system. An IP address family designation and prefix length are required when creating a subnet. Run the isi network subnets create command and specify a subnet ID, IP address family, and prefix length. Specify the subnet ID you want to create in the following format:
<groupnet_name>.<subnet_name>
The subnet name must be unique in the system. The following command creates a subnet associated with groupnet1, designates the IP address family as IPv4 and specifies an IPv4 prefix length:
isi network subnets create \ groupnet1.subnet3 ipv4 255.255.255.0
The following command creates a subnet with an associated IPv6 prefix length: isi network subnets create \ groupnet1.subnet3 ipv6 64
Modify a subnet
You can modify a subnet on the external network. NOTE: Modifying an external network subnet that is in use can disable access to the cluster.
436 Networking

1. Optional: To identify the ID of the external subnet you want to modify, run the following command:
isi network subnets list 2. Run the isi networks modify subnet command
Specify the subnet ID you want to modify in the following format:
<groupnet_name>.<subnet_name>
The following command changes the name of subnet3 under groupnet1 to subnet5:
isi network subnets modify groupnet1.subnet3 \ --name=subnet5
The following command sets the MTU to 1500, specifies the gateway address as 198.162.205.10, and sets the gateway priority to 1:
isi network subnets modify groupnet1.subnet3 --mtu=1500 --gateway=198.162.205.10 --gateway-priority=1

Delete a subnet
You can delete an external network subnet that you no longer need. NOTE: Deleting an external network subnet also deletes any associated IP address pools. Deleting a subnet that is in use can prevent access to the cluster.
1. Optional: To identify the name of the subnet you want to delete, run the following command: isi network subnets list
2. Run the isi networks delete subnet command. Specify the subnet ID you want to delete in the following format: <groupnet_name>.<subnet_name>
The following command deletes subnet3 under groupnet1: isi network subnets delete groupnet1.subnet3
3. At the prompt, type yes.

View subnets

You can view all subnets on the external network, sort subnets by specified criteria, or view details for a specific subnet. 1. To view all subnets, run the isi network subnets list command.
The system displays output similar to the following example:

ID

Subnet

Gateway|Priority Pools SC Service

----------------------------------------------------------------------

groupnet1.subnet0 203.0.113.10/24 203.0.113.12|1 pool0 198.51.100.10

groupnet1.subnet3 192.0.2.20/24 192.0.2.22|2

pool3 198.51.100.15

----------------------------------------------------------------------

2. To view the details of a specific subnet, run the isi network subnets view command and specify the subnet ID. Specify the subnet ID you want to view in the following format:
<groupnet_name>.<subnet_name>

Networking 437

The following command displays details for subnet3 under groupnet1:
isi network subnets view groupnet1.subnet3
The system displays output similar to the following example:
ID: groupnet1.subnet3 Name: subnet3 Groupnet: groupnet1 Pools: pool3 Addr Family: ipv4 Base Addr: 192.0.2.20 CIDR: 192.0.2.20/24 Description: Sales subnet Dsr Addrs: Gateway: 192.0.2.22 Gateway Priority: 2
MTU: 1500 Prefixlen: 24
Netmask: 255.255.255.0 Sc Service Addr: 198.51.100.15
VLAN Enabled: False VLAN ID: -
Enable or disable VLAN tagging
You can partition the external network into Virtual Local Area Networks or VLANs. VLAN tagging requires a VLAN ID that corresponds to the ID number for the VLAN set on the switch. Valid VLAN IDs are 2 to 4094. 1. Optional: To identify the name of the external subnet you want to modify for VLAN tagging, run the following command:
isi network subnets list
2. Enable or disable VLAN tagging on the external subnet by running the isi networks modify subnet command. Specify the subnet ID you want to modify in the following format:
<groupnet_name>.<subnet_name>
The following command enables VLAN tagging on subnet3 under groupnet1 and sets the required VLAN ID to 256:
isi network subnets modify groupnet1.subnet3 \ --vlan-enabled=true --vlan-id=256
The following command disables VLAN tagging on subnet3 under groupnet1:
isi network subnets modify groupnet1.subnet3 \ --vlan-enabled=false
3. At the prompt, type yes.
Add or remove a DSR address
You can specify a Direct Server Return (DSR) address for a subnet if your cluster contains an external hardware load balancing switch that uses DSR. 1. Optional: To identify the name of the external subnet you want to modify for DRS addresses, run the following command:
isi network subnets list
2. Run the isi network subnets modify command.
438 Networking

Specify the subnet ID you want to modify in the following format: <groupnet_name>.<subnet_name>
The following command adds a DSR address to subnet3 under groupnet1: isi network subnets modify groupnet1.subnet3 \ --add-dsr-addrs=198.51.100.20
The following command removes a DSR address from subnet3 under groupnet1: isi network subnets modify groupnet1.subnet3 \ --remove-dsr-addrs=198.51.100.20
Managing Multi-SSIP
You can configure Multi-SSIP on a cluster.
Configure a SmartConnect service IP address
You can specify a SmartConnect service IP address on a subnet. 1. Optional: To identify the name of the external subnet you want to modify, run the following command:
isi network subnets list 2. Run the isi network subnets modify command
Specify the subnet ID you want to modify in the following format: <groupnet_name>.<subnet_name>
The following command specifies the SmartConnect service IP address on subnet3 under groupnet1: isi network subnets modify groupnet1.subnet3 \ --sc-service-addrs=198.51.100.15
Assign this subnet to one or more IP address pools in order to handle DNS requests for those pools.
Configure Multi-SSIP
You can configure SmartConnect Multi-SSIP from the CLI. 1. Optional: To identify the name of the external subnet you want to configure with Multi-SSIP, run the following command:
isi network subnets list 2. Run the isi network subnets modify command with the --sc-service-addrs option, specifying an IP address
range, in the following format: isi network subnets modify <groupnet_name>.<subnet_name> --sc-serviceaddrs=<ip_address_range>
The following command specifies the SmartConnect service Multi-SSIP addresses on subnet0: isi network subnets modify subnet0 --sc-service-addrs=192.168.25.10-192.168.25.11
Networking 439

Add, clear, or remove a SmartConnect Multi-SSIP IP address range
You can add, clear, or remove the Multi-SSIP IP address ranges. 1. To add IPs to the Multi-SSIP address range, run the isi network subnets modify command with the --add-sc-
service-addrs option. The format is: isi network subnets modify <groupnet_name>.<subnet_name> --add-sc-serviceaddrs=<ip_address_range>
Specify --add-sc-service-addrs for each IP address to add. 2. To clear the entire Multi-SSIP address range, run the isi network subnets modify command with the --clear-
sc-service-addrs option. The format is: : isi network subnets modify <groupnet_name>.<subnet_name> --clear-sc-service-addrs
3. To remove IPs from the Multi-SSIP address range, run the isi network subnets modify command with the -remove-sc-service-addrs option. The format is: isi network subnets modify <groupnet_name>.<subnet_name> --remove-sc-serviceaddrs=<ip_address_range> Specify --remove-sc-service-addrs for each IP address to remove.
Managing IP address pools
You can create and manage IP address pools on the cluster.
Create an IP address pool
You can partition the external network interface into groups, or pools, of unique IP address ranges. NOTE: If you have not activated a SmartConnect Advanced license, the cluster is allowed one IP address pool per subnet. If you activate a SmartConnect Advanced license, the cluster is allowed unlimited IP address pools per subnet.
When you create an address pool, you must assign it to a subnet. If the subnet is not under the default groupnet, groupnet0, then you must also assign an access zone to the pool. Run the isi network pools create command. Specify the ID of the pool you want to create in the following format:
<groupnet_name>.<subnet_name>.<pool_name> The following command creates a pool named pool5 and assigns it to subnet3 under groupnet1:
isi network pools create groupnet1.subnet3.pool5 The following command creates a pool named pool5, assigns it to groupnet1.subnet3, and specifies zoneB as the access zone:
isi network pools create groupnet1.subnet3.pool5 \ --access-zone=zoneB
440 Networking

Modify an IP address pool
You can modify IP address pools to update pool settings. 1. Optional: To identify the name of the IP address pool you want to modify, run the following command:
isi network pools list
2. Run the isi networks modify pool command. Specify the pool ID you want to modify in the following format: <groupnet_name>.<subnet_name>.<pool_name>
The following command changes the name of the pool from pool3 to pool5: isi network pools modify groupnet1.subnet3.pool3 --name=pool5

Delete an IP address pool
You can delete an IP address pool that you no longer need. When a pool is deleted, the pool and pool settings are removed from the assigned subnet. 1. Optional: To identify the name of the IP address pool you want to delete, run the following command:
isi network pools list
2. Run the isi networks delete pool command. Specify the pool ID you want to delete in the following format: <groupnet_name>.<subnet_name>.<pool_name>
The following command deletes the pool name pool5 from groupnet1.subnet3: isi network pools delete groupnet1.subnet3.pool5
3. At the prompt, type yes.

View IP address pools

You can view all IP address pools within a groupnet or subnet, sort pools by specified criteria, or view details for a specific pool.
1. To view all IP address pools within a groupnet or subnet, run the isi network pools list command. The following command displays all IP address pools under groupnet1.subnet3:

isi network pools list groupnet1.subnet3

The system displays output similar to the following example:

ID

SC Zone

Allocation Method

----------------------------------------------------------

groupnet1.subnet3.pool5 data.company.com static

groupnet1.subnet3.pool7 data.company.com dynamic

----------------------------------------------------------

2. To view the details of a specific IP address pool, run the isi network pools view command and specify the pool ID. Specify the pool ID you want to view in the following format:
<groupnet_name>.<subnet_name>.<pool_name>

Networking 441

The following command displays the setting details of pool5 under groupnet1.subnet3:
isi network pools view groupnet1.subnet3.pool5
The system displays output similar to the following example:
ID: groupnet0.subnet3.pool5 Groupnet: groupnet1
Subnet: subnet3 Name: pool5
Rules: Access Zone: zone3 Allocation Method: static Aggregation Mode: lacp SC Suspended Nodes: Description: -
Ifaces: 1:ext-2, 2:ext-2, 3:ext-2 IP Ranges: 203.0.223.12-203.0.223.22 Rebalance Policy: auto SC Auto Unsuspend Delay: 0 SC Connect Policy: round_robin
SC Zone: data.company.com SC DNS Zone Aliases: -
SC Failover Policy: round_robin SC Subnet: groupnet0.subnet3 SC Ttl: 0
Static Routes: -
Add or remove an IP address range
You can configure a range of IP addresses for a pool. All IP address ranges in a pool must be unique. 1. Optional: To identify the name of the IP address pool you want to modify for IP address ranges, run the following command:
isi network pools list
2. Run the isi network pools modify command. Specify the pool ID you want to modify in the following format:
<groupnet_name>.<subnet_name>.<pool_name>
The following command adds an address range to pool5 under groupnet1.subnet3:
isi network pools modify groupnet1.subnet3.pool5 \ --add-ranges=203.0.223.12-203.0.223.22
The following command deletes an address range from pool5:
isi network pools modify groupnet1.subnet3.pool5 \ --remove-ranges=203.0.223.12-203.0.223.14
Configure IP address allocation
You can specify whether the IP addresses in an IP address pool are allocated to network interfaces statically or dynamically. To configure dynamic IP address allocation, you must activate a SmartConnect Advanced license. 1. Optional: To identify the name of the IP address pool you want to modify, run the following command:
isi network pools list
2. Run the isi network pools modify command.
442 Networking

Specify the pool ID you want to modify in the following format: <groupnet_name>.<subnet_name>.<pool_name>
The following command specifies dynamic distribution of IP addresses in pool5 under groupnet1.subnet 3: isi network pools modify groupnet1.subnet3.pool5 \ --alloc-method=dynamic
Managing SmartConnect Settings
You can configure SmartConnect settings within each IP address pool on the cluster.
Configure a SmartConnect DNS zone
You can specify a SmartConnect DNS zone and alternate DNS zone aliases for an IP address pool. 1. Optional: To identify the name of the IP address pool you want to modify, run the following command:
isi network pools list
2. To configure a SmartConnect DNS zone, run the isi networks modify pool command: Specify the pool ID you want to modify in the following format: <groupnet_name>.<subnet_name>.<pool_name>
The following command specifies a SmartConnect DNS zone in pool5 under subnet3 and groupnet1: isi network pools modify groupnet1.subnet3.pool5 \ --sc-dns-zone=www.company.com
It is recommended that the SmartConnect DNS zone be a fully-qualified domain name (FQDN). 3. To configure a SmartConnect DNS zone alias, run the isi networks modify pool command:
The following command specifies SmartConnect DNS aliases in pool5 under subnet3 and groupnet1: isi network pools modify groupnet1.subnet3.pool5 \ --add-sc-dns-zone-aliases=data.company.com,storage.company.com
You cannot specify more than three SmartConnect DNS zone aliases. 4. To remove a SmartConnect DNS zone alias, run the isi networks modify pool command:
The following command removes a SmartConnect DNS aliases from pool5 under subnet3 and groupnet1: isi network pools modify groupnet1.subnet3.pool5 \ --remove-dns-zone-aliases=data.company.com
SmartConnect requires that you add a new name server (NS) record to the existing authoritative DNS zone that contains the cluster and that you delegate the FQDN of the SmartConnect DNS zone.
Specify a SmartConnect service subnet
You can designate a subnet as the SmartConnect service subnet for an IP address pool. The subnet that you designate as the SmartConnect service subnet must have a SmartConnect service IP address configured, and the subnet must be in the same groupnet as the IP address pool. For example, although a pool might belong to subnet3, you can designate subnet5 as the SmartConnect service subnet as long as both subnets are under the same groupnet.
Networking 443

1. Optional: To identify the name of the IP address pool you want to modify, run the following command: isi network pools list
2. Run the isi networks modify pool command: Specify the pool ID you want to modify in the following format: <groupnet_name>.<subnet_name>.<pool_name>
The following command specifies subnet0 as the a SmartConnect service subnet of pool5 under subnet3 and groupnet1: isi network pools modify groupnet1.subnet3.pool5 \ --sc-subnet=subnet0
Suspend or resume a node
You can suspend and resume SmartConnect DNS query responses on a node. 1. To suspend DNS query responses for an node:
a. Optional: To identify a list of nodes and IP address pools, run the following command: isi network interfaces list
b. Run the isi network pools sc-suspend-nodes command and specify the pool ID and logical node number (LNN). Specify the pool ID you want in the following format: <groupnet_name>.<subnet_name>.<pool_name>
The following command suspends DNS query responses on node 3 when queries come through IP addresses in pool5 under groupnet1.subnet 3:
isi network pools sc-suspend-nodes groupnet1.subnet3.pool5 3 2. To resume DNS query responses for an IP address pool, run the isi network pools sc-resume-nodes command
and specify the pool ID and logical node number (LNN). The following command resumes DNS query responses on node 3 when queries come through IP addresses in pool5 under groupnet1.subnet 3:
isi network pools sc-resume-nodes groupnet1.subnet3.pool5 3
Configure a connection balancing policy
You can set a connection balancing policy for an IP address pool. SmartConnect supports the following balancing methods:  Round robin
NOTE: Round robin is the only method available without activating a SmartConnect Advanced license.  Connection count  Network throughput  CPU usage 1. Optional: To identify the name of the IP address pool you want to modify, run the following command:
isi network pools list
2. Run the isi network pools modify command.
444 Networking

Specify the pool ID you want to modify in the following format: <groupnet_name>.<subnet_name>.<pool_name>
The following command specifies a connection balancing policy based on connection count in pool5 under subnet 3 and groupnet1:
isi network pools modify groupnet1.subnet3.pool5 \ --sc-connect-policy=conn_count
Configure an IP failover policy
You can set an IP failover policy for an IP address pool. To configure an IP failover policy, you must activate a SmartConnect Advanced license. SmartConnect supports the following distribution methods:  Round robin  Connection count  Network throughput  CPU usage 1. Optional: To identify the name of the IP address pool you want to modify, run the following command:
isi network pools list
2. Run the isi network pools modify command. Specify the pool ID you want to modify in the following format: <groupnet_name>.<subnet_name>.<pool_name>
The following command specifies a IP failover policy based on CPU usage in pool5 under subnet 3 and groupnet0: isi network pools modify groupnet0.subnet3.pool5 \ --sc-failover-policy=cpu_usage
Managing connection rebalancing
You can configure and manage a connection rebalancing policy that specifies when to rebalance IP addresses after a previously unavailable node becomes available again.
Configure an IP rebalance policy
You can configure a manual or automatic rebalance policy for an IP address pool. To configure a rebalance policy for an IP address pool, you must activate a SmartConnect Advanced license and set the allocation method to dynamic. 1. Optional: To identify the name of the IP address pool you want to modify, run the following command:
isi network pools list
2. Run the isi network pools modify command. Specify the pool ID you want to modify in the following format: <groupnet_id>.<subnet_name>.<pool_name>
Networking 445

The following command specifies manual rebalancing of IP addresses in pool5 under groupnet1.subnet 3: isi network pools modify groupnet1.subnet3.pool5 \ --rebalance-policy=manual
If you configure an automatic rebalance policy, you can specify a rebalance delay which is a period of time (in seconds) that should pass after a qualifying event before an automatic rebalance is performed. The default value is 0 seconds. You can specify the delay by running the isi network external modify command with the --sc-balance-delay option.
Manually rebalance IP addresses
You can manually rebalance a specific IP address pool or all of the pools on the external network. You must activate a SmartConnect Advanced license. 1. To manually rebalance IP addresses in a pool:
a. Optional: To identify the name of the IP address pool you want to rebalance, run the following command: isi network pools list
b. Run the isi network pools rebalance-ips command. Specify the pool ID you want to modify in the following format: <groupnet_id>.<subnet_name>.<pool_name>
The following command rebalances the IP addresses in pool5 under groupnet1.subnet 3: isi network pools rebalance-ips groupnet1.subnet3.pool5
c. Type yes at the confirmation prompt. 2. To manually rebalance all IP address pools:
a. Run the isi network sc-rebalance-all command. b. Type yes at the confirmation prompt.
Managing network interface members
You can add and remove network interfaces to IP address pools.
Add or remove a network interface
You can configure which network interfaces are assigned to an IP address pool. Network interfaces must be specified in the following format <lnn>:<interface_name>. Run the isi network interfaces list command to identify the node numbers and interface names that you need. If you add an aggregated interface to the pool, you cannot individually add any interfaces that are part of the aggregated interface. 1. Optional: To identify the name of the IP address pool you want to modify, run the following command:
isi network pools list
2. Run the isi networks modify pool command. Specify the pool ID you want to modify in the following format: <groupnet_name>.<subnet_name>.<pool_name>
446 Networking

The following command modifies pool5 under groupnet1.subnet3 to add the first external network interfaces on nodes 1 through 3:
isi network pools modify groupnet1.subnet3.pool5 --add-ifaces=1-3:ext-1
The following command removes the first network interface on node 3 from pool5:
isi network pools modify groupnet1.subnet3.pool5 --remove-ifaces=3:ext-1

Specify a link aggregation mode
You can combine multiple, physical external network interfaces on a node into a single logical interface through link aggregation. You can add an aggregated interface to a pool and specify one of the following aggregation modes:  LACP  Round robin  Failover  Loadbalance
NOTE: As of OneFS 8.2.0, references to the FEC link aggregation mode are replaced with loadbalance link aggregation mode. Since OneFS 8.0.0, FEC link aggregation has been an alias to loadbalance link aggregation mode. While the name has changed, the underlying mode has not. No configuration changes are necessary. 1. Optional: To identify the name of the IP address pool you want to modify, run the following command:
isi network pools list
2. Run the isi networks modify pool command. Specify the pool ID you want to modify in the following format:
<groupnet_name>.<subnet_name>.<pool_name>
The following command modifies pool5 under groupnet1.subnet3 to specify loadbalance as the aggregation mode for all aggregated interfaces in the pool:
isi network pools modify groupnet1.subnet3.pool5 --aggregation-mode=loadbalance
The following command modifies pool5 under groupnet1.subnet3 to add ext-agg on node 1 and specify LACP as the aggregation mode:
isi network pools modify groupnet1.subnet3.pool5 --add-ifaces=1:ext-agg -aggregation-mode=lacp

Link aggregation modes

The link aggregation mode determines how traffic is balanced and routed among aggregated network interfaces. The aggregation mode is selected on a per-pool basis and applies to all aggregated network interfaces in the IP address pool.
OneFS supports dynamic and static aggregation modes. A dynamic aggregation mode enables nodes with aggregated interfaces to communicate with the switch so that the switch can use an analogous aggregation mode. Static modes do not facilitate communication between nodes and the switch.
OneFS provides support for the following link aggregation modes:

Link Aggregation Control Protocol (LACP)

Dynamic aggregation mode that supports the IEEE 802.3ad Link Aggregation Control Protocol (LACP). You can configure LACP at the switch level, which allows the node to negotiate interface aggregation with the switch. LACP balances outgoing traffic across the interfaces based on hashed protocol header information that includes the source and destination address and the VLAN tag, if available. This option is the default aggregation mode.

Loadbalance (FEC)

Static aggregation method that accepts all incoming traffic and balances outgoing traffic over aggregated interfaces based on hashed protocol header information that includes source and destination addresses.

Networking 447

Active/Passive Failover
Round-robin

Static aggregation mode that switches to the next active interface when the primary interface becomes unavailable. The primary interface handles traffic until there is an interruption in communication. At that point, one of the secondary interfaces will take over the work of the primary.
Static aggregation mode that rotates connections through the nodes in a first-in, first-out sequence, handling all processes without priority. Balances outbound traffic across all active ports in the aggregated link and accepts inbound traffic on any port.
NOTE: This method is not recommended if your cluster is handling TCP/IP workloads.

View network interfaces
You can retrieve and sort a list of all external network interfaces on the cluster. Run the isi network interfaces list command. The system displays output similar to the following example:

LNN Name Status VLAN ID Owners

Owner Type IP Addresses

---------------------------------------------------------------------------------------

1 ext-1 Up

-

groupnet0.subnet0.pool0

Static

10.205.232.166

1 int-a Up

-

internal.int-a-subnet.int-a-pool Internal 1.205.232.166

2 ext-1 Up

-

groupnet0.subnet0.pool0

Static

10.205.232.167

2 int-a Up

-

internal.int-a-subnet.int-a-pool Internal

1.205.232.167

3 ext-1 Up

-

groupnet0.subnet0.pool0

Static

10.205.232.168

3 int-a Up

-

internal.int-a-subnet.int-a-pool Internal

1.205.232.168

---------------------------------------------------------------------------------------

Total: 6

The following command displays interfaces only on nodes 1 and 3: isi network interfaces list --nodes=1,3
The system displays output similar to the following example:

LNN Name Status VLAN ID Owners

Owner Type IP Addresses

---------------------------------------------------------------------------------------

1 ext-1 Up

-

groupnet0.subnet0.pool0

Static

10.205.232.166

1 int-a Up

-

internal.int-a-subnet.int-a-pool Internal

1.205.232.166

3 ext-1 Up

-

groupnet0.subnet0.pool0

Static

10.205.232.168

3 int-a Up

-

internal.int-a-subnet.int-a-pool Internal

1.205.232.168

---------------------------------------------------------------------------------------

Total: 4

Managing node provisioning rules
You can create and manage node provisioning rules that automate the configuration of new network interfaces.

Create a node provisioning rule
You can create a node provisioning rule to specify how network interfaces on new nodes are configured when the nodes are added to the cluster. Run the isi network rules create command. Specify the ID of the rule you want to create in the following format:
<groupnet_name>.<subnet_name>.<pool_name>.<rule_name>

448 Networking

The following command creates a rule named rule7 that assigns the first external network interface on each new accelerator node to groupnet1.subnet3.pool5:
isi network rules create groupnet1.subnet3.pool5.rule7 \ --iface=ext-1 --node-type=accelerator

Modify a node provisioning rule
You can modify node provisioning rules settings. 1. Optional: To identify the name of the provisioning rule you want to modify, run the following command:
isi network rules list
2. Run the isi network rules modify command. Specify the ID of the rule you want to modify in the following format: <groupnet_name>.<subnet_name>.<pool_name>.<rule_name>
The following command changes the name of rule7 to rule7accelerator: isi network rules modify groupnet1.subnet3.pool5.rule7 \ --name=rule7accelerator
The following command changes rule7 so that it applies only to backup accelerator nodes: isi network rules modify groupnet1.subnet3.pool5.rule7 \ --node-type=backup-accelerator

Delete a node provisioning rule
You can delete an node provisioning rule that you no longer need. 1. Optional: To identify the name of the provisioning rule you want to delete, run the following command:
isi network rules list
2. Run the isi networks delete rule command. Specify the ID of the rule you want to delete in the following format: <groupnet_name>.<subnet_name>.<pool_name>.<rule_name>
The following command deletes rule7 from pool5: isi network rules delete groupnet1.subnet3.pool5.rule7
3. At the prompt, type yes.

View node provisioning rules

You can retrieve and sort a list of all node provisioning rules on the external network or view details of a specific rule. 1. To list all of the provisioning rules in the system, run the isi network rules list command:
The system displays output similar to the following example:

ID

Node Type Interface

---------------------------------------------------

groupnet0.subnet0.pool0.rule0 any

ext-1

groupnet0.subnet1.pool1.rule1 accelerator ext-3

groupnet1.subnet3.pool3.rule2 storage

ext-3

Networking 449

groupnet1.subnet3.pool5.rule7 storage

ext-2

---------------------------------------------------

The following command only lists rules in groupnet1:

isi network rules list --groupnet=groupnet1

The system displays output similar to the following example:

ID

Node Type Interface

---------------------------------------------------

groupnet1.subnet1.pool1.rule1 accelerator ext-3

groupnet1.subnet3.pool3.rule2 storage

ext-3

---------------------------------------------------

2. To view the details of a specific provisioning rule, run the isi network rules view command and specify the rule ID. Specify the rule ID you want to view in the following format:

<groupnet_name>.<subnet_name>.<pool_name>.<rule_name>

The following command displays the setting details of rule7 under groupnet1.subnet3.pool5:
isi network rules view groupnet1.subnet3.pool5.rule7
The system displays output similar to the following example:
ID: groupnet1.subnet3.pool5.rule7 Node Type: storage Interface: ext-2 Description: -
Name: rule7 Groupnet: groupnet1
Subnet: subnet3 Pool: pool5

Managing routing options
You can provide additional control of the direction of outgoing client traffic through source-based routing or static route configuration. If both source-based routing and static routes are configured, the static routes will take priority for traffic that matches the static routes.
Enable or disable source-based routing
You can enable source-based routing to ensure that outgoing client traffic is routed to the gateway of the source IP address in the packet header. If you disable source-based routing, outgoing traffic is destination-based or it follows static routes. Source-based routing is enabled or disabled globally on the cluster. Static routes are prioritized over source-based routing rules. You can check if there are static routes configured in any IP address pools by running the following command:
isi networks list pools -v
1. Enable source-based routing on the cluster by running the following command:
isi network external modify --sbr=true

450 Networking

2. Disable source-based routing on the clusetr by running the following command: isi network external modify --sbr=false

Add or remove a static route
You can configure static routes to direct outgoing traffic to specific destinations through a specific gateway. 1. Optional: Identify the name of the IP address pool that you want to modify for static routes by running the following
command:
isi network pools list
2. Run the isi networks modify pool command. Specify the route in classless inter-domain routing (CIDR) notation format. Specify the pool ID you want to modify in the following format:
<groupnet_name>.<subnet_name>.<pool_name>
The following command adds an IPv4 static route to pool5 and assigns the route to all network interfaces that are members of the pool:
isi network pools modify groupnet1.subnet3.pool5 --add-staticroutes=192.168.100.0/24-192.168.205.2
The following command removes an IPv6 static route from pool4:
isi network pools modify groupnet2.subnet2.pool4 --remove-staticroutes=2001:DB8:170:7c00::/64-2001:DB8:170:7cff::c008

Managing DNS cache settings
You can set DNS cache settings for the external network.

DNS cache settings
You can configure settings for the DNS cache. Setting TTL No Error Minimum
TTL No Error Maximum
TTL Non-existent Domain Minimum
TTL Non-existent Domain Maximum
TTL Other Failures Minimum

Description Specifies the lower boundary on time-to-live for cache hits. The default value is 30 seconds.
Specifies the upper boundary on time-to-live for cache hits. The default value is 3600 seconds.
Specifies the lower boundary on time-to-live for nxdomain. The default value is 15 seconds.
Specifies the upper boundary on time-to-live for nxdomain. The default value is 3600 seconds.
Specifies the lower boundary on time-to-live for nonnxdomain failures. The default value is 0 seconds.

Networking 451

Setting TTL Other Failures Maximum TTL Lower Limit For Server Failures TTL Upper Limit For Server Failures Eager Refresh Cache Entry Limit Test Ping Delta

Description Specifies the upper boundary on time-to-live for nonnxdomain failures. The default value is 60 seconds.
Specifies the lower boundary on time-to-live for DNS server failures. The default value is 300 seconds.
Specifies the upper boundary on time-to-live for DNS server failures. The default value is 3600 seconds.
Specifies the lead time to refresh cache entries that are nearing expiration. The default value is 0 seconds.
Specifies the maximum number of entries that the DNS cache can contain. The default value is 65536 entries.
Specifies the delta for checking the cbind cluster health. The default value is 30 seconds.

452 Networking

34
NFS3oRDMA
This section contains the following topics:
Topics:
· RDMA support for NFSoRDMA · Enable RDMA feature for NFSv3 protocol · Disable RDMA feature for NFSv3 protocol · View RDMA flag on network interface cards · Create an IP address pool with RDMA support · Modify an IP address pool with RDMA support
RDMA support for NFSoRDMA
The Network File System over Remote Direct Memory Access (NFSoRDMA) feature lets you perform memory-to-memory transfer of data over high speed networks. Network adapters that have RDMA support (known as RNICs) are used to transfer data directly, while using minimal amount of CPU, resulting in increased throughput. For applications that access large datasets on remote NFS, this feature enables:  Increased single-stream throughput to leverage the full throughput of high speed networks where the network interface
controllers coordinate the transfer of large amounts of data at line speed.  Low latency to provide fast responses to network requests, and, as a result, makes remote file storage feel as if it is directly
attached storage.  Low CPU utilization to use fewer CPU cycles when transferring data over the network, which leaves more power available to
other applications running on the client. The NFSoRDMA feature adds another front-end network transport communication mechanism between the client and OneFS node. The front-end network transport communication provides remote data transfer directly to and from memory without CPU intervention. This improves CPU utilization on the client machine and improve read or write throughput. Currently, NFSoRDMA is only supported for only NFSv3.
Enable RDMA feature for NFSv3 protocol
You can enable the RDMA feature for the NFSv3 protocol by modifying the NFS global actions. 1. Optional: Run the isi nfs settings global modify command with the RDMA parameter.
isi nfs settings global modify --nfsv3-rdma-enabled=true 2. Run the isi nfs settings global view command to verify if the RDMA feature is enabled for NFSv3 protocol.
isi nfs settings global view NFSv3 Enabled: Yes NFSv4 Enabled: No
NFSv3 RDMA Enabled: Yes Rquota Enabled: No
NFS Service Enabled: No
NFS3oRDMA 453

Disable RDMA feature for NFSv3 protocol
You can disable the RDMA feature for the NFSv3 protocol by modifying the NFS global actions. 1. Optional: Run the isi nfs settings global modify command with the RDMA parameter.
isi nfs settings global modify --nfsv3-rdma-enabled=false 2. Run the isi nfs settings global view command to verify if the RDMA feature is disabled for the NFSv3 protocol.
isi nfs settings global view NFSv3 Enabled: Yes NFSv4 Enabled: No
NFSv3 RDMA Enabled: No Rquota Enabled: No
NFS Service Enabled: No
View RDMA flag on network interface cards
You can view a new flag on the RDMA supported network interface cards. Run the isi network interfaces list - v command.
IP Addresses: 10.137.69.131, fe80::ba59:9fff:fe97:68f0 LNN: 1
Name: 40gige-1 NIC Name: mlxen2
Owners: groupnet0.subnet0.pool0, ipv6.link-local.mlxen2 Status: Up VLAN ID: Default IPv4 Gateway: 10.137.69.1 Default IPv6 Gateway: fe80::21c:73ff:feee:c249
MTU: 9000 Access Zone: System
Flags: ACCEPT_ROUTER_ADVERT, SUPPORTS_RDMA_RRoCE
Create an IP address pool with RDMA support
You can create an IP address pool to include RDMA support. NOTE: If you have not activated a SmartConnect Advanced license, the cluster is allowed one IP address pool per subnet. If you activate a SmartConnect Advanced license, the cluster is allowed unlimited IP address pools per subnet.
When you create an address pool, you must assign it to a subnet. If the subnet is not under the default groupnet, groupnet0, then you must also assign an access zone to the pool. 1. Run the isi network pools create command.
Specify the ID of the pool you want to create along with the RDMA parameter in the following format:
<groupnet_name>.<subnet_name>.<pool_name> <nfsv3_rroce_only>
The following command creates a RDMA supported pool named pool1 and assigns it to subnet0 under groupnet0:
isi network pools create groupnet0.subnet0.pool1 --nfsv3-rroce-only=true 2. Run the isi network pools view command.
Specify the ID of the pool to view its details in the following format:
<groupnet_name>.<subnet_name>.<pool_name>
454 NFS3oRDMA

The following command displays the details of the pool: isi network pools view groupnet0.subnet0.pool1
ID: groupnet0.subnet0.pool1 Groupnet: groupnet0 Subnet: subnet0 Name: pool1 Rules: -
Access Zone: System Allocation Method: static
Aggregation Mode: lacp SC Suspended Nodes: -
Description: Ifaces: -
IP Ranges: Rebalance Policy: auto SC Auto Unsuspend Delay: 0 SC Connect Policy: round_robin
SC Zone: SC DNS Zone Aliases: -
SC Failover Policy: round_robin SC Subnet: SC TTL: 0
Static Routes: NFSv3 RDMA RRoCE only: Yes
Modify an IP address pool with RDMA support
You can modify an IP address pool to include RDMA support. 1. Optional: To identify the name of the IP address pool you want to modify, run the following command:
isi network pools list
2. Run the isi network pools modify command. Specify the pool ID you want to modify in the following format: <groupnet_name>.<subnet_name>.<pool_name>
The following command modifies the pool to have RDMA support: isi network pools modify groupnet0.subnet0.pool0 --nfsv3-rroce-only=true
NFS3oRDMA 455

35
Partitioned Performance Monitoring
This section contains the following topics:
Topics:
· Partitioned Performance Monitoring · Workload monitoring · Create a standard dataset · Create a dataset with filters · Pin a workload · Apply filter(s) to a dataset · View statistics · Additional information
Partitioned Performance Monitoring
You can define and monitor performance-related issues on the cluster using OneFS Partitioned Performance monitoring. As clusters increase in size and the number of competing workloads place demands on system resources, more visibility is required to share cluster resources equitably. Partitioned Performance provides you with fine-grained accounting of the dynamic resources which helps in the better utilization and performance of the cluster. OneFS supports Partitioned Performance monitoring with several protocols, including NFS, SMB, and S3. You can use Partitioned Performance monitoring to define performance datasets to enable tracking any combination of directories, shares, users, clients, and access zones. You can view the associated performance statistics, including protocols operations, disk operations, read/write bandwidth, and CPU. You can configure customized settings and filters to match specific workloads for a dataset that meets the required criteria. Reported statistics are refreshed every 30 seconds. The performance dataset data is available to you through the CLI and PAPI.
Workload monitoring
Workload monitoring is a key for show-back and charge-back resource accounting. Workload is a set of identification metrics and resource consumption metrics. For example, {username bob zone_name System} consumed {cpu 1.2 s, bytes_in 10 K, bytes_out 20 M,...} Dataset is a specification of identification metrics to aggregate workloads by, and the workloads collected that match that specification. For instance, the workload above would be in a dataset that specified identification metrics {username, zone_name}. Filter is a method for including only workloads that match specific identification metrics, for example take the following workloads for a dataset with filter {zone_name:System}:  {username:bob , zone_name:System} would be included  {username:mary , zone_name:System} would be included  {username:bob , zone_name:Quarantine} would not be included A performance dataset automatically collects a list of the top workloads, with pinning and filtering to allow further customization to that list.
456 Partitioned Performance Monitoring

Create a standard dataset
You can create a standard dataset using the OneFS command line interface. Use the isi performance dataset create --name <name> <metrics> command to create a standard dataset. For example:
isi performance dataset create --name my_dataset username protocol export_id
A new performance dataset is created. Created new performance dataset 'my_dataset' with ID number 1.

View dataset list

You can view the list of configured performance datasets.
Use the isi performance dataset list command to view the list of performance datasets. The list of performance datasets appears.

ID Name

Metrics

Filters Statkey

Creation Time

--------------------------------------------------------------------------------

------

0 System

job_type -

cluster.performance.dataset.0 Never

system_name

1 my_dataset username -
:08:16 zone_name

cluster.performance.dataset.1 2021-03-30T07

--------------------------------------------------------------------------------
-----Total: 2

View details of dataset
You can view the properties of a configured performance dataset. Use the isi performance datasets view <dataset> command to view the details. For example:
isi performance datasets view my_dataset
The following details appear:
ID: 1 Name: my_dataset
Metrics: username, zone_name Filters: Statkey: cluster.performance.dataset.1 Creation Time: 2021-03-30T07:08:16

Partitioned Performance Monitoring 457

Modify details of dataset
You can modify the properties of a configured performance dataset. Use the isi performance datasets modify <dataset> command to modify the details. For example:
isi performance dataset modify my_dataset --name new_dataset The name of the dataset is now changed from "my_dataset" to "new_dataset"
name: my_dataset -> new_dataset
Delete dataset
You can delete a performance dataset. 1. Use the isi performance dataset delete command to delete a dataset.
isi performance dataset delete new_dataset The following message appears:
Are you sure you want to delete the performance dataset.? (yes/[no]): 2. Enter "yes".
The performance dataset is deleted.
Create a dataset with filters
You can create a dataset with filters using the OneFS command line interface. Use the isi performance dataset create --name <name> <metrics> --filters <filter-metrics> command to create a dataset with filters. For example:
isi performance dataset create --name my_filtered_dataset username export_id --filters export_id A new performance dataset with a filter is created. Created new performance dataset 'my_filtered_dataset' with ID number 4.
Pin a workload
If you want a workload to be always visible, you can pin it. A dataset shows the the top 1024 workloads. The rest of the workloads are aggregated into a single additional workload. Use the isi performance workload pin <dataset_name|id> <metric>:<value> command to make a workload always visible to you. For example,
isi performance workload pin my_dataset username:bob zone_name:System The performance workload is pinned.
458 Partitioned Performance Monitoring

Apply filter(s) to a dataset
You can apply one or more filters to a dataset. A workload is included if it matches any of the filters. Any workload that doesn't match a filter is aggregated into an excluded workload. Use the isi performance filter apply <dataset_name|id > <metric>:<value> command to apply a filter. For example,
isi performance filter apply my_filtered_dataset zone_name:System The filter is applied.
View statistics
You can view performance resource statistics by workloads. Use the isi statistics workload command to view the statistics.
isi statistics workload --dataset <dataset_name|id> You can view the statistics of the dataset you have created.
Partitioned Performance Monitoring 459

460 Partitioned Performance Monitoring

Additional information
These pointers provide you with some tips regarding the feature.  Name lookup failures, for example UID to username mappings, are reported in an additional column in the statistics output.  Statistics are updated every 30 s. A newly created dataset does not show up in the statistics.
output until the update has occurred. Similarly, an old dataset might show up until that update occurs.  Export ID tracking is only available post commit when upgrading. All other features and statistics are available during
upgrade.  export_id and share_name metrics can be combined in a dataset.
 Dataset with both metrics list workloads with either export_id or share_name  Dataset with only share_name metric excludes NFS workloads.  Dataset with only export_id metric excludes SMB workloads.  Path Tracking is only enabled for SMB.  Due to performance issues in NFS  A warning of this is output when creating a dataset with Path metric.  Pinned workloads/applying filters for NFS in a dataset with path are rejected.  A dataset with a filtered metric specified but with no filters applied do not output any workloads.  Paths and Non-Primary groups are only reported if they are pinned or have a filter applied.  Paths and Non-Primary groups might result in work being accounted twice within the same dataset, as they can match multiple workloads. The total amount that is overaccounted within a dataset is aggregated into the Overaccounted workload.
Partitioned Performance Monitoring 461

36
IPMI
This section contains the following topics.
Topics:
· IPMI overview · Enable IPMI · Enable IPMI power control and Serial over LAN · Configure IPMI username and password
IPMI overview
OneFS support for the Intelligent Platform Management Interface (IPMI) provides remote access to the OneFS console for performing power management operations remotely on your Generation 6 and PowerScale clusters. Support includes Serial over LAN (SoL). Using IPMI, you can:  Power on nodes or the cluster after shut down, for example, after maintenance or a power outage.  Power off nodes or the cluster, for example, after a power outage and when the cluster is operating on backup power.  Perform a Hard/Cold Reboot/Power Cycle, for example, if a node is unresponsive to OneFS. IPMI is disabled by default, and is disabled in releases earlier than OneFS 9.0.0.0. In OneFS 9.0.0.0, use the isi ipmi command to enable and configure IPMI for your cluster:  DHCP  Static IP address  IP address range IP addresses are assigned on a first-available basis. You cannot assign a specific IP address to a specific node. Use the isi ipmi nodes list command to confirm a node's IP address. For security purposes, restrict IPMI traffic to a management-only VLAN. Configure a single username and password for IPMI management for all the nodes in your cluster using isi ipmi user modify --username=<username> --set-password. User names can be up to 16 characters. Passwords must be 17-20 characters. To verify the username configuration, use isi ipmi user view. The physical serial port is disabled when a SoL session is active. The physical serial port is enabled when the SoL session is terminated with the deactivate command. You can also access the cluster using the IPMItool in Linux, available as part of most Linux distributions, or accessible through other proprietary tools. See the Linux man page for ipmitool for usage details.
NOTE: Support for IPMI on Isilon Generation 6 hardware requires node firmware package 10.3.2 and SSP firmware 02.81 or later.
Enable IPMI
Enable IPMI for DHCP, static IP, or a range of IP addresses. IP addresses are assigned on a first-available basis. You cannot assign a specific IP address to a specific node. IPMI commands require that you have the ISI_PRIV_IPMI privilege. 1. To enable IPMI for DHCP:
isi ipmi settings modify --enabled=True --allocation-type=dhcp
462 IPMI

2. To enable IPMI for a static IP address: isi ipmi settings modify --enabled=True --allocation-type=static
3. To enable IPMI for a range of IP addresses: isi ipmi network modify --gateway=[gateway IP] --prefixlen= --ranges=[IP Range]
Enable IPMI power control and Serial over LAN
You can enable the power control and Serial over LAN features and confirm the enabled features. IPMI commands require that you have the ISI_PRIV_IPMI privilege. 1. To enable the power control feature:
isi ipmi features modify Power-Control --enabled=True 2. To enable the Serial over LAN (SoL) feature:
isi ipmi features modify SOL --enabled=True 3. To confirm the enabled features:
isi ipmi features list
Configure IPMI username and password
Configure a single username and password to perform IPMI tasks across all nodes in your cluster. IPMI commands require that you have the ISI_PRIV_IPMI privilege. Usernames can be up to 16 characters in length. Passwords must be 17-20 characters in length. 1. To configure the username and password:
isi ipmi user modify --username [Username] --set-password 2. Confirm the username configuration:
isi ipmi user view
IPMI 463

37
Antivirus
This section contains the following topics:
Topics:
· Antivirus overview · On-access scanning · ICAP Antivirus policy scanning · Individual file scanning using ICAP · WORM files and antivirus · Antivirus scan reports · ICAP servers · CAVA servers · ICAP threat responses · CAVA threat responses · Configuring global antivirus settings · Managing ICAP servers · Managing CAVA servers · Create an ICAP antivirus policy · Managing ICAP antivirus policies · Managing antivirus scans · Managing antivirus threats · Managing antivirus reports
Antivirus overview
You can scan the files that you store on a PowerScale cluster for viruses, malware, and other security threats by integrating with third-party scanning services through the Internet Content Adaptation Protocol (ICAP) or the Common AntiVirus Agent (CAVA). OneFS sends files through ICAP or CAVA to a server running third-party antivirus scanning software. These servers are called ICAP servers or CAVA servers. These servers scan files for viruses. After a server scans a file, it notifies OneFS of whether the file is a threat. If a threat is detected, OneFSnotifies system administrators by creating an event, displaying near real-time summary information, and documenting the threat in an antivirus scan report. You can configure OneFS to request that ICAP or CAVA servers attempt to repair infected files. You can also configure OneFS to protect users against potentially dangerous files by truncating or quarantining infected files. Before OneFS sends a file for scanning, it ensures that the scan is not redundant. If a scanned file has not been modified since the last scan, the file will not be scanned again unless the virus database on the antivirus server has been updated since the last scan.
NOTE: Antivirus scanning is available only on nodes in the cluster that are connected to the external network.
On-access scanning
You can configure OneFS to send files to be scanned before they are opened, after they are closed, or both. This can be done through file access protocols such as SMB, NFS, and SSH. Sending files to be scanned after they are closed is faster but less secure. Sending files to be scanned before they are opened is slower but more secure. If OneFS is configured to ensure that files are scanned after they are closed, when a user creates or modifies a file on the cluster, OneFS queues the file to be scanned. OneFS then sends the file to an ICAP or CAVA server to be scanned when convenient. In this configuration, users can always access files without any delay. However, it is possible that after a user
464 Antivirus

modifies or creates a file, a second user might access the file before the file is scanned. If a virus was introduced to the file from the first user, the second user will be able to access the infected file. Also, if an ICAP or CAVA server is unable to scan a file, the file will still be accessible to users. If OneFS ensures that files are scanned before they are opened, when a user attempts to download a file from the cluster, OneFS first sends the file to an ICAP or CAVA server to be scanned. The file is not sent to the user until the scan is complete. Scanning files before they are opened is more secure than scanning files after they are closed, because users can access only scanned files. However, scanning files before they are opened requires users to wait for files to be scanned. You can also configure OneFS to deny access to files that cannot be scanned by an ICAP or CAVA server, which can increase the delay. For example, if no ICAP or CAVA servers are available, users will not be able to access any files until the servers become available again. If you configure OneFS to ensure that files are scanned before they are opened, it is recommended that you also configure OneFS to ensure that files are scanned after they are closed. Scanning files as they are both opened and closed will not necessarily improve security, but it will usually improve data availability when compared to scanning files only when they are opened. If a user wants to access a file, the file may have already been scanned after the file was last modified, and will not need to be scanned again if the antivirus server database has not been updated since the last scan.
NOTE: When scanning, do not exclude any file types (extensions). This ensures that any renamed files are caught.
ICAP Antivirus policy scanning
ICAP supports setting antivirus policies. You can use the OneFS Job Engine to create ICAP antivirus scanning policies that send files from a specified directory to be scanned. ICAP antivirus policies can be run manually at any time, or configured to run according to a schedule. ICAP antivirus policies target a specific directory on the cluster. You can prevent an antivirus policy from sending certain files within the specified root directory based on the size, name, or extension of the file. On-access scans also support filtering by size, name, and extensions, using the isi antivirus icap settings command. ICAP antivirus policies do not target snapshots. Only on-access scans include snapshots.
Individual file scanning using ICAP
You can send a specific file to an ICAP server to be scanned at any time. If a virus is detected in a file but the ICAP server is unable to repair it, you can send the file to the ICAP server after the virus database had been updated, and the ICAP server might be able to repair the file. You can also scan individual files to test the connection between the cluster and ICAP servers.
WORM files and antivirus
WORM (write-once, read-many) files can be scanned and quarantined by antivirus software, but cannot be repaired or deleted until their retention period expires. The SmartLock software module enables you to identify a directory in OneFS as a WORM domain. All files within the WORM domain will be committed to a WORM state, meaning that those files cannot be overwritten, modified, or deleted. As with other files in OneFS, WORM files can be scanned for viruses and other security threats. However, because of their protected read-only nature, WORM files cannot be repaired or deleted during an antivirus scan. If a WORM file is found to be a threat, the file is quarantined. When practical, you can initiate an antivirus scan on files before they are committed to a WORM state.
Antivirus scan reports
OneFS generates reports about antivirus scans. Each time that an ICAP antivirus policy is run, OneFS generates a report for that policy. OneFS also generates a report every 24 hours that includes all on-access scans that occurred during the day. Antivirus scan reports contain the following information:  The time that the scan started.
Antivirus 465

 The time that the scan ended.  The total number of files scanned.  The total size of the files scanned.  The total network traffic sent.  The network throughput that was consumed by virus scanning.  Whether the scan succeeded.  The total number of infected files detected.  The names of infected files.  The threats associated with infected files.  How OneFS responded to detected threats.

ICAP servers
The number of ICAP servers that are required to support a PowerScale cluster depends on how virus scanning is configured, the amount of data a cluster processes, and the processing power of the ICAP servers.
If you intend to scan files exclusively through anti-virus scan policies, it is recommended that you have a minimum of two ICAP servers per cluster. If you intend to scan files on access, it is recommended that you have at least one ICAP server for each node in the cluster.
If you configure more than one ICAP server for a cluster, ensure that the processing power of each ICAP server is relatively equal. OneFS distributes files to the ICAP servers on a rotating basis, regardless of the processing power of the ICAP servers. If one server is more powerful than another, OneFS does not send more files to the more powerful server.
CAUTION: When files are sent from the cluster to an ICAP server, they are sent across the network in cleartext. Ensure that the path from the cluster to the ICAP server is on a trusted network. Authentication is not supported. If authentication is required between an ICAP client and ICAP server, hop-by-hop Proxy Authentication must be used.

CAVA servers
CAVA uses industry-standard Server Message Block (SMB) protocol versions 2 and 3 in a Microsoft Windows Server environment. CAVA uses third-party antivirus software to identify and eliminate known viruses before they infect files on the system.
You can use the CAVA calculator and the CAVA sizing tool to determine the number of antivirus servers that the system requires. It is recommended that you start with one Common Event Enabler (CEE) server per two nodes and adjust the number as needed. For information about the sizing tool and using CAVA on Windows platforms, see the chapter Monitoring and Sizing the Antivirus Agent in the Dell Technologies CEE document Using the Common Event Enabler on Windows Platforms.

ICAP threat responses

You can configure the system to repair, quarantine, or truncate any files that the ICAP server detects viruses in. OneFS and ICAP servers react in one or more of the following ways when threats are detected:

Alert Repair Quarantine
Truncate

All threats that are detected cause an event to be generated in OneFS at the warning level, regardless of the threat response configuration.
The ICAP server attempts to repair the infected file before returning the file to OneFS.
OneFS quarantines the infected file. A quarantined file cannot be accessed by any user. However, a quarantined file can be removed from quarantine by the root user if the root user is connected to the cluster through secure shell (SSH). If you back up your cluster through NDMP backup, quarantined files will remain quarantined when the files are restored. If you replicate quarantined files to another PowerScale cluster, the quarantined files will continue to be quarantined on the target cluster. Quarantines operate independently of access control lists (ACLs).
OneFS truncates the infected file. When a file is truncated, OneFS reduces the size of the file to zero bytes to render the file harmless.

466 Antivirus

You can configure OneFS and ICAP servers to react in one of the following ways when threats are detected:

Repair or quarantine Repair or truncate
Alert only Repair only
Quarantine Truncate

Attempts to repair infected files. If an ICAP server fails to repair a file, OneFS quarantines the file. If the ICAP server repairs the file successfully, OneFS sends the file to the user. Repair or quarantine can be useful if you want to protect users from accessing infected files while retaining all data on a cluster.
Attempts to repair infected files. If an ICAP server fails to repair a file, OneFS truncates the file. If the ICAP server repairs the file successfully, OneFS sends the file to the user. Repair or truncate can be useful if you do not care about retaining all data on your cluster, and you want to free storage space. However, data in infected files will be lost.
Only generates an event for each infected file. It is recommended that you do not apply this setting.
Attempts to repair infected files. Afterwards, OneFS sends the files to the user, whether or not the ICAP server repaired the files successfully. It is recommended that you do not apply this setting. If you only attempt to repair files, users will still be able to access infected files that cannot be repaired.
Quarantines all infected files. It is recommended that you do not apply this setting. If you quarantine files without attempting to repair them, you might deny access to infected files that could have been repaired.
Truncates all infected files. It is recommended that you do not apply this setting. If you truncate files without attempting to repair them, you might delete data unnecessarily.

CAVA threat responses
You configure CAVA threat responses in the antivirus software you use. See your CAVA antivirus software documentation for information about how to configure your CAVA software to perform threat handling.
Configuring global antivirus settings
You can configure global antivirus settings that are applied to all antivirus scans by default.

Include specific files in antivirus scans
You can target specific files for scans by antivirus policies. Run the isi antivirus settings modify command. The following command configures OneFS to scan only files with the .txt extension:
isi antivirus settings modify --glob-filters-enabled true \ --glob-filters .txt

Configure on-access scanning settings
You can configure OneFS to automatically scan files as they are accessed by users. On-access scans operate independently of antivirus policies. Run the isi antivirus settings modify command. The following command configures OneFS to scan files and directories under /ifs/data/media when they are closed:
isi antivirus settings modify --scan-on-close true \ --path-prefixes /ifs/data/media

Antivirus 467

Configure antivirus threat response settings
You can configure how OneFS responds to detected threats. Run the isi antivirus settings modify command. The following command configures OneFS and ICAP servers to attempt to repair infected files and quarantine files that cannot be repaired:
isi antivirus settings modify --repair true --quarantine true
Configure antivirus report retention settings
You can configure how long OneFS retains antivirus reports before automatically deleting them. Run the isi antivirus settings modify command. The following command configures OneFS to delete antivirus reports older than 12 weeks.
isi antivirus settings modify --report-expiry 12w
Enable or disable antivirus scanning
You can enable or disable all antivirus scanning. This procedure is available only through the web administration interface. Run the isi antivirus settings modify command. The following command enables antivirus scanning
isi antivirus settings modify --service enable The following command disables antivirus scanning
isi antivirus settings modify --service disable
Managing ICAP servers
Before you can send files to be scanned on an ICAP server, you must configure OneFS to connect to the server. You can test, modify, and remove an ICAP server connection. You can also temporarily disconnect and reconnect to an ICAP server.
Add and connect to an ICAP server
You can add and connect to an ICAP server. After a server is added, OneFS can send files to the server to be scanned for viruses. Run the isi antivirus servers create command. The following command adds and connects to an ICAP server at 10.7.180.108:
isi antivirus servers create icap://10.7.180.108 --enabled yes
Temporarily disconnect from an ICAP server
If you want to prevent OneFS from sending files to an ICAP server, but want to retain the ICAP server connection settings, you can temporarily disconnect from the ICAP server. Run the isi antivirus servers modify command. The following command temporarily disconnects from an ICAP server with a URL of icap://10.7.180.108:
isi antivirus servers modify icap://10.7.180.108 --enabled yes
468 Antivirus

Reconnect to an ICAP server
You can reconnect to an ICAP server that you have temporarily disconnected from. Run the isi antivirus servers modify command. The following command reconnects to an ICAP server with a URL of icap://10.7.180.108:
isi antivirus servers modify icap://10.7.180.108 --enabled no
Remove an ICAP server
You can permanently disconnect from the ICAP server. 1. Run the isi antivirus servers delete command.
The following command removes an ICAP server with an ID of icap://10.7.180.108: isi antivirus servers delete icap://10.7.180.108
2. Type yes and then press ENTER.
Managing CAVA servers
To enable scanning files on a CAVA server, you create (or modify) the CAVA server configuration. Each node can have multiple connections to the CAVA server. The number of connections is based on an internal algorithm.
Add and connect to a CAVA server
You can add and connect to a CAVA server with the isi antivirus cava servers create command. After a server is added, OneFS can send files to the server to be scanned for viruses. OneFS supports the following as server URIs:  IPv4  IPv6  FQDN in UTF-8 (Unicode) If the port or path is missing, it will be added. Run the isi antivirus cava servers create command. The following command adds and connects to a CAVA server named CAVAServer1at cavaserver-1.mycompany.com:12228/ce:
isi antivirus cava servers create CAVAServer1 cava-server-1.mycompany.com:12228/ce -enabled true
Modify CAVA connection settings
You can modify the IP address, name, URL, and enable status of CAVA server connections with the isi antivirus cava servers modify command. The following command modifies the URI of a CAVA server to first-cava-server.mycompany.com:12228/cee:
isi antivirus cava servers modify CavaServer --server-uri first-cavaserver.mycompany.com:12228/cee
Antivirus 469

List or view CAVA servers

You can list the available CEE/CAVA antivirus servers and their configurations with the isi antivirus cava servers list command. You can view the properties of a particular CAVA server with the isi antivirus cava servers view command.
1. The following command shows the name, URI, enable status, and type (CEE/CAVA) of the available CAVA servers:

isi antivirus cava servers list

Server Name Server

URI Enabled Server Type

----------------------------------------------------------------------------

CAVA1

cava-server-1.mycompany.com:12228/cee Yes

CEE/CAVA

CAVA2

cava-server-2.mycompany.com:12228/cee Yes

CEE/CAVA

----------------------------------------------------------------------------

Total: 2

2. The following command shows the properties of a CAVA server named CAVA1:

isi antivirus cava servers view CAVA1 Server Name: CAVA1 Server URI: cava-server-1.mycompany.com:12228/cee Enabled: Yes Server Type: CEE/CAVA

Disable connection to a CAVA server
You can disable the connection to a CAVA server. Run the isi antivirus cava servers modify command. The following command disables, but does not delete, the connection to a CAVA server:
isi antivirus cava servers modify <server-name> --enabled=no

Delete a CAVA server configuration
You can delete a CAVA server configuration with the isi antivirus cava servers delete command. Specify the unique CAVA server name to delete that server. Specify --all to delete all CAVA servers. All connections to that server will be closed and that server will not be used to process antivirus scan requests. The following command deletes the CAVA server configuration CAVA1:
cava-user# isi antivirus cava servers delete CAVA1 Are you sure? (yes/[no]): yes

Create an IP pool in a CAVA server
You must create a dedicated IP pool for the exclusive use of the CAVA servers. Do not mix the IP range in this dedicated IP pool with others for regular SMB client connections. Any other protocol use will fail. 1. Set up an IP pool for the exclusive use of the CAVA servers. For instructions, see the section Managing IP address pools in
the Networking chapter of this guide. 2. Associate the IP pool with the CAVA configurations:
isi antivirus cava settings modify --ip-pool="<IP pool name>"
When prompted, enter yes. Output similar to the following displays:
IP Pool <IP pool name> added to CAVA antivirus. Note: The access zone of IP Pool <IP pool name> has been changed to AvVendor.

470 Antivirus

For example, the following command associates the IP pool groupnet0.subnet0.pool1 with the CAVA servers.
isi antivirus cava settings modify --ip-pool groupnet0.subnet0.pool1 This action will make the IP Pool unavailable to all other users except antivirus servers. Do you want to continue? (yes/[no]): yes
IP Pool groupnet0.subnet0.pool1 added to CAVA antivirus.
Note: The access zone of IP Pool groupnet0.subnet0.pool1 has been changed to AvVendor.

Create a dedicated Access Zone on a CAVA server
A dedicated Access Zone is automatically created when the CAVA service is enabled in the PowerScalecluster. The AvVendor access zone is restricted. You cannot create SMB shares and NFS exports in the AvVendor access zone.
NOTE: The AvVendor access zone is not deleted when CAVA is disabled, so access zone settings are preserved. 1. Run the following command to enable the CAVA service:
isi antivirus cava settings modify --service-enabled=1 2. View the CAVA settings and confirm Server Enable is set to Yes:
isi antivirus cava settings modify --service-enabled=1

isi antivirus cava settings view Service Enabled: Yes
Scan Access Zones: System IP Pool: groupnet0.subnet0.pool1
Report Expiry: 8 weeks, 4 days Scan Timeout: 1 minute
Cloudpool Scan Timeout: 1 minute Maximum Scan Size: 0.00kB
3. Verify the AvVendor creation in the Access Zone list:

isi zone zones list

Output similar to the following is displayed

Name

Path

--------------

System /ifs

AvVendor /ifs

--------------

Total: 2

Create an Active Directory authentication provider for the AvVendor access zone
All the anti-virus application servers and PowerScale cluster should be in the same domain. 1. Create the Active Directory by running a command similar to the following.
isi auth ads create <Domain name> --user administrator
For details, see the section Managing Active Directory providers in the Authentication chapter of this guide.

Antivirus 471

2. Add the authentication provider to the AvVendor access zone:
isi zone zones modify AvVendor --add-auth-providers=lsa-activedirectoryprovider:<Domain name>
For details, see the section Manage authentication providers in an access zone in the Access Zones chapter of this guide.
Update the role in the access zone
The CHECK$ share is a hidden share that allows access to all files on the cluster. It is used exclusively by the antivirus software running on a Windows server. Since the CHECK$ share allows access to all files on the cluster, any user accessing the share must have a unique privilege. The hidden ISI_PRIV_AV_VENDOR privilege within AVVendor role will be added to give the user account used by the antivirus software access to the CHECK$ share. The user account must be the same as used to configure the CAVA server. 1. Run the following command to assign the user ADS\cavausr to the role AVVendor in the AvVendor Access Zone:
isi auth roles modify AVVendor --zone=AvVendor --add-user ads\\cavausr
2. Confirm the System Status is RUNNING:
isi antivirus cava status
Output similar to the following is displayed:
System Status: RUNNING Fault Message: CEE Version: 8.7.7.0 DTD Version: 2.3.0 AV Vendor: Symantec
Scan CloudPool files in a CAVA server
You can scan files in CloudPools using a CAVA configuration. Note that by default, scanning of CloudPools files is disabled to prevent the unexpected cost of file callback, and scanning Cloudpool files will trigger file reads in the cloud. To enable this setting, modify the filter and job settings. 1. Run the following command to view the status of the Scan Cloudpool files setting:
isi antivirus cava filters view System
Output similar to the following displays:
Zone: System Enabled: Yes Open-on-fail: Yes File Extensions: * File Extension Action: include Scan If No Extension: No Exclude Paths: Scan-profile: standard Scan-on-read: No Scan-on-close: Yes Scan-on-rename: Yes Scan Cloudpool Files: No
2. Run the following command to modify the Scan Cloudpools files setting:
isi antivirus cava filters modify <zone> --scan-cloudpool-files true
This command applies to access to the specified zone using file protocols only. Each scheduled job has its own setting.
472 Antivirus

3. Run the following command to modify the CAVA jobs setting: isi antivirus cava jobs modify <job name> --scan-cloudpool-files true

View CloudPool scan timeout
You can view the time duration after which the CloudPool scan stops. Run the following command to view the scanning duration for the CloudPool files.
isi antivirus cava settings view
Output similar to the following displays:
Service Enabled: No Scan Access Zones: System, test-hdfs1
IP Pool: Report Expiry: 8 weeks, 4 days
Scan Timeout: 1 minute Cloudpool Scan Timeout: 1 minute
Maximum Scan Size: 0.00kB

Manage CAVA filters
You can manage CAVA antivirus filters per access zone using the isi antivirus cava filters <action> command. You can modify settings for on-demand or protocol access. Actions include:  list - List CAVA filter entries.  modify - Modify a CAVA filter.  view - View a CAVA antivirus filter. You must have the ISI_PRIV_ANTIVIRUS privilege to manage CAVA filters. See the OneFS CLI Command Reference for the complete list and usage details for isi antivirus cava filters.
NOTE: You can disable each access zone individually for CAVA antivirus.

List CAVA filters

You can view a list of CAVA antivirus filter entries for all access zones. Run the following command:

isi antivirus cava filters list

Output similar to the following displays:

Zone

Enabled Open-on-fail Scan-profile Scan Cloudpool Files

---------------------------------------------------------------------

System

Yes

Yes

standard

No

test-hdfs1 Yes

Yes

standard

No

---------------------------------------------------------------------

Antivirus 473

Modify CAVA filter
You can modify a cava filter for an access zone. Run the following command:
isi antivirus cava filters modify <zone>
View CAVA filter
You can view a CAVA antivirus filter for an access zone. Run the following command:
isi antivirus cava filters view <zone>
Output similar to the following displays for the System access zone:
Zone: System Enabled: Yes Open-on-fail: Yes File Extensions: * File Extension Action: include Scan If No Extension: No Exclude Paths: Scan-profile: standard Scan-on-read: No Scan-on-close: Yes Scan-on-rename: Yes Scan Cloudpool Files: No
Manage CAVA jobs
You configure and manage CAVA antivirus scans using the isi antivirus cava jobs <action> command. Actions include:  create - Create a CAVA antivirus job.  delete - Delete CAVA antivirus jobs.  list - List CAVA antivirus jobs.  start - Manually schedule a scan job.  modify - Modify CAVA antivirus jobs.  view - View a CAVA antivirus job. You must have the ISI_PRIV_ANTIVIRUS privilege to manage CAVA jobs. See the OneFS CLI Command Reference for the complete list and usage details for isi antivirus cava jobs. The following command creates a CAVA job named CAVAJob1. that scans /ifs/data every Friday at 11:00 PM.
isi antivirus cava jobs create CAVAJob1 -d "CAVA scan job #1" -e Yes --schedule 'every Friday at 23:00' --impact OFF_HOURS --paths-to-include /ifs/data
Create an ICAP antivirus policy
You can create an ICAP antivirus policy that causes specific files to be scanned for viruses each time the policy is run. Antivirus policies are specific to ICAP. Run the isi antivirus icap policies create command.
474 Antivirus

The following command creates an antivirus policy that scans /ifs/data every Friday at 12:00 PM: isi antivirus icap policies create WeekendVirusScan --paths /ifs/data \ --schedule "Every Friday at 12:00 PM"
Managing ICAP antivirus policies
Antivirus policies are specific to ICAP. You can modify and delete ICAP antivirus policies. You can also temporarily disable antivirus policies if you want to retain the policy but do not want to scan files.
Modify an ICAP antivirus policy
You can modify an ICAP antivirus policy. Run the isi antivirus icap policies modify command. The following command modifies a policy called WeekendVirusScan to be run on Saturday at 12:00 PM:
isi antivirus icap policies modify WeekendVirusScan \ --schedule "Every Friday at 12:00 PM"
Delete an ICAP antivirus policy
You can delete an ICAP antivirus policy. Run the isi antivirus icap policies delete command. The following command deletes a policy called WeekendVirusScan:
isi antivirus icap policies delete WeekendVirusScan
Enable or disable an ICAP antivirus policy
You can temporarily disable ICAP antivirus policies if you want to retain the policy but do not want to scan files. Run the isi antivirus icap policies modify command. The following command enables a policy called WeekendVirusScan:
isi antivirus icap policies modify WeekendVirusScan --enabled yes The following command disables a policy called WeekendVirusScan:
isi antivirus icap policies modify WeekendVirusScan --enabled no
View ICAP antivirus policies
You can view ICAP antivirus policies. Run the following command:
isi antivirus icap policies list
Managing antivirus scans
You can scan multiple files for viruses by manually running an antivirus policy, or scan an individual file without an antivirus policy. You can also stop antivirus scans.
Antivirus 475

Scan a file
You can manually scan an individual file for viruses. Run the isi antivirus scan command. The following command scans the /ifs/data/virus_file file for viruses:
isi antivirus scan /ifs/data/virus_file
Manually run an ICAP antivirus policy
You can manually run an ICAP antivirus policy at any time. This procedure is available only through the web administration interface. 1. Click Data Protection > Antivirus > ICAP > Policies. 2. In the Antivirus Policies table, in the row for a policy, click More > Run Policy.
Stop a running antivirus scan
You can stop a running antivirus scan. This procedure is available only through the web administration interface. 1. Click Cluster Management > Job Operations > Job Summary. 2. In the Active Jobs table, in the row with type AVScan, click More > Cancel Running Job.
Managing antivirus threats
You can repair, quarantine, or truncate files in which threats are detected. If you think that a quarantined file is no longer a threat, you can rescan the file or remove the file from quarantine.
Manually quarantine a file
You can quarantine a file to prevent the file from being accessed by users. Run the isi antivirus quarantine command. The following command quarantines /ifs/data/badFile.txt:
isi antivirus quarantine /ifs/data/badFile.txt
Rescan a file
You can rescan a file for viruses if, for example, you believe that a file is no longer a threat. Run the isi antivirus scan command. For example, the following command scans /ifs/data/virus_file:
isi antivirus scan /ifs/data/virus_file
Remove a file from quarantine
You can remove a file from quarantine if, for example, you believe that the file is no longer a threat. Run the isi antivirus release command.
476 Antivirus

The following command removes /ifs/data/badFile.txt from quarantine: isi antivirus release /ifs/data/newFile
Manually truncate a file
If a threat is detected in a file, and the file is irreparable and no longer needed, you can manually truncate the file. Run the truncate command on a file. The following command truncates the /ifs/data/virus_file file:
truncate -s 0 /ifs/data/virus_file
View threats
You can view files that have been identified as threats by an ICAP server. Run the following command:
isi antivirus reports threats list

Antivirus threat information

You can view information about the antivirus threats that are reported by an ICAP server. The following information is displayed in the output of the isi antivirus reports threats list command.

Scan ID:

The ID of the antivirus report.
The ID of the antivirus policy that detected the threat. If the threat was detected as a result of a manual antivirus scan of an individual file, MANUAL is displayed.

Remediation

How OneFS responded to the file when the threat was detected. If OneFS did not quarantine or truncate the file, Infected is displayed.

Threat Time

The name of the detected threat as it is recognized by the ICAP server. The time that the threat was detected.

Managing antivirus reports
You can view antivirus reports through the web administration interface. You can also view events that are related to antivirus activity.

View antivirus reports
You can view antivirus reports. Run the following command:
isi antivirus reports scans list

Antivirus 477

View antivirus events

You can view events that relate to antivirus activity. Run the following command:

isi event events list

All events related to antivirus scans are classified as warnings. The following events are related to antivirus activities:

AVScan Infected A threat was detected by an antivirus scan. These events refer to specific reports on the Antivirus

File Found

Reports page but do not provide threat details.

No ICAP Servers OneFS is unable to communicate with any ICAP servers. available

ICAP Server Misconfigured, Unreachable or Unresponsive

OneFS is unable to communicate with an ICAP server.

478 Antivirus



References

Antenna House PDF Output Library 7.1.1629

Search Any Device: