FabricWatch AdminGd v720
FabricWatch AdminGd v720
752
®
26 July 2013
Fabric Watch
Administrator’s Guide
Fabric Watch Administrator’s Guide 53-1001770-01 Updates to support Fabric OS v6.4.0: March 2010
portThConfig, sysMonitor, thConfig, and
portFencing commands (recommended
for use in configuring class areas instead
of the fwConfigure command);
recommended class settings added.
Fabric Watch Administrator’s Guide 53-1002153-01 Updates to support Fabric OS v7.0.0: April 2011
• Removed deprecated commands:
fwconfigure and fwshow.
• Removed RAPI trap support.
• Added information about “in-
between” thresholds.
• Updated portthconfig command
recommendations.
• Changed Port Fencing section.
• Added the thMonitor command,
which supports the SFP class.
• Added information about 16 Gbps
and QSFP SFPs.
• Added packet loss monitoring
enhancements for the VE_Port.
• Removed support for “changed”
action.
• Removed support for “seconds”
time base.
• Added pause / continue feature.
Fabric Watch Administrator’s Guide 53-1002153-02 Updated the Copyright section. December 2011
Added support for Brocade 6505.
Updates to support Fabric OS v7.0.1:
• Chapter 1—Added universal
temporary license support
information.
• Chapter 7—Added port fencing
thresholds for state change and link
reset areas.
• Chapter 8—Added information
about the switch status on the
power supply for the Brocade DCX
8510-8.
• Chapter 9—Removed information
regarding custom defined
percentage values using Web Tools.
Fabric Watch Administrator’s Guide 53-1002752-01 Very minor updates (this is the last December 2012
release for Fabric Watch; being replaced
by MAPS in next Fabric OS version)
• Continuous alerting mode has been
removed from Fabric Watch.
• A modification to the ITW counter.
Fabric Watch Administrator’s Guide 53-1002752-02 Corrected errors and omissions in the March 2013
guide.
Fabric Watch Administrator’s Guide 53-1002998-01 Corrected errors and omissions in the July 2013
guide.
Index
In this chapter
• How this document is organized . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xv
• Supported hardware and software. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xvi
• What’s new in this document . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xvii
• Document conventions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xvii
• Additional information . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xviii
• Getting technical help . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xix
• Document feedback . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xix
Document conventions
This section describes text formatting conventions and important notices formats.
Text formatting
The narrative-text formatting conventions that are used in this document are as follows:
bold text Identifies command names
Identifies the names of user-manipulated GUI elements
Identifies keywords and operands
Identifies text to enter at the GUI or CLI
italic text Provides emphasis
Identifies variables
Identifies paths and Internet addresses
Identifies document titles
code text Identifies CLI output
Identifies syntax examples
For readability, command names in the narrative portions of this guide are presented in mixed
lettercase: for example, switchShow. In actual examples, command lettercase is often all
lowercase. Otherwise, this manual specifically notes those cases in which a command is case
sensitive.
NOTE
A note provides a tip, guidance, or advice, emphasizes important information, or provides a
reference to related information.
ATTENTION
An Attention statement indicates potential damage to hardware or data.
DANGER
A Danger statement indicates conditions or situations that can be potentially lethal or extremely
hazardous to you. Safety labels are also attached directly to products to warn of these conditions
or situations.
Key terms
For definitions specific to Brocade and Fibre Channel, refer to the Brocade Glossary.
For definitions of SAN-specific terms, visit the Storage Networking Industry Association online
dictionary at http://www.snia.org/education/dictionary.
Additional information
This section lists additional Brocade and industry-specific documentation that you might find
helpful.
To get up-to-the-minute information, go to http://my.brocade.com to register at no cost for a user ID
and password.
White papers, online demonstrations, and data sheets are available through the Brocade website
at:
http://www.brocade.com/products-solutions/products/index.page
For additional Brocade documentation, visit the Brocade website:
http://www.brocade.com
Release notes are available on the MyBrocade website and are also bundled with the Fabric OS
firmware.
Document feedback
Quality is our first concern at Brocade and we have made every effort to ensure the accuracy and
completeness of this document. However, if you find an error or an omission, or you think that a
topic needs further development, we want to hear from you. Forward your feedback to:
[email protected]
Provide the title and version number of the document and as much detail as possible about your
comment, including the topic heading and page number and your suggestions for improvement.
Fabric Watch 1
In this chapter
• Fabric health . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
• Fabric Watch overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
• Role-based access control . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
• Fabric Watch licensing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
• Reasons to customize Fabric Watch settings . . . . . . . . . . . . . . . . . . . . . . . . . 3
• Class, area, and element hierarchy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
• Switch monitoring components . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
• Logical switch support . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
• Threshold monitoring using SNMP tables . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
• Fabric Watch event settings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
• Fabric Watch notification types . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
• Fabric Watch audit messages. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
• Data values . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
• Fabric Watch support in Access Gateway mode . . . . . . . . . . . . . . . . . . . . . . 12
Fabric health
Fabric health refers to the capability of the fabric to route data. A healthy fabric enables effective
data transmission between networked devices.
One of the more obvious criteria for fabric health is the condition of the network hardware. A switch
or port failure can prevent data packets from reaching their destination. Network traffic can also
influence fabric health.
If the number of packets routed through a port exceeds the port bandwidth, it causes network
delays and packet loss. Receive (Rx) and Transmit (Tx) performance areas are used to monitor the
bandwidth utilization to help keep traffic flowing to avoid congestion.
Because of the varied factors involved in determining fabric health, Fabric Watch can help you to
detect, identify, and resolve fabric health issues by continuously monitoring possible issues and
reporting any potential concerns. Fabric Watch automatically provides detailed reports on detected
issues and helps you correct failures.
Alert configuration
When Fabric Watch is improperly configured, a large number of error messages can be sent over a
short period of time, making it difficult to find those messages that are actually meaningful. If this
happens, there are a few simple ways to improve the configuration.
When large numbers of unimportant messages are received, examining the source can identify
those classes that need to be reconfigured. To reduce the number of unimportant messages,
consider the following reconfiguration options:
• Recheck the threshold settings. If the current thresholds are not realistic for the class and
area, messages may be sent frequently without need. For example, a high threshold for
temperature monitoring set to less than room temperature is probably incorrectly configured.
These messages could cause other important messages to be missed.
• Examine the notification settings. If you are not interested in receiving messages under certain
conditions, ensure that the notification setting for that event is set to zero.
Brocade recommends using either SNMP trap alerting to your system management console or
event log entry in conjunction with syslog forwarding configured on your switches.
NOTE
You cannot configure different thresholds for server and storage ports, because threshold
configuration is an area-wide setting and cannot be configured on an element (port).
Monitoring
Do you want to monitor all class areas, or implement the monitoring in incremental stages? If you
monitor class areas incrementally, you should configure Fabric Watch to monitor the classes in the
following order:
1. Monitor Fabric class areas using the thConfig command.
Refer to Chapter 6, “Fabric, Security, SFP, and Performance Monitoring,” for details.
2. Monitor Port class areas using the portThConfig command.
Refer to Chapter 7, “Port Monitoring,” for details.
3. Monitor FRU class areas using the fwFruCfg command.
Refer to Chapter 8, “System Monitoring,” for details.
NOTE
For each class area, there are setting guidelines and recommendations for whether you should
leave the setting at the default or change the settings. If a change is recommended, the reason for
the change and the suggested settings are provided in each of the configuration chapters. The
default settings are listed in these chapters as well.
Post-processing of messages
After you have configured thresholds and alerts, determine to where the messages will be sent.
Then, monitor the messages frequently and take the appropriate actions.
An example of a very simple Class --> Area --> Element hierarchy follows.
Port
--> Cyclic redundancy check (CRC)
-->port 0
For specific information about classes, areas, and elements, refer to Chapter 3, “Fabric Watch
Threshold Components”.
Performance monitoring
Performance monitoring groups areas that track the source and destination of traffic. Use the
Performance Monitor class thresholds and alarms to determine traffic load and flow and to
reallocate resources appropriately.
You can customize Performance Monitor class and area parameters using the thConfig command.
The fmConfig command Manages frame monitor configuration, replacing deprecated advanced
performance monitoring commands. Use the fmConfig command to configure, install, and display
frame monitors across port ranges on a switch. Refer to the Fabric OS Command Reference for
details.
The Performance Monitor class is divided into the following areas:
• EE (end-to-end) Performance Monitor - Monitors RX and TX performance between two devices.
• Filter Performance Monitor - Measures the number of frames transmitted through a port that
match specific values in the first 64 bytes of the frame. Because the entire Fibre Channel
frame header and many of upper protocol's header fall within the first 64 bytes of a frame,
filter-based monitoring can measure different types of traffic transmitted through a port.
NOTE
Performance Monitoring is not supported on VE_Ports, EX_Ports, and VEX _Ports.
Security monitoring
The Security class monitors different security violations on the switch and takes action based on
the configured thresholds and their actions. You can customize Security class and area parameters
using the thConfig command.
For complete information about security monitoring, refer to “Security monitoring guidelines and
default settings” on page 37.
SFP monitoring
The SFP class groups areas that monitor the physical aspects of an SFP, such as voltage, current,
RXP, and TXP in physical ports, E_Ports, FOP_Ports, and FCU_Ports. An SFP class alarm alerts you
to an SFP fault. You can customize SFP class and area parameters using the thConfig command.
Use the thMonitor command to monitor the Brocade 10 Gbps and 16 Gbps SFP modules and 16
Gbps QSFPs. By default, the 10 Gbps SFP and the 16 Gbps SFP and QSFP are disabled. Refer to
“16 Gbps SFP and QSFP monitoring” on page 44 for more information.
NOTE
SFPs connected to any GbE ports are not monitored by Fabric Watch.
For complete information about SFP monitoring, refer to “SFP monitoring guidelines and default
settings” on page 39.
Port monitoring
Port monitoring monitors port statistics and takes action based on the configured thresholds and
actions. You can configure thresholds per port type and apply the configuration to all ports of the
specified type using the portThConfig command. Configurable ports include physical ports,
E_Ports, optical F_Ports (FOP_Ports), copper F_Ports (FCU_Ports), and Virtual E_Ports (VE_Ports).
NOTE
The execution of the portThConfig command is subject to Virtual Fabric or Admin Domain
restrictions that may be in place. Refer to the Fabric OS Command Reference for more information
and for details about the portThConfig command.
For complete information about port monitoring, including configuration examples, port setting
guidelines, and default settings, refer to “Port Monitoring” on page 49.
Port persistence
The data collected in port monitoring can vary a great deal over short time periods. Therefore, the
port can become a source of frequent event messages (the data can exceed the threshold range
and return to a value within the threshold range).
Fabric Watch uses port persistence for a port event that requires the transition of the port into a
marginal status. Fabric Watch does not record any event until the event persists for a length of time
equal to the port persistence time. If the port returns to normal boundaries before the port
persistence time elapses, Fabric Watch does not record any event.
To set the port persistence time, refer to “Setting the port persistence time” on page 62.
Port fencing
A port that is consistently unstable can harm the responsiveness and stability of the entire fabric
and diminish the ability of the management platform to control and monitor the switches within the
fabric. Port fencing is a Fabric Watch enhancement that takes the ports offline if the user-defined
thresholds are exceeded. Supported port types include physical ports, E_Ports, optical F_Ports
(FOP_Ports), copper F_Ports (FCU_Ports), and Virtual E_Ports (VE_Ports).
NOTE
Port fencing is not enabled by default. You must manually enable port fencing. Refer to “Port fencing
configuration” on page 63 for instructions.
When a port that has exceeded its user-defined thresholds is fenced by the software, the port is
placed into the disabled state and held offline. After a port is disabled, the user must manually
enable the port for frame traffic to resume on the port.
Switch policies
Switch policies are a series of rules that define specific health states for the overall switch. Fabric
OS interacts with Fabric Watch using these policies. Each rule defines the number of types of errors
that transitions the overall switch state into a state that is not healthy. For example, you can specify
a switch policy so that if a switch has two port failures, it is considered to be in a marginal state; if it
has four failures, it is in a down state.
You can define these rules for a number of classes and field replaceable units, including ports,
power supplies, and flash memory.
Refer to “Switch status policy planning” on page 75 for information on configuring switch policies.
Refer to Chapter 10, “Fabric Watch Reports,” for information on viewing the current switch policies
using the Switch Status Policy report.
You must first use the fwSetToCustom command to switch from default to custom settings, and
then use the advanced configuration options provided with the portThConfig, thConfig, and
sysMonitor commands to configure event behavior, actions, and time bases at the port level.
Use the advanced configuration option provided with the portThConfig, thConfig, and sysMonitor
commands to view and modify custom and default values for specified classes and areas in Fabric
Watch. You can customize the information reported by Fabric Watch by configuring event behavior
types, threshold values, time bases, and event settings. These area attributes are used to define
and detect events in Fabric Watch.
E-mail alert
An e-mail alert sends information about a switch event to a one or multiple specified e-mail
addresses. An e-mail alert can send information about any error from any element, area, and class
(only one e-mail recipient can be configured per class). The e-mail specifies the threshold and
describes the event, much like an error message. You can configure multiple e-mail recipients per
class using the fwMailCfg command. You must separate the e-mail addresses with a comma and
include the complete e-mail address. For example, [email protected] is a valid e-mail address; abc@12
is not.
For a recipient to receive the e-mail alert, you must configure one of the following settings:
• Use the dnsConfig command to configure DNS settings to connect the switch to a DNS server.
• In case a DNS server is not available, e-mail alerts can be forwarded through a relay host. You
can configure the relay host IP address using the fwMailCfg command.
Enabling e-mail alerts for the Changed threshold state in several areas can quickly result in a
significant amount of e-mail. Fabric Watch discards e-mail alerts when more than 100 are
generated within a minute, which minimizes memory use.
SNMP traps
In environments where you have a high number of messages coming from a variety of switches, you
might want to receive them in a single location and view them using a graphical user interface
(GUI). In this type of scenario, the Simple Network Management Protocol (SNMP) notifications
might be the most efficient notification method. You can avoid having to log in to each switch
individually as you would have to do for error log notifications.
SNMP performs an operation called a trap that notifies a management station using SNMP when
events occur. Log entries can also trigger SNMP traps if the SNMP agent is configured. When the
SNMP agent is configured to a specific error message level, error messages at that level trigger
SNMP traps.
NOTE
Audit messages are generated for port fencing configuration changes, whether port fencing is
enabled or disabled.
You can set up an external host to receive Audit messages so you can easily monitor unexpected
changes. For information on error messages generated by Fabric Watch, refer to the Fabric OS
Message Reference. For information on configuring an Audit Log, refer to the “Audit Log
Configuration” section of the Fabric OS Administrator’s Guide.
Data values
A data value represents a measured value or a state value, described as follows:
• Measured value is the current, measurable value of a fabric or fabric element, such as
environmental temperature.
• State value, which is the only qualitative data value, provides information on the overall state
of a fabric component. Instead of numerical data, state values contain information on whether
components are faulty, active, or in another state.
NOTE
Either measured values or state values can be used; mixed values are not supported.
Fabric Watch compares the measured values to a set of configurable limits to determine whether
fabric monitoring has occurred and whether to notify you. You must set appropriate threshold
boundaries to trigger an event.
State values are handled differently, as Fabric Watch monitors state values for certain states which
you can select. When a state value transitions to one of the monitored states, an event is triggered.
Time bases specify the time interval between two samples to be compared. You can set the time
base to day (samples are compared once a day), hour (samples are compared once an hour), or
minute (samples are compared every minute). Second samples are not advisable. This
configurable field affects the comparison of sensor-based data with user-defined threshold values.
Refer to “Time bases” on page 16 for more information.
In this chapter
• Threshold values . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
• Time bases. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
• Threshold triggers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
• Fabric Watch alarm behavior . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
Threshold values
High and low threshold values are the values at which potential problems might occur. For
example, in configuring a temperature threshold, you can select the temperatures at which a
potential problem can occur because of overheating or freezing.
With Fabric Watch v6.4 and later, the concept of high and low thresholds have been expanded to
include four threshold categories, listed below. There are two setting types (the threshold and the
above and below trigger) for four configuration possibilities. A combination of these settings can
cause the following actions to occur:
• Above high threshold—Fabric Watch takes this action when the current value is above the high
threshold.
• Below high threshold—Fabric Watch takes this action when the current value is between the
high and low threshold. This replaces the previous “in-between” action, which no longer exists.
• Above low threshold—This action is only applicable to port classes (physical port, FOP_Port,
FCU_Port, and VE_Port). Fabric Watch takes this action when the current value crosses the low
threshold towards the high threshold.
• Below low threshold—Fabric Watch takes this action when the current value is below the low
threshold.
NOTE
The above low threshold action applies only to the portThConfig command. It does not apply to the
thConfig and sysMonitor commands.
Threshold triggers
This section describes how Fabric Watch compares a fabric element’s data value against a
threshold value to determine whether or not to trigger an event. It describes how a specified buffer
zone affects event triggering.
For Fabric Watch to monitor data values for one of the following conditions, the alarm setting must
be set to a nonzero value.
Time bases
Time bases specify the time interval between two samples to be compared. You can set the time
base to day (samples are compared once a day), hour (samples are compared once an hour),
minute (samples are compared every minute). This configurable field affects the comparison of
sensor-based data with user-defined threshold values.
Figure 4 shows a sample graph of data obtained by Fabric Watch (the type of data is irrelevant to
the example). A high threshold of 2 is specified to trigger an event. A time base of minute is
defined. An event occurs only if the rate of change in the specific interval (one minute in this
example) is across the threshold boundary. It should be either higher than the high threshold limit
or lower than the low threshold limit.
As illustrated on the tenth sample, the counter value changes from 0 to 1; thus the calculated rate
of change is 1 per minute. At the thirteenth sample, the rate of change is 2 per minute. The rate of
change must be at least 3 per minute to exceed the event-triggering requirement of 2, which is met
on the eighteenth sample.
In this chapter
• Fabric Watch classes, areas, and elements . . . . . . . . . . . . . . . . . . . . . . . . . 19
Classes
Classes are wide groupings of similar fabric devices or fabric data. Table 1 on page 20 describes
the classes into which Fabric Watch groups all switch and fabric elements.
In some cases, classes are divided into subclasses. This additional level in the hierarchy increases
the flexibility of setting monitoring thresholds. You can use subclasses to add additional event
monitoring to fabric objects that meet the requirements of a subclass.
For example, ports connected to another switch can be monitored using both the Port class and
E_Port subclass. You can configure general port monitoring using the Port class and monitoring
specific to a type of port using the E_Port class. Ports connected to another switch can trigger
events based on either of these configurations. Ports that are not connected to another switch are
not affected by the additional monitoring configured into the E_Port class.
Class areas
While classes represent large groupings of information, areas represent the information that
Product Name monitors. For example, switch temperature, one of the values tracked by Fabric
Watch, is an area within the class Environment.
For detailed information about how to configure areas, including recommended threshold and
action settings for the classes listed in Table 1, refer to one of the following chapters:
• Chapter 6, “Fabric, Security, SFP, and Performance Monitoring”
Fabric class, Security class, SFP class, and Performance class areas and actions are
configured using the thConfig command.
Elements
Fabric Watch defines an element as any fabric or switch component that the software monitors.
Within each area, the number of elements is equivalent to the number of components being
monitored. For instance, on a 64-port switch, each area of the Port class includes 64 elements.
Each element contains information pertaining to the description suggested by the area. To
continue the Ports example, each element in the Invalid Transmission Words area of the Ports
class would contain exactly 64 ports, each of which would contain the number of times invalid
words had been received by the port over the last time interval. Each of these elements maps to an
index number, so that all elements can be identified in terms of class, area, and index number. As
an example, the monitoring of the temperature sensor with an index of 1 can be viewed by
accessing the first temperature sensor within the temperature area of the environment class.
Subclasses are a minor exception to the preceding mapping rule. Subclasses, such as E_Ports,
contain areas with elements equivalent to the number of valid entries. Within the same example
used thus far in this section, in a 64-port switch in which eight ports are connected to another
switch, each area within the E_Port class would contain eight elements.
Each area of a subclass with defined thresholds will act in addition to the settings applied to the
element through the parent class. Assignment of elements to subclasses does not need to be
performed by a network administrator. These assignments are seamlessly made through
automated detection algorithms.
Table 1 describes the classes into which Fabric Watch groups all switch and fabric elements.
TABLE 1 Fabric Watch classes
Class Description
Environment Includes information about the physical environment in which the switch resides
and the internal environment of the switch. For example, an Environment-class
alarm alerts you to problems or potential problems with temperature.
Configure the Environment class using the sysMonitor command.
Fabric Groups areas of potential problems arising between devices, including interswitch
link (ISL) details, zoning, and traffic. A Fabric-class alarm alerts you to problems or
potential problems with interconnectivity.
Configure the Fabric class using the thConfig command.
Field Replaceable Unit (FRU) Monitors the status of FRUs and provides an alert when a part replacement is
needed. This class monitors states, not thresholds.
Configure the FRU class using the fwFruCfg command.
Performance Monitor Serves as a tuning tool. The Performance Monitor class groups areas that track the
source and destination of traffic. Use the Performance Monitor class thresholds
and notifications to determine traffic load and flow and to reallocate resources
appropriately.
The Performance Monitor class is divided into the following areas: EE (end-to-end)
Performance Monitor, and Filter Performance Monitor.
Note: Performance Monitoring is not supported on VE_Ports, EX_Ports, and
VEX _Ports.
Configure the Performance class using the thConfig command.
Port Enables you to set additional thresholds specific to different types of ports.
The Port class is made up of the following sub-classes:
• E_Port class—Represents ports connected to another switch.
• FOP_Port class — Represents fabric or fabric loop ports that are made of
optical fiber.
• FCU_Port class — Represents fabric or fabric loop ports that are made of
copper.
• VE_Port — Represents a port that is similar to the E_Port but terminates at the
switch and does not propagate fabric services from one edge fabric to
another.
Configure the Port class using the portThConfig command.
Resource Manages your system’s memory or CPU usage.
Monitors flash memory. It calculates the amount of flash space consumed and
compares it to a defined threshold.
Configure the Resource class using the sysMonitor command.
Security Monitors all attempts to breach your SAN security, helping you fine-tune your
security measures.
Configure the Security class using the thConfig command.
SFP Groups areas that monitor the physical aspects of SFPs. An SFP class alarm alerts
you to an SFP malfunction fault. SFP performance monitoring is not supported on
VE_Ports.
Note: SFPs connected to any GbE ports are not monitored.
Configure the SFP class using the thConfig command.
In this chapter
• Interfaces for activating Fabric Watch . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
• Activating Fabric Watch using a Telnet session. . . . . . . . . . . . . . . . . . . . . . . 23
• Activating Fabric Watch using SNMP . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
• Activating Fabric Watch using Web Tools. . . . . . . . . . . . . . . . . . . . . . . . . . . . 28
After you enter this command, respond to the prompts for a user name and password.
3. Enter the licenseShow command to determine if the Fabric Watch license is installed.
switch:admin> licenseshow
edzbzQStu4ecS:
Fabric Watch license
Performance Monitor license
Trunking license
Full Ports on Demand license - additional 16 port upgrade license
If the Fabric Watch license is not listed, continue to step 4; otherwise, you are ready to use
Fabric Watch.
4. Enter the license key with the licenseAdd key command, where key is the Fabric Watch license
key. License keys are case-sensitive, so type the license key exactly as it appears.
switch:admin> licenseadd "R9cQ9RcbddUAdRAX"
5. Enter the licenseShow command to verify successful activation. If the license is not listed,
verify that you typed the key correctly; if you did not, then repeat step 4.
If you still do not see the license, verify that the entered key is valid, and that the license key is
correct before repeating step 4.
6. Enter the fwClassinit command to initialize the Fabric Watch classes.
NOTE
The following instructions apply to the AdvantNet MIB browser. There may be some variation in the
procedures when other MIB browsers are used.
In Figure 5, the MIB browser populated the left side of the window with a MIB tree that you can
navigate.
3. Open Web Tools and select Tasks > Manage > Switch Admin.
4. Click Show Advanced Mode.
5. On the SNMP tab, enter the IP address of the trap receiver and the severity level, and click
Apply.
NOTE
The severity level must be informational (4) in order to forward threshold alerts.
6. Start a Telnet session, and enter the snmpConfig --set mibcapability command at the prompt to
set the SNMP MIB capability.
switch:admin> snmpConfig --set mibcapability
The SNMP Mib/Trap Capability has been set to support
FE-MIB
SW-MIB
FA-MIB
SW-TRAP
FA-TRAP
FA-MIB (yes, y, no, n): [yes]
FICON-MIB (yes, y, no, n): [no]
HA-MIB (yes, y, no, n): [no]
SW-TRAP (yes, y, no, n): [yes] yes
swFCPortScn (yes, y, no, n): [no]
swEventTrap (yes, y, no, n): [no]
swFabricWatchTrap (yes, y, no, n): [no] yes
swTrackChangesTrap (yes, y, no, n): [no]
7. Enter the snmpConfig command to configure the SNMP management host IP address.
switch:admin> snmpConfig
8. Enter the IP address for the switch in the Host field in the MIB browser. Enter the community
string in the Community field. To perform set operations, enter the write community in the
Write Community field.
9. View and listen for trap details from a MIB browser menu.
NOTE
Any changes related to Fabric Watch, such as changing the status of the temperature sensor, will
generate traps.
10. Expand the tree on the left to find the Fabric Watch OID information. To find the OID, navigate
the following hierarchy: SW-MIB. bcsi. commDev; fibrechannel, fcSwitch, sw, swFWSystem.
Fabric Watch displays a window similar to the one shown in Figure 6.
11. Obtain the specific identifier for the element that will be modified. To get the identifier, click the
swFwThresholdTable and swFwThresholdEntry directory, and run a get operation on
swFwName. A list of elements appears in which each element is preceded by an identifier.
Remember the numeric portion of the identifier, which appears before the “==>” symbol. You
can scroll through the list to find the numeric identifier for the element in which you are
interested.
For detailed descriptions of the SNMP fields in both Telnet and Web Tools, refer to the Fabric
OS MIB Reference.
2. Press Enter.
A browser window opens to open Web Tools. A Login dialog box opens.
3. Enter your user name and password.
4. Select a switch from the Fabric Tree and log in if necessary.
5. Select Tasks > Manage > Fabric Watch.
For information about how to configure Fabric Watch using Web Tools, refer to Chapter 9, “Fabric
Watch Configuration Using Web Tools”.
In this chapter
• Fabric Watch configuration tasks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
• Setting Fabric Watch custom and default values . . . . . . . . . . . . . . . . . . . . . 30
• E-mail notification configuration. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
• Notification configuration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
Initialize all Fabric Watch classes. fwClassInit “Activating Fabric Watch using a
Telnet session” on page 23.
Set the boundary and alarm level to custom or fwSetToCustom “Setting Fabric Watch custom and
default. fwSetToDefault default values” on page 30.
Note: These commands reset all thresholds
for all classes and cannot be configured on
individual ports.
Configure Fabric Watch e-mail alerts for all fwMailCfg “E-mail notification configuration” on
classes. page 31.
Configure and show alarms filtering for Fabric fwAlarmsFilterSet “Configuring alarm notifications” on
Watch for all classes. fwAlarmsFilterShow page 34.
Set the following parameters for SFP, Fabric, thConfig Chapter 6, “Fabric, Security, SFP, and
Security, and Performance monitoring: Performance Monitoring”.
• Class
• Area type
• Time base
• Threshold level
• Trigger (boundary level)
• Action (notification type)
• Buffer
Enable or disable monitoring for the 10 Gbps thMonitor Chapter 6, “Fabric, Security, SFP, and
and 16 Gbps SFPs and QSFPs. Performance Monitoring”.
Set the following parameters for port portThConfig Chapter 7, “Port Monitoring”.
monitoring: portFencing
• Port type
• Area type
• Time base
• Threshold level
• Trigger (boundary level)
• Action (notification type)
• Buffer
• Port fencing
Set the port persistence time. fwSet --port -persistence “Setting the port persistence time” on
page 62.
Configure port fencing. portFencing “Port fencing” on page 62.
Set the following parameters for system sysMonitor “System monitoring using the
monitoring: sysMonitor command” on page 71.
• Class
• Area type
• Threshold level
• Trigger (boundary level)
• Action (notification type)
• Buffer
Set and display the switch status policy switchStatusPolicySet Chapter 8, “System Monitoring”.
parameters. switchStatusPolicyShow
Show the overall switch status. switchStatusShow Chapter 8, “System Monitoring”.
Configure FRU state and notifications, and fwFruCfg Chapter 8, “System Monitoring”.
monitor power supply, fan, and SFP FRUs.
Display fan status. fanShow Chapter 8, “System Monitoring”.
Show sensor readings. sensorShow Chapter 8, “System Monitoring”.
Show switch temperature readings. tempShow Chapter 8, “System Monitoring”.
Create a detailed port report. fwPortDetailShow “Generating a Port Detail report” on
page 92.
Show the availability of monitor information. fwSamShow “Switch Availability Monitor report” on
page 90.
The Config Show menu lists each class for which you can provide a separate e-mail address.
2. Enter the number corresponding to the class for which the e-mail configuration should be
displayed.
If the class does not have an e-mail configuration (there is no e-mail address assigned to the
class), the following error message displays:
Mail configuration for class Environment is not done.
Email Alert is not enabled!
NOTE
To ensure that the mail server address and domain name are configured correctly, use the
dnsConfig command. For more details, refer to the Fabric OS Command Reference.
If the e-mail configuration for the class is complete, the following confirmation message
displays:
Email has been sent
If the e-mail configuration for the class is not complete, the following error message displays:
Email has not been sent.
Check Mail configuration for Environment class!
The e-mail address specified in the mail configuration receives a test e-mail message.
The system returns to the fwMailCfg menu.
3. Enter the e-mail address of the person responsible for the specific class of alerts.
Fabric Watch uses the default value, located between the brackets in the prompt, as the
current e-mail address for the class. A value of NONE indicates that no e-mail address has
been provided.
The system displays a confirmation message and returns to the fwMailCfg menu.
Notification configuration
Notifications act as a signal or alert that notifies you when a threshold has been crossed.
When you use alarm notifications, error messages are sent to designated locations such as an
error log, SNMP trap view, or e-mail. With an error log, you can log in to a particular switch to view
the error messages that have been captured for that particular switch. You can parse the log file to
make error message searches quicker and easier.
In this chapter
• Fabric monitoring guidelines and default settings . . . . . . . . . . . . . . . . . . . . 35
• Security monitoring guidelines and default settings. . . . . . . . . . . . . . . . . . . 37
• SFP monitoring guidelines and default settings . . . . . . . . . . . . . . . . . . . . . . 39
• Performance monitoring guidelines and default settings. . . . . . . . . . . . . . . 41
• thConfig command . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42
• Recommended settings for Fabric, SFP, Performance, and Security classes 47
Domain ID changes Monitors forced domain ID changes. Forced domain ID changes occur when there is a
(DC) conflict of domain IDs in a single fabric and the principal switch must assign another
domain ID to a switch.
Fabric logins (FLOGI) Activates when ports and devices initialize with the fabric.
Fabric reconfigurations Tracks the number of reconfigurations of the fabric. Fabric reconfiguration occurs when:
(FC) • Two fabrics with the same domain ID are connected.
• Two fabrics are joined.
• An E_Port or VE_Port goes offline.
• A principal link segments from the fabric.
E_Port downs (ED) Tracks the number of times that an E_Port or VE_Port goes down. E_Ports and VE_Ports
go down each time you remove a cable or an SFP (where there are SFP failures or
transient errors).
Segmentation changes Tracks the cumulative number of segmentation changes. Segmentation changes occur
(SC) because of one of the following:
• Zone conflicts.
• Incompatible link parameters. During E_Port and VE_Port initialization, ports
exchange link parameters, and incompatible parameters result in segmentation.
This is a rare event.
• Domain conflicts.
• Segmentation of the principal link between two switches.
Zone changes (ZC) Tracks the number of zone changes. Because zoning is a security provision, frequent
zone changes might indicate a security breach or weakness. Zone change messages
occur whenever there is a change in zone configurations.
When a port goes offline, the RXP and TXP area values of the SFP become zero. Brocade
recommends non-zero low thresholds for RXP and TXP; therefore, Fabric Watch stops monitoring
RXP and TXP parameters of the SFP once the port goes offline.
NOTE
SFPs connected to GbE ports are not monitored.
Temperature Measures the physical temperature of the SFP, in degrees Celsius. A high temperature
indicates that the SFP might be in danger of damage.
Receive power Measures the amount of incoming laser, in µwatts, to help determine if the SFP is in good
(RXP) working condition. If the counter often exceeds the threshold, the SFP is deteriorating.
Transmit power Measures the amount of outgoing laser, in µwatts. Use this to determine the condition of the
(TXP) SFP. If the counter often exceeds the threshold, the SFP is deteriorating.
Current Measures the amount of supplied current to the SFP transceiver. Current area events
indicate hardware failures.
Voltage Measures the amount of voltage supplied to the SFP. If this value exceeds the threshold, the
SFP is deteriorating.
NOTE
Performance Monitoring is not supported on VE_Ports.
RXP The percentage of word frames traveling from the configured S_ID to the D_ID exceeds
(EE performance monitor) the configured thresholds.
TXP The percentage of word frames traveling from the configured D_ID to the S_ID; user
(EE performance monitor) configuration triggers these messages, so you can use the Transmit Performance area
to tune your network.
Table 11 provides default settings for areas in the End-to-End Performance Monitor class.
thConfig command
You can use the thConfig command to customize event monitoring thresholds for the Fabric,
Security, SFP, and Performance classes, or to display the configuration. It is recommended,
however, that you use the default settings for these classes.
If configured areas exceed the currently-effective threshold settings, the Fabric Watch daemon can
take one of the following actions:
• Send an SNMP alarm.
• Log a RASlog message.
• Send an e-mail alert.
Table 12 lists the configuration options for thConfig command. For complete information about
using the thConfig command, refer to the Fabric OS Command Reference.
Fabric ED - Number of E_Ports Default or Default or It is recommended that you leave the
down Custom1 Custom2 entire Fabric class in its default state
FC - Fabric reconfiguration (no alerts).
DC - Domain ID changes Refer to “Fabric monitoring setting
SC - Segmentation changes guidelines” on page 36 for more
ZC - Zone changes information.
FL - Fabric logins
Security TV - Telnet violations Default or Default or Use the Security class default settings
HV - HTTP violations Custom1 Custom2 for areas and alarm configuration.
SV - Serial violations There is no reason to alter the default
DV - DCC violations settings.
IC - Invalid certifications
LV - Login violations
TS - TS out-of-sync
FF - SLAP failures
NF - No FCS
ISB - Incompatible security
IV - Illegal command
SFP TXP - Transmit areas Default or Default or Use the SFP default settings. The
RXP - Receive areas Custom1 Custom2 traits are SFP-specific and there is no
Current reason to alter them. Refer to “SFP
Voltage monitoring default settings” on
Temperature page 40 for more information.
PWROnHours
Filter CUSTDEF Default or Default or It is recommended that you use the
Custom1 Custom2 Filter default settings.
EE (End-to-end RXF - Receive areas Default or Default or It is recommended that you leave the
performance) TXP - Transmit areas Custom1 Custom2 entire Performance Monitor Class and
End-to-End Performance Monitor
Class area settings in their default
state (no alerts).
NOTE
Both the APM license and the Fabric Watch license must be installed on the platform configured in
AG mode to use the frame monitoring and EE monitoring capabilities. The APM license provides the
counters and the Fabric Watch license provides the monitoring and alert mechanisms for these
counters.
To set the high threshold of the RX area, enter the thConfig command using the following
parameters.
switch:admin> thconfig --set ee -area RX -timebase minute -high -val 12
NOTE
You cannot specify all for all classes but you can specify all for all areas.
NOTE
The Security and Fabric classes do not have a port or index value. For those classes, a value of 0 is
assumed.
The PERFPT area provides threshold values to the user-defined frame type. When a new
user-defined frame type is created using the fmMonitor command, the threshold value is
automatically based on the PERFPT configuration at the time the frame type is created. In the
above example, the high threshold value is 10. Therefore, all frame monitors configured hereafter
will have a high threshold value of 10.
• TXP
• Power on Hours (Power on Hours is not supported on the 10 Gbps SFP or the QSFP.)
Fabric Watch also monitors the Brocade Quad SFP (QSFP) and, as with the 16 Gbps SFP, if
configured thresholds are crossed, Fabric Watch generates an SNMP alarm, a RASlog message,
and an e-mail alert for the following SFP areas:
• Current
• Voltage
• Temperature
• RXP
NOTE
On core blades, only the 16 Gbps QSFPs can be installed.
Fabric Watch monitors QSFPs in each logical switch. Temperature and voltage are monitored on the
first available port of the QSFP unit to minimize the display of redundant information. However, if
individual ports of a QSFP belong to different logical switches, then there will be separate action
notifications for each logical switch.
To disable or stop the monitoring of the SFP and QSFP, enter the thMonitor command using the
following parameter:
switch:admin> thmonitor --disable brcdSfp
To show the monitoring status of the SFP and QSFP, enter the thMonitor command using the
following parameter:
switch:admin> thmonitor --show brcdSfp
SFP types for the 10 Gbps SFPs and 16 Gbps SFPs and QSFPs are listed in Table 13.
Fabric Watch monitors the current, voltage, receiver (RXP) of the SFP, and transmitter (TXP) and if
any of these parameters crosses the low or high thresholds, the state of the SFP is yellow;
otherwise, the state is green. The SFP can also be in one of the following states:
• Paused—Health monitoring is not enabled.
• No license—The switch does not have the Fabric Watch license.
• Unknown—Fabric Watch cannot determine the state of the SFP.
TABLE 14 Recommended settings for Fabric, SFP, Performance, and Security classes
E=Error_Log, S=SNMP_Trap,
P=Port_LOG_LOCK,
M=EMAIL_ALERT, F=Port Fence
Trait Configuration
High Thresh
Low Thresh
Time Base
Custom
Custom
Default
Default
Above
Below
Buffer
Notes
Class
Area
Unit
Fabric E_Port downs X Downs None 0 0 0 X
Fabric reconfig X Reconfigs None 0 0 0 X
Domain ID changes X DID changes None 0 0 0 X
Segmentation X Segmentations None 0 0 0 X
Zone changes X Zone changes None 0 0 0 X
Fabric logins X Logins None 0 0 0 X
SFP Temperature X C None -10 85 3 X E E
RX power (RXP) X uWatts None 0 5000 25 X E E
TX power (TXP) X uWatts None 0 5000 25 X E E
Current X mA None 0 50 1 X E E
Voltage X uV None 2970 3630 10 X E E
PWR on Hours X Changes None 0 0 0 X
End-to-End RX performance X KB/s None 0 0 0 X
Performance
TX performance X KB/s None 0 0 0 X
Filter-based Custom filter counter X Frames None 0 0 0 X
Performance
Security Telnet violations X Violations Minute 1 2 0 X E,S
HTTP violations X Violations Minute 1 2 0 X E,S
SCC violations X Violations Minute 1 2 0 X E,S
DCC violations X Violations Minute 1 4 0 X E,S
Login violations X Violations Minute 1 2 0 X E,S
SLAP failures X Violations Minute 1 2 0 X E,S
TS out-of-sync X Violations Minute 1 2 0 X E,S
No FCS X Violations Minute 1 2 0 X E,S
Incompatible security DB X Violations Minute 1 2 0 X E,S
Illegal commands X Violations Minute 1 2 0 X E,S
Port Monitoring 7
In this chapter
• Port class areas . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
• Port class guidelines and default settings. . . . . . . . . . . . . . . . . . . . . . . . . . . 50
• Port configuration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52
• portThConfig command procedures. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53
• Port fencing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62
• Recommended port configuration settings . . . . . . . . . . . . . . . . . . . . . . . . . . 65
NOTE
Fabric Watch monitors and reports the status of physical and virtual FC ports. Physical GbE ports
and iSCSI ports are not monitored and are not included in the Port Class area.
Cyclic redundancy check The number of times an invalid cyclic redundancy check error occurs on a port or
(CRC) a frame that computes to an invalid CRC. Invalid CRCs can represent noise on the
network. Such frames are recoverable by retransmission. Invalid CRCs can
indicate a potential hardware problem.
Invalid transmission words The number of times an invalid transmission word error occurs on a port. A word
(ITW) did not transmit successfully, resulting in encoding errors. Invalid word messages
usually indicate a hardware problem.
NOTE: For Fabric OS versions 7.1.0 and later, the ITW counter includes a physical
coding sublayer (PCS) violation. ITW violations can occur due to an ITW
violation, a PCS violation, or both.
Class 3 discards (C3TX_TO) The number of Class 3 discards frames because of time-outs.
Link loss (LOS) The number of times a link failure occurs on a port or sends or receives NOS.
Both physical and hardware problems can cause link failures. Link failures also
frequently occur due to a loss of synchronization or a loss of signal.
Signal loss The number of times that a signal loss occurs in a port. Signal loss indicates that
no data is moving through the port. A loss of signal usually indicates a hardware
problem.
Sync loss The number of times a synchronization error occurs on the port. Two devices
failed to communicate at the same speed. Synchronization errors are always
accompanied by a link failure. Loss of synchronization errors frequently occur due
to a faulty SFP or cable.
Packet loss (VE_Port only) The number of packets routed through a port exceeds the port bandwidth.
Protocol errors (PE) The number of times a protocol error occurs on a port. Invalid state due to LRR on
an online link. Occasionally these errors occur due to software glitches. Persistent
errors occur due to hardware problems.
Received packets (RXP) The percentage of maximum bandwidth consumed in packet receipts.
State changes (ST) The state of the port has changed for one of the following reasons:
(Port and VE_Port) • The port has gone offline.
• The port has come online.
• The port is faulty.
Transmitted packets (TXP) The percentage of maximum bandwidth consumed in packet transmissions.
Trunk utilization The percent of utilization for the trunk at the time of the last poll.
(E_Port, FCU_Port, and
FOP_Port)
Utilization (VE_Port only) The percent of utilization for the trunk at the time of the last poll.
Link reset The ports on which the number of link resets exceed the specified threshold
value.
NOTE
Only the Packet loss, State changes, and Utilization areas are supported on the VE_Port.
NOTE
E_Ports and VE_Ports are not supported in Access Gateway mode.
Port configuration
Use the portThConfig command to configure thresholds for Fabric Watch event monitoring for all
ports of a specified type and to display the configuration and current port status in real time. The
command syntax is detailed in the Fabric OS Command Reference.
Before you configure thresholds, you must first identify and select the appropriate class and areas,
which are described in “Port class areas” on page 49.
NOTE
The FCU_Port, supported on Fabric Watch version 6.4.0 and later, is applicable to copper ports.
The portThConfig command follows a transaction model. When you configure thresholds and
actions with the --set option, the changes are saved persistently to non-volatile storage, but the
changes do not become effective until you execute portThConfig --apply. The --apply option allows
you to toggle between default settings and your own saved custom configuration and to apply
actions and thresholds separately. You may choose to use default thresholds together with a
customized subset of available actions, or you may modify some of the thresholds and use the
default actions. Use the -nosave option to save the configuration non-persistently, and use --cancel
to remove a non-persistent configuration.
NOTE
The execution of this command is subject to Virtual Fabrics or Admin Domain restrictions that may
be in place. Refer to the Fabric OS Command Reference for more information and for details about
the portThConfig command.
CAUTION
When you use --set with the --nosave option and the switch reboots, your changes are lost.
• In this example, the alarms are set at two points: a low threshold of 1 and a high threshold
of 100 (the default is 1000). The goal is to be notified as the number of invalid CRCs per
minute rises above the low boundary and again when it rises above the high boundary.
• Triggers specify actions for below the high threshold. Here, the trigger for both is above.
• Set the action to take when a trigger occurs. Here, for the high threshold, log the event in
the RASlog, send an e-mail message, and issue an SNMP trap. For the low threshold, only
log the event in the RASlog.
• Set the buffer setting to 0 (the default is 100). Note that if you do not specify the buffer
value, Fabric Watch automatically recalculates the buffer.
• Apply the new custom settings so they become effective.
2. Apply the new custom settings so they become effective.
switch:admin> portthconfig --apply port -area crc -action cust -thresh_level
custom
3. To display the port threshold configuration for the Port class and all areas, use the following
command.
switch:admin> portthconfig --show port
NOTE
For Fabric OS versions 7.1.0 and later, the ITW counter includes a physical coding sublayer (PCS)
violation. ITW violations can occur due to an ITW violation, a PCS violation, or both.
• In this example, the alarms are set at two points: a high threshold of 40 and a low
threshold of 25. The goal is to be notified as the number of invalid transmission words per
minute rises above the low boundary and again when it rises above the high boundary.
• Triggers specify actions for in-range port behavior. Here, the trigger for both is above.
• Set the action to take when a trigger occurs. Here, for the low threshold, only log the event
in the RASlog. For the high threshold, log the event in the RASlog and issue an SNMP trap.
• Set the buffer to 0 (the default is 100).
2. Apply the new custom settings so they become effective.
switch:admin> portthconfig --apply port -area itw -action cust -thresh_level
custom
3. To display the port threshold configuration for the Port class and all areas, use the following
command.
switch:admin> portthconfig --show port
To pause the monitoring of a class, area, and port or index, enter the portThConfig command using
the following parameters.
NOTE
You cannot specify all for all classes, but you can specify all for all areas.
NOTE
If you are using a Brocade 48000 or DCX Backbone with an FR4-18i blade or the Brocade 7500, the
E_Port class monitors the following additional ports and creates monitors for each of the logical
ports: FCR ports (includes EX_Ports); FCIP (includes VE_Ports and VEX_Ports). In these
configurations, state changes are applicable for all ports and utilization and packet loss are
applicable to VE_Ports only.
NOTE
The settings in these subclasses include settings for the host bus adapter (HBA) ports as well as the
storage ports.
NOTE
With the increased use of virtual environments, alerts from device ports are increasing more
than ever in the past. This provides a good gauge as to the overall bandwidth requirement
changes and utilization and could indicate that additional ISL trunks are required.
Transmitted packets Monitors the transmit rate, Unit: Percentage (%) Below: 0 Informative
(TXP) by percentage. Time Base: minute Above: 0 Informative
Low: 0
High: 100
Buffer: 0
Trunk utilization The percent of utilization Unit: Percentage (%) Below: 0 Informative
(E_Port, FCU_Port, for the trunk at the time of Time Base: minute Above: 0 Informative
and FOP_Port) the last poll. Low: 0
High: 100
Buffer: 0
Link reset Monitors the number of Unit: Errors Below: 0 Informative
link resets sent by a given Time Base: minute Above: 0 Out_of_range
port (LR-Out) and received Low: 0
on a given port (LR-In). High: 500
Buffer: 50
NOTE
Only a subset of areas, shown in Table 19, can be configured for the VE_Port class. When setting
VE_Port thresholds for the Packet Loss area, the threshold value accepts up to two decimal points;
for example: -value 0.60, as shown in “Packet loss monitoring enhancements on the VE_Port”.
Port fencing
Port fencing monitors ports for erratic behavior and disables a port if specified error conditions are
met. You can customize the thresholds and configure the ports to report errors for one or more
areas using the portThConfig command. After the ports are configured, you can enable port fencing
for specific areas of the physical ports, E_Ports, FOP_Ports, and FCU_Ports using the portFencing
command. Port fencing is not supported on VE_Ports.
Table 20 shows the areas that support port fencing for the different physical port class and E_Port,
FOP_Port, and FCU subclasses. Port fencing is not supported for the Loss of Sync (LOS) and Link
Failure (LF) areas.
NOTE
Port fencing configuration on the VE_Port is not supported on the Brocade DCX backbone blade.
NOTE
The execution of the portFencing command is subject to Virtual Fabrics (VF) or Admin Domain
restrictions that may be in place. For example, in non-VF chassis environments, the state change
counter of a trunked slave port gets incremented by more than 1 when the master EX_Port changes
its state. Therefore, it is advisable to set the port fencing high threshold for the State Change area
to a value greater than 4 in this environment.
The allowed threshold configuration settings are displayed on a per-class basis. FOP_Port class
thresholds apply to the entire switch. You can set different thresholds for Storage and Host
FOP_Ports if they are on different switches, based on the fabric configuration.
NOTE
For Fabric OS versions 7.1.0 and later, the ITW counter includes a physical coding sublayer (PCS)
violation. ITW violations can occur due to an ITW violation, a PCS violation, or both.
When establishing thresholds for CRC errors and Invalid Words, consider the following:
• In general, “cleaner” interconnects can have lower thresholds as they should be less likely to
introduce errors on the links.
Trait Configuration
High Thresh
Low Thresh
Time Base
Custom
Custom
Default
Default
Above
Below
Buffer
Notes
Class
Area
Unit
Port Link Loss X Errors Minute 0 500 50 X
Sync Loss X Errors Minute 0 500 50 X
Signal Loss X Errors Minute 0 5 0 X
Protocol Error X Errors Minute 0 5 0 X
Invalid Words X Errors Minute 0 25 0 X E
Invalid CRCs X Errors Minute 0 5 0 X E
RX Performance X Percentage Minute 0 100 0 X
TX Performance X Percentage Minute 0 100 0 X
State Changes X Changes Minute 0 50 0 X
Link Reset X Errors Minute 0 500 50 X
C3 Discard X Errors Minute 0 5 0 X
E_Port Link Loss X Errors Minute 0 0 0 X
Sync Loss X Errors Minute 0 45 0 X E,S
Signal Loss X Errors Minute 0 45 0 X E,S
Protocol Error X Errors Minute 0 5 0 X
Invalid Words X Errors Minute 0 40 0 X E,S,F pf
Invalid CRCs X Errors Minute 0 20 0 X E,S,F pf
RX Performance X Percentage Minute 0 75 0 X E E
TX Performance X Percentage Minute 0 75 0 X E E
State Changes X Changes Minute 0 50 0 X
(E/VE_Port)
Link Reset X Errors Minute 0 500 50 X
Utilization (VE_Port) X Percentage Minute 0 100 0 X
Packet Loss (VE_Port) X Errors Minute 0 10 0 X
C3 Discard X Errors Minute 0 5 0 X E
Trunk Util X Percentage Minute 0 75 0 X E
Trait Configuration
High Thresh
Low Thresh
Time Base
Custom
Custom
Default
Default
Buffer
Below
Above
Notes
Class
Area
Unit
FOP_Port and Link Loss X Errors Minute 0 15 0 X E,S
FCU_Port
Sync Loss X Errors Minute 0 45 0 X E,S
System Monitoring 8
In this chapter
• Environment monitoring . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69
• Resource class settings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70
• System monitoring using the sysMonitor command . . . . . . . . . . . . . . . . . . . 71
• Recommended environment and resource monitoring settings . . . . . . . . . 74
• Switch monitoring . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75
• FRU monitoring . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77
Environment monitoring
The Environment class provides information about the internal temperature of the switch. You can
configure the Environment class using the sysMonitor command.
Temperature Refers to the ambient temperature inside the switch, in degrees Celsius. Temperature sensors
monitor the switch in case the temperature rises to levels at which damage to the switch might
occur.
NOTE
Event Manager (EM) now manages fan monitoring; the switch status is calculated based on fan
status reported by EM. You can use the fanShow command to view the fan status.
NOTE
Fabric Watch no longer supports fan monitoring. Event Manager (EM) now manages fan monitoring
and the switch status is calculated based on the fan status reported by EM.
Below: 3 Out-of-range
Above: 3 Out-of-range
70 53-1002998-01
Fabric Watch Administrator’s Guide
System monitoring using the sysMonitor command 8
Flash Monitors the compact flash space available by calculating the percentage of flash space
consumed and comparing it with the configured high threshold value.
NOTE
Spikes in memory and CPU utilization are normal during the firmware download process and you
may see threshold warning messages while the process is running. After the firmware download
process has completed, memory and CPU utilization should return to normal.
System monitoring is disabled by default. You must run both the --config -mem and the --config -cpu
commands to enable both memory and CPU system monitoring.
CAUTION
When you use --config with the --nosave option and the switch reboots, your changes are lost.
NOTE
You cannot specify all for all classes, but you can specify all for all areas.
72 53-1002998-01
Fabric Watch Administrator’s Guide
Examples of the sysMonitor command 8
Trait Configuration
High Thresh
Low Thresh
Time Base
Custom
Custom
Default
Default
Above
Below
Buffer
Notes
Class
Area
Unit
74 53-1002998-01
Fabric Watch Administrator’s Guide
Switch monitoring 8
Switch monitoring
Before entering the switchStatusPolicySet command, plan your switch status policy. Determine
your system requirements and the factors that affect its monitors.
NOTE
Based on the configuration of the core blade component of the switch status policy, Fabric Watch
generates two RASlogs when a core blade is removed either on the Brocade DCX or the Brocade
DCX-4S. For example, if the Down and Marginal configuration is 0 and 1 on the DCX, upon removal
of the first core blade, Fabric Watch generates one RASlog for the switch status policy and the other
RASlog for the error itself.
NOTE
The default setting for a MARGINAL state is 0, which prevents Fabric Watch from generating
notifications due to missing power supplies. For configurations with a 2+2 power supply (PS)
combination, it is recommended that you change the default Fabric Watch default setting of 0 to 2
power supplies, which forces the overall switch status to a MARGINAL state.
Power Supplies Power supply thresholds detect absent or failed power supplies, and power supplies that are
not in the correct slot for redundancy.
Temperatures Temperature thresholds, faulty temperature sensors.
Fans Fan thresholds, faulty fans.
WWN Faulty WWN card (applies to modular switches).
CP Switch does not have a redundant CP (applies to modular switches).
Blades Faulty blades (applies to modular switches).
Core Blade Faulty core blades.
Flash Flash thresholds.
Marginal Ports1 Port, E_Port, FOP_Port (optical), and FCU_Port (copper) port thresholds. Whenever these
thresholds are persistently high, the port is Marginal.
Faulty Ports1 Hardware-related port faults.
Missing SFPs1 Ports that are missing SFP media.
Error Ports1 Ports with errors.
1. Marginal ports, faulty ports, error ports, and missing SFPs are calculated as a percentage of the physical ports
(excluding FCoE and VE_Ports).
NOTE
The presence of four or more FS8-18 encryption blades in the DCX Data Center Backbone causes
the Fabric Watch switch status policy for power supplies to assume a policy setting of 2,1.
Switch status policies are saved in a nonvolatile memory, and therefore are persistent until
changed.
NOTE
FCoE and VE_Ports are not considered in marginal port or faulty port calculations.
• Power Supplies—The power supply thresholds detect absent or failed power supplies.
76 53-1002998-01
Fabric Watch Administrator’s Guide
FRU monitoring 8
FRU monitoring
Supported FRU areas depend on the type of Brocade switch. For the following switches, the slot
and WWN areas are not supported:
• Brocade 300, 5100, and 5300 switches
• Brocade DCX and DCX-4S Data Center Backbone
• Brocade Encryption Switch
Configuring FRUs
The configuration of field-replaceable units (FRUs) is an exception to the procedures described
thus far in this chapter. FRUs are monitored using state values, as opposed to the quantitative
values used to monitor the rest of the fabric. As a result of the qualitative nature of this monitoring,
the concept of thresholds does not apply.
NOTE
The Off state is applicable only to fans on some platforms, such as the Brocade DCX and Brocade
DCX-4S. The Off state is not applicable to the power supply, slot, or WWN FRUs.
Note that the value 0 for a parameter means that it is NOT used in the
calculation
78 53-1002998-01
Fabric Watch Administrator’s Guide
FRU monitoring 8
For example, to trigger events using the Absent, Off, and Faulty states, add the assigned values and
enter that value at the prompt. In this case, the values are 1, 8, and 16, respectively, and the total
is 25.
FRU Slot X X
Power Supply X X
Fan X X
WWN X X
80 53-1002998-01
Fabric Watch Administrator’s Guide
Chapter
In this chapter
• Using Web Tools to configure Fabric Watch. . . . . . . . . . . . . . . . . . . . . . . . . . 81
NOTE
Unless the switch is a member of the current Admin Domain context, Fabric Watch is view-only.
The Fabric Watch Explorer pane, on the left side of the window, displays the available classes. Not
all classes are available for all switches. The status bar at the bottom of the window provides you
with a summary of recent actions, and the date and time the module was last updated.
NOTE
Use the procedures in this section to configure threshold traits for all classes except for the FRU
class. Use the procedure described in “Configuring alarms for FRUs using Web Tools” on page 85 for
the FRU class.
NOTE
When you are configuring the VE-Port > Packet Loss area thresholds, the packet loss threshold
values are a percentage. You can configure from 0.01 percent (low boundary) to 100.00 percent
(high boundary).
NOTE
The module displays two tables of alarm configuration information, labeled System Default
and Custom Defined. You cannot modify the information in the System Default table.
• SNMP_TRAP
• PORT_LOG_LOCK
• EMAIL_ALERT
7. Click Apply.
If a FRU of the selected type is determined that it is one of the selected states, an event will
occur.
6. Select the methods by which you want to be notified about the FRU alarms.
For FRUs, the only options are error log and e-mail alert.
7. Click Apply to apply the changes to the switch.
A confirmation dialog box displays, asking if you want to apply the changes to the switch.
8. Click OK to save the changes to the switch.
Displaying alarms
Using the Alarm Notification tab, you can view a list of all alarms that occurred for a selected class
or area (Figure 7 on page 82). Table 31 describes the columns in this report. You can click the
header of each column to change the way the information is sorted in your view. You can also
right-click the column header and select sort options from a menu.
NOTE
For the FRU class, only the Name, State, and Time columns are displayed. In addition, if the FRU area
is Fan, the Name column refers to either a fan or a fan FRU, depending on the switch model.
5. In the Domain Name field, enter the domain name (between 4 and 32 characters).
6. Click Apply.
NOTE
You must execute the fwalfilterset 1 command to enable e-mail notification. Refer to the Fabric OS
Command Reference.
In this chapter
• Fabric Watch reports . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89
• Switch Availability Monitor report . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 90
• Switch Health report . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91
• Switch Status Policy report . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91
• Port Detail report . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92
You can generate the following types of reports using Fabric Watch:
• Switch Availability Monitor report
• Switch Health report
• Switch Status Policy report
• Port Detail report
NOTE
SAM report details do not display the health status of GbE ports. Fabric Watch only monitors and
reports the status for physical and virtual FC ports.
You can run reporting commands in Fabric Watch to get instant access to switch information.
Although the switchShow command provides basic switch information, the Fabric Watch reports
provide detailed information, which enables you to track marginal or faulty ports that can affect
throughput or switch performance.
You can generate reports from the command line using a Telnet session or by using Web Tools. The
examples in this chapter use the command line interface.
NOTE
Switch Health report details do not display the health status of GbE ports. Fabric Watch only
monitors and reports the status for physical and virtual FC ports.
NOTE
The final portion of the report, detailing port health, is not available without a Fabric Watch license.
NOTE
Port Detail reports do not display the health status of GbE ports. Fabric Watch only monitors and
reports the status for physical and virtual FC ports.
--------Port-Errors------------ -----SFP-Errors----
Port# Type State Dur(H:M) LFA LSY LSI PER INW CRC PSC BLP STM SRX STX SCU SVO
--------------------------------------------------------------------------------
080 U OFFLINE 062:17 - - - - - - - - - - - - -
081 U OFFLINE 062:17 - - - - - - - - - - - - -
082 U OFFLINE 062:17 - - - - - - - - - - - - -
083 U OFFLINE 062:17 - - - - - - - - - - - - -
084 U OFFLINE 062:17 - - - - - - - - - - - - -
NOTE
Output of the Port Detail report depends on the ports that belong to the current Admin Domain
context. If a port does not belong to the current Admin Domain, nothing other than the port number
is displayed for that port; for example:
“000 ----------------Not a member of current Admin Domain-------------------”
Table 33 lists and describes each item in the Port Detail report.
LFA Link Loss: The number of link loss occurrences out of range for a specified time period.
LSY Sync Loss: The number of sync loss occurrences out of range for a specified time period.
LSI Signal Loss: The number of signal loss occurrences out of range for a specified time period.
PER Protocol Error: The number of protocol errors out of range for a specified time period.
INW Invalid word: The number of invalid words out of range for a specified time period.
CRC Invalid CRC: The number of CRC errors out of range for a specified time period.
PSC Port hardware state changed too often because of fabric reconfiguration.
BLP Buffer limited port: The switch status changes when a port is in a buffer limited mode based on
the switch status policy.
STM SFP temperature is out of specifications.
SRX SFP receive power is out of specifications.
STX SFP transmit power is out of specifications.
SCU SFP current is out of specifications.
SVO SFP voltage is out of specifications.
A C
above event triggers, 15 class
access gateway mode, restrictions in Fabric Watch, 12 environment, 20
action configuration guidelines, 4 fabric, 20
activating Fabric Watch, 23, 81 FRU, 20
using a Telnet session, 23 Performance Monitor, 21
using SNMP, 24 port, 21
using Web Tools, 28 resource, 21
alarm behavior, 17 security, 21
alarm configuration report for Fabric Watch, 86 SFP, 21
classes, description of, 20
alarm notification configuration, 34
command
alarms, Fabric Watch
dnsConfig, 9
configuring, 84, 85
displaying, 87 errShow, 10
enabling and disabling, 85 fmConfig, 5
alerts configuration recommendations, 3 fwclassinit, 24
fwFruCfg, 4, 78
area
fwMailCfg, 9
environment class, 69
licenseAdd, 24
fabric class, 35
portThConfig, 4, 6, 49, 52
FRU class, 77
snmpConfig, 10
performance monitor class, 41
switchStatusPolicySet, 75
port class, 49
sysMonitor, 7, 71, 72, 73
resource class, 70
thConfig, 4, 5, 6, 42
security class, 37
thMonitor, 6, 45
SFP class, 40
configuration guidelines
areas, 19
actions, 4
audit messages, 11
threshold, 4
configuration tasks, list of, 29
configuring
B email notifications, 87
Fabric Watch thresholds, 83
below event trigger, 15 FRU alarms, 85
BNA, port fencing configuration, 65 threshold alarms, Fabric Watch, 84
configuring alarm notifications, 34
configuring ports, 52
core blades, notification when removed, 75
CPU and memory configuration, 73
T
Telnet, using to activate Fabric Watch, 23
temperature, configuring using sysMonitor command, 72
thConfig command
configuration options, 42