Preventive Health Check
Preventive Health Check
The information or statements given in this documentation concerning the suitability, capacity, or
performance of the mentioned hardware or software products are given “as is“ and all liability arising
in connection with such hardware or software products shall be defined conclusively and finally in
a separate agreement between Nokia Solutions and Networks and the customer. However, Nokia
Solutions and Networks has made all reasonable efforts to ensure that the instructions contained
in the document are adequate and free of material errors and omissions. Nokia Solutions and
Networks will, if deemed necessary by Nokia Solutions and Networks, explain issues which may not
be covered by the document.
Nokia Solutions and Networks will correct errors in this documentation as soon as possible. I N N O
E V EN T WI LL N OKI A S O L UT I ON S A N D N E T W O R K S B E L I A B L E F O R E R R O R S
I N T H IS DO CU M EN TAT IO N OR F O R A N Y D A M A G E S , I N C L U D I N G B U T N O T
L IM I T ED TO SPEC IAL, D IR E C T, IN D I R E C T, I N C I D E N TA L O R C O N S E Q U E N T I A L
O R AN Y LOSSES, SU CH A S B U T N O T L I M I T E D TO L O S S O F P R O F I T, R E V E N U E,
B USI NESS IN T ER RU P T I ON , B U S I N E S S O P P O RT U N I T Y O R D ATA , T H AT M AY
A RI SE F RO M T H E USE OF T HI S D O C U M E N T O R T H E I N F O R M AT I O N I N I T.
NSN is a trademark of Nokia Solutions and Networks. Nokia is a registered trademark of Nokia
Corporation. Other product names mentioned in this document may be trademarks of their
respective owners, and they are mentioned for identification purposes only.
Nokia Solutions and Networks is continually striving to reduce the adverse environmental effects of
its products and services. We would like to encourage you as our customers and users to join us
in working towards a cleaner, safer environment. Please recycle product packaging and follow the
recommendations for power use and proper disposal of our products and their components. If you
should have questions regarding our Environmental Policy or any of the environmental services we
offer, please contact us at Nokia Solutions and Networks for additional information.
Preventive Health Check
Contents
1 About Preventive Health Check.....................................................................................................................4
1.1 Preventive Health Check contents............................................................................................................4
3
Preventive Health Check About Preventive Health Check
• NetAct
• NetAct Configurator (both Core and Radio)
• Optimizer
After installation, the tool can be found in /opt/oss/NSN-mhcf directory on all the nodes of NetAct
Cluster.
The Directory folders figure below shows the contents of the Preventive Health Check installation di-
rectory.
Directory Description
Note:
• netact 8(includ-
ing NAC scripts)
• optimizer
Directory Description
1. Log in as omc user to the node where you want to run the health check. This is applicable to all
NetAct nodes except the performance management nodes.
where:
where:
<value> is value of the log level. Example: small, medium and all
For example,
• To execute the PHC for NetAct scenarios (including NetAct for Core and Performance Man-
agement) the command is:
mhcf.pl -c /opt/oss/NSN-mhcf/product/netact8/cfg_netact8.xml
Note:
During NetAct Upgrade, the PHC tool should not be executed. If PHC is scheduled
for automatic execution, such scheduling should be disabled.
mhcf.pl -c /opt/oss/NSN-mhcf/product/optimizer/cfg.optimizer.xml
Note:
If Password Policy is enforced in the system, then it may happen that the password of
internal backend user such as mhcf is expired on any node, the test cases executed
on that node fails as Preventive Health Check tool cannot make SSH connection to that
node.
In such cases the following error message is displayed on the screen and in the output
files.
You must manually change the password on all nodes of the system. The user security
hardening measures mentioned in the Administering NetAct System Security
document is also applicable for mhcf user.
Password expiry duration is as per password policy, which is 180 days by default
Expected outcome
The command output is either passed or failed. A result folder is created in /var/opt/nokia/
oss/global/NSN-mhcf/<user>/<product>/ directory. For more information, see Report
directory structure on page 10.
Note:
You can use timeout option in the configuration to define maximum execution time for sin-
gle test case. If test case execution exceeds the defined maximum execution time then
the test case fails.
<defaultTimeOut>90</defaultTimeOut>
Few test cases might have its own timeout defined in the test configuration.
Minimum or basic symptoms for quick analysis of the problem are collected on every execution
of the tool with loglevel = small, regardless of a test case pass or fail. If test case fails,
Preventive Health Check can be executed with loglevel = medium and filter=module
name or testSet=testset name.
Example
Additional Information
• To view the list of supported commands of the Preventive Health Check, execute:
mhcf.pl -help
• To run specific configuration file that contains the test case, execute:
mhcf.pl -c /opt/oss/NSN-mhcf/product/netact8/cfg_netact8.xml
mhcf.pl -c /opt/oss/NSN-mhcf/product/netact8/cfg_netact8.xml -t
product/netact8/subcfg_netact8.xml
Note:
• To run test case for NetAct for Core based on filterTags=cm specified in the test case,
execute:
mhcf.pl -c /opt/oss/NSN-mhcf/product/netact8/cfg_netact8.xml -f cm
Note:
filterTags can be cm, rhel, nas_common, fm, ha, was, pm. You can provide
multiple filterTags as follows:
mhcf.pl -c /opt/oss/NSN-mhcf/product/netact8/cfg_netact8.xml -
f cm,rhel,nas_common,fm,ha,was,pm
For information on the available test cases, see Test cases on page 12.
• To run specific test case assigned to specific role, such as db, execute:
mhcf.pl -c /opt/oss/NSN-mhcf/product/netact8/cfg_netact8.xml -r db
Note:
Role can be dmgr, fm_pipe and was. Multiple role values can be provided in the
same command as follows:
mhcf.pl -c /opt/oss/NSN-mhcf/product/netact8/cfg_netact8.xml -
r db,dmgr,fm_pipe,was
mhcf.pl -c /opt/oss/NSN-mhcf/product/netact8/cfg_netact8.xml -q
• To run specific test case based on the test case number, execute:
Note:
To view test case number, execute the command with -list option.
Note:
– small: quick logs; defined as level=4 in each module log configuration file
– medium: detailed logs, including core dumps; defined as level=2 in each
module log configuration file
– all: both small & medium logs
• Logs are collected based on the tag collectLogs="always" or
collectLogs="onError" in the log configuration file:
• The most preferred and default command is mhcf.pl -c /opt/oss/NSN-mhcf/
product/netact8/cfg_netact8.xml -loglevel small.
To see simple statistics at the end of the report use -stats option:
Example:
/var/opt/nokia/oss/global/NSN-mhcf/omc/netact8/
The summary report lists the test cases executed on each node and provides a summary of the test
results. The summary contains the description of each test and result of the test execution. If any test
case fails, the reason for the failure and suggested action are described.
The symptoms collected are stored in .tar.gz file. Summary report is available in the following
formats:
• .txt - Summary report in text format is same as the console output . Open the file to view the
report on a console.
• .html - Summary report in .html format enables to view the report in a browser.
• .xml - Summary report in .xml format is ASWS compatible for integration to care tools.
time: reports and logs are cleaned when older than value in reportsRetentionTimeInDays
size: reports and logs are cleaned when size of the report directory exceeds value given in report-
sSizeThresholdInMB. Files are removed starting from the oldest until the size of the report direc-
tory become smaller than above threshold.
This value tells the tool which files should be removed during clean up.
logsonly: only log files archives are removed, the reports are retained.
<reportsCleanUpRange>all</reportsCleanUpRange>
3. Reports cleanup retention time in days
Reports are removed when size of the reports directory exceeds this value
<reportsSizeThresholdInMB>200</reportsSizeThresholdInMB>
• RHEL test cases checks the operating system and hardware. For more information, see RHEL
module on page 28.
• NetAct test case cover NetAct for Core, Fault Management, high availability, NetAct system infor-
mation, Performance Management, and, WebSphere application server. For more information, see
NetAct module on page 12.
• Optimizer testcases are listed in Optimizer module on page 27.
For information on how to run a test case, see Running Preventive Health Check on page 7.
• Configuration management
• Fault management
• High Availability
• NetAct system information
• Performance management
• WebSphere application server
• License Management
• Software Asset Monitoring (SWAM)
Note:
Check the filterTag in the test case for the module name. The filterTag can be cm,
rhel, nas_common, fm, ha, was, pm. You can execute the modules separately by providing
the module name as filter in the command. For more information on how to execute modules,
see Running Preventive Health Check on page 7.
Execute on
Test case title node with Test case description
Role
Execute on
Test case title node with Test case description
Role
Check is unable to retrieve
these data, the test fails.
Daily Alarm Rate dmgr Lists 'daily alarm rate' for last
7 days excluding present-day.
This is calculated based on
alarm insertion time in FM
database. This report also
displays additional informa-
tion about daily count of ac-
tive-non-acknowledged, ac-
knowledged and cancelled
alarms for last 7 days. Test
fails if daily alarm rate cross-
ing threshold limit of 300K/
day. The report shows "fail"
against the dates alarm rate
is exceeding limit. Thresholds
can be changed in file: /opt/
oss/NSN-mhcf/product/net-
act8/fm/fm_config.xml. Para-
meters:
• <var id="days"
name="Number of days"
value="7" />
• <var
id="dailyAlarmThreshold"
name="Threshold for
daily alarm rate" val-
ue="300000" />
Execute on
Test case title node with Test case description
Role
• <var id="numNE"
name="This value in-
dicates number of high
alarm NEs to be shown in
report" value="10" />
• <var
id="highAlarmThreshold"
name="Threshold for high
alarm rate per NE" val-
ue="100000" />
Execute on
Test case title node with Test case description
Role
Execute on
Test case title node with Test case description
Role
Execute on
Test case title node with Test case description
Role
Execute on
Test case title node with Test case description
Role
Execute on
Test case title node with Test case description
Role
Execute on
Test case title node with Test case description
Role
2. Check this url https:/
/<loadbalancer_IP>/
opens without any error
Note: High availability testcases must be run only once. Therefore, execute these testcases
on the node with unique role db.
Execute on
Test case title node with Test case description
Role
NetAct System Info db Lists NetAct system information such as ClusterID, Cluster-
Name, TargetID , Domain name. Also display Oracle & WAS
versions. Test fails if it is not able to fetch the required informa-
tion.
NetAct Release Info dmgr Lists the Software release information. Test fails if it is not able
to fetch the required information.
DB Size Check db Checks that the DB usage is below 80% threshold. Test fails if
usage exceeds threshold.
Global Disk Size Check nfs Check folders mounted on NFS Server are less than 95%
(threshold) full. Folders considered:
• /home/
• /d/oss/global
• /home
• /packages/var/opt/nokia/oss/global
NFS Server check for nfs Check /home/ and /d/oss/global folders exported on NFS
exported folders Server.
NFS Client check for netfs Checks if /home and /d/oss/global folders are mounted as
mounted folders 'nfs' file system successfully.
Execute on
Test case title node with Test case description
Role
Execute on
Test case title node with Test case description
Role
Parameters:
Execute on
Test case title node with Test case description
Role
• <var
id="pmHourlyAggregation"
name="PM Maximum Ag-
gregation Duration Hourly"
value='900' />
• <var
id="pmDailyAggregation"
name="PM Maximum Ag-
gregation Duration Daily"
value='10800' />
Parameter: <var
id="pmDeletionPart"
name="PM Maximum Deletion
Duration of Partitioned tables"
value='500' />
Parameter: <var
id="pmDeletionNonPart"
name="PM Maximum Deletion
Duration of Non-Partitioned ta-
bles" value='1800' />
Execute on
Test case title node with Test case description
Role
Execute on
Test case title node with Test case description
Role
Execute on
Test case title node with Test case description
Role
Execute on
Test case title node with Test case description
Role
Execute on
Test case title node with Test case description
Role
is enabled. The log contains
the number of NE integrations
sorted on the Adaptation, Ab-
brevation, Version and Opera-
tional state.
<var id="days"
name="Duration (in days)
of Doctor logs to check" val-
ue="1" />
The following table lists the License Manager & SWAM test cases.
Execute on node
Test case title Test case description
with Role
Execute
Test case title on node Test case description
with Role
Optimizer Build Number and Customer Name Check the latest bulid number
Disk space details of UMA and Optimizer ta- Check and display table space
bles space in Optimizer
List of KPIs selected for Auto Retrieval – as Check and point all active KPI
zipped filed attached to log
Total Number of Plans in Optimizer and Total Check and display all avaliable
number of plans in optimizer which are older plans in Optimizer and point all
than 60 days older than 60 days
Count of Managed Objects - as zipped filed at- Check and display amount of
tached to log Managed Objects
Row Space Consumed in OPTDB – as zipped Check and point Row Space
file attached to log Consumed in OPTD
Execute on
Test case title Node(s) with Test case description
ROLE
Execute on
Test case title Node(s) with Test case description
ROLE
message file. The keywords
can be specified in /opt/
oss/NSN-mhcf/product/
netact8/rhel/rhel_
config.xml; change the
value for id=errString1
<var id="errString1"
name="Error pattern
to check in /var/log/
messages file; pattern
separated with /"
value="error/Error/
ERROR" />
Execute on
Test case title Node(s) with Test case description
ROLE
To change the line count,
the id errorString2Count
value can be changed in
the same XML file: <var
id="errString1Count"
name="Trailing number
of lines to check in
the /var/log/secure
file; default 500"
value="500" />
2.2.4 ORACLE
Execute on
Test case title node with Test case description
Role
<var id="proc_threshold"
name="Oracle Process
Threshold%" value="95" />
• <var id="topCount"
name="Top Oracle
Processes Count" val-
ue="20" />
• <var id="minTime"
name="Minimum TIME
in hundreds of second to
be listed in top command"
value="1000" />
Execute on
Test case title node with Test case description
Role
above threshold 1200 (default)
seconds
• <var id="minInterval"
name="Minimum In-
terval (seconds) be-
tween archive files" val-
ue="1200" />
• <var id="hoursToFull"
name="Minimum Hours
until archive partition is
full" value="72" />
<var id="oldestFile"
name="Time Limit (minutes)
for archive file before backup"
value="1440" />
Execute on
Test case title node with Test case description
Role
The PHC tool is scheduled to run automatically (cron added during installation for mhcf user) daily at
22:23 hrs, on the node with role DNS-Master.
Note:
Before starting NetAct upgrade, scheduling of Preventive Health Check must be disabled for
mhcf user on the node with DNS-Master role.
1. Login as root to the VM with DNS-Master role (use smanager.pl status service dns
command to get the DNS-Master VM node name)
2. Disable the execute permission for the cron job file:
After the NetAct upgrade is completed, the Preventive Health Check should be enabled again. Enable
the PHC cron job for mhcf user again on the node with DNS-Master role
1. Login as root to the VM with DNS-Master role (use smanager.pl status service dns
command to get the DNS-Master VM node name)
2. Enable the execute permission for the cron job file: