0% found this document useful (0 votes)
99 views

Predictive Failure HDD

1) The document provides steps for safely replacing a hard drive that has been predicted to fail based on an Array Diagnostic Utility (ADU) report. 2) It instructs to take full backups, identify the predicted failure drive, remove it after shutting down, replace it with a new drive which will then rebuild. 3) During rebuilding, the array controller sequentially rebuilds each logical drive one by one.

Uploaded by

Akbarvali Guntur
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
99 views

Predictive Failure HDD

1) The document provides steps for safely replacing a hard drive that has been predicted to fail based on an Array Diagnostic Utility (ADU) report. 2) It instructs to take full backups, identify the predicted failure drive, remove it after shutting down, replace it with a new drive which will then rebuild. 3) During rebuilding, the array controller sequentially rebuilds each logical drive one by one.

Uploaded by

Akbarvali Guntur
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 1

SAW pagina 1 van 1

Compaq Array Configuration Utility - Replace a Hard


Drive Safely After a Predictive Failure
ISSUE: The hard drive fault LED is blinking and the Array Diagnostic Utility (ADU) report has
determined that there is a predictive failure on a hard drive.
Here is an example from an ADU report:

ErrorReport SLOT 0 Smart Array 6i Controller ERROR REPORT:


ErrorReport SCSI Port 1 Drive ID 0 has exceeded the following threshold(s) ErrorReport Pred failure
errors
ErrorReport SOLUTION: Please replace this drive when conditions permit.
ErrorReport SCSI Port 1, Drive ID 0 ... S.M.A.R.T. predictive failure errors have been
ErrorReport detected in the factory Monitor and Performance data. SOLUTION: Please ErrorReport
replace this drive when conditions permit.
ErrorReport SCSI Port 1, Drive ID 0 ... S.M.A.R.T. predictive failure errors have been
ErrorReport detected in the since power Monitor and Performance data. SOLUTION: Please
ErrorReport replace this drive when conditions permit.

NOTE: Pulling an online hard disk drive to replace it while the server is powered on there is a
chance to lose data. Online meaning that the HDD has not fully failed. A failed HDD is indicated
by the Amber/Red Solid FAULT LED being on and ACU showing that the physical drive has
failed.

SOLUTION:
1. Take TWO FULL BACKUPS of server and test the restore process in your lab.
2. Use ACU & ADU to identify which drive is the predicted failure and make sure there is only one drive in this state or it might not
be able to rebuild the array.
3. Mark the drive you need to remove.
4. ring down the server gracefully.
5. Remove the pre-failure drive.
6. Power up the server - without adding another hard drive to the removed open slot. POST screen error will pop up as the
following: [Slot 0 HP Smart Array 5i Controller (32MB, v2.62) 2 Logical Drives
1789-Slot 0 Drive Array SCSI Drive(s) Not Responding
Check cables or replace the following SCSI drive(s):
SCSI Port 2: SCSI ID 0
Select F1 to continue. All logical drive(s) will remain disabled
Select F2 to fail drive(s) that are not responding - Interim Recovery
mode will be enabled if configured for fault tolerance
(RESUME = "F1" OR "F2" KEY)
7. Press F2. Read your choices before assuming that F2 is the correct choice.You should see a similar message after pressing
F2: 1787-Slot 0 Drive Array Operating in Interim Recovery Mode
The following SCSI drive(s)
should be replaced:
SCSI Port 2: SCSI ID 0
8. Operating System come up and login.
9. Add in a Good Known hard drive to the failed open slot. If you had a spare drive then that should already be rebuilding.
10. Once the hard drive is added and spins up it will start rebuilding. If there is a spare it will stop rebuilding and go back to being a
spare drive.
11. Use the ACU to see the progress and make sure the drive is rebuilding and check the LEDs on the hard drive as well. In the
Array Configuration Utility 7.50.23.0 or higher it should show a Status Message for the Array. Click on the STATUS MESSAGE:
This is just one example! The current array controller is rebuilding logical drive 1 (RAID
1+0).
The current array controller is rebuilding logical drive 1 (RAID 1+0)

NOTE: The process of rebuilding starts with the first logical drive and then goes to the next and
so on. So, each individual logical drive will get rebuilt one at a time and sequentially.

http://h41297.www4.hp.com/km/saw/print.do?docId=emr_na-c00791056-1 3-6-2010

You might also like