Predictive Failure HDD
Predictive Failure HDD
NOTE: Pulling an online hard disk drive to replace it while the server is powered on there is a
chance to lose data. Online meaning that the HDD has not fully failed. A failed HDD is indicated
by the Amber/Red Solid FAULT LED being on and ACU showing that the physical drive has
failed.
SOLUTION:
1. Take TWO FULL BACKUPS of server and test the restore process in your lab.
2. Use ACU & ADU to identify which drive is the predicted failure and make sure there is only one drive in this state or it might not
be able to rebuild the array.
3. Mark the drive you need to remove.
4. ring down the server gracefully.
5. Remove the pre-failure drive.
6. Power up the server - without adding another hard drive to the removed open slot. POST screen error will pop up as the
following: [Slot 0 HP Smart Array 5i Controller (32MB, v2.62) 2 Logical Drives
1789-Slot 0 Drive Array SCSI Drive(s) Not Responding
Check cables or replace the following SCSI drive(s):
SCSI Port 2: SCSI ID 0
Select F1 to continue. All logical drive(s) will remain disabled
Select F2 to fail drive(s) that are not responding - Interim Recovery
mode will be enabled if configured for fault tolerance
(RESUME = "F1" OR "F2" KEY)
7. Press F2. Read your choices before assuming that F2 is the correct choice.You should see a similar message after pressing
F2: 1787-Slot 0 Drive Array Operating in Interim Recovery Mode
The following SCSI drive(s)
should be replaced:
SCSI Port 2: SCSI ID 0
8. Operating System come up and login.
9. Add in a Good Known hard drive to the failed open slot. If you had a spare drive then that should already be rebuilding.
10. Once the hard drive is added and spins up it will start rebuilding. If there is a spare it will stop rebuilding and go back to being a
spare drive.
11. Use the ACU to see the progress and make sure the drive is rebuilding and check the LEDs on the hard drive as well. In the
Array Configuration Utility 7.50.23.0 or higher it should show a Status Message for the Array. Click on the STATUS MESSAGE:
This is just one example! The current array controller is rebuilding logical drive 1 (RAID
1+0).
The current array controller is rebuilding logical drive 1 (RAID 1+0)
NOTE: The process of rebuilding starts with the first logical drive and then goes to the next and
so on. So, each individual logical drive will get rebuilt one at a time and sequentially.
http://h41297.www4.hp.com/km/saw/print.do?docId=emr_na-c00791056-1 3-6-2010