My monitoring started to alert me about a failing disk recently.
Replacing a failed disk is quite straight forward. I have first identified the device:
1) Locating the Drive
We will locate the drive to make sure that we are replacing the correct disk.
/opt/MegaRAID/MegaCli/MegaCli64 -PdLocate -start -physdrv[2:11] -a0
Adapter: 0: Device at EnclId-2 SlotId-11 -- PD Locate Start Command was successfully sent to Firmware
Exit Code: 0x00
2) Phisically Removing the Drive
After locating the drive we can remove it and replace with a spare one.
3) Rebuilding the Array
My MegaRaid card is configured to auto-rebuild my arrays in case of a disk failure.
We can check the status of the rebuild with the command below:
/opt/MegaRAID/MegaCli/MegaCli64 -PDRbld -ShowProg -PhysDrv[2:11] -aALL
Rebuild Progress on Device at Enclosure 2, Slot 11 Completed 91% in 0 Minutes.
Exit Code: 0x00
Resources