Have a Dell R900 with 5 drives - 3 1 TB SAS drives in raid 5 and 2 others in raid 1. All of these are connected to the PERC 6i controller. Received a warning from Openmanage that one of the 3 drives in the RAID 5 had failed and ordered a replacement. Upon inserting new drive and starting a rebuild noticed the following - after several days the array was still listed as "degraded" and the rebuild process was stuck at 0%. Additionally, according to OpenManage, the only drive listed under "Physical Drives" was the one that was rebuilding - all the other drives disappeared. This generated no alerts and the status of the controller itself was still listed as green. The machine and OS kept running without problem throughout. I looked in the logs for the controller itself and it showed that the rebuild process started when I inserted the drive and completed normally with the raid array going into "Optimal" status once it completed. I managed to fix the problem by rebooting the machine - after it came back up everything was showing as normal again but I am concerned that OpenManage was showing either incomplete, inaccurate, or impossible information even after I restarted the Openmanage process on the machine. We rely on Openmanage monitoring the hardware to notify us when there is an issue and I'm concerned that a similar situation might crop up in the future where instead of showing a problem where there isn't one it might conversely not recognize an issue when there is one. Has anyone here seen something similar to this?
BMC Firmware - 2.48
SAS Backplane Firmware - 1.06
PERC 6/i firmware - 6.3.1-0003
PERC 6/i driver - unsure, the machine is currently being used as a VMWare host, and the only driver information given is "Non-SSD".
Server BIOS - 1.2.0
OpenManage Version - 6.5.0
According to OpenManage everything is up to date (all components have a green check mark next to them) other than the other controller in this machine which is not in use - a PERC 6/e which has a firmware update available that we haven't bothered with since we're not using it.
Originally Posted by benj114
I've seen when running outdated firmware/bios and older OpenManage versions can cause false readings or lack of readings.
Can you let us know what versions of the following your running;
SAS Backplane Firmware
PERC 6/i firmware
PERC 6/i driver
BMC and PERC 6/i firmware is behind by one release. Release notes do not suggest fixes for what you talked about. I don't advice upgrading the PERC 6/i firmware if your unsure about the driver being a VMWare host.
Your OpenManage should be fine for it being a 10th generation PowerEdge. I'm not sure off hand if the OpenManage 7.x version works with anything older then the 11th generation PowerEdge's, so I wouldn't upgrade to that.