hbouma
06-13-2002, 11:51 AM
Hi,
Does anyone have a favorite system for being notified of hardware failure? I notice that some systems provide notification say when a hard drive fails or whatnot. I was just wondering what methods there are for this and how effective people find it to be. Thanks!
Hal
allan
06-25-2002, 08:19 AM
What did you mean? What type of notification are you looking for?
Studio64
06-25-2002, 03:41 PM
Usual method of notification
[list=1]
Awful grinding sound.
Emails from mad people unable to access box
Odd smell coming from box
[/list=1]
Quite frankly... The box that breaks obviously can't inform you b/c.. Well.... It's broken..
The only way I can think of would be a cron script that would cycle through a list of given IP's pinging each one.
With a failure of a ping it would shoot off an email to whomever needs to be notified with the non-responsive IP, maybe with the corresponding name of the server.
hbouma
06-25-2002, 04:26 PM
Hi,
Sorry about the "deleted" prior post. When I wrote it last night I realized I screwed up so I deleted the contents. I would have deleted the whole message, but WHT wouldn't let me. What I mean is that lets say the cpu starts to get hot, it would be nice if I could get an email or pager notifying me of this. Or I saw someone complaining that they were getting emails about their S.M.A.R.T. hard drive reporting problems and to replace it. After all, if I have say a 4 drive RAID 5 setup, how else am I to know that one of the drives failed and that if I lose another drive, the ability to restore the data will be lost?
I am curious as to how these things are monitored. Someone earlier said to use SNMP to detect this. While I looked around and saw that devices can use SNMP, I don't know what programs that utilize these features or where to get the MIBs for this to work.
Any suggestions would be appeciated. Thanks.
Hal