BixData 2.7 Server/Agents - Ping ServiceCheck Failures

Submitted by jnicpon on April 8, 2008 - 09:14.

I have been extensively testing the BixData products to see if it is an appropriate solution for monitoring our IT infrastructure. So far I'm very impressed with the feature set and the flexibility.

I am running my BixServer (2.7) within an Ubuntu 6.06 LTS Server Edition - VM Guest on my desktop. All 'ServiceChecks' seem to work flawlessly except 'Ping ServiceChecks' I have been playing with the settings and have found using external pings to be more stable, but I still run into trouble when hosts have been flagged as [DOWN] for more than 1 hour. If the connection to the host resumes, the Ping ServiceCheck continues to report that it is down and does not attempt to re-ping hosts marked as [DOWN] at regular intervals.

This is a big problem for my company, as we have several maintenance windows in which certain services are taking off-line for archival purposes.

Thanks,
-John


Submitted by bixdata on April 9, 2008 - 08:24.

Theres two things you can do.

If you setup a ping check for a group called 'Web Servers', if you take a machine offline for an extended period of time, just untag the machine from that 'Web Servers' group by right clicking on it. If you tag it again later, all settings will be restored.

If you setup your ping check for less than an hour, say 10 minutes, Bix should pickup the host coming back online.


Submitted by jnicpon on April 9, 2008 - 13:26.

Thanks!

=====
John Nicpon, Manager
Server Operations Center
Nevada State College