1 |
On Monday, 16 October 2017 16:12:53 BST Alan McKinnon wrote: |
2 |
> On 16/10/2017 17:08, Ian Zimmerman wrote: |
3 |
> > On 2017-10-16 14:11, Alan McKinnon wrote: |
4 |
> >> My needs here are pretty simple: |
5 |
> >> local watchdog that checks if a program is running and restart it if |
6 |
> >> not. If that fails 3 times or so, alert me. |
7 |
> >> Maybe a few file/dir/fifo monitors as well. Not much else. |
8 |
> >> |
9 |
> >> I don't need any of monit's graphing features or M/monit, I have other |
10 |
> >> tools for that. And mostly don't even need it's http API either. |
11 |
> > |
12 |
> > supervisor (aka supervisord) |
13 |
> > |
14 |
> > http://supervisord.org/ |
15 |
> > |
16 |
> > python based, not sure if that's okay with you |
17 |
> |
18 |
> I forgot about supervisord. Like monit, it runs everywhere and might be |
19 |
> easier for the team-mates to understand and work with. |
20 |
> |
21 |
> Python is not a problem, all these hosts are ansible-managed anyway, so |
22 |
> they all have to run python-2.7 |
23 |
> |
24 |
> Good find, thanks! |
25 |
|
26 |
I've used Nagios in the past, but have not kept up with its development and |
27 |
the many plugins it provides. It could do any of the above tasks and much |
28 |
more. It can run scripts (perl, or bash) via daemons (nrpe) on the remote |
29 |
systems to restart applications, et al. The Nagios server possessed the |
30 |
ability to set up quite intelligent monitoring and alert hierarchies with |
31 |
multilayered comms structures to make sure you are not woken up at 2 a.m. by |
32 |
your boss, just because a ping failed to his home NAS. I also found the logs |
33 |
which can be also stored on SQL quite useful both in troubleshooting problems |
34 |
and in producing reports. It can monitor network connectivity, remote OS |
35 |
parameters and applications. Writing your own plugin/module to monitor quite |
36 |
specialised use cases is not particularly difficult either. |
37 |
|
38 |
I expect you may find Nagios more complicated to set up than monit, at least |
39 |
initially, but if you don't have the luxury of time to invest on setting up |
40 |
Nagios monit may be a better fit. I don't have in depth experience with other |
41 |
monitoring software to comment, so something else may suit better your |
42 |
specific needs. |
43 |
-- |
44 |
Regards, |
45 |
Mick |