[OGo-Users] Webui killed by watchdog?
Helge Hess
users@opengroupware.org
Tue, 12 Feb 2008 22:43:27 +0100
On 12.02.2008, at 16:02, Albrecht Dre=DF wrote:
> Well, I have a script, called by cron, which reads using wget the =20
> login page, and if this fails, calls the init script with restart.
OK, so this is what triggers the TERM signal.
> It sounds as if this is not the right approach, as the watchdog is =20
> supposed to restart the webui automagically. Is that true?
Auto-Restart on crashes, yes.
> How does it detect any "hangs"?
No, it can't detect hangs. I think snsd can, but won't restart a =20
process either.
So your script is quite fine. So back to your original questions:
> Questions:
> - what is the reason for the watchdog timeout?
There is no timeout, the TERM signal is triggered by your script.
> I /might/ be related by OgO trying to contact an *extremely* slow M$ =20=
> Exchange server which sometimes needs 2 minutes (!!) to return a =20
> message list (this software is complete crap!). Any way to increase =20=
> this timeout?
I think the socket timeouts are hardcoded, you might be able to change =20=
them in the sources and recompile.
BUT: this only makes sense if you run multiple OGo processes anyways. =20=
Otherwise your slow server will block all other sessions.
> - if the watchdog kills a process, is it possible to restart it =20
> immediately?
The watchdog only kills a process when its requested to do so.
Hm, maybe your script does an 'ogo stop' instead of 'ogo restart'? Or =20=
do you use kill in your script? In this case sending HUP might make =20
Hector restart the child process.
Greets,
Helge
--=20
Helge Hess
http://www.helgehess.eu/=