[Bug-openmcl] Debugging crashes of dppccl

Erik Pearson erik at adaptations.com
Thu Mar 4 16:21:51 MST 2004


Is there a procedure for compiling dppccl so that it will produce useful 
information upon crashing? I'm testing an app that runs continuously (i.e. 
a server), waking up a process every few minutes to run a set of registered 
functions. It is doing website monitoring, so each function goes out, 
fetches a web page, looks at it, and saves the results. The server then 
will write to a file or send an email alert.

After this process repeats about 250-300 times (at an interval of 15 
seconds for testing...) dppccl will just up and die, printing no messages, 
and returning to the shell, and leaving behind no core file. Well, that is 
not quite true. The last time this happened the message "Abort trap" was 
printed.

This is running under the bleeding edge, checked out a couple of days ago 
with a few manual tweaks to the threading code.

This just go away if it is just Gary fiddling with something. But in case 
not, or in any case, I'm wondering how to get diagnostic information on the 
cause of such crashes. I've looked _very_ briefly at gdb, and had it set up 
to monitor the dppccl process, but it caused dppccl to hang after a few 
minutes -- dppccl continued just fine after gdb was exited.

I'm also going to be looking at different methods of exercising the code 
and isolating the problem, but this is made difficult since it happens 
after many iterations, and the code that is being run is pretty "wild" in 
that it interacts with the network layer in multiple ways involving threads 
and timeouts. (e.g. I've run the code also with > 700 iterations when 
monitoring a simple, close website which has never timed out and which is 
on the local subnet.)

Thanks,

Erik.

--
Erik Pearson
Adaptations
"Adaptation: It's not just for finches anymore."


More information about the Bug-openmcl mailing list