[osiris-devel] Re: possible fix for osirismd defunct procs

David Vasil dmvasil at ornl.gov
Mon Jan 8 11:09:56 EST 2007


lemmings wrote:
> On Tue, Jan 02, 2007 at 02:39:32PM -0500, David Vasil wrote:
>> Regarding your patch, what happens if an osirismd scan finishes and
>> before it is able to set received_sigchld to 0 in check_for_signals()
>> another osirismd scan finishes.  Will that second scan become a defunct
>> process since received_sigchld was set to 0 before another wait() occurred?
> 
> The race condition won't cause a defunct process as there is a loop
> around waitpid() _after_ received_sigchld is set to 0.
> 
> e

I've been running your patched version of the defunct osirismd fix for a
week now and defunct processes are still created, but they are cleaned
up after the next sigchld is received by the osirismd scheduling
process.  Is this expected behavior?

It is better than the current 4.2.2 release since the defunct processes
are cleaned up.  It just seems that it would be ideal if the defunct
procs were never created.  Let me know your thoughts; I'm going to test
the build on BSD/Fedora/and WinNT platforms.

-- 
-dave



More information about the osiris-devel mailing list