Newsgroups: comp.parallel.mpi From: Miranda Barrows Subject: Problem with timeouts Organization: Draper Labs Date: Fri, 29 May 1998 16:33:52 GMT Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Message-ID: <356EE36A.6A36@draper.com> I got the following error message running MPICH on an SGI: Timeout in waiting for processes to exit. This may be due to a defective rsh program (Some versions of Kerberos rsh have been observed to have this problem). This is not a problem with P4 or MPICH but a problem with the operating environment. For many applications, this problem will only slow down process termination. I am running a program with seven processes, each of which runs for about one hour, with little communication. After getting this message, MPI appears to exit, but a ps command reveals that several processes continue to run for several minutes. Can anyone tell me what is causing this message and what it means? Miranda Barrows