[CIG-LONG] problem with gale on a cluster

js at cp.dias.ie js at cp.dias.ie
Mon Dec 10 12:50:49 PST 2007



Hello,
	firstly, I would like to thank you for your efforts in
releasing and documenting Gale.

	I recently downloaded both the stable (121) and svn versions
and successfully ran the extension.xml model included in stable on
both versions with a single processor.

	I have just tried to run the same model on four processors,
using gale_svn and it crashed about half way through with the following
message,

1: Something went horribly wrong in _PCDVC_Calculate2D: Problem has an
under resolved cell (Cell Id = 20), check or tune your population control
parameters
Gale: build/StGermain/Base/IO/src/Journal.c:603: Journal_Firewall:
Assertion `expression' failed.
[cli_0]: aborting job:
application called MPI_Abort(MPI_COMM_WORLD, 59) - process 0
[cli_2]: aborting job:
application called MPI_Abort(MPI_COMM_WORLD, 59) - process 2
[cli_3]: aborting job:
application called MPI_Abort(MPI_COMM_WORLD, 59) - process 3
mpiexec: Warning: tasks 0,2-3 exited with status 59.
mpiexec: Warning: task 1 died with signal 6 (Aborted).

Do I need an alternative input file for clusters?

I would appreciate any thoughts you may have on this.

All the best,

john.



More information about the CIG-LONG mailing list