[Pvfs2-users] MX help

Bradley Settlemyer bradles at parl.clemson.edu
Wed Mar 4 19:15:24 EST 2009


Hello

  I am trying to use PAV to run pvfs with the MX protocol.  I've
updated pav so that servers start and ping correctly.  But when I try
and run an mpi code, I'm getting client timeouts like the client
cannot contact the servers:

Lots of this stuff:

[E 19:11:02.573509] job_time_mgr_expire: job time out: cancelling bmi
operation, job_id: 3.
[E 19:11:02.583659] msgpair failed, will retry: Operation cancelled
(possibly due to timeout)


I have no problem acknowledging that I've done something wrong, but I
don't know how to debug MX at all.  Any pointers to at least get me
started?

Cheers,
brad


More information about the Pvfs2-users mailing list