[Pvfs2-developers] Re: bmi_ib resource constraints with older
hardware
Troy Benjegerdes
troy at scl.ameslab.gov
Thu Mar 13 13:07:14 EST 2008
A better patch which I hope will apply to CVSHEAD is at:
http://www.scl.ameslab.gov/~troy/pvfs/ibv_post_send/retry-ibv_post_send-cvshead.patch
This also includes several other minor changes I have been dragging
along in my tree.
>
> Just hack up anything you like to get it to work. If it fixes the
> situation, we'll go back and clean up the code later.
>
> It is optimistic, what you're trying to do, but I'm not sure if it
> will be sufficient. If there are no credits to get back from
> checking the CQ, you'll just deadlock. I'm also nervous about
> locking implications, as you're checking the CQ in the thread that
> is trying to do the send. Not sure if we have done this before.
>
> A simpler way would be just to just fail whatever operation got us
> into this RDMA, by abandoning it, with another state that says we're
> waiting on credits. An easier first step is just to add lots of
> printfs to track the credits and see if you can correlate a credit
> overflow with the rdma failures. If that works, a check at the top
> of "post rdma" can say whether we should even bother and we won't
> need your fixup step of looking at the CQ from the send.
>
> -- Pete
>
More information about the Pvfs2-developers
mailing list