[Pvfs2-developers] Re: openib-vfs failure
Kyle Schochenmaier
kschoche at scl.ameslab.gov
Fri Feb 8 16:55:31 EST 2008
Pete Wyckoff wrote:
> kschoche at gmail.com wrote on Wed, 06 Feb 2008 14:14 -0600:
>
>> I applied your patch, and got the following immediately on some io:
>>
>>
>> [E 02/06 14:12] Error: openib_check_cq: unknown opcode 11171.
>> [E 02/06 14:12] [bt] bin/sbin/pvfs2-server(error+0xca) [0x4293ba]
>> [E 02/06 14:12] [bt] bin/sbin/pvfs2-server [0x429f9b]
>> [E 02/06 14:12] [bt] bin/sbin/pvfs2-server [0x425dc3]
>> [E 02/06 14:12] [bt] bin/sbin/pvfs2-server [0x428129]
>> [E 02/06 14:12] [bt] bin/sbin/pvfs2-server(BMI_testunexpected+0x19f) [0x
>> 424d2f]
>> [E 02/06 14:12] [bt] bin/sbin/pvfs2-server [0x4397a7]
>> [E 02/06 14:12] [bt] /lib/libpthread.so.0 [0x2ba396d2df1a]
>> [E 02/06 14:12] [bt] /lib/libc.so.6(__clone+0x72) [0x2ba39711c602]
>>
>
> That "can't happen". My teensy patch didn't get anywhere near
> there. It just changes some printfs and adds an extra test in the
> RTS checking. Are you sure everything in the hardware is still
> working? And you recompiled okay? Maybe yank out the printfs in
> case there is some memory corruption going on somewhere.
>
> -- Pete
>
Looks like your patch works for me, I've recently identified some
hardware failures and removed the hardware from the test scenario and it
looks like things are looking good now I havent had a pvfs2 failure yet!
Thanks again Pete!
~Kyle
--
Kyle Schochenmaier
kschoche at scl.ameslab.gov
Research Assistant, Dr. Brett Bode
AmesLab - US Dept.Energy
Scalable Computing Laboratory
More information about the Pvfs2-developers
mailing list