[Pvfs2-developers] Re: openib-vfs failure

Kyle Schochenmaier kschoche at scl.ameslab.gov
Fri Feb 8 16:55:31 EST 2008


Pete Wyckoff wrote:
> kschoche at gmail.com wrote on Wed, 06 Feb 2008 14:14 -0600:
>   
>> I applied your patch, and got the following immediately on some io:
>>
>>
>> [E 02/06 14:12] Error: openib_check_cq: unknown opcode 11171.
>> [E 02/06 14:12]         [bt] bin/sbin/pvfs2-server(error+0xca) [0x4293ba]
>> [E 02/06 14:12]         [bt] bin/sbin/pvfs2-server [0x429f9b]
>> [E 02/06 14:12]         [bt] bin/sbin/pvfs2-server [0x425dc3]
>> [E 02/06 14:12]         [bt] bin/sbin/pvfs2-server [0x428129]
>> [E 02/06 14:12]         [bt] bin/sbin/pvfs2-server(BMI_testunexpected+0x19f) [0x
>> 424d2f]
>> [E 02/06 14:12]         [bt] bin/sbin/pvfs2-server [0x4397a7]
>> [E 02/06 14:12]         [bt] /lib/libpthread.so.0 [0x2ba396d2df1a]
>> [E 02/06 14:12]         [bt] /lib/libc.so.6(__clone+0x72) [0x2ba39711c602]
>>     
>
> That "can't happen".  My teensy patch didn't get anywhere near
> there.  It just changes some printfs and adds an extra test in the
> RTS checking.  Are you sure everything in the hardware is still
> working?  And you recompiled okay?  Maybe yank out the printfs in
> case there is some memory corruption going on somewhere.
>
> 		-- Pete
>   
Looks like your patch works for me, I've recently identified  some 
hardware failures and removed the hardware from the test scenario and it 
looks like things are looking good now I havent had a pvfs2 failure yet!

Thanks again Pete!

~Kyle

-- 
Kyle Schochenmaier
kschoche at scl.ameslab.gov
Research Assistant, Dr. Brett Bode
AmesLab - US Dept.Energy
Scalable Computing Laboratory 



More information about the Pvfs2-developers mailing list