[lwhatley.ctr@navo.hpc.mil: Re: [Pvfs2-developers]
Re: [Pvfs2-users] PVFS2 on Infiniband]
Murali Vilayannur
vilayann at mcs.anl.gov
Thu May 25 16:14:12 EDT 2006
Hi Lee,
Does the kernel that you are running support NPTL threads at all?
I recall that using LinuxThreads on Opteron x86_64 is strongly discouraged
..
Alternatively, maybe NPTL is the problem on outdated 2.4 kernels.. does
the RHEL3 update include the futex bug fix reported here
http://www.uwsg.iu.edu/hypermail/linux/kernel/0409.0/2044.html
I am shooting arrows in the dark here :)
Thanks,
Murali
On Thu, 25 May 2006, Lee Whatley, Contractor wrote:
> Pete Wyckoff wrote:
> > I would like to get the full list involved. Can you make a more
> > concise trace for them. Do something like the following:
> >
> > cd /tmp
> > service pvfs2 start
> > /usr/local/bin/pvfs2-set-debugmask -m /u/data1 verbose
> > pvfs2-mkdir /u/data1/foo
> > /usr/local/bin/pvfs2-set-debugmask -m /u/data1 none
> > service pvfs2 stop
>
> Hey Pete,
>
> I'm attaching the log generated when running those commands to this email.
>
> > Can you also give me:
> >
> > - the ./configure line you used to build, including any CC= or
> > CFLAGS= that may have been set in your environment
> > - gcc --version
> > - uname -a
> > - cat /etc/redhat-release (if you have one, or equivalent)
> > - /lib/libc.so.6 (makes it say its version number, or equivalent)
>
> ./configure --enable-redhat24 --disable-nptl-workaround
> --with-kernel24=/usr/src/linux-2.4
> --with-ib=/usr/local/ibgd/driver/infinihost/
>
> gcc (GCC) 3.2.3 20030502 (Red Hat Linux 3.2.3-54)
>
> Linux scn1 2.4.21-40.ELsmp #1 SMP Thu Feb 2 22:13:55 EST 2006 x86_64
> x86_64 x86_64 GNU/Linux
>
> Red Hat Enterprise Linux ES release 3 (Taroon Update 7)
>
> libc-2.3.2.so <- Note that this is a 64-bit version so it *should* be
> using the one in /lib64
>
> If you need anything else just let me know.
>
> Thanks again for all of your help with this!
> -Lee
>
> > No clue what is going on at the server for so long, and so evenly a
> > multiple of 10ms, but we'll figure it out. All I can guess now is
> > these 10ms timeout periods in src/io/job/job.c:
> >
> > do_one_work_cycle_all(10);
> >
> > and maybe a broken thread scheduler?
> >
> > -- Pete
> >
>
>
> --
> Lee S. Whatley, Contractor
> NAVOCEANO MSRC
> Lockheed Martin Space Operations - Stennis Programs
> 1001 Balch Boulevard
> Stennis Space Center, MS 39522
> Phone: 228-688-4999 DSN: 828-4999
>
>
More information about the Pvfs2-developers
mailing list