[Pvfs2-users] IOR errors

Murali Vilayannur murali.vilayannur at gmail.com
Thu Apr 5 07:30:59 EDT 2007


Hi Tim,
I don't know if anyone responded to this email or if it got lost..
You could try a couple of  things and also provide some more information,
- Are these Opteron/x86_64 boxes?
- Can you try this out on tcp if possible instead of ib? That will
help us rule out any IB specific oddities?
- writes may have hit ENOSPC on one or more servers.. Would it be
possible to check the amt of available disk space on all the servers?
 I will try to reproduce this on a much smaller run although I doubt
if anything would show up since the nightlies would have got those..
Sorry for not being able to help better..
Thanks,
Murali


On 3/21/07, Carlson, Timothy S <Timothy.Carlson at pnl.gov> wrote:
> Thanks to the folks who helped me out yesterday I got a nice little 2.3T
> pvfs2 (2.6.2) file system. I have 16 nodes that are all acting as I/O
> servers and clients. 1 of those boxes is also the meta data server.  All
> over Topspin IB and I am using all the default setting in my config file
> parameters.
>
> That being said, I wanted to test the bandwidth so I compiled the POSIX
> version of IOR against the Topspin mpich libraries.
>
> My run looks like this.
>
> IOR-2.9.4: MPI Coordinated Test of Parallel I/O
>
> Run began: Wed Mar 21 16:06:04 2007
> Command line used: /home/tim/IOR -i 8 -b 1024m -o /mnt/pvfs2/ior/ior_16g
> Machine: Linux compute-0-15.local
>
> Summary:
>         api                = POSIX
>         test filename      = /mnt/pvfs2/ior/ior_16g
>         access             = single-shared-file
>         clients            = 16 (1 per node)
>         repetitions        = 8
>         xfersize           = 262144 bytes
>         blocksize          = 1 GiB
>         aggregate filesize = 16 GiB
>
> access    bw(MiB/s)  block(KiB) xfer(KiB)  open(s)    wr/rd(s)
> close(s)   iter
> ------    ---------  ---------- ---------  --------   --------
> --------   ----
> write     613.70     1048576    256.00     0.177541   26.43      7.24
> 0
> read      1141.20    1048576    256.00     0.019199   14.34
> 0.329994   0
> write     589.05     1048576    256.00     0.154706   27.74      7.06
> 1
> read      1032.93    1048576    256.00     0.019723   15.84
> 0.417178   1
> write     550.66     1048576    256.00     0.991332   29.58      8.43
> 2
> read      1005.48    1048576    256.00     0.021340   16.28
> 0.448091   2
> write     555.06     1048576    256.00     0.232900   29.48      8.57
> 3
> read      1006.24    1048576    256.00     0.018788   16.27
> 0.263041   3
> WARNING: Expected aggregate file size       = 17179869184.
> WARNING: Stat() of aggregate file size      = 13958643712.
> WARNING: Using actual aggregate bytes moved = 17179869184.
> write     438.87     1048576    256.00     0.238877   37.23      15.80
> 4
> ** error **
> ERROR in aiori-POSIX.c (line 245): hit EOF prematurely.
> ERROR: Success
> ** exiting **
> ** error **
> ERROR in aiori-POSIX.c (line 245): hit EOF prematurely.
>
>
> I would say that the performance is quite good until I get to those
> errors. Nothing interesting in the client or server logs. Something in
> my IOR setup that might be stressing things a bit too hard?
>
> Thanks for any insights.
>
> Tim
>
>
> Tim Carlson
> Voice: (509) 376 3423
> Email: Tim.Carlson at pnl.gov
> Pacific Northwest National Laboratory
> HPCaNS: High Performance Computing and Networking Services
>
> _______________________________________________
> Pvfs2-users mailing list
> Pvfs2-users at beowulf-underground.org
> http://www.beowulf-underground.org/mailman/listinfo/pvfs2-users
>


More information about the Pvfs2-users mailing list