[Pvfs2-users] pvfs2 performance with btio

Sam Lang slang at mcs.anl.gov
Tue Jul 17 16:18:13 EDT 2007


On Jul 17, 2007, at 2:26 PM, Murali Vilayannur wrote:

> Hi florin,
> I suspect this might have to do with your platform's AIO libraries
> being broken and us having to work around that.
> Could you try the alt-aio stuff and see if that improves things a bit?
> Sam, there are some config file settings for this, isn't it? I forget
> what we need to modify..

In the <StorageHints> context:

TroveMethod alt-aio

Unless the data he's just written is sitting in the kernel buffers, I  
would expect reads to have the same problem as writes if aio is the  
cause.  What makes you suspect AIO libraries for his platform?

> Would it make any difference if configure works around aio-threaded- 
> callbacks?

Do you mean configure is disabling threaded callbacks for his build,  
or that we should ask it to?  AIO results we've seen without threaded  
callbacks are worse than with them.

>
> I am really tempted to just write our own aio libraries and on-disk
> backends instead of relying on Linux/glibc and ext3..sigh

:-)  I hear you.  Are you running over ext3 Murali?  I've seen  
results that suggest xfs might be better for large IOs and multiple  
threads.

-sam

> Thanks,
> Murali
>
> On 7/17/07, Florin Isaila <florin.isaila at gmail.com> wrote:
>> Hi Sam, we start the pvfs2 servers on different machines than the
>> compute nodes (picking the nodes from the list provided by the batch
>> system). Was that your question?
>>
>> And I should have said, all the measurements are done with collective
>> I/O of ROMIO.
>>
>> On 7/17/07, Sam Lang <slang at mcs.anl.gov> wrote:
>> >
>> > Ah, I read your email wrong.  Hmm...so writes really tank.  Are you
>> > using the storage nodes as servers, or other compute nodes?
>> >
>> > -sam
>> >
>> > On Jul 17, 2007, at 11:15 AM, Sam Lang wrote:
>> >
>> > >
>> > > Hi Florin,
>> > >
>> > > Just one clarification question...are those are bandwidth numbers
>> > > not seconds as the plot label suggests?
>> > >
>> > > -sam
>> > >
>> > > On Jul 17, 2007, at 11:03 AM, Florin Isaila wrote:
>> > >
>> > >> Hi everybody,
>> > >>
>> > >> I have a question about the PVFS2 write performance.
>> > >>
>> > >> We did some measurements with BTIO over PVFS2 on lonestar at  
>> TACC
>> > >> (http://www.tacc.utexas.edu/services/userguides/lonestar/)
>> > >>
>> > >> and we get pretty bad write results with classes B and C:
>> > >>
>> > >> http://www.arcos.inf.uc3m.es/~florin/btio.htm
>> > >>
>> > >> We used 16 I/O servers, the default configuration parameters  
>> and upto
>> > >> 100 processes. We realized that all I/O servers were used  
>> also as
>> > >> metadata servers, but BTIO uses just one file.
>> > >>
>> > >> The times are in seconds, contain only I/O time (no compute  
>> time) and
>> > >> are aggregated per each BTIO run (BTIO performs several writes).
>> > >>
>> > >> TroveSyncMeta was set to yes (by default). Could this cause  
>> the I/
>> > >> O to
>> > >> be serialized? It looks as if there were a serialization.
>> > >>
>> > >> Or could the fact that all nodes were also launched as metadata
>> > >> managers affect the performance?
>> > >>
>> > >> Any clue why this happens?
>> > >>
>> > >> Many thanks
>> > >> Florin
>> > >> _______________________________________________
>> > >> Pvfs2-users mailing list
>> > >> Pvfs2-users at beowulf-underground.org
>> > >> http://www.beowulf-underground.org/mailman/listinfo/pvfs2-users
>> > >>
>> > >
>> >
>> >
>> _______________________________________________
>> Pvfs2-users mailing list
>> Pvfs2-users at beowulf-underground.org
>> http://www.beowulf-underground.org/mailman/listinfo/pvfs2-users
>>
>



More information about the Pvfs2-users mailing list