[PVFS2-developers] metadata record pre-allocation
Phil Carns
pcarns at parl.clemson.edu
Tue Jul 27 16:51:14 EDT 2004
> Yes, that appears to be the problem. The internal db sync appears to
> sync a lot more than the metadata created -- and it's (partially) out
> of our control. Phil ran a test a while back that overloaded the
> fsync calls (with an empty function) inside of the DB code and sure
> enough the performance is heavily improved ... but the data in the
> system was totally unsafe all across the board. We talked for a
> minute about keeping it that way and doing our own syncs, but I think
> if we're going to go through the trouble (while complaining and
> 'fixing' the DB as it gets corrupt at weird times), we've so far
> worked around DB that it might be better to not use it all. Which
As a side note somewhat orthogonal to this conversation, I never did figure
out why calling fsync on a tiny db file clogged the whole server. I would
assume this is dependent on the kernel and underlying fs. It appeared that
an fsync call on any file, no matter how small, resulted in a full sync of
the file system. So for example, if you had just written 100 MB out to disk
but hadn't synced, then the next fsync on a completely unrelated db file
would suddenly pay the cost of hitting disk for 100 MB. It was pretty easy
to see this in action when importing large files.
If the fsyncs truly really worked as advertised and only caused the 4K
database file to be synced, it doesn't seem like these sync stalls would be
as painful. If anyone has any insight into that issue (Is this known linux
behavior? I am off base in expecting otherwise?) I would be curious to hear.
-Phil
More information about the PVFS2-developers
mailing list