[PVFS2-developers] Multiple trove implementations
Rob Ross
rross at mcs.anl.gov
Tue Oct 18 11:04:55 EDT 2005
Hi Julian,
Sorry for the delay in response.
We've talked about just this sort of thing before, and Phil had
implemented a different variation. I think it's a good idea.
I agree that were we to have more than one trove implementation actually
being used by people, that we would want something like what you
describe in the config file. It wouldn't hurt to go ahead and implement
such a keyword. TroveModule seems like a good name for it. It would
set a value in the filesystem_configuration structure that could be
passed to map_coll_id_to_method() (which should really be
trove_map_method() or something with a trove prefix, but whatever). For
the moment though, I think we would just ignore the keyword in the CVS
version (since there's only one Trove implementation at this time).
Out of curiosity, what file system are you using for the DBPF and
DBPF-nosync cases?
Looking at Meta-create-1Meta-5Data-1Clients-all.pdf, I see what you're
saying about the metadata dropoff. My guess would be that this is a
Berkeley DB issue? One thing that we haven't yet pursued is tuning
Berkeley DB. We've found some notes on this:
http://pybsddb.sourceforge.net/ref/mp/config.html
On the read/write side of things, it might be more useful to look at the
time rather than the bandwidth. This will more obviously show how much
time is lost in reading from/writing to the disk.
The IO-smallblock-10MB-1Meta-5Data-5Clients-all-Read+Write.pdf results
are confusing, in particular the read performance. Any ideas on why
performance drops as it does?
If you can replicate the situation where PVFS2 is crashing, we would of
course want to fix that! We don't see that sort of thing here...
Thanks!
Rob
More information about the PVFS2-developers
mailing list