[Pvfs2-developers] PVFS2 Removal of large files
Phil Carns
carns at mcs.anl.gov
Tue Oct 14 16:36:02 EDT 2008
Thanks David, this looks good. I committed the patch to our cvs trunk
so that it can get some mileage in nightly testing.
At some point before releasing this we will need a way to clean up the
stranded bstreams directory if the server dies before the files are
unlinked. That may just be a matter of adding files in that stranded
directory to the unlink queue on startup so that the server can work on
them in the background.
Here is a little more information about #3 that David mentioned in case
anyone is interested. The remove stall pointed out a flaw in the BMI
reference counting that at least led to some confusing log messages.
I'm not sure if there were any more serious side effects or not:
http://www.pvfs.org/fisheye/changelog/PVFS/?cs=MAIN:pcarns:20081008183827
-Phil
David Metheny wrote:
> XFS might be a good change for us in the future, thanks for the suggestion.
> For me, it won't be an option I can introduce immediately.
>
> I've created a patch that basically starts a persistent thread to take care
> of 'unlink' requests. This is patched against the 2.6.3 release.
>
> The unlink now renames the file (puts it in the stranded-bstreams dir), adds
> it to a shared queue, signals the unlink thread to delete it, and returns..
>
> There were a several side effects that the block on deletion caused us.
> 1) the pvfs2-check-server called via heartbeat would timeout, getting caught
> behind the remove, and would send heartbeat STONITH'ing away. After STONITH
> on a 'rm' command, that sent our 3+TB devices into fsck for a bit, and
> client jobs canceled away.
> 2) Other accesses, like 'ls' and reads/writes also queued behind deletes,
> and then they started canceling as well.
> 3) Once the BMI jobs started timing out and retries sent, there were some
> cases where canceling those jobs didn't quite stop accesses to the jobs
> memory. Phil Carns sent me a patch to fix this.
>
> I've ran a small sample of tests on a single PVFS2 server with a separate
> machine for the PVFS2 client. Both machines are running RHEL4 U7 i386 WS.
> The short version is that there was little affect on current performance for
> small files, and a large performance increase from a clients perspective for
> large files.
>
> +-----------+-----------+--------------+----------------+
> + Num Files + File Size + Time (stock) + Time (threaded)+
> +-----------+-----------+--------------+----------------+
> + 10 + 0 bytes + 0.55 (secs) + 0.51 (secs) +
> +-----------+-----------+--------------+----------------+
> + 500 + 100 bytes + 35 (secs) + 33 (secs) +
> +-----------+-----------+--------------+----------------+
> + 10,000 + 100 bytes + 728 (secs) + 702 (secs) +
> +-----------+-----------+--------------+----------------+
> + 10 + 500 MB + 7 (secs) + 0.73 (secs) +
> +-----------+-----------+--------------+----------------+
>
>
> -----Original Message-----
> From: Emmanuel Florac [mailto:eflorac at intellique.com]
> Sent: Friday, October 03, 2008 1:50 AM
> To: david.metheny at gmail.com
> Cc: pvfs2-developers at beowulf-underground.org
> Subject: Re: [Pvfs2-developers] PVFS2 Removal of large files
>
> Le Thu, 2 Oct 2008 16:57:19 -0500 vous écriviez:
>
>> What are you thinking along the lines of tuning the client side?
>
> I'd strongly encourage you NOT to use ext3. ext3 is by far the slowest
> filesystem under Linux, and it behaves very poorly on big filesystems.
> I'd definetely go for XFS ( or JFS ) on the underlying device.
>
>
>
> ------------------------------------------------------------------------
>
> _______________________________________________
> Pvfs2-developers mailing list
> Pvfs2-developers at beowulf-underground.org
> http://www.beowulf-underground.org/mailman/listinfo/pvfs2-developers
More information about the Pvfs2-developers
mailing list