[PVFS2-CVS] commit by robl in pvfs2-1/doc: pvfs2-faq.tex
CVS commit program
cvs at parl.clemson.edu
Tue May 4 15:54:50 EDT 2004
Update of /projects/cvsroot/pvfs2-1/doc
In directory acid:/tmp/cvs-serv29521/doc
Modified Files:
pvfs2-faq.tex
Log Message:
document performance issues and ROMIO
Index: pvfs2-faq.tex
===================================================================
RCS file: /projects/cvsroot/pvfs2-1/doc/pvfs2-faq.tex,v
diff -u -w -p -u -r1.1 -r1.2
--- pvfs2-faq.tex 29 Apr 2004 22:50:58 -0000 1.1
+++ pvfs2-faq.tex 4 May 2004 18:54:49 -0000 1.2
@@ -204,10 +204,47 @@ archived and that everyone has a chance
This section covers issues related to the performance of PVFS2.
-\subsection{I ran Bonnie and the performance is terrible. Why? Is there
- anything I can do?}
+\subsection{I ran Bonnie and/or IOzone and the performance is terrible.
+Why? Is there anything I can do?}
+\label{sec:badperf}
+
+We designed PVFS2 to work well for scientific applications in a cluster
+environment. In such an environment, a file system must either spend time
+ensuring all client-side caches are in sync or not use a cache at all. The
+\texttt{bonnie} and \texttt{bonnie++} benchmarks read and write very small
+blocks -- on the order of 1K. These many small requests must travel from the
+client to the server and back again. Without client-side caching, there
+is no sane way to speed this up.
+
+To improve benchmark performance, specify a bigger block size. PVFS2
+has several more agressive optimizations that can be turned on, but
+those optimizations require that applications accessing PVFS2 can cope
+with out-of-sync caches.
+
+In the future, PVFS2 is looking to provide optional semantics for use
+through the VFS that will allow some client-side caching to speed these
+kinds of serial benchmarks up. By offering a way to explicitly sync
+data at any given time or by providing 'close-to-open' semantics, these
+kinds of caching improvements become an option for some applications.
+
+Bear in mind that benchmarks such as IOzone and Bonnie were meant to
+stress local file systems. They do not accurately reflect the types of
+workloads for which we designed PVFS2. Furthermore, because of their
+serial nature, PVFS2 will be unable to deliver its full performance.
+Try running a parallel file system benchmark like IOR
+(\url{http://www.llnl.gov/asci/purple/benchmarks/limited/ior/}).
+
\subsection{Why is program XXX so slow?}
+See question \ref{sec:badperf}. If the program uses small block sizes to
+access a PVFS2 file, performance will suffer.
+
+The \texttt{TroveSyncMode}
+config file option can improve performance in some situations, but if the
+server dies, you will lose data. PVFS2 has an Attribute Cache, which can
+speed up applications which read a lot of attributes (\texttt{ls}, for
+example). Playing around with the \texttt{AttrCache*} config file settings
+might yield some performance improvements, or it might not.
%
% REDUNDANCY
@@ -258,9 +295,8 @@ This section covers issues related to ac
\subsection{How do I get MPI-IO for PVFS2?}
The ROMIO MPI-IO implementation, as provided with MPICH2 and others, supports
-PVFS2.
-
-\emph{Add pointer to notes on installing?}
+PVFS2. You can find more information in the ROMIO section of the
+pvfs2-quickstart: \url{http://www.pvfs.org/pvfs2/pvfs2-quickstart.html\#sec:romio}
\subsection{Can I directly manipulate PVFS2 files on the PVFS2 servers
without going through some client interface?}
More information about the PVFS2-CVS
mailing list