[PVFS-users] Re: PVFS access hangs
Rob Ross
rross at mcs.anl.gov
Tue Jul 20 11:10:18 EDT 2004
Hi Murali,
I've fixed the initialization of lost_data; thanks!
Rob
On Fri, 16 Jul 2004, Murali Vilayannur wrote:
> Hi Rob,
>
> Attached is a patch that fixes some I/O errors that I get when using PVFS1
> from the latest CVS. It is somewhat related to the error scenario that you
> mention below.
> The problem was that in function do_access() in iod/jobs.c, the variable
> lost_data is not initialized to 0, hence bad things happen below, when
> a_p->u.rw.off et. al are updated. the job layer function i.e do_job()'s
> return value is then interpreted as socket errors and the client socket is
> closed. Hence clients see EPIPE's.
>
> In addition, the patch addresses some more cleanups that bring the
> kpvfsd
> code in synch. with some more recent changes that have come up recently.
> Hopefully, this will fix some of the errors that Brannen is noticing.
> Thanks,
> Murali
>
> > That "empty job" message occurs when a client asks for a region of a file
> > and the server has no data in that region -- a common case would be trying
> > to read beyond EOF.
> >
> > Are you sure that those messages are from the "cat" case below? If they
> > are, then they imply that some data was in fact getting to the iods from
> > the client node. But my first guess would be that that isn't happening.
> >
> > Another thing that is possible is that somehow the IP addresses of your
> > iods are not quite right? For example, if you listed an address in the
> > .iodtab that is accessible from the local node but not remotely, then you
> > would have issues. Can you send your .iodtab to us? The pvfs-ping output
> > too, in both cases?
> >
> > > I could use pvfs-ping, pvfs-ls, etc (forgot to try u2p though)
> > > without a problem. After mounting the pvfs filesystem I could use 'ls',
> > > etc. However my test program hangs while trying to write files (though it
> > > can be interrupted). I also tried to 'cat' one of the files, but that hung
> > > also (the last errors above are from that attempt).
> >
> > What does your test program do, and what interface does it use?
> >
> > Please do try u2p also if you have a chance, I'm guessing that it won't
> > work.
> >
> > > I'm using the Jan patch to pvfs-1.6.2, the two patches for redhat to
> > > the kernel code. Not sure what else to try - I could set up one of the
> > > other two machines as the Manager and see what happens then - so far my
> > > experience has been that unless a machine is the Manager node, I cannot
> > > access the pvfs filesystem for read or write even though the utilities work
> > > fine.
> >
> > I'm almost certain that you're seeing either a misconfiguration problem
> > due to odd IP addresses in a PVFS config file, or a routing problem, or a
> > firewall problem.
> >
> > Regards,
> >
> > Rob
> > _______________________________________________
> > PVFS-users mailing list
> > PVFS-users at www.beowulf-underground.org
> > http://www.beowulf-underground.org/mailman/listinfo/pvfs-users
> >
> >
More information about the PVFS-users
mailing list