[Pvfs2-developers] ncache causes shared creat problems
Phil Carns
pcarns at wastedcycles.org
Mon Aug 28 13:02:09 EDT 2006
Pete Wyckoff wrote:
> The simul code, test #14, does a shared create: all processes
> try to do "creat(file, 0644)" at the same time through the VFS.
> There is no O_EXCL, so what should happen here is that they all
> succeed, although under the hood, all but one will probably have
> to unwind the SYS_CREATE when they notice that the dirent already
> exists from another process.
>
> This used to work just fine. With the addition of the ncache code
> to pvfs2-client, I'm guessing, things break. The test works again
> if I add "-n 0" to the pvfs2-client command line.
>
> My setup is all x86_64. Two IO servers, one of which does MD too.
> Two other nodes as clients, running:
>
> mpiexec -pernode -np 2 $simul/simul -d /pvfs-ib -i 14 -n 200 -N 1
>
> eventually one will fail, usually around the second iteration, with
>
> 14:46:54: Process 1(ib26): FAILED in simul_creat, creat failed:
> No such file or directory
>
> Does anybody know the ncache code well enough to figure this out?
> I find the -EEXIST fixup code in client-core, but can't see what
> kind of ncache invalidation should presumably happen around there.
>
> -- Pete
What happens on each iteration? Does the code at some point delete a
file with a particular name and then create a new one with the same name?
-Phil
More information about the Pvfs2-developers
mailing list