[PVFS-users] [Fwd: problem writing to one of the disks in a PVFS fs (fwd)]

Rob Ross rross at mcs.anl.gov
Tue Sep 14 11:32:49 EDT 2004


Hi Helen,

The file existing with a length of zero indicates that the file metadata 
was created but no data was written.

Are there errors in the mgr's logfile?

It appears that for whatever reason your client is unable to get to that 
particular server.

When a file is opened/created, the mgr returns an IP address to the client 
that the client will then use to access that server.  One possibility is 
that the IP address for that server in the .iodtab file is incorrect.  
Another is that the mgr can get to that iod on that address but the client 
cannot (different network connectivity).  Another possibility is that the 
routing was somehow screwed up on the client.  Another options is that a 
firewall is preventing traffic to the server.

Do any of those sound reasonable?

Rob

On Tue, 14 Sep 2004, Phil Carns wrote:

> I am forwarding along this message from Helen Katz, who is currently 
> unable to post to the list; please preserve the cc when responding.
> 
> thanks,
> -Phil
> 
> ---------- Forwarded message ----------
> Date: Thu, 09 Sep 2004 02:22:04 -0400
> From: pvfs-users-owner at beowulf-underground.org
> To: helen.katz at weizmann.ac.il
> Subject: problem writing to one of the disks in a PVFS fs
> 
> 
> Hello,
> We are using pvfs-1.5.4 and  pvfs-kernel-1.5.4 under RH-7.3
> without problems and we have several stable PVFS filesystems,for
> several years .
> In 5'th of Sept. I had this problem, which I solved by rebooting
> the mgr machine:
> In one of the pvfs's we have problem writing to one of the disks.
> The disk /data08 from within eio08 is part of /pvfsa disk,
> and it's the only one to which I can't write.
> The /pvfsa disk is build from disks in eio05-eio12
>   pvfsa     Size(Kb)         Used         Free   Use%
>     0      115377640      5596960    103919768     6% eio05
>     1      115380192      6626576    102892580     7% eio06
>     2      115377640    104998728      4518000    96% eio07
>     3      115380192        65068    109454088     1% eio08
>     4      115380192      7469356    102049800     7% eio09
>     5      115380192      8845144    100674012     9% eio10
>     6      115380192      6037972    103481184     6% eio11
>     7      115345508     10353536     99132668    10% eio12
> When I use u2p command to write directly to a specific disk
> in that pvfs , I got this reply:
> ---------------------------------------------------------
> [physdsk2/fhkatzh1]~ [114] u2p -b 3 u2p.zsh /pvfsa/zgrp/fhkatzh1/u2p.zsh3
> add_accesses: connect: Connection refused
> pvfs_write: build_rw_jobs failed
> pvfs_write: short write
> 1 node(s); ssize = 65536; buffer = 3165; 9.896MBps (3165 bytes total)
> build_simple_jobs: connect: Connection refused
> pvfs_close: build_simple_jobs failed
> 
> The file exists with the length of zero.
> 
> When I use 'pvstat'  command I got the following:
> [physdsk2/fhkatzh1]~ [130] pvstat /pvfsa/zgrp/fhkatzh1/u2p.zsh3
> /pvfsa/zgrp/fhkatzh1/u2p.zsh3: base = 3, pcount = 1, ssize = 65536
> build_simple_jobs: connect: Connection refused
> pvfs_close: build_simple_jobs failed
> 
> while to other disks in the same pvfs it shows correctly:
> [physdsk2/fhkatzh1]~ [137] pvstat /pvfsa/zgrp/fhkatzh1/u2p.zsh2
> /pvfsa/zgrp/fhkatzh1/u2p.zsh2: base = 2, pcount = 1, ssize = 65536
> ----------------------------------------------------------
> When I use other '-b X' except 3, the writing is performed without
> problems.
> Can you ,please, suggest were to look to resolv this
> 'Connection refused' ?
> Thank you,
>    Helen
> helen.katz at weizmann.ac.il
> Particle Physics Dept.
> Weizmann Institute of Science
> Tel:  972-08-9342629
> 
> 
> _______________________________________________
> PVFS-users mailing list
> PVFS-users at www.beowulf-underground.org
> http://www.beowulf-underground.org/mailman/listinfo/pvfs-users
> 
> 


More information about the PVFS-users mailing list