[PVFS-users] [Fwd: problem writing to one of the disks in a PVFS fs (fwd)]

Phil Carns pcarns at parl.clemson.edu
Tue Sep 14 11:33:16 EDT 2004


I am forwarding along this message from Helen Katz, who is currently 
unable to post to the list; please preserve the cc when responding.

thanks,
-Phil

---------- Forwarded message ----------
Date: Thu, 09 Sep 2004 02:22:04 -0400
From: pvfs-users-owner at beowulf-underground.org
To: helen.katz at weizmann.ac.il
Subject: problem writing to one of the disks in a PVFS fs


Hello,
We are using pvfs-1.5.4 and  pvfs-kernel-1.5.4 under RH-7.3
without problems and we have several stable PVFS filesystems,for
several years .
In 5'th of Sept. I had this problem, which I solved by rebooting
the mgr machine:
In one of the pvfs's we have problem writing to one of the disks.
The disk /data08 from within eio08 is part of /pvfsa disk,
and it's the only one to which I can't write.
The /pvfsa disk is build from disks in eio05-eio12
  pvfsa     Size(Kb)         Used         Free   Use%
    0      115377640      5596960    103919768     6% eio05
    1      115380192      6626576    102892580     7% eio06
    2      115377640    104998728      4518000    96% eio07
    3      115380192        65068    109454088     1% eio08
    4      115380192      7469356    102049800     7% eio09
    5      115380192      8845144    100674012     9% eio10
    6      115380192      6037972    103481184     6% eio11
    7      115345508     10353536     99132668    10% eio12
When I use u2p command to write directly to a specific disk
in that pvfs , I got this reply:
---------------------------------------------------------
[physdsk2/fhkatzh1]~ [114] u2p -b 3 u2p.zsh /pvfsa/zgrp/fhkatzh1/u2p.zsh3
add_accesses: connect: Connection refused
pvfs_write: build_rw_jobs failed
pvfs_write: short write
1 node(s); ssize = 65536; buffer = 3165; 9.896MBps (3165 bytes total)
build_simple_jobs: connect: Connection refused
pvfs_close: build_simple_jobs failed

The file exists with the length of zero.

When I use 'pvstat'  command I got the following:
[physdsk2/fhkatzh1]~ [130] pvstat /pvfsa/zgrp/fhkatzh1/u2p.zsh3
/pvfsa/zgrp/fhkatzh1/u2p.zsh3: base = 3, pcount = 1, ssize = 65536
build_simple_jobs: connect: Connection refused
pvfs_close: build_simple_jobs failed

while to other disks in the same pvfs it shows correctly:
[physdsk2/fhkatzh1]~ [137] pvstat /pvfsa/zgrp/fhkatzh1/u2p.zsh2
/pvfsa/zgrp/fhkatzh1/u2p.zsh2: base = 2, pcount = 1, ssize = 65536
----------------------------------------------------------
When I use other '-b X' except 3, the writing is performed without
problems.
Can you ,please, suggest were to look to resolv this
'Connection refused' ?
Thank you,
   Helen
helen.katz at weizmann.ac.il
Particle Physics Dept.
Weizmann Institute of Science
Tel:  972-08-9342629




More information about the PVFS-users mailing list