[Pvfs2-users] Question about redundancy
Erich Weiler
weiler at soe.ucsc.edu
Mon Apr 30 12:40:30 EDT 2007
IBM's GPFS has done this quite nicely with primary and redundant server
disks, also they use a concept called 'failover' groups that provide
backups for nodes with common failure points. It's a sort of
replication technique, not exactly a RAID 5 type of redundancy but it
works. I understand this kind of thing is not trivial to code; but
conceptually it seems doable.
-erich
Rob Ross wrote:
> Hi Steve,
>
> We get this question a lot.
>
> Software redundancy in a parallel file system is a very challenging
> problem, particularly to provide efficient access at the same time.
>
> The group at Clemson has been looking into this as a research project,
> and I believe that others have as well. If a group creates a solution
> that performs well, reliably operates, and fits into the rest of the
> PVFS system, then we would certainly consider integrating it into the
> production releases. This hasn't happened so far...
>
> Regards,
>
> Rob
>
> Steve wrote:
>> Is built in redundancy planned ? Or not in the scope of the project ?
>>
>> Steve
>>
>> Trusting my 1.1Tb to the reliability of my drives, and touch wood in 20
>> years of computing had never had a drive fail. Now ive just put a
>> curse on
>> them!
>>
>> -------Original Message-------
>> From: Robert Latham Date: 24/04/2007 14:14:13 To: Erich Weiler Cc:
>> pvfs2-users at beowulf-underground.org Subject: Re: [Pvfs2-users]
>> Question about redundancy
>> On Mon, Apr 23, 2007 at 05:03:39PM -0700, Erich Weiler wrote:
>>> I need to be clear on this before putting a lot of time into it, but
>>> it sounds like this might be a good solution for our firm, as we have
>>> a 200 node cluster each with one 500GB disk, 400GB of which can be
>>> leveraged to a massive parallel file system (400GB x 200 nodes = one
>>> big ~80TB distributed file system). But that assumes that there is no
>>> redundancy, other wise that 80TB would be more like 50-60TB max or
>>> something because there would be some redundancy in there... ?
>>
>> Murali's explanation is spot-on: no software-based reduncancy scheme.
>> For users concerned with redundancy, we suggest hardware failover to
>> Shared storage, which works quite well.
>> ==rob
More information about the Pvfs2-users
mailing list