<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xml:base="http://ideas.4brad.com" xmlns:dc="http://purl.org/dc/elements/1.1/">
<channel>
 <title>Brad Ideas - Slow RAID for the home network - Comments</title>
 <link>http://ideas.4brad.com/node/300</link>
 <description>Comments for &quot;Slow RAID for the home network&quot;</description>
 <language>en</language>
<item>
 <title>How fast?</title>
 <link>http://ideas.4brad.com/node/300#comment-2366</link>
 <description>&lt;p&gt;How fast do you think it could be with separate gigabit NIC to avery node? Every node fast computer, host (server) core 2 duo. 100GBps? More? What about latency? and the worst = is it possible to connect more hosts?&lt;/p&gt;
</description>
 <pubDate>Wed, 27 Sep 2006 19:09:32 -0700</pubDate>
 <dc:creator>Petrik</dc:creator>
 <guid isPermaLink="false">comment 2366 at http://ideas.4brad.com</guid>
</item>
<item>
 <title>What about RAID-6</title>
 <link>http://ideas.4brad.com/node/300#comment-1097</link>
 <description>&lt;p&gt;With RAID-6 you can have TWO disks go down and your data is still accessible.  If sharing a networked raid array with my roommates / neighbors, I&#039;d certainly want this.&lt;/p&gt;
</description>
 <pubDate>Sat, 31 Dec 2005 16:17:07 -0800</pubDate>
 <dc:creator>Anonymous</dc:creator>
 <guid isPermaLink="false">comment 1097 at http://ideas.4brad.com</guid>
</item>
<item>
 <title>Write/Read Issues Resolved</title>
 <link>http://ideas.4brad.com/node/300#comment-957</link>
 <description>&lt;p&gt;Let&#039;s look at this conundrum, which is flawless management of files across computers. The confusion over where data gets stored and fetched from can be resolved owing to remote login. Assuming we speak of high-bandwidth networks here (100Mbit LAN in my case), the &#039;cost&#039; of using SSH bandwidth is almost negligible. Even 10Mbit connections would give you a very responsive UI behaviour over the network. I know this because I tried it. The exception are cases where you stream many large frames over the network (e.g. video, games).&lt;/p&gt;
&lt;p&gt;I have never looked deep into RAID technologies. Why not just scp (or rsync) the entire content of a hard-drive periodically. I maintain files on just a single machine, which I always SSH to from elsewhere. Its contents get mirrored on two other remote machines. Files that change on a daily basis are backed on the SAN overnight using cron jobs. This simplifies life a great deal. I still see no compelling reason for using RAID, especially with the existence high-speed networks.&lt;/p&gt;
</description>
 <pubDate>Sat, 26 Nov 2005 00:03:22 -0800</pubDate>
 <dc:creator>Roy Schestowitz</dc:creator>
 <guid isPermaLink="false">comment 957 at http://ideas.4brad.com</guid>
</item>
<item>
 <title>Backup is good</title>
 <link>http://ideas.4brad.com/node/300#comment-954</link>
 <description>&lt;p&gt;Yes, the continual backup would be good, as is raid-1.   Raid-5 has the advantage of using less disk.  Ie. with 5 200gb drives in raid-5 you get 800gb of storage.   With raid-1 or other plain backup, then not counting compression, you get only 500gb.&lt;/p&gt;
&lt;p&gt;Of course you do not get the performance hits.   I have had a lot of drive failures of late.  The drives are cheaper, and the systems are running hotter as well, I suspect.&lt;/p&gt;
</description>
 <pubDate>Thu, 24 Nov 2005 23:13:55 -0800</pubDate>
 <dc:creator>brad</dc:creator>
 <guid isPermaLink="false">comment 954 at http://ideas.4brad.com</guid>
</item>
<item>
 <title>I&#039;ve been fortunate enough</title>
 <link>http://ideas.4brad.com/node/300#comment-953</link>
 <description>&lt;p&gt;I&#039;ve been fortunate enough to not yet have any HD failures (knock on wood), so I don&#039;t know this:  What&#039;s the typical failure mode for the newer, inexpensive drives?  Is is really bit-failures, randomly strewn across the device?  Block failures?  Complete failures?&lt;/p&gt;
&lt;p&gt;Do we really need a RAID solution to address it?  (Well, RAID does provide an idealised solution.)  Or would a continual (incremental) backup, in the background (e.g. from a journalling file system in the form perhaps of a log-structured file system), with &quot;instant, on-demand restore&quot; serve the bill?  The latter, of course, offers a time lag on backup, data compression opportunities, and likely a momentary delay on restore.&lt;/p&gt;
&lt;p&gt;Because such a system would buffer file churn, i/o performance should theoretically be levelled out over a longer interval as well, making such a solution more feasible over limited-bandwidth pipes.  A bit fault can be corrected fairly quickly; a complete HD failure would take some time to recover, but would be recoverable.&lt;/p&gt;
&lt;p&gt;For that matter, are there any popular Error-Correcting Codes that could supplement this?  E.g. offshoring the ECC while retaining the source data locally, and retrieving (only) the redundancy data to correct the local data when a local error is detected?&lt;/p&gt;
&lt;p&gt;Of course, if the nature of the typical failure (and thus the nature of the demand for data restoration) demands more immediate results, then a RAID solution would be the better approach.&lt;/p&gt;
</description>
 <pubDate>Thu, 24 Nov 2005 23:03:16 -0800</pubDate>
 <dc:creator>Paul</dc:creator>
 <guid isPermaLink="false">comment 953 at http://ideas.4brad.com</guid>
</item>
<item>
 <title>RAID6 with 2 drives per machine</title>
 <link>http://ideas.4brad.com/node/300#comment-948</link>
 <description>&lt;p&gt;With RAID6 you can lose two disks and keep your data, but of course the CPU required is greater. That would mean you could run mini-ATX boxes with big disks and still have some chance of keeping your data intact.&lt;/p&gt;
&lt;p&gt;You could probably do this with some of the NAS drive cases that I&#039;ve seen, although it would be easier if you could load the netblock system onto each drive, but since mostly then run linux it might be possible. We may even see it as a standard option...&lt;/p&gt;
&lt;p&gt;I think a purpose-designed system will work better than trying to run RAID over the network though - in theory you should be able to satisfy  any RAID5 request by sending full data from N-1 drives and a checksum from the last one (at a computational price on the requesting drive) which would lower your newtwork demands.&lt;/p&gt;
</description>
 <pubDate>Thu, 24 Nov 2005 00:41:38 -0800</pubDate>
 <dc:creator>Anonymous</dc:creator>
 <guid isPermaLink="false">comment 948 at http://ideas.4brad.com</guid>
</item>
<item>
 <title>Well, I&#039;m always discovering</title>
 <link>http://ideas.4brad.com/node/300#comment-946</link>
 <description>&lt;p&gt;Well, I&#039;m always discovering packages for linux I didn&#039;t know existed.  This is most of the way there, though in the vision articulated it also runs on other OSs (indeed, many people don&#039;t have 3 or 4 linux boxes but do have extra windows boxes.)  And I had the idea that it could be shared and efficient for all the machines hosting it.  Ie. in the example above with 2 machines with 2 disks each, one is the host and the other the client, and if the client wants to write a block, he has to send it to the host, which then sends the distributed blocks back to the client, and likewise for read.    Ideally you want a system where you know which blocks are local and which are remote.&lt;/p&gt;
&lt;p&gt;Probably need a central server to maintain the actual filesystem cohesion though with a bit of work you could minimize the network traffic on that.&lt;/p&gt;
&lt;p&gt;Also, as fully visualized, in a RAID-5 with one disk per machine and 3 or more machines, you want it so that any machine can be taken down.  That requires a truly peer based system.&lt;/p&gt;
</description>
 <pubDate>Wed, 23 Nov 2005 11:56:15 -0800</pubDate>
 <dc:creator>brad</dc:creator>
 <guid isPermaLink="false">comment 946 at http://ideas.4brad.com</guid>
</item>
<item>
 <title>your technology has arrived!</title>
 <link>http://ideas.4brad.com/node/300#comment-945</link>
 <description>&lt;p&gt;the stock linux kernel has a driver which exactly such a thing, called &lt;a href=&quot;http://www.it.uc3m.es/~ptb/nbd/&quot; rel=&quot;nofollow&quot;&gt;network block device&lt;/a&gt;.  people are using it, plus software raid, in production environments.  i&#039;ve heard of it used more often for raid-0 striping (eg to make petabyte filesystems out of a cluster of PC&#039;s for free) than raid-5, but it is capable for the full range of raid types that linux&#039;s  software raid driver supports.  i tested it once with raid-1 just for giggles and the performance was acceptable for home directory service (mp3 streaming was probably my highest bandwidth activity), but only have one &quot;always-on&quot; server so it wasn&#039;t practical long term.&lt;/p&gt;
</description>
 <pubDate>Wed, 23 Nov 2005 06:51:07 -0800</pubDate>
 <dc:creator>fool</dc:creator>
 <guid isPermaLink="false">comment 945 at http://ideas.4brad.com</guid>
</item>
<item>
 <title>Slow RAID for the home network</title>
 <link>http://ideas.4brad.com/node/300</link>
 <description>&lt;p&gt;Hard disk drives these days are cheap.  Too cheap, in that while we love paying 30 cents/GB, the reliability is getting pretty poor.   Doing backups, especially automatic backups is a must, but what about RAID?&lt;/p&gt;

&lt;p&gt;One of the problems with RAID, at least RAID-5 is that you need to have 3, and ideally 4 or 5 drives in a machine.  That&amp;#8217;s a lot of drives, a lot of power, a lot of heat, a lot of noise.   And many machines only have two IDE controllers so they can barely do 3 drives and can&amp;#8217;t readily do more even if they had the slots and power for them.&lt;/p&gt;

&lt;p&gt;So I propose a software RAID-5, done over a LAN with 3 to 5 drives scattered over several machines on the LAN.&lt;/p&gt;

&lt;p&gt;Slow as hell, of course, having to read and write your data out over the LAN even at 100mbits.  Gigabit would obviously be better.   But what is it we have that&amp;#8217;s taking up all this disk space &amp;#8212; it&amp;#8217;s video, music and photos.   Things which, if just being played back, don&amp;#8217;t need to be accessed very fast.   If you&amp;#8217;re not editing video or music, in particular, you can handle having it on a very slow device.   (Photos are a bigger issue, as they do sometimes need fast access when building thumbnails etc.)&lt;/p&gt;

&lt;p&gt;This could even be done among neighbours over 802.11g, with suitable encryption.  In theory.&lt;/p&gt;

&lt;p&gt;Not that there aren&amp;#8217;t some major issues to overcome.  The machines must be on most of the time.  (A single disk can be taken out of a RAID temporarily, and thus a single machine hosting one disk can be turned off or rebooted, but not for long periods.)
If you lose access to two disks (or your LAN) you can&amp;#8217;t get access to the data.   And it&amp;#8217;s going to use a lot of your network capacity, though gigabit networking is starting to get cheap.  And the idea gets better&amp;#8230;&lt;/p&gt;
</description>
 <comments>http://ideas.4brad.com/node/300#comments</comments>
 <category domain="http://ideas.4brad.com/archives/cat_inventions.html">Inventions</category>
 <category domain="http://ideas.4brad.com/archives/cat_media.html">Media</category>
 <pubDate>Tue, 22 Nov 2005 22:38:42 -0800</pubDate>
 <dc:creator>brad</dc:creator>
 <guid isPermaLink="false">300 at http://ideas.4brad.com</guid>
</item>
</channel>
</rss>
