<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Cleversafe&#8217;s dispersed storage network</title>
	<atom:link href="http://storagemojo.com/2008/03/03/cleversafes-dispersed-storage-network/feed/" rel="self" type="application/rss+xml" />
	<link>http://storagemojo.com/2008/03/03/cleversafes-dispersed-storage-network/</link>
	<description>Data storage info &#38; analysis</description>
	<lastBuildDate>Sun, 01 Aug 2010 02:16:15 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
	<item>
		<title>By: Bob</title>
		<link>http://storagemojo.com/2008/03/03/cleversafes-dispersed-storage-network/comment-page-1/#comment-196899</link>
		<dc:creator>Bob</dc:creator>
		<pubDate>Fri, 25 Jul 2008 14:29:53 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/2008/03/03/cleversafes-dispersed-storage-network/#comment-196899</guid>
		<description>I agree with Bill. The greatest concerns I have is the network costs and latency.  I don&#039;t see how this is going to work for anyone with any significant amount of data.  This is a solution for a smaller company or one which is looking to hide data by scattering it across multiple countries.</description>
		<content:encoded><![CDATA[<p>I agree with Bill. The greatest concerns I have is the network costs and latency.  I don&#8217;t see how this is going to work for anyone with any significant amount of data.  This is a solution for a smaller company or one which is looking to hide data by scattering it across multiple countries.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Prof. John</title>
		<link>http://storagemojo.com/2008/03/03/cleversafes-dispersed-storage-network/comment-page-1/#comment-179218</link>
		<dc:creator>Prof. John</dc:creator>
		<pubDate>Mon, 10 Mar 2008 03:16:51 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/2008/03/03/cleversafes-dispersed-storage-network/#comment-179218</guid>
		<description>First of all,  it is a well known joke in the coding community that Rabin invented the Reed-Solomon code: IDA is indeed the more famous Reed-Solomon code in its exact form invented in JPL in 1960&#039;s, and now used everywhere from DVDs to cell phones. This shows how ignorant the storage and networking people are!

Second, the M out of N concept has been around for at least 3 decades.  The devil is in details. Computationally, Reed-Solomon code is the only code that can support any M and N, but at a cost of high computation overhead,  even with the Cauchy implementation ( which CS is using ). That is why from DVD player to cell phone, there are special circuits doing those decoding operations; and that is why Jim Plank dedicated so much his time to find good encoding and decoding algorithms for the Reed-Solomon code, and still with little significant results to change the landscape. 

Third, from published CS white paper, they only implemented the Reed-Solomon codes up to N = 11, i.e., at most 11 data centers. Hopefully they have gained better knowledge of coding, last time I checked a year and a half ago, they hard-coded all the equations for encoding and decoding, without any systematic way of expanding or shrinking the system.  

I like to see CS succeed, but unless they already have new competent technical blood,   they had a very shaky technical base to start with, and as an ISP I wouldn&#039;t buy their products now even if they are (relatively) cheap; and as EMC, I certainly wouldn&#039;t buy them now, since there is nothing there in-house R&amp;D people don&#039;t know or cannot build. Admittedly,  CS did a much better job in marketing than product developing, such as the NYT article a year ago ...</description>
		<content:encoded><![CDATA[<p>First of all,  it is a well known joke in the coding community that Rabin invented the Reed-Solomon code: IDA is indeed the more famous Reed-Solomon code in its exact form invented in JPL in 1960&#8242;s, and now used everywhere from DVDs to cell phones. This shows how ignorant the storage and networking people are!</p>
<p>Second, the M out of N concept has been around for at least 3 decades.  The devil is in details. Computationally, Reed-Solomon code is the only code that can support any M and N, but at a cost of high computation overhead,  even with the Cauchy implementation ( which CS is using ). That is why from DVD player to cell phone, there are special circuits doing those decoding operations; and that is why Jim Plank dedicated so much his time to find good encoding and decoding algorithms for the Reed-Solomon code, and still with little significant results to change the landscape. </p>
<p>Third, from published CS white paper, they only implemented the Reed-Solomon codes up to N = 11, i.e., at most 11 data centers. Hopefully they have gained better knowledge of coding, last time I checked a year and a half ago, they hard-coded all the equations for encoding and decoding, without any systematic way of expanding or shrinking the system.  </p>
<p>I like to see CS succeed, but unless they already have new competent technical blood,   they had a very shaky technical base to start with, and as an ISP I wouldn&#8217;t buy their products now even if they are (relatively) cheap; and as EMC, I certainly wouldn&#8217;t buy them now, since there is nothing there in-house R&amp;D people don&#8217;t know or cannot build. Admittedly,  CS did a much better job in marketing than product developing, such as the NYT article a year ago &#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Bill Todd</title>
		<link>http://storagemojo.com/2008/03/03/cleversafes-dispersed-storage-network/comment-page-1/#comment-178433</link>
		<dc:creator>Bill Todd</dc:creator>
		<pubDate>Thu, 06 Mar 2008 09:13:22 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/2008/03/03/cleversafes-dispersed-storage-network/#comment-178433</guid>
		<description>I&#039;m afraid that Jim is a confused about the difference between how a technology *may* be used and whether it is in fact actually useful in any practical sense.

For any practical purpose, resiliency through any 3 our of &#039;M&#039; failures using RAID-5 at two separate sites (or any 5 out of &#039;M&#039; failures using RAID-6) is more than sufficient:  the IDA approach may be &#039;cool&#039;, but that&#039;s about the limit of its interest (especially given its performance and cost drawbacks) - though the ability to use a storage &#039;utility&#039; to store copies of your data off site certainly has value in many situations (its value just isn&#039;t increased by using IDAs to do so).

And if you can place servers in locations where the law can&#039;t compel you to divulge their contents (Ausmith1&#039;s assumption above), then there&#039;s no need to use IDAs in that situation either.

- bill</description>
		<content:encoded><![CDATA[<p>I&#8217;m afraid that Jim is a confused about the difference between how a technology *may* be used and whether it is in fact actually useful in any practical sense.</p>
<p>For any practical purpose, resiliency through any 3 our of &#8216;M&#8217; failures using RAID-5 at two separate sites (or any 5 out of &#8216;M&#8217; failures using RAID-6) is more than sufficient:  the IDA approach may be &#8216;cool&#8217;, but that&#8217;s about the limit of its interest (especially given its performance and cost drawbacks) &#8211; though the ability to use a storage &#8216;utility&#8217; to store copies of your data off site certainly has value in many situations (its value just isn&#8217;t increased by using IDAs to do so).</p>
<p>And if you can place servers in locations where the law can&#8217;t compel you to divulge their contents (Ausmith1&#8242;s assumption above), then there&#8217;s no need to use IDAs in that situation either.</p>
<p>- bill</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ausmith1</title>
		<link>http://storagemojo.com/2008/03/03/cleversafes-dispersed-storage-network/comment-page-1/#comment-178002</link>
		<dc:creator>Ausmith1</dc:creator>
		<pubDate>Tue, 04 Mar 2008 23:31:05 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/2008/03/03/cleversafes-dispersed-storage-network/#comment-178002</guid>
		<description>Strong encryption won&#039;t help you if the law of the land says that you must divulge your encryption key. The RIPA law in Great Britain is a good example. 
http://www.theregister.co.uk/2007/11/14/ripa_encryption_key_notice/
Excerpt:
&quot;Section Three of the Regulation of Investigatory Powers Act (RIPA) came into force at the start in October 2007, seven years after the original legislation passed through parliament. Intended primarily to deal with terror suspects, it allows police to demand encryption keys or provide a clear text transcript of encrypted text.
Failure to comply can result in up to two years imprisonment for cases not involving national security, or five years for terrorism offences and the like. Orders can be made to turn over data months or even years old.&quot;
Now if your data does not reside in a single country you might be able to make the case that you don&#039;t have to turn over those parts that are not in the country with the restrictive laws...
It will be interesting to see how this develops.</description>
		<content:encoded><![CDATA[<p>Strong encryption won&#8217;t help you if the law of the land says that you must divulge your encryption key. The RIPA law in Great Britain is a good example.<br />
<a href="http://www.theregister.co.uk/2007/11/14/ripa_encryption_key_notice/" rel="nofollow">http://www.theregister.co.uk/2007/11/14/ripa_encryption_key_notice/</a><br />
Excerpt:<br />
&#8220;Section Three of the Regulation of Investigatory Powers Act (RIPA) came into force at the start in October 2007, seven years after the original legislation passed through parliament. Intended primarily to deal with terror suspects, it allows police to demand encryption keys or provide a clear text transcript of encrypted text.<br />
Failure to comply can result in up to two years imprisonment for cases not involving national security, or five years for terrorism offences and the like. Orders can be made to turn over data months or even years old.&#8221;<br />
Now if your data does not reside in a single country you might be able to make the case that you don&#8217;t have to turn over those parts that are not in the country with the restrictive laws&#8230;<br />
It will be interesting to see how this develops.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jim Thompson</title>
		<link>http://storagemojo.com/2008/03/03/cleversafes-dispersed-storage-network/comment-page-1/#comment-177938</link>
		<dc:creator>Jim Thompson</dc:creator>
		<pubDate>Tue, 04 Mar 2008 19:24:31 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/2008/03/03/cleversafes-dispersed-storage-network/#comment-177938</guid>
		<description>I think &#039;bill&#039; is confused about how IDAs are used.  They&#039;re not primary storage, but rather a really cool way of having secondary storage that is resilient to N out of M failures.

Consider a &#039;Time Machine&#039; volume that runs over iSCSI (were it that Apple hadn&#039;t pulled the iSCSI initiator from Leopard).  With a bit of front-end software, Cleversafe would be a really interesting offering, much safer (and more convenient) than a USB disk, or even Apple&#039;s new &#039;Time Capsule&#039; with its single (non-&quot;server class&quot;!) drive.</description>
		<content:encoded><![CDATA[<p>I think &#8216;bill&#8217; is confused about how IDAs are used.  They&#8217;re not primary storage, but rather a really cool way of having secondary storage that is resilient to N out of M failures.</p>
<p>Consider a &#8216;Time Machine&#8217; volume that runs over iSCSI (were it that Apple hadn&#8217;t pulled the iSCSI initiator from Leopard).  With a bit of front-end software, Cleversafe would be a really interesting offering, much safer (and more convenient) than a USB disk, or even Apple&#8217;s new &#8216;Time Capsule&#8217; with its single (non-&#8221;server class&#8221;!) drive.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Bill Todd</title>
		<link>http://storagemojo.com/2008/03/03/cleversafes-dispersed-storage-network/comment-page-1/#comment-177682</link>
		<dc:creator>Bill Todd</dc:creator>
		<pubDate>Tue, 04 Mar 2008 03:30:54 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/2008/03/03/cleversafes-dispersed-storage-network/#comment-177682</guid>
		<description>Yet another case of interesting technology in search of a problem, with the hope that enough people won&#039;t think too hard about more conventional solutions with equivalent (and sometimes superior) features.

Michael Rabin at Harvard has been describing IDAs at least since 1989 - FAIK he invented them (and/or the term).  But finding uses for them has proven elusive:  about the only sensible one I&#039;ve heard of involves circumventing deliberate governmental censorship - but my impression is that even Freenet uses simple (usually encrypted) copies rather than IDAs, though OceanStore uses IDAs IIRC.

The problem with distributing data around the world as Cleversafe describes is that you have to visit (p-m) locations around the world (or locations on disk, even if all the disks are local) to get *anything* you can use - which is expensive in terms of both disk utilization and (if the pieces are far-flung) latency and network bandwidth.  Contrast this with a parity-style approach where you need just *one* access (unless you&#039;re in the process of recovering from a disk failure) to read useful data, and at most four (rather than the full &#039;p&#039;) accesses to write useful data - and can often arrange for that single read access (or two of the four accesses required for a write) to be local (even if your parity arrangement is distributed) when data exhibits significant geographic locality of reference.

Worried about security?  That&#039;s what strong encryption&#039;s for - you don&#039;t need any slicing and dicing and distribution to the four winds, and it works just fine in combination with a distributed parity-based redundancy scheme (if indeed that makes sense for you, though for most users a single remote mirror location will probably be more reasonable, with additional, possibly parity-based, redundancy at one or both sites as appropriate:  tolerating a single full-site disaster is usually considered pretty adequate, and the savings in both site cost and performance overheads compared with distribution around, say, a dozen sites is likely to dwarf the cost of mirroring the storage - and RAID-6 will allow up to two disks to fail at each site if RAID-5 isn&#039;t considered sufficient).

So while Cleversafe is indeed both clever and safe, it&#039;s not clear that it&#039;s particularly useful compared with the available alternatives.

- bill</description>
		<content:encoded><![CDATA[<p>Yet another case of interesting technology in search of a problem, with the hope that enough people won&#8217;t think too hard about more conventional solutions with equivalent (and sometimes superior) features.</p>
<p>Michael Rabin at Harvard has been describing IDAs at least since 1989 &#8211; FAIK he invented them (and/or the term).  But finding uses for them has proven elusive:  about the only sensible one I&#8217;ve heard of involves circumventing deliberate governmental censorship &#8211; but my impression is that even Freenet uses simple (usually encrypted) copies rather than IDAs, though OceanStore uses IDAs IIRC.</p>
<p>The problem with distributing data around the world as Cleversafe describes is that you have to visit (p-m) locations around the world (or locations on disk, even if all the disks are local) to get *anything* you can use &#8211; which is expensive in terms of both disk utilization and (if the pieces are far-flung) latency and network bandwidth.  Contrast this with a parity-style approach where you need just *one* access (unless you&#8217;re in the process of recovering from a disk failure) to read useful data, and at most four (rather than the full &#8216;p&#8217;) accesses to write useful data &#8211; and can often arrange for that single read access (or two of the four accesses required for a write) to be local (even if your parity arrangement is distributed) when data exhibits significant geographic locality of reference.</p>
<p>Worried about security?  That&#8217;s what strong encryption&#8217;s for &#8211; you don&#8217;t need any slicing and dicing and distribution to the four winds, and it works just fine in combination with a distributed parity-based redundancy scheme (if indeed that makes sense for you, though for most users a single remote mirror location will probably be more reasonable, with additional, possibly parity-based, redundancy at one or both sites as appropriate:  tolerating a single full-site disaster is usually considered pretty adequate, and the savings in both site cost and performance overheads compared with distribution around, say, a dozen sites is likely to dwarf the cost of mirroring the storage &#8211; and RAID-6 will allow up to two disks to fail at each site if RAID-5 isn&#8217;t considered sufficient).</p>
<p>So while Cleversafe is indeed both clever and safe, it&#8217;s not clear that it&#8217;s particularly useful compared with the available alternatives.</p>
<p>- bill</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Tony</title>
		<link>http://storagemojo.com/2008/03/03/cleversafes-dispersed-storage-network/comment-page-1/#comment-177605</link>
		<dc:creator>Tony</dc:creator>
		<pubDate>Mon, 03 Mar 2008 22:16:41 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/2008/03/03/cleversafes-dispersed-storage-network/#comment-177605</guid>
		<description>Another potential plus to the hardware route - better customer satisfaction.  It can be non-trivial to get software to work reliably with all hardware -- for example, this is one plus Apple has over Vista (Apple&#039;s supported hardware list is very small versus MS).  I also wonder how much impact drive/controller compatibility issues can have on reliability - especially consider the research you&#039;ve highlighted recently.  If Cleversafe is pushing reliability, that could be another reason for starting with a hardware box.</description>
		<content:encoded><![CDATA[<p>Another potential plus to the hardware route &#8211; better customer satisfaction.  It can be non-trivial to get software to work reliably with all hardware &#8212; for example, this is one plus Apple has over Vista (Apple&#8217;s supported hardware list is very small versus MS).  I also wonder how much impact drive/controller compatibility issues can have on reliability &#8211; especially consider the research you&#8217;ve highlighted recently.  If Cleversafe is pushing reliability, that could be another reason for starting with a hardware box.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
