<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: The computer science behind EMC&#8217;s cloud storage</title>
	<atom:link href="http://storagemojo.com/2008/11/12/the-computer-science-behind-emcs-cloud-storage/feed/" rel="self" type="application/rss+xml" />
	<link>http://storagemojo.com/2008/11/12/the-computer-science-behind-emcs-cloud-storage/</link>
	<description>Data storage info &#38; analysis</description>
	<lastBuildDate>Tue, 07 Feb 2012 16:02:02 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<item>
		<title>By: Protecting Your Data and Hurricanes &#171; On-Site Computer Services, Inc. in New Orleans Blog 504-469-6991</title>
		<link>http://storagemojo.com/2008/11/12/the-computer-science-behind-emcs-cloud-storage/comment-page-1/#comment-203919</link>
		<dc:creator>Protecting Your Data and Hurricanes &#171; On-Site Computer Services, Inc. in New Orleans Blog 504-469-6991</dc:creator>
		<pubDate>Tue, 07 Jul 2009 13:49:06 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1010#comment-203919</guid>
		<description>[...] Microsoft Corp., Symantec Corp, EMC Corp., Iron Mountain and Seagate Technology, to name a few. Cloud storage maintains copies of files safely and securely off-site.  Those who utilize a cloud data storage service claim it eliminates concerns like loss,  damage [...]</description>
		<content:encoded><![CDATA[<p>[...] Microsoft Corp., Symantec Corp, EMC Corp., Iron Mountain and Seagate Technology, to name a few. Cloud storage maintains copies of files safely and securely off-site.  Those who utilize a cloud data storage service claim it eliminates concerns like loss,  damage [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Storage Short Take #4 - blog.scottlowe.org - The weblog of an IT pro specializing in virtualization, storage, and servers</title>
		<link>http://storagemojo.com/2008/11/12/the-computer-science-behind-emcs-cloud-storage/comment-page-1/#comment-198718</link>
		<dc:creator>Storage Short Take #4 - blog.scottlowe.org - The weblog of an IT pro specializing in virtualization, storage, and servers</dc:creator>
		<pubDate>Tue, 02 Dec 2008 13:01:07 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1010#comment-198718</guid>
		<description>[...] Harris of StorageMojo provides some great insight into the details behind EMC&#8217;s Atmos cloud storage product. I won&#8217;t even begin to try to summarize some of that information here [...]</description>
		<content:encoded><![CDATA[<p>[...] Harris of StorageMojo provides some great insight into the details behind EMC&#8217;s Atmos cloud storage product. I won&#8217;t even begin to try to summarize some of that information here [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Rich Whiffen &#187; Blog Archive &#187; EMC and cloud storage</title>
		<link>http://storagemojo.com/2008/11/12/the-computer-science-behind-emcs-cloud-storage/comment-page-1/#comment-198450</link>
		<dc:creator>Rich Whiffen &#187; Blog Archive &#187; EMC and cloud storage</dc:creator>
		<pubDate>Sat, 15 Nov 2008 01:46:52 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1010#comment-198450</guid>
		<description>[...] example, Robin Harris has a link and some discussion about the theoretical underpinnings of the product, and the project in which the idea is based. Clearly the EMC product isn&#8217;t exactly the [...]</description>
		<content:encoded><![CDATA[<p>[...] example, Robin Harris has a link and some discussion about the theoretical underpinnings of the product, and the project in which the idea is based. Clearly the EMC product isn&#8217;t exactly the [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: drsan</title>
		<link>http://storagemojo.com/2008/11/12/the-computer-science-behind-emcs-cloud-storage/comment-page-1/#comment-198447</link>
		<dc:creator>drsan</dc:creator>
		<pubDate>Fri, 14 Nov 2008 13:49:01 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1010#comment-198447</guid>
		<description>to answer a few of these questions: 
1) is Atmos patentable?  well, only the EMC engineers/IP attorneys know for sure.  i suspect a few parts of it might be, otherwise not.
2) is EMC innovative?  of course they are, but no more so than any other cloud approach or wide-area/object-oriented filesystem approach.

but it&#039;s really a shame that Atmos is implemented, at least currently, using technology that does not include ANSI T10 DIF.  Without DIF, the nice, large cloud of petabytes of data is going to eventually end up as nice, large, cloud of petabytes of data, many gigabytes of which is corrupt, and you don&#039;t know which ones.  file or object replication doesn&#039;t help here - it doesn&#039;t really protect the data, it merely makes potentially corrupt copies of it.  Read the CERN paper from the spring of 2007.</description>
		<content:encoded><![CDATA[<p>to answer a few of these questions:<br />
1) is Atmos patentable?  well, only the EMC engineers/IP attorneys know for sure.  i suspect a few parts of it might be, otherwise not.<br />
2) is EMC innovative?  of course they are, but no more so than any other cloud approach or wide-area/object-oriented filesystem approach.</p>
<p>but it&#8217;s really a shame that Atmos is implemented, at least currently, using technology that does not include ANSI T10 DIF.  Without DIF, the nice, large cloud of petabytes of data is going to eventually end up as nice, large, cloud of petabytes of data, many gigabytes of which is corrupt, and you don&#8217;t know which ones.  file or object replication doesn&#8217;t help here &#8211; it doesn&#8217;t really protect the data, it merely makes potentially corrupt copies of it.  Read the CERN paper from the spring of 2007.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: drsan</title>
		<link>http://storagemojo.com/2008/11/12/the-computer-science-behind-emcs-cloud-storage/comment-page-1/#comment-198446</link>
		<dc:creator>drsan</dc:creator>
		<pubDate>Fri, 14 Nov 2008 13:48:21 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1010#comment-198446</guid>
		<description>to answer a few of these questions: 
1) is Atmos patentable?  well, only the EMC engineers/IP attorneys know for sure.  i suspect a few parts of it might be, otherwise not.
2) is EMC innovative?  of course they are, but no more so than any other cloud approach or wide-area/object-oriented filesystem approach.

but it&#039;s really a shame that Atmos is implemented, at least currently, using technology that does not include ANSI T10 DIF.  Without DIF, the nice, large cloud of petabytes of data is going to end up as nice, large, cloud of petabytes of data, many gigabytes of which is corrupt, and you don&#039;t know which ones.  file or object replication doesn&#039;t help here - it doesn&#039;t really protect the data, it merely makes potentially corrupt copies of it.  Read the CERN paper from the spring of 2007.</description>
		<content:encoded><![CDATA[<p>to answer a few of these questions:<br />
1) is Atmos patentable?  well, only the EMC engineers/IP attorneys know for sure.  i suspect a few parts of it might be, otherwise not.<br />
2) is EMC innovative?  of course they are, but no more so than any other cloud approach or wide-area/object-oriented filesystem approach.</p>
<p>but it&#8217;s really a shame that Atmos is implemented, at least currently, using technology that does not include ANSI T10 DIF.  Without DIF, the nice, large cloud of petabytes of data is going to end up as nice, large, cloud of petabytes of data, many gigabytes of which is corrupt, and you don&#8217;t know which ones.  file or object replication doesn&#8217;t help here &#8211; it doesn&#8217;t really protect the data, it merely makes potentially corrupt copies of it.  Read the CERN paper from the spring of 2007.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Christophe Bertrand &#187; Blog Archive &#187; Cloud or vaporware?</title>
		<link>http://storagemojo.com/2008/11/12/the-computer-science-behind-emcs-cloud-storage/comment-page-1/#comment-198441</link>
		<dc:creator>Christophe Bertrand &#187; Blog Archive &#187; Cloud or vaporware?</dc:creator>
		<pubDate>Fri, 14 Nov 2008 00:36:30 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1010#comment-198441</guid>
		<description>[...] about the cloud computing announcement from EMC on Monday: project Maui. It&#8217;s funny because Maui is not only an Island of the state our President-elect comes from, amongst other meanings, it is [...]</description>
		<content:encoded><![CDATA[<p>[...] about the cloud computing announcement from EMC on Monday: project Maui. It&#8217;s funny because Maui is not only an Island of the state our President-elect comes from, amongst other meanings, it is [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Steve Jones</title>
		<link>http://storagemojo.com/2008/11/12/the-computer-science-behind-emcs-cloud-storage/comment-page-1/#comment-198433</link>
		<dc:creator>Steve Jones</dc:creator>
		<pubDate>Thu, 13 Nov 2008 14:08:17 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1010#comment-198433</guid>
		<description>It would be possible to engineer a storage system like this on a peer-to-peer basis. There&#039;s a neat little job for the open source devotees. With P2P file sharing some of the mechanisms are already there, but some really clever work would be required on locality, public quota controls, resilience. Not a great thing for mastgering data, but a neat way of using lots of cheap storage otherwise tied up in underused PC storage for backup, network access and so on.

It shouldn&#039;t be assumed that this sort of service has to be constrained to commercial providers. 

More heavyweight commercial DP is more of an issue. The killer there is often performance - high latency is inherent to anything geographically spread. Global latency is truly awful - up to about 400ms round-trip, but even continental (typically 50-100ms) is unusable for many functions. In that case you want your compute capability close to the data.

There are also issues over consistency of multiple copies. Still, it&#039;s an interesting approach and I can see it&#039;s use for low access rate storage, archival stores (email looks a lot like that), remote backups etc.</description>
		<content:encoded><![CDATA[<p>It would be possible to engineer a storage system like this on a peer-to-peer basis. There&#8217;s a neat little job for the open source devotees. With P2P file sharing some of the mechanisms are already there, but some really clever work would be required on locality, public quota controls, resilience. Not a great thing for mastgering data, but a neat way of using lots of cheap storage otherwise tied up in underused PC storage for backup, network access and so on.</p>
<p>It shouldn&#8217;t be assumed that this sort of service has to be constrained to commercial providers. </p>
<p>More heavyweight commercial DP is more of an issue. The killer there is often performance &#8211; high latency is inherent to anything geographically spread. Global latency is truly awful &#8211; up to about 400ms round-trip, but even continental (typically 50-100ms) is unusable for many functions. In that case you want your compute capability close to the data.</p>
<p>There are also issues over consistency of multiple copies. Still, it&#8217;s an interesting approach and I can see it&#8217;s use for low access rate storage, archival stores (email looks a lot like that), remote backups etc.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Recommended reading about EMC Atmos &#8212; Storage Soup</title>
		<link>http://storagemojo.com/2008/11/12/the-computer-science-behind-emcs-cloud-storage/comment-page-1/#comment-198418</link>
		<dc:creator>Recommended reading about EMC Atmos &#8212; Storage Soup</dc:creator>
		<pubDate>Wed, 12 Nov 2008 23:31:33 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1010#comment-198418</guid>
		<description>[...] StorageMojo&#8217;s Robin Harris has links to all the relevant technical whitepapers for anyone looking to truly geek out to their [...]</description>
		<content:encoded><![CDATA[<p>[...] StorageMojo&#8217;s Robin Harris has links to all the relevant technical whitepapers for anyone looking to truly geek out to their [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Anonymous</title>
		<link>http://storagemojo.com/2008/11/12/the-computer-science-behind-emcs-cloud-storage/comment-page-1/#comment-198414</link>
		<dc:creator>Anonymous</dc:creator>
		<pubDate>Wed, 12 Nov 2008 20:20:10 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1010#comment-198414</guid>
		<description>Robin said:
&quot;EMC’s engineers chose to use a 3rd party cluster product - IBRIX I think - for the local data stores so they could focus on the layer that glues the sites together. Each local store can itself be a petabyte or more.&quot;

EMC did use IBRIX for some 9+ months during the initial development of Maui/Hulk.  But the use of IBRIX merely bought EMC time to develop their own (theoretically) scalable &quot;local data store&quot;.  The verdict is still very much out on EMC&#039;s ability to compete in the local clustered storage market (let alone cloud).

Mike Feinberg, vice president of EMC&#039;s cloud infrastructure group, shared the following with techtarget.com:

&quot;The talk in the storage industry is that EMC was using Ibrix software for its clustered file system. And while Atmos does include capabilities (such as global namespace) that Ibrix offers, Feinberg said that no other vendor&#039;s IP is used outside of EMC. &quot;It&#039;s all EMC technology,&quot; he said. &quot;Frankly, there&#039;s no Ibrix software and no other people&#039;s software. It&#039;s built from the ground with home-grown technology, although we do use open source technology as we see fit.&quot;</description>
		<content:encoded><![CDATA[<p>Robin said:<br />
&#8220;EMC’s engineers chose to use a 3rd party cluster product &#8211; IBRIX I think &#8211; for the local data stores so they could focus on the layer that glues the sites together. Each local store can itself be a petabyte or more.&#8221;</p>
<p>EMC did use IBRIX for some 9+ months during the initial development of Maui/Hulk.  But the use of IBRIX merely bought EMC time to develop their own (theoretically) scalable &#8220;local data store&#8221;.  The verdict is still very much out on EMC&#8217;s ability to compete in the local clustered storage market (let alone cloud).</p>
<p>Mike Feinberg, vice president of EMC&#8217;s cloud infrastructure group, shared the following with techtarget.com:</p>
<p>&#8220;The talk in the storage industry is that EMC was using Ibrix software for its clustered file system. And while Atmos does include capabilities (such as global namespace) that Ibrix offers, Feinberg said that no other vendor&#8217;s IP is used outside of EMC. &#8220;It&#8217;s all EMC technology,&#8221; he said. &#8220;Frankly, there&#8217;s no Ibrix software and no other people&#8217;s software. It&#8217;s built from the ground with home-grown technology, although we do use open source technology as we see fit.&#8221;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Dave Graham</title>
		<link>http://storagemojo.com/2008/11/12/the-computer-science-behind-emcs-cloud-storage/comment-page-1/#comment-198413</link>
		<dc:creator>Dave Graham</dc:creator>
		<pubDate>Wed, 12 Nov 2008 19:59:50 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1010#comment-198413</guid>
		<description>Hey Robin! 

love reading your articles.

just to note that Atmos is NOT using IBRIX (or Clustered XFS) for the underlying filesystem.  This is a truly organic, object based filesystem designed by Eaton and Co.  

cheers,

Dave Graham</description>
		<content:encoded><![CDATA[<p>Hey Robin! </p>
<p>love reading your articles.</p>
<p>just to note that Atmos is NOT using IBRIX (or Clustered XFS) for the underlying filesystem.  This is a truly organic, object based filesystem designed by Eaton and Co.  </p>
<p>cheers,</p>
<p>Dave Graham</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jeff Darcy</title>
		<link>http://storagemojo.com/2008/11/12/the-computer-science-behind-emcs-cloud-storage/comment-page-1/#comment-198411</link>
		<dc:creator>Jeff Darcy</dc:creator>
		<pubDate>Wed, 12 Nov 2008 18:20:09 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1010#comment-198411</guid>
		<description>I wouldn&#039;t quite refer to the OceanStore papers as the technical underpinnings for Atmos.  Certainly there&#039;s some relationship there, which I find pleasing because I was involved in making that connection at one time (I was one of the first two industry representatives to see OceanStore run - across two laptops at Granlibakken).  However, Atmos might not look at all like OceanStore, for three reasons.

(1) It&#039;s highly likely to be a new codebase even if it implements the same ideas.

(2) Atmos doesn&#039;t quite have the same goals as OceanStore (especially wrt multi-tenancy) and the state of the industry has changed in quite a few ways since OceanStore was being actively developed.

(3) Even if Atmos is exactly like OceanStore, which OceanStore?  Some of the basic approaches, including in areas you mention above, changed quite a bit during the project lifetime.

From what I&#039;ve read so far, Atmos seems more clearly related to an EMC Cambridge project I used to know as RAIDiant than to OceanStore, and might even be related to at least one other that I&#039;ll keep quiet about.  I think I&#039;ve succeeded in prodding Steve Todd to write more about the aspects that are most likely to be OceanStore-like, though, so that&#039;s probably where we should look for a more authoritative answer.</description>
		<content:encoded><![CDATA[<p>I wouldn&#8217;t quite refer to the OceanStore papers as the technical underpinnings for Atmos.  Certainly there&#8217;s some relationship there, which I find pleasing because I was involved in making that connection at one time (I was one of the first two industry representatives to see OceanStore run &#8211; across two laptops at Granlibakken).  However, Atmos might not look at all like OceanStore, for three reasons.</p>
<p>(1) It&#8217;s highly likely to be a new codebase even if it implements the same ideas.</p>
<p>(2) Atmos doesn&#8217;t quite have the same goals as OceanStore (especially wrt multi-tenancy) and the state of the industry has changed in quite a few ways since OceanStore was being actively developed.</p>
<p>(3) Even if Atmos is exactly like OceanStore, which OceanStore?  Some of the basic approaches, including in areas you mention above, changed quite a bit during the project lifetime.</p>
<p>From what I&#8217;ve read so far, Atmos seems more clearly related to an EMC Cambridge project I used to know as RAIDiant than to OceanStore, and might even be related to at least one other that I&#8217;ll keep quiet about.  I think I&#8217;ve succeeded in prodding Steve Todd to write more about the aspects that are most likely to be OceanStore-like, though, so that&#8217;s probably where we should look for a more authoritative answer.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: the storage anarchist</title>
		<link>http://storagemojo.com/2008/11/12/the-computer-science-behind-emcs-cloud-storage/comment-page-1/#comment-198410</link>
		<dc:creator>the storage anarchist</dc:creator>
		<pubDate>Wed, 12 Nov 2008 17:29:55 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1010#comment-198410</guid>
		<description>Nice coverage, Robin. I for one sincerely appreciate that you&#039;ve dug deep into this and revealed the science behind Atmos. And honestly, I don&#039;t think EMC necessarily wanted it kept secret.

That said, dare I ask - do you think this means EMC actually *IS* innovative?

Or is recognizing, funding and then converting a research project into a real-world product no more innovative than say, acquiring a smart-but-struggling startup and turning their technology into more than it ever was going to be on its own?</description>
		<content:encoded><![CDATA[<p>Nice coverage, Robin. I for one sincerely appreciate that you&#8217;ve dug deep into this and revealed the science behind Atmos. And honestly, I don&#8217;t think EMC necessarily wanted it kept secret.</p>
<p>That said, dare I ask &#8211; do you think this means EMC actually *IS* innovative?</p>
<p>Or is recognizing, funding and then converting a research project into a real-world product no more innovative than say, acquiring a smart-but-struggling startup and turning their technology into more than it ever was going to be on its own?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ryan Malayter</title>
		<link>http://storagemojo.com/2008/11/12/the-computer-science-behind-emcs-cloud-storage/comment-page-1/#comment-198409</link>
		<dc:creator>Ryan Malayter</dc:creator>
		<pubDate>Wed, 12 Nov 2008 17:03:14 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1010#comment-198409</guid>
		<description>Sounds great in theory. Assuming the technology actually works well, and is low latency, the big issue I see is with the economic model.
First, if this is an EMC-only solution, without open APIs, then the &quot;global&quot; network mentioned in the first paper will simply neverhappen.
Second, it sounds like getting SaaS providers to cooperate in a global mesh would have even more pitfalls than the issues surrounding peering between Tier-1 ISPs. How do they assign value to not only bandwidth, but also capacity, latency, customer count, etc. And how long before a &quot;storage Cogent&quot; came in with low prices, angering the &quot;established&quot; cloud storage vendors, getting de-peered, and segmenting the global storage network?</description>
		<content:encoded><![CDATA[<p>Sounds great in theory. Assuming the technology actually works well, and is low latency, the big issue I see is with the economic model.<br />
First, if this is an EMC-only solution, without open APIs, then the &#8220;global&#8221; network mentioned in the first paper will simply neverhappen.<br />
Second, it sounds like getting SaaS providers to cooperate in a global mesh would have even more pitfalls than the issues surrounding peering between Tier-1 ISPs. How do they assign value to not only bandwidth, but also capacity, latency, customer count, etc. And how long before a &#8220;storage Cogent&#8221; came in with low prices, angering the &#8220;established&#8221; cloud storage vendors, getting de-peered, and segmenting the global storage network?</p>
]]></content:encoded>
	</item>
</channel>
</rss>

