<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: The new storage pyramid</title>
	<atom:link href="http://storagemojo.com/2008/12/02/the-new-storage-pyramid/feed/" rel="self" type="application/rss+xml" />
	<link>http://storagemojo.com/2008/12/02/the-new-storage-pyramid/</link>
	<description>Data storage info &#38; analysis</description>
	<lastBuildDate>Fri, 19 Mar 2010 09:23:11 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Ernst Lopes Cardozo</title>
		<link>http://storagemojo.com/2008/12/02/the-new-storage-pyramid/comment-page-1/#comment-198747</link>
		<dc:creator>Ernst Lopes Cardozo</dc:creator>
		<pubDate>Thu, 04 Dec 2008 00:06:35 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1027#comment-198747</guid>
		<description>Robin,
What I think is missing in your picture is backup/restore. The storage-part has a clean interface - anybody can do read and write. Where it gets messy is  snapshot, replication and other data security features. They bind to OSses and applications in very specific ways, to the extend that a change of array implies redesigning, rescripting and retesting these pesky but unavoidable issues. 
Since a true and open standard to solve these integration issues is not even in the works, the alternative is to solve all data security issues at the OS and application level. Microsoft is doing just that, with recent additions in Windows, SQLserver, Exchange and Data Protection Manager. With such tools, storage becomes just a pool of r/w blocks, organised in volumes. Commodity pure and simple.</description>
		<content:encoded><![CDATA[<p>Robin,<br />
What I think is missing in your picture is backup/restore. The storage-part has a clean interface &#8211; anybody can do read and write. Where it gets messy is  snapshot, replication and other data security features. They bind to OSses and applications in very specific ways, to the extend that a change of array implies redesigning, rescripting and retesting these pesky but unavoidable issues.<br />
Since a true and open standard to solve these integration issues is not even in the works, the alternative is to solve all data security issues at the OS and application level. Microsoft is doing just that, with recent additions in Windows, SQLserver, Exchange and Data Protection Manager. With such tools, storage becomes just a pool of r/w blocks, organised in volumes. Commodity pure and simple.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jerry Leichter</title>
		<link>http://storagemojo.com/2008/12/02/the-new-storage-pyramid/comment-page-1/#comment-198745</link>
		<dc:creator>Jerry Leichter</dc:creator>
		<pubDate>Wed, 03 Dec 2008 23:07:53 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1027#comment-198745</guid>
		<description>One thing missing from this analysis is just what the future role of flash will be.  As with most new technologies, flash is being used to re-implement existing technologies:  The first industrial electric motors simply replaced the waterwheel that drove the shafts and belts the provided power throughout a factory.  In fact, the actual characteristics of flash memory makes it a pretty poor replacement for a disk.  You have to play all kinds of games to make flash memory look like a disk.  Then you implement complex protocols designed to make talking to a disk efficient.  Then the OS, and the application, plays all sorts of games in an attempt to make efficient use of a disk.

Of course, all this makes perfect sense because we really only have three well-defined, widely implemented, storage interfaces today:   Main memory, disks, and tapes.  Flash is closest to disk, so of course we initially pay the costs of using it as disk.  But is this what will make sense in the long run?  Certainly not.  It will take some time, but we&#039;ll eventually figure out the right abstractions for presenting flash to programs.  It will almost certainly look much more like main memory than disks do - but it will have all kinds of special characteristics of its own.

Once that happens, the pyramid changes.  Flash is physically small, doesn&#039;t use all that much power, and will be very fast.  It&#039;s likely to be physically interconnected to CPU&#039;s using technologies much more like today&#039;s memory busses than today&#039;s disk interfaces.  One possible future:  Flash moves into the box with the CPU&#039;s.  Arrays today allow sharing, but most sharing is by only a few programs.  With virtualization, f the flash is only directly accessible to a fraction of the CPU&#039;s, it may make sense to move the programs to the data, reshuffling the data to other clusters of CPU and flash as required.  Disks remain outside of the CPU/flash combo, in big arrays/cluster storage boxes (the distinction fades away), which no concentrate on bulk storage, not on speed - or, more to the point, on speed for big transfers, since all &quot;small&quot; stuff takes place in flash.

Exactly what systems like this will look like isn&#039;t clear, but if they&#039;re at all like this, one can make some obvious predictions.  For example, the market for 10K and 15K RPM disk drives disappears - their huge extra cost becomes unjustifiable when flash takes over speed-critical roles.  The old vision of RAID - using inexpensive disks and using redundancy for reliability - becomes more interesting again.  (Of course, the cluster approach of geographic redundancy just takes this an extra step further.)</description>
		<content:encoded><![CDATA[<p>One thing missing from this analysis is just what the future role of flash will be.  As with most new technologies, flash is being used to re-implement existing technologies:  The first industrial electric motors simply replaced the waterwheel that drove the shafts and belts the provided power throughout a factory.  In fact, the actual characteristics of flash memory makes it a pretty poor replacement for a disk.  You have to play all kinds of games to make flash memory look like a disk.  Then you implement complex protocols designed to make talking to a disk efficient.  Then the OS, and the application, plays all sorts of games in an attempt to make efficient use of a disk.</p>
<p>Of course, all this makes perfect sense because we really only have three well-defined, widely implemented, storage interfaces today:   Main memory, disks, and tapes.  Flash is closest to disk, so of course we initially pay the costs of using it as disk.  But is this what will make sense in the long run?  Certainly not.  It will take some time, but we&#8217;ll eventually figure out the right abstractions for presenting flash to programs.  It will almost certainly look much more like main memory than disks do &#8211; but it will have all kinds of special characteristics of its own.</p>
<p>Once that happens, the pyramid changes.  Flash is physically small, doesn&#8217;t use all that much power, and will be very fast.  It&#8217;s likely to be physically interconnected to CPU&#8217;s using technologies much more like today&#8217;s memory busses than today&#8217;s disk interfaces.  One possible future:  Flash moves into the box with the CPU&#8217;s.  Arrays today allow sharing, but most sharing is by only a few programs.  With virtualization, f the flash is only directly accessible to a fraction of the CPU&#8217;s, it may make sense to move the programs to the data, reshuffling the data to other clusters of CPU and flash as required.  Disks remain outside of the CPU/flash combo, in big arrays/cluster storage boxes (the distinction fades away), which no concentrate on bulk storage, not on speed &#8211; or, more to the point, on speed for big transfers, since all &#8220;small&#8221; stuff takes place in flash.</p>
<p>Exactly what systems like this will look like isn&#8217;t clear, but if they&#8217;re at all like this, one can make some obvious predictions.  For example, the market for 10K and 15K RPM disk drives disappears &#8211; their huge extra cost becomes unjustifiable when flash takes over speed-critical roles.  The old vision of RAID &#8211; using inexpensive disks and using redundancy for reliability &#8211; becomes more interesting again.  (Of course, the cluster approach of geographic redundancy just takes this an extra step further.)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jeff Darcy</title>
		<link>http://storagemojo.com/2008/12/02/the-new-storage-pyramid/comment-page-1/#comment-198742</link>
		<dc:creator>Jeff Darcy</dc:creator>
		<pubDate>Wed, 03 Dec 2008 16:36:41 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1027#comment-198742</guid>
		<description>The problem with Big Storage is not the internals, but the vendors&#039; insistence on selling at high margins.  Take any commodity-based storage (or for that matter compute) system, and any competent designer will probably find several areas where judicious application of some more specialized hardware will make it just a bit better than competitors - even if it&#039;s just power and cooling to support higher density.  Look under the covers of any big system and you&#039;ll probably find something that&#039;s about 90% familiar already, which is *not* like the mainframes of old which were almost entirely custom.  In other words, storage has already gone through most of the transformation you&#039;re predicting.

&quot;Availability, performance, scalability and supportability&quot; aren&#039;t just gravy.  They&#039;re the main course.  It actually does take expertise to put together commodity parts to meet these goals.  That&#039;s expertise a few customers have, but most don&#039;t and in any case IT-staff time has value.  If knowledge about how to make and maintain a scalable system is represented in the design of product, then a rational customer would pay up to the equivalent value of IT-staff time to get it even if the physical components are all available off the shelf.  Arrays won&#039;t go away until every customer is an expert on optimizing performance and availability for a system with multiple tiers, dozens of ports and hundreds of spindles.  Google and LLNL can do better by hiring their own experts, but they&#039;re not typical customers.</description>
		<content:encoded><![CDATA[<p>The problem with Big Storage is not the internals, but the vendors&#8217; insistence on selling at high margins.  Take any commodity-based storage (or for that matter compute) system, and any competent designer will probably find several areas where judicious application of some more specialized hardware will make it just a bit better than competitors &#8211; even if it&#8217;s just power and cooling to support higher density.  Look under the covers of any big system and you&#8217;ll probably find something that&#8217;s about 90% familiar already, which is *not* like the mainframes of old which were almost entirely custom.  In other words, storage has already gone through most of the transformation you&#8217;re predicting.</p>
<p>&#8220;Availability, performance, scalability and supportability&#8221; aren&#8217;t just gravy.  They&#8217;re the main course.  It actually does take expertise to put together commodity parts to meet these goals.  That&#8217;s expertise a few customers have, but most don&#8217;t and in any case IT-staff time has value.  If knowledge about how to make and maintain a scalable system is represented in the design of product, then a rational customer would pay up to the equivalent value of IT-staff time to get it even if the physical components are all available off the shelf.  Arrays won&#8217;t go away until every customer is an expert on optimizing performance and availability for a system with multiple tiers, dozens of ports and hundreds of spindles.  Google and LLNL can do better by hiring their own experts, but they&#8217;re not typical customers.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Pete Steege</title>
		<link>http://storagemojo.com/2008/12/02/the-new-storage-pyramid/comment-page-1/#comment-198739</link>
		<dc:creator>Pete Steege</dc:creator>
		<pubDate>Wed, 03 Dec 2008 13:54:13 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1027#comment-198739</guid>
		<description>I agree with Steve.  

There&#039;s a bit of an apples &amp; oranges confusion in your pyramind.  A clearer story would show 2 pyramids: one focused on media - RAM, flash, disk drive (2 levels?) - and the other focused on black box solutions.</description>
		<content:encoded><![CDATA[<p>I agree with Steve.  </p>
<p>There&#8217;s a bit of an apples &amp; oranges confusion in your pyramind.  A clearer story would show 2 pyramids: one focused on media &#8211; RAM, flash, disk drive (2 levels?) &#8211; and the other focused on black box solutions.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Steve Jones</title>
		<link>http://storagemojo.com/2008/12/02/the-new-storage-pyramid/comment-page-1/#comment-198733</link>
		<dc:creator>Steve Jones</dc:creator>
		<pubDate>Wed, 03 Dec 2008 09:38:09 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1027#comment-198733</guid>
		<description>I can&#039;t help but think some of the differentiation between storage clusters and arrays as presented here isn&#039;t a little bit artificial. For many organisations then they would be extremely happy is they were just presented with a &quot;black box&quot; which presented certain, standardised storage services with known and predicable service levels. It doesn&#039;t greatly matter to them how is it organised internally, but for sure they will want it very easy to setup, manage, expand  and so on. If it turns out that what underlies this is commodity hardware, servers, operating systems and software then fine - but I stil think that&#039;s a storage array. Perhaps we ought to adopt the term &quot;storage appliance&quot; to avoid religious wars over what consititutes an array. Then it just becomes a matter of how this &quot;storage appliance&quot; is built internally which may well be using what we think of as &quot;storage clusters&quot; - which I would just define as a storage architecture that is largely defined by a software layer unifying the storage functionality over a number of discreet nodes.
I still see a need for some hardware packaging which will make this rather different to just putting a few discrete servers into a rack and calling it an appliance. However, it&#039;s very easy to see that there are certain hardware packaging techniques, such as rumours about the way that future blade enclosures can be used in new ways (such as building very large scale SMPs with cross-blade NUMA memory) which could mean a storage array could be built out of many of the same building blocks as compute farms. Of course some of these techniques aren&#039;t completely new - but they have been hamstrung by sufficiently high-speed, standardised, cheap data interfaces. 
For the true low-end - no doubt the software and processing will be similar, but there&#039;s still a lot of value in optimised hardware design. After all, it&#039;s perfectly possible to run Linux on a PC as a home router, but how much more convenient to buy a small, low-powered dedicated box of a tenth of the size and power consumption optimised for the purpose. That it runs the same basic software as the full sized Linux router hardly matters to the user - it is the packaging and the user interface that matters. 
So arrays won&#039;t go away, and neither do I believe that they will low huge market share to &quot;storage clusters&quot; unless we redefine an array as a storage cluster because of the way it is built (in which case, why isn&#039;t a NetApp a storage clustger?). They will just adapt and commoditise - tha major storage vendors (at least if they have their wits about them) will adapt. DEC failed as they got fat an lazy on over-priced and slow VAX. IBM almost went the same way on mainframes.</description>
		<content:encoded><![CDATA[<p>I can&#8217;t help but think some of the differentiation between storage clusters and arrays as presented here isn&#8217;t a little bit artificial. For many organisations then they would be extremely happy is they were just presented with a &#8220;black box&#8221; which presented certain, standardised storage services with known and predicable service levels. It doesn&#8217;t greatly matter to them how is it organised internally, but for sure they will want it very easy to setup, manage, expand  and so on. If it turns out that what underlies this is commodity hardware, servers, operating systems and software then fine &#8211; but I stil think that&#8217;s a storage array. Perhaps we ought to adopt the term &#8220;storage appliance&#8221; to avoid religious wars over what consititutes an array. Then it just becomes a matter of how this &#8220;storage appliance&#8221; is built internally which may well be using what we think of as &#8220;storage clusters&#8221; &#8211; which I would just define as a storage architecture that is largely defined by a software layer unifying the storage functionality over a number of discreet nodes.<br />
I still see a need for some hardware packaging which will make this rather different to just putting a few discrete servers into a rack and calling it an appliance. However, it&#8217;s very easy to see that there are certain hardware packaging techniques, such as rumours about the way that future blade enclosures can be used in new ways (such as building very large scale SMPs with cross-blade NUMA memory) which could mean a storage array could be built out of many of the same building blocks as compute farms. Of course some of these techniques aren&#8217;t completely new &#8211; but they have been hamstrung by sufficiently high-speed, standardised, cheap data interfaces.<br />
For the true low-end &#8211; no doubt the software and processing will be similar, but there&#8217;s still a lot of value in optimised hardware design. After all, it&#8217;s perfectly possible to run Linux on a PC as a home router, but how much more convenient to buy a small, low-powered dedicated box of a tenth of the size and power consumption optimised for the purpose. That it runs the same basic software as the full sized Linux router hardly matters to the user &#8211; it is the packaging and the user interface that matters.<br />
So arrays won&#8217;t go away, and neither do I believe that they will low huge market share to &#8220;storage clusters&#8221; unless we redefine an array as a storage cluster because of the way it is built (in which case, why isn&#8217;t a NetApp a storage clustger?). They will just adapt and commoditise &#8211; tha major storage vendors (at least if they have their wits about them) will adapt. DEC failed as they got fat an lazy on over-priced and slow VAX. IBM almost went the same way on mainframes.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
