<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Responding to Marc Farley of EqualLogic</title>
	<atom:link href="http://storagemojo.com/2007/04/16/responding-to-marc-farley-of-equallogic/feed/" rel="self" type="application/rss+xml" />
	<link>http://storagemojo.com/2007/04/16/responding-to-marc-farley-of-equallogic/</link>
	<description>Data storage info &#38; analysis</description>
	<lastBuildDate>Sun, 20 May 2012 13:26:14 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
	<item>
		<title>By: Csaba Csoma</title>
		<link>http://storagemojo.com/2007/04/16/responding-to-marc-farley-of-equallogic/comment-page-1/#comment-163351</link>
		<dc:creator>Csaba Csoma</dc:creator>
		<pubDate>Mon, 14 Jan 2008 21:36:01 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=432#comment-163351</guid>
		<description>As end user I don&#039;t like RAID and I don&#039;t get the price tag either.

Server:
I think clustered file system makes more sense, even if you keep 2 or 3 copies of the same data. Database can be kept on cluster also.
Google proved that this will work even with a lot of data and it can be done fast enough.

Now hopefully the software will get cheap enough to be used. Or Google might release their GFS as open source...


Desktop:
You should not keep your important data there anyways. So a RAID0 on SAS is pretty much all you can have there performance wise. (RAID 0 to see 3 SAS drives as 1 TB...)</description>
		<content:encoded><![CDATA[<p>As end user I don&#8217;t like RAID and I don&#8217;t get the price tag either.</p>
<p>Server:<br />
I think clustered file system makes more sense, even if you keep 2 or 3 copies of the same data. Database can be kept on cluster also.<br />
Google proved that this will work even with a lot of data and it can be done fast enough.</p>
<p>Now hopefully the software will get cheap enough to be used. Or Google might release their GFS as open source&#8230;</p>
<p>Desktop:<br />
You should not keep your important data there anyways. So a RAID0 on SAS is pretty much all you can have there performance wise. (RAID 0 to see 3 SAS drives as 1 TB&#8230;)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Bill Todd</title>
		<link>http://storagemojo.com/2007/04/16/responding-to-marc-farley-of-equallogic/comment-page-1/#comment-53232</link>
		<dc:creator>Bill Todd</dc:creator>
		<pubDate>Wed, 18 Apr 2007 23:50:26 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=432#comment-53232</guid>
		<description>And for anyone who feels that I have not already run on at more than sufficient length:

In writing the above I was, I admit, thinking mostly about *file system* use of storage, where the existing lower-level storage interface actively impedes significant potential optimization (as ZFS helps illustrate) and where I see the most potential for dramatic improvement (given that the entire storage stack that lies beneath a file or object abstraction can be rethought).

However, even within the confines of that existing traditional block-level interface there are also possibilities for improvement, and products like Xiotech&#039;s &#039;Magnitude&#039; and perhaps also Compaq/HP&#039;s &#039;EVA&#039; have been exploring them for quite a few years now - primarily by replacing the traditional RAID static, algorithmic mapping between externally-visible logical array addresses and internal disk addresses with an explicit internal mapping that facilitates more flexible variation in numbers and sizes of disks and more flexible use of the space thereon - with considerably less management pain as well.  For that matter, throw in HP&#039;s innovative AutoRAID product from the &#039;90s as well, though AFAIK it never quite achieved the recognition that it might have (did its implementation fail to live up to its potential?).

I still think of such products as &#039;RAID&#039; (well, perhaps &#039;RAID V2.0&#039;), but though they share a common abstraction (use of redundancy in the form of mirroring or parity across disks to achieve increased availability) they certainly don&#039;t fit the 1988 definition at the detail level.  Perhaps this is more what you had in mind (and I just stumbled upon an article from last month that may as well - http://searchstorage.techtarget.com/columnItem/0,294698,sid5_gci1246543,00.html).

- bill</description>
		<content:encoded><![CDATA[<p>And for anyone who feels that I have not already run on at more than sufficient length:</p>
<p>In writing the above I was, I admit, thinking mostly about *file system* use of storage, where the existing lower-level storage interface actively impedes significant potential optimization (as ZFS helps illustrate) and where I see the most potential for dramatic improvement (given that the entire storage stack that lies beneath a file or object abstraction can be rethought).</p>
<p>However, even within the confines of that existing traditional block-level interface there are also possibilities for improvement, and products like Xiotech&#8217;s &#8216;Magnitude&#8217; and perhaps also Compaq/HP&#8217;s &#8216;EVA&#8217; have been exploring them for quite a few years now &#8211; primarily by replacing the traditional RAID static, algorithmic mapping between externally-visible logical array addresses and internal disk addresses with an explicit internal mapping that facilitates more flexible variation in numbers and sizes of disks and more flexible use of the space thereon &#8211; with considerably less management pain as well.  For that matter, throw in HP&#8217;s innovative AutoRAID product from the &#8217;90s as well, though AFAIK it never quite achieved the recognition that it might have (did its implementation fail to live up to its potential?).</p>
<p>I still think of such products as &#8216;RAID&#8217; (well, perhaps &#8216;RAID V2.0&#8242;), but though they share a common abstraction (use of redundancy in the form of mirroring or parity across disks to achieve increased availability) they certainly don&#8217;t fit the 1988 definition at the detail level.  Perhaps this is more what you had in mind (and I just stumbled upon an article from last month that may as well &#8211; <a href="http://searchstorage.techtarget.com/columnItem/0,294698,sid5_gci1246543,00.html" rel="nofollow">http://searchstorage.techtarget.com/columnItem/0,294698,sid5_gci1246543,00.html</a>).</p>
<p>- bill</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Bill Todd</title>
		<link>http://storagemojo.com/2007/04/16/responding-to-marc-farley-of-equallogic/comment-page-1/#comment-52653</link>
		<dc:creator>Bill Todd</dc:creator>
		<pubDate>Tue, 17 Apr 2007 01:36:24 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=432#comment-52653</guid>
		<description>Oh, my - where to begin?

RAID costs too much?  When you can get mature open-source software implementations for free?  Especially as I suspect that you&#039;re just itching to discuss an alternative like ZFS, which is itself, of course, a software implementation...

Or perhaps you meant that *good hardware RAID implementations* cost too much.  I think that Marc did an adequate job of defending their pricing - sophisticated NVRAM optimizations, rigorous drive-level and system-level testing, etc.

In anything resembling a free market, things *never* &#039;cost too much&#039;:  they cost what they&#039;re worth, or occasionally even less during market-share wars.  If buyers didn&#039;t feel that current hardware RAID products were worth what they cost, their prices would come down (there&#039;s clearly at least some room for them to, and indeed some activity on this front with the increased acceptance of ATA disk technology).  One could perhaps more reasonably suggest that industry innovation (i.e., competitiveness) has been sluggish in providing lower-cost, at least equally effective alternatives - but that&#039;s a rather different issue.

Management is based on a broken concept?  I think not:  RAIDs require very minimal managment - just give them another disk once in a while if they need one (or if you need more space), and they&#039;ll take care of the rest, just as virtually as virtual-memory systems handle mapping from one view of memory to a different one.

Don&#039;t confuse support for the LUN abstraction with anything RAID-specific:  it&#039;s just a way of continuing support at the RAID level for a pre-existing SCSI single-disk abstraction.  Or, if you were referring to using LUNs in a complex multi-RAID array to segregate access and/or allow software striping across multiple internal RAIDs instead of having a single large, somewhat more failure-prone RAID, then (again) this is not a characteristic of RAID itself.

RAID per se simply virtualizes a single-disk abstraction to encompass multiple cooperating disks for increased capacity and/or performance and/or availability.  As such, it does so easily and effectively.  So perhaps you really meant to suggest that the interface between higher-level software like file systems or databases and the underlying storage (*any* underlying storage - this is hardly specific to RAID) is awkward.  If so, don&#039;t blame RAID for supplying what its consumers demand (and demanded long before RAID as such even appeared):  blame the consumers for not having adjusted such that they could support something better (because until that happens, anyone developing a better RAID interface won&#039;t have any market at all - which is usually a significant impediment to such innovation from below)..

Parity RAID is architecturally doomed?  Rubbish - especially as (again) you confuse more general limitations with something specific to parity RAID.

Disks have indeed increased significantly in size to the point where the likelihood of encountering an unreadable sector during reconstruction is far greater than it once was.  But that&#039;s just as true for mirroring as for parity RAID, save for a small constant (1 for a mirror copy vs., say, 4 for a 5-disk RAID-5 array):  if we&#039;ve been able to tolerate the 2 to 3 orders of magnitude increase in the probability of encountering such rebuild errors as disk sizes grew over the past couple of decades, a mere factor of 4 seems pretty well down in the noise.

Or, to look at it another way, if the risk of this is excessive for RAID-5, it&#039;s really not all that much less for RAID-1, and *neither* will remain adequate for long.  So you&#039;ll be left with the choice between using 3 copies of each datum (for 3x the cost of the underlying storage) or using double-parity RAID (for 1.1x - 1.2x the cost of the underlying storage:  double-parity-protected stripes can be quite a bit wider and still retain significant MTTDL advantages):  some workloads may be sufficiently performance-critical to justify the expense of the former for all their storage, but most installations will find the latter attractive for at least some of their storage.

There is, of course, also evidence that background disk-scrubbing decreases the probability of encountering bad sectors when you least want to by an order of magnitude of more - but this is applicable to all uses, of course, not just to parity arrays.

Your objection to I/O rates is equally flawed, since IOPS/GB have decreased just as much for single-disk or mirrored configurations as they have for parity RAID.  Again, if we&#039;ve been able to tolerate a 2-3 order of magnitude decrease in this value over the past couple of decades, tolerating a mere factor of 2 *only* during degraded read operation seems laughably minor by comparison (especially since in large configurations striped across many small RAID-5 arrays only a small percentage of the data will see *any* such degradation when a disk in one small array fails).

Your suggestion that &quot;those precious IOPS get squandered on read/modify/write cycles&quot; is of course true only for small-write-intensive workloads (full-stripe writes incur no such penalty).  Even for small writes, the impact is hardly major (again, compared to the several-orders-of-magnitude decrease in IOPS/GB we&#039;ve tolerated over the last couple of decades):  at worst, a small write must first read the preexisting data and parity (two parallel reads, including seek and rotational latency, on separate disks) and then update both (two parallel writes on the same two disks, but without seek latency, though a full rotation is required).  One of the reads is often unnecessary, since the original data is still in the disk&#039;s cache.  In any event, while using RAID-5 for small-write-intensive workloads may be inadvisable if performance is absolutely critical, there are plenty of workloads where either performance is just *not* that super-critical or small writes just aren&#039;t that frequent, in which case RAID-5 clearly remains a good choice (with the more expensive RAID-1 capacity reserved for workloads that actually *need* it).

One reason why parity RAID may become at least somewhat relatively less attractive in the future is increasing use of off-site replication, where mirroring between sites is usually the only practical approach.  Even then, though, if your data is sufficiently important to be mirrored off-site, it may well be sufficiently important that mirroring alone is insufficient protection (or may have sufficiently high availability requirements that the probability of finding the remote copy unavailable after a local disk or sector has failed is unacceptable) - so using local parity RAID as well as remote mirroring may wind up the most cost-effective choice (use RAID-5 at both sites and you can tolerate the failure of any 3 disks without data loss at little more storage cost than just having the two sites mirrored entails, and without the added performance impact of RAID-6).  Or you could use 3-way mirroring when performance is more important and you only feel the need to tolerate the loss of any 2 disks.

That pretty much covers my objections, I think.  To sum them up, the &#039;oncoming train wreck&#039; that you describe has nothing whatsoever to do with RAID per se, but with the interface between storage (RAIDed or not) and its higher-level clients.  ZFS has indeed provided a glimpse of A Better Way to manage this interaction - though I question some of their specific design details (the RAID-Z approach that you describe glowingly elsewhere squanders disk utilization - *every* small write writes an entire stripe, tying up N disks for an I/O apiece, and, worse yet *so does every subsequent read of that data*) and will note that they still use mechanisms similar to RAID-1, RAID-5, and now RAID-6 internally to achieve availability.

By the way, recent Newegg pricing for reputable desktop SATA drives has nearly reached $0.25/GB, with little sign that it&#039;s yet hit rock-bottom.  I got my last PATA drive - a 160 GB Seagate - for $0.125/GB, but that was after rebate, whereas Newegg&#039;s prices are out-the-door ones, and last time I checked their prices for &#039;near-line&#039; allegedly more enterprise-quality 7200 rpm SATA drives weren&#039;t a great deal higher.

- bill</description>
		<content:encoded><![CDATA[<p>Oh, my &#8211; where to begin?</p>
<p>RAID costs too much?  When you can get mature open-source software implementations for free?  Especially as I suspect that you&#8217;re just itching to discuss an alternative like ZFS, which is itself, of course, a software implementation&#8230;</p>
<p>Or perhaps you meant that *good hardware RAID implementations* cost too much.  I think that Marc did an adequate job of defending their pricing &#8211; sophisticated NVRAM optimizations, rigorous drive-level and system-level testing, etc.</p>
<p>In anything resembling a free market, things *never* &#8216;cost too much&#8217;:  they cost what they&#8217;re worth, or occasionally even less during market-share wars.  If buyers didn&#8217;t feel that current hardware RAID products were worth what they cost, their prices would come down (there&#8217;s clearly at least some room for them to, and indeed some activity on this front with the increased acceptance of ATA disk technology).  One could perhaps more reasonably suggest that industry innovation (i.e., competitiveness) has been sluggish in providing lower-cost, at least equally effective alternatives &#8211; but that&#8217;s a rather different issue.</p>
<p>Management is based on a broken concept?  I think not:  RAIDs require very minimal managment &#8211; just give them another disk once in a while if they need one (or if you need more space), and they&#8217;ll take care of the rest, just as virtually as virtual-memory systems handle mapping from one view of memory to a different one.</p>
<p>Don&#8217;t confuse support for the LUN abstraction with anything RAID-specific:  it&#8217;s just a way of continuing support at the RAID level for a pre-existing SCSI single-disk abstraction.  Or, if you were referring to using LUNs in a complex multi-RAID array to segregate access and/or allow software striping across multiple internal RAIDs instead of having a single large, somewhat more failure-prone RAID, then (again) this is not a characteristic of RAID itself.</p>
<p>RAID per se simply virtualizes a single-disk abstraction to encompass multiple cooperating disks for increased capacity and/or performance and/or availability.  As such, it does so easily and effectively.  So perhaps you really meant to suggest that the interface between higher-level software like file systems or databases and the underlying storage (*any* underlying storage &#8211; this is hardly specific to RAID) is awkward.  If so, don&#8217;t blame RAID for supplying what its consumers demand (and demanded long before RAID as such even appeared):  blame the consumers for not having adjusted such that they could support something better (because until that happens, anyone developing a better RAID interface won&#8217;t have any market at all &#8211; which is usually a significant impediment to such innovation from below)..</p>
<p>Parity RAID is architecturally doomed?  Rubbish &#8211; especially as (again) you confuse more general limitations with something specific to parity RAID.</p>
<p>Disks have indeed increased significantly in size to the point where the likelihood of encountering an unreadable sector during reconstruction is far greater than it once was.  But that&#8217;s just as true for mirroring as for parity RAID, save for a small constant (1 for a mirror copy vs., say, 4 for a 5-disk RAID-5 array):  if we&#8217;ve been able to tolerate the 2 to 3 orders of magnitude increase in the probability of encountering such rebuild errors as disk sizes grew over the past couple of decades, a mere factor of 4 seems pretty well down in the noise.</p>
<p>Or, to look at it another way, if the risk of this is excessive for RAID-5, it&#8217;s really not all that much less for RAID-1, and *neither* will remain adequate for long.  So you&#8217;ll be left with the choice between using 3 copies of each datum (for 3x the cost of the underlying storage) or using double-parity RAID (for 1.1x &#8211; 1.2x the cost of the underlying storage:  double-parity-protected stripes can be quite a bit wider and still retain significant MTTDL advantages):  some workloads may be sufficiently performance-critical to justify the expense of the former for all their storage, but most installations will find the latter attractive for at least some of their storage.</p>
<p>There is, of course, also evidence that background disk-scrubbing decreases the probability of encountering bad sectors when you least want to by an order of magnitude of more &#8211; but this is applicable to all uses, of course, not just to parity arrays.</p>
<p>Your objection to I/O rates is equally flawed, since IOPS/GB have decreased just as much for single-disk or mirrored configurations as they have for parity RAID.  Again, if we&#8217;ve been able to tolerate a 2-3 order of magnitude decrease in this value over the past couple of decades, tolerating a mere factor of 2 *only* during degraded read operation seems laughably minor by comparison (especially since in large configurations striped across many small RAID-5 arrays only a small percentage of the data will see *any* such degradation when a disk in one small array fails).</p>
<p>Your suggestion that &#8220;those precious IOPS get squandered on read/modify/write cycles&#8221; is of course true only for small-write-intensive workloads (full-stripe writes incur no such penalty).  Even for small writes, the impact is hardly major (again, compared to the several-orders-of-magnitude decrease in IOPS/GB we&#8217;ve tolerated over the last couple of decades):  at worst, a small write must first read the preexisting data and parity (two parallel reads, including seek and rotational latency, on separate disks) and then update both (two parallel writes on the same two disks, but without seek latency, though a full rotation is required).  One of the reads is often unnecessary, since the original data is still in the disk&#8217;s cache.  In any event, while using RAID-5 for small-write-intensive workloads may be inadvisable if performance is absolutely critical, there are plenty of workloads where either performance is just *not* that super-critical or small writes just aren&#8217;t that frequent, in which case RAID-5 clearly remains a good choice (with the more expensive RAID-1 capacity reserved for workloads that actually *need* it).</p>
<p>One reason why parity RAID may become at least somewhat relatively less attractive in the future is increasing use of off-site replication, where mirroring between sites is usually the only practical approach.  Even then, though, if your data is sufficiently important to be mirrored off-site, it may well be sufficiently important that mirroring alone is insufficient protection (or may have sufficiently high availability requirements that the probability of finding the remote copy unavailable after a local disk or sector has failed is unacceptable) &#8211; so using local parity RAID as well as remote mirroring may wind up the most cost-effective choice (use RAID-5 at both sites and you can tolerate the failure of any 3 disks without data loss at little more storage cost than just having the two sites mirrored entails, and without the added performance impact of RAID-6).  Or you could use 3-way mirroring when performance is more important and you only feel the need to tolerate the loss of any 2 disks.</p>
<p>That pretty much covers my objections, I think.  To sum them up, the &#8216;oncoming train wreck&#8217; that you describe has nothing whatsoever to do with RAID per se, but with the interface between storage (RAIDed or not) and its higher-level clients.  ZFS has indeed provided a glimpse of A Better Way to manage this interaction &#8211; though I question some of their specific design details (the RAID-Z approach that you describe glowingly elsewhere squanders disk utilization &#8211; *every* small write writes an entire stripe, tying up N disks for an I/O apiece, and, worse yet *so does every subsequent read of that data*) and will note that they still use mechanisms similar to RAID-1, RAID-5, and now RAID-6 internally to achieve availability.</p>
<p>By the way, recent Newegg pricing for reputable desktop SATA drives has nearly reached $0.25/GB, with little sign that it&#8217;s yet hit rock-bottom.  I got my last PATA drive &#8211; a 160 GB Seagate &#8211; for $0.125/GB, but that was after rebate, whereas Newegg&#8217;s prices are out-the-door ones, and last time I checked their prices for &#8216;near-line&#8217; allegedly more enterprise-quality 7200 rpm SATA drives weren&#8217;t a great deal higher.</p>
<p>- bill</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Chris M Evans</title>
		<link>http://storagemojo.com/2007/04/16/responding-to-marc-farley-of-equallogic/comment-page-1/#comment-52591</link>
		<dc:creator>Chris M Evans</dc:creator>
		<pubDate>Mon, 16 Apr 2007 18:14:01 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=432#comment-52591</guid>
		<description>Robin, I think it&#039;s a little simplistic to blame RAID on the way disk drive manufacturers have failed to keep the ratio of capacity to IOPS consistent over the years.  We&#039;re still using the same protocol to store and retrieve data on directly attached (SCSI/SATA) devices as we do when they&#039;re behind a RAID controller.  Perhaps what we need is an improved version of SATA/SCSI which doesn&#039;t provide a &quot;normal&quot; view of a HDD but is optimised to read/write as it knows it is behind a RAID or other type of controller.</description>
		<content:encoded><![CDATA[<p>Robin, I think it&#8217;s a little simplistic to blame RAID on the way disk drive manufacturers have failed to keep the ratio of capacity to IOPS consistent over the years.  We&#8217;re still using the same protocol to store and retrieve data on directly attached (SCSI/SATA) devices as we do when they&#8217;re behind a RAID controller.  Perhaps what we need is an improved version of SATA/SCSI which doesn&#8217;t provide a &#8220;normal&#8221; view of a HDD but is optimised to read/write as it knows it is behind a RAID or other type of controller.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

