<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>
<channel>
	<title>Comments on: The virtual machine I/O blender</title>
	<atom:link href="http://storagemojo.com/2008/07/23/the-virtual-machine-io-blender/feed/" rel="self" type="application/rss+xml" />
	<link>http://storagemojo.com/2008/07/23/the-virtual-machine-io-blender/</link>
	<description>Data storage info &#38; analysis</description>
	<pubDate>Thu, 20 Nov 2008 20:34:15 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.6.3</generator>
		<item>
		<title>By: Rich Whiffen &#187; Blog Archive &#187; I/O and virtual machines&#8230;</title>
		<link>http://storagemojo.com/2008/07/23/the-virtual-machine-io-blender/#comment-197828</link>
		<dc:creator>Rich Whiffen &#187; Blog Archive &#187; I/O and virtual machines&#8230;</dc:creator>
		<pubDate>Sun, 28 Sep 2008 02:16:37 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=773#comment-197828</guid>
		<description>[...] The Virtual Machine I/O Blender [...]</description>
		<content:encoded><![CDATA[<p>[...] The Virtual Machine I/O Blender [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jeff Darcy</title>
		<link>http://storagemojo.com/2008/07/23/the-virtual-machine-io-blender/#comment-196932</link>
		<dc:creator>Jeff Darcy</dc:creator>
		<pubDate>Mon, 28 Jul 2008 02:53:38 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=773#comment-196932</guid>
		<description>"if my Enterprise Flash Drive, or anyone else’s for that matter, looks like a single drive and functions like a block device I should consider this complex? I would assume just about any level system administrator could manage this type of storage"

Yes, and I'm sure they could manage any of a dozen alternatives too, but that didn't stop you from presenting their complexity as a problem or a barrier.  Why make an exception for your own kind of complexity?

"I wasn’t trying to affect my brand in any way shape or form"

Touting the key differentiating technology in your brand is practically the same as pushing the brand itself.  When EMC was touting the advantages of cache-centric RAID, or NetApp was preaching the NAS gospel, everyone knew it was their way of enhancing their own brands.  When you try to follow in their footsteps, you are most definiely trying to affect your brand.

As for your "fun with math" Bill and I are hardly ever on the same page but he's pretty much right this time.  Just adding up your IOPS numbers and comparing the total to a competitors' measured whole-system result is fishy.  It's like HPC vendors adding up per-CPU GFLOPS, acting as though load can be distributed perfectly and communication won't matter.  Savvy customers in either market know their workloads don't distribute and scale the way you'd like them to, and if they decide you don't Get It then you've lost.

Also, your product provides lots of IOPS per dollar, but what happens when the customer wants something that they get from enterprise storage besides raw IOPS?  Want RAID or multipathing?  Set up MD/DM, which is kludgier than any of the dozen array interfaces I've used, to get RAID; forget about multipathing.  What about disaster recovery or backup?  Install more host software (hoping its supported on your particular platform) and expect to burn more host cycles running it.  Even if your speeds and feeds look good, even if your prices look good, any comparison to enterprise storage is still suspect if you don't have enterprise-storage features.  Does http://archives.postgresql.org/pgsql-performance/2008-07/msg00010.php say "enterprise" to anyone here?

"The Linux driver for this
piece of hardware is pretty dodgy.  Sub-alpha quality actually.  But
they seem to be working on it.  Also there's no driver for
OpenSolaris, Mac OS X, or Windows right now.  In fact there's not even
anything available for Debian or other respectable Linux distros, only
Red Hat and its clones."

Anecdotal, admittedly, but are anecdotes any less reliable than marketing claims?  Like Anonymous, I think you guys have a lot to be proud of.  Unlike Anonymous, I'm willing to sign my name to what I say.  Where you and I differ, I think, is that I see your product as a building block to be combined with existing technologies, whereas you seem to see it as a complete solution that displaces them.  Only time will tell which of us is right, but my experience building enterprise storage systems and watching them get sold (or not) by both incumbents and upstarts makes me pretty confident about my guess. I know what "science project" means to those kinds of customers.  That's why I sell my science projects to scientists now.  ;)</description>
		<content:encoded><![CDATA[<p>&#8220;if my Enterprise Flash Drive, or anyone else’s for that matter, looks like a single drive and functions like a block device I should consider this complex? I would assume just about any level system administrator could manage this type of storage&#8221;</p>
<p>Yes, and I&#8217;m sure they could manage any of a dozen alternatives too, but that didn&#8217;t stop you from presenting their complexity as a problem or a barrier.  Why make an exception for your own kind of complexity?</p>
<p>&#8220;I wasn’t trying to affect my brand in any way shape or form&#8221;</p>
<p>Touting the key differentiating technology in your brand is practically the same as pushing the brand itself.  When EMC was touting the advantages of cache-centric RAID, or NetApp was preaching the NAS gospel, everyone knew it was their way of enhancing their own brands.  When you try to follow in their footsteps, you are most definiely trying to affect your brand.</p>
<p>As for your &#8220;fun with math&#8221; Bill and I are hardly ever on the same page but he&#8217;s pretty much right this time.  Just adding up your IOPS numbers and comparing the total to a competitors&#8217; measured whole-system result is fishy.  It&#8217;s like HPC vendors adding up per-CPU GFLOPS, acting as though load can be distributed perfectly and communication won&#8217;t matter.  Savvy customers in either market know their workloads don&#8217;t distribute and scale the way you&#8217;d like them to, and if they decide you don&#8217;t Get It then you&#8217;ve lost.</p>
<p>Also, your product provides lots of IOPS per dollar, but what happens when the customer wants something that they get from enterprise storage besides raw IOPS?  Want RAID or multipathing?  Set up MD/DM, which is kludgier than any of the dozen array interfaces I&#8217;ve used, to get RAID; forget about multipathing.  What about disaster recovery or backup?  Install more host software (hoping its supported on your particular platform) and expect to burn more host cycles running it.  Even if your speeds and feeds look good, even if your prices look good, any comparison to enterprise storage is still suspect if you don&#8217;t have enterprise-storage features.  Does <a href="http://archives.postgresql.org/pgsql-performance/2008-07/msg00010.php" rel="nofollow">http://archives.postgresql.org/pgsql-performance/2008-07/msg00010.php</a> say &#8220;enterprise&#8221; to anyone here?</p>
<p>&#8220;The Linux driver for this<br />
piece of hardware is pretty dodgy.  Sub-alpha quality actually.  But<br />
they seem to be working on it.  Also there&#8217;s no driver for<br />
OpenSolaris, Mac OS X, or Windows right now.  In fact there&#8217;s not even<br />
anything available for Debian or other respectable Linux distros, only<br />
Red Hat and its clones.&#8221;</p>
<p>Anecdotal, admittedly, but are anecdotes any less reliable than marketing claims?  Like Anonymous, I think you guys have a lot to be proud of.  Unlike Anonymous, I&#8217;m willing to sign my name to what I say.  Where you and I differ, I think, is that I see your product as a building block to be combined with existing technologies, whereas you seem to see it as a complete solution that displaces them.  Only time will tell which of us is right, but my experience building enterprise storage systems and watching them get sold (or not) by both incumbents and upstarts makes me pretty confident about my guess. I know what &#8220;science project&#8221; means to those kinds of customers.  That&#8217;s why I sell my science projects to scientists now.  <img src='http://storagemojo.com/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Rick White</title>
		<link>http://storagemojo.com/2008/07/23/the-virtual-machine-io-blender/#comment-196931</link>
		<dc:creator>Rick White</dc:creator>
		<pubDate>Mon, 28 Jul 2008 01:56:58 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=773#comment-196931</guid>
		<description>So, as much as I would like to address the great questions and the jab or two, I would rather just end it here with one point of clarification. I wasn't prepared to discuss Fusion's products in detail but to be fair since there is some confusion, I wasn't talking about our ioDrives. What I was referring to when I said "hypothetical" was definitely mirrored and networked but not yet announced. So Bill's assumptions don't work but he had no way of knowing this so I'll just leave it at that.

No matter what happens it should be interesting to watch and if Bill or anyone else wants to get together to share ideas about the storage industry then dinners on me at SNW this fall.</description>
		<content:encoded><![CDATA[<p>So, as much as I would like to address the great questions and the jab or two, I would rather just end it here with one point of clarification. I wasn&#8217;t prepared to discuss Fusion&#8217;s products in detail but to be fair since there is some confusion, I wasn&#8217;t talking about our ioDrives. What I was referring to when I said &#8220;hypothetical&#8221; was definitely mirrored and networked but not yet announced. So Bill&#8217;s assumptions don&#8217;t work but he had no way of knowing this so I&#8217;ll just leave it at that.</p>
<p>No matter what happens it should be interesting to watch and if Bill or anyone else wants to get together to share ideas about the storage industry then dinners on me at SNW this fall.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Bill Todd</title>
		<link>http://storagemojo.com/2008/07/23/the-virtual-machine-io-blender/#comment-196930</link>
		<dc:creator>Bill Todd</dc:creator>
		<pubDate>Sun, 27 Jul 2008 23:50:26 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=773#comment-196930</guid>
		<description>Rick, I can assure you that I'm not at all 'like you', but it does help to know that I'm talking with a marketeer with an agenda rather than just with a confused, enthusiastic amateur.

So let's get right down to the specifics about just how disingenuous you're really being here:

1.  Plugging your performance estimates above into the 'performance calculator' on your own Web site indicates that you're positioning 32 of your io-drive 320 drives in an *unmirrored* configuration against a *mirrored* Clariion configuration - but reporting only the net (not raw) storage capacity of the latter.  Hence right off the bat your relative $/GB comparison is off by a factor of over 2.5 (given that you weren't giving the Clariion any credit for its hot spares either) - unless you're seriously suggesting that there's no need to mirror and provide spares for your own storage to guard against failure (in which case we can just stop paying any attention to you right now, since you're not only completely clueless in general but aren't even aware of the material on your own Web site which provides both mirrored and parity-based configurations in its performance calculator).

2.  While you do quote the actual max measured IOPS value for the Clariion, that's hardly representative of typical *response times* - because in *typical* use the latter are well under 10 ms. rather than well over 20 ms. (as becomes the case as one approaches complete saturation).

3.  As Jeff pointed out you're comparing local storage of your own (which appears to be the only kind that you offer) against SAN-located storage.  Not only does the latter include the cost of separate enclosures to terminate the required external busses and house the controller and the storage to which it connects, but in the case of the Clariion it includes a controller capable of supporting 60% more drives (and over 200% more overall capacity if using 300 GB drives) than were configured in the test you cite (i.e., a lot of the controller value was left unused but still contributed to the overall price that you cited).

4.  Jeff also mentioned the relative priciness of EMC storage, but didn't quite do that subject justice:  a quick look around the Web (e.g., at NewEgg and ServersDirect) indicates that comparable internal Seagate and Fujitsu 15krpm drives in the 147 - 450 GB range are available at $1.40 - $1.55/GB - so that's the *real* apples-to-apples price competition with your own $33/GB internal flash storage (if you don't wish to include SATA drives at $0.15 - $0.20/GB, that is - though they're eminently suitable for anything but very demanding IOPS-intensive applications).

5.  Such as streaming data, for example - where Seagate's new 450 GB SAS 15krpm drives sport a maximum data rate of 164 MB/sec *apiece* and bottom out at over 100 MB/sec.  Even the older drives in the Clariion you cited should support average rates in the 100 MB/sec area - so it would take just *2* of them to sustain the 204 MB/sec of bandwidth that you so casually claim for the Clariion box (perhaps the result of multiplying its measured IOPS by a small, random request size rather than assessing its streaming bandwidth), and it would take only about 40 of them (at an aggregate cost of under $10,000) to match the claimed 4 GB/sec bandwidth of your $340,000 flash solution (though in this case using around 60 SATA drives at a total cost of well under $5,000 might make more sense - and would offer considerably greater capacity as well).

6.  Oh, yeah - to be complete, I should address your product's single real claim to fame:  its supposedly fabulous IOPS/$.  If we remove the mirroring and hot-spare overhead from the Clariion box and remove the box itself (since we want to perform an apples-to-apples internal-storage comparision), we find that the actual cost of the 60 or so disks when purchased directly at retail rather than through EMC should be under $14,000 - which works out to under $0.56/SPC-1 IOP (lower than the $18/SPC-1 IOP figure you'd like people to believe by a factor of more than 30, and entirely consistent with measured IOPS for this calibre of disk with the significant queue lengths that you assumed).  Thus the claimed $0.35/SPC-1 IOP for your own product, while superior, is not at all *dramatically* superior - especially considering the fact that one has to pay at least 20x the price per GB to obtain it (though to be fair the rare application that is so response-time sensitive that it can't tolerate sub-10 ms. access latencies might still be tempted to consider it as an option).

In the end your 'fun with math' reminds me of the old saying that figures may not lie, but liars sure can figure.  Henceforth your company might be better served if you confined your misrepresentations to non-technical venues where they're less likely to get called out for what they are.

- bill</description>
		<content:encoded><![CDATA[<p>Rick, I can assure you that I&#8217;m not at all &#8216;like you&#8217;, but it does help to know that I&#8217;m talking with a marketeer with an agenda rather than just with a confused, enthusiastic amateur.</p>
<p>So let&#8217;s get right down to the specifics about just how disingenuous you&#8217;re really being here:</p>
<p>1.  Plugging your performance estimates above into the &#8216;performance calculator&#8217; on your own Web site indicates that you&#8217;re positioning 32 of your io-drive 320 drives in an *unmirrored* configuration against a *mirrored* Clariion configuration - but reporting only the net (not raw) storage capacity of the latter.  Hence right off the bat your relative $/GB comparison is off by a factor of over 2.5 (given that you weren&#8217;t giving the Clariion any credit for its hot spares either) - unless you&#8217;re seriously suggesting that there&#8217;s no need to mirror and provide spares for your own storage to guard against failure (in which case we can just stop paying any attention to you right now, since you&#8217;re not only completely clueless in general but aren&#8217;t even aware of the material on your own Web site which provides both mirrored and parity-based configurations in its performance calculator).</p>
<p>2.  While you do quote the actual max measured IOPS value for the Clariion, that&#8217;s hardly representative of typical *response times* - because in *typical* use the latter are well under 10 ms. rather than well over 20 ms. (as becomes the case as one approaches complete saturation).</p>
<p>3.  As Jeff pointed out you&#8217;re comparing local storage of your own (which appears to be the only kind that you offer) against SAN-located storage.  Not only does the latter include the cost of separate enclosures to terminate the required external busses and house the controller and the storage to which it connects, but in the case of the Clariion it includes a controller capable of supporting 60% more drives (and over 200% more overall capacity if using 300 GB drives) than were configured in the test you cite (i.e., a lot of the controller value was left unused but still contributed to the overall price that you cited).</p>
<p>4.  Jeff also mentioned the relative priciness of EMC storage, but didn&#8217;t quite do that subject justice:  a quick look around the Web (e.g., at NewEgg and ServersDirect) indicates that comparable internal Seagate and Fujitsu 15krpm drives in the 147 - 450 GB range are available at $1.40 - $1.55/GB - so that&#8217;s the *real* apples-to-apples price competition with your own $33/GB internal flash storage (if you don&#8217;t wish to include SATA drives at $0.15 - $0.20/GB, that is - though they&#8217;re eminently suitable for anything but very demanding IOPS-intensive applications).</p>
<p>5.  Such as streaming data, for example - where Seagate&#8217;s new 450 GB SAS 15krpm drives sport a maximum data rate of 164 MB/sec *apiece* and bottom out at over 100 MB/sec.  Even the older drives in the Clariion you cited should support average rates in the 100 MB/sec area - so it would take just *2* of them to sustain the 204 MB/sec of bandwidth that you so casually claim for the Clariion box (perhaps the result of multiplying its measured IOPS by a small, random request size rather than assessing its streaming bandwidth), and it would take only about 40 of them (at an aggregate cost of under $10,000) to match the claimed 4 GB/sec bandwidth of your $340,000 flash solution (though in this case using around 60 SATA drives at a total cost of well under $5,000 might make more sense - and would offer considerably greater capacity as well).</p>
<p>6.  Oh, yeah - to be complete, I should address your product&#8217;s single real claim to fame:  its supposedly fabulous IOPS/$.  If we remove the mirroring and hot-spare overhead from the Clariion box and remove the box itself (since we want to perform an apples-to-apples internal-storage comparision), we find that the actual cost of the 60 or so disks when purchased directly at retail rather than through EMC should be under $14,000 - which works out to under $0.56/SPC-1 IOP (lower than the $18/SPC-1 IOP figure you&#8217;d like people to believe by a factor of more than 30, and entirely consistent with measured IOPS for this calibre of disk with the significant queue lengths that you assumed).  Thus the claimed $0.35/SPC-1 IOP for your own product, while superior, is not at all *dramatically* superior - especially considering the fact that one has to pay at least 20x the price per GB to obtain it (though to be fair the rare application that is so response-time sensitive that it can&#8217;t tolerate sub-10 ms. access latencies might still be tempted to consider it as an option).</p>
<p>In the end your &#8216;fun with math&#8217; reminds me of the old saying that figures may not lie, but liars sure can figure.  Henceforth your company might be better served if you confined your misrepresentations to non-technical venues where they&#8217;re less likely to get called out for what they are.</p>
<p>- bill</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Rick White</title>
		<link>http://storagemojo.com/2008/07/23/the-virtual-machine-io-blender/#comment-196928</link>
		<dc:creator>Rick White</dc:creator>
		<pubDate>Sun, 27 Jul 2008 19:37:34 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=773#comment-196928</guid>
		<description>I always find it interesting when someone is willing to post their whole name and not hide who they are yet they get accused of not disclosing who they are by someone using the name "Anonymous"
 
Anonymous said, "...those opinions mean nothing without data and less than nothing when the people stating them fail to disclose their interest in a particular conclusion." -- I thought I was perfectly clear when I started my response out by saying "First off I’ll admit I’m biased…" I didn't think it was appropriate to plug my companies specific products so I spoke vaguely about very REAL solutions entering the market today, which is why I used a SAN model number Bill would be familiar with while not calling it out specifically.
 
Anonymous said, "Flash has not yet reached to capacity or cost per gigabyte of disk..." -- I apologize for my direct response but you're just wrong on this point and I don’t know how else to say it. If you're talking about the raw media, a single NAND chip versus a single enterprise hard disk-drive then you're right. When you aggregate NAND for performance versus disk an interesting thing happens, disks become more expensive per gigabyte and I mean a LOT more expensive.
 
Anonymous said, "...nor has it reached the speed of RAM." -- I never meant to imply that it has. If 357-microsend response time is close to your RAM's latency I would suggest buying better RAM because that's 100-times slower than what I would expect from my systems RAM.
 
Anonymous said, "Maybe it will develop to the point where it can displace one or the other some day, but not this day..." -- I don't recall claiming that Enterprise Flash Drives will displace system RAM. Although, I do believe you'll need less of it, which means lower density chips at a lower price per GB and that's a good thing for customers.
 
Also, Rick White said, "However I do agree that disk arrays make sense for archival storage..." -- So as I said before I don't believe Enterprise Flash Drives will displace hard disk-drives altogether, but again, I believe we will use smaller (2.5"), lower power, larger capacity, slower SATA like drives for archival storage versus faster, higher power, lower capacity FC like drives for high performance storage.
 
Anonymous said, "Flash will continue to be yet another layer in the memory/storage hierarchy, perhaps a revolutionary one but still one adding and not subtracting complexity - both internally (with non-trivial flash translation layers etc.) and externally (with more staging between hierarchy levels)..." -- So if my Enterprise Flash Drive, or anyone else’s for that matter, looks like a single drive and functions like a block device I should consider this complex? I would assume just about any level system administrator could manage this type of storage or even a RAID of disks like this with very little effort and zero training but with some amazing performance implications (let's say more than 50,000+ IOPS).
 
Anonymous said, "Until latency reaches *zero*..." -- I'm not sure how we would ever reach *zero* latency but if you say so.
 
Anonymous said, "...read-ahead and write-behind have the potential to be useful with any possible technology." -- Would you mind elaborating on this?
 
Anonymous said, "The uninformed can make all the guesses they want about how this or that technology *might* work with virtual machines, but those opinions mean nothing without data..." -- I'm not sure how you want me to reply to this because I think we both know where this rabbit hole would take us and I don't think it's appropriate for companies to directly plug their specific products in forums like this but that's just my opinion. Needless to say I wasn't nearly as hypothetical as I was implying but I was trying to play nice.
 
Anonymous said, "...and less than nothing when the people stating them fail to disclose their interest in a particular conclusion." -- Which I assume you're referring to someone else considering I did state who I was (not Anonymous) and that I'm biased.
 
Anonymous said, "Your company makes a great product with a great value proposition..." -- Thank you.
 
Anonymous said, "...I wish you well." -- If I had any idea who you were or what company you represented I would return the niceties.
 
Anonymous said, "Using FUD and astroturf, though, will only hurt your brand instead of helping it." -- I wasn't trying to affect my brand in any way shape or form, I was merely discussing a new media and the impact it could have on the way we architect storage in the future. It would appear you are trying to affect my brand by making unfounded and what I consider unfair accusations without even saying who you are. I can only assume you're a very frightened competitor who sells lots of disks for performance and your world is getting turned upside down by the possibilities about to unfold in the high performance storage market.
 
In closing, I'm not trying to pick a fight I was just sharing a different opinion but if this makes you uncomfortable I apologize. I do feel bad for the potential impact this may or may not have on your company but please don't take it out on me just because Enterprise Flash Drives are going to have an impact on the high performance storage Market...in my opinion.</description>
		<content:encoded><![CDATA[<p>I always find it interesting when someone is willing to post their whole name and not hide who they are yet they get accused of not disclosing who they are by someone using the name &#8220;Anonymous&#8221;</p>
<p>Anonymous said, &#8220;&#8230;those opinions mean nothing without data and less than nothing when the people stating them fail to disclose their interest in a particular conclusion.&#8221; &#8212; I thought I was perfectly clear when I started my response out by saying &#8220;First off I’ll admit I’m biased…&#8221; I didn&#8217;t think it was appropriate to plug my companies specific products so I spoke vaguely about very REAL solutions entering the market today, which is why I used a SAN model number Bill would be familiar with while not calling it out specifically.</p>
<p>Anonymous said, &#8220;Flash has not yet reached to capacity or cost per gigabyte of disk&#8230;&#8221; &#8212; I apologize for my direct response but you&#8217;re just wrong on this point and I don’t know how else to say it. If you&#8217;re talking about the raw media, a single NAND chip versus a single enterprise hard disk-drive then you&#8217;re right. When you aggregate NAND for performance versus disk an interesting thing happens, disks become more expensive per gigabyte and I mean a LOT more expensive.</p>
<p>Anonymous said, &#8220;&#8230;nor has it reached the speed of RAM.&#8221; &#8212; I never meant to imply that it has. If 357-microsend response time is close to your RAM&#8217;s latency I would suggest buying better RAM because that&#8217;s 100-times slower than what I would expect from my systems RAM.</p>
<p>Anonymous said, &#8220;Maybe it will develop to the point where it can displace one or the other some day, but not this day&#8230;&#8221; &#8212; I don&#8217;t recall claiming that Enterprise Flash Drives will displace system RAM. Although, I do believe you&#8217;ll need less of it, which means lower density chips at a lower price per GB and that&#8217;s a good thing for customers.</p>
<p>Also, Rick White said, &#8220;However I do agree that disk arrays make sense for archival storage&#8230;&#8221; &#8212; So as I said before I don&#8217;t believe Enterprise Flash Drives will displace hard disk-drives altogether, but again, I believe we will use smaller (2.5&#8243;), lower power, larger capacity, slower SATA like drives for archival storage versus faster, higher power, lower capacity FC like drives for high performance storage.</p>
<p>Anonymous said, &#8220;Flash will continue to be yet another layer in the memory/storage hierarchy, perhaps a revolutionary one but still one adding and not subtracting complexity - both internally (with non-trivial flash translation layers etc.) and externally (with more staging between hierarchy levels)&#8230;&#8221; &#8212; So if my Enterprise Flash Drive, or anyone else’s for that matter, looks like a single drive and functions like a block device I should consider this complex? I would assume just about any level system administrator could manage this type of storage or even a RAID of disks like this with very little effort and zero training but with some amazing performance implications (let&#8217;s say more than 50,000+ IOPS).</p>
<p>Anonymous said, &#8220;Until latency reaches *zero*&#8230;&#8221; &#8212; I&#8217;m not sure how we would ever reach *zero* latency but if you say so.</p>
<p>Anonymous said, &#8220;&#8230;read-ahead and write-behind have the potential to be useful with any possible technology.&#8221; &#8212; Would you mind elaborating on this?</p>
<p>Anonymous said, &#8220;The uninformed can make all the guesses they want about how this or that technology *might* work with virtual machines, but those opinions mean nothing without data&#8230;&#8221; &#8212; I&#8217;m not sure how you want me to reply to this because I think we both know where this rabbit hole would take us and I don&#8217;t think it&#8217;s appropriate for companies to directly plug their specific products in forums like this but that&#8217;s just my opinion. Needless to say I wasn&#8217;t nearly as hypothetical as I was implying but I was trying to play nice.</p>
<p>Anonymous said, &#8220;&#8230;and less than nothing when the people stating them fail to disclose their interest in a particular conclusion.&#8221; &#8212; Which I assume you&#8217;re referring to someone else considering I did state who I was (not Anonymous) and that I&#8217;m biased.</p>
<p>Anonymous said, &#8220;Your company makes a great product with a great value proposition&#8230;&#8221; &#8212; Thank you.</p>
<p>Anonymous said, &#8220;&#8230;I wish you well.&#8221; &#8212; If I had any idea who you were or what company you represented I would return the niceties.</p>
<p>Anonymous said, &#8220;Using FUD and astroturf, though, will only hurt your brand instead of helping it.&#8221; &#8212; I wasn&#8217;t trying to affect my brand in any way shape or form, I was merely discussing a new media and the impact it could have on the way we architect storage in the future. It would appear you are trying to affect my brand by making unfounded and what I consider unfair accusations without even saying who you are. I can only assume you&#8217;re a very frightened competitor who sells lots of disks for performance and your world is getting turned upside down by the possibilities about to unfold in the high performance storage market.</p>
<p>In closing, I&#8217;m not trying to pick a fight I was just sharing a different opinion but if this makes you uncomfortable I apologize. I do feel bad for the potential impact this may or may not have on your company but please don&#8217;t take it out on me just because Enterprise Flash Drives are going to have an impact on the high performance storage Market&#8230;in my opinion.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Joe Kraska</title>
		<link>http://storagemojo.com/2008/07/23/the-virtual-machine-io-blender/#comment-196927</link>
		<dc:creator>Joe Kraska</dc:creator>
		<pubDate>Sun, 27 Jul 2008 15:23:14 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=773#comment-196927</guid>
		<description>$53/GB for a SAN?

Gentlemen, you seriously need to get better procurement teams, or better yet, stop using price lists... no one, and I mean no one, buys at those prices.

Anyway, I have no doubt that high speed flash will improve storage... I can envision FusionIO's device as an ideal dedicated journal device, and no doubt technologies like Compellent will leverage flash soon through dynamic automated block-level ILM.

Joe Kraska
San Diego CA
USA</description>
		<content:encoded><![CDATA[<p>$53/GB for a SAN?</p>
<p>Gentlemen, you seriously need to get better procurement teams, or better yet, stop using price lists&#8230; no one, and I mean no one, buys at those prices.</p>
<p>Anyway, I have no doubt that high speed flash will improve storage&#8230; I can envision FusionIO&#8217;s device as an ideal dedicated journal device, and no doubt technologies like Compellent will leverage flash soon through dynamic automated block-level ILM.</p>
<p>Joe Kraska<br />
San Diego CA<br />
USA</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jeff Darcy</title>
		<link>http://storagemojo.com/2008/07/23/the-virtual-machine-io-blender/#comment-196926</link>
		<dc:creator>Jeff Darcy</dc:creator>
		<pubDate>Sun, 27 Jul 2008 15:11:02 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=773#comment-196926</guid>
		<description>Apples to oranges, Rick.  First, using the EMC Clariion CX-40 for the comparison is hardly representative because everyone knows they sell the most overpriced storage in the industry.  Second, your (unaudited) numbers only work if all of the computation is colocated with the storage - unlikely in the real world.  If some separate set of servers are doing the computation, then you have to account for the cost and performance impact of going from those servers to the ones with the Fusion cards.  Internal storage is not directly comparable to shared storage; if it were, you'd have to compare not to a Clariion but to much cheaper drives inside each server.

Server and OS vendors are indeed working on some amazing alternatives.  Some are based on flash.  Some are not.</description>
		<content:encoded><![CDATA[<p>Apples to oranges, Rick.  First, using the EMC Clariion CX-40 for the comparison is hardly representative because everyone knows they sell the most overpriced storage in the industry.  Second, your (unaudited) numbers only work if all of the computation is colocated with the storage - unlikely in the real world.  If some separate set of servers are doing the computation, then you have to account for the cost and performance impact of going from those servers to the ones with the Fusion cards.  Internal storage is not directly comparable to shared storage; if it were, you&#8217;d have to compare not to a Clariion but to much cheaper drives inside each server.</p>
<p>Server and OS vendors are indeed working on some amazing alternatives.  Some are based on flash.  Some are not.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Anonymous</title>
		<link>http://storagemojo.com/2008/07/23/the-virtual-machine-io-blender/#comment-196924</link>
		<dc:creator>Anonymous</dc:creator>
		<pubDate>Sun, 27 Jul 2008 13:33:28 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=773#comment-196924</guid>
		<description>Rick White, VP of Marketing for Fusion io, is misrepresenting the facts.  Flash has not yet reached to capacity or cost per gigabyte of disk, nor has it reached the speed of RAM.  Maybe it will develop to the point where it can displace one or the other some day, but not this day.  Flash will continue to be yet another layer in the memory/storage hierarchy, perhaps a revolutionary one but still one adding and not subtracting complexity - both internally (with non-trivial flash translation layers etc.) and externally (with more staging between hierarchy levels).  Until latency reaches *zero* read-ahead and write-behind have the potential to be useful with any possible technology.  The uninformed can make all the guesses they want about how this or that technology *might* work with virtual machines, but those opinions mean nothing without data and less than nothing when the people stating them fail to disclose their interest in a particular conclusion.

Your company makes a great product with a great value proposition, Rick, and I wish you well.  Using FUD and astroturf, though, will only hurt your brand instead of helping it.</description>
		<content:encoded><![CDATA[<p>Rick White, VP of Marketing for Fusion io, is misrepresenting the facts.  Flash has not yet reached to capacity or cost per gigabyte of disk, nor has it reached the speed of RAM.  Maybe it will develop to the point where it can displace one or the other some day, but not this day.  Flash will continue to be yet another layer in the memory/storage hierarchy, perhaps a revolutionary one but still one adding and not subtracting complexity - both internally (with non-trivial flash translation layers etc.) and externally (with more staging between hierarchy levels).  Until latency reaches *zero* read-ahead and write-behind have the potential to be useful with any possible technology.  The uninformed can make all the guesses they want about how this or that technology *might* work with virtual machines, but those opinions mean nothing without data and less than nothing when the people stating them fail to disclose their interest in a particular conclusion.</p>
<p>Your company makes a great product with a great value proposition, Rick, and I wish you well.  Using FUD and astroturf, though, will only hurt your brand instead of helping it.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Rick White</title>
		<link>http://storagemojo.com/2008/07/23/the-virtual-machine-io-blender/#comment-196921</link>
		<dc:creator>Rick White</dc:creator>
		<pubDate>Sun, 27 Jul 2008 08:08:31 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=773#comment-196921</guid>
		<description>Bill said, "We’re not talking about kludges here: we’re talking about economically extracting the most from what we’ve got to work with (not that bad a definition for engineering in general). If your mind is boggled, it’s because it’s untutored in the subject."

Bill I apologize if I frustrated you when I used the term, "kludge" and I hope I won’t offend you if my untutored mind has some fun with math...

Let's start by creating a hypothetical SAN and we'll call it the Model-40 for kicks. I would suspect it could do about 24,998 SPC-1 IOPS, sustain 204 MB/s bandwidth, with a 24,180-microsecond response time, taking up 38 rack units of space and a total ASU capacity of roughly 8,466 GB. Now I wonder what a SAN like this would cost...

Total TSC Price:  $448,435
SPC-1 Price-Performance:  $18/SPC-1 IOPS 
Price per GB:  $53

Now let's create a hypothetical blade server and let's say this blade server could hold...I don't know, let's say it could hold 32 Enterprise Flash Drives in a single blade chassis. I would suspect this system could do well over 1,000,000 IOPS, sustain 3,990 MB/s bandwidth, with a 357-microsecond response time, taking up 10 rack units of space and a total ASU capacity of roughly 10,208 GB. I wonder what a storage system like this would cost...

Total TSC Price:  $340,230
SPC-1 Price-Performance:  $0.35/SPC-1 IOPS 
Price per GB:  $33
*Including a fully configured sixteen-blade server.

If you're like me your probably thinking, "Hmm? Disks are expensive when you aggregate them for performance." Of course when it's your only option you make the best of it but the market is changing and it won't be our only option for long. I would suspect the server and OS vendors are working on some amazing alternatives right now.

However I do agree that disk arrays make sense for archival storage. But in my untutored and biased opinion I believe in the next few years it will be considered fiscally irresponsible to continue aggregating disks for performance (both CAPEX and OPEX). Of course this is just my untutored and probably ignorant opinion but it should be fun to watch as the market evolves no matter what ends up happening.</description>
		<content:encoded><![CDATA[<p>Bill said, &#8220;We’re not talking about kludges here: we’re talking about economically extracting the most from what we’ve got to work with (not that bad a definition for engineering in general). If your mind is boggled, it’s because it’s untutored in the subject.&#8221;</p>
<p>Bill I apologize if I frustrated you when I used the term, &#8220;kludge&#8221; and I hope I won’t offend you if my untutored mind has some fun with math&#8230;</p>
<p>Let&#8217;s start by creating a hypothetical SAN and we&#8217;ll call it the Model-40 for kicks. I would suspect it could do about 24,998 SPC-1 IOPS, sustain 204 MB/s bandwidth, with a 24,180-microsecond response time, taking up 38 rack units of space and a total ASU capacity of roughly 8,466 GB. Now I wonder what a SAN like this would cost&#8230;</p>
<p>Total TSC Price:  $448,435<br />
SPC-1 Price-Performance:  $18/SPC-1 IOPS<br />
Price per GB:  $53</p>
<p>Now let&#8217;s create a hypothetical blade server and let&#8217;s say this blade server could hold&#8230;I don&#8217;t know, let&#8217;s say it could hold 32 Enterprise Flash Drives in a single blade chassis. I would suspect this system could do well over 1,000,000 IOPS, sustain 3,990 MB/s bandwidth, with a 357-microsecond response time, taking up 10 rack units of space and a total ASU capacity of roughly 10,208 GB. I wonder what a storage system like this would cost&#8230;</p>
<p>Total TSC Price:  $340,230<br />
SPC-1 Price-Performance:  $0.35/SPC-1 IOPS<br />
Price per GB:  $33<br />
*Including a fully configured sixteen-blade server.</p>
<p>If you&#8217;re like me your probably thinking, &#8220;Hmm? Disks are expensive when you aggregate them for performance.&#8221; Of course when it&#8217;s your only option you make the best of it but the market is changing and it won&#8217;t be our only option for long. I would suspect the server and OS vendors are working on some amazing alternatives right now.</p>
<p>However I do agree that disk arrays make sense for archival storage. But in my untutored and biased opinion I believe in the next few years it will be considered fiscally irresponsible to continue aggregating disks for performance (both CAPEX and OPEX). Of course this is just my untutored and probably ignorant opinion but it should be fun to watch as the market evolves no matter what ends up happening.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Bill Todd</title>
		<link>http://storagemojo.com/2008/07/23/the-virtual-machine-io-blender/#comment-196920</link>
		<dc:creator>Bill Todd</dc:creator>
		<pubDate>Sun, 27 Jul 2008 04:02:31 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=773#comment-196920</guid>
		<description>The answers to your questions are pretty straight-forward:

It makes sense to add RAM up to the point of (economically) diminishing returns when performance is important (and if it's not important, this entire discussion is not applicable).  For read-dominated workloads, the RAM needn't be redundant (as long as it has good error detection such that the very rare instances of bad data can be re-fetched from disk) and thus may be very price-competitive with the kind of flash drives that you're assuming will shortly become available, and for all workloads if the RAM can be placed in the host access to it will be far faster than RAM (let alone flash) out on a storage bus (not to mention being able to be used more flexibly for other purposes when that's more desirable than dedicating it to cache).

The reason to use more disks (rather than the kind of flash that you assume will shortly become available) is because it's far cheaper when you've got a great deal of data to store:  as I suggested earlier, few have the luxury of infinite budgets, and for most this trade-off will continue to make sense.  You don't then have to go through the exercise of segregating data with high performance requirements (and reshuffling when the portion of your data with such requirements varies over time) - and may even not require any more disks than are necessary to house your dataset (because if you disperse the hot data fairly uniformly across the spindles the number of spindles required to hold the entire dataset may also be sufficient to provide acceptable access performance to the hot portion).

Of course it makes sense to add intelligence to increase performance:  every single portion of the system does this in one way or another.  Yes, it's complex, but it has also become pretty well understood over the past decades:  if you're afraid of the complexity it's because you're not a specialist in the area (but the people who actually design and implement it are, and while they're still only human suggesting that complexity should therefore be avoided in this area because of the possible risk while leaving the rest of the system rife with it is just silly).

No, it's not "because this is how its always been done" - it's because this is what's been demonstrated empirically over the years to be effective.  And when someone improves upon it, those improvements get incorporated (after they've proved themselves:  data is too important to be experimented with cavalierly).

We're not talking about kludges here:  we're talking about economically extracting the most from what we've got to work with (not that bad a definition for engineering in general).  If your mind is boggled, it's because it's untutored in the subject.

The reason that "some folks aren’t prepared for the impact Enterprise Flash Drives are about to have on the way we architect our storage systems for performance" is because a lot of them (some of whom probably are far better acquainted with the subject than you are) just don't agree with your assessment.  Even those (one might suspect rare) cases that beat heavily upon a relatively small dataset may well find their access patterns sufficiently cacheable that a modicum of cache will provide comparable performance at a lower cost than placing the entire dataset on 'Enterprise Flash' (which if experience is any guide will not be priced cheaply), and even workloads with intense random-update activity can be handled by 'write-anywhere' approaches that convert the accesses to far more efficient bulk-sequential writes with reasonable latency.

Fast non-volatile storage with no moving parts is nothing new, and while flash *may* be poised to reduce its cost significantly it's not at all clear that this reduction (coupled with flash's limitations) will be sufficient to change the storage landscape dramatically (my own guess is that flash may earliest prove useful for relatively small sequential workloads with critical response requirements like database logs, though even here fronting a small dedicated array of conventional disks with a small NVRAM cache can work just as well, making such a flash approach primarily suitable for installations too small to be using such external storage).  We'll just have to stay tuned and see.

- bill</description>
		<content:encoded><![CDATA[<p>The answers to your questions are pretty straight-forward:</p>
<p>It makes sense to add RAM up to the point of (economically) diminishing returns when performance is important (and if it&#8217;s not important, this entire discussion is not applicable).  For read-dominated workloads, the RAM needn&#8217;t be redundant (as long as it has good error detection such that the very rare instances of bad data can be re-fetched from disk) and thus may be very price-competitive with the kind of flash drives that you&#8217;re assuming will shortly become available, and for all workloads if the RAM can be placed in the host access to it will be far faster than RAM (let alone flash) out on a storage bus (not to mention being able to be used more flexibly for other purposes when that&#8217;s more desirable than dedicating it to cache).</p>
<p>The reason to use more disks (rather than the kind of flash that you assume will shortly become available) is because it&#8217;s far cheaper when you&#8217;ve got a great deal of data to store:  as I suggested earlier, few have the luxury of infinite budgets, and for most this trade-off will continue to make sense.  You don&#8217;t then have to go through the exercise of segregating data with high performance requirements (and reshuffling when the portion of your data with such requirements varies over time) - and may even not require any more disks than are necessary to house your dataset (because if you disperse the hot data fairly uniformly across the spindles the number of spindles required to hold the entire dataset may also be sufficient to provide acceptable access performance to the hot portion).</p>
<p>Of course it makes sense to add intelligence to increase performance:  every single portion of the system does this in one way or another.  Yes, it&#8217;s complex, but it has also become pretty well understood over the past decades:  if you&#8217;re afraid of the complexity it&#8217;s because you&#8217;re not a specialist in the area (but the people who actually design and implement it are, and while they&#8217;re still only human suggesting that complexity should therefore be avoided in this area because of the possible risk while leaving the rest of the system rife with it is just silly).</p>
<p>No, it&#8217;s not &#8220;because this is how its always been done&#8221; - it&#8217;s because this is what&#8217;s been demonstrated empirically over the years to be effective.  And when someone improves upon it, those improvements get incorporated (after they&#8217;ve proved themselves:  data is too important to be experimented with cavalierly).</p>
<p>We&#8217;re not talking about kludges here:  we&#8217;re talking about economically extracting the most from what we&#8217;ve got to work with (not that bad a definition for engineering in general).  If your mind is boggled, it&#8217;s because it&#8217;s untutored in the subject.</p>
<p>The reason that &#8220;some folks aren’t prepared for the impact Enterprise Flash Drives are about to have on the way we architect our storage systems for performance&#8221; is because a lot of them (some of whom probably are far better acquainted with the subject than you are) just don&#8217;t agree with your assessment.  Even those (one might suspect rare) cases that beat heavily upon a relatively small dataset may well find their access patterns sufficiently cacheable that a modicum of cache will provide comparable performance at a lower cost than placing the entire dataset on &#8216;Enterprise Flash&#8217; (which if experience is any guide will not be priced cheaply), and even workloads with intense random-update activity can be handled by &#8216;write-anywhere&#8217; approaches that convert the accesses to far more efficient bulk-sequential writes with reasonable latency.</p>
<p>Fast non-volatile storage with no moving parts is nothing new, and while flash *may* be poised to reduce its cost significantly it&#8217;s not at all clear that this reduction (coupled with flash&#8217;s limitations) will be sufficient to change the storage landscape dramatically (my own guess is that flash may earliest prove useful for relatively small sequential workloads with critical response requirements like database logs, though even here fronting a small dedicated array of conventional disks with a small NVRAM cache can work just as well, making such a flash approach primarily suitable for installations too small to be using such external storage).  We&#8217;ll just have to stay tuned and see.</p>
<p>- bill</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Rick White</title>
		<link>http://storagemojo.com/2008/07/23/the-virtual-machine-io-blender/#comment-196917</link>
		<dc:creator>Rick White</dc:creator>
		<pubDate>Sat, 26 Jul 2008 23:06:35 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=773#comment-196917</guid>
		<description>First off I'll admit I'm biased...

Why add a lot of expensive RAM just to avoid going to disk? Worse yet why add a bunch of disks to make up for the fact that they're so slow compared to the processors they’re supposed to serve? And does it really make sense to add another "smart" software layer to your system isn't that just more complexity to manage?  Is it because this is how its always been done?

I understand the industry had to kluge a lot of these pieces together over the years but it boggles my mind that we want to keep doing it. I suppose some folks aren't prepared for the impact Enterprise Flash Drives are about to have on the way we architect our storage systems for performance. In my opinion a system with an Enterprise Flash solution from one of several server vendors will run virtualized environments just fine. I appreciate "smart" software, new network layers and the old tried and true methods but sometimes you just can't beat the simplicity of massive brute force from a single drive.</description>
		<content:encoded><![CDATA[<p>First off I&#8217;ll admit I&#8217;m biased&#8230;</p>
<p>Why add a lot of expensive RAM just to avoid going to disk? Worse yet why add a bunch of disks to make up for the fact that they&#8217;re so slow compared to the processors they’re supposed to serve? And does it really make sense to add another &#8220;smart&#8221; software layer to your system isn&#8217;t that just more complexity to manage?  Is it because this is how its always been done?</p>
<p>I understand the industry had to kluge a lot of these pieces together over the years but it boggles my mind that we want to keep doing it. I suppose some folks aren&#8217;t prepared for the impact Enterprise Flash Drives are about to have on the way we architect our storage systems for performance. In my opinion a system with an Enterprise Flash solution from one of several server vendors will run virtualized environments just fine. I appreciate &#8220;smart&#8221; software, new network layers and the old tried and true methods but sometimes you just can&#8217;t beat the simplicity of massive brute force from a single drive.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Joe Kraska</title>
		<link>http://storagemojo.com/2008/07/23/the-virtual-machine-io-blender/#comment-196916</link>
		<dc:creator>Joe Kraska</dc:creator>
		<pubDate>Sat, 26 Jul 2008 23:00:47 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=773#comment-196916</guid>
		<description>I can’t say VM’s are my area of expertise and maybe I’m totally missing something, but why not just use very fast storage in the physical servers that are hosting your VM’s...
--------------------

VMWare environments typically used a shared storage environment so that they can effectively do automated load balancing: move the vm from server to server in response to point loads on particular servers.

Problem is, basically, that shared storage loads with large VMWare systems are... intractable unless you have a pretty good budget.

The last I looked our own VMWare environment had 475 virtual machines in it. I cringe every time I look, actually. We have plenty of memory and compute. However, our SAN/enterprise NAS is lacking. 

This costs a lot, mon.

BTW, a strategy we have been taking to lately is moving some of the heavier IO hogs to direct attached storage, as you say. This has some disadvantages.

BTW, at least one vendor combines direct attached storage with a clustered iSCSI SAN, pretty much with the specific purpose in mind of solving the VMWare disk deployment problem: LeftHand networks.

(Not an endorsement, we don't have).

Joe Kraska
San Diego CA
USA</description>
		<content:encoded><![CDATA[<p>I can’t say VM’s are my area of expertise and maybe I’m totally missing something, but why not just use very fast storage in the physical servers that are hosting your VM’s&#8230;<br />
&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8211;</p>
<p>VMWare environments typically used a shared storage environment so that they can effectively do automated load balancing: move the vm from server to server in response to point loads on particular servers.</p>
<p>Problem is, basically, that shared storage loads with large VMWare systems are&#8230; intractable unless you have a pretty good budget.</p>
<p>The last I looked our own VMWare environment had 475 virtual machines in it. I cringe every time I look, actually. We have plenty of memory and compute. However, our SAN/enterprise NAS is lacking. </p>
<p>This costs a lot, mon.</p>
<p>BTW, a strategy we have been taking to lately is moving some of the heavier IO hogs to direct attached storage, as you say. This has some disadvantages.</p>
<p>BTW, at least one vendor combines direct attached storage with a clustered iSCSI SAN, pretty much with the specific purpose in mind of solving the VMWare disk deployment problem: LeftHand networks.</p>
<p>(Not an endorsement, we don&#8217;t have).</p>
<p>Joe Kraska<br />
San Diego CA<br />
USA</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: John Lane</title>
		<link>http://storagemojo.com/2008/07/23/the-virtual-machine-io-blender/#comment-196900</link>
		<dc:creator>John Lane</dc:creator>
		<pubDate>Fri, 25 Jul 2008 14:57:07 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=773#comment-196900</guid>
		<description>The biggest difference we see (from a storage perspective) between a virtualized vs. non-virtualized environment is the amount of data and the number of systems that sit on shared storage. In a non-virtualized environment, we see the OS sit on the physical server while important apps and data site on the shared storage. In virtualized environments, we see ESX/Xen (or other hypervisor software) installed on the physical server. All of the VMs (OS, apps, and data) sit on the shared storage. 

So 100 Virtual Machines does not equal 100 Physical Machines. And the storage requirements go up significantly, but most users do not realize this by themselves. Sometimes customers run into disk thrashing when they implement virtualization without considering this difference.

I find that "smart" storage is the best approach. Personally, I like Pillar's approach to this problem with their QOS features, but there are solutions from other vendors that we recommend too.</description>
		<content:encoded><![CDATA[<p>The biggest difference we see (from a storage perspective) between a virtualized vs. non-virtualized environment is the amount of data and the number of systems that sit on shared storage. In a non-virtualized environment, we see the OS sit on the physical server while important apps and data site on the shared storage. In virtualized environments, we see ESX/Xen (or other hypervisor software) installed on the physical server. All of the VMs (OS, apps, and data) sit on the shared storage. </p>
<p>So 100 Virtual Machines does not equal 100 Physical Machines. And the storage requirements go up significantly, but most users do not realize this by themselves. Sometimes customers run into disk thrashing when they implement virtualization without considering this difference.</p>
<p>I find that &#8220;smart&#8221; storage is the best approach. Personally, I like Pillar&#8217;s approach to this problem with their QOS features, but there are solutions from other vendors that we recommend too.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Bill Todd</title>
		<link>http://storagemojo.com/2008/07/23/the-virtual-machine-io-blender/#comment-196893</link>
		<dc:creator>Bill Todd</dc:creator>
		<pubDate>Fri, 25 Jul 2008 05:43:39 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=773#comment-196893</guid>
		<description>Amy -

The old racing adage "Speed costs money:  how fast can you afford to go?" applies to storage as well:  the fastest non-volatile storage is battery-backed RAM, but it's just not within the budget of most users - nor, for that matter, are the fastest disk drives, even though they cost orders of magnitude less (and it would not be wise to count on currently over-hyped technologies like flash and ZFS to address the issue, though both may eventually help with some aspects of it).

But there are far less costly ways to achieve performance for many workloads than brute-force storage speed, and that's what we've been discussing here ('smart' storage really means 'smart use of the underlying dumb storage by taking advantage of specific characteristics of the workload to achieve far better performance than would otherwise be the case') - specifically, whether such optimizations remain effective as the storage gets increasingly removed (in terms of software layering) from the application.

- bill</description>
		<content:encoded><![CDATA[<p>Amy -</p>
<p>The old racing adage &#8220;Speed costs money:  how fast can you afford to go?&#8221; applies to storage as well:  the fastest non-volatile storage is battery-backed RAM, but it&#8217;s just not within the budget of most users - nor, for that matter, are the fastest disk drives, even though they cost orders of magnitude less (and it would not be wise to count on currently over-hyped technologies like flash and ZFS to address the issue, though both may eventually help with some aspects of it).</p>
<p>But there are far less costly ways to achieve performance for many workloads than brute-force storage speed, and that&#8217;s what we&#8217;ve been discussing here (&#8217;smart&#8217; storage really means &#8217;smart use of the underlying dumb storage by taking advantage of specific characteristics of the workload to achieve far better performance than would otherwise be the case&#8217;) - specifically, whether such optimizations remain effective as the storage gets increasingly removed (in terms of software layering) from the application.</p>
<p>- bill</p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Dynamic Page Served (once) in 2.652 seconds -->
