<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>StorageMojo &#187; Architecture</title>
	<atom:link href="http://storagemojo.com/category/architecture/feed/" rel="self" type="application/rss+xml" />
	<link>http://storagemojo.com</link>
	<description>Data storage info &#38; analysis</description>
	<lastBuildDate>Mon, 21 May 2012 22:16:25 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
		<item>
		<title>Real storage for a virtual world</title>
		<link>http://storagemojo.com/2012/05/21/real-storage-for-a-virtual-world/</link>
		<comments>http://storagemojo.com/2012/05/21/real-storage-for-a-virtual-world/#comments</comments>
		<pubDate>Mon, 21 May 2012 22:15:39 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[Management]]></category>
		<category><![CDATA[Virtualization]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=2680</guid>
		<description><![CDATA[More virtual machines than physical machines were sold last year. What does that mean for storage? As noted 4 years ago in The virtual machine I/O blender Engineers have spent decades optimizing the OS, drivers, caching, controllers and disks for specific workloads. Observed behavior such as locality of reference have informed many strategies. Like read-ahead. [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>More virtual machines than physical machines were sold last year. What does that mean for storage? </p>
<p>As noted 4 years ago in <a href="http://storagemojo.com/2008/07/23/the-virtual-machine-io-blender/" target="_blank">The virtual machine I/O blender</a></p>
<blockquote><p>
Engineers have spent decades optimizing the OS, drivers, caching, controllers and disks for specific workloads.</p>
<p>Observed behavior such as locality of reference have informed many strategies. Like read-ahead.</p>
<p> But when you put 25 virtual machines on a single server, what happens to all this hard-won empiricism? It’s gone.
</p></blockquote>
<p><strong>The new empiricism</strong><br />
Enter <a href="http://www.tintri.com/" target="_blank">Tintri</a>. Their mission: making storage for virtual machines more efficient.</p>
<p>Co-founder Kieran Harty was an engineering VP at VMware. The other co-founder, Mark Gritter, was part of Kealia, a server company founded by Andy Bechtolsheim and bought by Sun. Architect Ed Lee has deep storage experience dating from Berkeley&#8217;s RAID work in the late 80s. </p>
<p>Tintri brings several ideas to the issue of VM storage.</p>
<ul>
<li>A VM-aware file system</li>
<li>Inline dedup, compression and a hybrid file system that manages flash and SATA drives as a single pool</li>
<li>Managing VMs directly through interaction with vCenter</li>
<li>Manage VMs, not LUNS or volumes</li>
<li>Automatic VM alignment</li>
</ul>
<p><strong>The StorageMojo take</strong><br />
Much of this mirrors good ideas that other 21st century storage companies have adopted. The big win is the Tintri&#8217;s direct management of VMs. </p>
<p>What this means in practice is that an admin looks directly at the I/O activity of each VM, not on a LUN or volume basis, but at the VM level. Which means the array is doing the same thing, able to optimize performance for each VM based on what the VM is doing in real time.</p>
<p>If a VM starts gobbling up storage and bandwidth, it&#8217;s easy to see which VM is the problem. This should be tied to a good chargeback system as well.</p>
<p>There are many products that improve &#8211; often dramatically &#8211; VM storage and performance. But making the VM the unit of storage is brilliant and opens the door to many more enhancements.</p>
<p>Tintri is expecting to support other hypervisors in the not-too-distant future. </p>
<p><strong>Courteous comments welcome, of course.</strong> I attended the Tintri presentation as a Tech Field Day delegate. The vendors who presented at the TFD paid for the privilege, which funded my travel expenses.</p>
<div class="twttr_button">
				<a href="http://twitter.com/share?url=http://storagemojo.com/2012/05/21/real-storage-for-a-virtual-world/&text=Real storage for a virtual world" target="_blank" title="Click here if you liked this article.">
					<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
				</a>
			</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2012/05/21/real-storage-for-a-virtual-world/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Amplidata&#8217;s distributed object store</title>
		<link>http://storagemojo.com/2012/04/17/amplidatas-distributed-object-store/</link>
		<comments>http://storagemojo.com/2012/04/17/amplidatas-distributed-object-store/#comments</comments>
		<pubDate>Tue, 17 Apr 2012 18:19:05 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Clusters]]></category>
		<category><![CDATA[NAS, IP, iSCSI]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=2647</guid>
		<description><![CDATA[Our digital civilization requires data integrity and long-term preservation, and neither is assured by our current storage infrastructure. But progress continues. Latest case in point: Amplidata. This 4 year old company, based in Belgium with a growing US footprint, brings a new level of erasure code goodness to the both problems with a cluster-based object [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>Our digital civilization requires data integrity and long-term preservation, and neither is assured by our current storage infrastructure. But progress continues.</p>
<p>Latest case in point: Amplidata. This 4 year old company, based in Belgium with a growing US footprint, brings a new level of erasure code goodness to the both problems with a cluster-based object store.</p>
<p>What erasure code goodness, you ask? The first &#8211; AFAIK &#8211; rateless erasure code, AKA fountain code, storage system in production use.</p>
<p><strong>And that is a good thing because?</strong><br />
Robustness and efficiency. </p>
<p>Amplidata claims storage durability well beyond RAID 6: 10 9&#8242;s (spread across 16 drives with up to 4 failures) durability &#8211; though the spreads can be much larger logically and geographically. They do this by breaking the data object into segments and adding redundancy data. </p>
<p>The redundancy data adds about 50% to the object size &#8211; more efficient than mirroring or triple replication. The benefit is that the system can lose hundreds of segments and still reconstruct the data.</p>
<p>Each object is protected by checksums that can protect against more than 1000 simultaneous bit errors per object. And each write goes to at least to controllers before it is committed.</p>
<p>What kind of monster controller is able to perform all this magic? The minimum configuration is 3 Xeon-based commodity controller nodes with as many 10-drive Atom-based storage nodes as you need.</p>
<p>Amplidata is optimized for bandwidth, not IOPS. With their latest software update they now spec each controller at 750MB/sec, and you can have as many controllers as you can afford.</p>
<p><strong>Sounds like Cleversafe</strong><br />
Cleversafe thought so too, and they&#8217;ve sued Amplidata for patent infringement. But Intel &#8211; who knows about patents and due diligence &#8211; invested after the suit. </p>
<p>Like NetApp&#8217;s suit against ZFS, this seems like a vanity project. Surely Cleversafe has more important things to invest in. If they don&#8217;t they&#8217;re in bigger trouble than we know.</p>
<p><strong>The StorageMojo take</strong><br />
The need for robust, inexpensive and massive storage has been a theme of StorageMojo&#8217;s for years. Object storage is the best solution to the problem of scale, while the kind of redundancy and end-to-end checksumming that Amplidata uses seems as robust as anything on the market today.</p>
<p>As for inexpensive, that is in the eye of the beholder, but Amplidata tells me that their newest storage node lists for less than $0.60/GB while consuming only 60 watts. That should be attractive to people running tape silos who want faster access and better redundancy.</p>
<p><strong>Courteous comments welcome, of course.</strong> I&#8217;m working with Amplidata to produce a video white paper on their technology, so stay tuned for more info on a promising company.</p>
<div class="twttr_button">
				<a href="http://twitter.com/share?url=http://storagemojo.com/2012/04/17/amplidatas-distributed-object-store/&text=Amplidata's distributed object store" target="_blank" title="Click here if you liked this article.">
					<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
				</a>
			</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2012/04/17/amplidatas-distributed-object-store/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Violin&#8217;s clean-sheet architecture</title>
		<link>http://storagemojo.com/2012/04/11/violins-clean-sheet-architecture/</link>
		<comments>http://storagemojo.com/2012/04/11/violins-clean-sheet-architecture/#comments</comments>
		<pubDate>Wed, 11 Apr 2012 20:29:23 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Disk]]></category>
		<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[Future Tech]]></category>
		<category><![CDATA[SSD/Flash Disk]]></category>
		<category><![CDATA[Video]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=2637</guid>
		<description><![CDATA[Over 3 years ago StorageMojo saw that Violin Memory was &#8220;. . . on the winning architectural track.&#8221; Well, it took a lot of time and money, but Violin is making good on that early promise. StorageMojo&#8217;s enthusiasm was kindled by Violin&#8217;s unique architecture. Here&#8217;s a short video that shows how Violin&#8217;s architecture addresses key [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>Over <a href="http://storagemojo.com/2009/01/04/the-top-storage-stories-of-2008/" target="_blank">3 years ago</a> StorageMojo saw that <a href="http://www.violin-memory.com/" target="_blank">Violin Memory</a> was &#8220;. . . on the winning architectural track.&#8221; Well, it took a lot of time and money, but Violin is making good on that early promise.</p>
<p>StorageMojo&#8217;s enthusiasm was kindled by Violin&#8217;s unique architecture. Here&#8217;s a short video that shows how Violin&#8217;s architecture addresses key problems with flash:</p>
<p><iframe width="425" height="349" src="http://www.youtube.com/embed/L2VibZhNFbE?hl=en&#038;fs=1" frameborder="0" allowfullscreen></iframe></p>
<p>Full screen mode recommended.</p>
<p><strong>The StorageMojo take</strong><br />
The industry is still in the early days of digesting the implications of fast persistent solid state storage. We&#8217;ve built up 50 years of cruft to deal with disk&#8217;s many issues. It will take a few more years for flash&#8217;s new options to ripple through the entire storage, server and application stack.</p>
<p>Take, for example, failover. If all apps and monitoring software could declare a failure in 10 seconds rather than, say, a minute, how much smoother would major apps run? How much better would be the perception of system uptime and response times be?</p>
<p>There are many other possibilities &#8211; what about metadata? &#8211; that flash and its successor technologies will affect. I&#8217;ll be offering more detail in my keynote at the <a href="http://techfieldday.com/2012/ssss12/" target="_blank">Solid State Storage Symposium</a> on Wednesday, April 25 in Silicon Valley. S4 is free and you can <a href="http://ssss12.eventbrite.com/" target="_blank">register here</a>.</p>
<p><strong>Courteous comments welcome, of course.</strong> The other flash company I liked in 2009 was Fusion-io, and they&#8217;ve done OK. And yes, Violin paid StorageMojo to produce the video white paper, but the opinions are my own.</p>
<div class="twttr_button">
				<a href="http://twitter.com/share?url=http://storagemojo.com/2012/04/11/violins-clean-sheet-architecture/&text=Violin's clean-sheet architecture" target="_blank" title="Click here if you liked this article.">
					<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
				</a>
			</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2012/04/11/violins-clean-sheet-architecture/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Nimble Storage: StorageMojo is wrong!</title>
		<link>http://storagemojo.com/2012/04/10/nimble-storage-storagemojo-is-wrong/</link>
		<comments>http://storagemojo.com/2012/04/10/nimble-storage-storagemojo-is-wrong/#comments</comments>
		<pubDate>Wed, 11 Apr 2012 01:05:53 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[SSD/Flash Disk]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=2629</guid>
		<description><![CDATA[Actually, StorageMojo is wrong, wrong, wrong, wrong and wrong. Umesh Maheshwari of Nimble Storage wrote a detailed and thoughtful response to the StorageMojo post Are SSD-based arrays a bad idea? The StorageMojo take Umesh makes good points, but perhaps due to Nimble&#8217;s hybrid disk/SSD architecture some seem to miss the mark. What&#8217;s missing is ample [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>Actually, StorageMojo is wrong, wrong, wrong, wrong and wrong. Umesh Maheshwari of Nimble Storage wrote a detailed and <a href="http://www.nimblestorage.com/blog/are-ssd-based-arrays-a-bad-idea/" target="_blank">thoughtful response to</a> the StorageMojo post <a href="http://storagemojo.com/2012/03/05/are-ssd-based-arrays-a-bad-idea/" target="_blank">Are SSD-based arrays a bad idea?</a></p>
<p><strong>The StorageMojo take</strong><br />
Umesh makes good points, but perhaps due to Nimble&#8217;s hybrid disk/SSD architecture some seem to miss the mark. What&#8217;s missing is ample consideration of what the alternative to an SSD might be, a problem Nimble didn&#8217;t have because SSDs work fine, technically and economically, for their hybrid system. </p>
<p>For example, on the issue of reliability Umesh says:</p>
<blockquote><p>
Reliability. There are good reasons to be able to replace failed flash devices similar to how hard disks can be hot swapped. The raw bit error rate (RBER) of flash is actually worse than that of hard disks, and it gets worse as blocks are rewritten. It is also getting worse as manufacturers are moving to increase density. (See this paper from FAST 2012: The Bleak Future of NAND Flash and a related blog post.)
</p></blockquote>
<p>This is correct, but based on the Google/Bianca Schroeder <a href="http://storagemojo.com/2007/02/19/googles-disk-failure-experience/" target="_blank">research</a>, the StorageMojo point is that the disk electronics &#8211; apart from the head/media pieces &#8211; are a major &#8211; 40%-50% &#8211; source of HDD/SSD failures. The flash controller has to handle the RBER and declining flash performance, but why add the other HDD bits that account for a substantial percentage of drive failures?</p>
<p>I could niggle about Umesh&#8217;s other points, but what fun is that? StorageMojo readers are encouraged to check out Umesh&#8217;s post and make up their own minds.</p>
<p><strong>Courteous comments welcome, of course.</strong> I recently did a nifty <a href="http://storagemojo.com/2011/08/03/nimble-storage-architecture-video/" target="_blank">video white paper</a> for Nimble, which is a great intro to their innovative architecture. Check it out.</p>
<div class="twttr_button">
				<a href="http://twitter.com/share?url=http://storagemojo.com/2012/04/10/nimble-storage-storagemojo-is-wrong/&text=Nimble Storage: StorageMojo is wrong!" target="_blank" title="Click here if you liked this article.">
					<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
				</a>
			</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2012/04/10/nimble-storage-storagemojo-is-wrong/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>In thinking about SSDs, consider HA</title>
		<link>http://storagemojo.com/2012/04/10/in-thinking-about-ssds-consider-ha/</link>
		<comments>http://storagemojo.com/2012/04/10/in-thinking-about-ssds-consider-ha/#comments</comments>
		<pubDate>Wed, 11 Apr 2012 00:43:26 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[SSD/Flash Disk]]></category>
		<category><![CDATA[Virtualization]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=2632</guid>
		<description><![CDATA[More is coming on SSDs RSN, but in the meantime there is the following piece from Virsto&#8217;s Eric Burgener on HA considerations for SSDs. Virsto is a software company focused on making VIRtual STOrage for VMware and HyperV much more functional than the physical kind. Thus Eric&#8217;s response has a very particular POV: what is [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>More is coming on SSDs RSN, but in the meantime there is the following piece from <a href="http://www.virsto.com/" target="_blank">Virsto&#8217;s</a> Eric Burgener on HA considerations for SSDs. Virsto is a software company focused on making VIRtual STOrage for VMware and HyperV much more functional than the physical kind. </p>
<p>Thus Eric&#8217;s response has a very particular POV: what is needed to use SSDs in a virtual environment to ensure high availability &#8211; from a company that DOESN&#8217;T sell HA hardware. One key point: virtual server environments are much more write intensive than most enterprise apps, so using SSDs as a cache is a losing strategy. </p>
<p>If that is intriguing, read on! </p>
<blockquote><p>
In Thinking About SSD, You Can’t Leave HA Considerations Out </p>
<p>In the “host vs array-based SSD” discussion as it pertains to enterprise accounts, the need for HA must play a critical role. This is true whether you’re working with physical or virtual environments.  Any committed data that is not sitting on a shared, non-volatile, external storage device and accessible by at least one other node cannot be recovered until that failed node (on which it resides locally) is brought back up.</p>
<p>There are technical ways to solve this using synchronous replication technologies, but that’s an extra credit project you do yourself for now – as of yet, that hasn’t been built into any host-based SSD products. The reality today is that using host-based SSD precludes the use of HA (but not necessarily things like vMotion, which is NOT HA).</p>
<p>This was touched on in some other posts, but I think it’s an increasingly critical issue in virtual computing environments that may have been a bit downplayed in other comments. If you’re either thinking about moving production server workloads to VMs or have already got them there, HA is critical for a high percentage of workloads. </p>
<p>I can’t imagine an enterprise customer spec’ing out a production virtual server environment without asking about HA. True, there are workloads that don’t require it, but most do.</p>
<p>And it’s not just virtual server environments. We’re running into an increasing number of VDI environments where they want to enable HA for at least a small percentage of the desktops – usually executive desktops. HA isn’t a deal breaker for VDI like it is for “VSI” (virtual server infrastructure), but there are clearly use cases in VDI where you want and/or need it.</p>
<p>Today SSD is pretty much only used as a cache, regardless of where its deployed. And to provide a given level of performance speedup, caches generally have been sized at somewhere around 2% &#8211; 4% of the primary data store (it varies by application and exactly what you’re trying to speed up).  </p>
<p>In virtual environments, write performance is much more critical because it tends to comprise a much higher percentage of the read/write workload – in VSI environments its not uncommon to see 50% reads/50% writes, and in VDI environments we’ve seen 70% write environments. Unless you’re using a write back cache (with all the attendant additional expense associated with that), you’re not going to get any write performance speedup from the conventional cache architectures, just read.</p>
<p>But now think about what a log architecture, applied at the storage layer, could add.  Circular logs that are continuously draining (asynchronously) as they are filling need very little storage capacity to speed up ALL writes for ALL VMs ALL the time. In our experience, you need a log of about 10GB in size for each heavily loaded physical host.  </p>
<p>Think about what that could mean for a 16 host environment with 20TB. You could get away with 2-4 200GB enterprise flash drives instead of the 10-12 that you might otherwise deploy in a 20TB environment.  If you have a “linked clone” type snapshot technology combined with storage tiering, you could take the extra SSD capacity and create a tier 0 for critical VMs that need very high read performance, like for example the golden masters in a VDI environment or common templates you use to create your server VMs.  </p>
<p>This covers both needs – read and write performance – using a lot less storage. That means pretty much the same performance you’d get with the more expensive configuration with more SSDs for a lot less money. If you want to use SSD efficiently, a log architecture is a great idea. And if the logs are placed in shared, non-volatile, external storage (like the SSDs hosted in a SAN array or SAN-based SSD appliance), you can fully support HA.</p>
<p>Host-based SSD cards are closer to the physical host so theoretically they’ll provide more performance speedup, but given Amdahl’s law, how much of that can you really use? Array-based SSD will still get you past storage latencies as your critical bottleneck, and if they’re implemented using a log based architecture you’ll get HA and large write performance speedups as well using a lot less of it.
</p></blockquote>
<p><strong>The StorageMojo take</strong><br />
Write-through caches avoid a lot of sticky update synchronization problems, but as Eric notes they aren&#8217;t the best choice in write-intensive environments. And HA adds to the requirements: the cache must be network accessible.</p>
<p>But his larger point bears repeating: SSDs are wonderful for handling metadata. And as we move to object storage and more metadata that capability will become even more valuable.</p>
<p><strong>Courteous comments welcome, of course.</strong> I think Virsto&#8217;s architecture is smart and fixes some real problems with VMware and vMotion.</p>
<div class="twttr_button">
				<a href="http://twitter.com/share?url=http://storagemojo.com/2012/04/10/in-thinking-about-ssds-consider-ha/&text=In thinking about SSDs, consider HA " target="_blank" title="Click here if you liked this article.">
					<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
				</a>
			</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2012/04/10/in-thinking-about-ssds-consider-ha/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Tintri responds on SSD arrays</title>
		<link>http://storagemojo.com/2012/03/20/tintri-responds-on-ssd-arrays/</link>
		<comments>http://storagemojo.com/2012/03/20/tintri-responds-on-ssd-arrays/#comments</comments>
		<pubDate>Tue, 20 Mar 2012 23:27:50 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[Future Tech]]></category>
		<category><![CDATA[SSD/Flash Disk]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=2618</guid>
		<description><![CDATA[StorageMojo offered its soapbox to any vendors willing to weigh in on the question of whether enterprise arrays should be built from flash SSDs or not. Ed Lee, architect at Tintri, formerly of Data Domain and a Berkeley Ph.D, elected to respond. It is a long piece but rich in insight. Tintri produces hybrid disk/flash [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>StorageMojo offered its soapbox to any vendors willing to weigh in on the question of whether enterprise arrays should be built from flash SSDs or not. Ed Lee, architect at <a href="http://www.tintri.com/products/technology/" target="_blank">Tintri</a>, formerly of Data Domain and a Berkeley Ph.D, elected to respond. It is a long piece but rich in insight. </p>
<p>Tintri produces hybrid disk/flash SSD appliances optimized for virtual environments, not Symm-killers. They use SSDs in their products, as do other folks like <a href="http://www.nimblestorage.com/" target="_blank">Nimble Storage</a>. </p>
<p>No money changed hands between Tintri and StorageMojo or related entities. My accountant is weeping in the next room.</p>
<p><strong>Begin Tintri&#8217;s response:</strong></p>
<blockquote><p>
<strong>Outside the SSD Box: More than Faster Disk</strong><br />
Robin Harris of Storage Mojo in his recent article, &#8220;<a href="http://storagemojo.com/2012/03/05/are-ssd-based-arrays-a-bad-idea/" target="_blank">Are SSD-based arrays a bad idea?</a> and Matt Kixmoeller of Pure in his response, <a href="http://www.purestorage.com/blog/the-ssd-is-key-to-economic-flash-arrays/" target="_blank">The SSD is Key to Economic Flash Arrays</a>, present interesting perspectives on whether or not SSDs are the best technology for building flash-based arrays. Robin argues that by rethinking how flash can be packaged outside the SSD box, you can achieve better performance, reliability, cost and flexibility. And these observations are supported by the experience of existing flash-based storage vendors who have developed their own custom flash modules and packaging. Matt argues that SSDs provide an industry-standard product that requires less investment to leverage, better economies of scale, and rapid improvement in technology. These are also very valid points, especially for startups with limited time and capital.</p>
<p><strong>Latency</strong><br />
Taking latency as a point for comparison, flash-based storage vendors using custom packaging often quote IO latencies in the tens of microseconds versus SSD latencies of low hundreds of microseconds. While this is a notable difference, software and interfaces can also add overhead and the final latency seen at the subsystem level may differ by only a factor of two to four. Server-side flash products can avoid more of the software and interface overhead and provide better latencies – but may require rewriting applications to capitalize on this advantage. Keep in mind that hard disk latencies can easily reach tens of milliseconds under even moderate load. ALL of these flash-based products have latencies that are hundreds of times faster than disk.</p>
<p><a href="http://storagemojo.com/wp-content/uploads//2012/03/Bottleneck-no-longer-storage.png"><img src="http://storagemojo.com/wp-content/uploads//2012/03/Bottleneck-no-longer-storage.png" alt="" title="Bottleneck no longer storage" width="500" height="351" class="aligncenter size-full wp-image-2619" /></a></p>
<p>In short, most of the performance improvement comes from simply replacing hard disk with some form of flash. This immediately shifts the performance bottleneck from storage to some other component in your system. As a result, you won’t be able to take full advantage of flash performance without also optimizing the performance of the rest of your infrastructure, and ultimately rewriting your applications as well.</p>
<p>The above phenomenon explains why replacing your hard disk with flash often speeds up your applications by only a factor of two to three rather than ten or a hundred. Congratulations! You’ve just moved the bottleneck from storage to some other component of your system. By Amdahl’s Law, further improving only storage performance has diminishing returns. So while custom packaging does provide significant advantages in latency, most applications are unlikely to benefit until the rest of the computing ecosystem is optimized to take full advantage of flash.</p>
<p>To take a closer look at SSD latencies, I ran the following simple experiment:<br />
1)	Erase an MLC SSD so that no logical blocks were actually mapped to flash, and then issue small random reads.<br />
2)	Overwrite the entire SSD so that all logical blocks are mapped, and issue the same small random reads in step 1.</p>
<p>The idea here is to measure the software and protocol overheads of accessing flash packaged as SSD separately from accessing the data on the SSD. Reads with no blocks mapped had latencies of around 70us, while the reads with all blocks mapped had latencies of 250us. In this case only a fraction of the overall IO latency was due to SW and protocol overhead, indicating that SSDs may still have significant room for improving latency.</p>
<p><strong>Form factor</strong><br />
Another important issue discussed by both Robin and Matt is the relative cost of flash packaged in SSD versus non-SSD form factors. Robin argues that an SSD costs significantly more $/GB than the underlying flash while Matt argues that non-SSD packaging is expensive to develop, and SSDs provide useful flash management functions as well as hot-swap capability. It’s certainly true that developing custom packaging has a high up front cost, although this is likely balanced by lower unit costs. But as Robin points out, there are also standard packaging options available for non-SSD form factor flash, which may make custom packaging for non-SSD flash unnecessary.</p>
<p>A very important point to keep in mind when thinking about commercially available SSD vs. non-SSD form factors is that SSDs are designed as a substitute for disk, while non-SSD form factors are often designed as substitutes for memory. This means that SSDs focus primarily on reducing $/GB (its greatest weakness vs. disk), while non-SSDs focus on reducing $/IOPS (its greatest weakness vs. DRAM). This explains why SSD is currently much cheaper on a $/GB basis than PCIe flash, while PCIe flash designed as memory expansion is cheaper on a $/IOPS basis than SSD. This is not to say that you can’t build a non-SSD form factor that has lower $/GB than SSD, just that the primary applications for these non-SSD form factors today is usually not as a replacement for disk.</p>
<p>Whether flash in SSD versus non-SSD form factors is better for use in storage subsystems in the long run primarily depends on the relative volumes of these products, and the feature and price sensitivity of the applications these products serve. At this point the ‘winning’ form-factor seems hard to predict. So as a flash subsystem vendor, it seems desirable to keep your options open and ensure that your technology will work well with a variety of packaging options.</p>
<p><strong>More than just a faster disk</strong><br />
But flash is about more than just performance and packing. Flash enables much more than just a faster, denser replacement for disk. With flash, we can finally remove a key mechanical barrier to scaling not only storage systems, but computing systems in general. Going forward, CPU, network and storage can now all scale with improvements in semiconductor technology. When transistors replaced vacuum tubes, we got more than just compact radios; we got simpler, more powerful computing systems. Similarly, flash is a catalyst that will enable far greater levels of automation and functionality for storage and computing systems than is possible today.</p>
<p>I tend to think of the value of new technology as the product of its simplicity times the functionality it offers. It&#8217;s clear why functionality is important, but why is simplicity so important? Technology that is simple to use will be used more often, to solve more problems, in less time. As a result, simplicity has a compounding effect on value:</p>
<p>Value = Simplicity * Functionality</p>
<p>How does one measure simplicity? One way is to list the basic steps it takes to perform a task and how long each step takes. One to three is good, four to six is manageable, and anything resembling a twelve step program will likely require written directions and a significant amount of focus. Note that in assessing the simplicity and functionality of a technology, one must do it in the context of the job that needs to be done. For example, a chainsaw has great features for cutting down trees but not for giving haircuts.</p>
<p>A common problem with many general purpose storage products when applied to applications such as virtualization is that they require executing long lists of steps to get anything done – and most of the features are not directly applicable to virtualization. Paradoxically, many of the features that try to make these products better suited to the application end up making the products more complex – resulting in little improvement in overall value. Kind of like adding too many tools to a Swiss army knife until you have so many that the attachments start to stick and rub against each other.</p>
<p><a href="http://storagemojo.com/wp-content/uploads//2012/03/Swiss-Army-Giant-Knife.jpg"><img src="http://storagemojo.com/wp-content/uploads//2012/03/Swiss-Army-Giant-Knife.jpg" alt="" title="Swiss Army Giant Knife" width="500" height="350" class="aligncenter size-full wp-image-2620" /></a></p>
<p><strong>Flash as a catalyst</strong><br />
Flash eliminates a key mechanical barrier to scaling computing systems and is 400 times faster than disk. To keep things in perspective, the speed of sound is “only” 250 times faster than walking! If I could get to work at supersonic speeds, I would no doubt save a lot of time each year. But would I do no more which such an ability? Similarly, is flash just a faster replacement for disk? Will it make no significant difference in the way storage is managed and used? We obviously don’t think so. Flash will greatly increase the value of storage by improving both the simplicity and functionality of enterprise storage products. But these gains will not come easily or without their own set of problems.</p>
<p>An obvious way flash promotes simplicity is by eliminating performance bottlenecks, but as flash enables more dense storage systems many of those gains will be converted to problems in quality-of-service. A more significant way flash promotes value is by providing a better building block for constructing storage systems: flash promotes simplicity by enabling higher levels of automation and allows the implementation of more powerful functionality.</p>
<p>Flash will fragment the enterprise storage market. The general purpose storage systems of today will be supplanted by new flash-based products that are far simpler and more powerful for the specific application areas that they target. This will amplify the simplicity and power that flash already makes possible, and further accelerate the fragmentation of the storage market. This is precisely what happened in the 1980’s when advances in networking technology caused a shift from centralized computing to networked computing – and in the process fragmented the direct attached storage market into ones based on networked storage technology. Over time, the networked storage markets consolidated into the current general purpose storage market dominated by a few major vendors. And so the cycle is repeating itself. </p>
<p><a href="http://storagemojo.com/wp-content/uploads//2012/03/market_fragmentation.jpg"><img src="http://storagemojo.com/wp-content/uploads//2012/03/market_fragmentation.jpg" alt="" title="market_fragmentation" width="500" height="400" class="aligncenter size-full wp-image-2621" /></a></p>
<p>We are at the start of a new technological shift. A shift that is made possible by flash and one that will disrupt the existing enterprise storage market. Just as transistors enabled new products such as personal computers and smart phones, flash will enable simple, intelligent and fast enterprise storage systems. In turn, this will lead to much higher value for end users, but only if we think outside the storage box and treat flash as more than just a faster, denser disk.
</p></blockquote>
<p><strong>The StorageMojo take</strong><br />
For the record the original post wasn&#8217;t looking at hybrid solutions, although it is obvious that SSDs can help legacy designs stay competitive without replacing all disks for a few years. For folks like Tintri and Nimble who want to speed up disk storage to stay affordable SSDs make sense. Why engineer a small part of your system when an off-the-shelf solution will suffice?</p>
<p>But for high end transactional SAN storage I still don&#8217;t see how SSDs are the right way to go. But I&#8217;m expecting more responses, so stay tuned.</p>
<p><strong>Courteous comments welcome, of course.</strong> I&#8217;m working on a post that reflects directly on Ed&#8217;s comment about SSD latency. You&#8217;ll like it.</p>
<div class="twttr_button">
				<a href="http://twitter.com/share?url=http://storagemojo.com/2012/03/20/tintri-responds-on-ssd-arrays/&text=Tintri responds on SSD arrays" target="_blank" title="Click here if you liked this article.">
					<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
				</a>
			</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2012/03/20/tintri-responds-on-ssd-arrays/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>SSDs in arrays: the Pure Storage view</title>
		<link>http://storagemojo.com/2012/03/12/ssds-in-arrays-the-pure-storage-view/</link>
		<comments>http://storagemojo.com/2012/03/12/ssds-in-arrays-the-pure-storage-view/#comments</comments>
		<pubDate>Mon, 12 Mar 2012 12:57:46 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[SSD/Flash Disk]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=2605</guid>
		<description><![CDATA[Pure&#8217;s Matt Kixmoeller saw the Are SSD-based arrays a bad idea post and, unsurprisingly, responded. The SSD is Key to Economic Flash Arrays is a good post and I urge interested readers to check it out. Pure has a stellar team with deep experience. Their views are worth considering. As Matt notes: This post caught [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>Pure&#8217;s Matt Kixmoeller saw the <a href="http://storagemojo.com/2012/03/05/are-ssd-based-arrays-a-bad-idea/" target="_blank">Are SSD-based arrays a bad idea</a> post and, unsurprisingly, responded. <a href="http://www.purestorage.com/blog/the-ssd-is-key-to-economic-flash-arrays/" target="_blank">The SSD is Key to Economic Flash Arrays</a> is a good post and I urge interested readers to check it out.</p>
<p>Pure has a stellar team with deep experience. Their views are worth considering.</p>
<p>As Matt notes:</p>
<blockquote><p>
This post caught our eye for an obvious reason: Pure Storage did start “fresh” to build an all-flash enterprise storage array, and we did decide to use the SSD form factor, after quite exhaustive looks at all the other options. Quite simply, we found that SSDs are the most efficient and economic building blocks from which to build a flash array. Let’s explore why.
</p></blockquote>
<p>After dismissing disk arrays that add flash drives &#8211; as I do &#8211; Matt focuses on (1) all flash appliances built from raw NAND and (2) flash arrays using flash SSDs. </p>
<p><strong>SSDs are most efficient</strong><br />
Matt argues that SSD-based arrays have 3 key advantages:</p>
<ul>
<li><strong>Economics.</strong> SSDs are a commodity product that raw flash arrays will have a hard time out-engineering.</li>
<li><strong>Flash controller complexity.</strong> Matt notes, correctly, that the flash controller is at the heart of argument. Better to use a controller that goes into millions of SSDs or one purpose-built for a single vendor&#8217;s array? How will the single vendor be able to keep up?</li>
<li><strong>Servicability.</strong> Pure&#8217;s use of SSDs enables them to offer a familiar hot-swap experience that higher density designs may not offer. Futhermore, Pure&#8217;s data reduction features increase effective density to rival raw flash designs.</li>
</ul>
<p>In conclusion, Matt makes a couple of more points. First, that SSD form factors will become much more compact, such as Apple&#8217;s DIMM-like mini-SATA SSD used in the MacBook Air. Second, that the proof is in the pudding: Pure, he says, has &#8220;. . . delivered with break-through performance, at a cost below traditional spinning disk.&#8221;</p>
<p><strong>The StorageMojo take</strong><br />
How does Matt&#8217;s response stack up to the criteria in the original post? Not that there&#8217;s anything magic about them, but . . . .</p>
<ul>
<li><strong>Latency.</strong> No response, which doesn&#8217;t mean they&#8217;re worse.</li>
<li><strong>SSD bandwidth.</strong> No response, but to be fair with enough SSDs you should be able saturate 16Gb Fibre Channel.</li>
<li><strong>Reliability.</strong> No direct response. Instead a focus on servicability. More on that below.</li>
<li><strong>Cost.</strong> Says Pure is cost-effective using their data reduction technology. </li>
<li><strong>Flexibility.</strong> This is the heart of Matt&#8217;s argument: due to the commodity volume of the flash controllers flash SSDs will evolve faster &#8211; in functionality and cost &#8211; than any proprietary solution could. Proprietary flash controllers, he says, will be boat anchors for flash array vendors and are likely to end up controlled by flash manufacturers. </li>
</ul>
<p><strong>Servicability</strong> is an interesting response to the question of reliability. After all, the reason hot swap is important for some components but not others is because they either a)fail often &#8211; individually or in aggregate &#8211; b)failure compromises the product or c)online expansion, upgrading or reconfiguation is desirable.</p>
<p>Power supplies are routinely hot swappable because they have the lowest MTBF of any major system component. Disks are hot swappable because they come in multiples that reduce their aggregate MTBF while their standardized design makes hot swap cheap. I/O cards are often hot swappable because they are critical and needs change.</p>
<p>SSDs <i>should</i> be hot swappable because their failure rates are at best about half that of disks. But DIMMs, another critical component, especially if you invest in high-capacity ones, aren&#8217;t, because they rarely fail. </p>
<p>While I&#8217;m not aware of any non-SSD enterprise array vendor whose arrays don&#8217;t include hot swap components &#8211; love to be educated &#8211; which is more important: a short mean time to repair (MTTR) or a long mean time between failures (MTBF)? Because that is the argument about servicability.</p>
<p>I&#8217;d like to publish responses from vendors who feel strongly about this issue. Not in the comments, but as a blog post. Any takers?</p>
<p><strong>Courteous comments welcome, of course.</strong> I was so impressed with the Pure Storage team that I signed a rare NDA with them last spring to get briefed, the first of 2 visits to their Castro street HQ.</p>
<div class="twttr_button">
				<a href="http://twitter.com/share?url=http://storagemojo.com/2012/03/12/ssds-in-arrays-the-pure-storage-view/&text=SSDs in arrays: the Pure Storage view" target="_blank" title="Click here if you liked this article.">
					<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
				</a>
			</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2012/03/12/ssds-in-arrays-the-pure-storage-view/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
		<item>
		<title>Virtualizing storage controllers</title>
		<link>http://storagemojo.com/2012/02/28/virtualizing-storage-controllers/</link>
		<comments>http://storagemojo.com/2012/02/28/virtualizing-storage-controllers/#comments</comments>
		<pubDate>Tue, 28 Feb 2012 15:30:56 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[Future Tech]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=2587</guid>
		<description><![CDATA[A hardware storage controller is an expensive guarantee that you&#8217;re using old technology to handle your most important data. Hardware specs are frozen early in the typical 18-24 month development cycle so by the time you get your &#8220;new&#8221; controller it is already 2 years old. But it may not have to be that way. [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>A hardware storage controller is an expensive guarantee that you&#8217;re using old technology to handle your most important data. Hardware specs are frozen early in the typical 18-24 month development cycle so by the time you get your &#8220;new&#8221; controller it is already 2 years old.</p>
<p>But it may not have to be that way. In <a href="http://static.usenix.org/event/fast12/tech/full_papers/Ben-Yehuda2-2-12.pdf" target="_blank">Adding Advanced Storage Controller Functionality via Low-Overhead Virtualization</a> researchers Muli Ben-Yehuda, Michael Factor, Eran Rom, Avishay Traeger, Eran Borovik and Ben-Ami Yassour of IBM Research–Haifa wanted to find out if virtualized storage controller features are feasible.</p>
<p>Short answer: with some tweaking, yes.</p>
<p>The big question is overhead. Storage controllers are typically in the data path, so latency, as well as compute efficiency on out-of-date processors, are real concerns.</p>
<p>Unlike the gateway approach of virtual storage appliances (VSA), the team ran the VMs directly on storage controllers using the Linux KVM hypervisor.</p>
<p><strong>Overhead</strong><br />
The team identified 3 sources of performance overhead:</p>
<ul>
<li><strong>Base.</strong> System work such as virtual memory managment or process switching.</li>
<li><strong>External communication.</strong> Important if a new function is layered on top of the storage system, such as a file server.</li>
<li><strong>Internal communication.</strong> Virtual machine coordination and communication with the hardware controller.</li>
</ul>
<p><strong>Reducing overhead</strong><br />
Different techniques are used to limit each type of overhead.</p>
<p><strong>Base</strong> They statically allocate CPU cores to the guest to ensure sufficient resources. Memory is also statically allocated to the VM to reduce translation overheads.</p>
<p><strong>External</strong> Device assignment is the highest-performing approach as it eliminates hypervisor intervention for physical events. This requires assigning the network device directly to the guest using an <a href="http://www.intel.com/content/www/us/en/pci-express/pci-sig-sr-iov-primer-sr-iov-technology-paper.html" target="_blank">SR-IOV</a> (single root I/O virtualization) enabled adapter which allows the guest to send requests directly to the device. </p>
<p><strong>Internal communications</strong> To reduce internal communication overhead, they modified KVM’s block driver to poll instead of interrupt. This gives a fast, exit-less, zero-copy transport.</p>
<p><strong>Results</strong></p>
<blockquote><p>
By using these techniques, we show no measurable difference in network latency between bare metal and virtualized I/O and under 5% difference in throughput. For internal communication, micro-benchmarks show 6.6μs latency overhead, read throughput of 357K IOPS, and write throughput of 284K IOPS; roughly seven times better than a base KVM implementation. In addition, an I/O intensive filer workload running in KVM incurs less than 0.4% runtime performance overhead compared to bare metal integration.
</p></blockquote>
<p>That sounds pretty good.</p>
<p><strong>The StorageMojo take</strong><br />
While the static assignments may reduce flexibility, the win is updating storage functionality on the fly. But are there viable use cases? The arc of controller history suggests there are.</p>
<p>The earliest disk drives were directly controlled by the host CPU. Over the decades that and much other functionality migrated to controllers and to disks. Lately that trend has slowed because of large investments in existing standards.</p>
<p>This paper shows that it is possible to migrate more functionality to controllers without lengthy development cycles, enabling architects to make different tradeoffs. </p>
<p>For example, big data requires big pipes, and big pipes are expensive. If volume-reducing preprocessors could be added to file servers, existing bandwidth could be optimized. </p>
<p>More importantly, it suggests that by virtualizing the controller&#8217;s applications, the underlying hardware can be updated more frequently. To be fair, that&#8217;s not what the authors suggested, but it certainly seems possible based on their work.</p>
<p><strong>Courteous comments welcome, of course.</strong> Jeff Darcy of Red Hat has his own list of favorite papers from FAST &#8217;12 <a href="http://hekafs.org/blog" target="_blank">here</a>.</p>
<div class="twttr_button">
				<a href="http://twitter.com/share?url=http://storagemojo.com/2012/02/28/virtualizing-storage-controllers/&text=Virtualizing storage controllers" target="_blank" title="Click here if you liked this article.">
					<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
				</a>
			</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2012/02/28/virtualizing-storage-controllers/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Learning from customers</title>
		<link>http://storagemojo.com/2011/12/07/learning-from-customers/</link>
		<comments>http://storagemojo.com/2011/12/07/learning-from-customers/#comments</comments>
		<pubDate>Wed, 07 Dec 2011 20:05:07 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[SSD/Flash Disk]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=2563</guid>
		<description><![CDATA[EMC&#8217;s Chuck Hollis blogged about The Vendor Beating a couple of months ago. The unspoken question in the post is &#8220;how do we understand what customers are telling us?&#8221; He writes As an employee of a large IT vendor, I&#8217;ve been at the receiving end of a reasonable number of vendor beatings. Occasionally it&#8217;s richly [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>EMC&#8217;s Chuck Hollis <a href="http://chucksblog.emc.com/chucks_blog/2011/09/the-vendor-beating.html" target="_blank">blogged</a> about <i>The Vendor Beating</i> a couple of months ago. The unspoken question in the post is &#8220;how do we understand what customers are telling us?&#8221;</p>
<p>He writes</p>
<blockquote><p>
As an employee of a large IT vendor, I&#8217;ve been at the receiving end of a reasonable number of vendor beatings.</p>
<p><i>Occasionally it&#8217;s richly deserved</i>. But, sometimes, it&#8217;s masking a deeper set of issues that have very little to do any vendor whatsoever.
</p></blockquote>
<p>Unhappy customers, like unhappy families, are all unhappy in their own way. This customer appeared to be overstaffed, under-skilled and poorly managed.</p>
<p><strong>Interpretation</strong><br />
Interpreting customer complaints and behavior is hard. When companies can&#8217;t decipher what customers want &#8211; which is usually what the company <i>isn&#8217;t</i> selling &#8211; it is easy and dangerous to tune them out. </p>
<p>Customers can tell you things about your company and products that you can&#8217;t directly discover for yourself, but what customers say may be different from what they think. And both are influenced by the customer&#8217;s context, which can include company politics, prior vendor experiences, knowledge deficits and employee level.</p>
<p><strong>Diagnosis</strong><br />
Steve Jobs once said that customers don&#8217;t know what they want until you show it to them. Customers know what would improve the current product in the current use case, but they can&#8217;t imagine bringing multiple novel technologies to bear on a much broader problem.</p>
<p>Tablet computers flopped for years until the iPad crystalized the market. Everyone saw the tablet problems: thick; heavy; slow; clunky UI; poor battery life; and, thanks to low volumes, cost. Incremental improvements &#8211; faster processors, more RAM, larger disks &#8211; didn&#8217;t help.</p>
<p>Tablets required a deep rethinking and application of several novel technologies &#8211; flash, gestures, CNC case milling, an app store and an energy-efficient OS &#8211; to create a compelling user experience. </p>
<p>The iPad illustrates the problem of listening to customers: they described symptoms and suggest fixes, but couldn&#8217;t articulate the underlying problem: how the use case differs from desktop and notebook PCs. That requires an act of imagination, not transcription.</p>
<p><strong>The StorageMojo take</strong><br />
In Chuck&#8217;s post an EMC presales engineer identified the root cause of the customer&#8217;s pain:</p>
<blockquote><p>
. . . the database environment had grown willy-nilly over the years &#8212; it wasn&#8217;t laid out well, the queries weren&#8217;t particularly well written, and so on.</p>
<p>Sure, there were things we could do on the storage side (e.g. faster storage, better layouts, etc.), but it was a bigger issue than just storage performance.
</p></blockquote>
<p>But the larger question is: with high-speed and high-capacity SSDs, why isn&#8217;t this customer moving to an infrastructure that doesn&#8217;t need this fancy tuning? EMC can&#8217;t manage the fight between DBAs and storage admins, but they could be making it less contentious.</p>
<p>From within the EMC ecosystem the solution is clear: more training, professional services and faster gear. But from the outside the question is: who is building &#8220;it just works&#8221; high performance storage? </p>
<p><strong>Courteous comments welcome, of course.</strong> I admire Tucci&#8217;s innovative EMC business model: outbid everyone else for chasm-crossing companies; give them global distribution and support; and watch the bucks roll in. It may not be innovative <i>technically</i> but it is innovative.</p>
<div class="twttr_button">
				<a href="http://twitter.com/share?url=http://storagemojo.com/2011/12/07/learning-from-customers/&text=Learning from customers" target="_blank" title="Click here if you liked this article.">
					<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
				</a>
			</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2011/12/07/learning-from-customers/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
		<item>
		<title>How fault tolerant are SANs?</title>
		<link>http://storagemojo.com/2011/11/07/how-fault-tolerant-are-sans/</link>
		<comments>http://storagemojo.com/2011/11/07/how-fault-tolerant-are-sans/#comments</comments>
		<pubDate>Mon, 07 Nov 2011 16:11:36 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[SAN, FC]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=2548</guid>
		<description><![CDATA[Reader Kyle asks a good question: SANs are advertised up the wazoo as having lots of internal redundancy such as redundant power, redundant controllers, etc. I&#8217;ve spent enough time with redundancy to know that having two pieces of hardware often doesn&#8217;t cut it. I was wondering what the real story is from someone who has [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>Reader Kyle asks a good question:</p>
<blockquote><p>
SANs are advertised up the wazoo as having lots of internal redundancy such as redundant power, redundant controllers, etc. I&#8217;ve spent enough time with redundancy to know that having two pieces of hardware often doesn&#8217;t cut it. I was wondering what the real story is from someone who has spent a lot of time in the storage space. Do complete SAN failures really pretty much *never* happen or are they just on the rare side? If so what are the common points of failure? Perhaps people, the OS, non-redundant hardware parts?
</p></blockquote>
<p>Please, SAN folks, tell StorageMojo readers your experience. In the meantime, here&#8217;s </p>
<p><strong>The StorageMojo take</strong><br />
Kyle asks 2 questions: how reliable and available are the individual <i>devices</i> that make up a SAN and how reliable and available is the <i>system</i> &#8211; the SAN as a whole.</p>
<p>Redundancy is aimed at ensuring availability. Because of the redundancy&#8217;s greater component count you also have more failures. </p>
<p>Failures of redundant components shouldn&#8217;t affect availability &#8211; assuming, that is, that failures are not correlated. That assumption turned out not to be true of RAID arrays, making them less available than advertised.</p>
<p>How much redundancy is enough? Customers generally prefer triple redundancy if they can afford it, partly for availability and partly for performance: losing ⅓rd of system performance is less painful than ½. But for the moonshots, NASA chose quintuple redundancy on critical systems.</p>
<p>Yet I&#8217;d guess that most are more concerned about SAN <i>system</i> availability &#8211; which includes not only what we consider SAN hardware, but also the server-side HBAs, drivers and management software. It is here that the nastiest bugs lurk: untestable interactions between applications, drivers, firmware and architecture that bite us hard &#8211; and crash entire SANs.</p>
<p>But what is <i>your</i> experience, gentle reader? Many of us would like to know. </p>
<p><strong>Courteous comments welcome, of course.</strong> <strong>Update</strong>: Bayesian analysis is the best tool to evaluate system-level availability, as noted in this <a href="http://www.youtube.com/watch?v=UzTlN5qwzco" target="_blank">StorageMojo video</a>. Sadly, the tool referred to is no longer online. Anyone want to take a whack at a new one?</p>
<div class="twttr_button">
				<a href="http://twitter.com/share?url=http://storagemojo.com/2011/11/07/how-fault-tolerant-are-sans/&text=How fault tolerant are SANs? " target="_blank" title="Click here if you liked this article.">
					<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
				</a>
			</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2011/11/07/how-fault-tolerant-are-sans/feed/</wfw:commentRss>
		<slash:comments>25</slash:comments>
		</item>
		<item>
		<title>Ask StorageMojo: 80,000 mailboxes need help</title>
		<link>http://storagemojo.com/2011/11/02/ask-storagemojo-80000-mailboxes-need-help/</link>
		<comments>http://storagemojo.com/2011/11/02/ask-storagemojo-80000-mailboxes-need-help/#comments</comments>
		<pubDate>Wed, 02 Nov 2011 16:00:28 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[NAS, IP, iSCSI]]></category>
		<category><![CDATA[SSD/Flash Disk]]></category>
		<category><![CDATA[Virtualization]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=2543</guid>
		<description><![CDATA[A StorageMojo reader has a problem. Can you help? Our mail hub (80,000+ mailboxes) is virtualized with vSphere 4.1 with Red Hat Enterprise Linux 5 x64 and Dovecot 2.0 [an open source IMAP/POP3 email server for Linux/UNIX-like systems]. We are using HP LeftHand Networks P4300 iSCSI storage in a &#8220;network RAID10 setup of RAID10 storage&#8221; [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>A StorageMojo reader has a problem. Can you help?</p>
<blockquote><p>
Our mail hub (80,000+ mailboxes) is virtualized with vSphere 4.1 with Red Hat Enterprise Linux 5 x64 and <a href="http://dovecot.org/index.html" target="_blank">Dovecot 2.0</a> [an open source IMAP/POP3 email server for Linux/UNIX-like systems]. We are using HP LeftHand Networks P4300 iSCSI storage in a &#8220;network RAID10 setup of RAID10 storage&#8221; for Dovecot indexes and multiple &#8220;networks RAID1 of RAID5 storage&#8221; for actual mailboxes.</p>
<p>This is my take: our Dovecot indexes are getting hammered with lots of small I/O requests, about 8,000 IOPS continuous during 8-working-hour days, 75% write. Indexes are fairly small (50 GB) and expected to grow to 100-150 GB, but need a lot of random I/O. We need real-time replication in storage (LeftHand is ok for us) and we think that SSD should shine in this situation. Bandwidth is not a problem (200-300 megabits of indexes traffic, but we need more IOPs).</p>
<p>The problem is the indexes, but our total mailbox capacity is expected to grow to 6 TB compressed using zlib compression in Dovecot.</p>
<p>We want to buy a storage appliance with the following requirements:</p>
<ul>
<li>Vsphere 4.1 &#038; 5 certified storage, VAAI enabled (if possible)</li>
<li>iSCSI (1 gbps)</li>
<li>High number of IOPS (at least 12,000+, most of them writes)</li>
<li>Small size (200 GB)</li>
<li>Fault tolerant (RAID, battery-backed write cache, power supply, fans, multiple gigabit uplinks, synchronous replication)</li>
<li>Cheap (less than $30k the full setup)</li>
</ul>
<p>We want to buy at the beginning of 2012. Any product that fits?
</p></blockquote>
<p><strong>The StorageMojo take</strong><br />
Suspect price will be the most significant limiter. But the respondent only needs index storage not the whole shooting match. He&#8217;s pretty happy with LeftHand for mailbox storage.</p>
<p>But if we can solve both problems for him, why not? If he should relax some constraint, feel free to suggest it.</p>
<p>He&#8217;ll be watching the comments, so if you have questions please ask them. I&#8217;ll be following the comments as well.</p>
<p><strong>Courteous comments welcome, of course.</strong> His email was edited for clarity.</p>
<div class="twttr_button">
				<a href="http://twitter.com/share?url=http://storagemojo.com/2011/11/02/ask-storagemojo-80000-mailboxes-need-help/&text=Ask StorageMojo: 80,000 mailboxes need help " target="_blank" title="Click here if you liked this article.">
					<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
				</a>
			</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2011/11/02/ask-storagemojo-80000-mailboxes-need-help/feed/</wfw:commentRss>
		<slash:comments>47</slash:comments>
		</item>
		<item>
		<title>The network is choking our storage</title>
		<link>http://storagemojo.com/2011/10/20/the-network-is-choking-our-storage/</link>
		<comments>http://storagemojo.com/2011/10/20/the-network-is-choking-our-storage/#comments</comments>
		<pubDate>Thu, 20 Oct 2011 17:03:08 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Cloud computing & storage]]></category>
		<category><![CDATA[Clusters]]></category>
		<category><![CDATA[Future Tech]]></category>
		<category><![CDATA[SAN, FC]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=2533</guid>
		<description><![CDATA[Amazon Web Services architect James Hamilton has been posting on network issues for over a year and researching them much longer. As Ethernet becomes the de facto SAN technology, his views become more relevant to the larger storage market. Critique Part of Mr. Hamilton&#8217;s concern is the structure of the networking industry: the high margins; [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>Amazon Web Services architect James Hamilton has been <a href="http://perspectives.mvdirona.com/2011/10/01/ChangesInNetworkingSystems.aspx" target="_blank">posting</a> on network issues for over a year and researching them much longer. As Ethernet becomes the <i>de facto</i> SAN technology, his views become more relevant to the larger storage market.</p>
<p><strong>Critique</strong><br />
Part of Mr. Hamilton&#8217;s concern is the structure of the networking industry: the high margins; the dominance of a single player, Cisco; the closed technology; and the heavy vertical integration. All antithetical to the dynamics that have driven server costs down so successfully in the last 20 years.</p>
<p>These are issues the storage industry knows too well. But Mr. Hamilton is more concerned about the waste the current high-cost industry structure causes.</p>
<p>Waste?</p>
<p><strong>Workload placement</strong><br />
The cost of network bandwidth leads to network over-subscription. Networks are configured as tree topologies: the further you move from end nodes the worse the over subscription. </p>
<p>As described in the 2009 Microsoft Research paper <a href="http://research.microsoft.com/pubs/80693/vl2-sigcomm09-final.pdf" target="_blank">VL2: A Scalable and Flexible Data Center Network</a>:</p>
<blockquote><p>
. . . the capacity between different branches of the tree is typically over- subscribed by factors of 1:5 or more, with paths through the highest levels of the tree oversubscribed by factors of 1:80 to 1:240. This limits communication between servers to the point that it fragments the server pool — congestion and computation hot-spots are prevalent even when spare capacity is available elsewhere.
</p></blockquote>
<p>This throttles data center performance by limiting server-to-server bandwidth, fragmenting resources and reducing network utilization. The latter reflects the redundant paths needed in case of switch failure: ≈50% or more of costly data center bandwidth goes unused.</p>
<p>As might be expected, big Internet data centers like Amazon&#8217;s have complex and unpredictable workloads. They need lots of bandwidth between all servers all the time.</p>
<p><strong>A solution</strong><br />
The VL2 paper describes an experimental solution to these problems that includes <i>location-specific</i> and <i>application-specific</i> addressing, multi-path traffic load balancing and a novel directory design that efficiently handles lookups and updates to network mappings.</p>
<p>In an 75-node test cluster the design moved 2.75TB of data in 395 seconds &#8211; 94% of maximum network bandwidth &#8211; at a fraction of the cost of current enterprise networks. The paper calculates that a cloud-service scale network with no over-subscription could be built with commodity switches at <strong>1/14th the cost</strong> of a traditional data center Ethernet.</p>
<p>Whoa!</p>
<p><strong>The StorageMojo take</strong><br />
VC and engineering dollars follow high-growth markets. What Google, Amazon and Microsoft want, they get. With the rapid growth of public cloud services the network over-subscription problem will get solved. </p>
<p>Merchant silicon from Broadcom, Intel and Marvell is making a tried-and-true Moore&#8217;s Law attack on hardware cost. The protocol stack is tougher, but several open-source industry initiatives are under way with support from major companies. Progress will be slower than hoped, but within 3 years we&#8217;ll have a viable stack to build on.</p>
<p>Where does this leave the networking industry? That depends on where you sit.</p>
<p>Cisco will be the biggest loser, because they&#8217;ve been the biggest winner with the current model. They may need to pull an IBM and move big into services if they want to stick around. Ironically, Cisco&#8217;s UCS product line &#8211; which bakes in the tree-structured network &#8211; has further motivated broader industry action.</p>
<p>The rest of the industry can go after this emerging market with a lower-GM business model. Not all of them will, but it will be a critical success factor. </p>
<p>The big winner will be storage. Scale-out storage relies on spraying data across multiple racks for maximum availability, utilization and performance. Cheaper, faster, better scale-out networks will only drive storage demand.</p>
<p>For most of us this is an academic problem today. Lightly used systems &#8211; such as for backup and archiving &#8211; don&#8217;t see Amazon&#8217;s problems. But in 5 years this will be common even outside the public cloud providers.</p>
<p>Just as IT users have benefited from Google&#8217;s push on energy efficiency and much more, they will also benefit from much lower cost and more scalable networks.</p>
<p><strong>Courteous comments welcome, of course.</strong> I can&#8217;t help but continue to marvel at how dumb Cisco&#8217;s UCS has turned out to be. It&#8217;s a gift that keeps on giving.</p>
<div class="twttr_button">
				<a href="http://twitter.com/share?url=http://storagemojo.com/2011/10/20/the-network-is-choking-our-storage/&text=The network is choking our storage " target="_blank" title="Click here if you liked this article.">
					<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
				</a>
			</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2011/10/20/the-network-is-choking-our-storage/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>RAMCloud is the new flash</title>
		<link>http://storagemojo.com/2011/10/05/ramcloud-is-the-new-flash/</link>
		<comments>http://storagemojo.com/2011/10/05/ramcloud-is-the-new-flash/#comments</comments>
		<pubDate>Thu, 06 Oct 2011 01:03:30 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Clusters]]></category>
		<category><![CDATA[SSD/Flash Disk]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=2529</guid>
		<description><![CDATA[Sometimes in the midst of the endless tweaking needed to maximize storage performance one just wants to say &#8220;screw it! Put everything in RAM!&#8221; And that&#8217;s just what RAMCloud does. Disk is the new tape, flash the new disk, DRAM the new flash. RAMCloud is a research paper (pdf) and an open software project. The [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>Sometimes in the midst of the endless tweaking needed to maximize storage performance one just wants to say &#8220;screw it! Put everything in RAM!&#8221; And that&#8217;s just what RAMCloud does.</p>
<p><strong> Disk is the new tape, flash the new disk, DRAM the new flash.</strong><br />
RAMCloud is a <a href="http://www.stanford.edu/~ouster/cgi-bin/papers/ramcloud.pdf" target="_blank">research paper</a> (pdf) and an <a href="http://fiz.stanford.edu:8081/display/ramcloud/Home" target="_blank">open software project</a>. The goal is enterprise-class availability with every bit of active data stored in DRAM, not disk or flash, for maximum performance. It is a key-value object store today, though as pure software that could change.</p>
<p>It&#8217;s the brainchild of John Ousterhout, a Stanford prof who invented Tcl back in the 80s at Berkeley. </p>
<p><strong>Isn&#8217;t DRAM volatile and costly?</strong><br />
Right on both counts, grasshopper, so RAMCloud isn&#8217;t a 1 for 1 disk-style architecture. No Google FS-style triple replication here, or RAID-style erasure coding.</p>
<p>Instead RAMCloud uses <i>buffered logging</i>:</p>
<blockquote><p>
. . . a single copy of each object is stored in DRAM of a primary server and copies are kept on the disks of two or more backup servers; each server acts as both primary and backup. However, the disk copies are not updated synchronously during write operations. Instead, the primary server updates its DRAM and forwards log entries to the backup servers, where they are stored temporarily in DRAM.
</p></blockquote>
<p>Instead of working around crashes &#8211; using multiple object copies as scale-out storage does &#8211; RAMCloud recovers lost data from the DRAM logs or disk drives to replicate the lost data at high speed. That&#8217;s possible because all the log data is in DRAM or spread across many disks. </p>
<p>In a recent paper (<a href="http://www.stanford.edu/~ouster/cgi-bin/papers/ramcloud-recovery.pdf" target="_blank">Fast Crash Recovery in  RAMCloud</a>) (pdf) Diego Ongaro, Stephen M. Rumble, Ryan Stutsman, John Ousterhout, and Mendel Rosenblum (co-founder of VMware) go into more detail on this critical feature. </p>
<p>The key elements are:</p>
<ul>
<li><strong>Scale.</strong> Servers scatter their backup data across all other servers so thousands of disks can serve the recovery.</li>
<li><strong>Log-structure. </strong> Reduces complexity and offers high performance.</li>
<li><strong>Randomization.</strong> Many decisions need to be made in a large cluster. Rather than CPU, time and bandwidth consuming determinism, injecting randomization speeds decisions with less overhead.</li>
<li><strong>Dynamic tablets.</strong> The key-value store tracks resource usage within a single table and ensures that no single partition is too large for fast restores.</li>
</ul>
<p>DRAM is volatile so the log replication data is spread to other servers on other racks for redundancy before being committed to disk. Still, total system write throughput is limited by the disk write speed, whose limits are a key reason people are moving from disks. Flash drives may help, but other techniques, such as log truncation and sharding make it possible to get good performance from several thousand SATA drives.</p>
<p>How good? The team reports that in a 60 node cluster they recover 35GB in 1.6 seconds. With more nodes larger partitions should be restored even faster. Scale is good.</p>
<p><strong>Lights out!</strong><br />
Power failures wipe all the data in DRAM. The obvious defense is to avoid failures: combine battery backup with diesel generator sets. Power ride-through will handle interruptions into the hundreds of milliseconds.</p>
<p>But who is going to trust that? That&#8217;s why future commercial implementations will insist on logging to stable storage, such as the flash SSDs.</p>
<p>They&#8217;re getting cheaper fast &#8211; faster than DRAM &#8211; which will make this a common approach. </p>
<p><strong>Cost</strong><br />
Professor Ousterhout kindly sent a short note about cost, correctly noting that</p>
<blockquote><p>
. . . if you measure cost/operation, DRAM is roughly 100x cheaper than disk, since a disk can only perform about 100-200 operations/second.  This is why RAMCloud makes sense for data-intensive applications. . . .
</p></blockquote>
<p>While you and I might find that persuasive, too many enterprises don&#8217;t. The deep conservatism of the storage culture &#8211; both figuratively and literally &#8211; makes cost a good excuse to stay with the tried and true, and easy to explain to CFOs. </p>
<p>The good news for the company I hope he is starting is that the primacy of $/GB is slowly eroding as customers see the system level savings from fast storage. SSD vendors and companies like TMS and Kaminario are breaking trail for RAMCloud.</p>
<p><strong>The StorageMojo take</strong><br />
Make no mistake: RAMCloud is a research project, not a commercial product, years and million$ away from commercial application. But the concept is promising.</p>
<p>Imagine a world where data layout doesn&#8217;t matter, where apps are optimized for sub-millisecond storage, where 100 byte I/Os are faster and just as efficient as 8KB I/Os. The architectural implications are huge and would take a decade or more to absorb.</p>
<p>RAMCloud raises the thorny issue of tiering: getting hot data on the hot storage and everything else off to disk. There are OK answers for tiering but nothing insanely great. </p>
<p>RAMCloud shows we&#8217;re far from the end of the line in what storage can do. Faster, better, arguably cheaper: 2 out of 3 ain&#8217;t bad.</p>
<p><strong>Courteous comments welcome, of course.</strong> A shorter version of this post appeared on <a href="http://www.zdnet.com/blog/storage/ramcloud-puts-everything-in-dram/1546" target="_blank">ZDNet</a>.</p>
<div class="twttr_button">
				<a href="http://twitter.com/share?url=http://storagemojo.com/2011/10/05/ramcloud-is-the-new-flash/&text=RAMCloud is the new flash" target="_blank" title="Click here if you liked this article.">
					<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
				</a>
			</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2011/10/05/ramcloud-is-the-new-flash/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>NoSQL in the metadata engine room</title>
		<link>http://storagemojo.com/2011/10/03/nosql-in-the-metadata-engine-room/</link>
		<comments>http://storagemojo.com/2011/10/03/nosql-in-the-metadata-engine-room/#comments</comments>
		<pubDate>Mon, 03 Oct 2011 18:59:44 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Clusters]]></category>
		<category><![CDATA[Future Tech]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=2525</guid>
		<description><![CDATA[One more datapoint and we&#8217;ll have a trend: NoSQL databases managing metadata. It&#8217;s obvious in retrospect: use a scalable big data tool to handle scale-out metadata. Maybe not a requirement today, but surely will be with even bigger data tomorrow. Metadata is a fraction of the user data set, but it gets hammered much more. [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>One more datapoint and we&#8217;ll have a trend: NoSQL databases managing metadata. It&#8217;s obvious in retrospect: use a scalable big data tool to handle scale-out metadata. Maybe not a requirement today, but surely will be with even bigger data tomorrow.</p>
<p>Metadata is a fraction of the user data set, but it gets hammered much more. As more metadata is found useful the hammering will get more insistent.</p>
<p><strong>Nutanix</strong><br />
<a href="http://www.nutanix.com/" target="_blank">Nutanix</a>, whose CTO and co-founder, Mohit Aron, was a developer of the Google File System, uses MapReduce. Nutanix achieves it scale due to its distributed metadata, masterless architecture &#8211; powered by MapReduce jobs that run in the background.</p>
<p><strong>Druva</strong><br />
<a href="http://www.druva.com/" target="_blank">Druva</a>, a backup company for mobile devices, also uses a NoSQL database to manage storage metadata. They say they&#8217;ve found that NoSQL scales over an order of magnitude better than relational in similar applications.</p>
<p><strong>Somebody else</strong><br />
A company that shall remain nameless is porting Hadoop to their backend. The customer won&#8217;t be able to access Hadoop for their work &#8211; it is strictly for the system&#8217;s internal use.</p>
<p>It is a proof of concept so it isn&#8217;t a 3rd data point, but they see the potential advantages. Call it data point 2½. </p>
<p><strong>The StorageMojo take</strong><br />
Small advances are the building blocks of disruption. RAID made it possible to build available storage using cheap disks. Consumer adoption of PCs made disks even cheaper. Moore&#8217;s Law made RAID controllers cheaper and faster, or faster and more capable. </p>
<p>A virtuous circle of disruption.</p>
<p>The basic architecture of scale-out storage systems &#8211; purpose-built software on clustered commodity hardware &#8211; has been stable. But this is the beginning of scale-out storage 2.0: taking scale-out technology developed for users and incorporating it into the storage infrastructure itself.</p>
<p>These ideas are bubbling up among the latest startups and among the establishment players. At some point the old RAID architectures will be well and truly broken, able to compete in smaller and smaller niches until the revenue can&#8217;t justify more investment. </p>
<p>Of course vendors have been making RAID controllers out of servers for years now, and those servers can run any software they want. But at some point the explicit and implicit assumptions in the old architecture crash into current realities &#8211; either in cost, development time, feature completeness or management overhead &#8211; and then we move on.</p>
<p><strong>Courteous comments welcome, of course.</strong> I learned about Nutanix at the last <a href="http://techfieldday.com/" target="_blank">Tech Field Day</a> &#8220;The Independent IT Influencer Event&#8221; which paid for my travel expenses to Silicon Valley.</p>
<div class="twttr_button">
				<a href="http://twitter.com/share?url=http://storagemojo.com/2011/10/03/nosql-in-the-metadata-engine-room/&text=NoSQL in the metadata engine room " target="_blank" title="Click here if you liked this article.">
					<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
				</a>
			</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2011/10/03/nosql-in-the-metadata-engine-room/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

