<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>StorageMojo &#187; Architecture</title>
	<atom:link href="http://storagemojo.com/category/architecture/feed/" rel="self" type="application/rss+xml" />
	<link>http://storagemojo.com</link>
	<description>Data storage info &#38; analysis</description>
	<lastBuildDate>Thu, 13 Jun 2013 13:50:19 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5.1</generator>
		<item>
		<title>HP&#8217;s big transition</title>
		<link>http://storagemojo.com/2013/06/13/hps-big-transition/</link>
		<comments>http://storagemojo.com/2013/06/13/hps-big-transition/#comments</comments>
		<pubDate>Thu, 13 Jun 2013 13:50:19 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[Marketing]]></category>
		<category><![CDATA[NAS, IP, iSCSI]]></category>
		<category><![CDATA[SAN, FC]]></category>
		<category><![CDATA[SSD/Flash Disk]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=2975</guid>
		<description><![CDATA[IDC says HP is selling 17% less storage last quarter than a year ago. Was that because the high-end EVA and XP businesses were contracting faster than the new 3PAR converged storage business was growing? IDC definitions IDC defines a disk storage system broadly, including anything with a controller, cables and 3 or more drives [...]]]></description>
				<content:encoded><![CDATA[<p></p><p>IDC says HP is <a href="http://www.idc.com/getdoc.jsp?containerId=prUS24152113&#038;pageType=PRINTFRIENDLY" target="_blank">selling 17% less storage</a> last quarter than a year ago. Was that because the high-end EVA and XP businesses were contracting faster than the new 3PAR converged storage business was growing?</p>
<p><strong>IDC definitions</strong><br />
IDC defines a disk storage system broadly, including anything with a controller, cables and 3 or more drives as external storage. That covers a lot of ground. They also have another segment, called open networked disk storage (NAS Combined with non-mainframe SAN) that kicks out direct-attach storage. </p>
<p>In either case IDC counts OEM revenue for the branding company, not the supplier. So their numbers overstate, say, IBM&#8217;s storage presence &#8211; which includes the NetApp FAS arrays branded as IBM&#8217;s N-series &#8211; while understating NetApp&#8217;s actual shipments. Hitachi, who also OEMs to HP and Sun, also suffers in the IDC rankings.</p>
<p><strong>The HP answer</strong><br />
At HP Discover I asked David Scott, formerly 3PAR&#8217;s CEO and now head of HP storage, reporting to Dave Donatelli, formerly of EMC, about this. He answered that the high-end HP storage business was growing and that the contraction was in the low-end MSA and JBOD business.</p>
<p>According to Mr. Scott, due to HP&#8217;s current troubles, the server group has been moving away from low-margin business, as well as building servers that have up to 62 drives internally (HP ProLiant SL4540 Gen8 Server). Double whammy: server sales are down, and with them, the easy add-on sales of low-end storage; and the servers they do sell have more storage internally and don&#8217;t count as external storage sales.</p>
<p>As the largest seller of servers and disk drives, it doesn&#8217;t take much margin growth for both to drop off. It&#8217;s an intensely competitive market with lots of low-margin business that can juice the top line while eviscerating the bottom line.</p>
<p><strong>The StorageMojo take</strong><br />
The high-end storage market is consolidating. EMC is winning big, and if traditional storage arrays are what you want, they should. </p>
<p>HP is the only major player with a tightly integrated next-gen product strategy that is a stark contrast to EMC&#8217;s product stovepipes. Any enterprise storage product/architecture eval needs to include HP.</p>
<p>Now that they&#8217;ve added the HP 3PAR StoreServ 7450 flash array HP has covered most of the waterfront in enterprise storage. Despite the inevitable headwinds during the largest product transition of any storage vendor, HP is well-positioned for the future.</p>
<p>Market dynamics are just that &#8211; dynamic &#8211; and quarterly numbers are only a snapshot. So while I&#8217;m having trouble making sense of the numbers, I know that big transitions are rarely smooth. All should become clear in the fullness of time.</p>
<p>HP storage has great technology. Their real challenge is getting the HP sales force revved up to compete better against EMC and NetApp. HP&#8217;s tech culture doesn&#8217;t appreciate what salesmen need to win, so I&#8217;d expect that&#8217;s where Scott and Donatelli are spending a lot of their time.</p>
<p><strong>Courteous comments welcome, of course.</strong> HP is putting me up here in Las Vegas &#8211; a favorite drive of mine from NoAZ &#8211; but they aren&#8217;t paying for my attendance. Bummer.</p>
<div class="twttr_button">
					<a href="http://twitter.com/share?url=http://storagemojo.com/2013/06/13/hps-big-transition/&text=HP's big transition" target="_blank" title="Click here if you like this article.">
						<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
					</a>
				</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2013/06/13/hps-big-transition/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>Architecting &amp; integrating flash into enterprise storage</title>
		<link>http://storagemojo.com/2013/05/16/architecting-integrating-flash-into-enterprise-storage/</link>
		<comments>http://storagemojo.com/2013/05/16/architecting-integrating-flash-into-enterprise-storage/#comments</comments>
		<pubDate>Thu, 16 May 2013 18:04:05 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Disk]]></category>
		<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[SSD/Flash Disk]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=2941</guid>
		<description><![CDATA[Have you ever noticed that it is difficult to get good information about how flash works? The vendors know but they&#8217;ve never been terribly forthcoming. For example, how does flash wear out? When most things break you lose their contents. But once flash stops working your data is still there. Huh? And the fact that [...]]]></description>
				<content:encoded><![CDATA[<p></p><p>Have you ever noticed that it is difficult to get good information about how flash works? The vendors know but they&#8217;ve never been terribly forthcoming.</p>
<p>For example, how does flash wear out? When most things break you lose their contents. But once flash stops working your data is still there. Huh?</p>
<p>And the fact that flash is a wearing medium spooks many people. How should we think about flash? Can we live with a wearing medium?</p>
<p>Or write amplification? How does that work? What can be done to reduce it?</p>
<p>That&#8217;s why it was a pleasure to sit down with Rob Ober of LSI. Rob is an LSI Fellow and system architect with deep technical knowledge of flash and how it interacts with systems and applications.</p>
<p>Rob holds dozens of patents and is articulate and open. Plus he&#8217;s a very nice guy.</p>
<p>I distilled down what I learned and some of Rob&#8217;s key points into a StorageMojo video white paper that <a href="http://www.lsi.com/products/storagecomponents/Pages/SolidState.aspx" target="_blank">LSI</a> commissioned. If you are curious about flash, how it works, how it fails and how it can be turned into an enterprise class storage medium, you&#8217;ll find the video informative.</p>
<p>At least I did my level best to make it so, including video from Wilson Canyon, one of my favorite local hikes. Here&#8217;s the video:</p>
<p><iframe src="http://player.vimeo.com/video/65351106" width="500" height="281" frameborder="0" webkitAllowFullScreen mozallowfullscreen allowFullScreen></iframe></p>
<p><strong>The StorageMojo take</strong><br />
As a thought experiment I sometimes wonder about how storage would be different if IBM had invented flash back in 1956 instead of the RAMAC disk drive. What it reads were fast and free while writes were expensive?</p>
<p>That&#8217;s essentially the problem we&#8217;re trying to solve today. Except today we have an installed base of a couple billion disk drives and decades of driver, OS and application development all predicated on disk performance.</p>
<p>We&#8217;re still in the early days of flash integration, even though forward-leaning architects have been working on it for 6 years or more. Thanks to flash &#8211; and cloud &#8211; storage has never been more vibrant or exciting.</p>
<p><strong>Courteous comments welcome, of course.</strong> Feel free to ask about anything in the video that wasn&#8217;t clear or didn&#8217;t go deep enough. Your questions help me understand what you find valuable.</p>
<div class="twttr_button">
					<a href="http://twitter.com/share?url=http://storagemojo.com/2013/05/16/architecting-integrating-flash-into-enterprise-storage/&text=Architecting & integrating flash into enterprise storage" target="_blank" title="Click here if you like this article.">
						<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
					</a>
				</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2013/05/16/architecting-integrating-flash-into-enterprise-storage/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>EMC and the 7 dwarves &#8211; pt 2</title>
		<link>http://storagemojo.com/2013/05/03/emc-and-the-7-dwarves-pt-2/</link>
		<comments>http://storagemojo.com/2013/05/03/emc-and-the-7-dwarves-pt-2/#comments</comments>
		<pubDate>Fri, 03 May 2013 19:52:15 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Cloud computing & storage]]></category>
		<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[SSD/Flash Disk]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=2926</guid>
		<description><![CDATA[Note: This post got so long it needed to be posted in 2 parts. Part 1 is here. And while I promised this 2nd part &#8220;tomorrow&#8221; the editing took much longer than expected. End note. HP has made the most dramatic bet with their 3PAR-based converged storage line. While the rapid growth of the new [...]]]></description>
				<content:encoded><![CDATA[<p></p><p><strong>Note:</strong> This post got so long it needed to be posted in 2 parts. Part 1 is <a href="http://storagemojo.com/2013/04/25/emc-and-the-7-dwarves-part-1/" target="_blank">here</a>. And while I promised this 2nd part &#8220;tomorrow&#8221; the editing took much longer than expected. <strong>End note.</strong></p>
<p><strong>HP</strong> has made the most dramatic bet with their 3PAR-based converged storage line. While the rapid growth of the new products is overwhelmed by the even quicker decline of older products like EVA, they&#8217;re off to a good start, claiming over 1200 new customeres.</p>
<p>The challenge for former EMC&#8217;er Dave Donatelli and 3PAR&#8217;s David Scott isn&#8217;t technology but sales. Do you have field storage sales specialists and, if so, how do you compensate everyone so account managers are comfortable bringing them into large deals? Or do you flog marketing to make selling converged storage so simple and remunerative that server guys do it themselves? </p>
<p>Ideally both, but the HP go-to marketing strategy is mind-numbing detail and then tossing it over the wall for sales to figure out. That&#8217;s not a winning strategy these days.</p>
<p>Bottom line: HP&#8217;s lead in physical servers makes them a threat for many add-on storage sales. But EMC&#8217;s storage-focused sales force keeps winning deals.</p>
<p><strong>IBM</strong>&#8216;s storage business is at risk. While they have great technology, none of their hardware products are even a strong #2. Two thirds of IBM&#8217;s Systems and Technology group business is servers, and it is clear that IBM management isn&#8217;t happy with how <a href="http://www.channelregister.co.uk/2013/04/23/ibm_storage_actions/" target="_blank">some product lines are performing</a>. </p>
<p>IBM top management is not sentimental. Given the disposal of once-core assets, such as disk drives, and IBM&#8217;s shrinking storage market share, it is clear that IBM will likely sell off or shutter some hardware product lines to focus on higher-margin storage. The Texas Memory Systems solid state arrays, storage software and mainframe storage are likely safe. </p>
<p>But at some point &#8211; and that point will come sooner rather than later due to the cloud&#8217;s competitive pressure &#8211; IBM&#8217;s management will decide that the overhead and investment required to support 3rd place and worse products isn&#8217;t worth it. That&#8217;s likely behind the <a href="http://searchstorage.techtarget.com/news/2240181402/Big-Blues-billion-dollar-bet-on-SSD-includes-all-flash-storage-array" target="_blank">**BILLION DOLLAR ALL FLASH**</a> data center initiative: a desperate attempt to stay relevant in a rapidly changing storage world.</p>
<p>Expect to see less-than-competitive hardware lines put on life-support with cheap-to-implement HW upgrades &#8211; new, bigger disks! now with SSD added! &#8211; and new OS quals, until most of the base has migrated or the profits are gone. It is a sad comedown for the company that created and dominated the storage business for decades and <a href="http://storagemojo.com/2005/03/31/daddy-tell-me-again-how-little-emc-beat-giant-ibm/" target="_blank">then lost it to an upstart named EMC</a>.</p>
<p><strong>Who is the final dwarf?</strong><br />
Well, that&#8217;s 6 companies. Who&#8217;s the final dwarf? Fujitsu? NEC? Cisco? Huawei? Yes, Huawei is in the <a href="http://www.huawei.com/en/products/storage-security/cloudstor/index.htm" target="_blank">storage business</a>. </p>
<p>I like Fujitsu and their <a href="http://www.fujitsu.com/global/services/computing/storage/eternus/" target="_blank">Eternus</a> system for the final spot, although NEC&#8217;s <a href="http://www.necam.com/hydrastor/" target="_blank">HYDRAstor</a> has some great scale-out technology. But is NEC getting traction? If they are it&#8217;s a well-kept secret.</p>
<p>Eternus is probably doing best in Japan, a large enough market to keep them going for years. But don&#8217;t expect any game-changers.</p>
<p><strong>The StorageMojo take</strong><br />
The most important impact of cloud infrastructure is that it gives CFOs a yardstick to <strike>beat their CIO with</strike> measure their internal IT against. And they aren&#8217;t happy with what they&#8217;re seeing.</p>
<p>For decades IT has been the wayward child in the corporate family. Cost overruns, service delays, unfathomable mumbo-jumbo, security lapses &#8211; IT is a never ending string of expensive problems &#8211; as the short life of CIOs attests.</p>
<p>But the need for IT infrastructure is only growing. It&#8217;s the means that are changing. That&#8217;s where the aggressive, new architecture storage companies like Fusion-io/NexGen, Tegile, Nimble, Avere, Kaminario, DDN, Tintri, Nutanix and now Exablox come in. </p>
<p>Since the advent of RAID systems in the early 90s we haven&#8217;t seen much turnover in the major players. That&#8217;s about to change.</p>
<p>While it will be wrenching for those involved, it will be good for the industry and for the myriad organizations that need reliable, cost-effective storage. People will buy large arrays for decades to come &#8211; just as they still buy mainframes &#8211; but they&#8217;ll have many more options for non-transactional storage.</p>
<p><strong>Courteous comments welcome, of course.</strong> Texas Memory Systems was a long-time advertiser on StorageMojo until recently. HP flew me to their corporate trade show in Europe last year to get the converged storage story direct from David Scott and others. I&#8217;ve done work for some of the other firms mentioned.</p>
<div class="twttr_button">
					<a href="http://twitter.com/share?url=http://storagemojo.com/2013/05/03/emc-and-the-7-dwarves-pt-2/&text=EMC and the 7 dwarves - pt 2" target="_blank" title="Click here if you like this article.">
						<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
					</a>
				</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2013/05/03/emc-and-the-7-dwarves-pt-2/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Is F5&#8242;s ARX file virtualization a success?</title>
		<link>http://storagemojo.com/2013/04/22/is-f5s-arx-file-virtualization-a-success/</link>
		<comments>http://storagemojo.com/2013/04/22/is-f5s-arx-file-virtualization-a-success/#comments</comments>
		<pubDate>Mon, 22 Apr 2013 17:35:55 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[Management]]></category>
		<category><![CDATA[NAS, IP, iSCSI]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=2908</guid>
		<description><![CDATA[In response to the post on Avere&#8217;s architecture for fronting backend NAS filers &#8211; where StorageMojo said that no front-end to NAS boxes has succeeded &#8211; alert reader Jacob Marley asked &#8220;What about F5′s ARX to stitch/balance storage across multiple filers?&#8221; Good question! What can we deduce from publicly available sources? The F5 ARX product [...]]]></description>
				<content:encoded><![CDATA[<p></p><p>In response to the <a href="http://storagemojo.com/2013/04/19/fronting-nas-for-fun-and-profit/" target="_blank">post on Avere&#8217;s architecture</a> for fronting backend NAS filers &#8211; where StorageMojo said that no front-end to NAS boxes has succeeded &#8211; alert reader Jacob Marley asked &#8220;What about F5′s ARX to stitch/balance storage across multiple filers?&#8221;</p>
<p><a href="http://storagemojo.com/wp-content/uploads//2013/04/f5-fullcolor-lg.jpgwad58b3e319a2f6d68.jpeg"><img src="http://storagemojo.com/wp-content/uploads//2013/04/f5-fullcolor-lg.jpgwad58b3e319a2f6d68-e1366651712877.jpeg" alt="f5-fullcolor-lg.jpg;wad58b3e319a2f6d68" width="125" height="112" class="alignleft size-full wp-image-2911" /></a></p>
<p>Good question! What can we deduce from publicly available sources?</p>
<p>The F5 <a href="http://www.f5.com/products/hardware/arx-hardware/" target="_blank">ARX product line</a> is billed as an &#8220;intelligent file virtualization solution&#8221; that<br />
&#8220;. . .preserves the logical access to files regardless of their current location on storage.&#8221; Like earlier file switches</p>
<blockquote><p>
The ARX device does not introduce a new file system; rather it acts as a proxy to the file systems that are already there.
</p></blockquote>
<p>ARX is not a storage device itself but a load-balancer for NAS filers. Then, per Mr Marley&#8217;s question, is ARX not a success?</p>
<p><strong>Competitive analysis</strong><br />
First up, let&#8217;s take a look at the latest quarterly 10-Q report, courtesy of the <a href="http://www.sec.gov/edgar.shtml" target="_blank">SEC&#8217;s EDGAR database</a>.</p>
<p>In &#8220;Management’s Discussion and Analysis of Financial Condition and Results of Operations&#8221; they describe their product revenues as</p>
<blockquote><p>
The majority of our revenues are derived from sales of our application delivery networking (ADN) products including our high end VIPRION chassis and related software modules; BIG-IP Local Traffic Manager, BIG-IP Global Traffic Manager, BIG-IP Link Controller, BIG-IP Application Security Manager, BIG-IP Edge Gateway, BIG-IP WAN Optimization module, BIG-IP Access Policy Manager, WebAccelerator; FirePass SSL VPN appliance; Traffix diameter signaling products; and <strong>ARX file virtualization products</strong>.
</p></blockquote>
<p>Unless this is a last-but-not-least ordering, it looks like management is not leading with the ARX products. But let&#8217;s look for more evidence of management&#8217;s priorities.</p>
<p>Combing through the <a href="http://www.f5.com/about/news/press/" target="_blank">F5 newsroom</a>, for instance, we find that the last <a href="http://www.f5.com/about/news/press/2011/20110711/" target="_blank">press release</a> on ARX is almost 2 years old. Titled &#8220;F5’s New ARX Platforms Help Organizations Reap the Benefits of File Virtualization&#8221; it is surprising that later press releases don&#8217;t call out other success stories.</p>
<p>The most recent ARX white papers, &#8220;Reducing Storage Costs with F5 ARX&#8221; and &#8220;Enabling Flexibility with Intelligent File Virtualization&#8221; are both dated 2011. The ARX data sheet is from 2013 though.</p>
<p><strong>The StorageMojo take</strong><br />
It&#8217;s clear that F5 has backed away from the ARX technology &#8211; which they acquired with Acopia in 2007 &#8211; in favor of the Application Delivery Controller market. But does that mean that the Acopia/F5 ARX didn&#8217;t succeed?</p>
<p>Clearly, ARX succeeded for a while: F5 bought them after all. And the F5 PR archives have several success stories from 2009. </p>
<p>But within the current F5 context &#8211; where they have several high-growth segments &#8211; ARX is getting little investment. At another company, perhaps, ARX would be a success, but at F5 it clearly is not.</p>
<p>If I were a customer I would certainly look at ARX if I wanted to virtualize disparate NAS filers. But I&#8217;d be sure to have some contingency plans in place if F5 decided to end-of-life the product in the next 2 years.</p>
<p><strong>Courteous comments welcome, of course.</strong> Fun fact: F5&#8242;s name was inspired by one of my favorite movies &#8211; and an AFI top 100 selection &#8211; <i>Twister</i>, that popularized the Fujita scale (now the Enhanced Fujita scale) for tornado intensity.</p>
<div class="twttr_button">
					<a href="http://twitter.com/share?url=http://storagemojo.com/2013/04/22/is-f5s-arx-file-virtualization-a-success/&text=Is F5's ARX file virtualization a success?" target="_blank" title="Click here if you like this article.">
						<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
					</a>
				</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2013/04/22/is-f5s-arx-file-virtualization-a-success/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>Fronting NAS for fun and profit</title>
		<link>http://storagemojo.com/2013/04/19/fronting-nas-for-fun-and-profit/</link>
		<comments>http://storagemojo.com/2013/04/19/fronting-nas-for-fun-and-profit/#comments</comments>
		<pubDate>Fri, 19 Apr 2013 23:47:25 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Cloud computing & storage]]></category>
		<category><![CDATA[Clusters]]></category>
		<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[Management]]></category>
		<category><![CDATA[NAS, IP, iSCSI]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=2902</guid>
		<description><![CDATA[The traditional model of NAS filers is handy if you only have a few. But once you get to 8 or 10 NAs filers your life gets complicated. Your oldest data is on the oldest filer and your active data is on the newest. If that new filer bottlenecks your entire system slows down. Hence [...]]]></description>
				<content:encoded><![CDATA[<p></p><p>The traditional model of NAS filers is handy if you only have a few. But once you get to 8 or 10 NAs filers your life gets complicated. </p>
<p>Your oldest data is on the oldest filer and your active data is on the newest. If that new filer bottlenecks your entire system slows down.</p>
<p>Hence the old saw &#8220;you&#8217;ll love your first filer and hate your tenth.&#8221; System administrators will load balance by moving data back and forth, an inherently wasteful and error-prone exercise.</p>
<p><strong>A history of failure</strong><br />
A number of start ups &#8211; such as Z-force and Zambeel &#8211; have attempted a fix. The general idea is a switch that virtualizes the backend filers to create a single pool. </p>
<p>While the concept sounds good, results have been dismal. No storage system whose primary function was to front-end existing NAS boxes has succeeded.</p>
<p><strong>Once more unto the breach</strong><br />
Now another entrant enters the fray. <a href="http://www.averesystems.com" target="_blank">Avere Systems</a> has raised $50 million and is on v3 of their tin-wrapped software. </p>
<p>At NAB 2013 they announced the FXT 3800 Edge Filer. The 3800 tiers across RAM, SSD, SAS, backend NAS <a href="http://www.averesystems.com/News_PressReleases.aspx?ID=65" target="_blank">and cloud</a> across one namespace. </p>
<p>They&#8217;re understandably proud of their <a href="http://www.spec.org/sfs2008/results/res2013q2/sfs2008-20130318-00218.html" target="_blank">new SPECsfs2008 NFS result</a> with a FXT 3800 32 Node Cluster that reached 1,592,334 Ops/Sec. That beats NetApp, Isilon, Hitachi/BlueArc and everybody else, except Huawei&#8217;s OceanStor 8500, which used 24 file systems and more than twice the number of SSDs get over 3 million Ops/Sec.</p>
<p>Oh, and they included a transcontinental latency in the network. As you might see using a cloud provider like Amazon Web Services, which was showing an Avere proto version in their NAB booth.</p>
<p><strong>The hard question</strong><br />
After the briefing by Avere I asked Ron Bianchini, the CEO and cofounder, why Avere would escape the fate of their erstwhile predecessors.</p>
<p>I boiled his answer down to 4 points:</p>
<ul>
<li>Avere&#8217;s appliance is a read and write cache, so hot data I/O is handled directly and not routed to the backend filers. Typically, he says, only 1 out of 50 I/Os leave Avere for backend NAS, and for some workloads it is as little as 1 out of 200.</li>
<li>Their file system is the client of the backend filers, so they know exactly where the data is at all times. Furthermore, they&#8217;ve certified vendors like NetApp, so they handle the inevitable corner cases.</li>
<li>The system moves data across 4 tiers &#8211; DRAM, SSD, SAS, SATA and the backend filers so it is capable of extremely high performance, unlike products that relied upon backend performance.</li>
<li>They also manage blocks within files, so a change in a file doesn&#8217;t require rewriting the entire file, a popular feature in large file applications.</li>
</ul>
<p><strong>The StorageMojo take</strong><br />
Rip and replace has never been popular. With today&#8217;s data volumes it is ever more unwieldy. </p>
<p>Avere&#8217;s performance and cost-effectiveness make it more than a simple pooling of NAS capacity: by reducing the load on current filers it extends their economic life while eliminating hot-spots and bottlenecks. You keep what you&#8217;ve got and make it faster and easier to manage. </p>
<p>Since most disk-based systems are way over-configured on capacity, this also means reduced CapEx and OpEx as fewer new filers are bought and less floor space, power and maintenance is needed. Given their scale-out architecture &#8211; minimum config is 3 nodes &#8211; you can add performance without adding more filers.</p>
<p>Bottom line: Avere, using 21st century technology, has built a new way to utilize existing resources while improving performance and reducing costs. That&#8217;s something no other NAS front-end ever managed.</p>
<p>They&#8217;ll do well.</p>
<p><strong>Courteous comments welcome, of course.</strong> Any Avere users want to comment on their experience? I haven&#8217;t done any work for Avere, but that could change.</p>
<div class="twttr_button">
					<a href="http://twitter.com/share?url=http://storagemojo.com/2013/04/19/fronting-nas-for-fun-and-profit/&text=Fronting NAS for fun and profit" target="_blank" title="Click here if you like this article.">
						<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
					</a>
				</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2013/04/19/fronting-nas-for-fun-and-profit/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Build a 3PB storage solution</title>
		<link>http://storagemojo.com/2013/04/01/build-a-3pb-storage-solution/</link>
		<comments>http://storagemojo.com/2013/04/01/build-a-3pb-storage-solution/#comments</comments>
		<pubDate>Mon, 01 Apr 2013 23:36:34 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Clusters]]></category>
		<category><![CDATA[NAS, IP, iSCSI]]></category>
		<category><![CDATA[SAN, FC]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=2882</guid>
		<description><![CDATA[Choice is a great thing, unless there&#8217;s too much of it. And choice is what we have a lot of in today&#8217;s data storage market. A longtime StorageMojo reader has an interesting problem: architect a 3PB data storage facility. Can you help? Here&#8217;s what he wrote to StorageMojo. His email has been slightly edited for [...]]]></description>
				<content:encoded><![CDATA[<p></p><p>Choice is a great thing, unless there&#8217;s too much of it. And choice is what we have a lot of in today&#8217;s data storage market. </p>
<p>A longtime StorageMojo reader has an interesting problem: architect a 3PB data storage facility. Can you help?</p>
<p>Here&#8217;s what he wrote to StorageMojo. His email has been slightly edited for clarity and length.</p>
<blockquote><p>
One of my current problems is to design one of the nodes for a large research data storage facility. I&#8217;ve had to do this stuff in varying degrees, varying modalities and varying tech in times gone by.</p>
<p>I&#8217;ve been given a number and &#8220;capacity&#8221; to look into – somewhere near or around 3PB to begin with. We won&#8217;t even go down the path of discussing workloads or disk technology fit for purpose at this stage, but, something has struck me as interesting.</p>
<p>There is this clear divergence in disk technologies at the moment and I&#8217;m finding it hard to resolve what is the &#8220;right&#8221; one of the task.</p>
<p>Currently, I see:</p>
<ul>
<li>Heavy-end storage virtualisation frames [VSP, Symmetrix et al]</li>
<li>Big grid-ish things [IBM XIV etc]</li>
<li>Weird &#8220;stacked&#8221; commodity LSI Silicon [NetApp E5400/5500, SGI IS5500/IS5600, Dell MD3660F etc – all the same silicon I think?!]</li>
<li>Quasi virtualisation arrays with modular form factors (Hitachi&#8217;s HUS-VM?)</li>
<li>High performance dense trays in modular form factors [DDN's SFA-12K Exa and Grid scaler tech?]</li>
<li>Bog-standard performance dense trays in modular form factors [Hitachi HUS, EMC VNX, HP EVA, Dell compellent etc etc]</li>
<li>That wild crazy pure flash/RAM/SSD/NAND world that guys like Violin inhabit.</li>
</ul>
<p>Currently I&#8217;m trying to rationalise what I should be using for a storage platform that needs to scale big, but do it in a sensible economic standpoint, with density, performance of interconnect and throughput with gross mixed workloads being all big factors. </p>
<p>Some folks suggest to me that I should be happy enough with the LSI horizontally stacked 60-drive trays, but I am not sure the technology is tracking too well in terms of performance or density (Hitachi, DDN and maybe some others can now do 84 drives in as little as 4-RU!).</p>
<p>I guess my question to you is – where do you see that dense high performance market heading? I know the guys at the LLNL over your way were crowing about the NetApp E5400 LSI stuff where they managed their &#8220;1TB/sec&#8221; file system (I think it was Lustre based?), but I have to wonder if that could have been more efficiently carried out using a DDN GridScaler/SFA-12K-E etc.
</p></blockquote>
<p><strong>The StorageMojo take</strong><br />
Two issues here: is the segmentation our correspondent offers realistic and helpful? And what are the core architectural issues he needs to think about?</p>
<p>For the first issue an object store or a highly parallel NFS &#8211; like Panasas &#8211; seems to be indicated.</p>
<p>Given that this is a general purpose high-performance system, the critical problem seems to be how the system &#8211; however architected &#8211; handles file creation/update/deletion metadata. String enough disks together &#8211; 1,000 to 2,000 &#8211; and you can get a reasonable # of IOPS and, if you need more, put some SSDs in front. </p>
<p>There are a number of scale-out storage systems that will credibly and economically grow to 3PB. Metadata is often the bottleneck, as Isilon buyers have found when creating many small files.</p>
<p>A maximum performance spec &#8211; including file creation etc. rates &#8211; will probably help eliminate likely laggards, while a budget $ per usable TB/PB will eliminate the uneconomic products. </p>
<p>Vendors are welcome to offer their perspectives. Please just identify your company so we know where you&#8217;re coming from.</p>
<p>Practitioners who&#8217;ve done this, or something similar, are encouraged to share their hard-earned wisdom. 3PB is non-trivial today.</p>
<p><strong>Courteous comments welcome, of course.</strong> I&#8217;m going to start offering almost-free consulting for end-users. Stay tuned!</p>
<div class="twttr_button">
					<a href="http://twitter.com/share?url=http://storagemojo.com/2013/04/01/build-a-3pb-storage-solution/&text=Build a 3PB storage solution" target="_blank" title="Click here if you like this article.">
						<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
					</a>
				</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2013/04/01/build-a-3pb-storage-solution/feed/</wfw:commentRss>
		<slash:comments>12</slash:comments>
		</item>
		<item>
		<title>Dear StorageMojo: should I go all SSD?</title>
		<link>http://storagemojo.com/2013/03/19/dear-storagemojo-should-i-go-all-ssd/</link>
		<comments>http://storagemojo.com/2013/03/19/dear-storagemojo-should-i-go-all-ssd/#comments</comments>
		<pubDate>Tue, 19 Mar 2013 16:33:03 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[NAS, IP, iSCSI]]></category>
		<category><![CDATA[SSD/Flash Disk]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=2874</guid>
		<description><![CDATA[This came in this morning&#8217;s email from a reader I&#8217;ll call Perplexed. How would you advise Perplexed? I&#8217;m looking at a new iSCSI storage system for two sites with ~ 20 servers each &#8211; 10TB each should do it. Picture two fairly usual manufacturing/mining sites, 200-500 users, email, file/Print, finance and production database services, MS [...]]]></description>
				<content:encoded><![CDATA[<p></p><p>This came in this morning&#8217;s email from a reader I&#8217;ll call Perplexed. How would you advise Perplexed?</p>
<blockquote><p>
I&#8217;m looking at a new iSCSI storage system for two sites with ~ 20 servers each &#8211; 10TB each should do it. Picture two fairly usual manufacturing/mining sites, 200-500 users, email, file/Print, finance and production database services, MS Domain etc.</p>
<p>Looking at IOPS &#8211; we would be serviced by 24 x 2.5&#8243; SAS 10K disks in a RAID6 array.So &#8211; the thought occurs &#8211; that SSD would easily match that performance with far less devices. </p>
<p>Say 15 VM&#8217;s and 5 Servers per location. Requirement for about 5TB of data with limited growth &#8211; lets say 10TB storage and under 1000 IOPS. </p>
<p>Throughput is not an issue except for backup and DR. If we can saturate 2 or 3 Gigabit Ethernet links that is adequate.</p>
<p>This would be served comfortably by 24 x 10K 2.5&#8243; RAID6 arrays at each location. Two of them for redundancy.</p>
<p>But &#8211; a single Intel 710 SSD could meet that IOPS rate and probably throughput as well. One SSD disk replacing an entire 24 disk array!</p>
<p>I would then ask, why have RAID at all? RAID is based on spindles being the smallest block for failure. With SSD, that block could be much smaller. The controller is already doing some ECC for wear management with overprovisioning. </p>
<p>Is there a new paradigm the granularity is no longer a &#8220;spindle&#8221;? Should we simply over-provision by 50%? SSD generally comes with provisioned spare capacity &#8211; starting to sound like redundancy and error correction is built into the controllers to some degree already. </p>
<p>What would be ideal is a 1RU box full of 10TB solid state storage with 10G iSCSI &#8211; no separate disks.</p>
<p>Has SSD let us start to move beyond RAID? With the death of spindles and the huge IOPS available, is the entire R1, R5, R6, R10 debate finished? Does RAID have it&#8217;s place in a box full of chips, and if yes, does it look the same as what we know?</p>
<p>Has the world started to change in storage, or is SSD still just non-moving spindles?</p>
<p>10TB, 1000IOPS, 10G iSCSI &#8211; how would you buy it?
</p></blockquote>
<p><strong>Readers, what say you?</strong><br />
What suggestions do you have for Perplexed? The IOPS are low and he doesn&#8217;t suggest heavy bandwidth requirements either. But he does seem very interested in reliability.</p>
<p><strong>Update:</strong> Vendors are welcome to comment. I only ask that you identify yourself as such. <strong>End update.</strong></p>
<p><strong>The StorageMojo take</strong><br />
Aside from cost &#8211; I&#8217;d expect a minimum of $4-$5 per gigabyte or ≈$100k+ for the storage &#8211; the low IOPS requirement means SSD could be overkill. Perhaps a hybrid SSD/disk solution? SSDs can and do fail, so relying on a single SSD is as dangerous as relying on a single HDD.</p>
<p>A number of companies might be appropriate, including Nexsan, Nimble, TwinStrata, Nexenta, Nutanix, Tintri, Violin, Pure, Nimbus, Tegile and Avere among others. Some have features, such as WAN replication or cloud backup, that might prove useful. Others have VM support, but not with iSCSI.</p>
<p>Performance isn&#8217;t likely to be an issue with any of these vendors, so I&#8217;d focus on availability, management, support and then look at cost.</p>
<p><strong>Courteous comments welcome, of course.</strong> I&#8217;ve done work for some of the companies mentioned.</p>
<div class="twttr_button">
					<a href="http://twitter.com/share?url=http://storagemojo.com/2013/03/19/dear-storagemojo-should-i-go-all-ssd/&text=Dear StorageMojo: should I go all SSD?" target="_blank" title="Click here if you like this article.">
						<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
					</a>
				</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2013/03/19/dear-storagemojo-should-i-go-all-ssd/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>What is DSSD building?</title>
		<link>http://storagemojo.com/2013/03/13/what-is-dssd-building/</link>
		<comments>http://storagemojo.com/2013/03/13/what-is-dssd-building/#comments</comments>
		<pubDate>Wed, 13 Mar 2013 20:03:30 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[Future Tech]]></category>
		<category><![CDATA[SSD/Flash Disk]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=2868</guid>
		<description><![CDATA[The brains behind the ZFS filesystem &#8211; including Jeff Bonwick and Bill Moore &#8211; have been hard at work for several years at start up DSSD. What are they doing with Andy Bechtolsheim&#8217;s money? Bill&#8217;s recent Usenix bio says that &#8220;. . . DSSD, Inc., [is] a stealth startup focused on creating the fastest and [...]]]></description>
				<content:encoded><![CDATA[<p></p><p>The brains behind the ZFS filesystem &#8211; including Jeff Bonwick and Bill Moore &#8211; have been hard at work for several years at start up DSSD. What are they doing with Andy Bechtolsheim&#8217;s money?</p>
<p>Bill&#8217;s recent Usenix bio says that &#8220;. . . DSSD, Inc., [is] a stealth startup focused on creating the fastest and most reliable storage possible.&#8221;</p>
<p>That puts them in good company.</p>
<p>They also have a trademark on the term &#8220;Cubic RAID&#8221; which describes &#8220;Computer software for use in electronic storage of data.&#8221; That narrows it right down.</p>
<p><strong>Patent search</strong><br />
The good news is that while they are still in stealth mode their 6 patents are out there for all to see. What looks interesting?</p>
<p>The &#8220;Method And System For Multidimensional RAID&#8221; uses something called &#8220;RAID grids&#8221; to enable reliable and robust data storage and access.</p>
<p>The &#8220;Storage System With Self Describing Data&#8221; adds to the multidimensional RAID idea and adds object storage-like constructs into the storage system. </p>
<p>&#8220;Storage System With Incremental Multidimensional RAID&#8221; expands on this idea. By distributing the data and parity across the grid the design intends to enable data recovery despite multiple instances of data corruption or media failure.</p>
<p>The grid, of course, is three-dimensional, turning it into something they call a RAID cube. Given the focus on data recoverability perhaps the D in DSSD stands for durable.</p>
<p><strong>The StorageMojo take</strong><br />
Looking at the technology it seems that the folks at DSSD are focused on a very particular set of requirements.</p>
<p>Self-describing data sounds like object storage. The extreme durability and high performance suggests a system optimized for transaction processing.</p>
<p>The use of solid-state storage implied in the name also suggests a focus on high performance. But SSD costs mean a use case very different from the video storage that objects are commonly used for now.</p>
<p>So what are they building? They are taking a radically different approach to the problem of high-performance transaction processing storage. The use of flash is a given in TP, and the extra durability, scalability and guaranteed read latency would be very attractive in large TP applications.</p>
<p>The most surprising piece is the object storage-like characteristics suggested by the patents. But handling billions of small objects at high-speed in a flat namespace would make it easy to distribute object indexes among hundreds of users, reducing file system I/O latency. The 3D RAID could eliminate the encoding overhead inherent in advanced erasure codes while providing similar robustness, enabling way-beyond-RAID6 availability. </p>
<p>Going after the last stronghold of big iron RAID arrays may seem bold. But as Willy Sutton said of banks, that&#8217;s where the money is. </p>
<p>The key will be whether their technology enables them to come to market with a clear economic advantage, be it cost or performance. With some of the most brilliant and original minds in storage and Silicon Valley working the problem, whatever they come up with is bound to be innovative. </p>
<p><strong>Courteous comments welcome, of course.</strong> A launch this year feels about right.</p>
<div class="twttr_button">
					<a href="http://twitter.com/share?url=http://storagemojo.com/2013/03/13/what-is-dssd-building/&text=What is DSSD building?" target="_blank" title="Click here if you like this article.">
						<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
					</a>
				</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2013/03/13/what-is-dssd-building/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>StorageMojo&#8217;s best papers of FAST &#8217;13</title>
		<link>http://storagemojo.com/2013/02/15/storagemojos-best-papers-of-fast-13/</link>
		<comments>http://storagemojo.com/2013/02/15/storagemojos-best-papers-of-fast-13/#comments</comments>
		<pubDate>Fri, 15 Feb 2013 08:39:57 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Future Tech]]></category>
		<category><![CDATA[SSD/Flash Disk]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=2863</guid>
		<description><![CDATA[StorageMojo is focussed on new storage technologies, products, companies and markets. And where do new technologies come from? From people researching at the limits of the known. That&#8217;s why StorageMojo attends the Usenix File And Storage Technology (FAST) conference every year. Top academics &#8211; many with corporate ties &#8211; and grad students present the latest [...]]]></description>
				<content:encoded><![CDATA[<p></p><p>StorageMojo is focussed on new storage technologies, products, companies and markets. And where do new technologies come from? From people researching at the limits of the known.</p>
<p>That&#8217;s why StorageMojo attends the <a href="https://www.usenix.org/conference/fast13" target="_blank">Usenix File And Storage Technology</a> (FAST) conference every year. Top academics &#8211; many with corporate ties &#8211; and grad students present the latest research. </p>
<p>Many papers are submitted and reviewed before some are chosen for presentation at the conference over two and a half days. Here are StorageMojo&#8217;s favorites from FAST 13, in no particular order:</p>
<p><i><strong>SD Codes: Erasure Codes Designed for How Storage Systems Really Fail</strong></i> by James S. Plank, U of Tennessee, and Mario Blaum and James L. Hafner of IBM Research. RAID systems are vulnerable to a disk failures and unrecoverable read errors, but RAID 6 is overkill for UREs. The paper investigates lighter-weight erasure codes &#8211; disk plus sector, instead of 2 disks &#8211; to reclaim capacity for user data. </p>
<p><strong>The StorageMojo take</strong>: high update costs make this most attractive for active archives, not primary storage. The capacity savings could extend the economic life of current RAID strategies vs newer erasure codes.</p>
<p><i><strong>Gecko: Contention-Oblivious Disk Arrays for Cloud Storage</strong></i> by Ji-Yong Shin and Hakim Weatherspoon of Cornell, Mahesh Balakrishnan of Microsoft Research and Tudor Marian of Google. The limited I/O performance of disks makes contention a persistant problem on shared systems. The authors propose a novel log structured disk/SSD configuration and show that it virtually eliminates contention between writes, reads and garbage collection.</p>
<p><strong>The StorageMojo take</strong>: SSDs help with contention, but they aren&#8217;t affordable for large-scale deployments. Gecko offers a way to leverage SSDs for log-structured block storage that significantly improves performance at a reasonable hardware cost.</p>
<p><i><strong>Write Policies for Host-side Flash Caches</strong></i> by Leonardo Marmol, Raju Rangaswami and Ming Zhao of Florida International U., Swaminathan Sundararaman and Nisha Talagala of Fusion-io and Ricardo Koller of FIU and VMware. Write-through caching is safe but expensive. NAND&#8217;s non-volatile nature enables novel write-back cache strategies that preserve data integrity while improving performance. Thanks to large DRAM caches, read-only flash caches aren&#8217;t the performance booster they would have been even 5 years ago.</p>
<p><strong>The StorageMojo take</strong>: Using flash only for reads mean ignoring half &#8211; or more &#8211; of the I/O problem. This needs to be fixed and this paper points the way.</p>
<p><i><strong>Understanding the Robustness of SSDs under Power Fault</strong></i> by Mai Zheng and Feng Qin of Ohio State and Joseph Tucek and Mark Lillibridge of HP Labs. The authors tested 15 SSDs from 5 vendors by injecting power faults. 13 of the 15 lost data that should have been written and 2 of the 13 suffered massive corruption. </p>
<p><strong>The StorageMojo take</strong>: We may be trusting SSDs more than they deserve. This research points out problems with still immature SSD technology.</p>
<p><i><strong>A Study of Linux File System Evolution</strong></i> by Lanyue Lu, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau and Shan Lu of the University of Wisconsin. The authors analyzed 8 years of Linux file system patches &#8211; 5079 of them &#8211; and discovered, for instance, that </p>
<blockquote><p>
. . . semantic bugs, which require an understanding of file-system semantics to find or fix, are the dominant bug category (over 50% of all bugs). These types of bugs are vexing, as most of them are hard to detect via generic bug detection tools; more complex model checking or formal specification may be needed.
</p></blockquote>
<p><strong>The StorageMojo take</strong>: Anyone building or maintaining a file system should read this paper to get a handle on how and why file systems fail. Tool builders will find some likely projects as well.</p>
<p><strong>Courteous comments welcome, of course.</strong> There were some great posters and WIP presentations as well that I hope to write about soon.</p>
<div class="twttr_button">
					<a href="http://twitter.com/share?url=http://storagemojo.com/2013/02/15/storagemojos-best-papers-of-fast-13/&text=StorageMojo's best papers of FAST '13" target="_blank" title="Click here if you like this article.">
						<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
					</a>
				</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2013/02/15/storagemojos-best-papers-of-fast-13/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Why virtualization is a feature and VMware will lose</title>
		<link>http://storagemojo.com/2013/02/07/why-virtualization-is-a-feature-and-vmware-will-lose/</link>
		<comments>http://storagemojo.com/2013/02/07/why-virtualization-is-a-feature-and-vmware-will-lose/#comments</comments>
		<pubDate>Thu, 07 Feb 2013 18:43:18 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Cloud computing & storage]]></category>
		<category><![CDATA[Clusters]]></category>
		<category><![CDATA[Future Tech]]></category>
		<category><![CDATA[Marketing]]></category>
		<category><![CDATA[SSD/Flash Disk]]></category>
		<category><![CDATA[Virtualization]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=2855</guid>
		<description><![CDATA[I was asked at the SNIA nonvolatile memory conference why I did not include virtualization as a major driver for the use of nonvolatile memory. Flash helps with the multiple virtual machine I/O blender problem. But we also had that problem when we were running multiple applications on a single machine. Yes, performance requirements were [...]]]></description>
				<content:encoded><![CDATA[<p></p><p>I was asked at the <a href="http://www.snia.org/nvmsummit" target="_blank">SNIA nonvolatile memory conference</a> why I did not include virtualization as a major driver for the use of nonvolatile memory. Flash helps with the <a href="http://storagemojo.com/2008/07/23/the-virtual-machine-io-blender/" target="_blank">multiple virtual machine I/O blender</a> problem. </p>
<p>But we also had that problem when we were running multiple applications on a single machine. Yes, performance requirements were lower, but we managed.</p>
<p>I consider virtualization a feature and not a market. Why is virtualization a feature rather than a long-term product as many keen observers believe?</p>
<p><strong>VM history</strong><br />
In the 1970s we had the virtual memory operating system wars. A number of companies, including DEC, Prime and IBM, developed virtual memory operating systems.</p>
<p>The key enabler of the virtual memory operating systems was the advent of 32-bit processors with true 32-bit address spaces. This is back when people programmed on minicomputers with a quarter of a megabyte to a maximum 4 MB of physical memory.</p>
<p>The VAX/VMS (Virtual Address eXtension/Virtual Memory System) OS, in contrast, offered a massive 4GB address space, 2 of which were reserved for the system and 2 for users.</p>
<p>Virtual memory operating systems enabled important changes for the industry. Software developers could focus on making their software as functional as possible without worrying about the underlying memory architecture. </p>
<p>The virtual memory wars continued into the PC era, when MS-DOS was limited to a 640KB address space and Quarterdeck systems came in with QEMM. But eventually Intel and Microsoft got their act together and solved the problem of virtual memory for the masses.</p>
<p>No one pays extra for a virtual memory operating system today. It is expected functionality that most don&#8217;t even know exists.</p>
<p>Like 32-bit processors the advent of microprocessors powerful enough to run multiple applications led to the virtual machine opportunity. If Microsoft had not been intent on selling as many server licenses as possible, they could have improved their multitasking so that the problem of server sprawl might never have occurred.</p>
<p>Be that as it may, there is nothing in today&#8217;s virtualization technology that could not and should not be incorporated into server operating systems. In 20 years new CompSci grads won&#8217;t know that virtual machines weren&#8217;t always built into the OS. </p>
<p><strong>Feature vs product</strong><br />
Now just because something is at bottom a feature rather than a product doesn&#8217;t mean that you can&#8217;t make gobs of money before it becomes common. VMware is one example.</p>
<p>Data deduplication, for example, is clearly a feature. But the founders of Data Domain were able to make a lot of money by exploiting that feature with a high-quality application-focused implementation and being first to market.</p>
<p>Now deduplication is being built into new products. While debates over implementation details will continue among engineers, in a few years most users will see deduplication as a checkbox item. </p>
<p><strong>The transient and the permanent</strong><br />
How do we distinguish between a feature and a product? It is the difference between the transient and the permanent.</p>
<p>Transient problems can be resolved. Permanent problems can only be managed.</p>
<p>Processor virtual memory management is a problem that has been solved for the majority of the world&#8217;s computers. Data de-duplication can be added over time to storage systems as computing power increases and the cost of bandwidth and random I/Os &#8211; thanks NVM! &#8211; drops.</p>
<p>But some problems can only be managed, not solved. The issues of scale, aggregation and metadata &#8211; among others &#8211; will always be with us.</p>
<p>Like gas-rich regions of galactic star formation, these manageable-but-not-solvable issues will continue to be areas rich in startup formation.</p>
<p><strong>The StorageMojo take</strong><br />
Applying this theory to current markets yields some predictions:</p>
<ul>
<li>VMware, despite its feature-rich ecosystem and early lead, will lose to vendors, such as Microsoft and Red Hat, who can incorporate the most important virtualization features into their OS. VMware has no OS to fall back on and thus has no long-term future.</li>
<li>Data Domain is a wasting asset. As others add dedup to their products, DD&#8217;s differentiation will decline along with its market value.</li>
<li>Scale-out storage, like Isilon, will remain a lively market segment as the economics of disk, NVM, software, aggregation and metadata keep changing the calculus of efficient and durable storage.</li>
<li>The improving economics of erasure coding will enable more efficient approaches to backup and archiving &#8211; as long as Moore&#8217;s Law continues to hold.</li>
<li>System management is a permanent problem. When we get autonomic management at one level, the problem just kicks up a level with increasing scale. </li>
</ul>
<p>Just as no one remembers the critical register and segment management skills required for 16-bit minicomputers, in a decade or so all the painfully acquired knowledge required to manage VMs will lose value as it gets built into the OS infrastructure. But there will always be new worlds to conquer.</p>
<p><strong>Courteous comments welcome, of course.</strong> How would you define a feature vs a product?</p>
<div class="twttr_button">
					<a href="http://twitter.com/share?url=http://storagemojo.com/2013/02/07/why-virtualization-is-a-feature-and-vmware-will-lose/&text=Why virtualization is a feature and VMware will lose" target="_blank" title="Click here if you like this article.">
						<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
					</a>
				</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2013/02/07/why-virtualization-is-a-feature-and-vmware-will-lose/feed/</wfw:commentRss>
		<slash:comments>13</slash:comments>
		</item>
		<item>
		<title>The stovepipe vs the grid</title>
		<link>http://storagemojo.com/2013/01/25/the-stovepipe-vs-the-grid/</link>
		<comments>http://storagemojo.com/2013/01/25/the-stovepipe-vs-the-grid/#comments</comments>
		<pubDate>Fri, 25 Jan 2013 18:34:24 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Cloud computing & storage]]></category>
		<category><![CDATA[Clusters]]></category>
		<category><![CDATA[Enterprise]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=2850</guid>
		<description><![CDATA[Hewlett-Packard made a major announcement in December: their grid-based StoreServ 7000 based on 3PAR software. 3PAR has been selling enterprise-class storage for almost a decade and HP is building on that legacy with a radical take on enterprise storage. HP calls it converged storage. The concept is simple: enterprise-class scale-out storage that is configurable to [...]]]></description>
				<content:encoded><![CDATA[<p></p><p>Hewlett-Packard made a major announcement in December: their grid-based <a href="http://h17007.www1.hp.com/us/en/storage/nextera/index.aspx" target="_blank">StoreServ 7000</a> based on 3PAR software. 3PAR has been selling enterprise-class storage for almost a decade and HP is building on that legacy with a radical take on enterprise storage.</p>
<p>HP calls it converged storage. The concept is simple: enterprise-class scale-out storage that is configurable to meet a wide range of application requirements. Flexible commodity hardware to mix-and-match to meet requirements, while configuring SSDs, hard drives or tape makes it easy to meet a wide range of price-performance ratios.</p>
<p>It is the enterprise analog of Google&#8217;s and Amazon&#8217;s cloud infrastructures whose scale, flexibility and cost are up-ending traditional IT. I&#8217;ve been looking for this from a major vendor since I <a href="http://storagemojo.com/google-file-system-eval-part-ii/" target="_blank">wrote about the Google File System 7 years ago</a>.</p>
<blockquote><p>
GFS breaks that model and shows us what can be done when the entire storage paradigm is rethought. Build the availability around the devices, not in them, treat the storage infrastructure as a single system, not a collection of parts, extend the file system paradigm to include much of what we now consider storage management, including virtualization, continuous data protection, load balancing and capacity management.</p>
<p>GFS is not the future. But it shows us what the future can be.
</p></blockquote>
<p><strong>A choice, not an echo</strong><br />
The prevailing vendor model is the stovepipe: for every function within the enterprise there is a specialized product or set of products. These stovepipes are then glued together with a management layer that is supposed to manage everything from a single pane of glass &#8211; but never does.</p>
<p>Competition is stovepipe vs. stovepipe. The big iron arrays from EMC, Hewlett-Packard and Hitachi all compete, in theory, on availability, performance, support and cost. </p>
<p>The reality? Account control is often the deciding factor. Competition has stagnated over the last 10 years.</p>
<p>EMC is the most successful stovepipe peddler. Their technology publishing model &#8211; buy young companies that have crossed the chasm and sell the hell out of them &#8211; reduces their engineering risk at the cost of grievous incompatibility.</p>
<p><strong>Storage as a layer, not a pipe</strong><br />
But as long as EMC can afford to buy the best &#8211; and they can &#8211; competitors are always at a disadvantage. But HP&#8217;s converged storage model changes that &#8211; just as Amazon has forever changed enterprise IT.</p>
<p>HP says they&#8217;ve already won 1200 customers for their converged storage line. </p>
<p><strong>The StorageMojo take</strong><br />
No one is going to out-EMC EMC. They have the largest collection of best-of-breed point products in the industry.</p>
<p>Dave Donatelli, formerly of EMC, knows this well. Thus HP&#8217;s flanking move with converged storage.</p>
<p>Assuming HP can deliver a substantial fraction of Amazon&#8217;s benefits inside the data center, they will do very well. But the real magic will come when customers start interconnecting islands of converged storage to create a storage layer &#8211; like the network layer &#8211; in their infrastructure.</p>
<p>There are substantial customer political obstacles, but Amazon&#8217;s success has changed how CFOs look at IT. Smart CIOs are listening and responding.</p>
<p>Congratulations to HP and the 3PAR team for coming up with the only credible challenge to EMC in the last 15 years. I wish them every success.</p>
<p><strong>Courteous comments welcome, of course.</strong> The friendly folks at HP flew me to Europe last year to get the announcement first hand. But I&#8217;d like a technical briefing from a 3PAR architect.</p>
<div class="twttr_button">
					<a href="http://twitter.com/share?url=http://storagemojo.com/2013/01/25/the-stovepipe-vs-the-grid/&text=The stovepipe vs the grid" target="_blank" title="Click here if you like this article.">
						<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
					</a>
				</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2013/01/25/the-stovepipe-vs-the-grid/feed/</wfw:commentRss>
		<slash:comments>14</slash:comments>
		</item>
		<item>
		<title>2013: Year of the slog</title>
		<link>http://storagemojo.com/2013/01/02/2013-year-of-the-slog/</link>
		<comments>http://storagemojo.com/2013/01/02/2013-year-of-the-slog/#comments</comments>
		<pubDate>Wed, 02 Jan 2013 20:29:12 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Cloud computing & storage]]></category>
		<category><![CDATA[Disk]]></category>
		<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[Information Management]]></category>
		<category><![CDATA[Marketing]]></category>
		<category><![CDATA[SSD/Flash Disk]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=2837</guid>
		<description><![CDATA[There are years where new ideas and concepts explode. And there are years of consolidation. 2013 will be the latter. The storage industry has a lot to digest. Here are some of the issues. ReRAM Would-be vendors of the enterprise NAND flash replacement technology, Resistance RAM or ReRAM, will be hunting for launch customers. While [...]]]></description>
				<content:encoded><![CDATA[<p></p><p>There are years where new ideas and concepts explode. And there are years of consolidation. 2013 will be the latter.</p>
<p>The storage industry has a lot to digest. Here are some of the issues. </p>
<p><strong>ReRAM</strong><br />
Would-be vendors of the enterprise NAND flash replacement technology, Resistance RAM or ReRAM, will be hunting for launch customers. While their new product won&#8217;t be as cheap as NAND flash, it doesn&#8217;t need to be because enterprise customers will pay for greater speed, reliability and durability. The medium isn&#8217;t a big cost driver.</p>
<p>If ReRAM launches this year, who will lead the charge? I&#8217;d bet on either Intel, Violin Memory, Kaminario or Samsung. Intel and Samsung would incorporate ReRAM into their SSDs, while Kaminario and Violin would build it into their high-end solid-state arrays.</p>
<p><strong>Disks</strong><br />
Disk drive vendors are trying to figure out how to survive the deflation of their largest market due to tablets and smart phones. They&#8217;re counting on hybrid drives that blunt the performance advantages of pure SSDs. But they&#8217;ll need a higher level of commitment to architecting the right products than I&#8217;ve seen so far.</p>
<p>We&#8217;ll also see if Hitachi&#8217;s helium drives can make it in the marketplace. Hitachi/WD must have a major data center launch customer &#8211; Google, Facebook or Amazon &#8211; since no major server vendor would offer them without a 2nd source.</p>
<p><strong>Next-gen systems</strong><br />
Architects from next-gen storage companies such as Avere, Nimble Storage, Amplidata, Nimbus Data, Fusion-io, Starboard, Nutanix, Violin Memory, Tegile and many others will be tuning their systems and software &#8211; mostly software these days &#8211; based on real customer workloads instead of guesstimates. </p>
<p>HP&#8217;s converged storage is a special case. Will Donatelli be able to motivate HP&#8217;s enterprise sales force to move the product? </p>
<p><strong>Cloud</strong><br />
Cloud storage and computing will continue to bend the arc of enterprise new technology adoption. CFOs will become tougher in their questioning of CIOs given the obvious CapEx advantages of cloud infrastructure.</p>
<p><strong>Big Data</strong><br />
Multiple slogs for Big Data. Privacy concerns and regulation. The plethora of infrastructure options slows decision making and PO writing. Dawning awareness that the output is only as good as the questions you ask &#8211; technology is secondary. </p>
<p>But we&#8217;re only at the beginning of the Big Data revolution. </p>
<p><strong>The StorageMojo take</strong><br />
The last decade has seen an explosion of new storage architectures and technologies. That&#8217;s as it should be, since storage is the hardest part of information infrastructure &#8211; and the most critical.</p>
<p>But while innovation won&#8217;t halt this year, it&#8217;s time for the market to sort things out. CIOs who value their jobs will be looking beyond their usual suppliers to better compete with cloud IaaS vendors.</p>
<p>It will be an interesting year.</p>
<p><strong>Courteous comments welcome, of course.</strong> I&#8217;ve done work for several of the mentioned vendors, but for most I haven&#8217;t.</p>
<div class="twttr_button">
					<a href="http://twitter.com/share?url=http://storagemojo.com/2013/01/02/2013-year-of-the-slog/&text=2013: Year of the slog" target="_blank" title="Click here if you like this article.">
						<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
					</a>
				</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2013/01/02/2013-year-of-the-slog/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>EMC&#8217;s Xtreme embarrassment</title>
		<link>http://storagemojo.com/2012/12/05/emcs-xtreme-embarrassment/</link>
		<comments>http://storagemojo.com/2012/12/05/emcs-xtreme-embarrassment/#comments</comments>
		<pubDate>Wed, 05 Dec 2012 07:56:46 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[SSD/Flash Disk]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=2829</guid>
		<description><![CDATA[EMC&#8217;s Pravda, Chuck Hollis, chats in an interview about XtremIO&#8217;s problems. This quote from a Storage Newsletter interview may have some transcription errors, since SN is a French publication, but let&#8217;s not split hairs. Read it all to learn more. When will you have a dedicated all-flash array? We announced our XtremIO acquisition last year. [...]]]></description>
				<content:encoded><![CDATA[<p></p><p>EMC&#8217;s <i>Pravda</i>, Chuck Hollis, <a href="http://www.storagenewsletter.com/news/people/interview-chuck-hollis-global-marketing-cto-emc" target="_blank">chats in an interview</a> about XtremIO&#8217;s problems. This quote from a <i>Storage Newsletter</i> interview may have some transcription errors, since SN is a French publication, but let&#8217;s not split hairs. Read it all to learn more.</p>
<blockquote><p>
When will you have a dedicated all-flash array?<br />
<i>We announced our XtremIO acquisition last year. We demonstrated it at VMworld and EMC World. It&#8217;s an all-flash design. Made of de-dupe, scale-out architecture, all flash. It fits very precise parts of the market. </i></p>
<p>Do you sell it?<br />
<i>No, right now we&#8217;re in what&#8217;s called &#8216;early access&#8217;. Test with the customers, make sure it works right. You&#8217;ll probably see it in the first half of 2013 as a general product. It&#8217;s got a very specific profile. Random IO/s, extreme performance and tiering do not work. We have a data set where everything has to run fast.</i>
</p></blockquote>
<p><strong>Signal intelligence</strong><br />
Mr. Hollis is paid well to present the EMC party line, and he&#8217;s good at it. There are reasons for everything he says.</p>
<ol>
<li>Pre-announcing the new flash array is intended to freeze the market, especially <a href="www.violin-memory.com/" target="_blank">Violin Memory</a>, <a href="www.kaminario.com" target="_blank">Kaminario</a>, <a href="www.nimbusdata.com" target="_blank">Nimbus Data</a> and <a href="www.purestorage.com" target="_blank">Pure Storage</a> among others. </li>
<li>&#8220;Precise parts of the market&#8221; means that EMC doesn&#8217;t want customers to see it as a Symm replacement &#8211; which given their likely Xtreme pricing and need to keep VMAX billions rolling in &#8211; is simple self-preservation.</li>
<li>It won&#8217;t ship until the 2nd half of 2013 at the earliest and may slip further. But an H1/2013 <i>announcement</i> is likely.</li>
</ol>
<p><strong>Unpacking the ship date</strong><br />
EMC is very good at keeping secrets when they want to. In this case they don&#8217;t want to. </p>
<p>Why?</p>
<p>By telling us that they&#8217;re intending to enter the market for pure flash arrays they are hoping to slow the competitors kicking VMAX to the curb. EMC sales hates losing deals, but since they don&#8217;t have a competitive flash array they are stymied.</p>
<p>If the product were shippable in Q1/2013, EMC would deploy evangelists, including Mr. Hollis, to give NDA presentations, offer test lab visits, early delivery units and so on. But the &#8220;early access&#8221; program &#8211; &#8220;test with the customers, make sure it works right&#8221; &#8211; is a beta test. Normally that would come late in the development process when announce and ship dates are reasonably firm.</p>
<p>The dates aren&#8217;t firm if &#8220;probably&#8221; H1/2013 is the best EMC can offer. When XtremIO was bought, EMC <a href="http://www.computerweekly.com/news/2240150763/XtremIO-A-costly-mistake-or-genius-deal-for-EMC" target="_blank">predicted</a> a Q1/2013 product launch. </p>
<p><strong>The StorageMojo take</strong><br />
XtremIO hadn&#8217;t shipped a beta when EMC bought them. It&#8217;s clear that XtremIO&#8217;s &#8220;great technology&#8221; is further from a &#8220;great product&#8221; than they&#8217;d hoped.</p>
<p>Thus the damage control effort.</p>
<p>To limit embarrassment EMC may announce at EMC world in May with &#8220;planned availability&#8221; for Q3. But given the dynamics &#8211; flash arrays have a lot of interesting wrinkles &#8211; Q4 is my forecast for shipments of a de-featured product. </p>
<p>What went wrong? Under-estimating market demand for all-flash arrays plus over-estimating VMAX with flash cache appeal, resulting in a reluctance to pay $2B+ for a less-mature-than-they&#8217;d-like, but installed product company like Kaminario or Violin.</p>
<p>The year&#8217;s delay will be costly for EMC as flash array competitors become firmly entrenched in major accounts. Fixing that will take more than spin.</p>
<p><strong>Courteous comments welcome, of course.</strong> I&#8217;ve done work for Kaminario and Violin, and I&#8217;m still a little peeved over <a href="http://storagemojo.com/2007/03/20/emc-threatens-storagemojo/" target="_blank">EMC threatening StorageMojo</a>.</p>
<div class="twttr_button">
					<a href="http://twitter.com/share?url=http://storagemojo.com/2012/12/05/emcs-xtreme-embarrassment/&text=EMC's Xtreme embarrassment" target="_blank" title="Click here if you like this article.">
						<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
					</a>
				</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2012/12/05/emcs-xtreme-embarrassment/feed/</wfw:commentRss>
		<slash:comments>13</slash:comments>
		</item>
		<item>
		<title>Big Data and object storage: lessons learned</title>
		<link>http://storagemojo.com/2012/11/08/big-data-and-object-storage/</link>
		<comments>http://storagemojo.com/2012/11/08/big-data-and-object-storage/#comments</comments>
		<pubDate>Thu, 08 Nov 2012 23:01:38 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Cloud computing & storage]]></category>
		<category><![CDATA[Information Management]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=2813</guid>
		<description><![CDATA[I&#8217;ve had the pleasure of moderating a half-dozen panel discussions on Big Data and object storage in the last few months. It&#8217;s been a learning experience. Big Data has always been as big as we could afford, be it block, file or object. Google&#8217;s MapReduce, running on massive commodity-based scale-out GFS object storage, inspired the [...]]]></description>
				<content:encoded><![CDATA[<p></p><p>I&#8217;ve had the pleasure of moderating a half-dozen panel discussions on Big Data and object storage in the last few months. It&#8217;s been a learning experience.</p>
<p>Big Data has always been as big as we could afford, be it block, file or object. Google&#8217;s MapReduce, running on massive commodity-based scale-out GFS object storage, inspired the current Big Data craze. </p>
<p>If storage were the only problem we could stop with a discussion of <a href="http://www.amplidata.com" target="_blank">Amplidata</a>, the object storage company that sponsored StorageMojo&#8217;s participation (check out the <a href="" target="_blank">StorageMojo object storage video</a> to get a quick overview). But there&#8217;s much more to it than that.</p>
<p><strong>Defining Big Data</strong><br />
Ever notice that the number of disk drives in most infrastructures stays fairly constant? The growth in drive capacity parallels the growth in data, so disk drive populations should be stable.</p>
<p>Thus a definition of Big Data based on the number of drives is simple and flexible over time. Defining the lower bound of Big Data as an application that uses 100 large capacity SATA drives means that with 4TB drives you aren&#8217;t doing Big Data unless you have 400TB of raw storage.</p>
<p>Smaller data sets can be vital to business or research; this isn&#8217;t a &#8220;mine is bigger than yours&#8221; competition. But as a rule of thumb you can probably live with filers if you&#8217;re certain your application won&#8217;t need more than 100 drives ever and, if you will, you should consider object storage.</p>
<p><strong>Technology</strong><br />
There is no doubt that advanced erasure codes &#8211; variously referred to as rateless or fountain codes &#8211; are technically and financially superior to the older and much more common Reed-Solomon erasure codes for Big Data. </p>
<p>They are lower overhead, offer higher redundancy and better security than RS codes. Want to survive 4 failures with ≈50% overhead? Rateless codes are the answer.</p>
<p><strong>Moving Big Data</strong><br />
Shipping boxes of big SATA drives overnight is still the high bandwidth solution &#8211; assuming the drives survive &#8211; but the physical hassle is prohibitive for most operations. Assuming you need to move gigabytes regularly, you need 2 things: a fast transport; and a storage system that can manage sustained write performance &#8211; which not all can. </p>
<p>The friendly folks at <a href="http://asperasoft.com" target="_blank">Aspera software</a> have figured out how to achieve line speed data rates across long distances, a trick TCP has never learned. They can feed the data directly to an object storage system.</p>
<p><strong>Integration</strong><br />
Object storage isn&#8217;t a drop-in replacement for NAS: the common REST interface is much simpler than NFS 4.1 or SMB 3.0. Many new apps are now written to use REST, but what about legacy apps that expect NAS?</p>
<p>At least one vendor,<a href="http://panzura.com" target="_blank">Panzura</a> offers a NAS front end to object storage. Their Global Cloud Storage System serves up NAS protocols while using object storage on the back end.</p>
<p>But they only support a few back-end options, so you better like their choices. There are options beside Panzura though.</p>
<p><strong>The business problem</strong><br />
The technology is there, but <a href="http://searchstorage.techtarget.com/news/2240167974/Object-storage-system-success-examined-at-Storage-Networking-World" target="_blank">as I asked</a> at one of the panels &#8220;if this stuff is so freakin&#8217; great, why isn&#8217;t everyone using it?&#8221; It isn&#8217;t only enterprise IT inertia.</p>
<p>There&#8217;s another problem: business units and enterprises aren&#8217;t used to seeing storage as horizontal infrastructure, where scale-out storage wins. Even major adopters of object storage typically use a single app to justify an investment. </p>
<p>Networks are a horizontal infrastructure. We buy them anyway. It is a solvable problem that I&#8217;ll be writing more about more later.</p>
<p><strong>The StorageMojo take</strong><br />
A few years ago there was debate about public vs private clouds. I continue to think there are business cases to be made for both. </p>
<p>But they aren&#8217;t the same cases.</p>
<p>As the commoditization of scale-out architectures grows and the cost of public cloud storage becomes an issue, the advantage that public providers offer today will decline. AWS is driving app development to the S3 API, and those apps will migrate easily to private clouds.</p>
<p>The growth of private clouds won&#8217;t crater AWS. Yet AWS will enable private clouds. </p>
<p><strong>Courteous comments welcome, of course.</strong> How do you define Big Data?</p>
<div class="twttr_button">
					<a href="http://twitter.com/share?url=http://storagemojo.com/2012/11/08/big-data-and-object-storage/&text=Big Data and object storage: lessons learned" target="_blank" title="Click here if you like this article.">
						<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
					</a>
				</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2012/11/08/big-data-and-object-storage/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
	</channel>
</rss>
