<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>StorageMojo &#187; NAS, IP, iSCSI</title>
	<atom:link href="http://storagemojo.com/category/nas/feed/" rel="self" type="application/rss+xml" />
	<link>http://storagemojo.com</link>
	<description>Data storage info &#38; analysis</description>
	<lastBuildDate>Mon, 21 May 2012 22:16:25 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
		<item>
		<title>Amplidata&#8217;s distributed object store</title>
		<link>http://storagemojo.com/2012/04/17/amplidatas-distributed-object-store/</link>
		<comments>http://storagemojo.com/2012/04/17/amplidatas-distributed-object-store/#comments</comments>
		<pubDate>Tue, 17 Apr 2012 18:19:05 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Clusters]]></category>
		<category><![CDATA[NAS, IP, iSCSI]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=2647</guid>
		<description><![CDATA[Our digital civilization requires data integrity and long-term preservation, and neither is assured by our current storage infrastructure. But progress continues. Latest case in point: Amplidata. This 4 year old company, based in Belgium with a growing US footprint, brings a new level of erasure code goodness to the both problems with a cluster-based object [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>Our digital civilization requires data integrity and long-term preservation, and neither is assured by our current storage infrastructure. But progress continues.</p>
<p>Latest case in point: Amplidata. This 4 year old company, based in Belgium with a growing US footprint, brings a new level of erasure code goodness to the both problems with a cluster-based object store.</p>
<p>What erasure code goodness, you ask? The first &#8211; AFAIK &#8211; rateless erasure code, AKA fountain code, storage system in production use.</p>
<p><strong>And that is a good thing because?</strong><br />
Robustness and efficiency. </p>
<p>Amplidata claims storage durability well beyond RAID 6: 10 9&#8242;s (spread across 16 drives with up to 4 failures) durability &#8211; though the spreads can be much larger logically and geographically. They do this by breaking the data object into segments and adding redundancy data. </p>
<p>The redundancy data adds about 50% to the object size &#8211; more efficient than mirroring or triple replication. The benefit is that the system can lose hundreds of segments and still reconstruct the data.</p>
<p>Each object is protected by checksums that can protect against more than 1000 simultaneous bit errors per object. And each write goes to at least to controllers before it is committed.</p>
<p>What kind of monster controller is able to perform all this magic? The minimum configuration is 3 Xeon-based commodity controller nodes with as many 10-drive Atom-based storage nodes as you need.</p>
<p>Amplidata is optimized for bandwidth, not IOPS. With their latest software update they now spec each controller at 750MB/sec, and you can have as many controllers as you can afford.</p>
<p><strong>Sounds like Cleversafe</strong><br />
Cleversafe thought so too, and they&#8217;ve sued Amplidata for patent infringement. But Intel &#8211; who knows about patents and due diligence &#8211; invested after the suit. </p>
<p>Like NetApp&#8217;s suit against ZFS, this seems like a vanity project. Surely Cleversafe has more important things to invest in. If they don&#8217;t they&#8217;re in bigger trouble than we know.</p>
<p><strong>The StorageMojo take</strong><br />
The need for robust, inexpensive and massive storage has been a theme of StorageMojo&#8217;s for years. Object storage is the best solution to the problem of scale, while the kind of redundancy and end-to-end checksumming that Amplidata uses seems as robust as anything on the market today.</p>
<p>As for inexpensive, that is in the eye of the beholder, but Amplidata tells me that their newest storage node lists for less than $0.60/GB while consuming only 60 watts. That should be attractive to people running tape silos who want faster access and better redundancy.</p>
<p><strong>Courteous comments welcome, of course.</strong> I&#8217;m working with Amplidata to produce a video white paper on their technology, so stay tuned for more info on a promising company.</p>
<div class="twttr_button">
				<a href="http://twitter.com/share?url=http://storagemojo.com/2012/04/17/amplidatas-distributed-object-store/&text=Amplidata's distributed object store" target="_blank" title="Click here if you liked this article.">
					<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
				</a>
			</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2012/04/17/amplidatas-distributed-object-store/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Gridstore snags Geoff Barrall</title>
		<link>http://storagemojo.com/2012/01/10/gridstore-snags-geoff-barrall/</link>
		<comments>http://storagemojo.com/2012/01/10/gridstore-snags-geoff-barrall/#comments</comments>
		<pubDate>Tue, 10 Jan 2012 17:09:37 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Clusters]]></category>
		<category><![CDATA[NAS, IP, iSCSI]]></category>
		<category><![CDATA[SOHO/SMB]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=2568</guid>
		<description><![CDATA[BlueArc and Drobo founder Geoff Barrall has a new perch: Gridstore, one of the companies I&#8217;ve been following for almost 3 years. Geoff is the new executive chairman. Formal announcement is expected this week. Gridstore&#8217;s concept is a low-cost scale-out NAS appliance designed for office environments. Each box is a small, low-power node with a [...]]]></description>
			<content:encoded><![CDATA[<p></p><p><a href="http://www.bluearc.com/" target="_blank">BlueArc</a> and <a href="http://www.drobo.com/" target="_blank">Drobo</a> founder Geoff Barrall has a new perch: <a href="http://gridstore.com/" target="_blank">Gridstore</a>, one of the companies I&#8217;ve been <a href="http://www.zdnet.com/blog/storage/google-style-storage-comes-to-the-smb/1323" target="_blank">following</a> for almost 3 years. Geoff is the new executive chairman. Formal announcement is expected this week.</p>
<p>Gridstore&#8217;s concept is a low-cost scale-out NAS appliance designed for office environments. Each box is a small, low-power node with a couple of TB. Stack &#8216;em for as much redundancy, capacity and performance you want.</p>
<p>Think of it as the consumerization of hyper-scale technology. <a href="http://www.nutanix.com/" target="_blank">Nutanix</a> writ small.</p>
<p><strong>Gridstore details</strong><br />
Gridstore is offering a low-cost, scale-out network file server for $500 a node. That is too cheap for the enterprise storage companies to sell directly.</p>
<p>Founded 5 years ago, Gridstore got a beta out in 2010, and have been shipping for well over a year. They are a Microsoft CIFS protocol file server, using Microsoft’s storage server software. Running on small, 25 watt Atom-based boxes, a 6 node configuration is the size of a bread box.</p>
<p> Like other scale-out NAS systems, the Gridstore NAS has no single point of failure and can survive multiple node failures without going down or losing data.</p>
<p>They call their redundancy scheme RAIDg. When you set up a volume you dial in how many faults you want to survive and the software handles the rest.</p>
<p>Today the number of faults they can handle is limited to half the number of nodes minus one. If you have a 6 node configuration it can handle the loss of 2 nodes. They expect to relax that requirement in the future.</p>
<p><strong>The StorageMojo take</strong><br />
Haven&#8217;t spoken to Geoff about this, but Gridstore seems like a natural for him. If there&#8217;s a theme to his many endeavors, its making advanced NAS technology more accessible.</p>
<p>Gridstore fits the bill nicely. If there&#8217;s one complaint about Drobo, its the lack of box-level redundancy. Gridstore answers this objection, at a higher price point.</p>
<p>Drobo &#8211; over 200,000 units sold &#8211; has blazed a trail for bringing advanced storage technology to the masses at affordable prices. They may be the first, but as Gridstore and others demonstrate, they won&#8217;t be the last.</p>
<p><strong>Courteous comments welcome, of course.</strong> Hoping to make it to CES later this week. Readers: anyone I should make a point to see?</p>
<div class="twttr_button">
				<a href="http://twitter.com/share?url=http://storagemojo.com/2012/01/10/gridstore-snags-geoff-barrall/&text=Gridstore snags Geoff Barrall" target="_blank" title="Click here if you liked this article.">
					<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
				</a>
			</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2012/01/10/gridstore-snags-geoff-barrall/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>Ask StorageMojo: 80,000 mailboxes need help</title>
		<link>http://storagemojo.com/2011/11/02/ask-storagemojo-80000-mailboxes-need-help/</link>
		<comments>http://storagemojo.com/2011/11/02/ask-storagemojo-80000-mailboxes-need-help/#comments</comments>
		<pubDate>Wed, 02 Nov 2011 16:00:28 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[NAS, IP, iSCSI]]></category>
		<category><![CDATA[SSD/Flash Disk]]></category>
		<category><![CDATA[Virtualization]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=2543</guid>
		<description><![CDATA[A StorageMojo reader has a problem. Can you help? Our mail hub (80,000+ mailboxes) is virtualized with vSphere 4.1 with Red Hat Enterprise Linux 5 x64 and Dovecot 2.0 [an open source IMAP/POP3 email server for Linux/UNIX-like systems]. We are using HP LeftHand Networks P4300 iSCSI storage in a &#8220;network RAID10 setup of RAID10 storage&#8221; [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>A StorageMojo reader has a problem. Can you help?</p>
<blockquote><p>
Our mail hub (80,000+ mailboxes) is virtualized with vSphere 4.1 with Red Hat Enterprise Linux 5 x64 and <a href="http://dovecot.org/index.html" target="_blank">Dovecot 2.0</a> [an open source IMAP/POP3 email server for Linux/UNIX-like systems]. We are using HP LeftHand Networks P4300 iSCSI storage in a &#8220;network RAID10 setup of RAID10 storage&#8221; for Dovecot indexes and multiple &#8220;networks RAID1 of RAID5 storage&#8221; for actual mailboxes.</p>
<p>This is my take: our Dovecot indexes are getting hammered with lots of small I/O requests, about 8,000 IOPS continuous during 8-working-hour days, 75% write. Indexes are fairly small (50 GB) and expected to grow to 100-150 GB, but need a lot of random I/O. We need real-time replication in storage (LeftHand is ok for us) and we think that SSD should shine in this situation. Bandwidth is not a problem (200-300 megabits of indexes traffic, but we need more IOPs).</p>
<p>The problem is the indexes, but our total mailbox capacity is expected to grow to 6 TB compressed using zlib compression in Dovecot.</p>
<p>We want to buy a storage appliance with the following requirements:</p>
<ul>
<li>Vsphere 4.1 &#038; 5 certified storage, VAAI enabled (if possible)</li>
<li>iSCSI (1 gbps)</li>
<li>High number of IOPS (at least 12,000+, most of them writes)</li>
<li>Small size (200 GB)</li>
<li>Fault tolerant (RAID, battery-backed write cache, power supply, fans, multiple gigabit uplinks, synchronous replication)</li>
<li>Cheap (less than $30k the full setup)</li>
</ul>
<p>We want to buy at the beginning of 2012. Any product that fits?
</p></blockquote>
<p><strong>The StorageMojo take</strong><br />
Suspect price will be the most significant limiter. But the respondent only needs index storage not the whole shooting match. He&#8217;s pretty happy with LeftHand for mailbox storage.</p>
<p>But if we can solve both problems for him, why not? If he should relax some constraint, feel free to suggest it.</p>
<p>He&#8217;ll be watching the comments, so if you have questions please ask them. I&#8217;ll be following the comments as well.</p>
<p><strong>Courteous comments welcome, of course.</strong> His email was edited for clarity.</p>
<div class="twttr_button">
				<a href="http://twitter.com/share?url=http://storagemojo.com/2011/11/02/ask-storagemojo-80000-mailboxes-need-help/&text=Ask StorageMojo: 80,000 mailboxes need help " target="_blank" title="Click here if you liked this article.">
					<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
				</a>
			</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2011/11/02/ask-storagemojo-80000-mailboxes-need-help/feed/</wfw:commentRss>
		<slash:comments>47</slash:comments>
		</item>
		<item>
		<title>Dear StorageMojo: make NFS go fast!</title>
		<link>http://storagemojo.com/2010/12/10/dear-storagemojo-make-nfs-go-fast/</link>
		<comments>http://storagemojo.com/2010/12/10/dear-storagemojo-make-nfs-go-fast/#comments</comments>
		<pubDate>Fri, 10 Dec 2010 15:26:04 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[NAS, IP, iSCSI]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=2226</guid>
		<description><![CDATA[Most of us know what it is like when a relationship goes bad: the sinking feeling that this just isn&#8217;t going to work. Can this configuration be saved? Dear StorageMojo: I joined a company last year that is running Oracle 10g on a NetApp NAS/SAN. Immediately I asked why they were not using Clustering, Oracle [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>Most of us know what it is like when a relationship goes bad: the sinking feeling that this just isn&#8217;t going to work. </p>
<p><strong>Can this configuration be saved?</strong><br />
Dear StorageMojo:</p>
<blockquote><p>
I joined a company last year that is running Oracle 10g on a NetApp NAS/SAN.</p>
<p>Immediately I asked why they were not using Clustering, Oracle RAC, Oracle ASM or Fiber Channel. No answer.</p>
<p>Fast fwd to a year later and they are asking me to deploy this to an I/O bound customer with hundreds of connections and lots of transactions to their DB over NFS.</p>
<p>Long story short it&#8217;s slow-w-w-w. They tried trunking multiple network connections. They tried tuning. They tried a bunch of stuff. And it&#8217;s still a dog.</p>
<p>How slow?</p>
<p>I have a screaming Dell r710 running a 7TB database attached over SAS to a set of MD3000 storage arrays. I am getting 450MBs&#8230;..and this barely suffices&#8230;..</p>
<p>The &#8220;new&#8221; system they showed me gets 50MBsec&#8230;the same screaming Dellr710 but connected over NFS (instead of SAS) to the NetApp NAS.</p>
<p>Do you have any suggestions?</p>
<p>Thank you for reading this nightmare.<br />
Bob
</p></blockquote>
<p>Poor Bob! He&#8217;ll be getting grief from the client for months, maybe years to come, unless this gets fixed.</p>
<p><strong>The StorageMojo take</strong><br />
Maybe Bob could have been better about developing a relationship with the guys configuring the systems. More questions, fewer conclusions, at first.</p>
<p>Suggestions to the customer for acceptance testing might be in order. </p>
<p>But there are 2 problems here:</p>
<ol>
<li>What to do now.</li>
<li>How to keep this from happening again.</li>
</ol>
<p>What would you suggest to Bob on either or both topics? I&#8217;ve asked him to watch the comments, so if more info would be useful, I hope he&#8217;ll provide it.</p>
<p><strong>Courteous comments <strike>welcome</strike> needed.</strong> When a multi-billion dollar near-sighted telescope can get sent into orbit, it is surprising more IT projects don&#8217;t go wrong.</p>
<div class="twttr_button">
				<a href="http://twitter.com/share?url=http://storagemojo.com/2010/12/10/dear-storagemojo-make-nfs-go-fast/&text=Dear StorageMojo: make NFS go fast!" target="_blank" title="Click here if you liked this article.">
					<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
				</a>
			</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2010/12/10/dear-storagemojo-make-nfs-go-fast/feed/</wfw:commentRss>
		<slash:comments>26</slash:comments>
		</item>
		<item>
		<title>Jack be Nimble</title>
		<link>http://storagemojo.com/2010/11/08/jack-be-nimble/</link>
		<comments>http://storagemojo.com/2010/11/08/jack-be-nimble/#comments</comments>
		<pubDate>Mon, 08 Nov 2010 21:43:03 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Backup]]></category>
		<category><![CDATA[NAS, IP, iSCSI]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=2197</guid>
		<description><![CDATA[Talked to Nimble Storage a few months ago. The 1st time they sounded cool and now I know why. What they do Nimble builds a converged storage appliance out of commodity hard drives and SSDs that offers high performance &#8211; is there any other kind? &#8211; and iSCSI, backup, a form of dedup and WAN [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>Talked to Nimble Storage a few months ago. The 1st time they sounded cool and now I know why.</p>
<p><strong>What they do</strong><br />
Nimble builds a converged storage appliance out of commodity hard drives and SSDs that offers high performance &#8211; is there any other kind? &#8211; and iSCSI, backup, a form of dedup and WAN replication. The pitch is EqualLogic &#038; Data Domain merged into a single low-cost appliance. Only better.</p>
<ul>
<li>iSCSI + dedup</li>
<li>Capacity-optimized snapshots</li>
<li>SATA + flash instead of high-rpm drives</li>
<li>Can run off a remote snapshot</li>
</ul>
<p>EL &#038; DD sell a lot of kit, so this could work.</p>
<p><strong>Claim to fame</strong><br />
Cache Accelerated Sequential Layout is what Nimble calls their secret sauce.</p>
<p>CASL combines a variable block size, in-line compression, application-specific block sizes and checksum and compression data kept in the block header. They coalesce the blocks and only write in full stripes to disk.</p>
<p>The box has a large flash-based cache where the full stripe writes are also written, overcoming the small write performance hit that flash shares with parity raid. This also insures a high percentage of cache hits on the first read.</p>
<p>The system maintains an index of where all the blocks are written. Typically, this index is also held in flash for maximum lookup performance.</p>
<p><strong>App-specific block sizes</strong><br />
Nimble uses of variable block sizes to improve performance. For example, the last three versions of exchange have all used different block sizes. CASL recognizes the different versions of Exchange and dynamically adjusts its block size to the best fit.</p>
<p>They claim a 2x performance advantage on Exchange databases.</p>
<p><strong>Coalesce</strong><br />
They take the variable size blocks then coalesce those blocks into big chunks and write to flash. They write in large blocks &#8211; full block writes to flash and in full stripe writes to disk. Result: fast reads &#038; writes across both media</p>
<p>Their page sizes are variable but small, ranging from 4KB to 64KB. The greater granularity means that frequent snapshots are much smaller than large page size systems like EqualLogic.</p>
<p><strong>The StorageMojo take</strong><br />
There&#8217;s no reason that data protection should be separate from data storage. We&#8217;ve been moving towards integration since the CDP craze. </p>
<p>The average business wants to store and protect their data and they don&#8217;t want to spend much time or money on it. Nor should they. </p>
<p>With powerful commodity processors and nickel-per-GB storage there&#8217;s a huge market for a box that &#8211; or 2 or 3 boxes &#8211; that </p>
<ul>
<li>Stores terabytes of data</li>
<li>Protects that data with local replication and frequent snapshots</li>
<li>Auto-connects to cloud storage for DR and archiving</li>
<li>Doesn&#8217;t confuse users with LUNs and stripes</li>
<li>Offers Time Machine like data recovery to end users</li>
</ul>
<p>It will look like magic &#8211; as any sophisticated technology should &#8211; and you&#8217;ll buy it at Office Max. As with any volume product the key will be architecting to maximize the user experience at an affordable price point.</p>
<p>Nimble certainly has the right idea.</p>
<p><strong>Courteous comments welcome, of course.</strong> I don&#8217;t know which analyst the Nimble guys are blowing their money on, but it isn&#8217;t me.</p>
<div class="twttr_button">
				<a href="http://twitter.com/share?url=http://storagemojo.com/2010/11/08/jack-be-nimble/&text=Jack be Nimble" target="_blank" title="Click here if you liked this article.">
					<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
				</a>
			</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2010/11/08/jack-be-nimble/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
		<item>
		<title>Ask StorageMojo: EqualLogic vs LeftHand &amp; more</title>
		<link>http://storagemojo.com/2009/10/21/ask-storagemojo-equallogic-vs-lefthand-more/</link>
		<comments>http://storagemojo.com/2009/10/21/ask-storagemojo-equallogic-vs-lefthand-more/#comments</comments>
		<pubDate>Wed, 21 Oct 2009 20:29:11 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[NAS, IP, iSCSI]]></category>
		<category><![CDATA[SOHO/SMB]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=1658</guid>
		<description><![CDATA[These requests came in over the transom in the last couple of days. Maybe some StorageMojo readers have wisdom to share. I have a question I hope you can help me with. My boss asked me . . . to research HP Left-hand SANs and Dell Equallogic SANs. Do you have any special knowledge of [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>These requests came in over the transom in the last couple of days. Maybe some StorageMojo readers have wisdom to share. </p>
<blockquote><p>
I have a question I hope you can help me with.  My boss asked me . . . to research HP Left-hand SANs and Dell Equallogic SANs.  Do you have any special knowledge of these products and, if so, would you make an informal recommendation?
</p></blockquote>
<p>What say you, StorageMojo readers? If you evaluated both, why did you make the choice you did? Vendors welcome to comment, but please identify yourself as such. </p>
<p><strong>The StorageMojo take</strong><br />
AFAIK, both products are good iSCSI systems. Both are backed by major corporations. EqualLogic may be stronger in the channel today, but HP has channel chops as well. HP&#8217;s blade servers may be a more expandable platform, but EqualLogic&#8217;s software portfolio may be more affordable.</p>
<p>Translation: you could do worse than either of these. </p>
<p><strong>Part II</strong><br />
Another customer perplexity: service.</p>
<blockquote><p>
We have a pair of HP disk arrays, EVA 8000 and 6000 and I am looking for a consultant to help up with storage planning.  Do you do such work or could you recommend someone to me.  I am looking for someone who goes beyond just being a seller, I have plenty of potential sellers already.
</p></blockquote>
<p>The writer is in a small city in the Mountain West, so you should be used to working remotely with clients. No, not in Arizona.</p>
<p><strong>The StorageMojo take</strong><br />
HP folks may be wondering: why doesn&#8217;t he call HP? My guess: not big enough  for a direct engagement.</p>
<p><strong>Courteous comments welcome, of course.</strong>  </p>
<div class="twttr_button">
				<a href="http://twitter.com/share?url=http://storagemojo.com/2009/10/21/ask-storagemojo-equallogic-vs-lefthand-more/&text=Ask StorageMojo: EqualLogic vs LeftHand & more" target="_blank" title="Click here if you liked this article.">
					<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
				</a>
			</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2009/10/21/ask-storagemojo-equallogic-vs-lefthand-more/feed/</wfw:commentRss>
		<slash:comments>43</slash:comments>
		</item>
		<item>
		<title>Cloud storage for $100 a terabyte</title>
		<link>http://storagemojo.com/2009/09/01/cloud-storage-for-100-a-terabyte/</link>
		<comments>http://storagemojo.com/2009/09/01/cloud-storage-for-100-a-terabyte/#comments</comments>
		<pubDate>Tue, 01 Sep 2009 12:50:37 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Cloud computing & storage]]></category>
		<category><![CDATA[Future Tech]]></category>
		<category><![CDATA[NAS, IP, iSCSI]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=1555</guid>
		<description><![CDATA[Imagine cloud storage that didn&#8217;t cost much more than bare drives. High density storage with RAID 6 protection, reasonable bandwidth and web-friendly HTTPS access. And really, really cheap. Raw disk cost is only 5-10% of a RAID systems cost. The rest goes for corporate jets, sales commissions, 3 martini lunches, tradeshows, sheetmetal, 2 Intel x86 [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>Imagine cloud storage that didn&#8217;t cost much more than bare drives. High density storage with RAID 6 protection, reasonable bandwidth and web-friendly HTTPS access.</p>
<p>And really, really cheap. </p>
<p>Raw disk cost is only 5-10% of a RAID systems cost. The rest goes for corporate jets, sales commissions, 3 martini lunches, tradeshows, sheetmetal, 2 Intel x86 mobos, obscene profits and a few pale and blinking engineers in a windowless lab who make the whole thing work. </p>
<p><strong>Storage for ascetics</strong><br />
But let&#8217;s say you didn&#8217;t want the 3 martini lunch or the barely-clad booth babes. All you want is really <strike>cheap</strike> economical, reasonably reliable storage.</p>
<p>You aren&#8217;t running the global financial system &#8211; what&#8217;s left of it anyway &#8211; and you don&#8217;t have a 2500 person call center hammering on a few dozen Oracle databases 7 x 24. No, you&#8217;re thinking a quiet cloud storage business for SMB&#8217;s, maybe backup and some light file sharing, that will give you a nifty little revenue stream with annual renewals so you can see trouble coming 12 months in advance.</p>
<p>Enough redundancy so when something breaks you can wait until morning to fix it instead of an 0300 pajama run to the data center. Easy connectivity so you aren&#8217;t blowing the savings on Cisco switches. </p>
<p><strong>Bliss</strong><br />
Well, you aren&#8217;t the only one. <a href="https://www.backblaze.com/" target="_blank">Backblaze</a>, a new online backup provider, designed the Storage Pod for their own use and are sharing it with everyone. They aren&#8217;t in the hardware business and I think they figured sharing it would be a nice little attention-getting device.</p>
<p>It worked for me.  Here&#8217;s the box &#8211; which they are using in production.</p>
<p><a href="http://storagemojo.com/wp-content/uploads//2009/08/backblaze_box.jpg"><img src="http://storagemojo.com/wp-content/uploads//2009/08/backblaze_box.jpg" alt="backblaze_box" title="backblaze_box" width="480" height="324" class="aligncenter size-full wp-image-1562" /></a></p>
<p>Here&#8217;s an exploded diagram with a simplified BOM:</p>
<p><a href="http://storagemojo.com/wp-content/uploads//2009/08/backblaze_storage_pod_bom.jpg"><img src="http://storagemojo.com/wp-content/uploads//2009/08/backblaze_storage_pod_bom.jpg" alt="backblaze_storage_pod_bom" title="backblaze_storage_pod_bom" width="480" height="670" class="aligncenter size-full wp-image-1563" /></a></p>
<p>And then there&#8217;s the (free) software. 64-bit Debian Linux, IBM&#8217;s open source JFS file system and HTTPS access. Put a stateless webserving front end on it and you&#8217;re good to go. Scale out the webserver and add storagepods to grow the system.</p>
<p><strong>The StorageMojo take</strong><br />
This isn&#8217;t general purpose or high-performance storage.  Nor is it backed by global network of 7 x 24 service professionals. But there are a lot of applications out there that just need a big bit bucket. This is it.</p>
<p>No one is manufacturing this for you either &#8211; which is a good thing. If you don&#8217;t know what you&#8217;re doing you can get in a lot of trouble with a lot of data real fast. Want to be the Bernie Madoff of cloud storage? </p>
<p>But the density is good, the performance is reasonable, the availability is decent and the price is right. This is a DC-3, not a 747. It is all you need for the right application.</p>
<p><strong>Courteous comments welcome, of course.</strong>  See the plans and get the box model in the  <a href="https://www.backblaze.com/petabytes-on-a-budget-how-to-build-cheap-cloud-storage.html" target="_blank">paper </a> Backblaze put together.</p>
<div class="twttr_button">
				<a href="http://twitter.com/share?url=http://storagemojo.com/2009/09/01/cloud-storage-for-100-a-terabyte/&text=Cloud storage for $100 a terabyte" target="_blank" title="Click here if you liked this article.">
					<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
				</a>
			</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2009/09/01/cloud-storage-for-100-a-terabyte/feed/</wfw:commentRss>
		<slash:comments>49</slash:comments>
		</item>
		<item>
		<title>Configure a 100 TB HD video infrastructure</title>
		<link>http://storagemojo.com/2009/06/07/configure-a-100-tb-hd-video-infrastructure/</link>
		<comments>http://storagemojo.com/2009/06/07/configure-a-100-tb-hd-video-infrastructure/#comments</comments>
		<pubDate>Mon, 08 Jun 2009 01:20:37 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Clusters]]></category>
		<category><![CDATA[NAS, IP, iSCSI]]></category>
		<category><![CDATA[Video]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=1409</guid>
		<description><![CDATA[The video folks have an interesting set of problems: large needs; major bandwidth; time-critical collaboration; lots of metadata; and more. Like budgets. I do some video production myself and empathize. They are today where most of us will be in 10 years: lots of large files; local and remote sharing; processor and bandwidth intensive operations; [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>The video folks have an interesting set of problems: large needs; major bandwidth; time-critical collaboration; lots of metadata; and more. Like budgets. I do some <a href="http://www.youtube.com/user/StorageMojo" target="_blank">video production</a> myself and empathize.</p>
<p>They are today where most of us will be in 10 years: lots of large files; local and remote sharing; processor and bandwidth intensive operations; large archives of wanted and rarely accessed files.  Today high-end video folks are working at 2k, 4k and, sometimes, 8k video resolutions &#8211; and 10 years from now I wouldn&#8217;t be surprised if home users weren&#8217;t too.</p>
<p>What prompts this is a note I received from, well, I&#8217;ll let him introduce himself.</p>
<blockquote><p>
I have a boutique post-production company and I&#8217;m a filmmaker. We are small, under a dozen, but swell to a few times that size with freelancers on a project-by-project basis. Because we work with very high resolution media, we need a lot of space, and very high throughput to each user.  . . . [W]e&#8217;re all working with 2K and 4K media (300 and 1200MBps respectively to EACH user) and 3D animation rendering. . . . We use a mix of Linux, Windows, and OS X clients. In total, we could easily make use of 100TB+ right now, and prefer to stop archiving everything to tape and deleting it, but rather migrate to another tier of storage but keep in one global namespace with the tape just for disaster recovery. We also need security administration.</p>
<p>I can&#8217;t find a storage system that does all this. DataDirect Networks seems to be the du jour high-end storage for my industry, and supposing I&#8217;m willing to finance that big-ticket brand, they still don&#8217;t have a filing system answer. They&#8217;re suggesting StorNext or CXFS, and I know the multi-user scalability and expansion limitations well (can anybody say &#8220;forklift&#8221;?). </p>
<p>The closest I&#8217;ve come is Lustre. It seems like it would fit the bill nicely, especially since we&#8217;re savvy to integrate in-house, except that it is Linux only, and NFS/CIFS gateways don&#8217;t seem like a great idea. I keep hearing they&#8217;re working on at least a Windows client, but who knows when it will be ready?</p>
<p>Can you help at all? What have I overlooked? Doesn&#8217;t anyone make what I&#8217;m looking for?
</p></blockquote>
<p><strong>Short answer to last question:</strong><br />
No.</p>
<p><strong>Longer answer:</strong><br />
No. But there are workarounds.</p>
<p>For those new to video, here&#8217;s an abbreviated chart of some video rates in megabytes per second:<br />
<a href="http://storagemojo.com/wp-content/uploads//2009/06/video_data_rates1.png"><img src="http://storagemojo.com/wp-content/uploads//2009/06/video_data_rates1.png" alt="video_data_rates1" title="video_data_rates1" width="471" height="268" class="aligncenter size-full wp-image-1420" /></a> [Adapted from <a href="http://www.integritydatasystems.net/Video_Data_Rates.htm" target="_blank">Integrity Data Systems</a> which offers the whole chart. Aspect ratios and frame rates left out.]<br />
<strong>Update:</strong> Larry Jordan, a writer and trainer in video editing, graciously wrote to let me know that the above data rates are uncompressed &#8211; and that most production houses would use compressed data. The amount of compression varies based on the codec as Larry explains in this <a href="http://www.larryjordan.biz/articles/lj_video_data_rates.html" target="_blank">informative post</a>.<strong> End update.</strong></p>
<p><strong>Issue 1: Interconnects</strong><br />
GigE won&#8217;t even handle 32-bit RGB standard def video. And when you get into HD video it gets hairier fast. Trunk multiple GigE&#8217;s? 10GbE? 4x Infiniband? FC? eSATA or PCI-e direct attached storage? </p>
<p><strong>Issue 2: Virtualization</strong><br />
A single address space is a wonderful thing. You&#8217;ll need a software layer that clusters multiple boxes. You&#8217;ll also probably want to build an archive infrastructure that is distinct from your higher performance working set storage, but some vendors will disagree.</p>
<p>Likely software suspects include <a href="http://www.ibrix.com/" target="_blank">IBRIX</a>, <a href="http://www.parascale.com/" target="_blank">Parascale</a>, <a href="http://www.caringo.com/" target="_blank">Caringo</a>,  <a href="http://www.object-matrix.com/" target="_blank">MatrixStore</a>, <a href="http://www.bycast.com/" target="_blank">Bycast</a> and <a href="http://www.permabit.com/" target="_blank">Permabit</a>.</p>
<p>On the combined HW/SW side there&#8217;s <a href="http://www.panasas.com/" target="_blank">Panasas</a> and <a href="http://www.isilon.com/" target="_blank">Isilon</a>.  Something tells me there are some other options, like HP&#8217;s Extreme Data Storage 9100, that are also applicable. </p>
<p>Lustre is not a product I would recommend since it was designed for HPC, a market where PhDs work as sysadmins. Sun may have tamed it since they bought it, but it is a non-trivial piece of software. </p>
<p><strong>Come one, come all</strong><br />
StorageMojo readers are invited to offer their 2¢ worth. Architecting is non-trivial, especially if money is an object. </p>
<p><strong>Update:</strong><br />
Our interlocutor wrote in to add some detail:</p>
<blockquote>
<p>thanks for the response. Here&#8217;s some answers:</p>
<p> &#8211; We can manage expensive interfaces like 10GigE and Infiniband QDR. We&#8217;ve been paying for dual-channel 4Gb FC for the past few years, after all. I just want to also allow standard Gigabit connections to the cheap seats without a lot of complexity. So I guess the jargon for that would be &#8220;multiprotocol&#8221; switching?</p>
<p> &#8211; The large naming space might be a luxury. The fact is that jobs come in one of three general sizes, and we could have volumes of that size waiting to take on new jobs as they come in, so at least there is one namespace per job. As you said, capacity is cheap&#8230;</p>
<p> &#8211; Truth is I am pretty savvy, but other than that we have a lot of power desktop users but not sysadmin types. I contract some people with steady part-time work, but it has been our business model to try to keep as many of our full-time people on the creative and producing side as possible, and not in support/administration. </p>
<p>The one thing I don&#8217;t understand is what you say about Infiniband not being so great when there&#8217;s lots of node churn?</p>
<p>I know what you mean about DAS, but I think I&#8217;ve ruled out distributing the data through push/pull from a central repository. The fact is jobs just move to fast through here for that, and we often have about two seconds notice that we need to bring a certain job&#8217;s data to System X, Y or Z to do work on it. It&#8217;s very dynamic.</p>
<p>I see some brands in your blog post I haven&#8217;t checked on yet.</p>
<p>What turned me onto Lustre is that Frantic Films in London has deployed it. They&#8217;re the only ones AFAIK.<br />
<strong>End update.</strong></p>
</blockquote>
<p><strong>The StorageMojo take</strong><br />
Some thoughts on the infrastructure issues.</p>
<p>Capacity is cheap, network bandwidth is expensive. Raw SATA disk is less than $0.10/GB. 10GbE switch ports are over a grand apiece. Infiniband is better from a price/performance perspective, but not as friendly for networks where there is much node churn &#8211; unless that&#8217;s been fixed in the last few years.</p>
<p>Direct attached storage will give you the best performance &#8211; especially with 4k. The new PCI-e attached arrays from <a href="http://www.jmr.com/" target="_blank">JMR</a> and others can offer up to 4,000 MB/sec bandwidth. Stripe across 4 of those and you&#8217;ll be able to handle 8k.</p>
<p>Transaction processing is well on its way to niche status, like mainframes and hierarchical databases that once ruled the earth. It is a big file world out there and the files are getting bigger every year.</p>
<p><strong>Courteous comments welcome, of course.</strong>  I&#8217;ve done work for many of these folks &#8211; but not all &#8211; at one time or another. </p>
<div class="twttr_button">
				<a href="http://twitter.com/share?url=http://storagemojo.com/2009/06/07/configure-a-100-tb-hd-video-infrastructure/&text=Configure a 100 TB HD video infrastructure" target="_blank" title="Click here if you liked this article.">
					<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
				</a>
			</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2009/06/07/configure-a-100-tb-hd-video-infrastructure/feed/</wfw:commentRss>
		<slash:comments>28</slash:comments>
		</item>
		<item>
		<title>HP/LeftHand: cluster market shapes up</title>
		<link>http://storagemojo.com/2008/10/08/hplefthand-cluster-market-shapes-up/</link>
		<comments>http://storagemojo.com/2008/10/08/hplefthand-cluster-market-shapes-up/#comments</comments>
		<pubDate>Thu, 09 Oct 2008 01:33:04 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Clusters]]></category>
		<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[NAS, IP, iSCSI]]></category>
		<category><![CDATA[SOHO/SMB]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=966</guid>
		<description><![CDATA[Hewlett-Packard&#8217;s acquisition of the LeftHand Networks shows how cluster storage is going mainstream &#8211; and how HP plans to be right in the middle of it. First PolyServe and now LeftHand. This is about commodity-based clusters Not iSCSI or GigE or 10 GigE as a storage interconnect. Fibre Channel&#8217;s failure to move downmarket &#8211; and [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>Hewlett-Packard&#8217;s acquisition of the LeftHand Networks shows how cluster storage is going mainstream &#8211; and how HP plans to be right in the middle of it. First <a href="http://storagemojo.com/2007/03/12/hps-bold-move-into-storage-clusters/" target="_blank">PolyServe</a> and now LeftHand. </p>
<p><strong>This is about commodity-based clusters</strong><br />
Not iSCSI or GigE or 10 GigE as a storage interconnect. Fibre Channel&#8217;s failure to move downmarket &#8211; and Infiniband&#8217;s similar problem &#8211; means GigE is the only game in town. </p>
<p>Reaching the huge, not currently imploding, SMB market requires meeting people where they live. SMBs don&#8217;t live in Fibre Channel glass houses. GigE isn&#8217;t ideal, but it&#8217;s cheap and it works.</p>
<p><strong>Did HP overpay?</strong><br />
$360 million isn&#8217;t pocket change, but it is only about 4x the $86 million investors put in. The investors get some nice coin, but it isn&#8217;t the 10-bagger they were hoping for. </p>
<p>Once the Lefties go through the interminable internal HP meat grinder, sales will grow rapidly. I suspect they weren&#8217;t up to Isilon&#8217;s $100M in sales &#8211; maybe $70M &#8211; but LeftHand was much closer to profitability. Net net: the price looks fair for a market leader in a high-growth market.</p>
<p><strong>HP vs EMC</strong><br />
Battle of the competing cluster storage visions. Polyserve handles files; LeftHand blocks. EMC&#8217;s Maui is aimed at large-scale distributed file storage, a utility that ISP&#8217;s might resell to SMBs, but nothing an SMB would implement on their own.</p>
<p>Which will win &#8211; and there&#8217;s room for both &#8211; rests on the answer to the question <a href="http://storagemojo.com/2008/09/18/are-there-economies-of-scale-in-storage/" target="_blank">Are there economies of scale in storage?</a>. If there are, small-scale clusters sales will suffer and Maui should win. </p>
<p><strong>The StorageMojo take</strong><br />
This is cluster storage market skirmishing, not a pitched battle. That will come but right now everyone is feeling their way, coming into the market from different directions, waiting to see what clicks. </p>
<p>Right now though, HP seems to have the strongest position. XIV is too new; Maui even newer; Lustre too complex; Isilon is digging out of a big hole. HP has the pole position with implementable products today and the services to back them up. Should be a powerful combination.</p>
<p><strong>Courteous comments welcome, of course.</strong> Disclosure: I&#8217;ve done some work for HP, Isilon and Sun.</p>
<div class="twttr_button">
				<a href="http://twitter.com/share?url=http://storagemojo.com/2008/10/08/hplefthand-cluster-market-shapes-up/&text=HP/LeftHand: cluster market shapes up" target="_blank" title="Click here if you liked this article.">
					<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
				</a>
			</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2008/10/08/hplefthand-cluster-market-shapes-up/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Our changing file workloads</title>
		<link>http://storagemojo.com/2008/09/09/our-changing-file-workloads/</link>
		<comments>http://storagemojo.com/2008/09/09/our-changing-file-workloads/#comments</comments>
		<pubDate>Wed, 10 Sep 2008 04:34:49 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[NAS, IP, iSCSI]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=928</guid>
		<description><![CDATA[StorageMojo has long held the view that our storage workloads are changing: more file storage, less block storage; larger file sizes; and cooler data. While all the indicators said this was happening it&#8217;s good to find a study that confirmed this intuition. In the Measurement And Analysis Of Large-Scale Network File System Workloads (pdf) researchers [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>StorageMojo has long held the view that our storage workloads are changing: more file storage, less block storage; larger file sizes; and cooler data. While all the indicators said this was happening it&#8217;s good to find a study that confirmed this intuition.</p>
<p>In the <a href="http://www.ssrc.ucsc.edu/Papers/leung-usenix08.pdf" target="_blank">Measurement And Analysis Of Large-Scale Network File System Workloads</a> (pdf) researchers Andrew W. Leung and Ethan L. Miller from UC Santa Cruz and Shankar Pasupathy and Garth Goodson of Netapp measured 2 large file servers for 4 months. Their results are worth reviewing, since so many of the optimizations in storage infrastructures rely on workload assumptions. </p>
<p><strong>Unstudied CIFS</strong><br />
The authors point out that there have been no major studies of the CIFS protocol, odd since it is the default on Windows systems. Furthermore, the last major study of network file loads was performed in 2001 &#8211; seven years ago &#8211; an interval in which average this drive sizes have gone from 20 GB to 500 and network speeds from 100 MB to 1 GB. </p>
<p>Most surprising, however is that no published study has ever analyzed large-scale enterprise file system workloads. Researchers have studied workloads closer to home: university and engineering workloads. </p>
<p><strong>Enterprise workloads</strong><br />
One was a midrange file server with 3 TB of capacity with almost 3 TB used by over 1000 marketing sales and finance employees. The second server was a high end Netapp filer with 28 TB capacity &#8211; 19 TB used &#8211; supporting 500 engineering employees. </p>
<p>Yes, marketers, engineers get the good toys. You can cry about it over your next 3 martini lunch.</p>
<p>Some significant differences from prior studies:</p>
<ul>
<li><strong>Workloads more write oriented.</strong> Read/write byte ratios and are now only 2 to 1 compared to the 4-1 or higher ratios reported earlier.</li>
<li><strong>Workloads less read-centric.</strong> Read/write workloads are now 30x more common.</li>
<li><strong>Most bytes transferred sequentially.</strong> These runs are 10x the length found in the old studies.</li>
<li><strong>Files 10x bigger.</strong></li>
<li><strong>Files live 10x longer.</strong> Less than half are deleted within a day of creation.</li>
</ul>
<p><strong>Cool new findings</strong></p>
<ul>
<li><strong>Files rarely re-opened. </strong>Over 66% are re-opened once and 95% fewer than 5 times.</li>
<li><strong>Over 60% of file re-opens are within a minute of the first open.</strong></li>
<li><strong>Less than 1% of clients account for 50% of requests.</strong></li>
<li><strong>Infrequent file sharing.</strong> Over 76% of files are opened by just 1 client.</li>
<li><strong>Concurrent file sharing very rare.</strong> As the prior point suggests, only 5% of files are opened by multiple clients and 90% of those are read only.</li>
<li><strong>Most file types have no common access pattern.</strong></li>
</ul>
<p>And there&#8217;s this: <strong>over 90% of the active storage was untouched during the study.</strong> That makes it official: data is getting cooler.</p>
<p>Another interesting finding: 91% of VMWare Virtual Disk (vmdk) files accesses were small sequential reads &#8211; not the larger sequential accesses I&#8217;d expect.</p>
<p><strong>The StorageMojo take</strong><br />
The writers rightly suggest that given the rarity of file reads after creation it makes sense to migrate files to cheap storage sooner than later.</p>
<p>Perhaps primary file storage should be thought of as a large FIFO buffer &#8211; tossing 3 month old files to an archive for long-term storage. A data flow architecture instead of a series ever-larger buckets.</p>
<p>Kudos to NetApp and UCSC for this work. It seems like NetApp has been doing the best job of leveraging academic researchers lately. I&#8217;d like to see them get more marketing mileage out of their good work.</p>
<p><strong>Courteous comments welcome, of course.</strong>  </p>
<div class="twttr_button">
				<a href="http://twitter.com/share?url=http://storagemojo.com/2008/09/09/our-changing-file-workloads/&text=Our changing file workloads" target="_blank" title="Click here if you liked this article.">
					<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
				</a>
			</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2008/09/09/our-changing-file-workloads/feed/</wfw:commentRss>
		<slash:comments>16</slash:comments>
		</item>
		<item>
		<title>Roadrunner&#8217;s backing store</title>
		<link>http://storagemojo.com/2008/06/11/roadrunners-backing-store/</link>
		<comments>http://storagemojo.com/2008/06/11/roadrunners-backing-store/#comments</comments>
		<pubDate>Wed, 11 Jun 2008 22:40:46 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Clusters]]></category>
		<category><![CDATA[Disk]]></category>
		<category><![CDATA[NAS, IP, iSCSI]]></category>
		<category><![CDATA[SAN, FC]]></category>

		<guid isPermaLink="false">http://storagemojo.com/?p=719</guid>
		<description><![CDATA[I wrote a short piece on ZDnet about Los Alamos National Labs new Cell Broadband Engine based supercomputer, Roadrunner. With ~14k v.3 Cell processors &#8211; an earlier version powers the PS3 game console &#8211; and another ~7k dual core Opterons, the Roadrunner&#8217;s ~3,250 compute nodes pack a lot of compute cycles. The key compute element [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>I wrote a <a href="http://blogs.zdnet.com/storage/?p=332" target="_blank">short piece</a> on ZDnet about Los Alamos National Labs new Cell Broadband Engine based supercomputer, Roadrunner. With ~14k v.3 Cell processors &#8211; an earlier version powers the PS3 game console &#8211;  and another ~7k dual core Opterons, the Roadrunner&#8217;s ~3,250 compute nodes pack a lot of compute cycles.</p>
<p>The key compute element is the new version of the PS3 chip &#8211; called a PowerXCell 8i Processor &#8211; features 8x faster double-precision floating point and over 25 GB/sec of memory bandwidth. And it can address 64 GB RAM. There are 4 8i&#8217;s per compute node.</p>
<p>Nothing I read mentioned the disk storage &#8211; until the friendly Panasas PR person suggested I talk to Larry Jones, VP Product Marketing. Panasas is providing the back end storage for Roadrunner.</p>
<p>I did, and here&#8217;s what I learned.</p>
<p><strong>LANL storage infrastructure</strong><br />
LANL&#8217;s 6 supercomputers + Roadrunner share the Panasas storage through LANL-developed IO nodes. While Roadrunner itself uses dual-data-rate 4x Infiniband for internode communication, the I/O nodes attach to Panasas through trunked GigE.</p>
<p>The advantage of the I/O nodes is that the entire Panasas storage pool is available to each supercomputer. Lots of bandwidth.</p>
<p>Roadrunner currently has about 80TB of RAM, roughly 24 GB per compute node. That works out to about 4 GB RAM per processor. </p>
<p>The jobs these machines run are huge. A simulation can run 6 months or more. Depending on criticality a job gets checkpointed every hour or maybe once a day. </p>
<p>The Panasas installation at LANL, begun in 2003, is currently 2 PB. Assuming an average of 500 GB drives, that means 4,000 disk drives.</p>
<p>Panasas uses 5 trunked GigE links to each of the 8 controllers in a single rack. They are now in beta for 10 GigE, which reduce link count from 40 to 8 per rack while doubling bandwidth.</p>
<p>The hot rodders at LANL should like that.</p>
<p><strong>The StorageMojo take</strong><br />
Roadrunner&#8217;s 80 TB RAM is a sizable storage infrastructure in its own right. Keeping it fed and backed up is a major job. </p>
<p>Consumerization of IT is a common concept &#8211; but what we see here is the consumerization of HPC: Playstation CPUs; SATA drives; Linux OS; air cooling. The old model of highly customized kit for HPC is dead.</p>
<p>Which is a good thing for the rest of us. We get some of the smartest people in computing working on platforms that we might also use, developing applications that otherwise would never be available to the consumer market. </p>
<p>I&#8217;ll never run molecular dynamics codes, but maybe my kids will. After  all, I can now edit feature length movies on my desktop. Who would have believed that just 20 years ago?</p>
<p><strong>Comments welcome, of course.</strong> Disclosure: I did some work for Panasas last year and &#8211; who knows? &#8211; might do some more in the future. I like the team and the way they are pushing pNFS.</p>
<div class="twttr_button">
				<a href="http://twitter.com/share?url=http://storagemojo.com/2008/06/11/roadrunners-backing-store/&text=Roadrunner's backing store" target="_blank" title="Click here if you liked this article.">
					<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
				</a>
			</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2008/06/11/roadrunners-backing-store/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
		</item>
		<item>
		<title>Cleversafe&#8217;s dispersed storage network</title>
		<link>http://storagemojo.com/2008/03/03/cleversafes-dispersed-storage-network/</link>
		<comments>http://storagemojo.com/2008/03/03/cleversafes-dispersed-storage-network/#comments</comments>
		<pubDate>Mon, 03 Mar 2008 20:10:36 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[Future Tech]]></category>
		<category><![CDATA[NAS, IP, iSCSI]]></category>
		<category><![CDATA[Security & Public Policy]]></category>

		<guid isPermaLink="false">http://storagemojo.com/2008/03/03/cleversafes-dispersed-storage-network/</guid>
		<description><![CDATA[I had a con call with Chris Gladwin and Russ Kennedy of Cleversafe a couple of weeks ago. They&#8217;ve come to market with a product line that seeks to deliver: Massive scalability to meet growing digital content requirements Unprecedented Security and Privacy for critical digital assets Survivability against disasters, dishonesty and time Extremely cost-effective infrastructure [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>I had a con call with Chris Gladwin and Russ Kennedy of Cleversafe a couple of weeks ago. They&#8217;ve come to market with a product line that seeks to deliver:</p>
<ul>
<li>Massive scalability to meet growing digital content requirements</li>
<li>Unprecedented Security and Privacy for critical digital assets</li>
<li>Survivability against disasters, dishonesty and time</li>
<li>Extremely cost-effective infrastructure compared to traditional methods</li>
</ul>
<p>That&#8217;s a quote from their pitch.</p>
<p><strong>Cleversafe&#8217;s product line</strong><br />
Cleversafe, IIRC, started as a software company, but their announced products come in nice rack-mountable boxes. There are 3 of them:</p>
<ul>
<li>CS Slicestor &#8211; Dispersed Storage server &#8211; $11.3k</li>
<li>CS Accesser &#8211; Dispersed Storage router &#8211; $12.3k</li>
<li>CS Manager &#8211; Dispersed Storage network manager &#8211; $12.3k</li>
</ul>
<p>The Slicestor is a 1U storage server containing 4 disks. The Accessor slices up the data and distributes it &#8211; think slice router. The Manager works out of band to monitor and manage the storage network components.</p>
<p>I assume the pricing includes some room for volume discounts. There is an open-source version (c. 2006) of the software. The company intends to offer a software-only version as well.</p>
<p><strong>Why hardware?</strong><br />
The Conventional Wisdom in VC circles is that tin-wrapped software ramps revenues faster &#8211; hey, you&#8217;re selling tin + bits &#8211; at the cost of lower margins and loss of focus. </p>
<p>Qualifying hardware is non-trivial; so you tend to stay on one platform longer than you should. At liquidity event time, software companies fetch higher multiples, so it may be a net loss. VCs live by the Golden Rule: he who has the gold makes the rules.</p>
<p><strong>What it does</strong><br />
Cleversafe has an iSCSI or block storage interface. It takes the data, slices it into small pieces using <a href="http://www.cleversafe.org/dispersed-storage/idas" target="_blank">Information Dispersal Algorithms</a> and then ships the slices off to storage either locally or around the world.</p>
<p>In the latest version you can specify how many slices the system makes and how many slices are required to rebuild the data. If you have 11 data centers around the world, you can specify that, say, 6 are required to recreate the data. </p>
<p>You could lose access to 5 data centers and still recover. If the local controlling authority busts into 3 or 4 data centers, they get nothing. Pretty cool if you worry about corrupt government officials getting hold of your company secrets.</p>
<p>The company is planning on adding FTP, CIFS and NFS in the fullness of time.</p>
<p><strong>How well it works</strong><br />
Cleversafe claims that given sufficient low-latency bandwidth the dispersed storage is as fast as a local disk. That&#8217;s a tall order, but for now I&#8217;ll take their word for it. </p>
<p><strong>Who should buy it?</strong><br />
The company is aiming the Dispersed Storage Network at ISPs to offer as a service and multinationals with round the clock operations and critical data.</p>
<p><strong>How it works</strong><br />
Cleversafe uses Cauchy Reed Solomon erasure codes to slice and dice the data. These codes have several advantages:</p>
<ul>
<li>More capacity efficient and failure tolerant than parity codes</li>
<li>Doesn&#8217;t require a license</li>
<li>Code and decode are faster than other stack operations</li>
</ul>
<p>If you&#8217;d like to play with Cauchy Reed Solomon, check out Dr. Jim Plank&#8217;s software <a href="http://www.cs.utk.edu/~plank/plank/www/software.html" target="_blank">page</a> which includes </p>
<blockquote><p>
. . . Reed-Solomon coding, Cauchy Reed-Solomon coding, general bit-matrix coding, Reed-Solomon coding optimized for RAID-6, and Liberation coding. The documentation provides some tutorial material on matrix and bit-matrix based erasure coding.
</p></blockquote>
<p>I met the good doctor at FAST, where he was delighted to find that Clevesafe &#8211; also a FAST presenter &#8211; was using techniques he&#8217;d worked on a decade ago.</p>
<p><strong>The StorageMojo take</strong><br />
I&#8217;m impressed with what Cleversafe has done. They will look even smarter after EMC&#8217;s Hulk/Maui announcement this spring. I suspect they&#8217;ll be bought by year&#8217;s end.</p>
<p>Kudos to the Cleversafe team.</p>
<p><strong>Comments welcome, of course.</strong></p>
<div class="twttr_button">
				<a href="http://twitter.com/share?url=http://storagemojo.com/2008/03/03/cleversafes-dispersed-storage-network/&text=Cleversafe's dispersed storage network" target="_blank" title="Click here if you liked this article.">
					<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
				</a>
			</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2008/03/03/cleversafes-dispersed-storage-network/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>What&#8217;s with Isilon?</title>
		<link>http://storagemojo.com/2008/02/21/whats-with-isilon/</link>
		<comments>http://storagemojo.com/2008/02/21/whats-with-isilon/#comments</comments>
		<pubDate>Thu, 21 Feb 2008 22:27:26 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Clusters]]></category>
		<category><![CDATA[NAS, IP, iSCSI]]></category>

		<guid isPermaLink="false">http://storagemojo.com/2008/02/21/whats-with-isilon/</guid>
		<description><![CDATA[They haven&#8217;t reported financials for almost 3 quarters. Their stock is trading at about 20% of its peak. They fired their CEO and put founder Sujal Patel in his place. And NetApp was trying to strangle baby Isilon (see NetApp filers for $1/GB?) in its crib. Are they goners? I don&#8217;t think so. I&#8217;ve been [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>They haven&#8217;t reported financials for almost 3 quarters. Their stock is trading at about 20% of its peak. They fired their CEO and put founder Sujal Patel in his place. And NetApp was trying to strangle baby Isilon (see <a href="http://storagemojo.com/2007/10/22/netapp-filers-for-1gb/" target="_blank">NetApp filers for $1/GB?</a>) in its crib.</p>
<p>Are they goners?</p>
<p><strong>I don&#8217;t think so.</strong><br />
I&#8217;ve been trying to read the tea leaves on the Peter van Oppen&#8217;s decision to join the board earlier this month.</p>
<p>Peter led the tape library company ADIC, also based in the Seattle area, for 12 years until its sale to Quantum. ADIC out-innovated Quantum &#8211; saddled with a cranky and slow DLT development group &#8211; in libraries and software as well.</p>
<p>If you think the folks who buy storage arrays are conservative, you haven&#8217;t sold any tape libraries. It is a tough market and ADIC did well.</p>
<p><strong>So why would van Oppen join a sinking ship?</strong><br />
That&#8217;s why I don&#8217;t think Isilon is sinking. An external audit team is reviewing Isilon&#8217;s accounting to ensure that any financial dirty laundry &#8211; say, hypothetically, channel stuffing &#8211; gets cleaned up. They&#8217;ve been at it for months and must be about done. </p>
<p><strong>The StorageMojo take</strong><br />
Based on the Isilon <a href="http://www.isilon.com/news/?sub=press&#038;page=press&#038;release=159" target="_blank">press release</a> and pure speculation, here&#8217;s what I think is going down:</p>
<ul>
<li>Peter exercised some due diligence before accepting the directorship and isn&#8217;t terribly worried about the basic health of the company</li>
<li>After he gets up to speed on company operations, he assumes the CEO role by July</li>
<li>Sujal happily goes back to one of the best jobs in any company: CTO and Founder while the stock climbs in value</li>
</ul>
<p>However it goes down, getting Peter on board is a real plus. Storage experience is thin in Seattle. Isilon has lots of smart people, but the storage market has many unique wrinkles that networking or software folks take a long time to learn.</p>
<p><strong>Comments welcome, as always.</strong> Disclosure: I met Sujal 7 years ago and I&#8217;ve done some work for Isilon. I hope they do well.</p>
<div class="twttr_button">
				<a href="http://twitter.com/share?url=http://storagemojo.com/2008/02/21/whats-with-isilon/&text=What's with Isilon?" target="_blank" title="Click here if you liked this article.">
					<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
				</a>
			</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2008/02/21/whats-with-isilon/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Isilon increases their IQ</title>
		<link>http://storagemojo.com/2008/01/28/isilon-increases-their-iq/</link>
		<comments>http://storagemojo.com/2008/01/28/isilon-increases-their-iq/#comments</comments>
		<pubDate>Tue, 29 Jan 2008 01:09:02 +0000</pubDate>
		<dc:creator>Robin Harris</dc:creator>
				<category><![CDATA[Clusters]]></category>
		<category><![CDATA[NAS, IP, iSCSI]]></category>

		<guid isPermaLink="false">http://storagemojo.com/2008/01/28/isilon-increases-their-iq/</guid>
		<description><![CDATA[Despite being written off for dead . . . Isilon&#8217;s been putting their IPO money to good use: engineering the next gen of their platform that they&#8217;ve named the X-series. In the meantime they&#8217;ve been adding customers &#8211; over 600 so far &#8211; and they have 60 customers running the new kit. Moving from an [...]]]></description>
			<content:encoded><![CDATA[<p></p><p><strong>Despite being written off for dead . . . </strong><br />
<a href="http://www.isilon.com/products/index.php?sub=platforms&#038;page=platform_overview" target="_blank">Isilon&#8217;s</a> been putting their IPO money to good use: engineering the next gen of their platform that they&#8217;ve named the X-series. In the meantime they&#8217;ve been adding customers &#8211; over 600 so far &#8211; and they have 60 customers running the new kit.</p>
<p>Moving from an aging single-core Xeon to a dual-core Xeon &#8211; the second core isn&#8217;t turned on yet &#8211; with faster busses and more cache speeds things up. They claim up to 60% faster performance, 20% less power and heat and 10 GigE readiness. Once they get their software dual-core aware they&#8217;ll have another nice boost to offer.</p>
<p><strong>The StorageMojo take</strong><br />
Turning over the platform more rapidly than traditional array vendors do is a good strategy. It keeps the competition off-balance and gives you something new to tell customers. What good is commodity hardware if you don&#8217;t follow Moore&#8217;s law?</p>
<p>That said, Isilon&#8217;s scale out architecture is the real differentiator vs NetApp and other traditional filers. More bang for the buck just underscores the differences.</p>
<p><strong>Comments welcome.</strong></p>
<div class="twttr_button">
				<a href="http://twitter.com/share?url=http://storagemojo.com/2008/01/28/isilon-increases-their-iq/&text=Isilon increases their IQ" target="_blank" title="Click here if you liked this article.">
					<img src="http://storagemojo.com/wp-content/plugins/twitter-plugin/images/twitt.gif" alt="Twitt" />
				</a>
			</div>]]></content:encoded>
			<wfw:commentRss>http://storagemojo.com/2008/01/28/isilon-increases-their-iq/feed/</wfw:commentRss>
		<slash:comments>12</slash:comments>
		</item>
	</channel>
</rss>

