<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Configure a 100 TB HD video infrastructure</title>
	<atom:link href="http://storagemojo.com/2009/06/07/configure-a-100-tb-hd-video-infrastructure/feed/" rel="self" type="application/rss+xml" />
	<link>http://storagemojo.com/2009/06/07/configure-a-100-tb-hd-video-infrastructure/</link>
	<description>Data storage info &#38; analysis</description>
	<lastBuildDate>Tue, 07 Feb 2012 16:02:02 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<item>
		<title>By: Rafael Morbeck</title>
		<link>http://storagemojo.com/2009/06/07/configure-a-100-tb-hd-video-infrastructure/comment-page-1/#comment-207725</link>
		<dc:creator>Rafael Morbeck</dc:creator>
		<pubDate>Sat, 23 Jan 2010 02:48:16 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1409#comment-207725</guid>
		<description>I&#039;m a cinema student from Brazil, 24yo. I think u should see this:
http://www.ramsan.com/products/ramsan-6200.htm

I/Os Per Second
5,000,000+
 
Capacity
100 TB of Flash
 
Bandwidth
60 GB per second
 
Latency
Writes: 80 microseconds
Reads: 250 microseconds</description>
		<content:encoded><![CDATA[<p>I&#8217;m a cinema student from Brazil, 24yo. I think u should see this:<br />
<a href="http://www.ramsan.com/products/ramsan-6200.htm" rel="nofollow">http://www.ramsan.com/products/ramsan-6200.htm</a></p>
<p>I/Os Per Second<br />
5,000,000+</p>
<p>Capacity<br />
100 TB of Flash</p>
<p>Bandwidth<br />
60 GB per second</p>
<p>Latency<br />
Writes: 80 microseconds<br />
Reads: 250 microseconds</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jean</title>
		<link>http://storagemojo.com/2009/06/07/configure-a-100-tb-hd-video-infrastructure/comment-page-1/#comment-204065</link>
		<dc:creator>Jean</dc:creator>
		<pubDate>Thu, 09 Jul 2009 18:56:20 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1409#comment-204065</guid>
		<description>I would go with Sun QFS servers over standard 10Gbits links switches.  This is similar to what Sun have done at few places in this market segment.

Sun has won few TV stations with their Unified Storage into editing sections with Appel FCP or Avid.  A single 7410 with 128GB ram with few SSD head can do 1.2 to 5.5GB/sec depending file size and number of disks behind.  No complex setup like QFS or any similar disk sharing here.  Huge saving.</description>
		<content:encoded><![CDATA[<p>I would go with Sun QFS servers over standard 10Gbits links switches.  This is similar to what Sun have done at few places in this market segment.</p>
<p>Sun has won few TV stations with their Unified Storage into editing sections with Appel FCP or Avid.  A single 7410 with 128GB ram with few SSD head can do 1.2 to 5.5GB/sec depending file size and number of disks behind.  No complex setup like QFS or any similar disk sharing here.  Huge saving.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jeff Brue</title>
		<link>http://storagemojo.com/2009/06/07/configure-a-100-tb-hd-video-infrastructure/comment-page-1/#comment-203945</link>
		<dc:creator>Jeff Brue</dc:creator>
		<pubDate>Wed, 08 Jul 2009 02:34:30 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1409#comment-203945</guid>
		<description>I technically manage a 40 person post facility in socal. I&#039;ve given up on storage vendors actually doing what makes sense for this space. Thte bigger problem though as usual is knowledge about the medium. IE those data rates are great.... but what about when they&#039;re DPX files and you have to worry not only about throughput but IOPS...

So with that said I&#039;m about to embark on a dyi with SSD. I&#039;m projecting 6 workstations at 10 bit 4K with intel SSD&#039;s. One storage vendor put a price tag on that throughput at north of half a million... the DYI at under 40.</description>
		<content:encoded><![CDATA[<p>I technically manage a 40 person post facility in socal. I&#8217;ve given up on storage vendors actually doing what makes sense for this space. Thte bigger problem though as usual is knowledge about the medium. IE those data rates are great&#8230;. but what about when they&#8217;re DPX files and you have to worry not only about throughput but IOPS&#8230;</p>
<p>So with that said I&#8217;m about to embark on a dyi with SSD. I&#8217;m projecting 6 workstations at 10 bit 4K with intel SSD&#8217;s. One storage vendor put a price tag on that throughput at north of half a million&#8230; the DYI at under 40.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Chuck</title>
		<link>http://storagemojo.com/2009/06/07/configure-a-100-tb-hd-video-infrastructure/comment-page-1/#comment-203459</link>
		<dc:creator>Chuck</dc:creator>
		<pubDate>Thu, 25 Jun 2009 21:00:34 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1409#comment-203459</guid>
		<description>For years (literally) I&#039;ve whined to folks that its all about the Interconnect. Not that I&#039;m a prophet, on the contrary I&#039;m a systems guy (very un-prophet like). However when you start talking about tera-byte disk drives behind a channel the manufacturer claims will run flat out at 105MB/sec (see Disk Drive data sheets) but you only get to ask for maybe 100 or so different pieces of data from that huge pool every second.  The problem isn&#039;t storage any more, the problem is making sure what you need is already in RAM (or at least a whole lot closer to the interconnect than sitting dead on a platter 180 degrees away from being visible.)</description>
		<content:encoded><![CDATA[<p>For years (literally) I&#8217;ve whined to folks that its all about the Interconnect. Not that I&#8217;m a prophet, on the contrary I&#8217;m a systems guy (very un-prophet like). However when you start talking about tera-byte disk drives behind a channel the manufacturer claims will run flat out at 105MB/sec (see Disk Drive data sheets) but you only get to ask for maybe 100 or so different pieces of data from that huge pool every second.  The problem isn&#8217;t storage any more, the problem is making sure what you need is already in RAM (or at least a whole lot closer to the interconnect than sitting dead on a platter 180 degrees away from being visible.)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Dustin</title>
		<link>http://storagemojo.com/2009/06/07/configure-a-100-tb-hd-video-infrastructure/comment-page-1/#comment-203064</link>
		<dc:creator>Dustin</dc:creator>
		<pubDate>Tue, 16 Jun 2009 14:40:52 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1409#comment-203064</guid>
		<description>Part of what is so daunting about this is understanding the jargon and what all of these things do.  I was interested in this article because I&#039;m in a situation similar to your sample subject, but I got lost when I saw &quot;Issue 1: Interconnects&quot; because I have no idea what that is.  Ugh...</description>
		<content:encoded><![CDATA[<p>Part of what is so daunting about this is understanding the jargon and what all of these things do.  I was interested in this article because I&#8217;m in a situation similar to your sample subject, but I got lost when I saw &#8220;Issue 1: Interconnects&#8221; because I have no idea what that is.  Ugh&#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Joe Kraska</title>
		<link>http://storagemojo.com/2009/06/07/configure-a-100-tb-hd-video-infrastructure/comment-page-1/#comment-202907</link>
		<dc:creator>Joe Kraska</dc:creator>
		<pubDate>Fri, 12 Jun 2009 22:30:43 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1409#comment-202907</guid>
		<description>Well,

Panasas is remarkably affordable these days, offering most of the benefits of Lustre in a turnkey appliance form factor, without the baggage. At the prices Panasas is able to offer, I&#039;d suggest you consider only tiering off old data, and do it in a separate silo: i.e, don&#039;t try to get fancy. Caveat, they have no Mac driver, and their Windows driver is not QUITE out yet.

DDN is entering this market with a clustered storage appliance also.  It, too, will be an easy to use turnkey clustered storage appliance...

Others have pointed out Isilon, which is the easiest to use of the bunch, but cannot approach the performance requirements that the first two can. Note that if you don&#039;t need the highest performance that Panasas and Lustre offer you, you should do some thinkin&#039;: when Isilon says they are the easiest enterprise storage to deploy and use, they ain&#039;t lyin&#039;.

As for ILM, I might suggest you just go for some software that moves older files over to something cheap when the time is right. I think IBM will happily sell you TSM for that... and it&#039;s cheap.

So, make your own:

Put Panasas, Isilon, DDN in Tier 1
Put a Tape Robot (Sun, IBM, Quantum/ADIC) in Tier 3

Notice I do not list a Tier 2. I do NOT think this is needed. I believe you can negotiate Panasas, Isilon, DDN, and the like, to be close enough to Tier 2 prices so as to not need a classical Tier 2 at all...

Joe.</description>
		<content:encoded><![CDATA[<p>Well,</p>
<p>Panasas is remarkably affordable these days, offering most of the benefits of Lustre in a turnkey appliance form factor, without the baggage. At the prices Panasas is able to offer, I&#8217;d suggest you consider only tiering off old data, and do it in a separate silo: i.e, don&#8217;t try to get fancy. Caveat, they have no Mac driver, and their Windows driver is not QUITE out yet.</p>
<p>DDN is entering this market with a clustered storage appliance also.  It, too, will be an easy to use turnkey clustered storage appliance&#8230;</p>
<p>Others have pointed out Isilon, which is the easiest to use of the bunch, but cannot approach the performance requirements that the first two can. Note that if you don&#8217;t need the highest performance that Panasas and Lustre offer you, you should do some thinkin&#8217;: when Isilon says they are the easiest enterprise storage to deploy and use, they ain&#8217;t lyin&#8217;.</p>
<p>As for ILM, I might suggest you just go for some software that moves older files over to something cheap when the time is right. I think IBM will happily sell you TSM for that&#8230; and it&#8217;s cheap.</p>
<p>So, make your own:</p>
<p>Put Panasas, Isilon, DDN in Tier 1<br />
Put a Tape Robot (Sun, IBM, Quantum/ADIC) in Tier 3</p>
<p>Notice I do not list a Tier 2. I do NOT think this is needed. I believe you can negotiate Panasas, Isilon, DDN, and the like, to be close enough to Tier 2 prices so as to not need a classical Tier 2 at all&#8230;</p>
<p>Joe.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Thierry</title>
		<link>http://storagemojo.com/2009/06/07/configure-a-100-tb-hd-video-infrastructure/comment-page-1/#comment-202876</link>
		<dc:creator>Thierry</dc:creator>
		<pubDate>Thu, 11 Jun 2009 12:49:23 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1409#comment-202876</guid>
		<description>There are 3 well known companies that provide really good solutions tailored for broadcast/film industries.
1) You&#039;ve got Quantel with its Genetic Engineering solution.
2) You&#039;ve got Omneon with its MediaGrid solution already mentionned by Steve.
3) you&#039;ve got Avid with Unity ISIS solution.</description>
		<content:encoded><![CDATA[<p>There are 3 well known companies that provide really good solutions tailored for broadcast/film industries.<br />
1) You&#8217;ve got Quantel with its Genetic Engineering solution.<br />
2) You&#8217;ve got Omneon with its MediaGrid solution already mentionned by Steve.<br />
3) you&#8217;ve got Avid with Unity ISIS solution.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Steve Jones</title>
		<link>http://storagemojo.com/2009/06/07/configure-a-100-tb-hd-video-infrastructure/comment-page-1/#comment-202874</link>
		<dc:creator>Steve Jones</dc:creator>
		<pubDate>Thu, 11 Jun 2009 12:43:39 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1409#comment-202874</guid>
		<description>Having looked again I&#039;d agree -0 that 4K stuff is immense. maybe you can aggregate 10GBe ports but I doubt any software stack would handle the data volumes in a fully shared system. The performacne figures available for NFS streaming appear to be for the combined output for multiple clients and not single streaming speed. 

The data volumes here are seriously crazy - like my 12MP DSLR taking 30 frames a second. At 1.2GBps per second then that&#039;s over 1TB just for 15 minutes.

But this also shows the problem with spinning disks - 1.2GBps would take perhaps 20 of them in parallel to have a chance of sustaining just one stream. Pulling 60MBps reliably off of 20 devices all at the same time is difficult enough. Add into that contention through multiple streams, small random I/Os for file system, meta-data and so on and there would have to be many dozens of disks. I suspect that the strain of multiple streams and all the buffering going on could also saturate the memory bandwidth available.

Ahead of pNFS if somebody can invent some form of layer that allows for the interleaving of files over multiple, independent links and storage devices then I suppose the problem might be soluble using several commodity NAS units whilst allowing for full sharing. Using a file-based protocol rather than block-based at least makes the shared access easier. I suspect it is fairly easy to kludge together, but whether there is enough CPU horsepower in a client machine to do all the buffer shuffling at that speed, I don&#039;t know. Also there are probably single threaded parts of client IP stacks that could throttle the whole thing, even if there were multiple targets.</description>
		<content:encoded><![CDATA[<p>Having looked again I&#8217;d agree -0 that 4K stuff is immense. maybe you can aggregate 10GBe ports but I doubt any software stack would handle the data volumes in a fully shared system. The performacne figures available for NFS streaming appear to be for the combined output for multiple clients and not single streaming speed. </p>
<p>The data volumes here are seriously crazy &#8211; like my 12MP DSLR taking 30 frames a second. At 1.2GBps per second then that&#8217;s over 1TB just for 15 minutes.</p>
<p>But this also shows the problem with spinning disks &#8211; 1.2GBps would take perhaps 20 of them in parallel to have a chance of sustaining just one stream. Pulling 60MBps reliably off of 20 devices all at the same time is difficult enough. Add into that contention through multiple streams, small random I/Os for file system, meta-data and so on and there would have to be many dozens of disks. I suspect that the strain of multiple streams and all the buffering going on could also saturate the memory bandwidth available.</p>
<p>Ahead of pNFS if somebody can invent some form of layer that allows for the interleaving of files over multiple, independent links and storage devices then I suppose the problem might be soluble using several commodity NAS units whilst allowing for full sharing. Using a file-based protocol rather than block-based at least makes the shared access easier. I suspect it is fairly easy to kludge together, but whether there is enough CPU horsepower in a client machine to do all the buffer shuffling at that speed, I don&#8217;t know. Also there are probably single threaded parts of client IP stacks that could throttle the whole thing, even if there were multiple targets.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Wes Felter</title>
		<link>http://storagemojo.com/2009/06/07/configure-a-100-tb-hd-video-infrastructure/comment-page-1/#comment-202854</link>
		<dc:creator>Wes Felter</dc:creator>
		<pubDate>Wed, 10 Jun 2009 23:26:33 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1409#comment-202854</guid>
		<description>Joe, I think a better Windows solution would be for a few nodes to re-export the Gluster/Lustre filesystem using Samba w/ CTDB. No VMs needed.

Steve, I don&#039;t see how you&#039;d get multiple 1.2GB/s streams out of Sun Unified Storage (short of time-traveling into the future to get pNFS).</description>
		<content:encoded><![CDATA[<p>Joe, I think a better Windows solution would be for a few nodes to re-export the Gluster/Lustre filesystem using Samba w/ CTDB. No VMs needed.</p>
<p>Steve, I don&#8217;t see how you&#8217;d get multiple 1.2GB/s streams out of Sun Unified Storage (short of time-traveling into the future to get pNFS).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Uday</title>
		<link>http://storagemojo.com/2009/06/07/configure-a-100-tb-hd-video-infrastructure/comment-page-1/#comment-202849</link>
		<dc:creator>Uday</dc:creator>
		<pubDate>Wed, 10 Jun 2009 22:19:01 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1409#comment-202849</guid>
		<description>As Robin mentioned, you should build a ParaScale cloud to solve your storage problems. ParaScale cloud storage software aggregates storage from heterogeneous commodity servers to form a massively scalable storage pool that can be viewed, accessed and managed from a single point.

- Trying it out is a simple software download and install on any commodity hardware running Linux. You can try us out by downloading a free 4 TB cloud at www.parascale.com, and grow it to PB scale on demand. You choose your favorite hardware vendor and can select any hw configuration and SATA/SAS/FC based on your performance requirements/cost. You purchase any server that can run RHEL or CentOS (e.g. SuperMicro xxx, white boxes). You can also mix and match servers, repurposing servers into the cloud if needed.
   
- The cloud offers a single namespace and every node within the cloud can serve client requests independently. This is not a gateway implementation and there is no chokepoint.
- The cloud software performs load balancing automatically, eliminating the need for an external load balancer. Each user who uses NFS will get redirected to a different part of the cloud and the load will be spread across the cloud.
- We work over regular bonded interfaces, and so the network will also not be a bottleneck with our solution. GE, 10gE, IB etc - whatever Linux supports we support
- We simplify managing PBs of data by supporting policy based management operations like automatic data migration, capacity balancing and replication (if required). All this is included in the free download, and you can try it without having to install any add-ons. If you are familiar with Redhat Linux or CentOS admin, that’s all that is required.
- We are optimized for large file reads/writes
- We do not support CIFS today – but windows clients can access the cloud via WebDav quite effectively. We have also found Windows NFS clients to be quite effective and with good performance.
- You can eliminate “forklift” from your vocabulary. The ParaScale software provides a layer of persistence, permanence. Underneath this software layer you can add new servers as old ones die. The cloud continues with the latest and greatest hardware out there.</description>
		<content:encoded><![CDATA[<p>As Robin mentioned, you should build a ParaScale cloud to solve your storage problems. ParaScale cloud storage software aggregates storage from heterogeneous commodity servers to form a massively scalable storage pool that can be viewed, accessed and managed from a single point.</p>
<p>- Trying it out is a simple software download and install on any commodity hardware running Linux. You can try us out by downloading a free 4 TB cloud at <a href="http://www.parascale.com" rel="nofollow">http://www.parascale.com</a>, and grow it to PB scale on demand. You choose your favorite hardware vendor and can select any hw configuration and SATA/SAS/FC based on your performance requirements/cost. You purchase any server that can run RHEL or CentOS (e.g. SuperMicro xxx, white boxes). You can also mix and match servers, repurposing servers into the cloud if needed.</p>
<p>- The cloud offers a single namespace and every node within the cloud can serve client requests independently. This is not a gateway implementation and there is no chokepoint.<br />
- The cloud software performs load balancing automatically, eliminating the need for an external load balancer. Each user who uses NFS will get redirected to a different part of the cloud and the load will be spread across the cloud.<br />
- We work over regular bonded interfaces, and so the network will also not be a bottleneck with our solution. GE, 10gE, IB etc &#8211; whatever Linux supports we support<br />
- We simplify managing PBs of data by supporting policy based management operations like automatic data migration, capacity balancing and replication (if required). All this is included in the free download, and you can try it without having to install any add-ons. If you are familiar with Redhat Linux or CentOS admin, that’s all that is required.<br />
- We are optimized for large file reads/writes<br />
- We do not support CIFS today – but windows clients can access the cloud via WebDav quite effectively. We have also found Windows NFS clients to be quite effective and with good performance.<br />
- You can eliminate “forklift” from your vocabulary. The ParaScale software provides a layer of persistence, permanence. Underneath this software layer you can add new servers as old ones die. The cloud continues with the latest and greatest hardware out there.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Steve Jones</title>
		<link>http://storagemojo.com/2009/06/07/configure-a-100-tb-hd-video-infrastructure/comment-page-1/#comment-202825</link>
		<dc:creator>Steve Jones</dc:creator>
		<pubDate>Wed, 10 Jun 2009 07:01:07 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1409#comment-202825</guid>
		<description>I too would look at the SUN unified storage offerings as the basic building block using 10Gbe. There is always the option of aggregating 100BaseT foir a cheaper option. The considerable advantage of it is that it should be very easy to manage and long term support should be guaranteed with Oracle&#039;s interests in SUN and open storage. With Open Solaris &amp; ZFS there is a good software base. There are other vendors who will be using the same software stack.  It scales fairly large in a single box (and should be higher yet when 2TB disks are available).  

It might also be that some of the built-in functionality like snapshoting and will be useful too and the HA clustering might be important too.

Archival software is a different issue - I&#039;m old fashioned enough to like my layering approaches clean and tend to want my basic storage devices do that well rather than try and perform too many functions. When it comes down to it, a storage device offering a standardised interface can be swapped out - something which has lots of added functionality can lock you in.</description>
		<content:encoded><![CDATA[<p>I too would look at the SUN unified storage offerings as the basic building block using 10Gbe. There is always the option of aggregating 100BaseT foir a cheaper option. The considerable advantage of it is that it should be very easy to manage and long term support should be guaranteed with Oracle&#8217;s interests in SUN and open storage. With Open Solaris &amp; ZFS there is a good software base. There are other vendors who will be using the same software stack.  It scales fairly large in a single box (and should be higher yet when 2TB disks are available).  </p>
<p>It might also be that some of the built-in functionality like snapshoting and will be useful too and the HA clustering might be important too.</p>
<p>Archival software is a different issue &#8211; I&#8217;m old fashioned enough to like my layering approaches clean and tend to want my basic storage devices do that well rather than try and perform too many functions. When it comes down to it, a storage device offering a standardised interface can be swapped out &#8211; something which has lots of added functionality can lock you in.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Joe Landman</title>
		<link>http://storagemojo.com/2009/06/07/configure-a-100-tb-hd-video-infrastructure/comment-page-1/#comment-202819</link>
		<dc:creator>Joe Landman</dc:creator>
		<pubDate>Wed, 10 Jun 2009 01:05:40 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1409#comment-202819</guid>
		<description>Reading over some of the checklist and the comments was interesting.  First on the checklist

First, they need up to 1200 MB/s per user, using Linux, Windows, and OS-X.  Up to a dozen people now (potentially 14.4GB/s, not 1.2GB/s), possibly larger.  So smaller bandwidth/non-scalable bandwidth really need not apply.  This (strongly) suggests a cluster storage system, not a single box.

Second, they need 100TB+ now.  

Third, they need global namespace ... and direct access for Windows ... which is the odd one out in terms of client support.  Lustre is mentioned.

Ok, first, the only way you are going to hit these bandwidths is with a distributed model.  I am talking the 14GB/s with 12 users simultaneously slamming on the 1200 MB/s.  

Second, don&#039;t disparage Infiniband, just because your preferred vendor/solution doesn&#039;t include it.  QDR IB can handle these rates, and you can put it on desktop units.  We do.  You can do longer runs with DDR IB and fibre ... we quoted this out for a customer.  A bit more expensive than CX4, but if you need it, you can have it.  Moreover, you can channel bond the IB and do multi-rail stuff.  IB is the indicated solution, likely with the fibre links.  ConnectX or Qlogic cards in the desktops.  Nice Mellanox/Qlogic based switch doing QDR on the back end.  Add in a 10GbE blade into the frame for multi-protocol switching if you want to tie this into your gigabit ethernet network (don&#039;t go there, 110 MB/s won&#039;t cut it).

Third, global namespace (e.g. single file system) suggests a distributed or centralized metadata service.  One may be better than others for this use case.  I&#039;d argue for distributed in this scenario, and that suggests solutions like GlusterFS.  No current windows client, ask Anand if one is planned.   Lustre is possible, but again, that windows client gets in the way.  And I should point out that Lustre does not require a team of Ph.D.s to administer it.  

Ok, hardware side.  FWIW, we have designed a number of storage clusters as of late, including smaller sustained 4 and 8 GB/s performance using Lustre and GlusterFS respectively.  There is nothing magical about this, other than using good hardware as the basis.  We have units that will provide 96TB raw in 5U at ~2GB/s per unit, that you can use as the basis for this (see &lt;a href=&quot;http://scalableinformatics.com/jackrabbit&quot; rel=&quot;nofollow&quot;&gt;this link&lt;/a&gt; for details), without breaking budgets, or causing CFOs to go catatonic.

All told, if this person wants to solve this, their best solution would likely look a great deal like a GlusterFS cluster, followed closely by a Lustre cluster.  Its just that windows issue.  Thats going to be the hard aspect to this.  There is no simple solution ... possibly apart from running windows in a VM on &lt;a href=&quot;http://scalableinformatics.com/pegasus-gpu&quot; rel=&quot;nofollow&quot;&gt;a muscular Linux desktop with an IB card in it&lt;/a&gt;.  Not perfect, but much better than &quot;No&quot;.  And for the highly reluctant pure windows users, you can have the bootup log in to a VM user, and fire up VMWare workstation right away, so it looks simply like a slow booting, but otherwise, very fast windows box.

Just my thoughts, and of course I am biased given who I work for ...</description>
		<content:encoded><![CDATA[<p>Reading over some of the checklist and the comments was interesting.  First on the checklist</p>
<p>First, they need up to 1200 MB/s per user, using Linux, Windows, and OS-X.  Up to a dozen people now (potentially 14.4GB/s, not 1.2GB/s), possibly larger.  So smaller bandwidth/non-scalable bandwidth really need not apply.  This (strongly) suggests a cluster storage system, not a single box.</p>
<p>Second, they need 100TB+ now.  </p>
<p>Third, they need global namespace &#8230; and direct access for Windows &#8230; which is the odd one out in terms of client support.  Lustre is mentioned.</p>
<p>Ok, first, the only way you are going to hit these bandwidths is with a distributed model.  I am talking the 14GB/s with 12 users simultaneously slamming on the 1200 MB/s.  </p>
<p>Second, don&#8217;t disparage Infiniband, just because your preferred vendor/solution doesn&#8217;t include it.  QDR IB can handle these rates, and you can put it on desktop units.  We do.  You can do longer runs with DDR IB and fibre &#8230; we quoted this out for a customer.  A bit more expensive than CX4, but if you need it, you can have it.  Moreover, you can channel bond the IB and do multi-rail stuff.  IB is the indicated solution, likely with the fibre links.  ConnectX or Qlogic cards in the desktops.  Nice Mellanox/Qlogic based switch doing QDR on the back end.  Add in a 10GbE blade into the frame for multi-protocol switching if you want to tie this into your gigabit ethernet network (don&#8217;t go there, 110 MB/s won&#8217;t cut it).</p>
<p>Third, global namespace (e.g. single file system) suggests a distributed or centralized metadata service.  One may be better than others for this use case.  I&#8217;d argue for distributed in this scenario, and that suggests solutions like GlusterFS.  No current windows client, ask Anand if one is planned.   Lustre is possible, but again, that windows client gets in the way.  And I should point out that Lustre does not require a team of Ph.D.s to administer it.  </p>
<p>Ok, hardware side.  FWIW, we have designed a number of storage clusters as of late, including smaller sustained 4 and 8 GB/s performance using Lustre and GlusterFS respectively.  There is nothing magical about this, other than using good hardware as the basis.  We have units that will provide 96TB raw in 5U at ~2GB/s per unit, that you can use as the basis for this (see <a href="http://scalableinformatics.com/jackrabbit" rel="nofollow">this link</a> for details), without breaking budgets, or causing CFOs to go catatonic.</p>
<p>All told, if this person wants to solve this, their best solution would likely look a great deal like a GlusterFS cluster, followed closely by a Lustre cluster.  Its just that windows issue.  Thats going to be the hard aspect to this.  There is no simple solution &#8230; possibly apart from running windows in a VM on <a href="http://scalableinformatics.com/pegasus-gpu" rel="nofollow">a muscular Linux desktop with an IB card in it</a>.  Not perfect, but much better than &#8220;No&#8221;.  And for the highly reluctant pure windows users, you can have the bootup log in to a VM user, and fire up VMWare workstation right away, so it looks simply like a slow booting, but otherwise, very fast windows box.</p>
<p>Just my thoughts, and of course I am biased given who I work for &#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Steve</title>
		<link>http://storagemojo.com/2009/06/07/configure-a-100-tb-hd-video-infrastructure/comment-page-1/#comment-202816</link>
		<dc:creator>Steve</dc:creator>
		<pubDate>Tue, 09 Jun 2009 23:31:39 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1409#comment-202816</guid>
		<description>You should check out the Omneon MediaGrid.  It can easily handle over 100 TB of replicated storage.  There are several large broadcasters using it for video editing with similar bandwidth requirements.</description>
		<content:encoded><![CDATA[<p>You should check out the Omneon MediaGrid.  It can easily handle over 100 TB of replicated storage.  There are several large broadcasters using it for video editing with similar bandwidth requirements.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Blake Golliher</title>
		<link>http://storagemojo.com/2009/06/07/configure-a-100-tb-hd-video-infrastructure/comment-page-1/#comment-202804</link>
		<dc:creator>Blake Golliher</dc:creator>
		<pubDate>Tue, 09 Jun 2009 16:40:02 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1409#comment-202804</guid>
		<description>I&#039;ll toss GPFS from IBM into the ring.  DDN is an OEM of it.  I foresee this company having to support some element of partitioning out filesystems, but probably by workload.  I&#039;ve seen companies try the &#039;single filesystem, it would be soooo easy!&#039; thing and it&#039;s not pretty when it fails.  Less eggs per basket is better. The random small file IO stuff can hit flash on a jbod (equallogic maybe?) or 15k disk from DDN, while the highly sequential throughput stuff can pound away on a 9900 or something.  All of this IO can happen under the GPFS umbrella in a small number of filesystems.  It scales to many petabytes and large numbers of nodes, like 32k in the latest version.  (I don&#039;t work for DDN, but I&#039;m a fan.)</description>
		<content:encoded><![CDATA[<p>I&#8217;ll toss GPFS from IBM into the ring.  DDN is an OEM of it.  I foresee this company having to support some element of partitioning out filesystems, but probably by workload.  I&#8217;ve seen companies try the &#8216;single filesystem, it would be soooo easy!&#8217; thing and it&#8217;s not pretty when it fails.  Less eggs per basket is better. The random small file IO stuff can hit flash on a jbod (equallogic maybe?) or 15k disk from DDN, while the highly sequential throughput stuff can pound away on a 9900 or something.  All of this IO can happen under the GPFS umbrella in a small number of filesystems.  It scales to many petabytes and large numbers of nodes, like 32k in the latest version.  (I don&#8217;t work for DDN, but I&#8217;m a fan.)</p>
]]></content:encoded>
	</item>
</channel>
</rss>

