<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>
<channel>
	<title>Comments on: Why do storage systems fail?</title>
	<atom:link href="http://storagemojo.com/2008/02/24/why-do-storage-systems-fail/feed/" rel="self" type="application/rss+xml" />
	<link>http://storagemojo.com/2008/02/24/why-do-storage-systems-fail/</link>
	<description>Data storage info &#38; analysis</description>
	<pubDate>Mon, 12 May 2008 08:47:45 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.5.1</generator>
		<item>
		<title>By: You get what you pay for &#171; Storage Effect</title>
		<link>http://storagemojo.com/2008/02/24/why-do-storage-systems-fail/#comment-178816</link>
		<dc:creator>You get what you pay for &#171; Storage Effect</dc:creator>
		<pubDate>Fri, 07 Mar 2008 19:15:38 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/2008/02/24/why-do-storage-systems-fail/#comment-178816</guid>
		<description>[...] One failure in a million hours?  It&#8217;s claims like these that seem extreme to some people when they look at enterprise disk drives.  Yet a study of 39,000 NetApp systems by a researcher have found that these drives fail at a 1% annual failure rate (AFR).  Robin Harris summarizes the study in his blog. [...]</description>
		<content:encoded><![CDATA[<p>[...] One failure in a million hours?  It&#8217;s claims like these that seem extreme to some people when they look at enterprise disk drives.  Yet a study of 39,000 NetApp systems by a researcher have found that these drives fail at a 1% annual failure rate (AFR).  Robin Harris summarizes the study in his blog. [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Richard</title>
		<link>http://storagemojo.com/2008/02/24/why-do-storage-systems-fail/#comment-176723</link>
		<dc:creator>Richard</dc:creator>
		<pubDate>Sat, 01 Mar 2008 09:48:46 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/2008/02/24/why-do-storage-systems-fail/#comment-176723</guid>
		<description>Robin,
Yes... cabling plus connectors plus poor airflow plus power .... this is why the 'commodity' motherboard hardware apporoach does not work well. 
NetApp controllers seem to be built on standard motherboard technology... right?</description>
		<content:encoded><![CDATA[<p>Robin,<br />
Yes&#8230; cabling plus connectors plus poor airflow plus power &#8230;. this is why the &#8216;commodity&#8217; motherboard hardware apporoach does not work well.<br />
NetApp controllers seem to be built on standard motherboard technology&#8230; right?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: pmwut5</title>
		<link>http://storagemojo.com/2008/02/24/why-do-storage-systems-fail/#comment-175790</link>
		<dc:creator>pmwut5</dc:creator>
		<pubDate>Wed, 27 Feb 2008 10:12:53 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/2008/02/24/why-do-storage-systems-fail/#comment-175790</guid>
		<description>I still find it interesting that a Netapp report into storage failure rates make no mention on out dated drive firmware as a cause of storage subsystem failure. Some disk drives being used have N14 releases of firmware - mostly to handle disk error handling which prevents the drive being marked as failed. If I was going to produce a report into 1.8 million disks the first thing I would look at is the firmware on those drives before making  any conclusion</description>
		<content:encoded><![CDATA[<p>I still find it interesting that a Netapp report into storage failure rates make no mention on out dated drive firmware as a cause of storage subsystem failure. Some disk drives being used have N14 releases of firmware - mostly to handle disk error handling which prevents the drive being marked as failed. If I was going to produce a report into 1.8 million disks the first thing I would look at is the firmware on those drives before making  any conclusion</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Pete Steege</title>
		<link>http://storagemojo.com/2008/02/24/why-do-storage-systems-fail/#comment-175525</link>
		<dc:creator>Pete Steege</dc:creator>
		<pubDate>Tue, 26 Feb 2008 16:35:39 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/2008/02/24/why-do-storage-systems-fail/#comment-175525</guid>
		<description>It's probably not that NetApp systems are better, but that their application environment is different than other systems on average.  I have no facts to support this, but my sense is the NetApp sweet spot is very high capacity apps that tend to stress individual drives less than other apps.

My observation is that the higher the capacity in a system, the lower the IOPS per Gigabyte (or drive).</description>
		<content:encoded><![CDATA[<p>It&#8217;s probably not that NetApp systems are better, but that their application environment is different than other systems on average.  I have no facts to support this, but my sense is the NetApp sweet spot is very high capacity apps that tend to stress individual drives less than other apps.</p>
<p>My observation is that the higher the capacity in a system, the lower the IOPS per Gigabyte (or drive).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Pete Steege</title>
		<link>http://storagemojo.com/2008/02/24/why-do-storage-systems-fail/#comment-175520</link>
		<dc:creator>Pete Steege</dc:creator>
		<pubDate>Tue, 26 Feb 2008 16:32:03 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/2008/02/24/why-do-storage-systems-fail/#comment-175520</guid>
		<description>Great perspective on disk failure.  The fact that enterprise drives actually achieve their reliability numbers will surprise some people, given that they are so extreme (1.6 million hours between failures, for example).

Goes to show you get what you pay for.</description>
		<content:encoded><![CDATA[<p>Great perspective on disk failure.  The fact that enterprise drives actually achieve their reliability numbers will surprise some people, given that they are so extreme (1.6 million hours between failures, for example).</p>
<p>Goes to show you get what you pay for.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mad Morf</title>
		<link>http://storagemojo.com/2008/02/24/why-do-storage-systems-fail/#comment-175461</link>
		<dc:creator>Mad Morf</dc:creator>
		<pubDate>Tue, 26 Feb 2008 12:56:13 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/2008/02/24/why-do-storage-systems-fail/#comment-175461</guid>
		<description>pmwut5...
You're mixing apples and oranges here.
Drive firmware is different from shelf module firmware.
Keeping drive and shelf firmware updated is the customer's responsibility, as it is with any vendor.
LRCs provide no protection from LIP storms, so replace them if you don't like the results you are getting. ESH2s and ESH4s do provide a lot of protection from LIP events, but no solution is perfect!
Dual connecting shelves requires no downtime if you have sufficient HBA ports available in the appliance.</description>
		<content:encoded><![CDATA[<p>pmwut5&#8230;<br />
You&#8217;re mixing apples and oranges here.<br />
Drive firmware is different from shelf module firmware.<br />
Keeping drive and shelf firmware updated is the customer&#8217;s responsibility, as it is with any vendor.<br />
LRCs provide no protection from LIP storms, so replace them if you don&#8217;t like the results you are getting. ESH2s and ESH4s do provide a lot of protection from LIP events, but no solution is perfect!<br />
Dual connecting shelves requires no downtime if you have sufficient HBA ports available in the appliance.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: pmwut5</title>
		<link>http://storagemojo.com/2008/02/24/why-do-storage-systems-fail/#comment-175424</link>
		<dc:creator>pmwut5</dc:creator>
		<pubDate>Tue, 26 Feb 2008 10:18:11 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/2008/02/24/why-do-storage-systems-fail/#comment-175424</guid>
		<description>Cannot believe Netapp made no mention on out dated firmware running on their drives as a cause of storage failure. I have experienced several LIP storms from failed drives on LRC ESH2 and ESH4 modules storage interconnects on production Netapp Arrays. Netapp Solution is to dual connect all storage trays - great if you have enough I/O slots and downtime available.</description>
		<content:encoded><![CDATA[<p>Cannot believe Netapp made no mention on out dated firmware running on their drives as a cause of storage failure. I have experienced several LIP storms from failed drives on LRC ESH2 and ESH4 modules storage interconnects on production Netapp Arrays. Netapp Solution is to dual connect all storage trays - great if you have enough I/O slots and downtime available.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Tony</title>
		<link>http://storagemojo.com/2008/02/24/why-do-storage-systems-fail/#comment-175287</link>
		<dc:creator>Tony</dc:creator>
		<pubDate>Mon, 25 Feb 2008 17:33:24 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/2008/02/24/why-do-storage-systems-fail/#comment-175287</guid>
		<description>Yes, connectors really suck (as I figure out where and what type of connectors to use in a new design) - but no connectors (hard-wired) is even worse!

There's a lot more science and technology in good connectors than most people realize.</description>
		<content:encoded><![CDATA[<p>Yes, connectors really suck (as I figure out where and what type of connectors to use in a new design) - but no connectors (hard-wired) is even worse!</p>
<p>There&#8217;s a lot more science and technology in good connectors than most people realize.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Dynamic Page Served (once) in -0.979 seconds -->
