<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>
<channel>
	<title>Comments on: ZFS: Threat or Menace? Pt. I</title>
	<atom:link href="http://storagemojo.com/2006/05/26/zfs-threat-or-menace-pt-i/feed/" rel="self" type="application/rss+xml" />
	<link>http://storagemojo.com/2006/05/26/zfs-threat-or-menace-pt-i/</link>
	<description>Data storage info &#38; analysis</description>
	<pubDate>Fri, 21 Nov 2008 20:09:56 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.6.3</generator>
		<item>
		<title>By: Macwintel &#187; ZFS in new Leopard build</title>
		<link>http://storagemojo.com/2006/05/26/zfs-threat-or-menace-pt-i/#comment-12101</link>
		<dc:creator>Macwintel &#187; ZFS in new Leopard build</dc:creator>
		<pubDate>Tue, 19 Dec 2006 10:29:10 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=108#comment-12101</guid>
		<description>[...] Also for you ubergeeks (is there any other kind?), who like me, might have noticed a similarity between google&#8217;s famed GFS and ZFS there is a discussion of it here [...]</description>
		<content:encoded><![CDATA[<p>[...] Also for you ubergeeks (is there any other kind?), who like me, might have noticed a similarity between google&#8217;s famed GFS and ZFS there is a discussion of it here [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: StorageMojo &#187; ZFS On Leopard: How Cool Is That?</title>
		<link>http://storagemojo.com/2006/05/26/zfs-threat-or-menace-pt-i/#comment-2381</link>
		<dc:creator>StorageMojo &#187; ZFS On Leopard: How Cool Is That?</dc:creator>
		<pubDate>Thu, 29 Jun 2006 12:15:24 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=108#comment-2381</guid>
		<description>[...] But That&#8217;s Not All! For in-depth treatment of ZFS see here and here. Includes links to more technical info and benchmarks. [...]</description>
		<content:encoded><![CDATA[<p>[...] But That&#8217;s Not All! For in-depth treatment of ZFS see here and here. Includes links to more technical info and benchmarks. [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Robin Harris</title>
		<link>http://storagemojo.com/2006/05/26/zfs-threat-or-menace-pt-i/#comment-528</link>
		<dc:creator>Robin Harris</dc:creator>
		<pubDate>Tue, 13 Jun 2006 12:53:45 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=108#comment-528</guid>
		<description>No cluster support today. I'm told it is high on the list though, with implementation of a global namespace.</description>
		<content:encoded><![CDATA[<p>No cluster support today. I&#8217;m told it is high on the list though, with implementation of a global namespace.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Dmitry Kulagin</title>
		<link>http://storagemojo.com/2006/05/26/zfs-threat-or-menace-pt-i/#comment-522</link>
		<dc:creator>Dmitry Kulagin</dc:creator>
		<pubDate>Tue, 13 Jun 2006 05:41:40 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=108#comment-522</guid>
		<description>Interesting article. But there is a question, as soon as you compared ZFS with Google's GFS: Does ZFS support distribution over a cluster? Reading the docs, I have no indications of that. Effective and fault tolerant clustering is a major feature of GFS.</description>
		<content:encoded><![CDATA[<p>Interesting article. But there is a question, as soon as you compared ZFS with Google&#8217;s GFS: Does ZFS support distribution over a cluster? Reading the docs, I have no indications of that. Effective and fault tolerant clustering is a major feature of GFS.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jeff Bonwick</title>
		<link>http://storagemojo.com/2006/05/26/zfs-threat-or-menace-pt-i/#comment-108</link>
		<dc:creator>Jeff Bonwick</dc:creator>
		<pubDate>Sun, 28 May 2006 08:15:17 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=108#comment-108</guid>
		<description>Sure.  I've explained this to the best of my ability &lt;a href="http://blogs.sun.com/roller/page/bonwick?entry=zfs_end_to_end_data" rel="nofollow"&gt;here&lt;/a&gt;.  If you find this unsatisfactory or unclear in any way, please let me know so I can improve it.</description>
		<content:encoded><![CDATA[<p>Sure.  I&#8217;ve explained this to the best of my ability <a href="http://blogs.sun.com/roller/page/bonwick?entry=zfs_end_to_end_data" rel="nofollow">here</a>.  If you find this unsatisfactory or unclear in any way, please let me know so I can improve it.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Robert Pearson</title>
		<link>http://storagemojo.com/2006/05/26/zfs-threat-or-menace-pt-i/#comment-107</link>
		<dc:creator>Robert Pearson</dc:creator>
		<pubDate>Sun, 28 May 2006 07:51:18 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=108#comment-107</guid>
		<description>Are you referencing Jeff Bonwick's post of---

"Friday December 09, 2005

ZFS End-to-End Data Integrity "

in your reply?

If so, this issue is at the core of developing the definitive Search, Find, and Obtain (SFO) "killer app" (function) that is the cornerstone of Peter Morville's Findability. [My words, not Peter's.]
Findability is the key element in determining the User Experience (UX) defined by Peter in "Ambient Findability". True, everything above the SFO is way above the file system level. However, a big stumbling block for developing a working SFO has been Information Integrity (II [my acronym]). In my concept of End-to-end Information on Demand (E2EIoD) SFO is mission critical and depends totally on Information Integrity (II). Particularly with regard to returning useful Information in a manner and time frame that generates a pleasing User Experience (UX). 
The "Operational Definition" of "pleasing User Experience (UX)" is, "Did you get what you asked for"? Was it fast enough to be of use? Other requirements to this definition are:
1) Did you get "value add" Information that was helpful? 
2) If the Information was not available did the reply contain meaningful information as to when it would be available? 
3) If the "exact" Information was not available was any meaningful "fuzzy search' or "sounds like" Information returned?
I have been working for years on ways to overcome the very problems addressed by the ZFS developers. EMC has been particularly obtuse when I raised these issues with their equipment. I raised these issues with other Storage vendors as well but I made the mistake of thinking EMC understood the problem. They talked like they did. Turns out it was EMC"Show-biz" talk.</description>
		<content:encoded><![CDATA[<p>Are you referencing Jeff Bonwick&#8217;s post of&#8212;</p>
<p>&#8220;Friday December 09, 2005</p>
<p>ZFS End-to-End Data Integrity &#8221;</p>
<p>in your reply?</p>
<p>If so, this issue is at the core of developing the definitive Search, Find, and Obtain (SFO) &#8220;killer app&#8221; (function) that is the cornerstone of Peter Morville&#8217;s Findability. [My words, not Peter's.]<br />
Findability is the key element in determining the User Experience (UX) defined by Peter in &#8220;Ambient Findability&#8221;. True, everything above the SFO is way above the file system level. However, a big stumbling block for developing a working SFO has been Information Integrity (II [my acronym]). In my concept of End-to-end Information on Demand (E2EIoD) SFO is mission critical and depends totally on Information Integrity (II). Particularly with regard to returning useful Information in a manner and time frame that generates a pleasing User Experience (UX).<br />
The &#8220;Operational Definition&#8221; of &#8220;pleasing User Experience (UX)&#8221; is, &#8220;Did you get what you asked for&#8221;? Was it fast enough to be of use? Other requirements to this definition are:<br />
1) Did you get &#8220;value add&#8221; Information that was helpful?<br />
2) If the Information was not available did the reply contain meaningful information as to when it would be available?<br />
3) If the &#8220;exact&#8221; Information was not available was any meaningful &#8220;fuzzy search&#8217; or &#8220;sounds like&#8221; Information returned?<br />
I have been working for years on ways to overcome the very problems addressed by the ZFS developers. EMC has been particularly obtuse when I raised these issues with their equipment. I raised these issues with other Storage vendors as well but I made the mistake of thinking EMC understood the problem. They talked like they did. Turns out it was EMC&#8221;Show-biz&#8221; talk.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Robin Harris</title>
		<link>http://storagemojo.com/2006/05/26/zfs-threat-or-menace-pt-i/#comment-102</link>
		<dc:creator>Robin Harris</dc:creator>
		<pubDate>Sun, 28 May 2006 02:16:05 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=108#comment-102</guid>
		<description>Good question, Jonathan. The basic issue is if the checksum is in the packet with the data, all it tells you is that the data is correct. But it may not be the data you want. &lt;a href=http://blogs.sun.com/roller/page/bonwick?catname=%2FZFS rel="nofollow"&gt;Jeff Bonwick&lt;/a&gt;, the ZFS architect cites phantom writes and misdirected reads (corruption in a pointer file?) as reasons that the wrong block might be returned. Since ZFS keeps the checksum seperate from the data, it knows if the wrong data has been returned. 

Since the checksum is calculated while the data is in the host's RAM, I suppose it could be corrupt, but if that were a regular occurence the host would crash pretty quickly.

How often does this happen? I have no idea. Since this is a statistical universe, I'd assume that as data stores continue to grow the incidence will cross a cognitive threshold and boom! all the CIOs will get excited.

From a marketing perspective, no storage vendor has gone broke scaring the hell out of customers. Someone is going to pick this up and run with it. Sun? Not likely -- they've got a lot of Storagetek to sell. More likely some hungry mid-tier vendor will start the ball rolling.

Hey, ZFS guys, any comment?</description>
		<content:encoded><![CDATA[<p>Good question, Jonathan. The basic issue is if the checksum is in the packet with the data, all it tells you is that the data is correct. But it may not be the data you want. <a href=http://blogs.sun.com/roller/page/bonwick?catname=%2FZFS rel="nofollow">Jeff Bonwick</a>, the ZFS architect cites phantom writes and misdirected reads (corruption in a pointer file?) as reasons that the wrong block might be returned. Since ZFS keeps the checksum seperate from the data, it knows if the wrong data has been returned. </p>
<p>Since the checksum is calculated while the data is in the host&#8217;s RAM, I suppose it could be corrupt, but if that were a regular occurence the host would crash pretty quickly.</p>
<p>How often does this happen? I have no idea. Since this is a statistical universe, I&#8217;d assume that as data stores continue to grow the incidence will cross a cognitive threshold and boom! all the CIOs will get excited.</p>
<p>From a marketing perspective, no storage vendor has gone broke scaring the hell out of customers. Someone is going to pick this up and run with it. Sun? Not likely &#8212; they&#8217;ve got a lot of Storagetek to sell. More likely some hungry mid-tier vendor will start the ball rolling.</p>
<p>Hey, ZFS guys, any comment?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jonathan</title>
		<link>http://storagemojo.com/2006/05/26/zfs-threat-or-menace-pt-i/#comment-99</link>
		<dc:creator>Jonathan</dc:creator>
		<pubDate>Sat, 27 May 2006 22:42:04 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=108#comment-99</guid>
		<description>Great article - maybe im missing something fundamental, but when you say,

"Any checksum stored with the data it is supporting can only tell you that this data is uncorrupted. It could be the wrong data"

How is this different than ZFS?  Is the checksum calculated at a different time?  If the checksum is calculated on corrupt data, then you'd have the same problem of a valid checksme with invalid data.  

Garbage In, Garbage out seems to continue to apply either way.  Can you give an example of a failture that ZFS would recover from, that a traditional system would not?</description>
		<content:encoded><![CDATA[<p>Great article - maybe im missing something fundamental, but when you say,</p>
<p>&#8220;Any checksum stored with the data it is supporting can only tell you that this data is uncorrupted. It could be the wrong data&#8221;</p>
<p>How is this different than ZFS?  Is the checksum calculated at a different time?  If the checksum is calculated on corrupt data, then you&#8217;d have the same problem of a valid checksme with invalid data.  </p>
<p>Garbage In, Garbage out seems to continue to apply either way.  Can you give an example of a failture that ZFS would recover from, that a traditional system would not?</p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Dynamic Page Served (once) in 1.866 seconds -->
