<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>
<channel>
	<title>Comments on: P4P: smart, fast and easy P2P</title>
	<atom:link href="http://storagemojo.com/2008/03/16/p4p-smart-fast-and-easy-p2p/feed/" rel="self" type="application/rss+xml" />
	<link>http://storagemojo.com/2008/03/16/p4p-smart-fast-and-easy-p2p/</link>
	<description>Data storage info &#38; analysis</description>
	<pubDate>Tue, 20 May 2008 19:14:28 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.5.1</generator>
		<item>
		<title>By: Paul</title>
		<link>http://storagemojo.com/2008/03/16/p4p-smart-fast-and-easy-p2p/#comment-181496</link>
		<dc:creator>Paul</dc:creator>
		<pubDate>Tue, 18 Mar 2008 01:21:28 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/2008/03/16/p4p-smart-fast-and-easy-p2p/#comment-181496</guid>
		<description>Ars Technica links to one paper (tech enough to have equations, at least):

http://www.dcia.info/documents/P4P_Overview.pdf</description>
		<content:encoded><![CDATA[<p>Ars Technica links to one paper (tech enough to have equations, at least):</p>
<p><a href="http://www.dcia.info/documents/P4P_Overview.pdf" rel="nofollow">http://www.dcia.info/documents/P4P_Overview.pdf</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jeff Darcy</title>
		<link>http://storagemojo.com/2008/03/16/p4p-smart-fast-and-easy-p2p/#comment-181234</link>
		<dc:creator>Jeff Darcy</dc:creator>
		<pubDate>Mon, 17 Mar 2008 10:55:33 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/2008/03/16/p4p-smart-fast-and-easy-p2p/#comment-181234</guid>
		<description>This is not too dissimilar from what I was doing for the second half of my tenure at EMC.  On the one hand P4P is more fully distributed, on the other it lacks support for writes with full coherency and authentication, but those are all kind of beside the point.  What matters is that the idea of setting up a dynamic tree/graph/whatever based on the actual network topology and then propagating data through that network instead of between peers at random really can produce dramatic efficiency gains.  In one of my specs, which IIRC got reproduced almost verbatim in the associated patent, I refer to the two cardinal laws of data distribution:

* Never transmit data over the same link multiple times when once could have sufficed.

* Never transmit data over a link once when zero could have sufficed.

Most systems tend to violate one rule or the other, most often the first - and since infinity minus one is greater than one minus zero that can be far worse.  Because of my work I was involved with some mostly-academic projects in or near the P2P space at the time.  There was a distinct pattern of starting out with DHTs and such that completely ignore network topology, then trying to bolt on some limited form of topology awareness later as latency (and sometimes reliability but rarely bandwidth) turned out to be a problem after all.  Few had the foresight to take topology into account right at the start.

What I really wonder about in P4P is how - and how well - the nodes cooperate wrt placement of data copies.  On the one hand, it can be good to have one piece of data replicated as many places as possible.  On the other, it can be good to replicate as many pieces of data as possible.  Total storage space being finite even in a P2P network, balancing these needs is a much-studied problem and I'm curious what approach the P4P team took.</description>
		<content:encoded><![CDATA[<p>This is not too dissimilar from what I was doing for the second half of my tenure at EMC.  On the one hand P4P is more fully distributed, on the other it lacks support for writes with full coherency and authentication, but those are all kind of beside the point.  What matters is that the idea of setting up a dynamic tree/graph/whatever based on the actual network topology and then propagating data through that network instead of between peers at random really can produce dramatic efficiency gains.  In one of my specs, which IIRC got reproduced almost verbatim in the associated patent, I refer to the two cardinal laws of data distribution:</p>
<p>* Never transmit data over the same link multiple times when once could have sufficed.</p>
<p>* Never transmit data over a link once when zero could have sufficed.</p>
<p>Most systems tend to violate one rule or the other, most often the first - and since infinity minus one is greater than one minus zero that can be far worse.  Because of my work I was involved with some mostly-academic projects in or near the P2P space at the time.  There was a distinct pattern of starting out with DHTs and such that completely ignore network topology, then trying to bolt on some limited form of topology awareness later as latency (and sometimes reliability but rarely bandwidth) turned out to be a problem after all.  Few had the foresight to take topology into account right at the start.</p>
<p>What I really wonder about in P4P is how - and how well - the nodes cooperate wrt placement of data copies.  On the one hand, it can be good to have one piece of data replicated as many places as possible.  On the other, it can be good to replicate as many pieces of data as possible.  Total storage space being finite even in a P2P network, balancing these needs is a much-studied problem and I&#8217;m curious what approach the P4P team took.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Dynamic Page Served (once) in 0.553 seconds -->
