<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Google&#8217;s Bigtable Distributed Storage System, Pt. I</title>
	<atom:link href="http://storagemojo.com/2006/09/07/googles-bigtable-distributed-storage-system-pt-i/feed/" rel="self" type="application/rss+xml" />
	<link>http://storagemojo.com/2006/09/07/googles-bigtable-distributed-storage-system-pt-i/</link>
	<description>Data storage info &#38; analysis</description>
	<lastBuildDate>Sun, 01 Aug 2010 02:16:15 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
	<item>
		<title>By: jones</title>
		<link>http://storagemojo.com/2006/09/07/googles-bigtable-distributed-storage-system-pt-i/comment-page-1/#comment-28020</link>
		<dc:creator>jones</dc:creator>
		<pubDate>Tue, 20 Feb 2007 22:28:34 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=239#comment-28020</guid>
		<description>gosh i wish this fiilesystem could keep up with novell nss</description>
		<content:encoded><![CDATA[<p>gosh i wish this fiilesystem could keep up with novell nss</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Robin Harris</title>
		<link>http://storagemojo.com/2006/09/07/googles-bigtable-distributed-storage-system-pt-i/comment-page-1/#comment-21324</link>
		<dc:creator>Robin Harris</dc:creator>
		<pubDate>Thu, 08 Feb 2007 06:25:36 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=239#comment-21324</guid>
		<description>Warren Beatty not Steve Martin?
Halle Berry not Heather Graham?

Uh-huh.

But thanks for reading StorageMojo.

Robin</description>
		<content:encoded><![CDATA[<p>Warren Beatty not Steve Martin?<br />
Halle Berry not Heather Graham?</p>
<p>Uh-huh.</p>
<p>But thanks for reading StorageMojo.</p>
<p>Robin</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: DAR</title>
		<link>http://storagemojo.com/2006/09/07/googles-bigtable-distributed-storage-system-pt-i/comment-page-1/#comment-21302</link>
		<dc:creator>DAR</dc:creator>
		<pubDate>Thu, 08 Feb 2007 05:32:37 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=239#comment-21302</guid>
		<description>Sorry - that&#039;s what I meant.  Steve Martin, Eddie Murphy.  I&#039;ve seen it.  Funny flick!</description>
		<content:encoded><![CDATA[<p>Sorry &#8211; that&#8217;s what I meant.  Steve Martin, Eddie Murphy.  I&#8217;ve seen it.  Funny flick!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Robin Harris</title>
		<link>http://storagemojo.com/2006/09/07/googles-bigtable-distributed-storage-system-pt-i/comment-page-1/#comment-21300</link>
		<dc:creator>Robin Harris</dc:creator>
		<pubDate>Thu, 08 Feb 2007 05:25:43 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=239#comment-21300</guid>
		<description>Bowfinger.

Robin</description>
		<content:encoded><![CDATA[<p>Bowfinger.</p>
<p>Robin</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: DAR</title>
		<link>http://storagemojo.com/2006/09/07/googles-bigtable-distributed-storage-system-pt-i/comment-page-1/#comment-21295</link>
		<dc:creator>DAR</dc:creator>
		<pubDate>Thu, 08 Feb 2007 05:11:27 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=239#comment-21295</guid>
		<description>Great read.  Thanks for the review!

Re: your trivia test:  Bulworth?</description>
		<content:encoded><![CDATA[<p>Great read.  Thanks for the review!</p>
<p>Re: your trivia test:  Bulworth?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Robin Harris</title>
		<link>http://storagemojo.com/2006/09/07/googles-bigtable-distributed-storage-system-pt-i/comment-page-1/#comment-6268</link>
		<dc:creator>Robin Harris</dc:creator>
		<pubDate>Sat, 09 Sep 2006 21:38:03 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=239#comment-6268</guid>
		<description>Thanks for the links.

Sawzall - a great name - looks really interesting. Not sure I&#039;ll dive into it in depth though. Here are a few thoughts while the topic is still fresh.

Sawzall works directly on the data in a Bigtable. Bigtable data is structured into tables, but is not typed. So it is up to Sawzall or any other app to know what it is looking at.

Ad hoc refers, I think, to a data set that is the product of a query or a Sawzall data reduction exercise. Once it&#039;s created you can search it or perform more opearations on it.

I&#039;m not quite sure what is meant by structuring search results on the input side. Page rank does just that, and there are allegations that Google plays with Page rank to provide results more to its liking, what ever that may mean.

A topic for a future post is &quot;Finding meaning in massive data&quot;. Search is the first step. Social networking, i.e. human-powered search is the second. Automating the discovery of meaning in massive data is, at least, the third.</description>
		<content:encoded><![CDATA[<p>Thanks for the links.</p>
<p>Sawzall &#8211; a great name &#8211; looks really interesting. Not sure I&#8217;ll dive into it in depth though. Here are a few thoughts while the topic is still fresh.</p>
<p>Sawzall works directly on the data in a Bigtable. Bigtable data is structured into tables, but is not typed. So it is up to Sawzall or any other app to know what it is looking at.</p>
<p>Ad hoc refers, I think, to a data set that is the product of a query or a Sawzall data reduction exercise. Once it&#8217;s created you can search it or perform more opearations on it.</p>
<p>I&#8217;m not quite sure what is meant by structuring search results on the input side. Page rank does just that, and there are allegations that Google plays with Page rank to provide results more to its liking, what ever that may mean.</p>
<p>A topic for a future post is &#8220;Finding meaning in massive data&#8221;. Search is the first step. Social networking, i.e. human-powered search is the second. Automating the discovery of meaning in massive data is, at least, the third.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: e2eiod</title>
		<link>http://storagemojo.com/2006/09/07/googles-bigtable-distributed-storage-system-pt-i/comment-page-1/#comment-6267</link>
		<dc:creator>e2eiod</dc:creator>
		<pubDate>Sat, 09 Sep 2006 17:36:02 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=239#comment-6267</guid>
		<description>Interesting read on &quot;sawzall&quot; at:
[Begin long URL]
http://www.networkworld.com/community/?q=node/8336&amp;rlt=0904gibbs1&amp;code=nlgibbs146124
[End long URL]

From the article:
Sawzall White Paper
http://labs.google.com/papers/sawzall-sciprog.pdf

And the movie mentioned in the article:
http://labs.google.com/papers/sawzall-20030814.gif

Structured, semi-structured and ad hoc Information spaces:

Sawzall on&quot;ad hoc&quot; 
Traditional data processing is done by storing the information in a relational database and processing it with SQL queries. Our system has many differences. First, the data sets are usually too large to fit in a relational database; files are processed in situ rather than being imported into a database server. Also, there are no pre-computed tables or indices; instead the purpose of the system is to construct ad hoc tables and indices appropriate to the computation.

The words &quot;ad hoc&quot; do not appear in the Bigtable white paper.

Bigtable on structured, semi-structured Information:

&quot;Bigtable is a distributed storage system for managing structured data that is designed to scale to a very large size:&quot;

&quot;Bigtable also treats data as uninterpreted strings, although clients often serialize various forms of structured and semi-structured data into these strings. Clients can control the locality of their data through careful
choices in their schemas.&quot;

Does the Google search engine &quot;structure&quot; search results on the input side?
Is an effective search of &quot;ad hoc&quot; Information space not possible?
Is this search area size dependent?
Or is it resource limited?
Interesting...</description>
		<content:encoded><![CDATA[<p>Interesting read on &#8220;sawzall&#8221; at:<br />
[Begin long URL]<br />
<a href="http://www.networkworld.com/community/?q=node/8336&amp;rlt=0904gibbs1&amp;code=nlgibbs146124" rel="nofollow">http://www.networkworld.com/community/?q=node/8336&amp;rlt=0904gibbs1&amp;code=nlgibbs146124</a><br />
[End long URL]</p>
<p>From the article:<br />
Sawzall White Paper<br />
<a href="http://labs.google.com/papers/sawzall-sciprog.pdf" rel="nofollow">http://labs.google.com/papers/sawzall-sciprog.pdf</a></p>
<p>And the movie mentioned in the article:<br />
<a href="http://labs.google.com/papers/sawzall-20030814.gif" rel="nofollow">http://labs.google.com/papers/sawzall-20030814.gif</a></p>
<p>Structured, semi-structured and ad hoc Information spaces:</p>
<p>Sawzall on&#8221;ad hoc&#8221;<br />
Traditional data processing is done by storing the information in a relational database and processing it with SQL queries. Our system has many differences. First, the data sets are usually too large to fit in a relational database; files are processed in situ rather than being imported into a database server. Also, there are no pre-computed tables or indices; instead the purpose of the system is to construct ad hoc tables and indices appropriate to the computation.</p>
<p>The words &#8220;ad hoc&#8221; do not appear in the Bigtable white paper.</p>
<p>Bigtable on structured, semi-structured Information:</p>
<p>&#8220;Bigtable is a distributed storage system for managing structured data that is designed to scale to a very large size:&#8221;</p>
<p>&#8220;Bigtable also treats data as uninterpreted strings, although clients often serialize various forms of structured and semi-structured data into these strings. Clients can control the locality of their data through careful<br />
choices in their schemas.&#8221;</p>
<p>Does the Google search engine &#8220;structure&#8221; search results on the input side?<br />
Is an effective search of &#8220;ad hoc&#8221; Information space not possible?<br />
Is this search area size dependent?<br />
Or is it resource limited?<br />
Interesting&#8230;</p>
]]></content:encoded>
	</item>
</channel>
</rss>
