<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: A petascale parallel database</title>
	<atom:link href="http://storagemojo.com/2010/02/08/a-petascale-parallel-database/feed/" rel="self" type="application/rss+xml" />
	<link>http://storagemojo.com/2010/02/08/a-petascale-parallel-database/</link>
	<description>Data storage info &#38; analysis</description>
	<lastBuildDate>Tue, 07 Feb 2012 16:02:02 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<item>
		<title>By: Roger</title>
		<link>http://storagemojo.com/2010/02/08/a-petascale-parallel-database/comment-page-1/#comment-214861</link>
		<dc:creator>Roger</dc:creator>
		<pubDate>Mon, 31 Jan 2011 12:52:17 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1891#comment-214861</guid>
		<description>You are very good at this! 
HadoopDB is focused on structured data (rather than unstructured data). HadoopDB can also be applied to other workloads, but the less structured the data gets, the less useful it will be relative to just using Hadoop.
I hope the project HadoopDB lives on...</description>
		<content:encoded><![CDATA[<p>You are very good at this!<br />
HadoopDB is focused on structured data (rather than unstructured data). HadoopDB can also be applied to other workloads, but the less structured the data gets, the less useful it will be relative to just using Hadoop.<br />
I hope the project HadoopDB lives on&#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ryan Garrett</title>
		<link>http://storagemojo.com/2010/02/08/a-petascale-parallel-database/comment-page-1/#comment-208073</link>
		<dc:creator>Ryan Garrett</dc:creator>
		<pubDate>Fri, 12 Feb 2010 00:42:22 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1891#comment-208073</guid>
		<description>Very interesting article, Sebastian.

Anyone interested in MapReduce, big data management and big data analytics you should check out the Big Data Summit Bay Area next week in Burlingame, CA. You can register at http://bit.ly/5KUX01. This is the premier conference on data warehousing and big data analytics. Learn how leading companies are leveraging technologies like Hadoop and MapReduce to turn data into dollars. Hear from Aster Data customers like Intuit and Mobclix and leading analysts on new technologies and trends in big data management and advanced analytics.</description>
		<content:encoded><![CDATA[<p>Very interesting article, Sebastian.</p>
<p>Anyone interested in MapReduce, big data management and big data analytics you should check out the Big Data Summit Bay Area next week in Burlingame, CA. You can register at <a href="http://bit.ly/5KUX01" rel="nofollow">http://bit.ly/5KUX01</a>. This is the premier conference on data warehousing and big data analytics. Learn how leading companies are leveraging technologies like Hadoop and MapReduce to turn data into dollars. Hear from Aster Data customers like Intuit and Mobclix and leading analysts on new technologies and trends in big data management and advanced analytics.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: rdp</title>
		<link>http://storagemojo.com/2010/02/08/a-petascale-parallel-database/comment-page-1/#comment-208037</link>
		<dc:creator>rdp</dc:creator>
		<pubDate>Wed, 10 Feb 2010 13:40:05 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1891#comment-208037</guid>
		<description>RE:  &quot;google does internally such as their own file system, mapreduce, server builds, their own switches and routers, their own http server, their own java servlet server&quot; all key components of Enterprise Computing and its BIG brother, Cloud Computing&quot;

 I have always been a believer in scalability in computing. To this end I have been trying to decide on a Home/SOHO Cloud Computing design that could be implemented in this lifetime. Remember when &quot;SuperComputing&quot; looked so far out of reach of the little people? 
My spin on this draws on Robin&#039;s previous post &quot;Why private clouds are part of the future&quot;. So for a Home Cloud you would need:
&quot;your heavily modified file system, Hadoop (mapreduce), your custom
server builds, your hand picked switches and routers, your own http
server, your own java servlet server&quot;.
You could use COTS (Commercial Off The Shelf) components for the Home
Cloud since bandwidth and throughput will not make the difference
between your making a profit or not and surviving. This means that a new
market for Private Cloud components is developing to supply some of
the Google in-house developed components. &quot;New&quot; in the sense that these components have to be highly configurable (easily modified) to your local environment. A set of switches (hardware/software/firmware), and a roadmap for &quot;dummies&quot; to produce the required feature/function set without an in-house staff of programmers. Auto/Self configuring components are a possibility. It is not &quot;rocket science&quot; to do this.
IMHO,YMMV the rise of Private Clouds is a major shift in the computing paradigm.
I want a good cheeseburger (Information on Demand) with fries and a Pepsi (all &quot;Value Add&quot; Services) every time.
It is interesting to me that Robin has written on every topic required to compete with and even surpass the IDCs on a lesser scale. All the key components are outlined in his posts. Has anyone put them together?</description>
		<content:encoded><![CDATA[<p>RE:  &#8220;google does internally such as their own file system, mapreduce, server builds, their own switches and routers, their own http server, their own java servlet server&#8221; all key components of Enterprise Computing and its BIG brother, Cloud Computing&#8221;</p>
<p> I have always been a believer in scalability in computing. To this end I have been trying to decide on a Home/SOHO Cloud Computing design that could be implemented in this lifetime. Remember when &#8220;SuperComputing&#8221; looked so far out of reach of the little people?<br />
My spin on this draws on Robin&#8217;s previous post &#8220;Why private clouds are part of the future&#8221;. So for a Home Cloud you would need:<br />
&#8220;your heavily modified file system, Hadoop (mapreduce), your custom<br />
server builds, your hand picked switches and routers, your own http<br />
server, your own java servlet server&#8221;.<br />
You could use COTS (Commercial Off The Shelf) components for the Home<br />
Cloud since bandwidth and throughput will not make the difference<br />
between your making a profit or not and surviving. This means that a new<br />
market for Private Cloud components is developing to supply some of<br />
the Google in-house developed components. &#8220;New&#8221; in the sense that these components have to be highly configurable (easily modified) to your local environment. A set of switches (hardware/software/firmware), and a roadmap for &#8220;dummies&#8221; to produce the required feature/function set without an in-house staff of programmers. Auto/Self configuring components are a possibility. It is not &#8220;rocket science&#8221; to do this.<br />
IMHO,YMMV the rise of Private Clouds is a major shift in the computing paradigm.<br />
I want a good cheeseburger (Information on Demand) with fries and a Pepsi (all &#8220;Value Add&#8221; Services) every time.<br />
It is interesting to me that Robin has written on every topic required to compete with and even surpass the IDCs on a lesser scale. All the key components are outlined in his posts. Has anyone put them together?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: juliet</title>
		<link>http://storagemojo.com/2010/02/08/a-petascale-parallel-database/comment-page-1/#comment-208032</link>
		<dc:creator>juliet</dc:creator>
		<pubDate>Wed, 10 Feb 2010 07:37:57 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1891#comment-208032</guid>
		<description>Forget the problem of an application crash or slow data access or response time for an overloaded SAN switch port with Traverse&#039;s service container that can monitor application response time and correlate that with the underlying storage components which are relevant to that application using its Business Container technology.
http://zyrion.com/solutions/server.php</description>
		<content:encoded><![CDATA[<p>Forget the problem of an application crash or slow data access or response time for an overloaded SAN switch port with Traverse&#8217;s service container that can monitor application response time and correlate that with the underlying storage components which are relevant to that application using its Business Container technology.<br />
<a href="http://zyrion.com/solutions/server.php" rel="nofollow">http://zyrion.com/solutions/server.php</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: ryan</title>
		<link>http://storagemojo.com/2010/02/08/a-petascale-parallel-database/comment-page-1/#comment-208027</link>
		<dc:creator>ryan</dc:creator>
		<pubDate>Tue, 09 Feb 2010 22:25:11 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1891#comment-208027</guid>
		<description>Maybe you could elaborate on why one might choose to use this platform over some of the other options, namely, hbase, pig, sqoop, hive, etc, etc.</description>
		<content:encoded><![CDATA[<p>Maybe you could elaborate on why one might choose to use this platform over some of the other options, namely, hbase, pig, sqoop, hive, etc, etc.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: nate</title>
		<link>http://storagemojo.com/2010/02/08/a-petascale-parallel-database/comment-page-1/#comment-208022</link>
		<dc:creator>nate</dc:creator>
		<pubDate>Tue, 09 Feb 2010 18:43:52 +0000</pubDate>
		<guid isPermaLink="false">http://storagemojo.com/?p=1891#comment-208022</guid>
		<description>I was talking to a developer working on a project that will be running on hadoop soon and was interested to hear his comments on hadoop itself, it&#039;s extremely poorly written, apparently Yahoo built it mostly by outsourcing the development overseas to some low quality coders, and the result is some pretty poor code. It can work it&#039;s just not that good. 

I find it pretty interesting how much stuff google does internally such as their own file system, mapreduce, server builds, their own switches and routers, their own http server, their own java servlet server. 

Meanwhile others struggle to keep up trying to use as much off the shelf stuff as possible because they don&#039;t have the engineering resources internally to even begin to approach doing it themselves, even a Microsoft insider admitted as much recently in an interview http://www.theregister.co.uk/2010/02/03/microsoft_bing_number_two_wannabe/

I suppose the message here is hope &amp; pray you aren&#039;t in a market that google is or might become interested in at some point if your relying on hadoop. Because whatever you can do, they can do 1000x faster with their ~billion servers, and their ~million PhDs.</description>
		<content:encoded><![CDATA[<p>I was talking to a developer working on a project that will be running on hadoop soon and was interested to hear his comments on hadoop itself, it&#8217;s extremely poorly written, apparently Yahoo built it mostly by outsourcing the development overseas to some low quality coders, and the result is some pretty poor code. It can work it&#8217;s just not that good. </p>
<p>I find it pretty interesting how much stuff google does internally such as their own file system, mapreduce, server builds, their own switches and routers, their own http server, their own java servlet server. </p>
<p>Meanwhile others struggle to keep up trying to use as much off the shelf stuff as possible because they don&#8217;t have the engineering resources internally to even begin to approach doing it themselves, even a Microsoft insider admitted as much recently in an interview <a href="http://www.theregister.co.uk/2010/02/03/microsoft_bing_number_two_wannabe/" rel="nofollow">http://www.theregister.co.uk/2010/02/03/microsoft_bing_number_two_wannabe/</a></p>
<p>I suppose the message here is hope &amp; pray you aren&#8217;t in a market that google is or might become interested in at some point if your relying on hadoop. Because whatever you can do, they can do 1000x faster with their ~billion servers, and their ~million PhDs.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

