<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Xavier Llorà</title>
	<atom:link href="http://www.xavierllora.net/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.xavierllora.net</link>
	<description>A notebook about data-intensive computing, genetics-based machine learning, semantic-web technology, cloud computing,  and more.</description>
	<lastBuildDate>Thu, 21 Jan 2010 14:59:59 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Fast REST API prototyping with Crochet and Scala</title>
		<link>http://www.xavierllora.net/2010/01/21/fast-rest-api-prototyping-with-crochet-and-scala/</link>
		<comments>http://www.xavierllora.net/2010/01/21/fast-rest-api-prototyping-with-crochet-and-scala/#comments</comments>
		<pubDate>Thu, 21 Jan 2010 14:59:59 +0000</pubDate>
		<dc:creator>Xavier</dc:creator>
				<category><![CDATA[Software]]></category>
		<category><![CDATA[APIs]]></category>
		<category><![CDATA[Crochet]]></category>
		<category><![CDATA[meandre]]></category>
		<category><![CDATA[REST]]></category>
		<category><![CDATA[scala]]></category>
		<category><![CDATA[SEASR]]></category>

		<guid isPermaLink="false">http://www.xavierllora.net/?p=650</guid>
		<description><![CDATA[I just finished committing the last changes to Crochet and tagged version 0.1.4vcli now publicly available on GitHub (http://github.com/xllora/Crochet).  Also feel free to visit the issues page in case you run into question/problems/bugs.
Motivation
Crochet is a light weight web framework oriented to rapid prototyping of REST APIs. If you are looking for a Rails like [...]


Related posts:<ol><li><a href='http://www.xavierllora.net/2009/12/01/meandre-is-going-scala/' rel='bookmark' title='Permanent Link: Meandre is going Scala'>Meandre is going Scala</a></li>
<li><a href='http://www.xavierllora.net/2008/11/13/fast-mutation-implementation-for-genetic-algorithms-in-python/' rel='bookmark' title='Permanent Link: Fast mutation implementation for genetic algorithms in Python'>Fast mutation implementation for genetic algorithms in Python</a></li>
<li><a href='http://www.xavierllora.net/2006/01/19/fast-rule-matching-using-vector-instructions/' rel='bookmark' title='Permanent Link: Software for fast rule matching using vector instructions'>Software for fast rule matching using vector instructions</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<p>I just finished committing the last changes to <em>Crochet</em> and tagged version 0.1.4vcli now publicly available on GitHub (<a href="http://github.com/xllora/Crochet">http://github.com/xllora/Crochet</a>).  Also feel free to visit the <a href="http://github.com/xllora/Crochet/issues">issues</a> page in case you run into question/problems/bugs.</p>
<h2>Motivation</h2>
<p><em>Crochet</em> is a light weight web framework oriented to rapid prototyping of REST APIs. If you are looking for a <a href="http://rubyonrails.org/">Rails</a> like framework written in <a href="http://www.scala-lang.org/">Scala</a>, please take a look at Lift at <a href="http://liftweb.net/">http://liftweb.net/</a> instead.</p>
<p><a href="http://github.com/xllora/Crochet">Crochet</a> targets quick prototyping of REST APIs relying on the flexibility of the <a href="http://www.scala-lang.org/">Scala</a> language. The initial ideas for Crochet were inspired while reading Gabriele Renzi post on creating the <a href="http://www.riffraff.info/2009/4/11/step-a-scala-web-picoframework">STEP</a> picoframework with <a href="http://www.scala-lang.org/">Scala</a> and the need for quickly prototyping APIs for pilot projects. <em>Crochet</em> also provides mechanisms to hide repetitive tasks involved with default responses and authentication/authorization piggybacking on the mechanics provided by application servers.</p>
<h2>Who uses Crochet?</h2>
<p><em>Crochet</em> was born from the need for quickly prototyping REST APIs which required exposing legacy code written in Java. I have been actively using <a href="http://github.com/xllora/Crochet">Crochet</a> to provide REST APIs for a variety of projects developed at the <a href="http://www.ncsa.illinois.edu">National Center for Supercomputing Applications</a>. One of the primary adopters and movers of <em>Crochet</em> is the <a href="http://seasr.org/meandre/">Meandre</a> Infrastructure for data-intensive computing developed under the <a href="http://seasr.org/">SEASR</a> project.</p>
<h2>Crochet in 2 minuts</h2>
<p>Before you start please check you have <a href="http://www.scala-lang.org/">Scala</a> installed on your system. You can find more information on how to get Scala up and running <a href="http://www.scala-lang.org/downloads">here</a>.</p>
<ol>
<li>Get the latest <em>Crochet</em> jar from the <a href="http://github.com/xllora/Crochet/downloads">Downloads</a> section at <a href="http://github.com/xllora/Crochet/">GitHub</a> and the third party dependencies. </li>
<li>Copy the following code into a file named <code>hello-world.scala</code>.

<div class="wp_syntax"><div class="code"><pre class="scala" style="font-family:monospace;"><span style="color: #0000ff; font-weight: bold;">import</span> crochet.<span style="color: #000080;">_</span>
<span style="color: #0000ff; font-weight: bold;">new</span> Crochet <span style="color: #F78811;">&#123;</span>
     get<span style="color: #F78811;">&#40;</span><span style="color: #6666FF;">&quot;/message&quot;</span><span style="color: #F78811;">&#41;</span> <span style="color: #F78811;">&#123;</span> 
         <span style="color: #000080;">&lt;</span>html<span style="color: #000080;">&gt;</span>
               <span style="color: #000080;">&lt;</span>head<span style="color: #000080;">&gt;&lt;</span>title<span style="color: #000080;">&gt;</span>Hello World<span style="color: #000080;">&lt;</span>/title<span style="color: #000080;">&gt;&lt;</span>/head<span style="color: #000080;">&gt;</span>
               <span style="color: #000080;">&lt;</span>body<span style="color: #000080;">&gt;&lt;</span>h1<span style="color: #000080;">&gt;</span>Hello World<span style="color: #000080;">!&lt;</span>/h1<span style="color: #000080;">&gt;&lt;</span>/body<span style="color: #000080;">&gt;</span>
         <span style="color: #000080;">&lt;</span>/html<span style="color: #000080;">&gt;</span>
     <span style="color: #F78811;">&#125;</span>
<span style="color: #F78811;">&#125;</span> on <span style="color: #F78811;">8080</span></pre></div></div>

</li>
<li>Get your server up and running by running (please change the version number if needed)
<pre>$ scala -cp crochet-0.1.4.jar:crochet-3dparty-libraries-0.1.X.jar hello-world.scala</pre>
</li>
<p>You just have your first _Crochet_ API up and running. You can check the API working by opening your browser and pointing it to <a href="http://localhost:8080/message">http://localhost:8080/message</a> and you should get the message <code>Hello World!</code> back.</p>
<h2>Where to go from here?</h2>
<p>You will find more information on the <a href="http://wiki.github.com/xllora/Crochet/">Crochet wiki</a> at <a href="http://github.com/xllora/Crochet">GitHub</a>. The wiki contains basic information as a QuickStart guide (which also includes how to deal with static content), descriptions of the basic concepts used in <em>Crochet</em>, and several examples that can get up and running fast. </p>


<p>Related posts:<ol><li><a href='http://www.xavierllora.net/2009/12/01/meandre-is-going-scala/' rel='bookmark' title='Permanent Link: Meandre is going Scala'>Meandre is going Scala</a></li>
<li><a href='http://www.xavierllora.net/2008/11/13/fast-mutation-implementation-for-genetic-algorithms-in-python/' rel='bookmark' title='Permanent Link: Fast mutation implementation for genetic algorithms in Python'>Fast mutation implementation for genetic algorithms in Python</a></li>
<li><a href='http://www.xavierllora.net/2006/01/19/fast-rule-matching-using-vector-instructions/' rel='bookmark' title='Permanent Link: Software for fast rule matching using vector instructions'>Software for fast rule matching using vector instructions</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.xavierllora.net/2010/01/21/fast-rest-api-prototyping-with-crochet-and-scala/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>GECCO 2010 Submission Deadline (Extended)</title>
		<link>http://www.xavierllora.net/2009/12/19/gecco-2010-submission-deadline/</link>
		<comments>http://www.xavierllora.net/2009/12/19/gecco-2010-submission-deadline/#comments</comments>
		<pubDate>Sat, 19 Dec 2009 12:11:34 +0000</pubDate>
		<dc:creator>Xavier</dc:creator>
				<category><![CDATA[Estimation of Distribution Algorithms]]></category>
		<category><![CDATA[Human-Computer Interaction]]></category>
		<category><![CDATA[Learning Classifier Systems]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[GECCO]]></category>

		<guid isPermaLink="false">http://www.xavierllora.net/?p=647</guid>
		<description><![CDATA[If you are planning to submit a paper for the 2010 Genetic and Evolutionary Computation Conference, the deadline is January 13, 2010 (and now extended to January 27th). You can find more information at the GECCO 2010 calendar site.


Related posts:GECCO 2009 paper submission deadline extended till January 28
GECCO 2007 deadline extended
GECCO-2006 submissions deadline extended to [...]


Related posts:<ol><li><a href='http://www.xavierllora.net/2009/01/09/gecco-2009-paper-submission-deadline-extended-till-january-28/' rel='bookmark' title='Permanent Link: GECCO 2009 paper submission deadline extended till January 28'>GECCO 2009 paper submission deadline extended till January 28</a></li>
<li><a href='http://www.xavierllora.net/2007/01/16/gecco-2007-deadline-extended/' rel='bookmark' title='Permanent Link: GECCO 2007 deadline extended'>GECCO 2007 deadline extended</a></li>
<li><a href='http://www.xavierllora.net/2006/01/18/gecco-2006-submissions-deadline-extended-to-february-1st/' rel='bookmark' title='Permanent Link: GECCO-2006 submissions deadline extended to February 1st'>GECCO-2006 submissions deadline extended to February 1st</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<p>If you are planning to submit a paper for the 2010 Genetic and Evolutionary Computation Conference, the deadline is January 13, 2010 (<strong>and now extended to January 27th</strong>). You can find more information at the <a href="http://www.sigevo.org/gecco-2010/calendar.html">GECCO 2010 calendar site</a>.</p>


<p>Related posts:<ol><li><a href='http://www.xavierllora.net/2009/01/09/gecco-2009-paper-submission-deadline-extended-till-january-28/' rel='bookmark' title='Permanent Link: GECCO 2009 paper submission deadline extended till January 28'>GECCO 2009 paper submission deadline extended till January 28</a></li>
<li><a href='http://www.xavierllora.net/2007/01/16/gecco-2007-deadline-extended/' rel='bookmark' title='Permanent Link: GECCO 2007 deadline extended'>GECCO 2007 deadline extended</a></li>
<li><a href='http://www.xavierllora.net/2006/01/18/gecco-2006-submissions-deadline-extended-to-february-1st/' rel='bookmark' title='Permanent Link: GECCO-2006 submissions deadline extended to February 1st'>GECCO-2006 submissions deadline extended to February 1st</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.xavierllora.net/2009/12/19/gecco-2010-submission-deadline/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Meandre is going Scala</title>
		<link>http://www.xavierllora.net/2009/12/01/meandre-is-going-scala/</link>
		<comments>http://www.xavierllora.net/2009/12/01/meandre-is-going-scala/#comments</comments>
		<pubDate>Tue, 01 Dec 2009 20:01:47 +0000</pubDate>
		<dc:creator>Xavier</dc:creator>
				<category><![CDATA[Notes]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[clojure]]></category>
		<category><![CDATA[data-intensive flows]]></category>
		<category><![CDATA[erlang]]></category>
		<category><![CDATA[jruby]]></category>
		<category><![CDATA[jython]]></category>
		<category><![CDATA[meandre]]></category>
		<category><![CDATA[scala]]></category>
		<category><![CDATA[ZigZag]]></category>

		<guid isPermaLink="false">http://www.xavierllora.net/?p=637</guid>
		<description><![CDATA[After quite a bit of experimenting with different alternatives, Meandre is moving into Scala. Scala is a general purpose programming language designed to express common programming patterns in a concise, elegant, and type-safe way. This is not a radical process, but a gradual one while I am starting to revisit the infrastructure for the next [...]


Related posts:<ol><li><a href='http://www.xavierllora.net/2008/04/18/meandre-semantic-driven-data-intensive-flow-engine/' rel='bookmark' title='Permanent Link: Meandre: Semantic-Driven Data-Intensive Flow Engine'>Meandre: Semantic-Driven Data-Intensive Flow Engine</a></li>
<li><a href='http://www.xavierllora.net/2010/01/21/fast-rest-api-prototyping-with-crochet-and-scala/' rel='bookmark' title='Permanent Link: Fast REST API prototyping with Crochet and Scala'>Fast REST API prototyping with Crochet and Scala</a></li>
<li><a href='http://www.xavierllora.net/2008/12/02/meandre-infrastructure-14-rc1-tagged/' rel='bookmark' title='Permanent Link: Meandre Infrastructure 1.4 RC1 tagged'>Meandre Infrastructure 1.4 RC1 tagged</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<p>After quite a bit of experimenting with different alternatives, <a href="http://seasr.org/meandre">Meandre</a> is moving into <a href="http://www.scala-lang.org/">Scala</a>. <a href="http://www.scala-lang.org/">Scala</a> is a general purpose programming language designed to express common programming patterns in a concise, elegant, and type-safe way. This is not a radical process, but a gradual one while I am starting to revisit the infrastructure for the next major release. <a href="http://www.scala-lang.org/">Scala</a> also generates code for the JVM making mix and match trivial. I started fuzzing around with Scala back when I started the development of <a href="http://seasr.org/meandre">Meandre</a> during the summer of 2007, however I did fall back to Java since that was what most of the people in the group was comfortable with. I was fascinated with <a href="http://www.scala-lang.org/">Scala</a> fusion of object oriented programming and functional programming. Time went by and the codebase has grown to a point that I cannot stand anymore cutting through the weeds of Java when I have to extend the infrastructure or do bug fixing&#8212;not to mention its verbosity even for writing trivial code.</p>
<p>This summer I decided to go on a quest to get me out of the woods. I do not mind relying on the JVM and the large collection of libraries available, but I would also like to get my sanity back. Yes, I tested some of the usual suspects for the JVM (<a href="http://www.jython.org/">Jython</a>, <a href="http://jruby.org/">JRuby</a>, <a href="http://clojure.org/">Clojure</a>, and <a href="http://groovy.codehaus.org/">Groovy</a>) but not quite what I wanted. For instance, I wrote most of the <a href="http://seasr.org/meandre">Meandre</a> infrastructure services using <a href="http://www.jython.org/">Jython</a> (much more concise than Java), but still not quite happy to jump on that boat. <a href="http://clojure.org/">Clojure</a> is also interesting (functional programming) but it would be hard to justify for the group to move into it since not everybody may feel comfortable with a pure functional language. I also toyed with some not-so-usual ones like <a href="http://www.erlang.org/">Erlang</a> and <a href="http://www.haskell.org/">Haskell</a>, but again, I ended up with no real argument that could justify such a decision. </p>
<p>So, as I started doing back in 2007, I went back to my original idea of using  <a href="http://www.scala-lang.org/">Scala</a> and its mixed object-oriented- and functional-programming- paradigm. To test it seriously, I started developing the distributed execution engine for <a href="http://seasr.org/meandre">Meandre</a> in  <a href="http://www.scala-lang.org/">Scala</a> using its Earlang-inspired actors. And, boom, suddenly I found myself spending more time thinking that writing/debugging threaded/networking code <img src='http://www.xavierllora.net/wp-includes/images/smilies/icon_biggrin.gif' alt=':D' class='wp-smiley' /> . Yes, I regret my 2007 decision instead of running with my original intuition, but better late than never. With a working seed of the distributed engine working and tested (did I mention that <a href="http://code.google.com/p/scalacheck/">scalacheck</a> and <a href="http://code.google.com/p/specs/">specs</a> are really powerful tools for behavior driven development?), I finally decided to start gravitating the Meandre infrastructure development effort from Java to <a href="http://www.scala-lang.org/">Scala</a>&#8212;did I mention that <a href="http://www.scala-lang.org/">Scala</a> is Martin Odersky&#8217;s child? Yes, such a decision has some impact on my colleagues, but I envision that the benefits will eventually weight out the initial resistance and step learning curve. At least, the last two group meetings nobody jumped off the window while presenting the key elements of  <a href="http://www.scala-lang.org/">Scala</a>, and demonstrating how concise and elegant it made the first working seed of the distributed execution engine <img src='http://www.xavierllora.net/wp-includes/images/smilies/icon_biggrin.gif' alt=':D' class='wp-smiley' /> . We even got in discussions about the benefits of using <a href="http://www.scala-lang.org/">Scala</a> if it delivered everything I showed. I am lucky to work with such smart guys. If you want to take a peek at the <a href="http://dev-tools.seasr.org/fisheye/browse/Snowfield">distributed execution engine (a.k.a. Snowfield) at SEASR&#8217;s Fisheye</a>.</p>
<p>Oh, one last thing. Are you using Atlassian&#8217;s Fisheye? Do you want <a href='http://www.xavierllora.net/wp-content/uploads/2009/12/scala.def_.gz'>syntax highlighting for Scala</a>? I tweaked the Java definitions to make it highlight  <a href="http://www.scala-lang.org/">Scala</a> code. Remember to drop the <code>scala.def</code> file on <code>$FISHEYE_HOME/syntax</code> directory add an entry on the filename.map to make it highlight anything with extension <code>.scala</code>.</p>


<p>Related posts:<ol><li><a href='http://www.xavierllora.net/2008/04/18/meandre-semantic-driven-data-intensive-flow-engine/' rel='bookmark' title='Permanent Link: Meandre: Semantic-Driven Data-Intensive Flow Engine'>Meandre: Semantic-Driven Data-Intensive Flow Engine</a></li>
<li><a href='http://www.xavierllora.net/2010/01/21/fast-rest-api-prototyping-with-crochet-and-scala/' rel='bookmark' title='Permanent Link: Fast REST API prototyping with Crochet and Scala'>Fast REST API prototyping with Crochet and Scala</a></li>
<li><a href='http://www.xavierllora.net/2008/12/02/meandre-infrastructure-14-rc1-tagged/' rel='bookmark' title='Permanent Link: Meandre Infrastructure 1.4 RC1 tagged'>Meandre Infrastructure 1.4 RC1 tagged</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.xavierllora.net/2009/12/01/meandre-is-going-scala/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>Scaling Genetic Algorithms using MapReduce</title>
		<link>http://www.xavierllora.net/2009/10/09/scaling-genetic-algorithms-using-mapreduce/</link>
		<comments>http://www.xavierllora.net/2009/10/09/scaling-genetic-algorithms-using-mapreduce/#comments</comments>
		<pubDate>Fri, 09 Oct 2009 15:51:19 +0000</pubDate>
		<dc:creator>Xavier</dc:creator>
				<category><![CDATA[Conferences]]></category>
		<category><![CDATA[Data-Intensive Computing]]></category>
		<category><![CDATA[Estimation of Distribution Algorithms]]></category>
		<category><![CDATA[Publications]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Technical Reports]]></category>
		<category><![CDATA[genetic algorithms]]></category>
		<category><![CDATA[hadoop]]></category>
		<category><![CDATA[map-reduce]]></category>

		<guid isPermaLink="false">http://www.xavierllora.net/?p=634</guid>
		<description><![CDATA[Below you may find the abstract to and the link to the technical report of the paper entitled &#8220;Scaling Genetic Algorithms using MapReduce&#8221; that will be presented at the Ninth International Conference on Intelligent Systems Design and Applications (ISDA) 2009 by Verma, A., Llorà, X., Campbell, R.H., Goldberg, D.E. next month. 
Abstract:Genetic algorithms(GAs) are increasingly [...]


Related posts:<ol><li><a href='http://www.xavierllora.net/2009/07/13/data-intensive-computing-for-competent-genetic-algorithms-a-pilot-study-using-meandre-2/' rel='bookmark' title='Permanent Link: Data-Intensive Computing for  Competent Genetic Algorithms:  A Pilot Study using Meandre'>Data-Intensive Computing for  Competent Genetic Algorithms:  A Pilot Study using Meandre</a></li>
<li><a href='http://www.xavierllora.net/2009/01/29/data-intensive-computing-for-competent-genetic-algorithms-a-pilot-study-using-meandre/' rel='bookmark' title='Permanent Link: Data-Intensive Computing for Competent Genetic Algorithms: A Pilot Study using  Meandre'>Data-Intensive Computing for Competent Genetic Algorithms: A Pilot Study using  Meandre</a></li>
<li><a href='http://www.xavierllora.net/2007/01/18/toward-routine-billion-variable-optimization-using-genetic-algorithms/' rel='bookmark' title='Permanent Link: Toward routine billion-variable optimization using genetic algorithms'>Toward routine billion-variable optimization using genetic algorithms</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<p>Below you may find the abstract to and the link to the technical report of the paper entitled <em>&#8220;Scaling Genetic Algorithms using MapReduce&#8221;</em> that will be presented at the <a href="">Ninth International Conference on Intelligent Systems Design and Applications (ISDA) 2009</a> by Verma, A., Llorà, X., Campbell, R.H., Goldberg, D.E. next month. </p>
<p><strong>Abstract:</strong>Genetic algorithms(GAs) are increasingly being applied to large scale problems. The traditional MPI-based parallel GAs do not scale very well. MapReduce is a powerful abstraction developed by Google for making scalable and fault tolerant applications. In this paper, we mould genetic algorithms into the the MapReduce model. We describe the algorithm design and implementation of GAs on Hadoop, the open source implementation of MapReduce. Our experiments demonstrate the convergence and scalability upto 105 variable problems. Adding more resources would enable us to solve even larger problems without any changes in the algorithms and implementation.</p>
<p>The draft of the paper can be downloaded as <a href="http://www.illigal.uiuc.edu/pub/papers/IlliGALs/2009007.pdf">IlliGAL TR. No. 2009007</a>. For more information see the <a href="http://www.illigal.uiuc.edu/web/technical-reports/2009/10/09/scaling-genetic-algorithms-using-mapreduce/">IlliGAL technical reports web site</a>.</p>


<p>Related posts:<ol><li><a href='http://www.xavierllora.net/2009/07/13/data-intensive-computing-for-competent-genetic-algorithms-a-pilot-study-using-meandre-2/' rel='bookmark' title='Permanent Link: Data-Intensive Computing for  Competent Genetic Algorithms:  A Pilot Study using Meandre'>Data-Intensive Computing for  Competent Genetic Algorithms:  A Pilot Study using Meandre</a></li>
<li><a href='http://www.xavierllora.net/2009/01/29/data-intensive-computing-for-competent-genetic-algorithms-a-pilot-study-using-meandre/' rel='bookmark' title='Permanent Link: Data-Intensive Computing for Competent Genetic Algorithms: A Pilot Study using  Meandre'>Data-Intensive Computing for Competent Genetic Algorithms: A Pilot Study using  Meandre</a></li>
<li><a href='http://www.xavierllora.net/2007/01/18/toward-routine-billion-variable-optimization-using-genetic-algorithms/' rel='bookmark' title='Permanent Link: Toward routine billion-variable optimization using genetic algorithms'>Toward routine billion-variable optimization using genetic algorithms</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.xavierllora.net/2009/10/09/scaling-genetic-algorithms-using-mapreduce/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Temporary storage for Meandre&#8217;s distributed flow execution</title>
		<link>http://www.xavierllora.net/2009/09/29/temporary-storage-for-meandres-distribute-flow-execution/</link>
		<comments>http://www.xavierllora.net/2009/09/29/temporary-storage-for-meandres-distribute-flow-execution/#comments</comments>
		<pubDate>Tue, 29 Sep 2009 15:14:28 +0000</pubDate>
		<dc:creator>Xavier</dc:creator>
				<category><![CDATA[Notes]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[Data-Intensive Computing]]></category>
		<category><![CDATA[data-intensive flows]]></category>
		<category><![CDATA[java]]></category>
		<category><![CDATA[meandre]]></category>
		<category><![CDATA[python]]></category>
		<category><![CDATA[storage]]></category>
		<category><![CDATA[tokyo cabinet]]></category>

		<guid isPermaLink="false">http://www.xavierllora.net/?p=615</guid>
		<description><![CDATA[Designing the distributed execution of a generic Meandre flow involves several moving pieces. One of those is the temporary storage required by the computing nodes (think of it as one node as one isolated component of a flow) to keep up with the data generated by a component, and also be able to replicate such [...]


Related posts:<ol><li><a href='http://www.xavierllora.net/2009/08/13/easy-reliable-and-flexible-storage-for-python/' rel='bookmark' title='Permanent Link: Easy, reliable, and flexible storage for Python'>Easy, reliable, and flexible storage for Python</a></li>
<li><a href='http://www.xavierllora.net/2008/05/22/zookeeper-and-orchestrating-distributed-applications/' rel='bookmark' title='Permanent Link: ZooKeeper and distributed applications'>ZooKeeper and distributed applications</a></li>
<li><a href='http://www.xavierllora.net/2008/04/18/meandre-semantic-driven-data-intensive-flow-engine/' rel='bookmark' title='Permanent Link: Meandre: Semantic-Driven Data-Intensive Flow Engine'>Meandre: Semantic-Driven Data-Intensive Flow Engine</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<p>Designing the distributed execution of a generic Meandre flow involves several moving pieces. One of those is the temporary storage required by the computing nodes (think of it as one node as one isolated component of a flow) to keep up with the data generated by a component, and also be able to replicate such storage to the node containing the consumer to be fed. Such storage, local to each node, must guarantee at least three basic properties.</p>
<ul>
<li>Transaction ready</li>
<li>Light weight implementation</li>
<li>Efficient write and read to minimize the contention on ports</li>
</ul>
<p>Also, it is important to keep in mind that in a distributed execution scenario, each node requires to have its one separated and standalone storage system. Thus, it is also important to minimize the overhead of installation and maintenance of such storage subsystem. There are several alternatives available ranging from traditional relational data base systems to home-brewed solutions. Relational data base systems provide a distributed, reliable, stable, and well tested environment, but they may tend to require a quite involved installation and maintenance. Also, tuning those systems to optimize performance may required quite an involved monitoring and tweaking. On the other hand, home-brewed solutions can be optimized for performance by dropping non required functionality and focussing on writing and reading performance. However, such solutions tend to be bug prone and tend to become time consuming, not to mention that proving transaction correctness can be quite involved.</p>
<p>Fortunately there is a middle ground where efficient and stable transaction aware solutions are available. They may not provide SQL interfaces, but they still provide transaction boundaries. Also, since they are oriented to maximize performance, they can provide better throughput and operation latency than having to traverse the SQL stack. Examples of such storage systems can be found under the areas of key-value stores and column stores. Several options were considered while writing these line, but key-value stores were the ones that better matches the three requirements described above. Several options were informally tested, including solutions like HDF and Berkely DB, however the best performing by far under similar stress test conditions as the sketched temporary storage subsystem was <a href="http://1978th.net/tokyocabinet/">Tokyo Cabinet</a>. I already <a href="/2008/06/05/the-next-generation-of-data-bases/">introduced and <a href="/2009/08/13/easy-reliable-and-flexible-storage-for-python/">tested</a> <a href="http://1978th.net/tokyocabinet/">Tokyo Cabinet</a> more than a year ago, but this time I was going to give it a stress test to basically convince myself that that was what I wanted to use for as temporary storage of the distributed flow execution.</p>
<h2>The experiment</h2>
<p>Tokyo cabinet is a collection of storage utilities including, among other facilities, key-value stores implemented as hash files or B-trees and flexible column stores. To illustrate the performance and throughput you can achieve. To implement multiple queues on a single casket (<a href="http://1978th.net/tokyocabinet/">Tokyo Cabinet</a> file containing the data store) B-trees with duplicated keys can help achieving such goal. The duplicated keys are the queue names, and the values are the <a href="http://en.wikipedia.org/wiki/Universally_Unique_Identifier">UUID</a>s of the objects being store. Objects are also stored in the same B-tree by using the <a href="http://en.wikipedia.org/wiki/Universally_Unique_Identifier">UIUD</a> as a key and the value become the payload to store (usually an array of bytes). </p>
<p>Previously, I have been heavily using Python bindings to test <a href="http://1978th.net/tokyocabinet/">Tokyo Cabinet</a>, but this time I went down the Java route (since the Meandre infrastructure is written on Java). The Java bindings are basically build around JNI and statically link to the C version of <a href="http://1978th.net/tokyocabinet/">Tokyo Cabinet</a> library, giving away the best of both world. To measure how fast can I write data out of a port into the local storage in a transactional mode, I used the following piece of code.</p>

<div class="wp_syntax"><div class="code"><pre class="java" style="font-family:monospace;">	<span style="color: #000000; font-weight: bold;">public</span> <span style="color: #000000; font-weight: bold;">static</span> <span style="color: #000066; font-weight: bold;">void</span> main <span style="color: #009900;">&#40;</span> <span style="color: #003399;">String</span> args <span style="color: #009900;">&#91;</span><span style="color: #009900;">&#93;</span> <span style="color: #009900;">&#41;</span> <span style="color: #009900;">&#123;</span>
		<span style="color: #000066; font-weight: bold;">int</span> MAX <span style="color: #339933;">=</span> <span style="color: #cc66cc;">10000000</span><span style="color: #339933;">;</span>
		<span style="color: #000066; font-weight: bold;">int</span> inc <span style="color: #339933;">=</span> <span style="color: #cc66cc;">10</span><span style="color: #339933;">;</span>
		<span style="color: #000066; font-weight: bold;">int</span> cnt <span style="color: #339933;">=</span> <span style="color: #cc66cc;">0</span><span style="color: #339933;">;</span>
		<span style="color: #000066; font-weight: bold;">float</span> fa <span style="color: #009900;">&#91;</span><span style="color: #009900;">&#93;</span> <span style="color: #339933;">=</span> <span style="color: #000000; font-weight: bold;">new</span> <span style="color: #000066; font-weight: bold;">float</span><span style="color: #009900;">&#91;</span><span style="color: #cc66cc;">8</span><span style="color: #009900;">&#93;</span><span style="color: #339933;">;</span>
		<span style="color: #000066; font-weight: bold;">int</span> reps <span style="color: #339933;">=</span> <span style="color: #cc66cc;">10</span><span style="color: #339933;">;</span>
&nbsp;
		<span style="color: #000000; font-weight: bold;">for</span> <span style="color: #009900;">&#40;</span> <span style="color: #000066; font-weight: bold;">int</span> i<span style="color: #339933;">=</span><span style="color: #cc66cc;">1</span> <span style="color: #339933;">;</span> i<span style="color: #339933;">&lt;=</span>MAX <span style="color: #339933;">;</span> i<span style="color: #339933;">*=</span>inc  <span style="color: #009900;">&#41;</span> <span style="color: #009900;">&#123;</span>
			<span style="color: #666666; font-style: italic;">//System.out.println(&quot;Size: &quot;+i);</span>
			<span style="color: #000000; font-weight: bold;">for</span> <span style="color: #009900;">&#40;</span> <span style="color: #000066; font-weight: bold;">int</span> j<span style="color: #339933;">=</span><span style="color: #cc66cc;">0</span> <span style="color: #339933;">;</span> j<span style="color: #339933;">&lt;</span>reps <span style="color: #339933;">;</span> j<span style="color: #339933;">++</span> <span style="color: #009900;">&#41;</span> <span style="color: #009900;">&#123;</span>	
				<span style="color: #666666; font-style: italic;">//System.out.println(&quot;\tRepetition: &quot;+j);</span>
&nbsp;
				<span style="color: #666666; font-style: italic;">// open the database</span>
				BDB bdb <span style="color: #339933;">=</span> <span style="color: #000000; font-weight: bold;">new</span> BDB<span style="color: #009900;">&#40;</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span>
&nbsp;
				<span style="color: #000000; font-weight: bold;">if</span><span style="color: #009900;">&#40;</span><span style="color: #339933;">!</span>bdb.<span style="color: #006633;">open</span><span style="color: #009900;">&#40;</span>TEST_CASKET_TCB, BDB.<span style="color: #006633;">OWRITER</span> <span style="color: #339933;">|</span> BDB.<span style="color: #006633;">OCREAT</span> <span style="color: #339933;">|</span> BDB.<span style="color: #006633;">OTSYNC</span> <span style="color: #009900;">&#41;</span><span style="color: #009900;">&#41;</span><span style="color: #009900;">&#123;</span>
					<span style="color: #000066; font-weight: bold;">int</span> ecode <span style="color: #339933;">=</span> bdb.<span style="color: #006633;">ecode</span><span style="color: #009900;">&#40;</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span>
					fail<span style="color: #009900;">&#40;</span><span style="color: #0000ff;">&quot;open error: &quot;</span> <span style="color: #339933;">+</span> bdb.<span style="color: #006633;">errmsg</span><span style="color: #009900;">&#40;</span>ecode<span style="color: #009900;">&#41;</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span>
				<span style="color: #009900;">&#125;</span>
&nbsp;
				<span style="color: #666666; font-style: italic;">// Add a bunch of duplicates</span>
				<span style="color: #000066; font-weight: bold;">long</span> start <span style="color: #339933;">=</span> <span style="color: #003399;">System</span>.<span style="color: #006633;">currentTimeMillis</span><span style="color: #009900;">&#40;</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span>
				bdb.<span style="color: #006633;">tranbegin</span><span style="color: #009900;">&#40;</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span>
				<span style="color: #000000; font-weight: bold;">for</span> <span style="color: #009900;">&#40;</span> <span style="color: #000066; font-weight: bold;">int</span> k<span style="color: #339933;">=</span><span style="color: #cc66cc;">0</span><span style="color: #339933;">;</span> k<span style="color: #339933;">&lt;</span>i<span style="color: #339933;">;</span> k<span style="color: #339933;">++</span> <span style="color: #009900;">&#41;</span> <span style="color: #009900;">&#123;</span>
					<span style="color: #003399;">String</span> uuid <span style="color: #339933;">=</span> UUID.<span style="color: #006633;">randomUUID</span><span style="color: #009900;">&#40;</span><span style="color: #009900;">&#41;</span>.<span style="color: #006633;">toString</span><span style="color: #009900;">&#40;</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span>
					bdb.<span style="color: #006633;">putdup</span><span style="color: #009900;">&#40;</span>QUEUE_KEY, uuid<span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span>
					bdb.<span style="color: #006633;">putdup</span><span style="color: #009900;">&#40;</span>uuid.<span style="color: #006633;">getBytes</span><span style="color: #009900;">&#40;</span><span style="color: #009900;">&#41;</span>, uuid.<span style="color: #006633;">getBytes</span><span style="color: #009900;">&#40;</span><span style="color: #009900;">&#41;</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span>	
				<span style="color: #009900;">&#125;</span>
				bdb.<span style="color: #006633;">trancommit</span><span style="color: #009900;">&#40;</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span>
				fa<span style="color: #009900;">&#91;</span>cnt<span style="color: #009900;">&#93;</span> <span style="color: #339933;">+=</span> <span style="color: #003399;">System</span>.<span style="color: #006633;">currentTimeMillis</span><span style="color: #009900;">&#40;</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">-</span>start<span style="color: #339933;">;</span>
&nbsp;
				<span style="color: #666666; font-style: italic;">// Clean up</span>
				bdb.<span style="color: #006633;">close</span><span style="color: #009900;">&#40;</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span>
				<span style="color: #000000; font-weight: bold;">new</span> <span style="color: #003399;">File</span><span style="color: #009900;">&#40;</span>TEST_CASKET_TCB<span style="color: #009900;">&#41;</span>.<span style="color: #006633;">delete</span><span style="color: #009900;">&#40;</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span>
			<span style="color: #009900;">&#125;</span>
			fa<span style="color: #009900;">&#91;</span>cnt<span style="color: #009900;">&#93;</span> <span style="color: #339933;">/=</span> reps<span style="color: #339933;">;</span>
			<span style="color: #003399;">System</span>.<span style="color: #006633;">out</span>.<span style="color: #006633;">println</span><span style="color: #009900;">&#40;</span><span style="color: #0000ff;">&quot;&quot;</span><span style="color: #339933;">+</span>i<span style="color: #339933;">+</span><span style="color: #0000ff;">&quot;<span style="color: #000099; font-weight: bold;">\t</span>&quot;</span><span style="color: #339933;">+</span>fa<span style="color: #009900;">&#91;</span>cnt<span style="color: #009900;">&#93;</span><span style="color: #339933;">+</span><span style="color: #0000ff;">&quot;<span style="color: #000099; font-weight: bold;">\t</span>&quot;</span><span style="color: #339933;">+</span><span style="color: #009900;">&#40;</span>fa<span style="color: #009900;">&#91;</span>cnt<span style="color: #009900;">&#93;</span><span style="color: #339933;">/</span>i<span style="color: #009900;">&#41;</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span>
			cnt<span style="color: #339933;">++;</span>
		<span style="color: #009900;">&#125;</span>
	<span style="color: #009900;">&#125;</span></pre></div></div>

<p>The idea is very simple. Just go and star storing 1, 10, 100, 1000, 10000, 1000000, and 10000000 pieces of data at once in a transaction. Measure the time. For each data number repeat the operation 10 times and average the time trying to palliate the fact that the experiment was run on a laptop running all sorts of other concurrent applications. Plot the results to illustrate:</p>
<ol>
<li>time required to insert one piece of data as a function of the number of data involve in the transaction</li>
<li>number of pieces of data wrote per second as a function of the number of data involve in the transaction</li>
</ol>
<p>The idea is to expose the behavior of <a href="http://1978th.net/tokyocabinet/">Tokyo Cabinet</a> as more data is involved in a transaction to check if degradation happens as the volume increase. This is an important issue, since data intensive flows can generate large volumes of data per firing event. </p>
<h2>The results</h2>
<p>Results are displayed on the figures below.</p>
<p><a href="http://www.xavierllora.net/wp-content/uploads/2009/09/tc_time.png"><img src="http://www.xavierllora.net/wp-content/uploads/2009/09/tc_time-400x400.png" alt="Time per data unit as a function of number of data involve in a transaction" title="Time per data unit as a function of number of data involve in a transaction" width="300" height="300" /></a><a href="http://www.xavierllora.net/wp-content/uploads/2009/09/tc_throughput.png"><img src="http://www.xavierllora.net/wp-content/uploads/2009/09/tc_throughput-400x400.png" alt="Throughput as a function of number of data in a transaction" title="Throughput as a function of number of data in a transaction" width="300" height="300" /></a></p>
<p>The first important element to highlight is that the time to insert one data element does not degrade as the volume increase. Actually, it is quite interesting that <a href="http://1978th.net/tokyocabinet/">Tokyo Cabinet</a> feels more comfortable as the volume per transaction grows. The throughput results are also interesting, since it shows that it is able to sustain transfers of around 40K data units per second, and that the only bottleneck is the disk cache management and bandwidth to the disk itself&#8212;which gets saturated after pushing more than 10K pieces of data.</p>
<h2>The lessons learned</h2>
<p><a href="http://1978th.net/tokyocabinet/">Tokyo Cabinet</a> is a excellent candidate to support the temporary transactional storage required in a distributed execution of a Meandre flow. Other alternatives like <a href="http://www.mysql.com/">MySQL</a>, embedded <a href="http://db.apache.org/derby/">Apache Derby</a>, the <a href="http://www.oracle.com/database/berkeley-db/je/index.html">Java edition of Berkeley DB</a>, <a href="http://www.zentus.com/sqlitejdbc/">SQLite JDBC</a> could not get even get close to such performance falling at least one order of magnitude behind.</p>


<p>Related posts:<ol><li><a href='http://www.xavierllora.net/2009/08/13/easy-reliable-and-flexible-storage-for-python/' rel='bookmark' title='Permanent Link: Easy, reliable, and flexible storage for Python'>Easy, reliable, and flexible storage for Python</a></li>
<li><a href='http://www.xavierllora.net/2008/05/22/zookeeper-and-orchestrating-distributed-applications/' rel='bookmark' title='Permanent Link: ZooKeeper and distributed applications'>ZooKeeper and distributed applications</a></li>
<li><a href='http://www.xavierllora.net/2008/04/18/meandre-semantic-driven-data-intensive-flow-engine/' rel='bookmark' title='Permanent Link: Meandre: Semantic-Driven Data-Intensive Flow Engine'>Meandre: Semantic-Driven Data-Intensive Flow Engine</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.xavierllora.net/2009/09/29/temporary-storage-for-meandres-distribute-flow-execution/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Liquid: RDF endpoint for FluidDB</title>
		<link>http://www.xavierllora.net/2009/09/24/liquid-rdf-endpoint-for-fluiddb/</link>
		<comments>http://www.xavierllora.net/2009/09/24/liquid-rdf-endpoint-for-fluiddb/#comments</comments>
		<pubDate>Thu, 24 Sep 2009 20:45:19 +0000</pubDate>
		<dc:creator>Xavier</dc:creator>
				<category><![CDATA[Notes]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[cloud computing]]></category>
		<category><![CDATA[Data-Intensive Computing]]></category>
		<category><![CDATA[FluidDB]]></category>
		<category><![CDATA[meandre]]></category>
		<category><![CDATA[RDF]]></category>
		<category><![CDATA[sparql]]></category>
		<category><![CDATA[storage]]></category>

		<guid isPermaLink="false">http://www.xavierllora.net/?p=593</guid>
		<description><![CDATA[A while ago I wrote some thoughts about how to map RDF to and from FluidDB. There I explored how you could map RDF onto FluidDB, and how to get it back. That got me thinking about how to get a simple endpoint you could query for RDF. Imagine that you could pull FluidDB data [...]


Related posts:<ol><li><a href='http://www.xavierllora.net/2009/08/25/liquid-rdf-meandering-in-fluiddb/' rel='bookmark' title='Permanent Link: Liquid: RDF meandering in FluidDB'>Liquid: RDF meandering in FluidDB</a></li>
<li><a href='http://www.xavierllora.net/2009/09/29/temporary-storage-for-meandres-distribute-flow-execution/' rel='bookmark' title='Permanent Link: Temporary storage for Meandre&#8217;s distributed flow execution'>Temporary storage for Meandre&#8217;s distributed flow execution</a></li>
<li><a href='http://www.xavierllora.net/2009/02/04/efficient-serialization-for-java-and-beyond/' rel='bookmark' title='Permanent Link: Efficient serialization for Java (and beyond)'>Efficient serialization for Java (and beyond)</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<p>A while ago I wrote some thoughts about how to map RDF to and from <a href="http://www.fluidinfo.com/fluiddb">FluidDB</a>. There I explored how you could map RDF onto FluidDB, and how to get it back. That got me thinking about how to get a simple endpoint you could query for RDF. Imagine that you could pull <a href="http://www.fluidinfo.com/fluiddb">FluidDB</a> data in RDF, then I could just get all the flexibility of <a href="http://www.w3.org/TR/rdf-sparql-query/">SPARQL</a> for free. With this idea in my mind I just went and grabbed <a href="http://seasr.org/meandre">Meandre</a>, the <a href="http://github.com/rossjones/JFluidDB">JFLuidDB</a> library started by <a href="http://github.com/rossjones">Ross Jones</a>, and build a few components.</p>
<p>The main goal was to be able to get an object, list of the tags, and express the result in RDF. <a href="http://www.fluidinfo.com/fluiddb">FluidDB</a> helps the mapping since objects are uniquely identified by URIs. For instance, the unique object <code>5ff74371-455b-4299-83f9-ba13ae898ad1</code> (FluidDB relies on UUID version four with the form <code>xxxxxxxx-xxxx-4xxx-yxxx-xxxxxxxxxxxx</code>) is uniquely identified by <code>http://sandbox.fluidinfo.com/objects/5ff74371-455b-4299-83f9-ba13ae898ad1</code> (or a url of the form <code>http://sandbox.fluidinfo.com/objects/xxxxxxxx-xxxx-4xxx-yxxx-xxxxxxxxxxxx</code>), in case you are using the sandbox or <code>http://fluiddb.fluidinfo.com/objects/5ff74371-455b-4299-83f9-ba13ae898ad1</code> if you are using the main instance. Same story for tags. The tag <code>fluiddb/about</code> can be uniquely identified by the URI <code>http://sandbox.fluidinfo.com/tags/fluiddb/about</code>, or <code>http://fluiddb.fluidinfo.com/tags/fluiddb/about</code>.</p>
<h2>A simple RDF description for and object<br />
<h2>
<p>Once you get the object back the basic translated RDF version for object <code>a10ab0f3-ef56-4fc0-a8fa-4d452d8ab1db</code> should look like as the listing below in TURTLE notation.</p>

<div class="wp_syntax"><div class="code"><pre class="xml" style="font-family:monospace;"><span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;http:</span>//sandbox.fluidinfo.com/objects/a10ab0f3-ef56-4fc0-a8fa-4d452d8ab1db<span style="color: #000000; font-weight: bold;">&gt;</span></span>
      <span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;http:</span>//www.w3.org/1999/02/22-rdf-syntax-ns#type<span style="color: #000000; font-weight: bold;">&gt;</span></span>
              <span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;http:</span>//sandbox.fluidinfo.com/objects<span style="color: #000000; font-weight: bold;">/&gt;</span></span> , <span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;http:</span>//www.w3.org/1999/02/22-rdf-syntax-ns#Bag<span style="color: #000000; font-weight: bold;">&gt;</span></span> ;
      <span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;http:</span>//www.w3.org/1999/02/22-rdf-syntax-ns#_1<span style="color: #000000; font-weight: bold;">&gt;</span></span>
              <span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;http:</span>//sandbox.fluidinfo.com/tags/fluiddb/about<span style="color: #000000; font-weight: bold;">&gt;</span></span> ;
      <span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;http:</span>//www.w3.org/1999/02/22-rdf-syntax-ns#_2<span style="color: #000000; font-weight: bold;">&gt;</span></span>
              <span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;http:</span>//sandbox.fluidinfo.com/tags/fluiddb/tags/path<span style="color: #000000; font-weight: bold;">&gt;</span></span> ;
      <span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;http:</span>//www.w3.org/1999/02/22-rdf-syntax-ns#_3<span style="color: #000000; font-weight: bold;">&gt;</span></span>
              <span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;http:</span>//sandbox.fluidinfo.com/tags/fluiddb/tags/description<span style="color: #000000; font-weight: bold;">&gt;</span></span> ;
      <span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;http:</span>//purl.org/dc/elements/1.1/description<span style="color: #000000; font-weight: bold;">&gt;</span></span>
              &quot;Object for the attribute fluiddb/default/tags/permission/update/policy&quot;^^<span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;http:</span>//www.w3.org/2001/XMLSchema#string<span style="color: #000000; font-weight: bold;">&gt;</span></span> .</pre></div></div>

<p>I will break the above example into small chunks and explain the above example into the three main pieces involved (the id, the about, and the tags). The basic construct is simple. First a triple to mark the object as a <a href="http://www.fluidinfo.com/fluiddb">FluidDB</a> object.</p>

<div class="wp_syntax"><div class="code"><pre class="xml" style="font-family:monospace;"><span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;http:</span>//sandbox.fluidinfo.com/objects/a10ab0f3-ef56-4fc0-a8fa-4d452d8ab1db<span style="color: #000000; font-weight: bold;">&gt;</span></span>
      <span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;http:</span>//www.w3.org/1999/02/22-rdf-syntax-ns#type<span style="color: #000000; font-weight: bold;">&gt;</span></span>
              <span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;http:</span>//sandbox.fluidinfo.com/objects<span style="color: #000000; font-weight: bold;">/&gt;</span></span>   
.</pre></div></div>

<p>Then if the object has an <code>about</code> associated on creation, another triple gets generated and added, as shown below. To be consistent, I suggest reusing DC description since that is what the <code>about</code> for an object tend to indicate.</p>

<div class="wp_syntax"><div class="code"><pre class="xml" style="font-family:monospace;"><span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;http:</span>//sandbox.fluidinfo.com/objects/a10ab0f3-ef56-4fc0-a8fa-4d452d8ab1db<span style="color: #000000; font-weight: bold;">&gt;</span></span>
      <span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;http:</span>//purl.org/dc/elements/1.1/description<span style="color: #000000; font-weight: bold;">&gt;</span></span>
              &quot;Object for the attribute fluiddb/default/tags/permission/update/policy&quot;^^<span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;http:</span>//www.w3.org/2001/XMLSchema#string<span style="color: #000000; font-weight: bold;">&gt;</span></span> 
.</pre></div></div>

<p>Finally, if there are tags associated to the object, a bag gets created, and all the URI describing the tags get pushed into the bag as shown below.</p>

<div class="wp_syntax"><div class="code"><pre class="xml" style="font-family:monospace;"><span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;http:</span>//sandbox.fluidinfo.com/objects/a10ab0f3-ef56-4fc0-a8fa-4d452d8ab1db<span style="color: #000000; font-weight: bold;">&gt;</span></span>
      <span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;http:</span>//www.w3.org/1999/02/22-rdf-syntax-ns#type<span style="color: #000000; font-weight: bold;">&gt;</span></span>
              <span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;http:</span>//www.w3.org/1999/02/22-rdf-syntax-ns#Bag<span style="color: #000000; font-weight: bold;">&gt;</span></span> ;
      <span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;http:</span>//www.w3.org/1999/02/22-rdf-syntax-ns#_1<span style="color: #000000; font-weight: bold;">&gt;</span></span>
              <span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;http:</span>//sandbox.fluidinfo.com/tags/fluiddb/about<span style="color: #000000; font-weight: bold;">&gt;</span></span> ;
      <span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;http:</span>//www.w3.org/1999/02/22-rdf-syntax-ns#_2<span style="color: #000000; font-weight: bold;">&gt;</span></span>
              <span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;http:</span>//sandbox.fluidinfo.com/tags/fluiddb/tags/path<span style="color: #000000; font-weight: bold;">&gt;</span></span> ;
      <span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;http:</span>//www.w3.org/1999/02/22-rdf-syntax-ns#_3<span style="color: #000000; font-weight: bold;">&gt;</span></span>
              <span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;http:</span>//sandbox.fluidinfo.com/tags/fluiddb/tags/description<span style="color: #000000; font-weight: bold;">&gt;</span></span>
.</pre></div></div>

<h2>Creating and RDF endpoint</h2>
<p>Armed with the previous, the thing should be easy. Just allow querying for objects, then collect the object information, and finally generate the final RDF. Using <a href="http://seasr.org/meandre">Meandre</a> and <a href="http://github.com/rossjones/JFluidDB">JFLuidDB</a> I wrote a few components that allow the simple creation of such an endpoint as illustrated by the picture below.</p>
<p><a href="http://www.xavierllora.net/wp-content/uploads/2009/09/meandre-fluiddb-rdf.png"><img src="http://www.xavierllora.net/wp-content/uploads/2009/09/meandre-fluiddb-rdf.png" alt="Meandre FluidDB RDF endpoint" title="Meandre FluidDB RDF endpoint" width="500" height="204" class="aligncenter size-full wp-image-602" /></a></p>
<p>The basic mechanism is simple. Just push the query into the <em>Query for objects</em> component. This component will stream each of the <code>uuid</code> of the matched objects to <em>Read object</em> which pulls the object information. Then the object is passed to <em>Object to RDF model</em> that basically generates the RDF snipped shown in the example shown above for each of the objects pushed. Finally all the RDF fragments are reduced together by component <em>Wrapped models reducer</em>. Then the resulting RDF model just gets serialize into text using the Turtle notation. Finally the serialized text is printed to the console. The equivalent code could be express as a <a href="http://seasr.org/meandre/documentation/for-developers/zigzag/">ZigZag</a> script as:</p>
<pre>
#
# Imports eliminated for clarity
#

#
# Create the component aliases
#
alias <meandre://fluidinfo.com/fluiddb/meandre/component/object-to-rdf> as OBJECT_TO_RDF
alias <meandre://fluidinfo.com/fluiddb/meandre/component/print-object> as PRINT_OBJECT
alias <meandre://fluidinfo.com/fluiddb/meandre/component/query-for-objects> as QUERY_FOR_OBJECTS
alias <meandre://fluidinfo.com/fluiddb/meandre/component/reads-the-requested-object> as READS_THE_REQUESTED_OBJECT
alias <meandre://seasr.org/components/tools/wrapped-models-reducer> as WRAPPED_MODELS_REDUCER
alias <meandre://seasr.org/components/tools/model-to-rdf-text> as MODEL_TO_RDF_TEXT
alias <meandre://fluidinfo.com/fluiddb/meandre/component/push-string> as PUSH_STRING

#
# Create the component instances
#
push_query_string = PUSH_STRING()
wrapped_models_reducer = WRAPPED_MODELS_REDUCER()
query_for_objects = QUERY_FOR_OBJECTS()
reads_object = READS_THE_REQUESTED_OBJECT()
model_to_rdf_text = MODEL_TO_RDF_TEXT()
print_rdf_text = PRINT_OBJECT()
object_to_rdf_model = OBJECT_TO_RDF()

#
# Set component properties
#
push_query_string.message = "has fluiddb/tag/path"
query_for_objects.fluiddb_url = "http://sandbox.fluidinfo.com"
eads_object.fluiddb_url = "http://sandbox.fluidinfo.com"
model_to_rdf_text.rdf_dialect = "TTL"

#
# Create the flow by connecting the components
#
@query_for_objects_outputs = query_for_objects()
@model_to_rdf_text_outputs = model_to_rdf_text()
@push_query_string_outputs = push_query_string()
@object_to_rdf_model_outputs = object_to_rdf_model()
@reads_object_outputs = reads_object()
@wrapped_models_reducer_outputs = wrapped_models_reducer()

query_for_objects(text: push_query_string_outputs.text)
model_to_rdf_text(model: wrapped_models_reducer_outputs.model)
object_to_rdf_model(object: reads_object_outputs.object)
reads_object(uuid: query_for_objects_outputs.uuid)[+200!]
print_rdf_text(object: model_to_rdf_text_outputs.text)
wrapped_models_reducer(model: object_to_rdf_model_outputs.model)
</pre>
<p>The only interesting element in the script is the [+200!] entry that creates 200 parallel copies of read object that will concurrently hit FluidDB to pull the data, trying to minimize the latency. The script could be compiled into a MAU and run. The output of the execution would look like the following:</p>

<div class="wp_syntax"><div class="code"><pre class="java" style="font-family:monospace;">$ java <span style="color: #339933;">-</span>jar zzre<span style="color: #339933;">-</span>1.4.7.<span style="color: #006633;">jar</span> pull<span style="color: #339933;">-</span>test.<span style="color: #006633;">mau</span> 
Meandre MAU Executor <span style="color: #009900;">&#91;</span>1.0.1vcli<span style="color: #339933;">/</span>1.4.7<span style="color: #009900;">&#93;</span>
All rights reserved by DITA, NCSA, UofI <span style="color: #009900;">&#40;</span><span style="color: #cc66cc;">2007</span><span style="color: #339933;">-</span><span style="color: #cc66cc;">2009</span><span style="color: #009900;">&#41;</span>
<span style="color: #000000; font-weight: bold;">THIS</span> SOFTWARE IS PROVIDED UNDER University of Illinois<span style="color: #339933;">/</span>NCSA OPEN SOURCE LICENSE.
&nbsp;
<span style="color: #006633;">Executing</span> MAU file pull<span style="color: #339933;">-</span>test.<span style="color: #006633;">mau</span>
Creating temp dir pull<span style="color: #339933;">-</span>test.<span style="color: #006633;">mau</span>.<span style="color: #006633;">run</span>
Creating temp dir pull<span style="color: #339933;">-</span>test.<span style="color: #006633;">mau</span>.<span style="color: #006633;">public_resources</span>
&nbsp;
Preparing flow<span style="color: #339933;">:</span> meandre<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//seasr.org/zigzag/1253813636945/4416962494019783033/flow/pull-test-mau/</span>
<span style="color: #cc66cc;">2009</span><span style="color: #339933;">-</span>09<span style="color: #339933;">-</span><span style="color: #cc66cc;">24</span> <span style="color: #cc66cc;">12</span><span style="color: #339933;">:</span><span style="color: #cc66cc;">34</span><span style="color: #339933;">:</span><span style="color: #cc66cc;">38.480</span><span style="color: #339933;">::</span>INFO<span style="color: #339933;">:</span>  jetty<span style="color: #339933;">-</span>6.1.<span style="color: #006633;">x</span>
<span style="color: #cc66cc;">2009</span><span style="color: #339933;">-</span>09<span style="color: #339933;">-</span><span style="color: #cc66cc;">24</span> <span style="color: #cc66cc;">12</span><span style="color: #339933;">:</span><span style="color: #cc66cc;">34</span><span style="color: #339933;">:</span><span style="color: #cc66cc;">38.495</span><span style="color: #339933;">::</span>INFO<span style="color: #339933;">:</span>  Started SocketConnector@0.0.0.0<span style="color: #339933;">:</span><span style="color: #cc66cc;">1715</span>
Preparation completed correctly
&nbsp;
Execution started at<span style="color: #339933;">:</span> <span style="color: #cc66cc;">2009</span><span style="color: #339933;">-</span>09<span style="color: #339933;">-</span>24T12<span style="color: #339933;">:</span><span style="color: #cc66cc;">34</span><span style="color: #339933;">:</span><span style="color: #cc66cc;">38</span>
<span style="color: #339933;">----------------------------------------------------------------------------</span>
<span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/objects/a24b4a18-5483-47c6-9b62-0955210c7ebd&gt;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/1999/02/22-rdf-syntax-ns#type&gt;</span>
              <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/objects/&gt; , &lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#Bag&gt; ;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/1999/02/22-rdf-syntax-ns#_1&gt;</span>
              <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/tags/fluiddb/about&gt; ;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/1999/02/22-rdf-syntax-ns#_2&gt;</span>
              <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/tags/fluiddb/tags/path&gt; ;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/1999/02/22-rdf-syntax-ns#_3&gt;</span>
              <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/tags/fluiddb/tags/description&gt; ;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//purl.org/dc/elements/1.1/description&gt;</span>
              <span style="color: #0000ff;">&quot;Object for the attribute test/Net::FluidDB-name-1253772095.82845-0.944567286499904&quot;</span><span style="color: #339933;">^^&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/2001/XMLSchema#string&gt; .</span>
&nbsp;
<span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/objects/5ff74371-455b-4299-83f9-ba13ae898ad1&gt;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/1999/02/22-rdf-syntax-ns#type&gt;</span>
              <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/objects/&gt; , &lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#Bag&gt; ;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/1999/02/22-rdf-syntax-ns#_1&gt;</span>
              <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/tags/fluiddb/about&gt; ;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/1999/02/22-rdf-syntax-ns#_2&gt;</span>
              <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/tags/fluiddb/tags/path&gt; ;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/1999/02/22-rdf-syntax-ns#_3&gt;</span>
              <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/tags/fluiddb/tags/description&gt; ;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//purl.org/dc/elements/1.1/description&gt;</span>
              <span style="color: #0000ff;">&quot;Object for the attribute test/Net::FluidDB-name-1253622685.3231461-0.437099602163897316&quot;</span><span style="color: #339933;">^^&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/2001/XMLSchema#string&gt; .</span>
&nbsp;
<span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/objects/67e52346-527e-4bb7-b8f3-05fa8a8ae35b&gt;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/1999/02/22-rdf-syntax-ns#type&gt;</span>
              <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/objects/&gt; , &lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#Bag&gt; ;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/1999/02/22-rdf-syntax-ns#_1&gt;</span>
              <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/tags/fluiddb/about&gt; ;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/1999/02/22-rdf-syntax-ns#_2&gt;</span>
              <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/tags/fluiddb/tags/path&gt; ;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/1999/02/22-rdf-syntax-ns#_3&gt;</span>
              <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/tags/fluiddb/tags/description&gt; ;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//purl.org/dc/elements/1.1/description&gt;</span>
              <span style="color: #0000ff;">&quot;Object for the attribute test/Net::FluidDB-name-1253620190.69175-0.861614257420541&quot;</span><span style="color: #339933;">^^&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/2001/XMLSchema#string&gt; .</span>
&nbsp;
<span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/objects/8a65a184-03d9-4881-95df-02fa0561a86f&gt;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/1999/02/22-rdf-syntax-ns#type&gt;</span>
              <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/objects/&gt; , &lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#Bag&gt; ;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/1999/02/22-rdf-syntax-ns#_1&gt;</span>
              <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/tags/fluiddb/about&gt; ;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/1999/02/22-rdf-syntax-ns#_2&gt;</span>
              <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/tags/fluiddb/tags/path&gt; ;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/1999/02/22-rdf-syntax-ns#_3&gt;</span>
              <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/tags/fluiddb/tags/description&gt; ;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//purl.org/dc/elements/1.1/description&gt;</span>
              <span style="color: #0000ff;">&quot;Object for the attribute fluiddb/namespaces/permission/update/exceptions&quot;</span><span style="color: #339933;">^^&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/2001/XMLSchema#string&gt; .</span>
&nbsp;
<span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/objects/335b44e9-a72f-479d-ad60-3661a35231ba&gt;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/1999/02/22-rdf-syntax-ns#type&gt;</span>
              <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/objects/&gt; , &lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#Bag&gt; ;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/1999/02/22-rdf-syntax-ns#_1&gt;</span>
              <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/tags/fluiddb/about&gt; ;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/1999/02/22-rdf-syntax-ns#_2&gt;</span>
              <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/tags/fluiddb/tags/path&gt; ;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/1999/02/22-rdf-syntax-ns#_3&gt;</span>
              <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/tags/fluiddb/tags/description&gt; ;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//purl.org/dc/elements/1.1/description&gt;</span>
              <span style="color: #0000ff;">&quot;Object for the attribute test/Net::FluidDB-name-1253776141.95577-0.284175700598524&quot;</span><span style="color: #339933;">^^&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/2001/XMLSchema#string&gt; .</span>
&nbsp;
<span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/objects/3bbf1cc6-731c-4e56-a664-adeb5484334f&gt;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/1999/02/22-rdf-syntax-ns#type&gt;</span>
              <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/objects/&gt; , &lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#Bag&gt; ;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/1999/02/22-rdf-syntax-ns#_1&gt;</span>
              <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/tags/fluiddb/about&gt; ;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/1999/02/22-rdf-syntax-ns#_2&gt;</span>
              <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/tags/fluiddb/tags/path&gt; ;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/1999/02/22-rdf-syntax-ns#_3&gt;</span>
              <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/tags/fluiddb/tags/description&gt; ;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//purl.org/dc/elements/1.1/description&gt;</span>
              <span style="color: #0000ff;">&quot;Object for the attribute fluiddb/namespaces/permission/delete/policy&quot;</span><span style="color: #339933;">^^&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/2001/XMLSchema#string&gt; .</span>
&nbsp;
<span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/objects/aba5adcf-fd44-40ab-b702-9cc635650bc3&gt;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/1999/02/22-rdf-syntax-ns#type&gt;</span>
              <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/objects/&gt; , &lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#Bag&gt; ;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/1999/02/22-rdf-syntax-ns#_1&gt;</span>
              <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/tags/fluiddb/about&gt; ;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/1999/02/22-rdf-syntax-ns#_2&gt;</span>
              <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/tags/fluiddb/tags/path&gt; ;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/1999/02/22-rdf-syntax-ns#_3&gt;</span>
              <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/tags/fluiddb/tags/description&gt; ;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//purl.org/dc/elements/1.1/description&gt;</span>
              <span style="color: #0000ff;">&quot;Object for the attribute test/Net::FluidDB-name-1253614713.757-0.604769721717702&quot;</span><span style="color: #339933;">^^&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/2001/XMLSchema#string&gt; .</span>
&nbsp;
<span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/objects/f61ceb3b-33df-4356-8e7d-c56d3d0ae338&gt;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/1999/02/22-rdf-syntax-ns#type&gt;</span>
              <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/objects/&gt; , &lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#Bag&gt; ;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/1999/02/22-rdf-syntax-ns#_1&gt;</span>
              <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/tags/fluiddb/about&gt; ;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/1999/02/22-rdf-syntax-ns#_2&gt;</span>
              <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/tags/fluiddb/tags/path&gt; ;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/1999/02/22-rdf-syntax-ns#_3&gt;</span>
              <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//sandbox.fluidinfo.com/tags/fluiddb/tags/description&gt; ;</span>
      <span style="color: #339933;">&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//purl.org/dc/elements/1.1/description&gt;</span>
              <span style="color: #0000ff;">&quot;Object for the attribute test/Net::FluidDB-name-1253615887.80879-0.0437609496034099&quot;</span><span style="color: #339933;">^^&lt;</span>http<span style="color: #339933;">:</span><span style="color: #666666; font-style: italic;">//www.w3.org/2001/XMLSchema#string&gt; .</span>
&nbsp;
...</pre></div></div>

<p>That&#8217;s it! A first RDF dump of the query!</p>
<h2>The not so great news</h2>
<p>The current <a href="http://www.fluidinfo.com/fluiddb">FluidDB</a> API does not provide any method to be able to pull data from more than one object at once. That basically means, that for each <code>uuid</code> a call to the server needs to be process. That is a huge latency overhead. The <a href="http://www.fluidinfo.com/fluiddb">FluidDB</a> guys know about it and they are scratching their heads on how to provide a &#8220;multi get&#8221;. A full trace of the output can be found on this <a href='http://www.xavierllora.net/wp-content/uploads/2009/09/fluiddb.txt'>FluidDB RDF endpoint trace</a>.</p>
<p>This element is crucial for any RDF endpoint. Above I left out a basic element, the time measures. That part looks like:</p>
<pre>
Flow execution statistics

Flow unique execution ID : meandre://seasr.org/zigzag/1253813636945/4416962494019783033/flow/pull-test-mau/8D8E354A/1253813678323/1493255769/
Flow state               : ended
Started at               : Thu Sep 24 12:34:38 CDT 2009
Last update              : Thu Sep 24 12:37:28 CDT 2009
Total run time (ms)      : 170144
</pre>
<p>Basically 170s to pull only 238 objects, where all the time is spent round tripping to <a href="http://www.fluidinfo.com/fluiddb">FluidDB</a>. </p>
<h2>Getting there</h2>
<p>This basically means that such high latency would not allow efficient interactive usage of the end point. However, this exercise was useful to prof that simple RDF endpoints for <a href="http://www.fluidinfo.com/fluiddb">FluidDB</a> are possible and would greatly boost the flexibility of interaction with <a href="http://www.fluidinfo.com/fluiddb">FluidDB</a> . The current form of the endpoint is may still have value if you are not in a hurry, allowing you to run <a href="http://www.w3.org/TR/rdf-sparql-query/">SPARQL</a> queries against <a href="http://www.fluidinfo.com/fluiddb">FluidDB</a> data and get the best of both worlds.</p>
<h2>The code use</h2>
<p>If you are interested on running the code, you may need <a href="http://seasr.org/meandre">Meandre</a> and the components I put together for the experiment, that you can get from <a href="http://github.com/xllora/liquid">http://github.com/xllora/liquid</a>.</p>


<p>Related posts:<ol><li><a href='http://www.xavierllora.net/2009/08/25/liquid-rdf-meandering-in-fluiddb/' rel='bookmark' title='Permanent Link: Liquid: RDF meandering in FluidDB'>Liquid: RDF meandering in FluidDB</a></li>
<li><a href='http://www.xavierllora.net/2009/09/29/temporary-storage-for-meandres-distribute-flow-execution/' rel='bookmark' title='Permanent Link: Temporary storage for Meandre&#8217;s distributed flow execution'>Temporary storage for Meandre&#8217;s distributed flow execution</a></li>
<li><a href='http://www.xavierllora.net/2009/02/04/efficient-serialization-for-java-and-beyond/' rel='bookmark' title='Permanent Link: Efficient serialization for Java (and beyond)'>Efficient serialization for Java (and beyond)</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.xavierllora.net/2009/09/24/liquid-rdf-endpoint-for-fluiddb/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>From Galapagos to Twitter: Darwin, Natural Selection, and Web 2.0</title>
		<link>http://www.xavierllora.net/2009/09/18/from-galapagos-to-twitter-darwin-natural-selection-and-web-2-0/</link>
		<comments>http://www.xavierllora.net/2009/09/18/from-galapagos-to-twitter-darwin-natural-selection-and-web-2-0/#comments</comments>
		<pubDate>Fri, 18 Sep 2009 18:14:05 +0000</pubDate>
		<dc:creator>Xavier</dc:creator>
				<category><![CDATA[DISCUS]]></category>
		<category><![CDATA[Meandre]]></category>
		<category><![CDATA[Presentations]]></category>
		<category><![CDATA[Charles Darwin]]></category>
		<category><![CDATA[genetic algorithms]]></category>
		<category><![CDATA[human-based genetic algorithms]]></category>
		<category><![CDATA[interactive genetic algorithms]]></category>
		<category><![CDATA[meandre]]></category>

		<guid isPermaLink="false">http://www.xavierllora.net/?p=590</guid>
		<description><![CDATA[Yesterday I was visiting Monmouth College to participate on the Darwinpalooza which commemorates the 200th anniversary of Charles Darwin&#8217;s birth and the 150th anniversary of the publication of On the Origin of Species. After scratching my head about about what to present, I came out with quite a mix. You will find the abstract of [...]


Related posts:<ol><li><a href='http://www.xavierllora.net/2006/12/18/challenging-lectures-on-line-at-ted/' rel='bookmark' title='Permanent Link: Challenging lectures on-line at TED'>Challenging lectures on-line at TED</a></li>
<li><a href='http://www.xavierllora.net/2009/10/09/scaling-genetic-algorithms-using-mapreduce/' rel='bookmark' title='Permanent Link: Scaling Genetic Algorithms using MapReduce'>Scaling Genetic Algorithms using MapReduce</a></li>
<li><a href='http://www.xavierllora.net/2008/12/30/dusting-my-phd-thesis-off/' rel='bookmark' title='Permanent Link: Dusting my Ph.D. thesis off'>Dusting my Ph.D. thesis off</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<p>Yesterday I was visiting <a href="http://www.monm.edu/">Monmouth College</a> to participate on the <a href="http://department.monm.edu/nineteenth-century/pdf/darwin-flyer.pdf">Darwinpalooza</a> which commemorates the 200th anniversary of Charles Darwin&#8217;s birth and the 150th anniversary of the publication of On the Origin of Species. After scratching my head about about what to present, I came out with quite a mix. You will find the abstract of the talk below, as well as the slides I used.</p>
<p><strong>Abstract:</strong> One hundred and fifty years have passed since the publication of Darwin&#8217;s world-changing manuscript &#8220;The Origins of Species by Means of Natural Selection&#8221;. Darwin&#8217;s ideas have proven their power to reach beyond the biology realm, and their ability to define a conceptual framework which allows us to model and understand complex systems. In the mid 1950s and 60s the efforts of a scattered group of engineers proved the benefits of adopting an evolutionary paradigm to solve complex real-world problems. In the 70s, the emerging presence of computers brought us a new collection of artificial evolution paradigms, among which genetic algorithms rapidly gained widespread adoption. Currently, the Internet has propitiated an exponential growth of information and computational resources that are clearly disrupting our perception and forcing us to reevaluate the boundaries between technology and social interaction. Darwin&#8217;s ideas can, once again, help us understand such disruptive change. In this talk, I will review the origin of artificial evolution ideas and techniques. I will also show how these techniques are, nowadays, helping to solve a wide range of applications, from life science problems to twitter puzzles, and how high performance computing can make Darwin ideas a routinary tool to help us model and understand complex systems.</p>
<object width="425&type=s" height="348"><param name="movie" value="http://static.slideshare.net/swf/ssplayer2.swf?doc=monmouth-talk-slides-090918130141-phpapp01"/><param name="allowFullScreen" value="true"/><param name="allowScriptAccess" value="always"/><embed src="http://static.slideshare.net/swf/ssplayer2.swf?doc=monmouth-talk-slides-090918130141-phpapp01"  type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425&type=s" height="348"></embed></object><!-- ysttest:Array
(
    [id] => 2019013&amp;doc=monmouth-talk-slides-090918130141-phpapp01
)
-->


<p>Related posts:<ol><li><a href='http://www.xavierllora.net/2006/12/18/challenging-lectures-on-line-at-ted/' rel='bookmark' title='Permanent Link: Challenging lectures on-line at TED'>Challenging lectures on-line at TED</a></li>
<li><a href='http://www.xavierllora.net/2009/10/09/scaling-genetic-algorithms-using-mapreduce/' rel='bookmark' title='Permanent Link: Scaling Genetic Algorithms using MapReduce'>Scaling Genetic Algorithms using MapReduce</a></li>
<li><a href='http://www.xavierllora.net/2008/12/30/dusting-my-phd-thesis-off/' rel='bookmark' title='Permanent Link: Dusting my Ph.D. thesis off'>Dusting my Ph.D. thesis off</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.xavierllora.net/2009/09/18/from-galapagos-to-twitter-darwin-natural-selection-and-web-2-0/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Liquid: RDF meandering in FluidDB</title>
		<link>http://www.xavierllora.net/2009/08/25/liquid-rdf-meandering-in-fluiddb/</link>
		<comments>http://www.xavierllora.net/2009/08/25/liquid-rdf-meandering-in-fluiddb/#comments</comments>
		<pubDate>Tue, 25 Aug 2009 18:04:20 +0000</pubDate>
		<dc:creator>Xavier</dc:creator>
				<category><![CDATA[Data-Intensive Computing]]></category>
		<category><![CDATA[Notes]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Social Networks]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[cloud computing]]></category>
		<category><![CDATA[FluidDB]]></category>
		<category><![CDATA[meandre]]></category>
		<category><![CDATA[RDF]]></category>
		<category><![CDATA[storage]]></category>

		<guid isPermaLink="false">http://www.xavierllora.net/?p=577</guid>
		<description><![CDATA[Meandre (NCSA pushed data-intensive computing infrastructure) relies on RDF to describe components, flows, locations and repositories. RDF has become the central piece that makes possible Meandre&#8217;s flexibility and reusability. However, one piece still remains largely sketchy and still has no clear optimal solution: How can we facilitate to anybody sharing, publishing and annotating flows, components, [...]


Related posts:<ol><li><a href='http://www.xavierllora.net/2009/09/24/liquid-rdf-endpoint-for-fluiddb/' rel='bookmark' title='Permanent Link: Liquid: RDF endpoint for FluidDB'>Liquid: RDF endpoint for FluidDB</a></li>
<li><a href='http://www.xavierllora.net/2008/11/15/meandre-semantic-driven-data-intensive-flows-in-the-clouds/' rel='bookmark' title='Permanent Link: Meandre: Semantic-Driven Data-Intensive Flows in the Clouds'>Meandre: Semantic-Driven Data-Intensive Flows in the Clouds</a></li>
<li><a href='http://www.xavierllora.net/2009/01/11/meandre-140-final-release-candidate-tagged/' rel='bookmark' title='Permanent Link: Meandre 1.4.0 final release candidate tagged'>Meandre 1.4.0 final release candidate tagged</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<p><a href="http://seasr.org/meandre/">Meandre</a> (<a href="http://www.ncsa.illinois.edu">NCSA</a> pushed data-intensive computing infrastructure) relies on <a href="http://www.w3.org/RDF/">RDF</a> to describe components, flows, locations and repositories. <a href="http://www.w3.org/RDF/">RDF</a> has become the central piece that makes possible <a href="http://seasr.org/meandre/">Meandre</a>&#8217;s flexibility and reusability. However, one piece still remains largely sketchy and still has no clear optimal solution: How can we facilitate to anybody sharing, publishing and annotating flows, components, locations and repositories? More importantly, how can that be done in the cloud in an open-ended fashion and allow anybody to annotate and comment on each of the afore mentioned pieces?</p>
<h3>The FluidDB trip</h3>
<p>During my last summer trip to Europe, <a href="http://blogs.fluidinfo.com/terry/">Terry Jones</a> (CEO) invited me to visit <a href="http://www.fluidinfo.com/">FluidInfo</a> (based in Barcelona) where I also meet <a href="http://blogs.fluidinfo.com/esteve/">Esteve Fernandez</a> (CTO). I had a great opportunity to chat with the masterminds behind an intriguing concept I ran into after a short note I received from <a href="http://www.illigal.uiuc.edu/web/deg/vita/">David E. Goldberg</a>. <a href="http://www.fluidinfo.com/fluiddb">FluidDB</a>, the main product being pushed by <a href="http://www.fluidinfo.com/">FluidInfo</a>, is an online collaborative &#8220;cloud&#8221;  database. On <a href="http://www.fluidinfo.com/">FluidInfo</a> words:</p>
<blockquote><p>
FluidDB lets data be social. It allows almost unlimited information personalization by individual users and applications, and also between them. This makes it simple to build a wide variety of applications that benefit from cooperation, and which are open to unanticipated future enhancements. Even more importantly, FluidDB facilitates and encourages the growth of applications that leave users in control of their own data.
</p></blockquote>
<p><a href="http://www.fluidinfo.com/fluiddb">FluidDB</a> went live on a private alpha last week. The basic concept behind the scenes is simple. <a href="http://www.fluidinfo.com/fluiddb">FluidDB</a> stores objects. Objects do not belong to anybody. Objects may be &#8220;blank&#8221; or they may be about something (e.g. <a href="http://seasr.org/meandre">http://seasr.org/meandre</a>). You can create as many blank objects as you want. Creating an object with the same about always returns the same object (thus, there will only be one object about <a href="http://seasr.org/meandre">http://seasr.org/meandre</a>). Once objects exists, things start getting more interesting, you can go and tag any object with whatever tag you want. For instance I could tag the <a href="http://seasr.org/meandre">http://seasr.org/meandre</a> object <code>hosted_by</code> tag, and assign the tag the value <a href="http://www.ncsa.illinois.edu>&#8220;National Center for Supercomputing Applications&#8221;</a> value. Values can be anything you want, from text and numerals to blobs. Finally, <a href="http://www.fluidinfo.com/fluiddb">FluidDB</a> introduces one last trick: namespaces. For instance, I got xllora. that means that the above tag I mentioned would look like <code>/tag/xllora/hosted_by</code>. You can create as many nested namespaces under your main namespace as you want. <a href="http://www.fluidinfo.com/fluiddb">FluidDB</a> also provides mechanisms to control who can query and see the values of your created tags.</p>
<p>As you can see, the basic object model and mechanics is very simple. When the alpha went live, <a href="http://www.fluidinfo.com/fluiddb">FluidDB</a> only provide access via a simple REST-like HTTP API. In a few days a blossom of client libraries that wrap that API were develop by a dynamic community that gather on <code>#fluiddb</code> channel on <code>irc.freenode.net</code> where <a href="http://www.fluidinfo.com/fluiddb">FluidDB</a and early adopters share experiences. </p>
<h3>You were saying something about RDF</h3>
<p>Back to the point. One thing I chatted with the <a href="http://www.fluidinfo.com/fluiddb">FluidDB</a> guys was what did they think about the similarities between <a href="http://www.fluidinfo.com/fluiddb">FluidDB</a>&#8217;s object model and RDF. After playing with RDF for a while, the <a href="http://www.fluidinfo.com/fluiddb">FluidDB</a> model look awfully familiar, despite a much simplified and manageable model than <a href="http://www.w3.org/RDF/">RDF</a>. They did not have much to say about it, and the question got stuck in the back of my mind. So when I got access to the private alpha, I could not help it but get down the path of what would it mean to map <a href="http://www.w3.org/RDF/">RDF</a> on <a href="http://www.fluidinfo.com/fluiddb">FluidDB</a>. Yes, the simple straight answer would be to stick serialized RDF into the value of a given tag (e.g. <code>xllora/rdf</code>). However, that option seemed poor, since I could not exploit the social aspect of collaborative annotations provided by <a href="http://www.fluidinfo.com/fluiddb">FluidDB</a>. So back to the drawing board. What both models have in common: They are both descriptions about something. In RDF you can see those as the subjects of the triple predicates, whereas in <a href="http://www.fluidinfo.com/fluiddb">FluidDB</a> those are simple objects. <a href="http://www.w3.org/RDF/">RDF</a> use properties to qualify objects. <a href="http://www.fluidinfo.com/fluiddb">FluidDB</a> uses tags. Both enable you to add value to qualified objects. Mmh, there you go.</p>
<p>With this idea in mind, I started <a href="http://github.com/xllora/liquid/tree/master">Liquid</a>, a simple proof-of-concept library that maps <a href="http://www.w3.org/RDF/">RDF</a> on to <a href="http://www.fluidinfo.com/fluiddb">FluidDB</a> and then it gets it back. There was only one thing that needed a bit of patching. <a href="http://www.w3.org/RDF/">RDF</a> properties are arbitrary URIs. Those could not be easily map on the top of <a href="http://www.fluidinfo.com/fluiddb">FluidDB</a> tags, so I took a simple compromise route.</p>
<ul>
<li><a href="http://www.w3.org/RDF/">RDF</a>s subject URIs are mapped onto FluidDB qualified objects via the about tag</li>
<li>One <a href="http://www.fluidinfo.com/fluiddb">FluidDB</a> tag will contain all the properties for that object (basically a simple dictionary encoded in JSON)</li>
<li>Reference to other <a href="http://www.w3.org/RDF/">RDF</a> URIs will be mapped on to <a href="http://www.fluidinfo.com/fluiddb">FluidDB</a> object URIs, and vice versa</li>
</ul>
<p>Let&#8217;s make it a bit more chewable with a simple example.</p>

<div class="wp_syntax"><div class="code"><pre class="xml" style="font-family:monospace;"><span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;?xml</span> <span style="color: #000066;">version</span>=<span style="color: #ff0000;">&quot;1.0&quot;</span><span style="color: #000000; font-weight: bold;">?&gt;</span></span>
&nbsp;
<span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;rdf:RDF</span></span>
<span style="color: #009900;"><span style="color: #000066;">xmlns:rdf</span>=<span style="color: #ff0000;">&quot;http://www.w3.org/1999/02/22-rdf-syntax-ns#&quot;</span></span>
<span style="color: #009900;"><span style="color: #000066;">xmlns:cd</span>=<span style="color: #ff0000;">&quot;http://www.recshop.fake/cd#&quot;</span><span style="color: #000000; font-weight: bold;">&gt;</span></span>
&nbsp;
<span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;rdf:Description</span></span>
<span style="color: #009900;"><span style="color: #000066;">rdf:about</span>=<span style="color: #ff0000;">&quot;http://www.recshop.fake/cd/Empire Burlesque&quot;</span><span style="color: #000000; font-weight: bold;">&gt;</span></span>
  <span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;cd:artist<span style="color: #000000; font-weight: bold;">&gt;</span></span></span>Bob Dylan<span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;/cd:artist<span style="color: #000000; font-weight: bold;">&gt;</span></span></span>
 <span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;/rdf:Description<span style="color: #000000; font-weight: bold;">&gt;</span></span></span>
&nbsp;
<span style="color: #009900;"><span style="color: #000000; font-weight: bold;">&lt;/rdf:RDF<span style="color: #000000; font-weight: bold;">&gt;</span></span></span></pre></div></div>

<p>The above <a href="http://www.w3.org/RDF/">RDF</a> represents a single triple</p>

<div class="wp_syntax"><div class="code"><pre class="xml" style="font-family:monospace;">http://www.recshop.fake/cd/Empire Burlesque	http://www.recshop.fake/cd#artist	   &quot;Bob Dylan&quot;</pre></div></div>

<p>This triple could be map onto <a href="http://www.fluidinfo.com/fluiddb">FluidDB</a> by creating one qualified <a href="http://www.fluidinfo.com/fluiddb">FluidDB</a> object and adding the proper tags. The example below shows how to do so using <a href="http://github.com/njr0/fdb.py/tree/master">Python&#8217;s fdb.py client library</a> by <a href="http://StochasticSolutions.com/about.html">Nicholas J. Radcliffe</a>.</p>

<div class="wp_syntax"><div class="code"><pre class="python" style="font-family:monospace;"><span style="color: #ff7700;font-weight:bold;">import</span> fdb,<span style="color: #dc143c;">sys</span>
<span style="color: #ff7700;font-weight:bold;">if</span> <span style="color: #dc143c;">sys</span>.<span style="color: black;">version_info</span> <span style="color: #66cc66;">&lt;</span> <span style="color: black;">&#40;</span><span style="color: #ff4500;">2</span>, <span style="color: #ff4500;">6</span><span style="color: black;">&#41;</span>:
    <span style="color: #ff7700;font-weight:bold;">import</span> simplejson <span style="color: #ff7700;font-weight:bold;">as</span> json
<span style="color: #ff7700;font-weight:bold;">else</span>:
    <span style="color: #ff7700;font-weight:bold;">import</span> json
&nbsp;
__RDF_TAG__ = <span style="color: #483d8b;">'rdf'</span>
__RDF_TAG_PROPERTIES__  = <span style="color: #483d8b;">'rdf_properties'</span>
__RDF_TAG_MODEL_NAME__ = <span style="color: #483d8b;">'rdf_model_name'</span>
&nbsp;
<span style="color: #808080; font-style: italic;">#</span>
<span style="color: #808080; font-style: italic;"># Initialize the FluidDB client library</span>
<span style="color: #808080; font-style: italic;">#</span>
f = fdb.<span style="color: black;">FluidDB</span><span style="color: black;">&#40;</span><span style="color: black;">&#41;</span>
<span style="color: #808080; font-style: italic;">#</span>
<span style="color: #808080; font-style: italic;"># Create the tags (if they exist, this won't hurt)</span>
<span style="color: #808080; font-style: italic;">#</span>
f.<span style="color: black;">create_abstract_tag</span><span style="color: black;">&#40;</span>__RDF_TAG__<span style="color: black;">&#41;</span>
f.<span style="color: black;">create_abstract_tag</span><span style="color: black;">&#40;</span>__RDF_TAG_PROPERTIES__<span style="color: black;">&#41;</span>
f.<span style="color: black;">create_abstract_tag</span><span style="color: black;">&#40;</span>__RDF_TAG_MODEL_NAME__<span style="color: black;">&#41;</span>
<span style="color: #808080; font-style: italic;">#</span>
<span style="color: #808080; font-style: italic;"># Create the subject object of the triple</span>
<span style="color: #808080; font-style: italic;">#	</span>
o = f.<span style="color: black;">create_object</span><span style="color: black;">&#40;</span><span style="color: #483d8b;">'http://www.recshop.fake/cd/Empire Burlesque'</span><span style="color: black;">&#41;</span>
<span style="color: #808080; font-style: italic;">#</span>
<span style="color: #808080; font-style: italic;"># Map RDF properties</span>
<span style="color: #808080; font-style: italic;">#</span>
properties = <span style="color: black;">&#123;</span><span style="color: #483d8b;">'http://www.recshop.fake/cd#artist'</span>:<span style="color: black;">&#91;</span><span style="color: #483d8b;">'Bob Dylan'</span><span style="color: black;">&#93;</span><span style="color: black;">&#125;</span>
<span style="color: #808080; font-style: italic;">#</span>
<span style="color: #808080; font-style: italic;"># Tag the object as RDF aware, properties available, and to which model/named graph </span>
<span style="color: #808080; font-style: italic;"># it belongs</span>
<span style="color: #808080; font-style: italic;">#</span>
f.<span style="color: black;">tag_object_by_id</span><span style="color: black;">&#40;</span>o.<span style="color: #008000;">id</span>, __RDF_TAG__<span style="color: black;">&#41;</span>
f.<span style="color: black;">tag_object_by_id</span><span style="color: black;">&#40;</span>o.<span style="color: #008000;">id</span>,__RDF_TAG_PROPERTIES__,value=json.<span style="color: black;">dumps</span><span style="color: black;">&#40;</span>properties<span style="color: black;">&#41;</span><span style="color: black;">&#41;</span>
f.<span style="color: black;">tag_object_by_id</span><span style="color: black;">&#40;</span>o.<span style="color: #008000;">id</span>, __RDF_TAG_MODEL_NAME__,<span style="color: #483d8b;">'test_dummy'</span><span style="color: black;">&#41;</span></pre></div></div>

<p>Running along with this basic idea, I quickly stitched <a href="http://github.com/xllora/liquid/tree/master">a simple library (Liquid)</a> that allows ingestion and retrieval of <a href="http://www.w3.org/RDF/">RDF</a> from <a href="http://www.fluidinfo.com/fluiddb">FluidDB</a>. It is still very rudimentary and may not totally map properly all possible <a href="http://www.w3.org/RDF/">RDF</a>, but it is a working proof-of-concept implementation that it is possible to do so.</p>
<p>The Python code above just saves a triple. You can easy retrieve the triple by performing the following operation</p>

<div class="wp_syntax"><div class="code"><pre class="python" style="font-family:monospace;"><span style="color: #ff7700;font-weight:bold;">import</span> fdb,<span style="color: #dc143c;">sys</span>
<span style="color: #ff7700;font-weight:bold;">if</span> <span style="color: #dc143c;">sys</span>.<span style="color: black;">version_info</span> <span style="color: #66cc66;">&lt;</span> <span style="color: black;">&#40;</span><span style="color: #ff4500;">2</span>, <span style="color: #ff4500;">6</span><span style="color: black;">&#41;</span>:
    <span style="color: #ff7700;font-weight:bold;">import</span> simplejson <span style="color: #ff7700;font-weight:bold;">as</span> json
<span style="color: #ff7700;font-weight:bold;">else</span>:
    <span style="color: #ff7700;font-weight:bold;">import</span> json
&nbsp;
__RDF_TAG__ = <span style="color: #483d8b;">'rdf'</span>
__RDF_TAG_PROPERTIES__  = <span style="color: #483d8b;">'rdf_properties'</span>
__RDF_TAG_MODEL_NAME__ = <span style="color: #483d8b;">'rdf_model_name'</span>
&nbsp;
<span style="color: #808080; font-style: italic;">#</span>
<span style="color: #808080; font-style: italic;"># Initialize the FluidDB client library</span>
<span style="color: #808080; font-style: italic;">#</span>
f = fdb.<span style="color: black;">FluidDB</span><span style="color: black;">&#40;</span><span style="color: black;">&#41;</span>
<span style="color: #808080; font-style: italic;">#</span>
<span style="color: #808080; font-style: italic;"># Retrieve the annotated objects</span>
<span style="color: #808080; font-style: italic;">#</span>
objs = f.<span style="color: black;">query</span><span style="color: black;">&#40;</span><span style="color: #483d8b;">'has xllora/%s'</span><span style="color: #66cc66;">%</span><span style="color: black;">&#40;</span>__RDF_TAG__<span style="color: black;">&#41;</span><span style="color: black;">&#41;</span>
<span style="color: #808080; font-style: italic;">#</span>
<span style="color: #808080; font-style: italic;"># Optionally you could retrieve the ones only belonging to a given model by</span>
<span style="color: #808080; font-style: italic;">#</span>
<span style="color: #808080; font-style: italic;"># objs = fdb.query('has xllora/%s and xllora/%s matches &quot;%s&quot;'%(__RDF_TAG__,__RDF_TAG_MODEL_NAME__,modelname))</span>
<span style="color: #808080; font-style: italic;">#</span>
subs = <span style="color: black;">&#91;</span>f.<span style="color: black;">get_tag_value_by_id</span><span style="color: black;">&#40;</span>s,<span style="color: #483d8b;">'/tags/fluiddb/about'</span><span style="color: black;">&#41;</span> <span style="color: #ff7700;font-weight:bold;">for</span> s <span style="color: #ff7700;font-weight:bold;">in</span> objs<span style="color: black;">&#93;</span>
props_tmp = <span style="color: black;">&#91;</span>f.<span style="color: black;">get_tag_value_by_id</span><span style="color: black;">&#40;</span>s,<span style="color: #483d8b;">'/tags/xllora/'</span>+__RDF_TAG_PROPERTIES__<span style="color: black;">&#41;</span> <span style="color: #ff7700;font-weight:bold;">for</span> s <span style="color: #ff7700;font-weight:bold;">in</span> objs<span style="color: black;">&#93;</span>
props = <span style="color: black;">&#91;</span>json.<span style="color: black;">loads</span><span style="color: black;">&#40;</span>s<span style="color: black;">&#91;</span><span style="color: #ff4500;">1</span><span style="color: black;">&#93;</span><span style="color: black;">&#41;</span> <span style="color: #ff7700;font-weight:bold;">if</span> s<span style="color: black;">&#91;</span><span style="color: #ff4500;">0</span><span style="color: black;">&#93;</span>==<span style="color: #ff4500;">200</span> <span style="color: #ff7700;font-weight:bold;">else</span> <span style="color: black;">&#123;</span><span style="color: black;">&#125;</span> <span style="color: #ff7700;font-weight:bold;">for</span> s <span style="color: #ff7700;font-weight:bold;">in</span> props_tmp<span style="color: black;">&#93;</span></pre></div></div>

<p>Now <code>subs</code> contains all the subject URIs for the predicates, and <code>props</code> all the dictionaries containing the properties.</p>
<h3>The bottom line</h3>
<p>OK. So, what is this mapping important? Basically, it will allow collaborative tagging of the created objects (subjects), allowing a collaborative and social gathering of information, besides them mapped <a href="http://www.w3.org/RDF/">RDF</a>. So, what does it all means?</p>
<p>It basically means, that if you do not have the need to ingest <a href="http://www.w3.org/RDF/">RDF</a> (where property URIs are not directly map and you need to Fluidify/reify), any data stored in <a href="http://www.fluidinfo.com/fluiddb">FluidDB</a> is already on some form of triplified RDF. Let me explain what I mean by that. Each <a href="http://www.fluidinfo.com/fluiddb">FluidDB</a> has a unique URI (e.g. <code>http://fluidDB.fluidinfo.com/objects/4fdf7ff4-f0da-4441-8e63-9b98ed26fc12</code>). Each tag is also uniquely identified by an URI (e.g. <code>http://fluidDB.fluidinfo.com/tags/xllora/rdf_model_name</code>). And finally each pair object/tag may have a value (e.g. a literal <code>'test_dummy'</code> or maybe another URI <code>http://fluidDB.fluidinfo.com/objects/a0dda173-9ee0-4799-a507-8710045d2b07</code>). If a object/tag does not have a value you can just point it to the no value URI (or some other convention you like). </p>
<p>Having said that, now you have all the pieces to express <a href="http://www.fluidinfo.com/fluiddb">FluidDB</a> data in plain shareable RDF. That would mean basically get all the tags for and object, query the values, and then just generate and <a href="http://www.w3.org/RDF/">RDF</a> model by adding the gathered triples. That&#8217;s easy. Also, if you align your properties to tags, the ingestion would also become that trivial. I will try to get that piece into <a href="http://github.com/xllora/liquid/tree/master">Liquid</a> as soon as other issues allow me to do so <img src='http://www.xavierllora.net/wp-includes/images/smilies/icon_biggrin.gif' alt=':D' class='wp-smiley' /> .</p>
<p>Just to close, I would mention once again a key element of this picture. <a href="http://www.fluidinfo.com/fluiddb">FluidDB</a> opens the door to a truly cooperative, distributed, and online fluid semantic web.  It is one of the first examples of how annotations (a.k.a. metadata) can be easily gathered and used on the &#8220;cloud&#8221; for the masses. Great job guys!</p>


<p>Related posts:<ol><li><a href='http://www.xavierllora.net/2009/09/24/liquid-rdf-endpoint-for-fluiddb/' rel='bookmark' title='Permanent Link: Liquid: RDF endpoint for FluidDB'>Liquid: RDF endpoint for FluidDB</a></li>
<li><a href='http://www.xavierllora.net/2008/11/15/meandre-semantic-driven-data-intensive-flows-in-the-clouds/' rel='bookmark' title='Permanent Link: Meandre: Semantic-Driven Data-Intensive Flows in the Clouds'>Meandre: Semantic-Driven Data-Intensive Flows in the Clouds</a></li>
<li><a href='http://www.xavierllora.net/2009/01/11/meandre-140-final-release-candidate-tagged/' rel='bookmark' title='Permanent Link: Meandre 1.4.0 final release candidate tagged'>Meandre 1.4.0 final release candidate tagged</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.xavierllora.net/2009/08/25/liquid-rdf-meandering-in-fluiddb/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Easy, reliable, and flexible storage for Python</title>
		<link>http://www.xavierllora.net/2009/08/13/easy-reliable-and-flexible-storage-for-python/</link>
		<comments>http://www.xavierllora.net/2009/08/13/easy-reliable-and-flexible-storage-for-python/#comments</comments>
		<pubDate>Thu, 13 Aug 2009 22:01:32 +0000</pubDate>
		<dc:creator>Xavier</dc:creator>
				<category><![CDATA[Software]]></category>
		<category><![CDATA[pytc]]></category>
		<category><![CDATA[python]]></category>
		<category><![CDATA[tokyo cabinet]]></category>
		<category><![CDATA[tokyo tyrant]]></category>

		<guid isPermaLink="false">http://www.xavierllora.net/?p=573</guid>
		<description><![CDATA[A while ago I wrote a little post about alternative column stores. One that I mentioned was Tokyo Cabinet (and its associated server Tokyo Tyrant. Tokyo Cabinet it is a key-value store written in C and with bindings for multiple languages (including Python and Java). It can maintain data bases in memory or spin them [...]


Related posts:<ol><li><a href='http://www.xavierllora.net/2009/09/29/temporary-storage-for-meandres-distribute-flow-execution/' rel='bookmark' title='Permanent Link: Temporary storage for Meandre&#8217;s distributed flow execution'>Temporary storage for Meandre&#8217;s distributed flow execution</a></li>
<li><a href='http://www.xavierllora.net/2008/07/01/efficient-storage-for-python/' rel='bookmark' title='Permanent Link: Efficient storage for Python'>Efficient storage for Python</a></li>
<li><a href='http://www.xavierllora.net/2008/01/16/a-simple-and-flexible-ga-loop-in-python/' rel='bookmark' title='Permanent Link: A simple and flexible GA loop in Python'>A simple and flexible GA loop in Python</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<p>A while ago I wrote a little post about <a href="/2008/06/05/the-next-generation-of-data-bases/">alternative column stores</a>. One that I mentioned was <a hef="http://tokyocabinet.sourceforge.net/">Tokyo Cabinet</a> (and its associated server <a href="http://tokyocabinet.sourceforge.net/tyrantdoc/">Tokyo Tyrant</a>. Tokyo Cabinet it is a key-value store written in C and with bindings for multiple languages (including Python and Java). It can maintain data bases in memory or spin them to disk (you can pick between hash or B-tree based stores). </p>
<p>Having heard a bunch of good things, I finally gave it a try. I just installed both Cabinet and Tyrant (you may find useful installation instructions <a href="http://openwferu.rubyforge.org/tokyo.html">here</a> using the usual configure, make, make install cycle). Another nice feature of Tyrant is that it also supports HTTP gets and puts. So having all this said, I just wanted to check how easy it was to use it from Python. And the answer was very simple. <a href="http://github.com/turian/pytc-example/tree/master">Joseph Turian&#8217;s examples</a> got me running in less than 2 minutes&#8212;see the piece of code below&#8212;when dealing with a particular data base. Using Tyrant over HTTP is quite simple too&#8212;see <a href="http://petewarden.typepad.com/searchbrowser/2009/03/tokyo-tyrant-tutorial.html">PeteSearch blog post</a>.</p>

<div class="wp_syntax"><div class="code"><pre class="python" style="font-family:monospace;"><span style="color: #ff7700;font-weight:bold;">import</span> pytc,<span style="color: #dc143c;">pickle</span>
<span style="color: #ff7700;font-weight:bold;">from</span> numpy <span style="color: #ff7700;font-weight:bold;">import</span> <span style="color: #66cc66;">*</span>
&nbsp;
hdb = pytc.<span style="color: black;">HDB</span><span style="color: black;">&#40;</span><span style="color: black;">&#41;</span>
hdb.<span style="color: #008000;">open</span><span style="color: black;">&#40;</span><span style="color: #483d8b;">'casket.tch'</span>,pytc.<span style="color: black;">HDBOWRITER</span>|pytc.<span style="color: black;">HDBOCREAT</span><span style="color: black;">&#41;</span>
&nbsp;
a = arange<span style="color: black;">&#40;</span><span style="color: #ff4500;">100</span><span style="color: black;">&#41;</span>
hdb.<span style="color: black;">put</span><span style="color: black;">&#40;</span><span style="color: #483d8b;">'test'</span>,<span style="color: #dc143c;">pickle</span>.<span style="color: black;">dumps</span><span style="color: black;">&#40;</span>a<span style="color: black;">&#41;</span><span style="color: black;">&#41;</span>
b = <span style="color: #dc143c;">pickle</span>.<span style="color: black;">loads</span><span style="color: black;">&#40;</span>hdb.<span style="color: black;">get</span><span style="color: black;">&#40;</span><span style="color: #483d8b;">'test'</span><span style="color: black;">&#41;</span><span style="color: black;">&#41;</span>
<span style="color: #ff7700;font-weight:bold;">if</span> <span style="color: black;">&#40;</span>a==b<span style="color: black;">&#41;</span>.<span style="color: #008000;">all</span><span style="color: black;">&#40;</span><span style="color: black;">&#41;</span> :
     <span style="color: #ff7700;font-weight:bold;">print</span> <span style="color: #483d8b;">'OK'</span>
hdb.<span style="color: black;">close</span><span style="color: black;">&#40;</span><span style="color: black;">&#41;</span></pre></div></div>



<p>Related posts:<ol><li><a href='http://www.xavierllora.net/2009/09/29/temporary-storage-for-meandres-distribute-flow-execution/' rel='bookmark' title='Permanent Link: Temporary storage for Meandre&#8217;s distributed flow execution'>Temporary storage for Meandre&#8217;s distributed flow execution</a></li>
<li><a href='http://www.xavierllora.net/2008/07/01/efficient-storage-for-python/' rel='bookmark' title='Permanent Link: Efficient storage for Python'>Efficient storage for Python</a></li>
<li><a href='http://www.xavierllora.net/2008/01/16/a-simple-and-flexible-ga-loop-in-python/' rel='bookmark' title='Permanent Link: A simple and flexible GA loop in Python'>A simple and flexible GA loop in Python</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.xavierllora.net/2009/08/13/easy-reliable-and-flexible-storage-for-python/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Large Scale Data Mining using Genetics-Based Machine Learning</title>
		<link>http://www.xavierllora.net/2009/07/15/large-scale-data-mining-using-genetics-based-machine-learning/</link>
		<comments>http://www.xavierllora.net/2009/07/15/large-scale-data-mining-using-genetics-based-machine-learning/#comments</comments>
		<pubDate>Wed, 15 Jul 2009 21:56:17 +0000</pubDate>
		<dc:creator>Xavier</dc:creator>
				<category><![CDATA[Data-Intensive Computing]]></category>
		<category><![CDATA[GBML & LCS]]></category>
		<category><![CDATA[Learning Classifier Systems]]></category>
		<category><![CDATA[Presentations]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[data-intensive flows]]></category>
		<category><![CDATA[genetics-based machine learning]]></category>
		<category><![CDATA[hadoop]]></category>
		<category><![CDATA[LCS]]></category>
		<category><![CDATA[map-reduce]]></category>

		<guid isPermaLink="false">http://www.xavierllora.net/?p=568</guid>
		<description><![CDATA[Below you may find the slides of the GECCO 2009 tutorial that Jaume Bacardit and I put together. Hope you enjoy it.
Slides
Abstract
We are living in the peta-byte era.We have larger and larger data to analyze, process and transform into useful answers for the domain experts. Robust data mining tools, able to cope with petascale volumes [...]


Related posts:<ol><li><a href='http://www.xavierllora.net/2006/12/13/observer-invariant-histopathology-using-genetics-based-machine-learning/' rel='bookmark' title='Permanent Link: Observer-Invariant Histopathology using Genetics-Based Machine Learning'>Observer-Invariant Histopathology using Genetics-Based Machine Learning</a></li>
<li><a href='http://www.xavierllora.net/2009/04/07/deadline-extended-for-special-issue-on-metaheuristics-for-large-scale-data-mining/' rel='bookmark' title='Permanent Link: Deadline extended for special issue on Metaheuristics for Large Scale Data Mining'>Deadline extended for special issue on Metaheuristics for Large Scale Data Mining</a></li>
<li><a href='http://www.xavierllora.net/2008/03/26/bdcsg2008-algorithmic-perspectives-on-large-scale-social-network-data-jon-kleinberg/' rel='bookmark' title='Permanent Link: [BDCSG2008] Algorithmic Perspectives on Large-Scale Social Network Data (Jon Kleinberg)'>[BDCSG2008] Algorithmic Perspectives on Large-Scale Social Network Data (Jon Kleinberg)</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<p>Below you may find the slides of the <a href="http://www.sigevo.org/gecco-2009/tutorials.html#lsdm">GECCO 2009 tutorial</a> that <a href="http://www.cs.nott.ac.uk/~jqb/">Jaume Bacardit</a> and I put together. Hope you enjoy it.</p>
<p><strong>Slides</strong></p>
<object width="425&type=s" height="348"><param name="movie" value="http://static.slideshare.net/swf/ssplayer2.swf?doc=gecco2009largegbmltutorial-090715163244-phpapp01"/><param name="allowFullScreen" value="true"/><param name="allowScriptAccess" value="always"/><embed src="http://static.slideshare.net/swf/ssplayer2.swf?doc=gecco2009largegbmltutorial-090715163244-phpapp01"  type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425&type=s" height="348"></embed></object><!-- ysttest:Array
(
    [id] => 1727172&amp;doc=gecco2009largegbmltutorial-090715163244-phpapp01
)
-->
<p><strong>Abstract</strong></p>
<p>We are living in the peta-byte era.We have larger and larger data to analyze, process and transform into useful answers for the domain experts. Robust data mining tools, able to cope with petascale volumes and/or high dimensionality producing human-understandable solutions are key on several domain areas. Genetics-based machine learning (GBML) techniques are perfect candidates for this task, among others, due to the recent advances in representations, learning paradigms, and theoretical modeling. If evolutionary learning techniques aspire to be a relevant player in this context, they need to have the capacity of processing these vast amounts of data and they need to process this data within reasonable time. Moreover, massive computation cycles are getting cheaper and cheaper every day, allowing researchers to have access to unprecedented parallelization degrees. Several topics are interlaced in these two requirements: (1) having the proper learning paradigms and knowledge representations, (2) understanding them and knowing when are they suitable for the problem at hand, (3) using efficiency enhancement techniques, and (4) transforming and visualizing the produced solutions to give back as much insight as possible to the domain experts are few of them.</p>
<p>This tutorial will try to answer this question, following a roadmap that starts with the questions of what large means, and why large is a challenge for GBML methods. Afterwards, we will discuss different facets in which we can overcome this challenge: Efficiency enhancement techniques, representations able to cope with large dimensionality spaces, scalability of learning paradigms. We will also review a topic interlaced with all of them: how can we model the scalability of the components of our GBML systems to better engineer them to get the best performance out of them for large datasets. The roadmap continues with examples of real applications of GBML systems and finishes with an analysis of further directions.</p>


<p>Related posts:<ol><li><a href='http://www.xavierllora.net/2006/12/13/observer-invariant-histopathology-using-genetics-based-machine-learning/' rel='bookmark' title='Permanent Link: Observer-Invariant Histopathology using Genetics-Based Machine Learning'>Observer-Invariant Histopathology using Genetics-Based Machine Learning</a></li>
<li><a href='http://www.xavierllora.net/2009/04/07/deadline-extended-for-special-issue-on-metaheuristics-for-large-scale-data-mining/' rel='bookmark' title='Permanent Link: Deadline extended for special issue on Metaheuristics for Large Scale Data Mining'>Deadline extended for special issue on Metaheuristics for Large Scale Data Mining</a></li>
<li><a href='http://www.xavierllora.net/2008/03/26/bdcsg2008-algorithmic-perspectives-on-large-scale-social-network-data-jon-kleinberg/' rel='bookmark' title='Permanent Link: [BDCSG2008] Algorithmic Perspectives on Large-Scale Social Network Data (Jon Kleinberg)'>[BDCSG2008] Algorithmic Perspectives on Large-Scale Social Network Data (Jon Kleinberg)</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.xavierllora.net/2009/07/15/large-scale-data-mining-using-genetics-based-machine-learning/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>
