<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Meyer Information Management Blog&#187; lucene</title>
	<atom:link href="http://mimblog.de/tag/lucene/feed/" rel="self" type="application/rss+xml" />
	<link>http://mimblog.de</link>
	<description>Innovationen und Technologien im Information Management</description>
	<lastBuildDate>Thu, 25 Mar 2010 20:18:39 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Berlin Buzzwords 2010 Search Store Scale</title>
		<link>http://mimblog.de/2010/02/12/berlin-buzzwords-2010-search-store-scale/</link>
		<comments>http://mimblog.de/2010/02/12/berlin-buzzwords-2010-search-store-scale/#comments</comments>
		<pubDate>Fri, 12 Feb 2010 13:02:15 +0000</pubDate>
		<dc:creator>Hannes Carl Meyer</dc:creator>
				<category><![CDATA[Events]]></category>
		<category><![CDATA[hadoop]]></category>
		<category><![CDATA[lucene]]></category>
		<category><![CDATA[nosql]]></category>

		<guid isPermaLink="false">http://mimblog.de/?p=916</guid>
		<description><![CDATA[Search Store Scale &#8211; diese drei Begriffe bringen die Veranstaltungen, die von Isabel Drost organisiert werden ganz genau auf den Punkt! Neben den regelmäßigen Hadoop Get Togethers (das nächste findet am 10. März statt) soll nun auch im Juni die Berlin Buzzwords 2010 scalability conference stattfinden.
Organisiert durch Jan Lehnardt (CouchDB), Simon Willnauer (Lucene Committer) und [...]]]></description>
			<content:encoded><![CDATA[<p><strong>Search Store Scale</strong> &#8211; diese drei Begriffe bringen die Veranstaltungen, die von <a href="http://blog.isabel-drost.de/index.php/about" target="_blank">Isabel Drost</a> organisiert werden ganz genau auf den Punkt! Neben den regelmäßigen Hadoop Get Togethers (<a title="Apache Hadoop Get Together - March 2010 - Update" href="http://blog.isabel-drost.de/index.php/archives/149/apache-hadoop-get-together-march-2010-update" target="_blank">das nächste findet am 10. März statt</a>) soll nun auch im Juni die <a title="Berlin Buzzwords 2010" href="http://hadoopberlin.de/~events/index.html" target="_blank">Berlin Buzzwords 2010 scalability conference</a> stattfinden.</p>
<p>Organisiert durch Jan Lehnardt (CouchDB), Simon Willnauer (Lucene Committer) und Isabel Drost (Co-Founder &amp; Committer of Mahout) wird sich bei der Konferenz alles rund um die Themen NoSQL, Hadoop, Lucene und weiteres aus dem Bereich Scalability drehen. Die Namen der drei dürften übrigens jedem Bekannt sein, der die ein oder andere Mailingliste rund um Lucene abonniert hat.</p>
<p>Neuigkeiten gibt es via <a href="http://twitter.com/hadoopberlin" target="_blank">@hadoopberlin</a> sowie auf der <a href="http://hadoopberlin.de/~events/index.html">Veranstaltungswebsite</a> &#8211; dort werden übrigens auch noch &#8220;Helping Hands&#8221; gesucht!</p>
]]></content:encoded>
			<wfw:commentRss>http://mimblog.de/2010/02/12/berlin-buzzwords-2010-search-store-scale/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Apache Nutch 1.0 Finally Released</title>
		<link>http://mimblog.de/2009/03/30/apache-nutch-10-finally-released/</link>
		<comments>http://mimblog.de/2009/03/30/apache-nutch-10-finally-released/#comments</comments>
		<pubDate>Mon, 30 Mar 2009 07:35:04 +0000</pubDate>
		<dc:creator>Hannes Carl Meyer</dc:creator>
				<category><![CDATA[Information Management]]></category>
		<category><![CDATA[lucene]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[mahout]]></category>
		<category><![CDATA[nutch]]></category>

		<guid isPermaLink="false">http://mimblog.de/?p=779</guid>
		<description><![CDATA[
It is already two months since I was announcing the Nutch 1.0 release. Finally after a 1.0 release candidate Nutch was released in version 1.0 last week! Congratulations!


If you are also into Machine Learning you should check out the release of Mahout in its early version 0.3 and wether you are new on it, check [...]]]></description>
			<content:encoded><![CDATA[<p><img class="size-full wp-image-740 alignnone" title="nutch-logo" src="http://mimblog.de/wp-content/uploads/2009/02/nutch-logo.gif" alt="nutch-logo" width="121" height="48" /></p>
<p>It is already <a title="Nutch 1.0" href="http://mimblog.de/2009/02/02/nutch-release-10/" target="_blank">two months since I was announcing</a> the Nutch 1.0 release. Finally after a 1.0 release candidate <a title="Apache Nutch 1.0 Released" href="http://lucene.apache.org/nutch/#23+March+2009+-+Apache+Nutch+1.0+Released" target="_blank">Nutch was released in version 1.0</a> last week! Congratulations!</p>
<p><span id="more-779"></span></p>
<p><img class="size-full wp-image-780 alignnone" title="mahout-logo-82x100" src="http://mimblog.de/wp-content/uploads/2009/03/mahout-logo-82x100.png" alt="mahout-logo-82x100" width="82" height="100" /></p>
<p>If you are also into Machine Learning you should check out the <a title="Apache Mahout" href="http://lucene.apache.org/mahout/" target="_blank">release of Mahout</a> in its early version 0.3 and wether you are new on it, check out <a title="Introducing Mahout: Apache Machine Learning" href="http://www.eu.apachecon.com/c/aceu2009/sessions/136" target="_blank">Grant Ingersoll&#8217;s presentation on Mahout</a> from ApacheCon Europe 2009 here.</p>
<p>P.S.: Also notice the new <a title="Apache Droids" href="http://incubator.apache.org/droids/" target="_blank">Apache Droids</a> Tab in the Lucene navigation bar!</p>
]]></content:encoded>
			<wfw:commentRss>http://mimblog.de/2009/03/30/apache-nutch-10-finally-released/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Lost + Found: Lucene Meetup, 97 Things, Social Networks</title>
		<link>http://mimblog.de/2009/03/02/lost-found-lucene-meetup-97-things-social-networks/</link>
		<comments>http://mimblog.de/2009/03/02/lost-found-lucene-meetup-97-things-social-networks/#comments</comments>
		<pubDate>Mon, 02 Mar 2009 06:28:37 +0000</pubDate>
		<dc:creator>Hannes Carl Meyer</dc:creator>
				<category><![CDATA[Lost + Found]]></category>
		<category><![CDATA[event]]></category>
		<category><![CDATA[lucene]]></category>

		<guid isPermaLink="false">http://mimblog.de/?p=759</guid>
		<description><![CDATA[O&#8217;Reilly just released the book 97 Things Every Software Architect Should Know. Those 97 things are not only from one person, but aggregated from dozen known software architects like Neal Ford, Michael Nygard and more. You can have a look at the Axioms overe here, I picked out three favorite ones:

For the end-user, the interface [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://oreilly.com/catalog/9780596522698/index.html#top"><img class="alignleft size-thumbnail wp-image-762" title="97-things-book-cover" src="http://mimblog.de/wp-content/uploads/2009/03/97-things-book-cover-150x150.jpg" alt="97-things-book-cover" width="150" height="150" /></a>O&#8217;Reilly just released the book <strong>97 Things Every Software Architect Should Know</strong>. Those 97 things are not only from one person, but aggregated from dozen known software architects like <a title="Neal Ford" href="http://www.nealford.com" target="_blank">Neal Ford</a>, <a title="Michael Nygard" href="http://www.michaelnygard.com/" target="_blank">Michael Nygard</a> and more. You can have a look at the <a title="97 Things Every Software Architect Should Know - The Book" href="http://97-things.near-time.net/wiki/97-things-every-software-architect-should-know-the-book" target="_blank">Axioms overe here</a>, I picked out three favorite ones:</p>
<ul>
<li>For the end-user, the interface is the system (Vinayak Hedge)</li>
<li>&#8220;Perfect&#8221; is the enemy of &#8220;Good enough&#8221; (Greg Nyberg)</li>
<li>Business Drives (Dave Muirhead)</li>
</ul>
<p>You can order the book at <a title="97 Things Every Software Architect Should Know (Paperback)" href="http://www.amazon.com/Things-Every-Software-Architect-Should/dp/059652269X/" target="_blank">Amazon</a> or checkout the book&#8217;s <a title="97 Things Every Software Architect Should Know (Paperback)" href="http://97-things.near-time.net/wiki/97-things-every-software-architect-should-know-the-book" target="_blank">website here</a>.</p>
<p><span id="more-759"></span></p>
<p><strong>Lucene community gathering in Amsterdam in March 2009</strong></p>
<p>During the <a title="ApacheCon Europe 2009" href="http://www.eu.apachecon.com/c/aceu2009/" target="_blank">ApacheCon Europe 2009</a> there will take place a Lucene Meetup beside the venue. If you are interested in meeting Doug Cutting, Grant Ingersoll or Erik Hatcher you should check the Lucene wiki page <a title="LuceneMeetupMarch2009" href="http://wiki.apache.org/lucene-java/LuceneMeetupMarch2009" target="_blank">LuceneMeetupMarch2009</a>! <em>I&#8217;m not sure yet wether I&#8217;m going to Amsterdam end of march!</em></p>
<p><strong>Social Networks and Web 2.0 Papers at WWW 2009</strong></p>
<p>You can find a summary of interesting papers regarding Social Networks in <a title="Social Networks and Web 2.0 Papers at WWW 2009" href="http://datamining.typepad.com/data_mining/2009/02/social-networks-and-web-20-papers-at-www-2009.html" target="_blank">this post from Matthew Hurst&#8217;s Blog</a>. Let me recommend <strong>Social Search in &#8220;Small World&#8221; Examples</strong>, <strong>Network Analysis of Collaboration Structure in Wikipedia</strong> and <strong>How Opinions are Received by Online Communities</strong>.</p>
]]></content:encoded>
			<wfw:commentRss>http://mimblog.de/2009/03/02/lost-found-lucene-meetup-97-things-social-networks/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Nutch Release 1.0</title>
		<link>http://mimblog.de/2009/02/02/nutch-release-10/</link>
		<comments>http://mimblog.de/2009/02/02/nutch-release-10/#comments</comments>
		<pubDate>Mon, 02 Feb 2009 21:20:17 +0000</pubDate>
		<dc:creator>Hannes Carl Meyer</dc:creator>
				<category><![CDATA[From Mailing Lists]]></category>
		<category><![CDATA[lucene]]></category>
		<category><![CDATA[nutch]]></category>

		<guid isPermaLink="false">http://mimblog.de/?p=739</guid>
		<description><![CDATA[
I already posted some kind of spoiler couple weeks ago according to a new Nutch release! Again a message from Andrzej Bialecki appeared at the nutch mailing list announcing the release of Nutch 1.0 maybe in february (this year).
Comment from Andrzej:
We do exist.   We plan to release in February &#8211; I can&#8217;t tell [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignnone size-full wp-image-740" title="nutch-logo" src="http://mimblog.de/wp-content/uploads/2009/02/nutch-logo.gif" alt="nutch-logo" width="121" height="48" /></p>
<p>I already posted some kind of spoiler couple weeks ago according to a new Nutch release! Again a message from Andrzej Bialecki appeared at the nutch mailing list announcing the release of <strong>Nutch 1.0</strong> maybe in february (this year).</p>
<p>Comment from Andrzej:</p>
<blockquote><p>We do exist. <img src='http://mimblog.de/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' />  We plan to release in February &#8211; I can&#8217;t tell you yet when exactly, we need to review the (few) remaining issues that we want to resolve before the release.</p></blockquote>
<p><strong>Update</strong>: I also heard about a Nutch/SOLR integration coming with this release?! Will start my investigation tommorrow.</p>
<p><strong>Update II</strong>, A few important snippets from the CHANGES.txt:</p>
<ul>
<li><a title="Nutch 442" href="http://issues.apache.org/jira/browse/NUTCH-442" target="_blank">NUTCH-442</a> &#8211; Integrate Solr/Nutch. (dogacan, original version by siren)</li>
<li><a title="Upgrade Nutch to Hadoop 0.19" href="http://issues.apache.org/jira/browse/NUTCH-663" target="_blank">NUTCH-663</a> &#8211; Upgrade Nutch to use Hadoop 0.19 (kubes)</li>
<li><a title="Upgrade Nutch to use Lucene 2.4" href="http://issues.apache.org/jira/browse/NUTCH-662" target="_blank">NUTCH-662</a> &#8211; Upgrade Nutch to use Lucene 2.4. (kubes)</li>
<li><a title="Search Load Testing Tool" href="http://issues.apache.org/jira/browse/NUTCH-665" target="_blank">NUTCH-665</a> &#8211; Search Load Testing Tool (kubes)</li>
<li><a title="New Indexing Framework for Nutch" href="http://issues.apache.org/jira/browse/NUTCH-646" target="_blank">NUTCH-646</a> -  New Indexing Framework for Nutch. (kubes)</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://mimblog.de/2009/02/02/nutch-release-10/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Search Through The Lucene Ecosystem</title>
		<link>http://mimblog.de/2009/01/28/search-through-the-lucene-ecosystem/</link>
		<comments>http://mimblog.de/2009/01/28/search-through-the-lucene-ecosystem/#comments</comments>
		<pubDate>Wed, 28 Jan 2009 20:05:56 +0000</pubDate>
		<dc:creator>Hannes Carl Meyer</dc:creator>
				<category><![CDATA[Information Management]]></category>
		<category><![CDATA[lucene]]></category>

		<guid isPermaLink="false">http://mimblog.de/?p=735</guid>
		<description><![CDATA[
If you ever wanted to search through the whole Lucene ecosystem (Lucene Java, Mahout, Tika and more) your possible solution of choice is the search engine by Lucid Imagination. It searches through project websites, wikis, mailing-lists (public, dev) and the source code itself &#8211; its indexiation performance is also pretty impressive.
Lucid Imagination is a company [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignnone size-full wp-image-501" title="lucene_green_300" src="http://mimblog.de/wp-content/uploads/2008/10/lucene_green_300.gif" alt="lucene_green_300" width="300" height="46" /></p>
<p>If you ever wanted to search through the whole Lucene ecosystem (Lucene Java, Mahout, Tika and more) your possible solution of choice is the <a href="http://www.lucidimagination.com/search" target="_blank">search engine</a> by <a title="Lucene Imagination" href="http://www.lucidimagination.com" target="_blank">Lucid Imagination</a>. It searches through project websites, wikis, mailing-lists (public, dev) and the source code itself &#8211; its indexiation performance is also pretty impressive.</p>
<p>Lucid Imagination is a company founded by Marc Krellenstein, Yonik Seeley, Grant Ingersoll and Erik Hatcher (a team of experts) which provides professional search (and beyond) solutions for the enterprise based upon <a title="Lucene" href="http://lucene.apache.org/" target="_blank">Lucene</a>.</p>
<p><strong>Update</strong>: <a title="Lucene SOLR Challenger in Enterprise Search" href="http://arnoldit.com/wordpress/2009/01/27/lucene-solr-challenger-in-enterprise-search/" target="_blank">comment about Lucid Imagination from Beyond Search</a></p>
]]></content:encoded>
			<wfw:commentRss>http://mimblog.de/2009/01/28/search-through-the-lucene-ecosystem/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Interesting ApacheCon Europe 2009 Sessions</title>
		<link>http://mimblog.de/2009/01/26/interesting-apachecon-europe-2009-sessions/</link>
		<comments>http://mimblog.de/2009/01/26/interesting-apachecon-europe-2009-sessions/#comments</comments>
		<pubDate>Mon, 26 Jan 2009 05:10:56 +0000</pubDate>
		<dc:creator>Hannes Carl Meyer</dc:creator>
				<category><![CDATA[Events]]></category>
		<category><![CDATA[cloud computing]]></category>
		<category><![CDATA[event]]></category>
		<category><![CDATA[hadoop]]></category>
		<category><![CDATA[lucene]]></category>
		<category><![CDATA[Solr]]></category>

		<guid isPermaLink="false">http://mimblog.de/?p=733</guid>
		<description><![CDATA[
The registration to ApacheCon Europe 2009 (23-27 March) in Amsterdam was opened last week. I already spent some time watching the available sessions regarding Hadoop, Lucene etc., have a look&#8230;

Hadoop Tools and Tricks for Data Processing Pipelines
Lucene Boot Camp
Solr Boot Camp
Introduction To Hadoop
Introducing Mahout: Apache Machine Learning
Hadoop Map-Reduce: Tuning and Debugging
Lucene Case Studies
Pig &#8211; Making [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignnone size-full wp-image-442" title="feather" src="http://mimblog.de/wp-content/uploads/2008/10/feather.jpg" alt="feather" width="285" height="86" /></p>
<p>The registration to ApacheCon Europe 2009 (23-27 March) in Amsterdam <a title="Registration is now open" href="http://us.apachecon.com/c/aceu2009/articles/registration-is-now-open/" target="_blank">was opened last week.</a> I already spent some time watching the available sessions regarding Hadoop, Lucene etc., have a look&#8230;</p>
<ul>
<li><span id="more-733"></span><a title="Hadoop Tools and Tricks for Data Processing Pipelines" href="http://us.apachecon.com/c/aceu2009/sessions/230" target="_blank">Hadoop Tools and Tricks for Data Processing Pipelines</a></li>
<li><a title="Lucene Boot Camp" href="http://us.apachecon.com/c/aceu2009/sessions/197" target="_blank">Lucene Boot Camp</a></li>
<li><a title="Solr Boot Camp" href="http://us.apachecon.com/c/aceu2009/sessions/201" target="_blank">Solr Boot Camp</a></li>
<li><a title="Introduction to Hadoop" href="http://us.apachecon.com/c/aceu2009/sessions/222" target="_blank">Introduction To Hadoop</a></li>
<li><a title="Introducing Mahout" href="http://us.apachecon.com/c/aceu2009/sessions/222" target="_blank">Introducing Mahout: Apache Machine Learning</a></li>
<li><a title="Hadoop Map-Reduce: Tuning and Debugging" href="http://us.apachecon.com/c/aceu2009/sessions/223" target="_blank">Hadoop Map-Reduce: Tuning and Debugging</a></li>
<li><a title="Lucene Case Studies" href="http://us.apachecon.com/c/aceu2009/sessions/137" target="_blank">Lucene Case Studies</a></li>
<li><a title="Title: Pig - Making Hadoop Easy" href="http://us.apachecon.com/c/aceu2009/sessions/224" target="_blank">Pig &#8211; Making Hadoop Easy</a></li>
<li><a title="Advanced Indexing Techniques with Apache Lucene" href="http://us.apachecon.com/c/aceu2009/sessions/138" target="_blank">Advanced Indexing Techniques with Apache Lucene</a></li>
<li><a title="Running Hadoop in the Cloud" href="http://us.apachecon.com/c/aceu2009/sessions/225" target="_blank">Running Hadoop in the Cloud</a></li>
<li><a title="Configuring Hadoop for Grid Services" href="http://us.apachecon.com/c/aceu2009/sessions/226" target="_blank">Configuring Hadoop for Grid Services</a></li>
<li><a title="Dynamic Hadoop Clusters" href="http://us.apachecon.com/c/aceu2009/sessions/227" target="_blank">Dynamic Hadoop Clusters</a></li>
<li><a title="Best of breed - httpd, forrest, solr and droids" href="http://us.apachecon.com/c/aceu2009/sessions/163" target="_blank">Best of breed &#8211; httpd, forrest, solr and droids</a></li>
</ul>
<p>As you can see, Apache Hadoop is playing a big role this year, and cloud computing starts to take off at Apache too. Unfortunately I didn&#8217;t find any Nutch related sessions &#8211; whats happening there?</p>
]]></content:encoded>
			<wfw:commentRss>http://mimblog.de/2009/01/26/interesting-apachecon-europe-2009-sessions/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Lucene and Python</title>
		<link>http://mimblog.de/2009/01/12/lucene-and-python/</link>
		<comments>http://mimblog.de/2009/01/12/lucene-and-python/#comments</comments>
		<pubDate>Mon, 12 Jan 2009 16:04:07 +0000</pubDate>
		<dc:creator>Hannes Carl Meyer</dc:creator>
				<category><![CDATA[Breakfast Links]]></category>
		<category><![CDATA[lucene]]></category>

		<guid isPermaLink="false">http://mimblog.de/?p=700</guid>
		<description><![CDATA[
Breakfast Links 12.01.2009
* PyLucene &#8211; Python Lucene &#8211; Joins Apache Lucene 
PyLucene is a python based port of Lucene which is build automatically from the original Lucene source. It just joined Apache Lucene as a sub project. Does this means I can run Lucene in Google&#8217;s App Engine?
* Interview with Norbert Weitkämper
Norbert Weitkämper (Weitkämper Technology) [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://mimblog.de/wp-content/uploads/2009/01/lucene-python-pylucene.png"><img class="alignnone size-medium wp-image-703" title="lucene-python-pylucene" src="http://mimblog.de/wp-content/uploads/2009/01/lucene-python-pylucene.png" alt="" width="300" height="82" /></a></p>
<p><strong>Breakfast Links 12.01.2009</strong></p>
<p>* <a title="PyLucene - Python Lucene - Joins Apache Lucene " href="http://www.jroller.com/otis/entry/pylucene_python_lucene_joins_apache" target="_blank">PyLucene &#8211; Python Lucene &#8211; Joins Apache Lucene </a></p>
<p>PyLucene is a python based port of Lucene which is build automatically from the original Lucene source. It just joined <a title="Apache Lucene" href="http://lucene.apache.org" target="_blank">Apache Lucene</a> as a sub project. <em>Does this means I can run Lucene in Google&#8217;s App Engine?</em></p>
<p>* <a title="An Interview with Norbert Weitkämper" href="http://www.arnoldit.com/search-wizards-speak/xsearch.html" target="_blank">Interview with Norbert Weitkämper</a></p>
<p>Norbert Weitkämper (<a title="Weitkämper Technology" href="http://www.weitkamper.de/" target="_blank">Weitkämper Technology</a>) interviewed in december 2008 by Stephen E. Arnold from Beyond Search. <em>After the previous interview with Hans-Christian Brockmann, Beyond Search means also a lot of beyond Kentucky!</em></p>
]]></content:encoded>
			<wfw:commentRss>http://mimblog.de/2009/01/12/lucene-and-python/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Enterprise Search 2008 Wrap-Up</title>
		<link>http://mimblog.de/2009/01/07/enterprise-search-2008-wrap-up/</link>
		<comments>http://mimblog.de/2009/01/07/enterprise-search-2008-wrap-up/#comments</comments>
		<pubDate>Wed, 07 Jan 2009 16:23:30 +0000</pubDate>
		<dc:creator>Hannes Carl Meyer</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[Enterprise Search]]></category>
		<category><![CDATA[lucene]]></category>
		<category><![CDATA[Solr]]></category>

		<guid isPermaLink="false">http://mimblog.de/?p=692</guid>
		<description><![CDATA[Breakfast Links 07.01.2009
* Enterprise Search 2008 Wrap-Up
Lynda Moulton (The Gilbane Group) with a short summary on Enterprise Search in the year 2008. Including business developments at the big players (Microsoft and FAST, IBM and OmniFind), search/text mining technologies and a lot of interesting company names in the field of Enterprise Search and beyond.
* New Stuff [...]]]></description>
			<content:encoded><![CDATA[<p><strong>Breakfast Links 07.01.2009</strong></p>
<p>* <a title="Enterprise Search 2008 Wrap-Up" href="http://gilbane.com/search_blog/2008/12/enterprise_search_2008_wrap-up.html" target="_blank">Enterprise Search 2008 Wrap-Up</a></p>
<p><span class="byline">Lynda Moulton (<a title="The Gilbane Group" href="http://gilbane.com/" target="_blank">The Gilbane Group</a>) with a short summary on <strong>Enterprise Search</strong> in the year 2008. Including business developments at the big players (Microsoft and FAST, IBM and OmniFind), search/text mining technologies and a lot of interesting company names in the field of Enterprise Search and beyond.</span><span class="byline"></span></p>
<p>* <a title="New Stuff in Lucene and Solr" href="http://lucene.grantingersoll.com/2009/01/03/new-stuff-in-lucene-and-solr/" target="_blank">New Stuff in Lucene and Solr</a></p>
<p>Solr insider Grant Ingersoll about the development of new hot stuff for Apache Solr. I&#8217;m really excited about the <strong>SpatialSearch feature</strong> which provides some kind of geo search (there was also an introduction of local search by Chris Hostetter at ApacheCon 2008 <a href="http://us.apachecon.com/c/acus2008/sessions/10" target="_blank">here</a>).</p>
]]></content:encoded>
			<wfw:commentRss>http://mimblog.de/2009/01/07/enterprise-search-2008-wrap-up/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Yahoo&#8217;s Success in Technology</title>
		<link>http://mimblog.de/2008/12/09/yahoos-success-in-technology/</link>
		<comments>http://mimblog.de/2008/12/09/yahoos-success-in-technology/#comments</comments>
		<pubDate>Tue, 09 Dec 2008 11:20:51 +0000</pubDate>
		<dc:creator>Hannes Carl Meyer</dc:creator>
				<category><![CDATA[Breakfast Links]]></category>
		<category><![CDATA[BOSS]]></category>
		<category><![CDATA[lucene]]></category>
		<category><![CDATA[uima]]></category>
		<category><![CDATA[yahoo]]></category>

		<guid isPermaLink="false">http://mimblog.de/?p=681</guid>
		<description><![CDATA[
Breakfast Links 09.12.2008
* Yahoo BOSS Now Serving 100 Queries Per Second
While Yahoo! was recently struggling with lots of negative issues in terms of their business (who is next Yahoo! CEO?), they kept on publishing and propagating their technologies. According to the Yahoo Search Blog, BOSS is now serving 10 million queries per day! In my [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://mimblog.de/wp-content/uploads/2008/12/boss_info5.gif"><img class="alignnone size-medium wp-image-682" title="boss_info5" src="http://mimblog.de/wp-content/uploads/2008/12/boss_info5-300x73.gif" alt="" width="300" height="73" /></a></p>
<p><strong>Breakfast Links 09.12.2008</strong></p>
<p>* <a title="Yahoo BOSS Now Serving 100 Queries Per Second" href="http://developer.yahoo.com/search/boss/" target="_blank">Yahoo BOSS Now Serving 100 Queries Per Second</a></p>
<p>While Yahoo! was recently struggling with lots of negative issues in terms of their business (who is next Yahoo! CEO?), they kept on publishing and propagating their technologies. According to the <a title="Yahoo! Search Blog" href="http://www.ysearchblog.com/">Yahoo Search Blog</a>, BOSS is now serving <strong>10 million queries per day</strong>! <em>In my opinion Yahoo&#8217;s search technology is catching up fast and is far ahead of Google&#8217;s APIs.</em></p>
<p>* If you were ever interested in an <strong>UIMA Lucene Cas Consumer</strong>, check this out:</p>
<blockquote><p>at the JULIE Lab, we&#8217;ve been working (silently) on a new (and completely<br />
altered) version of our Lucene CAS Indexer consumer (Lucas). We are<br />
planning to make this available soon &#8212; preferably in the UIMA sandbox.<br />
In fact, LUCAS now is able to perform offset-based token stream<br />
alignment and merging of UIMA annotations (via position increment) in<br />
the same Lucene field (e.g. &#8220;documenttext&#8221; or &#8220;title&#8221;), which we feel is<br />
more appropriate for text indexing &#8212; instead of putting each UIMA<br />
annotation into a separate field like the Solr approach (still possible<br />
with the new LUCAS).</p>
<p>(UIMA mailing list, Joachim Wermter)</p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://mimblog.de/2008/12/09/yahoos-success-in-technology/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Information Management Week Summary</title>
		<link>http://mimblog.de/2008/12/05/information-management-week-summary/</link>
		<comments>http://mimblog.de/2008/12/05/information-management-week-summary/#comments</comments>
		<pubDate>Fri, 05 Dec 2008 15:21:52 +0000</pubDate>
		<dc:creator>Hannes Carl Meyer</dc:creator>
				<category><![CDATA[Information Management]]></category>
		<category><![CDATA[cloud computing]]></category>
		<category><![CDATA[data set]]></category>
		<category><![CDATA[Enterprise Search]]></category>
		<category><![CDATA[lucene]]></category>
		<category><![CDATA[search engine]]></category>

		<guid isPermaLink="false">http://mimblog.de/?p=671</guid>
		<description><![CDATA[
I wasn&#8217;t able to write any Breakfast Links since monday due to projects (like the upcoming relaunch of this blog), birthday parties and efforts for christmas presents (big, big family)! Finally I turned on my news reader today&#8230;

* Amazon Launches Public Data Sets To Ease Research
Amazon (AWS) provides public data sets to demonstrate use cases [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://family.go.com/santas-list/video/22886-01shwbpfrj6yg3x/"><img class="alignnone size-medium wp-image-672" title="santa-hannes" src="http://mimblog.de/wp-content/uploads/2008/12/santa-hannes-300x221.jpg" alt="" width="210" height="155" /></a></p>
<p>I wasn&#8217;t able to write any <a title="Meyer Information Management | Breakfast Links" href="http://mimblog.de/category/breakfastnews/" target="_blank">Breakfast Links</a> since monday due to projects (like the upcoming relaunch of this blog), birthday parties and efforts for christmas presents (big, big family)! Finally I turned on my news reader today&#8230;</p>
<p><span id="more-671"></span></p>
<p>* <a title="Amazon Launches Public Data Sets To Ease Research" href="http://www.techcrunch.com/2008/12/04/amazon-launches-public-data-sets-to-ease-research/" target="_blank">Amazon Launches Public Data Sets To Ease Research</a></p>
<p>Amazon (AWS) provides public data sets to demonstrate use cases for their (cloud) services. The article on TechCrunch also links to <a title="Tasty Data Goodies" href="http://www.swivel.com/" target="_blank">Swivel</a> which is a repository for open data sets. Unfortunately you won&#8217;t find any textual data but a lot of data like U.S. Census, historical oil prices, world photovoltaic production and more.</p>
<p>* <a title="New Open Source Search Vendor" href="http://arnoldit.com/wordpress/2008/12/05/new-open-source-search-vendor/" target="_blank">New Open Source Search Vendor</a></p>
<p>An interview of Hans-Christian Brockmann (<a title="Brox IT-Solutions GmbH" href="http://www.brox.de" target="_blank">Brox IT-Solutions GmbH</a>) about Brox Open Source strategy in the field of enterprise search. Really interesting is Brockmann&#8217;s number of 180.000 lucene projects (more or less productive) out there! Brox already came up with their Eclipse project <a title="The official eccenca weblog" href="http://eccenca.broxblogs.de/" target="_blank">SMILA</a>, which is going to be a base architecture to better integrate search and text mining into common systems. <em>Nice to hear from those kind of projects in my business area!</em></p>
<p>* <a title="http://z-pod.de/archives/299-Z!-Episode-134-Qimaya-Anwendung-neuronaler-Netze.html" href="Z! - Episode 134: Qimaya - Anwendung neuronaler Netze" target="_blank">Z! &#8211; Episode 134: Qimaya &#8211; Anwendung neuronaler Netze</a> (german)</p>
<p>Interesting interview with Roy Uhlmann (<a title="Qimaya" href="http://qimaya.wordpress.com/" target="_blank">Qimaya</a> aka former Queap) about neuronal networks in their search engine Qimaya. If you are interested in testing Qimaya you can request access to the demo &#8211; next invitation and <a title="Anmeldungen für die 3. Passworttranche werden noch bis Freitag entgegengenommen!" href="http://qimaya.wordpress.com/2008/12/03/anmeldungen-fur-die-3-passworttranche-werden-noch-bis-freitag-entgegengenommen/" target="_blank">access delivery is even today</a>! <em>As I can see, Qimaya is currently a german language only project!</em></p>
]]></content:encoded>
			<wfw:commentRss>http://mimblog.de/2008/12/05/information-management-week-summary/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
