<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Meyer Information Management Blog&#187; Breakfast Links</title>
	<atom:link href="http://mimblog.de/category/breakfastnews/feed/" rel="self" type="application/rss+xml" />
	<link>http://mimblog.de</link>
	<description>Innovationen und Technologien im Information Management</description>
	<lastBuildDate>Thu, 25 Mar 2010 20:18:39 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Search &amp; Deploy, Search Is Going SciFi</title>
		<link>http://mimblog.de/2009/01/19/search-deploy-search-is-going-scifi/</link>
		<comments>http://mimblog.de/2009/01/19/search-deploy-search-is-going-scifi/#comments</comments>
		<pubDate>Mon, 19 Jan 2009 19:57:55 +0000</pubDate>
		<dc:creator>Hannes Carl Meyer</dc:creator>
				<category><![CDATA[Breakfast Links]]></category>
		<category><![CDATA[Enterprise Search]]></category>
		<category><![CDATA[open source]]></category>
		<category><![CDATA[semantic search]]></category>

		<guid isPermaLink="false">http://mimblog.de/?p=721</guid>
		<description><![CDATA[
Breakfast Links 19.01.2009
* Search &#38; Deploy
Interesting article regarding semantic search being the basis of lots of scifi applications used in star trek etc &#8211; also worth reading for non IT insider. Doesn&#8217;t the Google Android reminds you on the good ol&#8217; Star Trek Tricoder (picture above!)?
* Open Source Search &#38; Search Appliances Need Expert Attention
Article [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://mimblog.de/wp-content/uploads/2009/01/tricorder-detail.jpg"><img class="alignnone size-medium wp-image-722" title="tricorder-detail" src="http://mimblog.de/wp-content/uploads/2009/01/tricorder-detail-300x269.jpg" alt="" width="210" height="188" /></a></p>
<p><strong>Breakfast Links 19.01.2009</strong></p>
<p>* <a title="Search &amp; Deploy" href="http://www.adweek.com/aw/content_display/news/digital/e3if42f3145e3efa9c4f299b95e2754a98e?pn=1" target="_blank">Search &amp; Deploy</a></p>
<p>Interesting article regarding <strong>semantic search</strong> being the basis of lots of scifi applications used in star trek etc &#8211; also worth reading for non IT insider. <em>Doesn&#8217;t the Google Android reminds you on the good ol&#8217; Star Trek Tricoder (picture above!)?</em></p>
<p>* <a title="Open Source Search &amp; Search Appliances Need Expert Attention" href="http://gilbane.com/search_blog/2009/01/open_source_search_search_appl.html" target="_blank">Open Source Search &amp; Search Appliances Need Expert Attention</a></p>
<p>Article on enterprise search concerning the need of a search expert or even better, a specialized search company. I totally share the opinion of Lynda Moulton &#8220;<strong>I am going to put my confidence in the industry to spawn a whole new category of search service organizations</strong>&#8220;.</p>
<p>* <a title="Open Source NLP and Search Engine Libraries Updated" href="http://www.searchenginecaffe.com/2009/01/open-source-nlp-and-search-engine.html" target="_blank">Open Source NLP and Search Engine Libraries Updated</a></p>
<p>A useful collection of open source search and text processing tools!</p>
]]></content:encoded>
			<wfw:commentRss>http://mimblog.de/2009/01/19/search-deploy-search-is-going-scifi/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Search Team from FAST moving to Information Builders</title>
		<link>http://mimblog.de/2009/01/14/search-team-from-fast-moving-to-information-builders/</link>
		<comments>http://mimblog.de/2009/01/14/search-team-from-fast-moving-to-information-builders/#comments</comments>
		<pubDate>Wed, 14 Jan 2009 15:31:54 +0000</pubDate>
		<dc:creator>Hannes Carl Meyer</dc:creator>
				<category><![CDATA[Breakfast Links]]></category>
		<category><![CDATA[business]]></category>
		<category><![CDATA[Enterprise Search]]></category>
		<category><![CDATA[hadoop]]></category>

		<guid isPermaLink="false">http://mimblog.de/?p=716</guid>
		<description><![CDATA[
Breakfast Links 14.01.2009
* Information Builders (Schweiz) AG übernimmt spezialisiertes Search–Team von fast, A Microsoft Subsidiary
Information Builders (ch) just acquired a 2 person team from FAST to build its new search excellence team. Their current product iWay Enterprise Index is an Enterprise Search solution and being built upon GSA and Lucene. Does anyone has any details [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignnone size-medium wp-image-719" title="swiss-knife" src="http://mimblog.de/wp-content/uploads/2009/01/swiss-knife-300x201.png" alt="" width="300" height="201" /></p>
<p><strong>Breakfast Links 14.01.2009</strong></p>
<p>* <a title="Information Builders (Schweiz) AG übernimmt spezialisiertes Search–Team von fast, A Microsoft Subsidiary" href="http://moneycab.presscab.com/de/templates/?a=58169&amp;z=79" target="_blank">Information Builders (Schweiz) AG übernimmt spezialisiertes Search–Team von fast, A Microsoft Subsidiary</a></p>
<p>Information Builders (ch) just acquired a 2 person team from FAST to build its new <strong>search excellence</strong> team. Their current product <a title="iWay Enterprise Index" href="http://www.iwaysoftware.com/products/poweredbygoogle.html" target="_blank">iWay Enterprise Index</a> is an Enterprise Search solution and being built upon GSA and Lucene. <em>Does anyone has any details about this combination of GSA and Lucene?</em></p>
<p>* <a title="HDFS Reliability" href="http://www.cloudera.com/resources/hdfs-reliability" target="_blank">HDFS Reliability</a></p>
<p>Tom White from Cloudera (company from US with a business model around Apache Hadoop) released a paper about the reliability of <a title="HDFS" href="http://en.wikipedia.org/wiki/HDFS" target="_blank">HDFS</a> with some useful recommendations on the productive use of it.</p>
]]></content:encoded>
			<wfw:commentRss>http://mimblog.de/2009/01/14/search-team-from-fast-moving-to-information-builders/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Information Architecture Conference In Hamburg</title>
		<link>http://mimblog.de/2009/01/13/information-architecture-conference-in-hamburg/</link>
		<comments>http://mimblog.de/2009/01/13/information-architecture-conference-in-hamburg/#comments</comments>
		<pubDate>Tue, 13 Jan 2009 16:53:54 +0000</pubDate>
		<dc:creator>Hannes Carl Meyer</dc:creator>
				<category><![CDATA[Breakfast Links]]></category>
		<category><![CDATA[data set]]></category>
		<category><![CDATA[event]]></category>

		<guid isPermaLink="false">http://mimblog.de/?p=707</guid>
		<description><![CDATA[
Breakfast Links 13.01.2009
* IA Konferenz 2009
I never went to this conference before and pretty happy about the location (100km from my home office) in Hamburg this year. Call for papers is not done yet but there were also interesting topics at the IA 2007 you can find here.
* Virtualization and Search: Performance Tests Summary
The answer [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.iakonferenz.org/de/2009/program.html"><img style="border:0;" src="http://www.iakonferenz.org/de/2009/downloads/banner_400x50.gif" alt="IA Konferenz in Hamburg: 16 und 17 Mai" width="400" height="50" /></a></p>
<p><strong>Breakfast Links 13.01.2009</strong></p>
<p>* <a title="IA Konferenz 2009" href="http://www.iakonferenz.org/de/2009/news.html" target="_blank">IA Konferenz 2009</a></p>
<p>I never went to this conference before and pretty happy about the location (100km from my home office) in Hamburg this year. Call for papers is not done yet but there were also interesting topics at the IA 2007 <a title="IAK Programm 2007" href="http://www.iakonferenz.org/de/2007/downloads/IAK-Programm_2007.pdf" target="_blank">you can find here</a>.</p>
<p>* <a title="Virtualization and Search: Performance Tests Summary" href="http://www.enterprisesearchblog.com/2009/01/virtualization-and-search-performance-tests-summary.html" target="_blank">Virtualization and Search: Performance Tests Summary</a></p>
<p>The answer of what I&#8217;ve ever asked myself and never tried out. <em>The article also brought me to the public <a title="Enron Email Dataset" href="http://www.cs.cmu.edu/~enron/" target="_blank">Enron email data set</a>, wow, never heard about it so far!</em></p>
]]></content:encoded>
			<wfw:commentRss>http://mimblog.de/2009/01/13/information-architecture-conference-in-hamburg/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Lucene and Python</title>
		<link>http://mimblog.de/2009/01/12/lucene-and-python/</link>
		<comments>http://mimblog.de/2009/01/12/lucene-and-python/#comments</comments>
		<pubDate>Mon, 12 Jan 2009 16:04:07 +0000</pubDate>
		<dc:creator>Hannes Carl Meyer</dc:creator>
				<category><![CDATA[Breakfast Links]]></category>
		<category><![CDATA[lucene]]></category>

		<guid isPermaLink="false">http://mimblog.de/?p=700</guid>
		<description><![CDATA[
Breakfast Links 12.01.2009
* PyLucene &#8211; Python Lucene &#8211; Joins Apache Lucene 
PyLucene is a python based port of Lucene which is build automatically from the original Lucene source. It just joined Apache Lucene as a sub project. Does this means I can run Lucene in Google&#8217;s App Engine?
* Interview with Norbert Weitkämper
Norbert Weitkämper (Weitkämper Technology) [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://mimblog.de/wp-content/uploads/2009/01/lucene-python-pylucene.png"><img class="alignnone size-medium wp-image-703" title="lucene-python-pylucene" src="http://mimblog.de/wp-content/uploads/2009/01/lucene-python-pylucene.png" alt="" width="300" height="82" /></a></p>
<p><strong>Breakfast Links 12.01.2009</strong></p>
<p>* <a title="PyLucene - Python Lucene - Joins Apache Lucene " href="http://www.jroller.com/otis/entry/pylucene_python_lucene_joins_apache" target="_blank">PyLucene &#8211; Python Lucene &#8211; Joins Apache Lucene </a></p>
<p>PyLucene is a python based port of Lucene which is build automatically from the original Lucene source. It just joined <a title="Apache Lucene" href="http://lucene.apache.org" target="_blank">Apache Lucene</a> as a sub project. <em>Does this means I can run Lucene in Google&#8217;s App Engine?</em></p>
<p>* <a title="An Interview with Norbert Weitkämper" href="http://www.arnoldit.com/search-wizards-speak/xsearch.html" target="_blank">Interview with Norbert Weitkämper</a></p>
<p>Norbert Weitkämper (<a title="Weitkämper Technology" href="http://www.weitkamper.de/" target="_blank">Weitkämper Technology</a>) interviewed in december 2008 by Stephen E. Arnold from Beyond Search. <em>After the previous interview with Hans-Christian Brockmann, Beyond Search means also a lot of beyond Kentucky!</em></p>
]]></content:encoded>
			<wfw:commentRss>http://mimblog.de/2009/01/12/lucene-and-python/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>TrustYou Semantic Meta Search Engine</title>
		<link>http://mimblog.de/2009/01/08/trustyou-semantic-meta-search-engine/</link>
		<comments>http://mimblog.de/2009/01/08/trustyou-semantic-meta-search-engine/#comments</comments>
		<pubDate>Thu, 08 Jan 2009 12:27:50 +0000</pubDate>
		<dc:creator>Hannes Carl Meyer</dc:creator>
				<category><![CDATA[Breakfast Links]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[search engine]]></category>
		<category><![CDATA[semantic]]></category>

		<guid isPermaLink="false">http://mimblog.de/?p=694</guid>
		<description><![CDATA[
Breakfast Links 08.01.2009
* TrustYou &#8211; make a good choice
I just stumbled upon a press release from TrustYou, a search engine for hotel and restaurant recommendations. TrustYou calls itself a semantic meta search engine. Meta because it makes use of various existing hotel recommendation platforms like HRS, hotel.de and more. Semantic because of its pre-processing methods [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://mimblog.de/wp-content/uploads/2009/01/trustyou-logo.jpg"><img class="alignnone size-medium wp-image-695" title="trustyou-logo" src="http://mimblog.de/wp-content/uploads/2009/01/trustyou-logo.jpg" alt="" width="260" height="63" /></a></p>
<p><strong>Breakfast Links 08.01.2009</strong></p>
<p>* <a title="TrustYou" href="http://www.trustyou.com" target="_blank">TrustYou &#8211; make a good choice</a></p>
<p>I just stumbled upon a press release from TrustYou, a search engine for hotel and restaurant recommendations. TrustYou calls itself a semantic meta search engine. <strong>Meta</strong> because it makes use of various existing hotel recommendation platforms like HRS, hotel.de and more. <strong>Semantic</strong> because of its pre-processing methods for heterogeneous text data about hotels/restaurant and user comments. <em>Yeah, finally a new alternaitve search engine from germany, chapeau!</em></p>
<p>See also this post: <a title="TrustYou durchsucht Hotel-Bewertungen" href="http://www.deutsche-startups.de/2009/01/08/trustyou-durchsucht-hotel-bewertungen/" target="_blank">TrustYou durchsucht Hotel-Bewertungen</a> (german)</p>
<p>* <a title="Google Semantic Surfacing" href="http://arnoldit.com/wordpress/2009/01/08/google-semantics-surfacing/" target="_blank">Google Semantics Surfacing</a></p>
<p>Stephen Arnold&#8217;s response to the article &#8220;<a title="Did Google Just Expose Semantic Data in Search Results?" href="http://www.readwriteweb.com/archives/google_semantic_data.php" target="_blank">Did Google Just Expose Semantic Data in Search Results?</a>&#8221; from readwriteweb.com.</p>
]]></content:encoded>
			<wfw:commentRss>http://mimblog.de/2009/01/08/trustyou-semantic-meta-search-engine/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Channel Intelligence Case Study at AWS</title>
		<link>http://mimblog.de/2009/01/06/channel-intelligence-case-study-at-aws/</link>
		<comments>http://mimblog.de/2009/01/06/channel-intelligence-case-study-at-aws/#comments</comments>
		<pubDate>Tue, 06 Jan 2009 11:39:25 +0000</pubDate>
		<dc:creator>Hannes Carl Meyer</dc:creator>
				<category><![CDATA[Breakfast Links]]></category>
		<category><![CDATA[image classification]]></category>
		<category><![CDATA[nutch]]></category>

		<guid isPermaLink="false">http://mimblog.de/?p=690</guid>
		<description><![CDATA[
Breakfast Links 06.01.2009
* Channel Intelligence Case Study: Amazon Web Services
Interesting case study about how Channel Intelligence uses a semi-automatic way to categorize masses of images and product data. CI uses Amazaon&#8217;s Mechanical Turk &#8211; an online marketplace for human workforce &#8211; to support their automated categorization system.
* Sky Grid: Thomson Reuters and Bloomberg Challenger
A quick [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://mimblog.de/wp-content/uploads/2009/01/logo_channelintelligence.gif"><img class="alignnone size-medium wp-image-689" title="logo_channelintelligence" src="http://mimblog.de/wp-content/uploads/2009/01/logo_channelintelligence.gif" alt="" width="198" height="65" /></a></p>
<p><strong>Breakfast Links 06.01.2009</strong></p>
<p>* <a title="Channel Intelligence Case Study: Amazon Web Services" href="http://aws.amazon.com/solutions/case-studies/Channel-Intelligence/" target="_blank">Channel Intelligence Case Study: Amazon Web Services</a></p>
<p>Interesting case study about how <a title="Channel Intelligence" href="http://www.channelintelligence.com/" target="_blank">Channel Intelligence</a> uses a semi-automatic way to categorize masses of images and product data. CI uses Amazaon&#8217;s <a title="Amazon Mechanical Turk" href="http://aws.amazon.com/mturk/" target="_blank">Mechanical Turk</a> &#8211; an online marketplace for human workforce &#8211; to support their automated categorization system.</p>
<p>* <a title="Sky Grid: Thomson Reuters and Bloomberg Challenger" href="http://arnoldit.com/wordpress/2009/01/06/sky-grid-thomson-reuters-and-bloomberg-challenger/" target="_blank">Sky Grid: Thomson Reuters and Bloomberg Challenger</a></p>
<p>A quick capture of <a title="Sky Grid" href="http://www.skygrid.com/" target="_blank">Sky Grid</a> (US company) from Beyond Search, especially regarding its Business Model and opportunities. <em>This is actually a kind of software I built a couple years ago for different companies (10.000+) over here in germany!</em></p>
<p>* <strong>Nutch 1.0 in 1-2 weeks?</strong></p>
<p>Just read a message from Dennis Kubes on the Nutch Mailing-List regarding the release of Nutch 1.0! The <a href="http://issues.apache.org/jira/secure/IssueNavigator.jspa?reset=true&amp;mode=hide&amp;sorter/order=DESC&amp;sorter/field=priority&amp;resolution=-1&amp;pid=10680&amp;fixfor=12312443" target="_blank">Jira link</a> is also pretty interesting about the upcoming features&#8230;</p>
]]></content:encoded>
			<wfw:commentRss>http://mimblog.de/2009/01/06/channel-intelligence-case-study-at-aws/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Yahoo&#8217;s Success in Technology</title>
		<link>http://mimblog.de/2008/12/09/yahoos-success-in-technology/</link>
		<comments>http://mimblog.de/2008/12/09/yahoos-success-in-technology/#comments</comments>
		<pubDate>Tue, 09 Dec 2008 11:20:51 +0000</pubDate>
		<dc:creator>Hannes Carl Meyer</dc:creator>
				<category><![CDATA[Breakfast Links]]></category>
		<category><![CDATA[BOSS]]></category>
		<category><![CDATA[lucene]]></category>
		<category><![CDATA[uima]]></category>
		<category><![CDATA[yahoo]]></category>

		<guid isPermaLink="false">http://mimblog.de/?p=681</guid>
		<description><![CDATA[
Breakfast Links 09.12.2008
* Yahoo BOSS Now Serving 100 Queries Per Second
While Yahoo! was recently struggling with lots of negative issues in terms of their business (who is next Yahoo! CEO?), they kept on publishing and propagating their technologies. According to the Yahoo Search Blog, BOSS is now serving 10 million queries per day! In my [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://mimblog.de/wp-content/uploads/2008/12/boss_info5.gif"><img class="alignnone size-medium wp-image-682" title="boss_info5" src="http://mimblog.de/wp-content/uploads/2008/12/boss_info5-300x73.gif" alt="" width="300" height="73" /></a></p>
<p><strong>Breakfast Links 09.12.2008</strong></p>
<p>* <a title="Yahoo BOSS Now Serving 100 Queries Per Second" href="http://developer.yahoo.com/search/boss/" target="_blank">Yahoo BOSS Now Serving 100 Queries Per Second</a></p>
<p>While Yahoo! was recently struggling with lots of negative issues in terms of their business (who is next Yahoo! CEO?), they kept on publishing and propagating their technologies. According to the <a title="Yahoo! Search Blog" href="http://www.ysearchblog.com/">Yahoo Search Blog</a>, BOSS is now serving <strong>10 million queries per day</strong>! <em>In my opinion Yahoo&#8217;s search technology is catching up fast and is far ahead of Google&#8217;s APIs.</em></p>
<p>* If you were ever interested in an <strong>UIMA Lucene Cas Consumer</strong>, check this out:</p>
<blockquote><p>at the JULIE Lab, we&#8217;ve been working (silently) on a new (and completely<br />
altered) version of our Lucene CAS Indexer consumer (Lucas). We are<br />
planning to make this available soon &#8212; preferably in the UIMA sandbox.<br />
In fact, LUCAS now is able to perform offset-based token stream<br />
alignment and merging of UIMA annotations (via position increment) in<br />
the same Lucene field (e.g. &#8220;documenttext&#8221; or &#8220;title&#8221;), which we feel is<br />
more appropriate for text indexing &#8212; instead of putting each UIMA<br />
annotation into a separate field like the Solr approach (still possible<br />
with the new LUCAS).</p>
<p>(UIMA mailing list, Joachim Wermter)</p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://mimblog.de/2008/12/09/yahoos-success-in-technology/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Enterprise Search Study</title>
		<link>http://mimblog.de/2008/12/08/enterprise-search-study/</link>
		<comments>http://mimblog.de/2008/12/08/enterprise-search-study/#comments</comments>
		<pubDate>Mon, 08 Dec 2008 09:42:47 +0000</pubDate>
		<dc:creator>Hannes Carl Meyer</dc:creator>
				<category><![CDATA[Breakfast Links]]></category>
		<category><![CDATA[Enterprise Search]]></category>
		<category><![CDATA[Solr]]></category>
		<category><![CDATA[study]]></category>
		<category><![CDATA[tika]]></category>

		<guid isPermaLink="false">http://mimblog.de/?p=677</guid>
		<description><![CDATA[
Breakfast Links 08.12.2008
* Arnold White Study Published
Galatea published the study &#8220;Successful Enterprise Search Management&#8221; by Stephen E. Arnold (Beyond Search) and Martin White (Interview at ArnoldIT). The study is essentially about how to manage an enterprise search environnement, beginning from the selection and integration of a vendor and how a project team is setup. There [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.galatea.co.uk/index.php?page=shop.product_details&amp;flypage=shop.flypage&amp;product_id=36&amp;category_id=8&amp;manufacturer_id=0&amp;option=com_virtuemart&amp;Itemid=44"><img class="alignnone size-medium wp-image-679" title="study-enterprise-search" src="http://mimblog.de/wp-content/uploads/2008/12/study-enterprise-search.jpg" alt="" width="126" height="178" /></a></p>
<p><strong>Breakfast Links 08.12.2008</strong></p>
<p>* <a title="Arnold White Study Published" href="http://arnoldit.com/wordpress/2008/12/08/arnold-white-study-published/" target="_blank">Arnold White Study Published</a></p>
<p>Galatea published the study &#8220;<strong>Successful Enterprise Search Management</strong>&#8221; by Stephen E. Arnold (<a title="Beyond Search" href="http://arnoldit.com/wordpress" target="_blank">Beyond Search</a>) and Martin White (<a title="Search Wizards Speak Martin White" href="http://www.arnoldit.com/search-wizards-speak/martin-white.html" target="_blank">Interview</a> at ArnoldIT). The study is essentially about how <strong>to manage an enterprise search environnement</strong>, beginning from the selection and integration of a vendor and how a project team is setup. There are also some chapters about search and technology in general like content processing (text mining). <em>I don&#8217;t know yet wether it makes sense to get the study for businesses in germany but will write a report on it!</em></p>
<p>* <a title="Tika and Solr" href="http://lucene.grantingersoll.com/2008/12/06/tika-and-solr/" target="_blank">Tika and Solr</a></p>
<p>Grant Ingersoll just committed an <a title="ExtractingRequestHandler Solr Wiki" href="http://wiki.apache.org/solr/ExtractingRequestHandler" target="_blank">ExtractingRequestHandler</a> module for Solr which adds <a title="Apache Tika" href="http://lucene.apache.org/tika/" target="_blank">Tika</a> support to solr. Meaning, it is now possible to <strong>feed Solr with a variety of common document formats</strong> like Office, PDF, HTML and <a title="Tika supported file formats" href="http://lucene.apache.org/tika/formats.html" target="_blank">more</a>. <em>Awesome!</em></p>
]]></content:encoded>
			<wfw:commentRss>http://mimblog.de/2008/12/08/enterprise-search-study/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Start of german Lucene Online Series</title>
		<link>http://mimblog.de/2008/12/01/start-of-german-lucene-online-series/</link>
		<comments>http://mimblog.de/2008/12/01/start-of-german-lucene-online-series/#comments</comments>
		<pubDate>Mon, 01 Dec 2008 10:29:01 +0000</pubDate>
		<dc:creator>Hannes Carl Meyer</dc:creator>
				<category><![CDATA[Breakfast Links]]></category>
		<category><![CDATA[lucene]]></category>
		<category><![CDATA[sharepoint]]></category>
		<category><![CDATA[taxonomy]]></category>

		<guid isPermaLink="false">http://mimblog.de/?p=666</guid>
		<description><![CDATA[
Breakfast Links 01.12.2008
* Start of german Lucene Online Series
JAXenter just started a new article series on Apache Lucene by Bernd Fondermann. The first article roughly describes Lucene and it&#8217;s API with and indexing and search example. Nothing new, but in german. Later articles will cover Solr, Nutch and Tika (yeeehaa)!
* Concept Searching
Stephen Arnold on Concept [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://mimblog.de/wp-content/uploads/2008/10/lucene_green_300.gif"><img class="alignnone size-medium wp-image-501" title="lucene_green_300" src="http://mimblog.de/wp-content/uploads/2008/10/lucene_green_300.gif" alt="" width="300" height="46" /></a></p>
<p><strong>Breakfast Links 01.12.2008</strong></p>
<p>* <a title="Start of german Lucene Online Series" href="http://www.brainlounge.de/blog_lucene_series_1.html" target="_blank">Start of german Lucene Online Series</a></p>
<p><a title="it republik - JAXenter" href="http://it-republik.de/jaxenter/" target="_blank">JAXenter</a> just started a new article series on Apache Lucene by <a title="Brainlounge Bernd Fondermann" href="http://www.brainlounge.de" target="_blank">Bernd Fondermann</a>. The first article roughly describes Lucene and it&#8217;s API with and indexing and search example. <em>Nothing new, but in german</em>. Later articles will cover Solr, Nutch and Tika (yeeehaa)!</p>
<p>* <a title="Concept Searching" href="http://arnoldit.com/wordpress/2008/11/30/concept-searching/" target="_blank">Concept Searching</a></p>
<p>Stephen Arnold on <a title="ConceptSearching" href="http://www.conceptsearching.com/web/" target="_blank">Concept Searching Inc.</a> which delivers software products around taxonomy creation and automatic classification. <em>Again, this company also offers its services and products around Microsoft Sharepoint!</em><br />
<a title="Start of german Lucene Online Series" href="http://www.brainlounge.de/blog_lucene_series_1.html" target="_blank"></a></p>
]]></content:encoded>
			<wfw:commentRss>http://mimblog.de/2008/12/01/start-of-german-lucene-online-series/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Become a Knowledge Hero</title>
		<link>http://mimblog.de/2008/11/28/become-a-knowledge-hero/</link>
		<comments>http://mimblog.de/2008/11/28/become-a-knowledge-hero/#comments</comments>
		<pubDate>Fri, 28 Nov 2008 10:34:11 +0000</pubDate>
		<dc:creator>Hannes Carl Meyer</dc:creator>
				<category><![CDATA[Breakfast Links]]></category>
		<category><![CDATA[Enterprise Search]]></category>
		<category><![CDATA[hadoop]]></category>
		<category><![CDATA[knowledge management]]></category>
		<category><![CDATA[lucene]]></category>

		<guid isPermaLink="false">http://mimblog.de/?p=660</guid>
		<description><![CDATA[
Breakfast Links 28.11.2008
* You are a knowledge worker. Become a knowledge hero.
I just discovered the belgium company Whatever which launched a software in the field of knowledge management called Knowledge Plaza. Knowledge Plaza is focused on Enterprise Social Search methods which are roughly described here by a Whatever employee. Under the hood, Knowledge Plaza uses [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://mimblog.de/wp-content/uploads/2008/11/hero-flickr-r8r.jpg"><img class="alignnone size-medium wp-image-661" title="hero-flickr-r8r" src="http://mimblog.de/wp-content/uploads/2008/11/hero-flickr-r8r-218x300.jpg" alt="" width="218" height="300" /></a></p>
<p><strong>Breakfast Links 28.11.2008</strong></p>
<p>* <a title="Knowledge Plaza" href="http://www.knowledgeplaza.be/features.html" target="_blank">You are a knowledge worker. Become a knowledge hero.</a></p>
<p>I just discovered the belgium company <a title="Whatever" href="http://www.whatever-company.com/" target="_blank">Whatever</a> which launched a software in the field of knowledge management called Knowledge Plaza. Knowledge Plaza is focused on Enterprise Social Search methods which are roughly <a title="Enterprise Social Search slideshow" href="http://raphael.slinckx.net/blog/2008-04-22/enterprise-social-search" target="_blank">described here</a> by a Whatever employee. Under the hood, Knowledge Plaza uses Apache Lucene core for Search (?) and <a title="Aperture" href="http://aperture.sourceforge.net/" target="_blank">Aperture</a> for text extraction.</p>
<p>* <a title="Hadoop Map-Reduce – Tuning and Debugging" href="http://infram.wordpress.com/2008/11/28/hadoop-map-reduce-%E2%80%93-tuning-and-debugging/" target="_blank">Hadoop Map-Reduce – Tuning and Debugging</a></p>
<p>Just stumbled upon this overview for tuning and debugging Hadoop.</p>
]]></content:encoded>
			<wfw:commentRss>http://mimblog.de/2008/11/28/become-a-knowledge-hero/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
