<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Meyer Information Management Blog&#187; mahout</title>
	<atom:link href="http://mimblog.de/tag/mahout/feed/" rel="self" type="application/rss+xml" />
	<link>http://mimblog.de</link>
	<description>Innovationen und Technologien im Information Management</description>
	<lastBuildDate>Thu, 25 Mar 2010 20:18:39 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Apache Mahout nimmt Fahrt auf</title>
		<link>http://mimblog.de/2009/09/07/apache-mahout-nimmt-fahrt-auf/</link>
		<comments>http://mimblog.de/2009/09/07/apache-mahout-nimmt-fahrt-auf/#comments</comments>
		<pubDate>Mon, 07 Sep 2009 10:15:36 +0000</pubDate>
		<dc:creator>Hannes Carl Meyer</dc:creator>
				<category><![CDATA[machine learning]]></category>
		<category><![CDATA[collaborative filtering]]></category>
		<category><![CDATA[mahout]]></category>

		<guid isPermaLink="false">http://mimblog.de/?p=885</guid>
		<description><![CDATA[
Nicht nur in der Mailingliste (Apache Mahout Status by Grant Ingersoll) des Projekts nimmt der Traffic zu, auch erste Projekte die Mahout erfolgreich einsetzen werden bekanntgegeben.

So geschehen bei dem (mobilen) News-Aggregator und Organisator Mippin aus Großbritannien. Mippin sammelt Informationen über das Surf-Verhalten seiner Nutzer und erstellt profilbezogene Vorschläge Themen-ähnlicher Websites. D.h. je mehr ein Benutzer [...]]]></description>
			<content:encoded><![CDATA[<p><a title="Apache Mahout" href="http://lucene.apache.org/mahout/" target="_blank"><img class="size-full wp-image-886 alignleft" title="Mahout-logo-82x100" src="http://mimblog.de/wp-content/uploads/2009/09/Mahout-logo-82x100.png" alt="Mahout-logo-82x100" width="82" height="100" /></a></p>
<p>Nicht nur in der Mailingliste (<a title="Apache Mahout Status" href="http://lucene.grantingersoll.com/2009/06/16/apache-mahout-status/" target="_blank">Apache Mahout Status by Grant Ingersoll</a>) des Projekts nimmt der Traffic zu, auch erste Projekte die Mahout erfolgreich einsetzen werden bekanntgegeben.</p>
<p><span id="more-885"></span></p>
<p>So geschehen bei dem (mobilen) News-Aggregator und Organisator <a title="Mippin" href="http://mippin.com" target="_blank">Mippin</a> aus Großbritannien. Mippin sammelt Informationen über das Surf-Verhalten seiner Nutzer und erstellt profilbezogene Vorschläge Themen-ähnlicher Websites. D.h. je mehr ein Benutzer die Software &#8220;mitlesen&#8221; lässt und je mehr Benutzer Mippin benutzen, desto besser funktioniert das Vorschlagwesen &#8211; sowas nennt man dann <strong>Collaborative Filtering</strong>.</p>
<p>Mippin nutzt dafür das Recommendation System in Apache Mahout und bedient sich damit denselben Tricks wie Amazons Empfehlungs-System. Laut Sean Owen (Mipping &amp; Committer bei Apache Mahout) soll mit Apache Mahout das Ziel erreicht werden, <strong>Machine Learning</strong> Algorithmen für alle Entwicklern zur Verfügung zu stellen, damit diese Disziplin <strong>keine Raketenwissenschaft bleibt</strong>.</p>
<p>Hier noch einige Quellen zum Thema Apache Mahout:</p>
<ul>
<li><a title="Apache Mahout: Highly Scalable Machine Learning Algorithms" href="http://www.infoq.com/news/2009/04/mahout" target="_blank">InfoQ: Apache Mahout, Highly Scalable Machine Learning Algorithms</a></li>
<li><a title="    *     *       Recent Posts           o Chiang Mai: World’s 5th Best City by Travel and Leisure           o Disclosure and Transparency, Please.           o Quiz: Event Processing 101           o Patent Title: Event Handling System           o U.S. Cyber Command - Air Force Stumbles           o Only Just Beginning: Twitter message could be cyber criminal at work           o CEP Thinking: A Dream, Wishful Thinking or Incompetance?           o Jojuba Oil and Positive Thinking in the Jungle           o The Water and the Rain           o Why Gimmick Marketing?           o U.S. Cyber Command - Some Deep Background           o U.S. Cyber Command (USCYBERCOMM)           o A Hidden Danger in Cloud Computing           o Apama’s Good Adsense           o Amazon CloudFront Test Results with Small Objects     *     *       Archives           o August 2009 (2)           o July 2009 (7)           o June 2009 (9)           o May 2009 (10)           o April 2009 (9)           o March 2009 (15)           o February 2009 (21)           o January 2009 (11)           o December 2008 (7)           o November 2008 (25)           o October 2008 (11)           o September 2008 (27)           o August 2008 (20)           o July 2008 (25)           o June 2008 (15)           o May 2008 (4)           o April 2008 (13)           o March 2008 (4)           o February 2008 (3)           o January 2008 (29)           o December 2007 (35)           o November 2007 (28)           o October 2007 (10)           o September 2007 (13)           o August 2007 (16)           o July 2007 (14)           o June 2007 (13)           o May 2007 (12)     *       Blogroll           o ArchitectGuy           o Blog NOW!           o BlogNow!           o CEP on SlideShare           o Charles Young’s Blog           o Dancing with UNIX           o David Luckham’s CEP Site           o DogEar           o ED SOA           o James Owen: Java Rules           o Marc Adler’s Blog           o ORF2009           o Progress Apama’s CEP Blog           o Ted Dunning           o The (ISC)2 Blog           o The CEP Forum           o The UNIX Forums           o Whats On Chiang Mai           o Whats on Thailand           o Zignasa     *       Pages           o Complex Event Processing in Distributed Systems           o Contact           o Cyberstrategics           o E-Mail Bombs and Countermeasures           o Google Search           o Intrusion Detection Systems and Multisensor Data Fusion           o Privacy Policy           o Publications           o Sakura-Iro ni Somaru Tomoko           o Site Map           o Sponsorship           o WAR.COM by Frank Vizard           o Webinars           o What is Complex Event Processing?     *      *       Meta           o Log in           o Entries RSS           o Comments RSS           o WordPress.org  Mahout on Elastic MapReduce: Running k-means Clustering" href="http://www.thecepblog.com/2009/05/07/mahout-on-elastic-mapreduce-running-k-means-clustering/" target="_blank">Cyberstrategics: Running k-means Clustering</a></li>
<li><a title="Mahout Review" href="http://www.iletken-project.com/index.php?option=com_content&amp;view=category&amp;layout=blog&amp;id=34&amp;Itemid=59&amp;lang=en" target="_blank">Iletken Blog: Mahout Review</a> (Kritisches Benchmark)</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://mimblog.de/2009/09/07/apache-mahout-nimmt-fahrt-auf/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Apache Nutch 1.0 Finally Released</title>
		<link>http://mimblog.de/2009/03/30/apache-nutch-10-finally-released/</link>
		<comments>http://mimblog.de/2009/03/30/apache-nutch-10-finally-released/#comments</comments>
		<pubDate>Mon, 30 Mar 2009 07:35:04 +0000</pubDate>
		<dc:creator>Hannes Carl Meyer</dc:creator>
				<category><![CDATA[Information Management]]></category>
		<category><![CDATA[lucene]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[mahout]]></category>
		<category><![CDATA[nutch]]></category>

		<guid isPermaLink="false">http://mimblog.de/?p=779</guid>
		<description><![CDATA[
It is already two months since I was announcing the Nutch 1.0 release. Finally after a 1.0 release candidate Nutch was released in version 1.0 last week! Congratulations!


If you are also into Machine Learning you should check out the release of Mahout in its early version 0.3 and wether you are new on it, check [...]]]></description>
			<content:encoded><![CDATA[<p><img class="size-full wp-image-740 alignnone" title="nutch-logo" src="http://mimblog.de/wp-content/uploads/2009/02/nutch-logo.gif" alt="nutch-logo" width="121" height="48" /></p>
<p>It is already <a title="Nutch 1.0" href="http://mimblog.de/2009/02/02/nutch-release-10/" target="_blank">two months since I was announcing</a> the Nutch 1.0 release. Finally after a 1.0 release candidate <a title="Apache Nutch 1.0 Released" href="http://lucene.apache.org/nutch/#23+March+2009+-+Apache+Nutch+1.0+Released" target="_blank">Nutch was released in version 1.0</a> last week! Congratulations!</p>
<p><span id="more-779"></span></p>
<p><img class="size-full wp-image-780 alignnone" title="mahout-logo-82x100" src="http://mimblog.de/wp-content/uploads/2009/03/mahout-logo-82x100.png" alt="mahout-logo-82x100" width="82" height="100" /></p>
<p>If you are also into Machine Learning you should check out the <a title="Apache Mahout" href="http://lucene.apache.org/mahout/" target="_blank">release of Mahout</a> in its early version 0.3 and wether you are new on it, check out <a title="Introducing Mahout: Apache Machine Learning" href="http://www.eu.apachecon.com/c/aceu2009/sessions/136" target="_blank">Grant Ingersoll&#8217;s presentation on Mahout</a> from ApacheCon Europe 2009 here.</p>
<p>P.S.: Also notice the new <a title="Apache Droids" href="http://incubator.apache.org/droids/" target="_blank">Apache Droids</a> Tab in the Lucene navigation bar!</p>
]]></content:encoded>
			<wfw:commentRss>http://mimblog.de/2009/03/30/apache-nutch-10-finally-released/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>ApacheCon US 2008 Lucene (And Beyond) Sessions</title>
		<link>http://mimblog.de/2008/11/17/apachecon-us-2008-lucene-and-beyond-sessions/</link>
		<comments>http://mimblog.de/2008/11/17/apachecon-us-2008-lucene-and-beyond-sessions/#comments</comments>
		<pubDate>Mon, 17 Nov 2008 20:39:50 +0000</pubDate>
		<dc:creator>Hannes Carl Meyer</dc:creator>
				<category><![CDATA[Information Management]]></category>
		<category><![CDATA[apache]]></category>
		<category><![CDATA[content extraction]]></category>
		<category><![CDATA[event]]></category>
		<category><![CDATA[hadoop]]></category>
		<category><![CDATA[lucene]]></category>
		<category><![CDATA[mahout]]></category>
		<category><![CDATA[tika]]></category>

		<guid isPermaLink="false">http://mimblog.de/?p=619</guid>
		<description><![CDATA[
Unfortunately I was not be able to attend at the ApacheCon US 2008 in New Orleans this year &#8211; way too far away from good ol&#8217; germany! But I reviewed the given sessions on Lucene (and Solr, Mahout, Tika) afterwards the conference to get some inspiration. Some comments on it:

* Advanced Indexing Techniques with Apache [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://mimblog.de/wp-content/uploads/2008/11/acus08basic.jpg"><img class="alignnone size-medium wp-image-620" title="acus08basic" src="http://mimblog.de/wp-content/uploads/2008/11/acus08basic-300x71.jpg" alt="" width="300" height="71" /></a></p>
<p>Unfortunately I was not be able to attend at the <a title="ApacheCon US 2008" href="http://us.apachecon.com/c/acus2008/" target="_blank">ApacheCon US 2008</a> in New Orleans this year &#8211; way too far away from good ol&#8217; germany! But I reviewed the given sessions on <strong>Lucene</strong> (and <strong>Solr</strong>, <strong>Mahout</strong>, <strong>Tika</strong>) afterwards the conference to get some inspiration. Some comments on it:</p>
<p><span id="more-619"></span></p>
<p>* <a title="Advanced Indexing Techniques with Apache Lucene" href="http://us.apachecon.com/c/acus2008/sessions/7" target="_blank">Advanced Indexing Techniques with Apache Lucene</a> (<a title="Michael Busch" href="http://us.apachecon.com/c/acus2008/speakers/44" target="_blank">Michael Busch</a>)</p>
<p>Detailled presentation about Indexing capabilities of Apache Lucene and a very interesting part on how to use <strong>Token Payloads</strong> and POS-Tagging with the new <strong>TokenStream API</strong>.</p>
<p>* <a title="Apache Solr: Out of the Box" href="http://us.apachecon.com/c/acus2008/sessions/9" target="_blank">Apache Solr: Out of the Box</a> (<a title="Chris Hostetter" href="http://people.apache.org/~hossman/" target="_blank">Chris Hostetter</a>)</p>
<p>Introduction to Solr from installation and administration (Admin Console, Luke), querying (Facets, Highlighting) and configuration (Analyzers, Multiple Indexes, Replication).</p>
<p>* <a title="Introducing Mahout: Apache Machine Learning" href="http://us.apachecon.com/c/acus2008/sessions/11" target="_blank">Introducing Mahout: Apache Machine Learning</a> (<a title="Grant Ingersoll" href="http://grantingersoll.com/" target="_blank">Grant Ingersoll</a>)</p>
<p>Already posted this session in my <a title="Intro to Mahout" href="http://mimblog.de/2008/11/10/intro-to-mahout/" target="_blank">recent Breakfast Links</a>. A nice presentation about what Machine Learning stands for and the approach of Mahout.</p>
<p>* <a title="Apache Solr: Beyond the Box" href="http://us.apachecon.com/c/acus2008/sessions/10" target="_blank">Apache Solr: Beyond the Box</a> (<a title="Chris Hostetter" href="http://people.apache.org/~hossman/" target="_blank">Chris Hostetter</a>)</p>
<p>Presentation about Solr&#8217;s history and real world examples such like Geo search.</p>
<p>* <a title="Content analysis for ECM with Apache Tika" href="http://us.apachecon.com/c/acus2008/sessions/12" target="_blank">Content analysis for ECM with Apache Tika</a> (<a title="Paolo Mottadelli" href="http://www.paolomottadelli.com/" target="_blank">Paolo Mottadelli</a>)</p>
<p>Impressive and extensive presentation about Apache Tika and its Alfresco integration for content extraction.</p>
]]></content:encoded>
			<wfw:commentRss>http://mimblog.de/2008/11/17/apachecon-us-2008-lucene-and-beyond-sessions/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Intro to Mahout</title>
		<link>http://mimblog.de/2008/11/10/intro-to-mahout/</link>
		<comments>http://mimblog.de/2008/11/10/intro-to-mahout/#comments</comments>
		<pubDate>Mon, 10 Nov 2008 08:53:23 +0000</pubDate>
		<dc:creator>Hannes Carl Meyer</dc:creator>
				<category><![CDATA[Breakfast Links]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[mahout]]></category>
		<category><![CDATA[performance]]></category>

		<guid isPermaLink="false">http://mimblog.de/?p=616</guid>
		<description><![CDATA[
Breakfast Links 10.11.2008
* Introducing Mahout: Apache Machine Learning
Grant Ingersoll posted his presentation about Mahout from last week on the ApacheCon US 2008. The presentation is in general about machine learning, different approaches, methods and the current status of Mahout. Check it out!
* Multi Core Chips and Search Performance
Interesting article on Search Performance in terms of [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://mimblog.de/wp-content/uploads/2008/10/feather.jpg"><img class="alignnone size-medium wp-image-442" title="feather" src="http://mimblog.de/wp-content/uploads/2008/10/feather-300x90.jpg" alt="" width="300" height="90" /></a></p>
<p><strong>Breakfast Links 10.11.2008</strong></p>
<p>* <a title="Introducing Mahout: Apache Machine Learning" href="http://us.apachecon.com/c/acus2008/sessions/11" target="_blank">Introducing Mahout: Apache Machine Learning</a></p>
<p>Grant Ingersoll posted his presentation about Mahout from last week on the <a title="ApacheCon US 2008" href="http://us.apachecon.com/c/acus2008/" target="_blank">ApacheCon US 2008</a>. The presentation is in general about machine learning, different approaches, methods and the current status of Mahout. Check it out!</p>
<p>* <a title="Multi Core Chips and Search Performance" href="http://arnoldit.com/wordpress/2008/11/10/multi-core-chips-and-search-performance/" target="_blank">Multi Core Chips and Search Performance</a></p>
<p>Interesting article on Search Performance in terms of scalability through hardware.</p>
]]></content:encoded>
			<wfw:commentRss>http://mimblog.de/2008/11/10/intro-to-mahout/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Another Video Search Engine from Exalead</title>
		<link>http://mimblog.de/2008/11/03/another-video-search-engine-from-exalead/</link>
		<comments>http://mimblog.de/2008/11/03/another-video-search-engine-from-exalead/#comments</comments>
		<pubDate>Mon, 03 Nov 2008 10:43:00 +0000</pubDate>
		<dc:creator>Hannes Carl Meyer</dc:creator>
				<category><![CDATA[Breakfast Links]]></category>
		<category><![CDATA[hadoop]]></category>
		<category><![CDATA[mahout]]></category>
		<category><![CDATA[search engine]]></category>
		<category><![CDATA[video search]]></category>
		<category><![CDATA[voice to text]]></category>

		<guid isPermaLink="false">http://mimblog.de/?p=595</guid>
		<description><![CDATA[
Breakfast Links 03.11.2008
* Exalead-Labs: voxalead
The parisian search engine company Exalead released their video search engine voxalead. It combines Exalead&#8217;s indexing (Entity Recognition) and search technology with third partie&#8217;s Voice recognition (LIMSI). Currently they indexed mostly french and us channels and a small amount of german channels. Beside the search through the voice-to-text extracted contents you [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://mimblog.de/wp-content/uploads/2008/11/logo_voxalead_big.gif"><img class="alignnone size-medium wp-image-596" title="logo_voxalead_big" src="http://mimblog.de/wp-content/uploads/2008/11/logo_voxalead_big.gif" alt="" width="262" height="105" /></a></p>
<p><strong>Breakfast Links 03.11.2008</strong></p>
<p>* <a title="voxalead" href="http://voxalead.labs.exalead.com" target="_blank">Exalead-Labs: voxalead</a></p>
<p>The parisian search engine company <a title="Exalead" href="http://www.exalead.com/software/" target="_blank">Exalead</a> released their video search engine <a title="About Voxalead" href="http://labs.exalead.com/index.php?option=com_content&amp;view=article&amp;catid=37:features-demos&amp;id=49:tv-news-search" target="_blank">voxalead</a>. It combines Exalead&#8217;s indexing (Entity Recognition) and search technology with third partie&#8217;s Voice recognition (<a title="LIMSI" href="http://www.limsi.fr/" target="_blank">LIMSI</a>). Currently they indexed <strong>mostly french and us channels and a small amount of german channels</strong>. Beside the search through the voice-to-text extracted contents you can also view the <strong>original extracted text</strong> which shows off the quality of voxalead!</p>
<p>* <a title="The German metasearch engine “MetaGer”" href="http://altsearchengines.com/2008/11/02/the-german-metasearch-engine-metager/" target="_blank">Metager at Alt Search Engines</a></p>
<p><a title="Metager" href="http://metager.de/" target="_blank">Metager</a> now has its place at <a title="Alt Search Engines" href="http://www.altsearchengines.com/" target="_blank">AltSearchEngines.com</a> in english and german!</p>
<p>* <a title="Twenty Newsgroups Classification" href="http://cwiki.apache.org/confluence/display/MAHOUT/TwentyNewsgroups" target="_blank">Mahout: Twenty Newsgroups Classification</a></p>
<p>This weekend I stumbled upon a nice <strong>Mahout Classification example</strong>. It is all about creating a classification model based on twenty clusters of newsgroup messages. All you need is Mahout and Hadoop to get the example running in approx. 20 minutes.</p>
]]></content:encoded>
			<wfw:commentRss>http://mimblog.de/2008/11/03/another-video-search-engine-from-exalead/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

