<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Meyer Information Management Blog&#187; search engine</title>
	<atom:link href="http://mimblog.de/tag/search-engine/feed/" rel="self" type="application/rss+xml" />
	<link>http://mimblog.de</link>
	<description>Innovationen und Technologien im Information Management</description>
	<lastBuildDate>Thu, 25 Mar 2010 20:18:39 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>NetBase und die semantische Suche (Content Intelligence)</title>
		<link>http://mimblog.de/2009/05/05/netbase-und-die-semantische-suche-content-intelligence/</link>
		<comments>http://mimblog.de/2009/05/05/netbase-und-die-semantische-suche-content-intelligence/#comments</comments>
		<pubDate>Tue, 05 May 2009 16:18:54 +0000</pubDate>
		<dc:creator>Hannes Carl Meyer</dc:creator>
				<category><![CDATA[Information Management]]></category>
		<category><![CDATA[search engine]]></category>
		<category><![CDATA[semantic search]]></category>

		<guid isPermaLink="false">http://mimblog.de/?p=828</guid>
		<description><![CDATA[Nachdem ich bereits in der letzten Woche im Rahmen des Semantic Web Day Leipzig über Transinsight und der semantischen Suche von GoPubMed berichtet hatte, ging mir heute die Meldung von NetBase Content Intelligence und deren Demonstrator  (ebenfalls im Gesundheits-Sektor auf Basis von PubMed) ins Netz.
Wo andere Suchmaschinen versuchen den Kontext des Inhalts aus Keyword-Assoziationen zu [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.netbase.com/"><img class="alignleft size-full wp-image-829" title="netbase-150x150" src="http://mimblog.de/wp-content/uploads/2009/05/netbase-150x150.png" alt="netbase-150x150" width="150" height="44" /></a>Nachdem ich bereits in der letzten Woche im Rahmen des <a title="Semantic Web Day Leipzig Teil I" href="http://mimblog.de/2009/04/30/semantic-web-day-2009-teil-i/" target="_blank">Semantic Web Day Leipzig</a> über Transinsight und der semantischen Suche von <a title="GoPubMed" href="http://gopubmed.org/" target="_blank">GoPubMed</a> berichtet hatte, ging mir heute die Meldung von <a title="NetBase" href="http://www.netbase.com" target="_blank">NetBase</a> <strong>Content Intelligence</strong> und deren Demonstrator  (ebenfalls im Gesundheits-Sektor auf Basis von PubMed) ins Netz.</p>
<p><span id="more-828"></span>Wo andere Suchmaschinen versuchen den <strong>Kontext des Inhalts aus Keyword-Assoziationen zu berechnen</strong>, soll NetBase einen Schritt weiter gehen und auf Basis einer <strong>Analyse ganzer Sätze</strong> die Assoziationen von Sätzen und Bedeutungen zueinander automatisch erkennen. Somit soll Content Intelligence auch ganz ohne vorherige Wissens-Modellierung oder Lexika auskommen können.</p>
<p>Damit folgt NetBase dem Trend weg von der Suchmaschine die Suchergebnis-Listen ausspuckt, hin zu Frage-Antwort-Maschine a la <a title="Wolfram Alpha" href="http://www.spiegel.de/netzwelt/tech/0,1518,612268,00.html" target="_blank">WolframAlpha</a>. Netbase wird allerdings vorraussichtlich keine eigene Suchmaschine anbieten (abgesehen vom Demonstrator), sondern auf ein Lizenz-Modell für Unternehmen setzen &#8211; lt. <a title="NetBase Offers Powerful Semantic Indexing Platform" href="http://www.techcrunch.com/2009/04/22/netbase-offers-powerful-semantic-indexing-platform-that-reads-the-web/" target="_blank">TechCrunch</a> soll die Lizenz bei 100.000$ anfangen. Diese Kröte sollen dann Unternehmen schlucken, die ansonsten große Aufwände bei der Modellierung von Taxonomien/Lexika o.ä. in Kauf nehmen müssten.</p>
<p>Doch adressiert NetBase damit tatsächlich ein Problem oder handelt es sich lediglich um ein Sahnehäubchen?</p>
<p>Artikel in der Technology Review: <a title="Smarte Gesundheitsdatenbank" href="http://www.heise.de/tr/Smarte-Gesundheitsdatenbank--/artikel/137174" target="_blank">Smarte Gesundheitsdatenbank</a></p>
]]></content:encoded>
			<wfw:commentRss>http://mimblog.de/2009/05/05/netbase-und-die-semantische-suche-content-intelligence/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>TrustYou Semantic Meta Search Engine</title>
		<link>http://mimblog.de/2009/01/08/trustyou-semantic-meta-search-engine/</link>
		<comments>http://mimblog.de/2009/01/08/trustyou-semantic-meta-search-engine/#comments</comments>
		<pubDate>Thu, 08 Jan 2009 12:27:50 +0000</pubDate>
		<dc:creator>Hannes Carl Meyer</dc:creator>
				<category><![CDATA[Breakfast Links]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[search engine]]></category>
		<category><![CDATA[semantic]]></category>

		<guid isPermaLink="false">http://mimblog.de/?p=694</guid>
		<description><![CDATA[
Breakfast Links 08.01.2009
* TrustYou &#8211; make a good choice
I just stumbled upon a press release from TrustYou, a search engine for hotel and restaurant recommendations. TrustYou calls itself a semantic meta search engine. Meta because it makes use of various existing hotel recommendation platforms like HRS, hotel.de and more. Semantic because of its pre-processing methods [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://mimblog.de/wp-content/uploads/2009/01/trustyou-logo.jpg"><img class="alignnone size-medium wp-image-695" title="trustyou-logo" src="http://mimblog.de/wp-content/uploads/2009/01/trustyou-logo.jpg" alt="" width="260" height="63" /></a></p>
<p><strong>Breakfast Links 08.01.2009</strong></p>
<p>* <a title="TrustYou" href="http://www.trustyou.com" target="_blank">TrustYou &#8211; make a good choice</a></p>
<p>I just stumbled upon a press release from TrustYou, a search engine for hotel and restaurant recommendations. TrustYou calls itself a semantic meta search engine. <strong>Meta</strong> because it makes use of various existing hotel recommendation platforms like HRS, hotel.de and more. <strong>Semantic</strong> because of its pre-processing methods for heterogeneous text data about hotels/restaurant and user comments. <em>Yeah, finally a new alternaitve search engine from germany, chapeau!</em></p>
<p>See also this post: <a title="TrustYou durchsucht Hotel-Bewertungen" href="http://www.deutsche-startups.de/2009/01/08/trustyou-durchsucht-hotel-bewertungen/" target="_blank">TrustYou durchsucht Hotel-Bewertungen</a> (german)</p>
<p>* <a title="Google Semantic Surfacing" href="http://arnoldit.com/wordpress/2009/01/08/google-semantics-surfacing/" target="_blank">Google Semantics Surfacing</a></p>
<p>Stephen Arnold&#8217;s response to the article &#8220;<a title="Did Google Just Expose Semantic Data in Search Results?" href="http://www.readwriteweb.com/archives/google_semantic_data.php" target="_blank">Did Google Just Expose Semantic Data in Search Results?</a>&#8221; from readwriteweb.com.</p>
]]></content:encoded>
			<wfw:commentRss>http://mimblog.de/2009/01/08/trustyou-semantic-meta-search-engine/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Information Management Week Summary</title>
		<link>http://mimblog.de/2008/12/05/information-management-week-summary/</link>
		<comments>http://mimblog.de/2008/12/05/information-management-week-summary/#comments</comments>
		<pubDate>Fri, 05 Dec 2008 15:21:52 +0000</pubDate>
		<dc:creator>Hannes Carl Meyer</dc:creator>
				<category><![CDATA[Information Management]]></category>
		<category><![CDATA[cloud computing]]></category>
		<category><![CDATA[data set]]></category>
		<category><![CDATA[Enterprise Search]]></category>
		<category><![CDATA[lucene]]></category>
		<category><![CDATA[search engine]]></category>

		<guid isPermaLink="false">http://mimblog.de/?p=671</guid>
		<description><![CDATA[
I wasn&#8217;t able to write any Breakfast Links since monday due to projects (like the upcoming relaunch of this blog), birthday parties and efforts for christmas presents (big, big family)! Finally I turned on my news reader today&#8230;

* Amazon Launches Public Data Sets To Ease Research
Amazon (AWS) provides public data sets to demonstrate use cases [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://family.go.com/santas-list/video/22886-01shwbpfrj6yg3x/"><img class="alignnone size-medium wp-image-672" title="santa-hannes" src="http://mimblog.de/wp-content/uploads/2008/12/santa-hannes-300x221.jpg" alt="" width="210" height="155" /></a></p>
<p>I wasn&#8217;t able to write any <a title="Meyer Information Management | Breakfast Links" href="http://mimblog.de/category/breakfastnews/" target="_blank">Breakfast Links</a> since monday due to projects (like the upcoming relaunch of this blog), birthday parties and efforts for christmas presents (big, big family)! Finally I turned on my news reader today&#8230;</p>
<p><span id="more-671"></span></p>
<p>* <a title="Amazon Launches Public Data Sets To Ease Research" href="http://www.techcrunch.com/2008/12/04/amazon-launches-public-data-sets-to-ease-research/" target="_blank">Amazon Launches Public Data Sets To Ease Research</a></p>
<p>Amazon (AWS) provides public data sets to demonstrate use cases for their (cloud) services. The article on TechCrunch also links to <a title="Tasty Data Goodies" href="http://www.swivel.com/" target="_blank">Swivel</a> which is a repository for open data sets. Unfortunately you won&#8217;t find any textual data but a lot of data like U.S. Census, historical oil prices, world photovoltaic production and more.</p>
<p>* <a title="New Open Source Search Vendor" href="http://arnoldit.com/wordpress/2008/12/05/new-open-source-search-vendor/" target="_blank">New Open Source Search Vendor</a></p>
<p>An interview of Hans-Christian Brockmann (<a title="Brox IT-Solutions GmbH" href="http://www.brox.de" target="_blank">Brox IT-Solutions GmbH</a>) about Brox Open Source strategy in the field of enterprise search. Really interesting is Brockmann&#8217;s number of 180.000 lucene projects (more or less productive) out there! Brox already came up with their Eclipse project <a title="The official eccenca weblog" href="http://eccenca.broxblogs.de/" target="_blank">SMILA</a>, which is going to be a base architecture to better integrate search and text mining into common systems. <em>Nice to hear from those kind of projects in my business area!</em></p>
<p>* <a title="http://z-pod.de/archives/299-Z!-Episode-134-Qimaya-Anwendung-neuronaler-Netze.html" href="Z! - Episode 134: Qimaya - Anwendung neuronaler Netze" target="_blank">Z! &#8211; Episode 134: Qimaya &#8211; Anwendung neuronaler Netze</a> (german)</p>
<p>Interesting interview with Roy Uhlmann (<a title="Qimaya" href="http://qimaya.wordpress.com/" target="_blank">Qimaya</a> aka former Queap) about neuronal networks in their search engine Qimaya. If you are interested in testing Qimaya you can request access to the demo &#8211; next invitation and <a title="Anmeldungen für die 3. Passworttranche werden noch bis Freitag entgegengenommen!" href="http://qimaya.wordpress.com/2008/12/03/anmeldungen-fur-die-3-passworttranche-werden-noch-bis-freitag-entgegengenommen/" target="_blank">access delivery is even today</a>! <em>As I can see, Qimaya is currently a german language only project!</em></p>
]]></content:encoded>
			<wfw:commentRss>http://mimblog.de/2008/12/05/information-management-week-summary/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Another Video Search Engine from Exalead</title>
		<link>http://mimblog.de/2008/11/03/another-video-search-engine-from-exalead/</link>
		<comments>http://mimblog.de/2008/11/03/another-video-search-engine-from-exalead/#comments</comments>
		<pubDate>Mon, 03 Nov 2008 10:43:00 +0000</pubDate>
		<dc:creator>Hannes Carl Meyer</dc:creator>
				<category><![CDATA[Breakfast Links]]></category>
		<category><![CDATA[hadoop]]></category>
		<category><![CDATA[mahout]]></category>
		<category><![CDATA[search engine]]></category>
		<category><![CDATA[video search]]></category>
		<category><![CDATA[voice to text]]></category>

		<guid isPermaLink="false">http://mimblog.de/?p=595</guid>
		<description><![CDATA[
Breakfast Links 03.11.2008
* Exalead-Labs: voxalead
The parisian search engine company Exalead released their video search engine voxalead. It combines Exalead&#8217;s indexing (Entity Recognition) and search technology with third partie&#8217;s Voice recognition (LIMSI). Currently they indexed mostly french and us channels and a small amount of german channels. Beside the search through the voice-to-text extracted contents you [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://mimblog.de/wp-content/uploads/2008/11/logo_voxalead_big.gif"><img class="alignnone size-medium wp-image-596" title="logo_voxalead_big" src="http://mimblog.de/wp-content/uploads/2008/11/logo_voxalead_big.gif" alt="" width="262" height="105" /></a></p>
<p><strong>Breakfast Links 03.11.2008</strong></p>
<p>* <a title="voxalead" href="http://voxalead.labs.exalead.com" target="_blank">Exalead-Labs: voxalead</a></p>
<p>The parisian search engine company <a title="Exalead" href="http://www.exalead.com/software/" target="_blank">Exalead</a> released their video search engine <a title="About Voxalead" href="http://labs.exalead.com/index.php?option=com_content&amp;view=article&amp;catid=37:features-demos&amp;id=49:tv-news-search" target="_blank">voxalead</a>. It combines Exalead&#8217;s indexing (Entity Recognition) and search technology with third partie&#8217;s Voice recognition (<a title="LIMSI" href="http://www.limsi.fr/" target="_blank">LIMSI</a>). Currently they indexed <strong>mostly french and us channels and a small amount of german channels</strong>. Beside the search through the voice-to-text extracted contents you can also view the <strong>original extracted text</strong> which shows off the quality of voxalead!</p>
<p>* <a title="The German metasearch engine “MetaGer”" href="http://altsearchengines.com/2008/11/02/the-german-metasearch-engine-metager/" target="_blank">Metager at Alt Search Engines</a></p>
<p><a title="Metager" href="http://metager.de/" target="_blank">Metager</a> now has its place at <a title="Alt Search Engines" href="http://www.altsearchengines.com/" target="_blank">AltSearchEngines.com</a> in english and german!</p>
<p>* <a title="Twenty Newsgroups Classification" href="http://cwiki.apache.org/confluence/display/MAHOUT/TwentyNewsgroups" target="_blank">Mahout: Twenty Newsgroups Classification</a></p>
<p>This weekend I stumbled upon a nice <strong>Mahout Classification example</strong>. It is all about creating a classification model based on twenty clusters of newsgroup messages. All you need is Mahout and Hadoop to get the example running in approx. 20 minutes.</p>
]]></content:encoded>
			<wfw:commentRss>http://mimblog.de/2008/11/03/another-video-search-engine-from-exalead/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Yotify Social Search Engine</title>
		<link>http://mimblog.de/2008/10/15/yotify-social-search-engine/</link>
		<comments>http://mimblog.de/2008/10/15/yotify-social-search-engine/#comments</comments>
		<pubDate>Wed, 15 Oct 2008 07:48:04 +0000</pubDate>
		<dc:creator>Hannes Carl Meyer</dc:creator>
				<category><![CDATA[Breakfast Links]]></category>
		<category><![CDATA[search engine]]></category>
		<category><![CDATA[social search]]></category>

		<guid isPermaLink="false">http://mimblog.de/?p=503</guid>
		<description><![CDATA[
Breakfast Links 15.10.2008
* Yotify: A Social Search Engine
Stephen Arnold reviews the Social Search Engine Yotify regarding to the article on the Technology Review magazine.
]]></description>
			<content:encoded><![CDATA[<p><img class="aligncenter size-medium wp-image-86" title="Breakfast Link" src="http://mimblog.de/wp-content/uploads/2008/08/breakfast-link-english-300x66.gif" alt="" width="300" height="66" /></p>
<p><strong>Breakfast Links 15.10.2008</strong></p>
<p>* <a title="Yotify: A social Search Engine" href="http://arnoldit.com/wordpress/2008/10/15/yotify-a-social-search-engine/" target="_blank">Yotify: A Social Search Engine</a></p>
<p>Stephen Arnold reviews the Social Search Engine <a title="Yotify" href="http://www.yotify.com" target="_blank">Yotify</a> regarding to the <a title="Making Search Social " href="http://www.technologyreview.com/web/21509/?a=f" target="_blank">article on the Technology Review </a>magazine.</p>
]]></content:encoded>
			<wfw:commentRss>http://mimblog.de/2008/10/15/yotify-social-search-engine/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title></title>
		<link>http://mimblog.de/2008/10/08/477/</link>
		<comments>http://mimblog.de/2008/10/08/477/#comments</comments>
		<pubDate>Wed, 08 Oct 2008 08:12:25 +0000</pubDate>
		<dc:creator>Hannes Carl Meyer</dc:creator>
				<category><![CDATA[Breakfast Links]]></category>
		<category><![CDATA[Enterprise Search]]></category>
		<category><![CDATA[search engine]]></category>

		<guid isPermaLink="false">http://mimblog.de/?p=477</guid>
		<description><![CDATA[
Breakfast Links 08.10.2008
* Ask.com with new version of search
Ask.com with changes in terms of search relevance, search speed and question answering. But why am I getting 10 to 15 sponsored links in some result sets?
* Autonomy and OpenText: Equally Good Bellwethers
Stephen Arnold compares Autonomy and OpenText and gives an outlook into the future of them [...]]]></description>
			<content:encoded><![CDATA[<p><img class="aligncenter size-medium wp-image-86" title="Breakfast Link" src="http://mimblog.de/wp-content/uploads/2008/08/breakfast-link-english-300x66.gif" alt="" width="300" height="66" /></p>
<p><strong>Breakfast Links 08.10.2008</strong></p>
<p>* <a title="ASK.COM DELIVERS NEXT GENERATION OF ITS SITE" href="http://sp.uk.ask.com/en/docs/about/press2008/release.shtml?id=pr2008_0610" target="_blank">Ask.com with new version of search</a></p>
<p>Ask.com with changes in terms of search relevance, search speed and question answering. But why am I getting 10 to 15 sponsored links in some result sets?</p>
<p>* <a title="Autonomy and OpenText: Equally Good Bellwethers" href="http://arnoldit.com/wordpress/2008/10/08/autonomy-and-opentext-equally-good-bellwethers/" target="_blank">Autonomy and OpenText: Equally Good Bellwethers</a></p>
<p>Stephen Arnold compares Autonomy and OpenText and gives an outlook into the future of them and enterprise search in general.</p>
]]></content:encoded>
			<wfw:commentRss>http://mimblog.de/2008/10/08/477/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Outstanding Graduate Theses</title>
		<link>http://mimblog.de/2008/10/07/outstanding-graduate-theses/</link>
		<comments>http://mimblog.de/2008/10/07/outstanding-graduate-theses/#comments</comments>
		<pubDate>Tue, 07 Oct 2008 07:20:56 +0000</pubDate>
		<dc:creator>Hannes Carl Meyer</dc:creator>
				<category><![CDATA[Breakfast Links]]></category>
		<category><![CDATA[categorization]]></category>
		<category><![CDATA[search engine]]></category>

		<guid isPermaLink="false">http://mimblog.de/?p=465</guid>
		<description><![CDATA[
Breakfast Links 07.10.2008
* Welcome to the Data Cloud
* VN (vietnamese) search engine released
* Three outstanding graduate theses &#8211; with a really interesting language-based approach in categorization -  on IR Touthgs
]]></description>
			<content:encoded><![CDATA[<p><img class="aligncenter size-medium wp-image-86" title="Breakfast Link" src="http://mimblog.de/wp-content/uploads/2008/08/breakfast-link-english-300x66.gif" alt="" width="300" height="66" /></p>
<p><strong>Breakfast Links 07.10.2008</strong></p>
<p>* <a title="Welcome to the Data Cloud?" href="http://blogs.zdnet.com/semantic-web/?p=205" target="_blank">Welcome to the Data Cloud</a></p>
<p>* <a title="http://vietnamnews.vnagency.com.vn/showarticle.php?num=01BUS041008" href="http://vietnamnews.vnagency.com.vn/showarticle.php?num=01BUS041008" target="_blank">VN (vietnamese) search engine released</a></p>
<p>* <a title="Outstanding Graduate Theses" href="http://irthoughts.wordpress.com/2008/10/06/outstanding-graduate-theses/" target="_blank">Three outstanding graduate theses</a> &#8211; with a really interesting language-based approach in categorization -  on IR Touthgs</p>
]]></content:encoded>
			<wfw:commentRss>http://mimblog.de/2008/10/07/outstanding-graduate-theses/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google Street View to Fail in Germany?</title>
		<link>http://mimblog.de/2008/10/01/bl/</link>
		<comments>http://mimblog.de/2008/10/01/bl/#comments</comments>
		<pubDate>Wed, 01 Oct 2008 05:10:47 +0000</pubDate>
		<dc:creator>Hannes Carl Meyer</dc:creator>
				<category><![CDATA[Breakfast Links]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[search engine]]></category>

		<guid isPermaLink="false">http://mimblog.de/?p=429</guid>
		<description><![CDATA[
Breakfast Links 01.10.2008
* Schleswig-Holstein communities saying no to Google Street View and planning legal actions to protect themselves from Google
* Whole Travel beta released, a travel search engine with the possibility to rank results by sustainability
]]></description>
			<content:encoded><![CDATA[<p><img class="aligncenter size-medium wp-image-86" title="Breakfast Link" src="http://mimblog.de/wp-content/uploads/2008/08/breakfast-link-english-300x66.gif" alt="" width="300" height="66" /></p>
<p><strong>Breakfast Links 01.10.2008</strong></p>
<p>* <a title="German Towns Saying 'Nein' to Google 'Street View'" href="http://www.spiegel.de/international/germany/0,1518,581177,00.html" target="_blank">Schleswig-Holstein communities saying no to Google Street View</a> and planning legal actions to protect themselves from Google</p>
<p>*<a title="Whole Travel" href="http://www.wholetravel.com/beta" target="_blank"> Whole Travel</a> beta released, a travel search engine with the possibility to rank results by sustainability</p>
]]></content:encoded>
			<wfw:commentRss>http://mimblog.de/2008/10/01/bl/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>New Enterprise Search Simplexo</title>
		<link>http://mimblog.de/2008/09/11/new-enterprise-search-simplexo/</link>
		<comments>http://mimblog.de/2008/09/11/new-enterprise-search-simplexo/#comments</comments>
		<pubDate>Thu, 11 Sep 2008 19:52:16 +0000</pubDate>
		<dc:creator>Hannes Carl Meyer</dc:creator>
				<category><![CDATA[Information Management]]></category>
		<category><![CDATA[Enterprise Search]]></category>
		<category><![CDATA[search engine]]></category>

		<guid isPermaLink="false">http://mimblog.de/?p=344</guid>
		<description><![CDATA[
Today I read some articles about Simplexo, a new open source enterprise search application provided by a privately held company from the UK (Simplexo Ltd.). Since I&#8217;m already experienced with open and commercial enterprise search solutions I was asking some questions in terms of features and open source to Simplexo&#8217;s website (please keep in mind [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.simplexo.com"><img class="alignnone size-medium wp-image-355" title="simplexo-enterprise-search" src="http://mimblog.de/wp-content/uploads/2008/09/simplexo-enterprise-search.gif" alt="" width="130" height="124" /></a></p>
<p>Today I read some articles about <a title="Simplexo" href="http://www.simplexo.com" target="_blank">Simplexo</a>, a new open source enterprise search application provided by a privately held company from the UK (Simplexo Ltd.). Since I&#8217;m already experienced with open and commercial enterprise search solutions I was asking some questions in terms of features and open source to Simplexo&#8217;s website (please keep in mind that this is not a test in practice).</p>
<p><span id="more-344"></span></p>
<p><strong>What are the system Requirements, Hard- and Software?</strong><br />
Simplexo is a Microsoft only product, it needs Windows XP, 2000, 2003 or Vista, the IIS server, .NET 2 Framework and finally a relational database like MySQL or MS SQL.</p>
<p><strong>Which kind of files are getting indexed?</strong><br />
Simplexo already comes with parsers for structured and unstructured documents. It can handle documents like MS/Open Office, PDF, PS, Webpages, Flat-text, Images (I guess metadata) and E-Mails (?). Beside those files from you&#8217;re local filesystem or network drive, Simplexo can also index information from databases, business applications like salesforce or sap and online sources such Google, Reuters and arbitrary feeds.</p>
<p><strong>Is it possible to integrate modules for content processing?<br />
</strong>I couldn&#8217;t find any information on how to extend Simplexo on their website and I&#8217;m still missing information on mailing lists, developer forums etc.</p>
<p><strong>Which search functions are available?</strong><br />
Simplexo can handle ordinary term, phrase and boolean queries. Beside that Simplexo also offers Synonym support, date range, fuzzy and semantic queries (detail information missing).</p>
<p><strong>Is it possible to integrate Simplexo in existing applications?</strong><br />
Yes, Simplexo comes with a SOAP interface and already provides an interface to search through existing Microsoft and Autonomy indexes.</p>
<p><strong>What about scalability?</strong><br />
There is a page about scalability at Simplexo&#8217;s website but it only tells you about 600k simultaneous users, less search time than Google and a much smaller index size than on MS databases. I&#8217;m missing information about how the Scalability is done technically, is it possible to use more hardware?</p>
<p>So, what about a test in practice? Lately there is no online demonstration of Simplexo yet and since I&#8217;m running a Linux desktop I can&#8217;t perform the installation of *.exes &#8211; Simplexo is a pure Microsoft compatible product.</p>
<p>Original press article on CBR: <a title="Simplexo launches open source enterprise search platform " href="http://www.cbronline.com/article_news.asp?guid=D4082B6C-7C32-4FC3-B289-3E2F439DED67" target="_blank">Simplexo launches open source enterprise search platform</a></p>
]]></content:encoded>
			<wfw:commentRss>http://mimblog.de/2008/09/11/new-enterprise-search-simplexo/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>New Video Search Engine: VideoSurf</title>
		<link>http://mimblog.de/2008/09/11/new-video-search-engine-videosurf/</link>
		<comments>http://mimblog.de/2008/09/11/new-video-search-engine-videosurf/#comments</comments>
		<pubDate>Thu, 11 Sep 2008 04:15:00 +0000</pubDate>
		<dc:creator>Hannes Carl Meyer</dc:creator>
				<category><![CDATA[Breakfast Links]]></category>
		<category><![CDATA[application server]]></category>
		<category><![CDATA[geronimo]]></category>
		<category><![CDATA[glassfish]]></category>
		<category><![CDATA[hadoop]]></category>
		<category><![CDATA[jboss]]></category>
		<category><![CDATA[search engine]]></category>
		<category><![CDATA[tomcat]]></category>
		<category><![CDATA[video search]]></category>

		<guid isPermaLink="false">http://mimblog.de/?p=338</guid>
		<description><![CDATA[
Breakfast Links 11.09.2008
New Video Search Engine: VideoSurf (I&#8217;m still waiting for my invitation, can anybody help?)
http://searchengineland.com/080910-050000.php
Cascading tries to simplify building Hadoop MapReduce Applications
http://www.theserverside.com/news/thread.tss?thread_id=50629
Free Application Servers tested: Tomcat vs. Geronimo vs. JBoss vs. GlassFish (german)
http://www.computerwoche.de/knowledge_center/software_infrastruktur/1873146/
]]></description>
			<content:encoded><![CDATA[<p><img class="aligncenter size-medium wp-image-86" title="Breakfast Link" src="http://mimblog.de/wp-content/uploads/2008/08/breakfast-link-english-300x66.gif" alt="" width="300" height="66" /></p>
<p><strong>Breakfast Links 11.09.2008</strong></p>
<p>New Video Search Engine: VideoSurf (I&#8217;m still waiting for my invitation, can anybody help?)<br />
<a title="VideoSurf: New, Genuinely Radical Video Search" href="http://searchengineland.com/080910-050000.php" target="_blank">http://searchengineland.com/080910-050000.php</a></p>
<p>Cascading tries to simplify building Hadoop MapReduce Applications<br />
<a title="Cascading: Simplifying Hadoop MapReduce Applications" href="http://www.theserverside.com/news/thread.tss?thread_id=50629" target="_blank">http://www.theserverside.com/news/thread.tss?thread_id=50629</a></p>
<p>Free Application Servers tested: Tomcat vs. Geronimo vs. JBoss vs. GlassFish (german)<br />
<a title="Freie App-Server im Vergleich" href="http://www.computerwoche.de/knowledge_center/software_infrastruktur/1873146/" target="_blank">http://www.computerwoche.de/knowledge_center/software_infrastruktur/1873146/</a></p>
]]></content:encoded>
			<wfw:commentRss>http://mimblog.de/2008/09/11/new-video-search-engine-videosurf/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
