Breakfast Links 04.11.2008
* Truevert Green Sarch Engine
OrcaTec from the US is running the green search engine truevert. Its aim is to display only search results with the topic of green, natural etc. stuff. Truevert is not using its own crawler but uses Yahoo! BOSS to retrieve search results and their technology for clustering and some [...]
Posts Tagged ‘lucene’
Truevert Green Search
Birds Of A Feather Sessions in New Orleans
Breakfast Links 31.10.2008
* Birds Of A Feather Sessions (BOFs) in New Orleans
BoF sessions on the ApacheCon are informal gatherings of persons interested in a particular topic. Grant Ingersoll from Lucene proposed two sessions, one on Apache Machine Learning (Mahout, UIMA) and one on Apache Search (Lucene, Nutch, Search). I would have increased the counter, unfortunately [...]
Lucene + Hadoop + Zookeeper = Katta
Breakfast Links 23.10.2008
* katta and hadoop survey slides
Katta is a project by 101tec.com a consulting and software development company specialized on large-scale data processing and information management software. Katta adds grid support to Apache Lucene with a combination of Hadoop and Zookeeper (which is going to be moved into Hadoop).
* Mozenda: SaaS Data Extraction
Mozenda is [...]
Lucene 2.4.0 released
Lucene 2.4.0 was released today, important new features are:
New RAM-based index for much faster searching than with RAMDirectory
Machine or OS crash while writing will no longer corrupt the whole index
Delete documents by query with new IndexWriter
Ability to define a search timeout
and more…
You can read the detailed changelog here: Lucene Change Log (by the way, like [...]
Upcoming Book from Grant Ingersoll, Taming Text
I just stumbled upon a new book from Grant Ingersoll and Thomas Morton called “Taming Text“. It is currently in progress but you can already look into it via Manning early access. Since I was really excited on the Chapter 4 (Identifying People, Places and Things) I already bought the MEAP eBook.
Solr 1.3 released
Solr 1.3 final was released today, congratulations to the development team!
This release ships with a lot of new features:
Distributed search capabilities
Supporting multiple indexes in one deployment (this means also multiple languages in one deployment)
Chained search components (More Like This, Spell Checking, etc.)
The new DataImportHandler to import easily from Database to Solr
Haloe for Lucene/SOLR released
Today I noticed the release of a new Lucene/SOLR package called “Haloe”. It addresses an abstract way on how to use Indexer/Searcher to either use with Lucene or SOLR.
Mail from Marcus Herou on the SOLR mailing list:
Glad to announce that I finally managed to move this package out of the
company code and released it to [...]