Meyer Information Management Blog Rotating Header Image

Lucene + Hadoop + Zookeeper = Katta

Breakfast Links 23.10.2008

* katta and hadoop survey slides

Katta is a project by 101tec.com a consulting and software development company specialized on large-scale data processing and information management software. Katta adds grid support to Apache Lucene with a combination of Hadoop and Zookeeper (which is going to be moved into Hadoop).

* Mozenda: SaaS Data Extraction

Mozenda is a software which is able to extract data from different sources like Websites, Databases, RSS-Feeds and more. Really interesting sounds the extraction of content from forum and blogs.

Wow, there is really too much stuff to actually test… Does anybody have any experiences in those projects/products?

Share and Enjoy:
  • Digg
  • Sphinn
  • del.icio.us
  • Google Bookmarks
  • Technorati
  • MisterWong

2 Comments

  1. [...] there is also an up to date article at Computerwoche. I’m currently writing an article on Mozenda and GoGrid – after those I’m going to check out Amazon [...]

  2. [...] already posted Mozenda in my Breakfast Links last week after BS recommended it to check it out. Since web monitoring and data extraction always [...]

Leave a Reply