Breakfast Links 23.10.2008
* katta and hadoop survey slides
Katta is a project by 101tec.com a consulting and software development company specialized on large-scale data processing and information management software. Katta adds grid support to Apache Lucene with a combination of Hadoop and Zookeeper (which is going to be moved into Hadoop).
* Mozenda: SaaS Data Extraction
Mozenda is a software which is able to extract data from different sources like Websites, Databases, RSS-Feeds and more. Really interesting sounds the extraction of content from forum and blogs.
Wow, there is really too much stuff to actually test… Does anybody have any experiences in those projects/products?

[...] there is also an up to date article at Computerwoche. I’m currently writing an article on Mozenda and GoGrid – after those I’m going to check out Amazon [...]
[...] already posted Mozenda in my Breakfast Links last week after BS recommended it to check it out. Since web monitoring and data extraction always [...]