MC+A Open Pipeline Connector for Google Search Appliance

November 4th, 2009

OpenPipline is an open source software project for crawling, parsing, analyzing and routing documents.  I joined the board of advisors a while back and I’m happy to say that I’ve finally got around some time to put effort into really contributing.  The goal is to develop a standard framework for developing enterprise search applications.  Enterprise search products have similar architectures yet they are typically incompatible with one and another.  Something developed for one…can not be processed by another.

We’ve started out building on a commercial release of the pipeline processor and have begun to integrate it with some of our Google Search Appliance customers.  For them, this tool provides the ability to convert document types that are not support by the GSA or other extraction techniques.  It really depends on the requirements.

Our first contribution to the project is a commercial release of a stage that publishes the item from OpenPipeline to a Google Search Appliance.  By combining the technologies, you get an open text processing framework along with Google’s powerful algorithm.

Stay tuned for some interesting updates!

image

Leave a Reply