Do you have a question? Post it now! No Registration Necessary. Now with pictures!
- Arnold Shore
May 30, 2006, 12:56 am
rate this thread
simpler than, say, OWL. I18n not needed, in an attempt to keep size and
complexity down - as an example.
The only semi-heavy stuff it needs to do is to parse and full-text index the
common MS Office file/formats like Word and Excel, and PDF's. And the
Porter stemmer function.
I'm satisfied that all the pieces exist. While I could build this using
antiWord, some php/excel and PDF classes, I want to make sure that ground
hasn't already been well plowed. I've hit the usual sources, FreshMeat and
SourceForge somewhat casually, but struck out.
Will appreciate any thoughts, URL's, etc. Thanks, all.