Module for text analysis and comparison?

Do you have a question? Post it now! No Registration Necessary.  Now with pictures!

Is there a CPAN module that does statistical text comparison, ie given
two paragraph-length strings, returns a probability saying if they are
similar or different?

I'm thinking of the kind of module that you'd use to implement your own
Baysian filtering, or trying to determine if two anonymous posts on a
message board are written by the same person; things like that.

Site Timeline