Monday, May 21, 2007

Ocropus

Seems that things are moving along quite quickly. Ocropus is an open source document analysis and OCR system. It uses tesseract as ocr, and a bunch of other stuff for statistical analysis, aspell for spell checking, etc. Not even alpha yet though.

I'm building it right now.

Hmm. How hard would it be to write a scan -> pdf generator?

I think Google is paying people to work on this stuff.


Comments: Post a Comment

Subscribe to Post Comments [Atom]





<< Home

This page is powered by Blogger. Isn't yours?

Subscribe to Posts [Atom]