Monday, May 21, 2007
Ocropus
Seems that things are moving along quite quickly. Ocropus is an open source document analysis and OCR system. It uses tesseract as ocr, and a bunch of other stuff for statistical analysis, aspell for spell checking, etc. Not even alpha yet though.
I'm building it right now.
Hmm. How hard would it be to write a scan -> pdf generator?
I think Google is paying people to work on this stuff.
Subscribe to Posts [Atom]