Friday, 12 February 2010

Version 1.2

It's been very quiet on the VelOCRaptor front recently, as I have had to concentrate on another project to feed my family, and we are still waiting (patiently) for the much-vaunted next release of our OCRopus engine.

In the meantime, I finally got round to adding a checkbox that disables the spell checking. Our first preference!

In order to improve the quality of the output, I run the OCR'd text through the Mac spill chequer, replacing mis-spells with it's top suggestion. This works well for most documents, but can give hilariously bad results on other.

If you're reading a document that is not in the same language as your Mac, or that doesn't really contain words (someone recently sent me DNA sequence data - I hope they didn't need 100% accuracy before starting that gene-therapy) then try turning spell checking off to improve the results.