tesseract

tesseract - Open Source OCR Engine

Website: http://sourceforge.net/projects/tesseract-ocr
License: Apache License
Vendor: olea.org/paquetes-rpm/
Description:
This package contains the Tesseract Open Source OCR Engine.
Orignally developed at Hewlett Packard Laboratories Bristol and
at Hewlett Packard Co, Greeley Colorado

A few things to know about Tesseract OCR: for now it only supports the
English language, and does not include a page layout analysis module (yet),
so it will perform poorly on multi-column material. It also doesn't do well
on grayscale and color documents, and it's not nearly as accurate as some of
the best commercial OCR packages out there. Yet, as far as we know, despite
its shortcomings, Tesseract is far more accurate than any other Open Source
OCR package out there. If you know of one that is more accurate, please do
tell us!

http://google-code-updates.blogspot.com/2006/08/announcing-tesseract-ocr.html

Packages

tesseract-1.02-1olea.i386 [550 KiB] Changelog by Ismael Olea (2006-12-30):
- first version
tesseract-1.02-1olea.src [2.7 MiB] Changelog by Ismael Olea (2006-12-30):
- first version