ocropus | document analysis and OCR system | Mehr ... |
OCRopus(tm) is a state-of-the-art document analysis and Optical Character Recognition (OCR) system, featuring pluggable layout analysis, pluggable character recognition, statistical natural language modeling, and multi-lingual capabilities. . The OCRopus engine is based on two research projects: a high-performance handwriting recognizer developed in the mid-90's and deployed by the US Census bureau, and novel high-performance layout analysis methods. . OCRopus development is sponsored by Google and is initially intended for high-throughput, high-volume document conversion efforts. It will also be an excellent OCR system for many other applications. |
ocropus-data | document analysis and OCR system --- data files | Mehr ... |
OCRopus(tm) is a state-of-the-art document analysis and Optical Character Recognition (OCR) system, featuring pluggable layout analysis, pluggable character recognition, statistical natural language modeling, and multi-lingual capabilities. . The OCRopus engine is based on two research projects: a high-performance handwriting recognizer developed in the mid-90's and deployed by the US Census bureau, and novel high-performance layout analysis methods. . OCRopus development is sponsored by Google and is initially intended for high-throughput, high-volume document conversion efforts. It will also be an excellent OCR system for many other applications. . The ocropus-data package contains the architecture-independent data required by ocropus. |