Difference between revisions of "Optical Character Recognition"

From ArchWiki
Jump to: navigation, search
(Layout analysers and user interfaces: Added gImageReader)
(merged to List of applications/Documents#OCR software, redirect there)
 
(6 intermediate revisions by 3 users not shown)
Line 1: Line 1:
[[Category:Applications]]
+
#REDIRECT [[List of applications/Documents#OCR software]]
{{Stub}}
 
 
 
There are several steps to the whole OCR process, the actual OCR engine is only part of this:
 
# scanning
 
# document layout analysis
 
# optical character recognition
 
# post-processing (formatting, PDF creation)
 
 
 
== OCR software ==
 
=== OCR (Optical Character Recognition) Engines ===
 
* {{App|[[CuneiForm]]|A command line OCR system originally developed and open sourced by Cognitive technologies. Supported languages: eng, ger, fra, rus, swe, spa, ita, ruseng, ukr, srp, hrv, pol, dan, por, dut, cze, rum, hun, bul, slo, lav, lit, est, tur.|https://launchpad.net/cuneiform-linux|{{Pkg|cuneiform}}}}
 
* {{App|[[GOCR]]/JOCR|An OCR engine which also supports barcode recognition.|http://jocr.sourceforge.net/|{{Pkg|gocr}}}}
 
* {{App|[[Ocrad]]|An OCR program based on a feature extraction method.|http://www.gnu.org/software/ocrad/|{{Pkg|ocrad}}}}
 
* {{App|[[Tesseract]]|"Probably one of the most accurate open source OCR engines available". Package splitted, you need install some datafiles for each language ({{Pkg|tesseract-data-eng}} for examle).|http://code.google.com/p/tesseract-ocr/|{{Pkg|tesseract}}}}
 
 
 
=== Layout analysers and user interfaces ===
 
* {{App|[[OCRFeeder]]|Python GUI for Gnome which performs document analysis and rendition, and can use either [[CuneiForm]], [[GOCR]], [[Ocrad]] or [[Tesseract]] as OCR engines. It can import from PDF or image files, and export to HTML or OpenDocument. |http://live.gnome.org/OCRFeeder|{{pkg|ocrfeeder}}}}
 
* {{App|[[YAGF]]|graphical interface for the [[CuneiForm]] text recognition program on the Linux platform. Available from community repository|http://symmetrica.net/cuneiform-linux/yagf-en.html|{{Pkg|yagf}}}}
 
* {{App|[[gImageReader]]|A graphical GTK frontend to Tesseract|http://gimagereader.sourceforge.net/|{{AUR|gimagereader}}}}
 
* {{App|[[gscan2pdf]]|scans, runs Tesseract and creates a PDF all in one go|http://gscan2pdf.sourceforge.net/|{{AUR|gscan2pdf}}}}
 
* {{App|[[OCRopus]]|OCR ''platform'', modules exist for document layout analysis, OCR engines (it can use Tesseract or its own engine), natural language modelling, etc. Available from [[AUR]]|http://code.google.com/p/ocropus/|{{AUR|ocropus}}}}
 

Latest revision as of 12:46, 15 April 2014