Difference between revisions of "Optical Character Recognition"

From ArchWiki
Jump to: navigation, search
(Layout analysers and user interfaces: removed Kooka: the website is gone, so is the AUR package)
(In "Community" now)
Line 17: Line 17:
  
 
=== Layout analysers and user interfaces ===
 
=== Layout analysers and user interfaces ===
 +
* {{App|[[OCRFeeder]]|Python GUI for Gnome which performs document analysis and rendition, and can use either [[CuneiForm]], [[GOCR]], [[Ocrad]] or [[Tesseract]] as OCR engines. It can import from PDF or image files, and export to HTML or OpenDocument. |http://live.gnome.org/OCRFeeder|{{pkg|ocrfeeder}}}}
 
* {{App|[[YAGF]]|graphical interface for the [[CuneiForm]] text recognition program on the Linux platform. Available from community repository|http://symmetrica.net/cuneiform-linux/yagf-en.html|{{Pkg|yagf}}}}
 
* {{App|[[YAGF]]|graphical interface for the [[CuneiForm]] text recognition program on the Linux platform. Available from community repository|http://symmetrica.net/cuneiform-linux/yagf-en.html|{{Pkg|yagf}}}}
 
* {{App|[[gscan2pdf]]|scans, runs Tesseract and creates a PDF all in one go|http://gscan2pdf.sourceforge.net/|{{AUR|gscan2pdf}}}}
 
* {{App|[[gscan2pdf]]|scans, runs Tesseract and creates a PDF all in one go|http://gscan2pdf.sourceforge.net/|{{AUR|gscan2pdf}}}}
* {{App|[[OCRFeeder]]|Python GUI for Gnome which performs document analysis and rendition, and can use either [[CuneiForm]], [[GOCR]], [[Ocrad]] or [[Tesseract]] as OCR engines. It can import from PDF or image files, and export to HTML or OpenDocument. Available from [[AUR]]|http://live.gnome.org/OCRFeeder|{{AUR|ocrfeeder}}}}
 
 
* {{App|[[OCRopus]]|OCR ''platform'', modules exist for document layout analysis, OCR engines (it can use Tesseract or its own engine), natural language modelling, etc. Available from [[AUR]]|http://code.google.com/p/ocropus/|{{AUR|ocropus}}}}
 
* {{App|[[OCRopus]]|OCR ''platform'', modules exist for document layout analysis, OCR engines (it can use Tesseract or its own engine), natural language modelling, etc. Available from [[AUR]]|http://code.google.com/p/ocropus/|{{AUR|ocropus}}}}

Revision as of 12:21, 17 April 2012

This template has only maintenance purposes. For linking to local translations please use interlanguage links, see Help:i18n#Interlanguage links.


Local languages: Català – Dansk – English – Español – Esperanto – Hrvatski – Indonesia – Italiano – Lietuviškai – Magyar – Nederlands – Norsk Bokmål – Polski – Português – Slovenský – Česky – Ελληνικά – Български – Русский – Српски – Українська – עברית – العربية – ไทย – 日本語 – 正體中文 – 简体中文 – 한국어


External languages (all articles in these languages should be moved to the external wiki): Deutsch – Français – Română – Suomi – Svenska – Tiếng Việt – Türkçe – فارسی

Tango-document-new.pngThis article is a stub.Tango-document-new.png

Notes: please use the first argument of the template to provide more detailed indications. (Discuss in Talk:Optical Character Recognition#)

There are several steps to the whole OCR process, the actual OCR engine is only part of this:

  1. scanning
  2. document layout analysis
  3. optical character recognition
  4. post-processing (formatting, PDF creation)

OCR software

OCR (Optical Character Recognition) Engines

  • CuneiForm — A command line OCR system originally developed and open sourced by Cognitive technologies. Supported languages: eng, ger, fra, rus, swe, spa, ita, ruseng, ukr, srp, hrv, pol, dan, por, dut, cze, rum, hun, bul, slo, lav, lit, est, tur.
https://launchpad.net/cuneiform-linux || cuneiform
  • GOCR/JOCR — An OCR engine which also supports barcode recognition.
http://jocr.sourceforge.net/ || gocr
  • Ocrad — An OCR program based on a feature extraction method.
http://www.gnu.org/software/ocrad/ || ocrad
  • Tesseract — "Probably one of the most accurate open source OCR engines available".
http://code.google.com/p/tesseract-ocr/ || tesseract

Layout analysers and user interfaces

  • OCRFeeder — Python GUI for Gnome which performs document analysis and rendition, and can use either CuneiForm, GOCR, Ocrad or Tesseract as OCR engines. It can import from PDF or image files, and export to HTML or OpenDocument.
http://live.gnome.org/OCRFeeder || ocrfeeder
  • YAGF — graphical interface for the CuneiForm text recognition program on the Linux platform. Available from community repository
http://symmetrica.net/cuneiform-linux/yagf-en.html || yagf
  • gscan2pdf — scans, runs Tesseract and creates a PDF all in one go
http://gscan2pdf.sourceforge.net/ || gscan2pdfAUR
  • OCRopus — OCR platform, modules exist for document layout analysis, OCR engines (it can use Tesseract or its own engine), natural language modelling, etc. Available from AUR
http://code.google.com/p/ocropus/ || ocropusAUR