Sonntag, 29. April 2012

TessOCR 1.05 - Free (GNU) OCR software based on Tessaract.. (Free)

TessOCR 1.05 - Free (GNU) OCR software based on Tessaract.. (Free): "TessOCR is free OCR tool using tesseract, ImageMagick and Xpdf as a framework with JVM. TessOCR is released and distributed under the Apache License, Version 2.0.

Features:

  • Supported language: Japanese, English, French and so on. Additional support for character recognition dictionary.
  • Layout recognition: Detects horizontal-writing and vertical-writing automatically. Recognizes only content of tabular.
  • Recognizable format of image data: JPEG,PNG,GIF,BMP, TIFF and PDF.
  • Recognizable image dimensions: There is no particular limitation.
  • Recognizable character size: (Under the investigation)
  • Elimination of noise in the image: Manual control.
  • Correction of the inclination of the image: Manual control.
  • Crop the image: Manual control. Spread pages can be specified.
  • Convert to the grayscaled image by threshold: Manual control.
  • Training the character recognition dictionary: Semi-automatic control. You can edit the box.
  • Text Editing : You can input the text and edit it, and save it. You can search the text and replace with another string.

TessOCR uses internally tesseract, ImageMagick and Xpdf to process the image. However, tesseract, ImageMagick and/or Xpdf do not include as a framework of TessOCR. If tesseract, ImageMagick and/or Xpdf is already installed in your environment, TessOCR will link to it. If tesseract, ImageMagick and/or Xpdf have not been installed yet, that thing will notify to you. You have to install tesseract, ImageMagick and/or Xpdf using MacPorts.



Version 1.05:
  • Abolition of the framework.
  • Append function that accesses the NDL digital Library.
  • Append function that detect the page area automatically.
Version 1.04:
  • Fixed bug on handling half-width blank of Japanese.
  • Append function that wrap each text line.
  • Fixed bug in the Search and Replace.
  • Fixed bug in the choice of language recognition.
  • Fixed bug on handling of the file name.
  • Fixed bug on displaying the character recognition window.
  • Append function that extracts text from the PDF.
Version 1.03:
  • Fixed bug when display the character recognition window.
  • Append function that can split the character recognition window.
  • Append function that accesses the PDF.




Download Now"

(Via MacUpdate - Mac OS X.)

Keine Kommentare: