Recognize Text & Objects in Graphical Images with PHP

Written by on April 3, 2006 in General - 3 Comments

An OCR with PHP ? it doesn’t sounds very common topic for PHP developers, but Andrey Kucherenko from Ukraine have made a very interesting project to realize the first phpOCR. His classes can recognize text in monochrome graphical images after a training phase. The training phase is necessary to let the class build recognition data structures from images that have known characters. The training data structures are used during the recognition process to attempt to identify text in real images using the corner algorithm.
PHPOCR have win the PHPClasses innovation awards of march 2006, and it shows the power of what could be implemented with PHP5. Manuel Lemos have reviewed the project :

Certain types of applications require reading text from documents that are stored as graphical images. That is the case of scanned documents.
An OCR (Optical Character Recognition) tool can be used to recover the original text that is written in scanned documents. These are sophisticated tools that are trained to recognize text in graphical images.
This class provides a base implementation for an OCR tool. It can be trained to learn how to recognize each letter drawn in an image. Then it can be used to recognize longer texts in real documents.

3 Comments on "Recognize Text & Objects in Graphical Images with PHP"

  1. Thinh Nguyen Xuan July 28, 2006 at 5:17 am · Reply

    - How to recognize the letter and number on image?
    - what is corner algorithm

  2. Jordan Willms November 14, 2006 at 11:02 pm · Reply

    I can just see the myspace bots have new enhanced captcha readers!

  3. prem2 January 16, 2010 at 4:55 am · Reply

    I want to do a project in php.To convert pdf into text.Will u guide me the coding part and what is corner algorithm.
    Thank you,

Leave a Comment