Python OCR and Barcode Recognition
High performance, royalty-free Python OCR and barcode recognition on Windows, Linux, Mac OS and Unix
Resources and FAQ's for Asprise OCR for Python
Recognizes text only or barcode only
To recognize text only:
s = ocrEngine.recognize("test.png", -1, -1, -1, -1, -1, OCR_RECOGNIZE_TYPE_TEXT, OCR_OUTPUT_FORMAT_PLAINTEXT)
To recognize barcode only:
s = ocrEngine.recognize("test.png", -1, -1, -1, -1, -1, RECOGNIZE_TYPE_BARCODE, OCR_OUTPUT_FORMAT_PLAINTEXT)
Perform OCR on part of the image
In some cases, you might not want to OCR the whole image. In that case, you can OCR on part of the image to save time:
s = ocrEngine.recognize("test.png", -1, 0, 0, 400, 200, OCR_RECOGNIZE_TYPE_TEXT, OCR_OUTPUT_FORMAT_PLAINTEXT)
The above code OCR the top left part of the image with width 400 pixels and height 200 pixels.
Perform OCR on multiple input files in one shot
s = ocrEngine.recognize("test1.png;test2.png", -1, -1, -1, -1, -1, OCR_RECOGNIZE_TYPE_TEXT, OCR_OUTPUT_FORMAT_PLAINTEXT)
Perform OCR on a certain page from the specified TIFF file
A TIFF file may contain multiple pages. If you need to recognize only a certain page, you can specify the page number as following:
s = ocrEngine.recognize("test.tif", 2, -1, -1, -1, -1, OCR_RECOGNIZE_TYPE_TEXT, OCR_OUTPUT_FORMAT_PLAINTEXT)
Note 2 means the second page (the page number of the first page is 1).
Perform OCR on a PDF input file
You use the following method to perform OCR on a PDF input file:
s = ocrEngine.recognize("test.pdf", -1, -1, -1, -1, -1, OCR_RECOGNIZE_TYPE_TEXT, OCR_OUTPUT_FORMAT_PLAINTEXT)
Read the developer's guide for more detailsFeatured Clients
Sectors: Finance • Information Technoloy • Government • Healthcare • Industries • Education • (show all)