简体繁体中英

OCR with tesseract, pre-processing image

原文 2019-05-31 03:08:24 9 1 python/ python-3.x/ image-processing/ ocr/ python-tesseract

I need to extract digits from images like the one shown below, I'm using tesseract now, but it isn't working. Can anyone help me in pre-processing the images before feeding it to tesseract?

1 answers

I don't think tesseract is the right tool for it, Tesseract can only handle very clear letters.
If your numbers are all like those in the picture you can use opencv ORB detector https://opencv-python-tutroals.readthedocs.io/en/latest/py_tutorials/py_feature2d/py_orb/py_orb.html
Or if it don't work, you can use some deeplearning aproch, as a SSD Keras or YOLO.
https://github.com/pierluigiferrari/ssd_keras
Another option is to dismember the numbers (it is easy if is all the same size) and create a very simple convolutional neural network with keras.
https://keras.io/

Improving image pre-processing for tesseract (video game screenshot)

Pre-processing image before text recognition with Tesseract

Implementation of image pre-processing methods on android

Measuring image processing quality for tesseract ocr

Keras Image Pre-processing Flow Converts RGB Images to BGR

Replicating a Python workflow for pre-processing of an image for Tensorflow in a Javascript environment

Text Pre-processing with NLTK

Tesseract OCR on binary image

How to OCR image with Tesseract

How to data pre-processing in Spark in this case

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Improving image pre-processing for tesseract (video game screenshot) Pre-processing image before text recognition with Tesseract Implementation of image pre-processing methods on android Measuring image processing quality for tesseract ocr Keras Image Pre-processing Flow Converts RGB Images to BGR Replicating a Python workflow for pre-processing of an image for Tensorflow in a Javascript environment Text Pre-processing with NLTK Tesseract OCR on binary image How to OCR image with Tesseract How to data pre-processing in Spark in this case

Related Tags

OCR with tesseract, pre-processing image

Question

1 answers

solution1 0 2019-05-31 03:49:30

solution1
0 2019-05-31 03:49:30