Pytesseract can not recognize even very simple textline

Question

I think these images are quite simple and clear. Still pytesseract does not work. I really wonder why.

Here is my code

from pytesseract import pytesseract as tesseract
import cv2 as cv

binary = cv.imread(filepath)

lang = 'eng'
config = 'tessedit_char_whitelist=RGB123'
print(tesseract.image_to_string(binary, lang=lang, config=config))

The output is just blank string.

Answer 1

To Dennlinger's point, I would definitely rotate it before sending it through PyTess. PyTess should rotate it automatically though. Should.

Alternatively, I see in your configuration that you have white listed "RGB123" which, correct me if I'm wrong, may mean that PyTess is mainly looking for those specific numbers and characters.

I'd try changing your configuration by omiting that configuration so that it can pick up the "Y" in there.

Pytesseract can not recognize even very simple textline

Question

1 answers

solution1
0 2021-11-09 19:16:14

Pytesseract can not recognize even very simple textline

Question

1 answers

solution1 0 2021-11-09 19:16:14

solution1
0 2021-11-09 19:16:14