如何從python中的驗證碼圖像中提取數字？

Question

我想從驗證碼圖像中提取數字，所以我從這個答案這個答案中嘗試了這個代碼：

try:
    from PIL import Image
except ImportError:
    import Image
import pytesseract
import cv2

file = 'sample.jpg'

img = cv2.imread(file, cv2.IMREAD_GRAYSCALE)
img = cv2.resize(img, None, fx=10, fy=10, interpolation=cv2.INTER_LINEAR)
img = cv2.medianBlur(img, 9)
th, img = cv2.threshold(img, 185, 255, cv2.THRESH_BINARY)
kernel = cv2.getStructuringElement(cv2.MORPH_RECT, (4,8))
img = cv2.morphologyEx(img, cv2.MORPH_CLOSE, kernel)
cv2.imwrite("sample2.jpg", img)


file = 'sample2.jpg'
text = pytesseract.image_to_string(file)
print(''.join(x for x in text if x.isdigit()))

它適用於這張圖片：

outPut: 436359
但是，當我在這張圖片上嘗試時：

它什么也沒給我， outPut: 。
如何修改我的代碼以從第二張圖像中獲取數字作為字符串？

Answer 1

通常，在像這樣的圖像上獲得正確的 OCR 與轉換的順序和參數有關。 例如，在下面的代碼片段中，我首先轉換為灰度，然后腐蝕像素，然后膨脹，然后再次腐蝕。 我使用閾值轉換為二進制（只是黑色和白色），然后再膨脹和腐蝕一次。 這對我來說產生了正確的 859917 值並且應該是可重現的。

import cv2
import numpy as np
import pytesseract

file = 'sample2.jpg'
img = cv2.imread(file)
gray = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY)
ekernel = np.ones((1,2),np.uint8)
eroded = cv2.erode(gray, ekernel, iterations = 1)
dkernel = np.ones((2,3),np.uint8)
dilated_once = cv2.dilate(eroded, dkernel, iterations = 1)
ekernel = np.ones((2,2),np.uint8)
dilated_twice = cv2.erode(dilated_once, ekernel, iterations = 1)
th, threshed = cv2.threshold(dilated_twice, 200, 255, cv2.THRESH_BINARY)
dkernel = np.ones((2,2),np.uint8)
threshed_dilated = cv2.dilate(threshed, dkernel, iterations = 1)
ekernel = np.ones((2,2),np.uint8)
threshed_eroded = cv2.erode(threshed_dilated, ekernel, iterations = 1)
text = pytesseract.image_to_string(threshed_eroded)
print(''.join(x for x in text if x.isdigit()))

如何從python中的驗證碼圖像中提取數字？

問題描述

1 個解決方案

解決方案1
0 2021-11-01 02:32:24

如何從python中的驗證碼圖像中提取數字？

問題描述

1 個解決方案

解決方案1 0 2021-11-01 02:32:24

解決方案1
0 2021-11-01 02:32:24