使用 OpenCV 優化 OCR 的各種亮度圖像

Question

我有以下類型的圖像：

我想對它們進行預處理以獲得最佳 OCR 結果，但正如您所見，它們具有不同的亮度和不同的清晰度……是否可以進行一些“通用”調整以提取 OCR 文本並獲得最佳結果？

Answer 1

您可以使用簡單的 ocr為這些情況提供正確的結果。 這將適用於模糊和不模糊的情況。

import easyocr
import cv2
import numpy as np
from PIL import Image, ImageEnhance


def unsharp_mask(image, kernel_size=(5, 5), sigma=1.0, amount=1.0, threshold=0):
    """Return a sharpened version of the image, using an unsharp mask."""
    blurred = cv2.GaussianBlur(image, kernel_size, sigma)
    sharpened = float(amount + 1) * image - float(amount) * blurred
    sharpened = np.maximum(sharpened, np.zeros(sharpened.shape))
    sharpened = np.minimum(sharpened, 255 * np.ones(sharpened.shape))
    sharpened = sharpened.round().astype(np.uint8)
    if threshold > 0:
        low_contrast_mask = np.absolute(image - blurred) < threshold
        np.copyto(sharpened, image, where=low_contrast_mask)
    return sharpened

def increase_brightness(img, value):
    hsv = cv2.cvtColor(img, cv2.COLOR_BGR2HSV)
    h, s, v = cv2.split(hsv)

    lim = 255 - value
    v[v > lim] = 255
    v[v <= lim] += value

    final_hsv = cv2.merge((h, s, v))
    img = cv2.cvtColor(final_hsv, cv2.COLOR_HSV2BGR)
    return img

image = cv2.imread('if8nC.png')
sharpened = unsharp_mask(image)
imag = increase_brightness(sharpened, value=10) # 60 ->5qoOk.png #10 -> if8nC.png
cv2.imwrite('resize.png',imag)

reader = easyocr.Reader(['en'],gpu=False)
result = reader.readtext('resize.png')
for detection in result:
        print(detection)

您必須進行的唯一調整是將亮度值從 0 更改為 100。它適用於所有情況。 輸出是

([[1, 0], [282, 0], [282, 68], [1, 68]], 'Tvrdosin', 0.4517089309490733)

使用 OpenCV 優化 OCR 的各種亮度圖像

問題描述

1 個解決方案

解決方案1
1 2022-06-28 18:48:34

使用 OpenCV 優化 OCR 的各種亮度圖像

問題描述

1 個解決方案

解決方案1 1 2022-06-28 18:48:34

解決方案1
1 2022-06-28 18:48:34