[英]How can i search specific letters / words in image by pytesseract
我使用pytesseract
构建一个项目,它通常会给出一个图像作为回报,其中所有字母都被颜色覆盖。 例如 -
我的这个项目的代码是 -
import cv2
import pytesseract
pytesseract.pytesseract.tesseract_cmd = 'C:\\Program Files\\Tesseract-OCR\\tesseract.exe'
img = cv2.imread('testing.png')
img = cv2.cvtColor(img,cv2.COLOR_BGR2RGB)
#Detecting Characters
hImg,wImg,_ = img.shape
boxes = pytesseract.image_to_boxes(img)
for b in boxes.splitlines():
print(b)
b = b.split(' ')
print(b)
x,y,w,h = int(b[1]),int(b[2]),int(b[3]),int(b[4])
cv2.rectangle(img,(x,hImg-y),(w,hImg-h),(0,0,255),1)
cv2.putText(img,b[0],(x,hImg-y+20),cv2.FONT_HERSHEY_COMPLEX,0.5,(0,0,255),1)
cv2.imshow('Result',img)
cv2.waitKey(0)
现在我试图从中找出特定的字母,作为回报,程序将提供一个图像,其中只有搜索到的字母会被彩色覆盖。 我在代码中所做的编辑是
import cv2
import pytesseract
pytesseract.pytesseract.tesseract_cmd = 'C:\\Program Files\\Tesseract-OCR\\tesseract.exe'
img = cv2.imread('testing.png')
img = cv2.cvtColor(img,cv2.COLOR_BGR2RGB)
Q = input("Which letter do you want to find? : ")
#Detecting Characters
hImg,wImg,_ = img.shape
boxes = pytesseract.image_to_boxes(img)
search = []
result = []
count = 0
for b in boxes.splitlines():
# print(b)
b = b.split(' ')
#print(b)
x,y,w,h,= int(b[1]),int(b[2]),int(b[3]),int(b[4],)
s = b[0]
search.append(s)
print(search)
for i in range(len(search)):
if search[i] == Q:
result.append(search[i])
count += 1
# else:
# print("Your desired letter is not found")
print(result)
print("Your desired letter which is", Q , "is found", count , "times")
# cv2.rectangle(img,(x,hImg-y),(w,hImg-h),(0,0,255),1)
# cv2.putText(img,b[0],(x,hImg-y+20),cv2.FONT_HERSHEY_COMPLEX,0.5,(0,0,255),1)
#
# cv2.imshow('Result',img)
# cv2.waitKey(0)
我可以找出字母,但我无法将其注入图片中。
我要的程序是,如果我在程序中搜索H
,那么图片将覆盖程序中的所有H
,仅此而已
任何人都可以帮助我吗?
我已经解决了这个问题。 如果有人想知道,那么修复就在这个循环中——
for b in boxes.splitlines():
if (b[0]== Q):
b = b.split(' ')
x, y, w, h = int(b[1]), int(b[2]), int(b[3]), int(b[4])
cv2.rectangle(img,(x,hImg-y),(w,hImg-h),(0,0,255),1)
cv2.putText(img,b[0],(x,hImg-y+20),cv2.FONT_HERSHEY_COMPLEX,0.5,(0,0,255),1)
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.