简体   繁体   English

Tesseract OCR:图像到包含两列文本的文本

[英]Tesseract OCR: image to text containing 2 columns of text

I have an article in PNG format with 2 columns of text that I'm trying to read using Python and Tesseract OCR. 我有一篇PNG格式的文章,我尝试使用Python和Tesseract OCR读取两列文本。 However, by default, Tesseract reads from left to right in a horizontal wa. 但是,默认情况下,Tesseract在水平wa中从左到右读取。 Is there an option to automatically detect the columns in the text and read from left to right column by column? 是否可以选择自动检测文本中的列并从左到右逐列读取?

As far as I know your only chance is that one of the page segment mode available will work with your image 据我所知,您唯一的机会是可以使用一种页面细分模式来处理您的图像

docs here: https://github.com/tesseract-ocr/tesseract/wiki/Command-Line-Usage#using-different-page-segmentation-modes 此处的文档: https : //github.com/tesseract-ocr/tesseract/wiki/Command-Line-Usage#using-different-page-segmentation-modes

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 如何通过tesseract OCR读取黑色背景图像上的黑色文本? - How to read black text on black background image through tesseract OCR? tesseract-ocr使用字符编码从图像中读取文本 - tesseract-ocr reading text from image with character cordination 如何使用 Tesseract OCR 从具有水平线的图像中提取文本? - How to extract at text from an image with horizontal line using Tesseract OCR? Python 无法从图像中读取文本 [Python OCR with Tesseract] - Python cannot read text from an image [Python OCR with Tesseract] Output 从 tesseract ocr 中提取的文本 - Output of the text extracted from tesseract ocr 在OCR / tesseract / OpenCV中是否有任何方法可以从图像的特定区域中提取文本? - Is there any way in OCR/tesseract/OpenCV for extracting text from a particular region of an image? 二值图像上的 Tesseract OCR - Tesseract OCR on binary image 如何使用 Tesseract 对图像进行 OCR - How to OCR image with Tesseract 当使用 Gimp 手动预处理图像时,使用 Tesseract-OCR 的图像到文本识别比我的 Python 代码更好 - Image to text recognition using Tesseract-OCR is better when Image is preprocessed manually using Gimp than my Python Code Tesseract ocr output 在检测到的文本之间有单个字符 - Tesseract ocr output with single characters in between the detected text
 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM