[英]Tesseract OCR: image to text containing 2 columns of text
I have an article in PNG format with 2 columns of text that I'm trying to read using Python and Tesseract OCR. 我有一篇PNG格式的文章,我尝试使用Python和Tesseract OCR读取两列文本。 However, by default, Tesseract reads from left to right in a horizontal wa.
但是,默认情况下,Tesseract在水平wa中从左到右读取。 Is there an option to automatically detect the columns in the text and read from left to right column by column?
是否可以选择自动检测文本中的列并从左到右逐列读取?
As far as I know your only chance is that one of the page segment mode available will work with your image 据我所知,您唯一的机会是可以使用一种页面细分模式来处理您的图像
docs here: https://github.com/tesseract-ocr/tesseract/wiki/Command-Line-Usage#using-different-page-segmentation-modes 此处的文档: https : //github.com/tesseract-ocr/tesseract/wiki/Command-Line-Usage#using-different-page-segmentation-modes
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.