[英]How to install textract in Python 3?
我想从 pdf 中提取,但pypdf2
没有提取所有信息,并且由于以下错误, textract
无法在 3.7 中安装:
UnicodeDecodeError: 'charmap' codec can't decode byte 0x8d in position 1671: character maps to <undefined>
Download the source file for textract
from: https://pypi.python.org/pypi/textract从以下位置下载
textract
的源文件: https : textract
pip3 install pdfminer3k
untar
the downloaded file untar
下载的文件
cd
into the directory cd
进入目录
run: python3 setup.py install
运行:
python3 setup.py install
Hope it works for you :)
希望它对你
:)
I have installed textract
on windows 10 with following steps: -我已经通过以下步骤在 Windows 10 上安装了
textract
:-
pip install textract
C:\\Program Files
C:\\Program Files
C:\\Program Files\\poppler-0.68.0\\bin
to path variableC:\\Program Files\\poppler-0.68.0\\bin
到路径变量import textract
import textract
textract.process('path_to_file_with_extension')
For further reference, you can click here如需进一步参考,您可以点击这里
Hope it will be helpful to you!希望对你有帮助!
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.