[英]Extract header and footer from pdf in python
I have read a pdf using pdfminer
. 我已经使用
pdfminer
阅读了pdf。 I want to detect the header and footer of the pdf. 我想检测pdf的页眉和页脚。 Please let me know if there is any possibility.
请让我知道是否有可能。
Also possible with Apache Tika: Apache Tika也可以:
import tika
from tika import parser
FileName = "PDF File Name"
PDF_Parse = parser.from_file(FileName)
print(PDF_Parse ['content'])
print(PDF_Parse ['metadata']) # Format-Dictionary
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.