在python中从pdf提取页眉和页脚

Question

I have read a pdf using pdfminer . 我已经使用pdfminer阅读了pdf。 I want to detect the header and footer of the pdf. 我想检测pdf的页眉和页脚。 Please let me know if there is any possibility. 请让我知道是否有可能。

Answer 1

Also possible with Apache Tika: Apache Tika也可以：

import tika
from tika import parser

FileName = "PDF File Name"
PDF_Parse = parser.from_file(FileName)
print(PDF_Parse ['content'])
print(PDF_Parse ['metadata']) # Format-Dictionary

在python中从pdf提取页眉和页脚

问题描述

1 个解决方案

解决方案1
1 已采纳 2019-01-30 09:53:05

在python中从pdf提取页眉和页脚

问题描述

1 个解决方案

解决方案1 1 已采纳 2019-01-30 09:53:05

解决方案1
1 已采纳 2019-01-30 09:53:05