繁体 English 中英

使用PyPDF2读取PDF不会产生任何结果

[英]Reading PDF using PyPDF2 not resulting anything

原文 2016-01-15 12:24:26 3 1 python/ pdf

这是我的代码-礼貌-http: //code.activestate.com/recipes/511465-pure-python-pdf-to-text-converter/ 。 我对其进行了修改，以包括PyPDF的下一版本。

import PyPDF2

def getPDFContent(path):
    content = ""
    # Load PDF into pyPDF
    pdf = PyPDF2.PdfFileReader(file(path, "rb"))
    # Iterate pages
    print "Number of pages is ", pdf.getNumPages()

    for i in range(0, pdf.getNumPages()):
        # Extract text from page and add to content
        content += pdf.getPage(i).extractText() + "\n"
        print (content)

    # Collapse whitespace
    content = " ".join(content.replace(u"\xa0", " ").strip().split())
    return content

print getPDFContent("RL.pdf").encode("ascii", "xmlcharrefreplace")

我正在读取的文件在这里。 http://dmc.kar.nic.in/RL.pdf

我所得到的就是这个。

此后，页数为1。

这是PDF的问题，还是我在某处出错？ 所有帮助表示赞赏！

1 个解决方案

该文件原来已损坏。

使用带有波兰语字符的pyPDF2阅读pdf

[英]Reading pdf using pyPDF2 with polish characters

PyPDF2：从压缩文件中读取 pdf

[英]PyPDF2: Reading a pdf from a zipfile

如何使用 PyPDF2 获取 Pdf 方向

[英]How to get Pdf Orientation using PyPDF2

使用 PyPDF2 向 PDF 添加嵌套书签

[英]Adding nested bookmarks to a PDF using PyPDF2

使用 PyPdf2 替换 pdf 中的文本

[英]Replacing text in a pdf using PyPdf2

如何使用pypdf2打开pdf文件

[英]how to open pdf file using pypdf2

使用 PyPDF2 连接多个 Pdf

[英]Concatenate multiple Pdf using PyPDF2

使用 PyPDF2 更新可填充的 pdf

[英]Update a fillable pdf using PyPDF2

如何使用 PyPDF2 附加 PDF 页面

[英]How to append PDF pages using PyPDF2

使用 python PyPDF2 合并 PDF 文件

[英]Merge PDF Files using python PyPDF2

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 使用带有波兰语字符的pyPDF2阅读pdf PyPDF2：从压缩文件中读取 pdf 如何使用 PyPDF2 获取 Pdf 方向使用 PyPDF2 向 PDF 添加嵌套书签使用 PyPdf2 替换 pdf 中的文本如何使用pypdf2打开pdf文件使用 PyPDF2 连接多个 Pdf 使用 PyPDF2 更新可填充的 pdf 如何使用 PyPDF2 附加 PDF 页面使用 python PyPDF2 合并 PDF 文件

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM