简体繁体 English

如何在pdf中获取页面的特定部分并将其保存到python中的新pdf？

[英]How do I get a specific part of a page in a pdf and save it to a new pdf in python?

原文 2022-02-25 16:35:02 5 1 python/ python-3.x/ pdf/ pdf-generation/ crop

I have very little experience in manipulating pdfs using python, and my experience is restricted only to reading using 'pdfreader' a python library.我在使用 python 处理 pdf 方面经验很少，我的经验仅限于使用“pdfreader”python 库进行阅读。 I have a pdf, (which in this case is a past exam paper), I want it to split a page when it encounters a question number, let's say 12 for this example (it would be formatted "12."), and save the split part containing the number 12. in a new pdf. How do I do this?我有一个 pdf，（在这种情况下是过去的试卷），我希望它在遇到问题编号时拆分页面，假设这个例子是 12（格式为“12.”），然后保存在新的 pdf 中包含数字 12 的拆分部分。我该怎么做？

I'm not a very good programmer so sorry if my question is stupid, but searching on the inte.net I could not find how to do this.我不是一个很好的程序员，如果我的问题很愚蠢，我很抱歉，但是在 inte.net 上搜索我找不到如何做到这一点。

1 个解决方案

The solution at the end was to transform the pdf page into an image, crop it where I want it, then back to a pdf. To get the coordinates I had to use pdf miner, to then get the pixels to modify the image I had to make a proportion between the height of the page in pdf coordinates and the height of the image I wanted to create in pixels, so then I could transform the coordinates of one into the coordinates of the other.最后的解决方案是将 pdf 页面转换为图像，将其裁剪到我想要的位置，然后返回到 pdf。要获取坐标，我必须使用 pdf 矿工，然后获取像素来修改我的图像在 pdf 坐标中的页面高度与我想以像素为单位创建的图像的高度之间建立一个比例，这样我就可以将一个坐标转换为另一个坐标。

如何使用 Python 拆分 PDF，每个页面都包含一组特定的唯一文本 - How do I split a PDF using Python, every page that contains a set of specific unique text

如何通过python保存Google pdf文件？ - How do I save a Google pdf file through python?

如何在Selenium（Python）中将打开的页面保存为pdf - how to save opened page as pdf in Selenium (Python)

Python阅读了pdf页面的一部分 - Python read part of a pdf page

如何将多页 PDF 转换为 pdf 页面中的 PNG 图像 Python - How do I convert a multiple paged PDF into a PNG image per pdf page in Python

如何使用 selenium 在 python 中获取 web 属性的特定部分？ - how do I get a Specific part of a web attribute in python with selenium?

如何在 Selenium Z23EEEB4347BDD26BFC6B7EE9A3B755 中自动将 web 页面保存为 pdf - How to save web page as pdf automatically in Selenium python

如何将 python plot 保存到 PDF - How to save python plot to a PDF

用python 3中另一个PDF的页面替换PDF中的特定页面 - Replace a Specific page in a PDF with a page from another PDF in python 3

如何在 Python 文档中的 PDF 文档中包含 PDF 中的页面 - How to include page in PDF in PDF document in Python

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 如何使用 Python 拆分 PDF，每个页面都包含一组特定的唯一文本 - How do I split a PDF using Python, every page that contains a set of specific unique text 如何通过python保存Google pdf文件？ - How do I save a Google pdf file through python? 如何在Selenium（Python）中将打开的页面保存为pdf - how to save opened page as pdf in Selenium (Python) Python阅读了pdf页面的一部分 - Python read part of a pdf page 如何将多页 PDF 转换为 pdf 页面中的 PNG 图像 Python - How do I convert a multiple paged PDF into a PNG image per pdf page in Python 如何使用 selenium 在 python 中获取 web 属性的特定部分？ - how do I get a Specific part of a web attribute in python with selenium? 如何在 Selenium Z23EEEB4347BDD26BFC6B7EE9A3B755 中自动将 web 页面保存为 pdf - How to save web page as pdf automatically in Selenium python 如何将 python plot 保存到 PDF - How to save python plot to a PDF 用python 3中另一个PDF的页面替换PDF中的特定页面 - Replace a Specific page in a PDF with a page from another PDF in python 3 如何在 Python 文档中的 PDF 文档中包含 PDF 中的页面 - How to include page in PDF in PDF document in Python

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM