简体繁体 English

从 pdf 中提取图像作为 pdf

[英]Extract images from a pdf as pdfs

原文 2017-10-03 15:17:27 7 1 python/ pdf/ adobe/ acrobat

I need to find a tool (python, adobe suite, some cmd line utility, etc) that can extract images from a PDF as individual PDF files - not jpegs, pngs, etc.我需要找到一个工具（python、adobe 套件、一些 cmd 行实用程序等），它可以从 PDF 中提取图像作为单独的 PDF 文件 - 而不是 jpegs、pngs 等。

Does such a thing exist?这样的事情存在吗？ Seems like there is a bunch of stuff out there for extracting image files to png, jpeg, etc, but nothing for extracting the images as PDFs.似乎有很多东西可以将图像文件提取为 png、jpeg 等，但没有用于将图像提取为 PDF 的东西。 A strange request I know.我知道一个奇怪的要求。

I am working with a large set of PDFs that contain images that are comprised of all kinds of different images formats, bitmaps, vectors, etc. If there was some way to programmatically pull out images as pdfs it would save me a lot of time.我正在处理大量 PDF，其中包含由各种不同图像格式、位图、矢量等组成的图像。如果有某种方法可以以编程方式将图像提取为 pdf，这将为我节省大量时间。

Right now I am selecting a portion of the page in the PDF in acrobat pro, choosing to edit in illustrator, and then saving as PDF.现在我在 acrobat pro 中选择 PDF 中的一部分页面，选择在 illustrator 中编辑，然后另存为 PDF。

Very time consuming.非常耗时。

Any ideas?有任何想法吗？

1 个解决方案

You could use poppler 's pdfimages utility to extract all bitmap images as-is from a PDF.您可以使用poppler的pdfimages实用程序从 PDF 中按pdfimages提取所有位图图像。 In a second step, you can convert these bitmaps back to PDFs.第二步，您可以将这些位图转换回 PDF。 img2pdf seems like a good candidate for this. img2pdf似乎是一个很好的候选者。