嘗試遍歷多個 PDF 文件並將這些 PDF 的各個頁面保存為圖像

Question

我正在研究一個 python 項目，該項目需要我一個接一個地遍歷存儲在當前目錄的 sample/ 文件夾中的多個 pdf，並將這些 pdf 的各個頁面作為圖像保存在另一個名為 convert_images/ 的目錄中。 有人能幫我嗎？ 所有 pdf 文件都是隨機命名的，但具有“.pdf”擴展名。

Answer 1

你可以用pdf2image

pip install pdf2image

    from pdf2image import convert_from_path
    pages = convert_from_path('pdf_file', 500)
    for page in pages:
        page.save('out.jpg', 'JPEG')

或者：

import pypdfium2 as pdfium

pdffile = 'path/to/your_doc.pdf'

# render multiple pages concurrently (in this case: all)
for image, suffix in pdfium.render_pdf(pdffile):
    image.save(f'output_{suffix}.jpg')

# render a single page (in this case: the first one)
with pdfium.PdfContext(pdffile) as pdf:
    image = pdfium.render_page(pdf, 0)
    image.save('output.jpg')

嘗試遍歷多個 PDF 文件並將這些 PDF 的各個頁面保存為圖像

問題描述

1 個解決方案

解決方案1
0 2022-01-20 10:22:18

嘗試遍歷多個 PDF 文件並將這些 PDF 的各個頁面保存為圖像

問題描述

1 個解決方案

解決方案1 0 2022-01-20 10:22:18

解決方案1
0 2022-01-20 10:22:18