Apache Commons IO仅下载第一个PDF页面

Question

I'm using Java with Apache Commons-IO to download a PDF but I only want to get the first page, is there a way I can do it? 我正在将Java与Apache Commons-IO一起使用来下载PDF，但是我只想获得首页，有没有办法做到这一点？

Here's the piece of code that gets the whole doc: 这是获取整个文档的代码：

public void getPDF(String route) throws IOException {
    URL url = new URL(route);
    File file = new File("file.pdf");
    FileUtils.copyURLToFile(url, file);
}

Answer 1

In continuation to your code, you may use a new Document to hold only first page of given PDF file. 继续执行代码，可以使用新文档仅保留给定PDF文件的第一页。

 URL url = new URL(route);
 File file = new File("file.pdf");
 FileUtils.copyURLToFile(url, file);

 PDDocument pdDoc = PDDocument.load(file);
 PDDocument document = null;

int pageNumberToRead=0;

try {   
    document = new PDDocument();   
    document.addPage((PDPage) pdDoc.getDocumentCatalog().getAllPages().get(pageNumberToRead));   
    document.save("basepath/first_page.pdf");  
    document.close();  
}catch(Exception e){}

Apache Commons IO仅下载第一个PDF页面

问题描述

1 个解决方案

解决方案1
0 已采纳 2019-02-15 06:27:31

Apache Commons IO仅下载第一个PDF页面

问题描述

1 个解决方案

解决方案1 0 已采纳 2019-02-15 06:27:31

解决方案1
0 已采纳 2019-02-15 06:27:31