如何从具有.html扩展名的网页以编程方式下载pdf文件？

Question

我在这个论坛上已经审查了所有类似的问题（不仅是这个！），并尝试了所有这些方法，但是仍然无法以编程方式下载测试文件： http : //pdfobject.com/markup/examples/full-browser- window.html

以下是我要下载的测试文件的直接链接。 这是一个具有开放访问权限的测试pdf文件，因此任何人都可以使用它来测试下载方法。

如何下载此特定文件，使其具有pdf扩展名？

Answer 1

要下载文件，也许您可以尝试执行以下操作：

import java.io.File;
import java.io.FileOutputStream;
import java.io.IOException;
import java.io.InputStream;
import java.net.URL;
import java.net.URLConnection;

public final class FileDownloader {

    private FileDownloader(){}

    public static void main(String args[]) throws IOException{
        download("http://pdfobject.com/pdf/sample.pdf", new File("sample.pdf"));
    }

    public static void download(final String url, final File destination) throws IOException {
        final URLConnection connection = new URL(url).openConnection();
        connection.setConnectTimeout(60000);
        connection.setReadTimeout(60000);
        connection.addRequestProperty("User-Agent", "Mozilla/5.0");
        final FileOutputStream output = new FileOutputStream(destination, false);
        final byte[] buffer = new byte[2048];
        int read;
        final InputStream input = connection.getInputStream();
        while((read = input.read(buffer)) > -1)
            output.write(buffer, 0, read);
        output.flush();
        output.close();
        input.close();
    }
}

Answer 2

让我给您一个简短的解决方案，它带有一个名为JSoup的库， BalusC经常在他的答案中使用该库。

//Get the response
Response response=Jsoup.connect(location).ignoreContentType(true).execute();

//Save the file 
FileOutputStream out = new FileOutputStream(new File(outputFolder + name));
out.write(response.bodyAsBytes());
out.close();

好吧，您现在必须已经猜到了，pdf就是其中的response.body() 。 您可以使用这段代码下载任何二进制文件。

如何从具有.html扩展名的网页以编程方式下载pdf文件？

问题描述

2 个解决方案

解决方案1
4 已采纳 2013-10-11 02:41:36

解决方案2
1 2013-10-21 16:14:16

如何从具有.html扩展名的网页以编程方式下载pdf文件？

问题描述

2 个解决方案

解决方案1 4 已采纳 2013-10-11 02:41:36

解决方案2 1 2013-10-21 16:14:16

解决方案1
4 已采纳 2013-10-11 02:41:36

解决方案2
1 2013-10-21 16:14:16