Java-从url读取页面源不起作用

Question

I am using the code below to read page source from url. 我正在使用下面的代码从url读取页面源代码。 It works almost for all urls but not for this url and just returns the url itself. 它几乎适用于所有网址，但不适用于此网址，仅返回网址本身。

public static String getURLSource(String url) throws IOException
{
    URL urlObject = new URL(url);
    URLConnection urlConnection = urlObject.openConnection();
    //urlConnection.setRequestProperty("User-Agent", "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.11 (KHTML, like Gecko) Chrome/23.0.1271.95 Safari/537.11");

    return toString(urlConnection.getInputStream());
}

private static String toString(InputStream inputStream) throws IOException
{
    try (BufferedReader bufferedReader = new BufferedReader(new InputStreamReader(inputStream, "UTF-8")))
    {
        String inputLine;
        StringBuilder stringBuilder = new StringBuilder();
        while ((inputLine = bufferedReader.readLine()) != null)
        {
            stringBuilder.append(inputLine);
        }

        return stringBuilder.toString();
    }
}

What is the problem and how can I modify the code to work properly? 有什么问题，如何修改代码才能正常工作？ Thanks. 谢谢。

Answer 1

您必须使用HttpsURLConnection，因为它是https。

Java-从url读取页面源不起作用

问题描述

1 个解决方案

解决方案1
1 2019-01-29 14:17:42

Java-从url读取页面源不起作用

问题描述

1 个解决方案

解决方案1 1 2019-01-29 14:17:42

解决方案1
1 2019-01-29 14:17:42