简体   繁体   中英

java.io.IOException: unexpected end of stream on Connection jsoup

I am parsing some lyrics site, and i have error from the header. URL, that i give to it (for example): http://www.azlyrics.com/lyrics/linkinpark/intheend.html

class GetLyrics extends AsyncTask<String, Void, String> {
protected String doInBackground(String... urls) {
    String url = urls[0];
    String output;
    output = "If you see this, some kind of error has occupied";
    try {
        Document document = Jsoup.connect(url).get(); //I dont know how it works, its google
        document.outputSettings(new Document.OutputSettings().prettyPrint(false));//makes html() preserve linebreaks and spacing
        document.select("br").append("\\n");
        Elements lyrics = document.select("b + br + br + div"); //Search for lyrics <div> tag, that after <b> and 2 <br> tags
        String s = lyrics.html().replaceAll("\\\\n", "\n"); //Google again
        output = Jsoup.clean(s, "", Whitelist.none(), new Document.OutputSettings().prettyPrint(false));
        output = output.replace("\n\n", "\n");
        output = output.substring(4); //Remove first enters
    }
    catch (HttpStatusException e) {
        System.err.println("404 error: " + e);
        System.err.println("Check your input data");
        output = "An 404 error has occurred, more info:\n" + e + "\nCheck your input data";
        Log.d("LyricFinder", e.toString());
    }
    catch (Exception e) {
        System.err.println("Some error: " + e);
        output = "An uknown error has occurred\nCheck your internet connection";
        Log.d("LyricFinder", e.toString());
    }
    return output;
}

protected void onPostExecute(String lyrics) {
    lyricsOutput.setText(lyrics);
}

} And logs are:

D/LyricFinder: java.io.IOException: unexpected end of stream on Connection{www.azlyrics.com:80, proxy=DIRECT@ hostAddress=85.17.159.246 cipherSuite=none protocol=http/1.1} (recycle count=0)

In eclipse console project this code works perfectly (but without this asynctask :/ )
Idk what to do, because my question remain unanswered

Ok, so on another forum I found the solution:

Document document = Jsoup.connect(url)
                    .userAgent("Mozilla/5.0 (Windows; U; WindowsNT 5.1; en-US; rv1.8.1.6) Gecko/20070725 Firefox/2.0.0.6")
                    .referrer("http://www.google.com")
                    .get();

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM