Problems with parsing facebook using Jsoup

Question

I wrote a program to parse facebook, and I can get the whole DOM tree already. Things go well but when I want to select all <p> -tags, the problem is it returns a zero sized array. PS: Nothing goes wrong when I parse other websites but facebook.

Here is my code:

public static void main(String[] args) throws IOException {
    doc = connect(); //connect the website,
    System.out.print(doc.outerHtml());//in the wole html file, i can find the tag <p>
    newsHeadlines = doc.select("p"); //nothing
    doc.getElementsByTag("p");//nothing either
    oldEleStr = newsHeadlines.text();
    System.out.println(oldEleStr);//nothing
}


static Document connect() throws IOException {
    org.jsoup.Connection connection = Jsoup
            .connect("facebook.com")
            .cookies(
                    splitCookies(facebookCookies));
    Document doc = connection.get();
    return doc;
}

Answer 1

You may want to try something like:

Document new_doc = Jsoup.parse(doc.outerHtml());
Elements elements = doc.select("p");
for (Element aa : elements) {
    //TODO:
}

Problems with parsing facebook using Jsoup

Question

1 answers

solution1
0 2015-01-19 15:09:01

Problems with parsing facebook using Jsoup

Question

1 answers

solution1 0 2015-01-19 15:09:01

solution1
0 2015-01-19 15:09:01