简体繁体中英

Reading the page source inside <form> of a web page

原文 2012-06-13 10:39:18 0 2 java/ html-parsing/ jsoup/ htmlunit

Can any one help me to read the page source present inside the tag.

I have tried with htmlUnit and jsoup... but it retrns only the contents inside and tags. Any responce is highly appreciated.

2 answers

Use element.html() to read the HTML and not the contain of tag itself in JSoup

For Example:

String html = "<p>An </p><form action="SOMESERVLET"><b>example</b></form> ";
Document doc = Jsoup.parse(html);
String htmlContent = doc.select("form").first().html();

For your case

Document doc = Jsoup.connect("example.com").get(); 
Iterator<Element> itr = doc.select("form").iterator()
while(itr.hasNext()){ 
   Element element = itr.next();
   System.out.println(element.html());
}

Step by step

read html from url to string
find <form> tag it is start index
find </form> tag it is last index , * if this tag is not present last index is length *
and just substring from start to end index

it is simple algorithm but I think there are a lot of tools that can help you!!!

Reading the content of web page

Reading the source code of a page in Java

reading web page source code in java Differs from the orginal webpage source code

fater web page source provider

Content of a web page changing during reading

reading data from web page and writing in excel

Reading a web page in Java IOException Premature EOF

Bad request error when reading a web page

Reading dynamic web page content in java

display web page source in android app

暂无

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Reading the content of web page Reading the source code of a page in Java reading web page source code in java Differs from the orginal webpage source code fater web page source provider Content of a web page changing during reading reading data from web page and writing in excel Reading a web page in Java IOException Premature EOF Bad request error when reading a web page Reading dynamic web page content in java display web page source in android app

Related Tags

粤ICP备18138465号 © 2020-2024 STACKOOM.COM