[英]Keep the original Html format when scrapping a website element using Jsoup
The below code only retrieves the simple text but I want to retain the HTML format.下面的代码只检索简单的文本,但我想保留 HTML 格式。 Here is my sample code:
这是我的示例代码:
public class doIT extends AsyncTask<Void,Void,Void> {
//String words = wordToTranslate;
String translated;
@Override
protected void onPreExecute() {
super.onPreExecute();
}
@Override
protected Void doInBackground(Void... params) {
try
{
Document document = Jsoup.connect("https://studentdevos.com/").get();
org.jsoup.select.Elements elements = document.getElementsByClass("post-entry");
translated = Jsoup.parse(elements.html()).wholeText();
}
catch (IOException e)
{
e.printStackTrace();
} return null;
}
@Override
protected void onPostExecute(Void aVoid)
{
tv_Jsoupe.setText(Html.fromHtml(translated));
super.onPostExecute(aVoid);
}
}
I think that the Jsoup.parse() is what retrieves the simple text.我认为 Jsoup.parse() 是检索简单文本的方法。
code:代码:
translated = elements.html();
Change this line as well from:也从以下位置更改此行:
tv_Jsoupe.setText(Html.fromHtml(translated));
to到
tv_Jsoupe.setText(translated);
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.