繁体 English 中英

jsoup不会刮所有元素吗？

[英]jsoup not scraping all elements?

原文 2017-06-30 10:14:13 5 1 java/ web-scraping/ jsoup

so

当从具有200个元素的网站上抓取数据时，“输出”只是200个元素中的前49个或前50个元素，为什么？ -我该如何解决这个问题。 获取所有200个元素数据？''

Document d = Jsoup.connect("https://www.ebay.com/sch/i.html?_from=R40&_trksid=p2047675.m570.l1313.TR0.TRC0.H0.Xsilver.TRS0&_nkw=silver&_sacat=0").timeout(6000).get();
Elements ele = d.select("div#ResultSetItems");

for (Element element : ele.select("li.shic")) {
    String img_url = element.select("li").attr("listingid");
    System.out.println(img_url);
}

1 个解决方案

其他元素是Javascript按需获取的，这是当今很常见的，因此它们对于JSoup是不可见的。 没有办法让JSoup执行这些提取，因此您将不得不想出一种比抓取来获取数据更好的方法。 我建议您看看EBay提供的API选项。

使用Jsoup获取所有元素

[英]Get all elements with Jsoup

Jsoup select未获取所有元素

[英]Jsoup select is not fetching all elements

Jsoup选择不返回所有元素

[英]Jsoup select not returning all elements

Jsoup选择并迭代所有元素

[英]Jsoup select and iterate all elements

[英]Scraping with Jsoup

刮取嵌套在JSoup中几个元素中的图像源

[英]Scraping an image source nested within several elements in JSoup

Jsoup：从所有子元素中获取文本

[英]Jsoup: get text from all children elements

JSoup根据类获取所有元素

[英]JSoup get all elements based on class

Jsoup不会提取所有预期的元素

[英]Jsoup doesn't extract all the expected elements

为什么JSoup不读取页面的所有元素？

[英]Why JSoup does not read all the elements of the page?

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 使用Jsoup获取所有元素 Jsoup select未获取所有元素 Jsoup选择不返回所有元素 Jsoup选择并迭代所有元素用Jsoup刮刮取嵌套在JSoup中几个元素中的图像源 Jsoup：从所有子元素中获取文本 JSoup根据类获取所有元素 Jsoup不会提取所有预期的元素为什么JSoup不读取页面的所有元素？

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM