R中使用RVest和XML2提取表进行Web抓取

Question

I wish to extract the table with the ranks and returns from a sample URL https://www.valueresearchonline.com/funds/fundSelector/returns.asp?cat=10&exc=susp%2Cclose&rettab=st 我希望从示例URL https://www.valueresearchonline.com/funds/fundSelector/returns.asp?cat=10&exc=susp%2Cclose&rettab=st提取具有排名和回报的表格

So far tried rvest 到目前为止尝试过rvest

#Reading the HTML code from the website
webpage <- read_html(urlString)

#Using CSS selectors to scrap the section
tables <- webpage %>% html_node("tr") %>% html_text()
tables <- html_node(".fundtool_cat") %>% html_text()

I need a dataframe/table with name of the scheme along with ranks and returns for all periods mentioned Any help would be appreciated. 我需要一个带有方案名称的数据框/表，以及所提及的所有期间的等级和收益。希望能有所帮助。

Answer 1

Hope this helps! 希望这可以帮助！

library(rvest)
urlString <- "https://www.valueresearchonline.com/funds/fundSelector/returns.asp?cat=10&exc=susp%2Cclose&rettab=st"
urlString %>%
  read_html() %>%
  html_nodes(xpath='//*[@id="fundCatData"]/table[1]') %>%
  html_table(fill=T)

R中使用RVest和XML2提取表进行Web抓取

问题描述

1 个解决方案

解决方案1
2 已采纳 2017-09-07 19:58:27

R中使用RVest和XML2提取表进行Web抓取

问题描述

1 个解决方案

解决方案1 2 已采纳 2017-09-07 19:58:27

解决方案1
2 已采纳 2017-09-07 19:58:27