Web Scraping a table into R

Question

I'm new to trying to web scrape, and am sure there's a very obvious answer I'm missing here, but have exhausted every post I can find on using rvest, XML, xml2, etc on reading a table from the web into R, and I've had no success.

An example of the table I'm looking to scrape can be found here: https://www.eliteprospects.com/iframe_player_stats.php?player=364033

I've tried

EXAMPLE <- read_html("http://www.eliteprospects.com/iframe_player_stats.php? 
player=364033")
EXAMPLE


URL <- 'http://www.eliteprospects.com/iframe_player_stats.php?player=364033'
table <- URL %>%  
read_html %>% 
html_nodes("table")

But am unsure what to do with these results to get them into a dataframe, or anything usable.

Answer 1

You need to extract the correct html_nodes , and then convert them into a data.frame . The code below is an example of how to go about doing something like this. I find Selector Gadget very useful for finding the right CSS selectors.

library(tidyverse)
library(rvest)

# read the html
html <- read_html('http://www.eliteprospects.com/iframe_player_stats.php?player=364033')

# function to read columns
read_col <- function(x){
  col <- html %>%  
    # CSS nodes to select by using selector gadget
    html_nodes(paste0("td:nth-child(", x, ")")) %>% 
    html_text()
  return(col)
}

# apply the function
col_list <- lapply(c(1:8, 10:15), read_col)

# collapse into matrix
mat <- do.call(cbind, col_list)

# put data into dataframe
df <- data.frame(mat[2:nrow(mat), ] %>% data.frame()) 

# assign names
names(df) <- mat[1, ] 

df

Web Scraping a table into R

Question

1 answers

solution1
0 ACCPTED 2018-06-30 18:46:13

Web Scraping a table into R

Question

1 answers

solution1 0 ACCPTED 2018-06-30 18:46:13

solution1
0 ACCPTED 2018-06-30 18:46:13