简体繁体中英

R, Reading html source code using XML library and htmlTreeParse. I am new to this, so it may be a simple solution

原文 2021-04-29 22:53:06 8 2 html/ r/ xml

I want to be able to read in the source code to extract nodes from the HTML file.

library(XML)
url <- ("https://www.mlb.com/marlins")
html <- htmlTreeParse(url, useInternal=T)

The issue is when I try this i get an error message saying: "XML content does not seem to be XML: '' "

thanks ahead of time

2 answers

Because it is really not an XML file. To read the source code, try the following script

library(httr)
html <- httr::content(httr::GET("https://www.mlb.com/marlins"))

You can use rvest::read_html to read the source.

data <- rvest::read_html("https://www.mlb.com/marlins")

Reading XML data into R from a html source

Web scraping: html structure visible with chrome developer tool, but not with htmlTreeParse (R)

How do I scrape information from website source code/html using R?

Need to change HTML structure so it will work with a script I am using

i am experiencing an unusual behaviour in my html code. what may be the problem?

How to display XML source code using HTML with Emacs?

HTML code I am using will not display properly?

Reading Web 2.0 HTML Source Code with Perl

Reading in HTML/XML PDF file formats into R

How to get rendered source code of an html page from code behind so I can send it in mail

暂无

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Reading XML data into R from a html source Web scraping: html structure visible with chrome developer tool, but not with htmlTreeParse (R) How do I scrape information from website source code/html using R? Need to change HTML structure so it will work with a script I am using i am experiencing an unusual behaviour in my html code. what may be the problem? How to display XML source code using HTML with Emacs? HTML code I am using will not display properly? Reading Web 2.0 HTML Source Code with Perl Reading in HTML/XML PDF file formats into R How to get rendered source code of an html page from code behind so I can send it in mail

Related Tags

粤ICP备18138465号 © 2020-2024 STACKOOM.COM