[英]Error with getURL function in r
I am trying to automatically translate a text from different language to English. 我正在尝试自动将文本从其他语言翻译成英语。
i am doing the passages explained on this other question: Google translate via web scraping r 我正在做另一个问题的解释: Google通过网络抓取翻译
but I obtain this error: 但我收到此错误:
Error in function (type, msg, asError = TRUE) : Illegal characters found in URL
my code is 我的代码是
getParam <- as.character(db$text)
translateFrom <- as.character(db$language)
translateTo <- "en"
search <- gsub(" ", "%20", getParam)
URL <- paste("https://translate.google.pl/m?hl=",translateFrom,"&sl=",translateFrom,"&tl=",translateTo,"&ie=UTF-8&prev=_m&q=",search,sep="", ssl.verifypeer = FALSE)
page <- getURL(URL)
tree <- htmlTreeParse(page)
body <- tree$children$html$children$body
library(XML)
library(RCurl)
db=data.frame(text = c("traduire", "tradurre"), langage=c("fr", "it"))
ls = unlist(apply(db, 1, list), recursive = FALSE)
lapply(unlist(apply(db, 1, list), recursive = FALSE), function(x){
getParam <- as.character(x[1])
translateFrom <- as.character(x[2])
translateTo <- "en"
search <- gsub(" ", "%20", getParam)
URL <- paste("https://translate.google.pl/m?hl=",translateFrom,"&sl=",translateFrom,"&tl=",translateTo,"&ie=UTF-8&prev=_m&q=",search,sep="")
page <- getURL(URL)
tree <-htmlTreeParse(page)
body <- tree$children$html$children$body
body_text <- body$children[[5]]$children[[1]]
body_text
})
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.