简体   繁体   English

R中的getURL函数出错

[英]Error with getURL function in r

I am trying to automatically translate a text from different language to English. 我正在尝试自动将文本从其他语言翻译成英语。

i am doing the passages explained on this other question: Google translate via web scraping r 我正在做另一个问题的解释: Google通过网络抓取翻译

but I obtain this error: 但我收到此错误:

Error in function (type, msg, asError = TRUE)  :  Illegal characters found in URL

my code is 我的代码是

 getParam <- as.character(db$text) 
 translateFrom <- as.character(db$language)

 translateTo <- "en"  
 search <- gsub(" ", "%20", getParam) 
 URL <- paste("https://translate.google.pl/m?hl=",translateFrom,"&sl=",translateFrom,"&tl=",translateTo,"&ie=UTF-8&prev=_m&q=",search,sep="", ssl.verifypeer = FALSE)

 page <- getURL(URL)  

 tree <- htmlTreeParse(page)

 body <- tree$children$html$children$body 
library(XML)
library(RCurl)

db=data.frame(text = c("traduire", "tradurre"), langage=c("fr", "it"))

ls = unlist(apply(db, 1, list), recursive = FALSE)

lapply(unlist(apply(db, 1, list), recursive = FALSE), function(x){

  getParam <- as.character(x[1])
  translateFrom <- as.character(x[2])

  translateTo <- "en"
  search <- gsub(" ", "%20", getParam)
  URL <- paste("https://translate.google.pl/m?hl=",translateFrom,"&sl=",translateFrom,"&tl=",translateTo,"&ie=UTF-8&prev=_m&q=",search,sep="")
  page <- getURL(URL)
  tree <-htmlTreeParse(page)
  body <- tree$children$html$children$body
  body_text <- body$children[[5]]$children[[1]]
  body_text

})

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM