簡體   English   中英

R中的getURL函數出錯

[英]Error with getURL function in r

我正在嘗試自動將文本從其他語言翻譯成英語。

我正在做另一個問題的解釋: Google通過網絡抓取翻譯

但我收到此錯誤:

Error in function (type, msg, asError = TRUE)  :  Illegal characters found in URL

我的代碼是

 getParam <- as.character(db$text) 
 translateFrom <- as.character(db$language)

 translateTo <- "en"  
 search <- gsub(" ", "%20", getParam) 
 URL <- paste("https://translate.google.pl/m?hl=",translateFrom,"&sl=",translateFrom,"&tl=",translateTo,"&ie=UTF-8&prev=_m&q=",search,sep="", ssl.verifypeer = FALSE)

 page <- getURL(URL)  

 tree <- htmlTreeParse(page)

 body <- tree$children$html$children$body 
library(XML)
library(RCurl)

db=data.frame(text = c("traduire", "tradurre"), langage=c("fr", "it"))

ls = unlist(apply(db, 1, list), recursive = FALSE)

lapply(unlist(apply(db, 1, list), recursive = FALSE), function(x){

  getParam <- as.character(x[1])
  translateFrom <- as.character(x[2])

  translateTo <- "en"
  search <- gsub(" ", "%20", getParam)
  URL <- paste("https://translate.google.pl/m?hl=",translateFrom,"&sl=",translateFrom,"&tl=",translateTo,"&ie=UTF-8&prev=_m&q=",search,sep="")
  page <- getURL(URL)
  tree <-htmlTreeParse(page)
  body <- tree$children$html$children$body
  body_text <- body$children[[5]]$children[[1]]
  body_text

})

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM