简体   繁体   English

“无法打开连接” - 带雪的HP中的HPC

[英]“Cannot open the connection” - HPC in R with snow

I'm attempting to run a parallel job in R using snow. 我正在尝试使用雪在R中运行并行作业。 I've been able to run extremely similar jobs with no trouble on older versions of R and snow. 我已经能够在旧版本的R和雪上毫无困难地运行非常相似的工作。 R package dependencies prevent me from reverting. R包依赖性阻止我恢复。

What happens: My jobs terminate at the parRapply step, ie, the first time the nodes have to do anything short of reporting Sys.info() . 会发生什么:我的作业在parRapply步骤终止,即第一次节点必须做任何报告Sys.info()事情。 The error message reads: 错误消息显示:

Error in checkForRemoteErrors(val) : 
3 nodes produced errors; first error: cannot open the connection 
Calls: parRapply ... clusterApply -> staticClusterApply -> checkForRemoteErrors

Specs: R 2.14.0, snow 0.3-8, RedHat Enterprise Linux Client release 5.6. 规格: R 2.14.0,雪0.3-8,RedHat Enterprise Linux客户端版本5.6。 The snow package has been built on the correct version of R. 雪包已经建立在正确版本的R上。

Details: The following code appears to execute fine: 详细信息:以下代码似乎执行正常:

cl <- makeCluster(3)
clusterEvalQ(cl,library(deSolve,lib="~/R/library"))
clusterCall(cl,function() Sys.info()[c("nodename","machine")])

I'm an end-user, not a system admin, but I'm desperate for suggestions and insights into what could be going wrong. 我是最终用户,而不是系统管理员,但我迫切需要建议并深入了解可能出现的问题。

This cryptic error appeared because an input file that's requested during program execution wasn't actually present. 出现这种神秘错误是因为在程序执行期间请求的输入文件实际上并不存在。 Each node would attempt to load this file and then fail, but this would result only in a "cannot open the connection" message. 每个节点都会尝试加载此文件然后失败,但这只会导致“无法打开连接”消息。

What this means is that almost anything can cause a "connection" error. 这意味着几乎任何事都可能导致“连接”错误。 Incredibly annoying! 令人难以置信的烦人!

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM