[英]R and snow on amazon EC2 using starcluster
我正在嘗試在AWS EC2集群上以R並行運行分析。 我正在使用starcluster來設置和管理EC2集群,並嘗試在R中使用snow
和foreach
。首先,我在集群中有2個節點,1個master和1個worker。
starcluster start mycluster
starcluster listinstances
-----------------------------------------
mycluster (security group: @sc-mycluster)
-----------------------------------------
....
Cluster nodes:
master running i-xxxxxxxxx masterIP.compute-1.amazonaws.com
node001 running i-xxxxxxxxx node001IP.compute-1.amazonaws.com
Total nodes: 2
starcluster sshmaster mycluster
然后我啟動R並加載snow
包並嘗試創建一個集群對象。
R
library("snow")
cl = makeCluster(c("masterIP.compute-1.amazonaws.com", "node001IP.compute-1.amazonaws.com"), type = "SOCK")
但是,這會給我以下錯誤消息:
The authenticity of host 'masterIP.compute-1.amazonaws.com (xx.xxx.xx.xx)' can't be established.
ECDSA key fingerprint is xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx.
Are you sure you want to continue connecting (yes/no)? yes
Warning: Permanently added 'masterIP.compute-1.amazonaws.com,xx.xxx.xx.xx' (ECDSA) to the list of known hosts.
Permission denied (publickey).
所以我嘗試將我的ssh密鑰( keyname.rsa
具體)復制到EC2上的.ssh文件中並再次嘗試。 那還是行不通; 我收到了相同的Permission denied (publickey).
錯誤。 我認為starcluster處理節點之間的ssh和通信的設置,所以我有點困惑為什么我無法設置它。 我也嘗試添加node001,所以cl = makeCluster(c("node001IP.compute-1.amazonaws.com"), type = "SOCK")
,但是發生了同樣的錯誤。
事實證明,經過多次修補后,所需要的只是R版本2.15的更新。 命令cl = makeCluster(c("masterIP.compute-1.amazonaws.com", "node001IP.compute-1.amazonaws.com"), type = "SOCK")
之后完美運行。
聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.