[英]Cloudera Manager installation failed to receive heartbeat from agent - to add new hosts to cluster
我嘗試在Ubuntu 12.04.1 LTS上使用標准版本安裝cloudera管理器,當我想添加新主機時,出現下一個錯誤:
Installation failed.Failed to receive heartbeat from agent.
Ensure that the host's hostname is configured properly.
Ensure that port 7182 is accesible on the Cloudera Manager server (check firewall rules).
Ensure that ports 9000 an 9001 are free on the host being added.
Check agent logs in /var/log/cloudera-scm-agent/ on the host being added (some of the logs can be found in the installation details).
在/etc/hosts
文件中,我將其配置為:
127.0.0.1 localhost
127.0.0.1 hadoop-ubuntu
192.168.5.xyz hadoop-ubuntu.dana.local hadoop-ubuntu
192.168.3.xyz ro-m81.dana.local ro-m81
192.168.3.abc ro-m41.dana.local ro-m41
以下行對於支持IPv6的主機是理想的
::1 ip6-localhost ip6-loopback
fe00::0 ip6-localnet
ff00::0 ip6-mcastprefix
ff02::1 ip6-allnodes
ff02::2 ip6-allrouters
The **/var/log/cloudera-scm-agent/cloudera-scm-agent.log** shows the next error::
[09/Oct/2013 16:04:23 +0000] 4532 MainThread agent ERROR Heartbeating to 192.168.5.xyz:7182 failed.
Traceback (most recent call last):
File "/usr/lib64/cmf/agent/src/cmf/agent.py", line 747, in send_heartbeat
response = self.requestor.request('heartbeat', dict(request=heartbeat))
File "/usr/lib64/cmf/agent/build/env/lib/python2.6/site-packages/avro-1.6.3-py2.6.egg/avro/ipc.py", line 145, in request
return self.issue_request(call_request, message_name, request_datum)
File "/usr/lib64/cmf/agent/build/env/lib/python2.6/site-packages/avro-1.6.3-py2.6.egg/avro/ipc.py", line 256, in issue_request
call_response = self.transceiver.transceive(call_request)
File "/usr/lib64/cmf/agent/build/env/lib/python2.6/site-packages/avro-1.6.3-py2.6.egg/avro/ipc.py", line 485, in transceive
result = self.read_framed_message()
File "/usr/lib64/cmf/agent/build/env/lib/python2.6/site-packages/avro-1.6.3-py2.6.egg/avro/ipc.py", line 489, in read_framed_message
response = self.conn.getresponse()
File "/usr/lib64/python2.6/httplib.py", line 990, in getresponse
response.begin()
File "/usr/lib64/python2.6/httplib.py", line 391, in begin
version, status, reason = self._read_status()
File "/usr/lib64/python2.6/httplib.py", line 349, in _read_status
line = self.fp.readline()
File "/usr/lib64/python2.6/socket.py", line 433, in readline
data = recv(1)
error: [Errno 104] Connection reset by peer
請幫助我找到為什么出現此錯誤或我遺漏了什么。
我遇到過同樣的問題。 這就是我的訣竅。
輸入ifconfig並找到您的IP地址。 不是127.0.0.1。
輸入$ hostname並找到您的主機名
編輯/ etc / hosts文件
在那添加您的ipaddress的條目。 就像是
192.168.8.xxx hostname.test.com hostname
重新啟動cloudera服務。 請訪問sonic.test.com:7180,然后重試。 它應該工作。 即使不起作用,也請轉到http://hostname.test.com:7180/cmf/home檢查主機的狀態。
事實證明,即使我遇到心跳錯誤,主機實際上仍已啟動並正在運行。
我遇到了同樣的問題,然后找到了解決方案 。
我用了兩台機器,一台是master
,另一台是slave
具有cloudera-scm-server
。
我在兩台機器上都配置了/etc/hosts
,最后錯誤消失了。
主計算機IP為:192.168.1.10
In Master Machine /etc/hosts
127.0.0.1 localhost
192.168.1.10 <hostname>
從機IP為:192.168.1.8
In Slave Machine /etc/hosts
127.0.0.1 localhost
192.168.1.8 <hostname>
檢查群集中所有節點上的主機文件后,請確保打開安裝程序上的端口7180和7182以及群集節點(安裝程序除外)上的端口9000。
我從Cloudera安裝中收到“檢查器失敗。拋出IO異常”錯誤,直到我查看安裝程序(服務器)日志並看到客戶端無法在端口9000上通信。
我也遇到了同樣的問題,終於解決了。
我的問題是代理的cloudera-scm-agent
的版本與服務器的cloudera-scm-server
,您可以使用dpkg
或yum
進行檢查。
2.在/ var / log / cloudera-scm-agent /中檢查此目錄中的代理日志文件
解決資源: http : //commandstech.com/what-is-heartbeat-in-hadoop-how-to-resolve-heartbeat-lost-in-cloudera-and-hortonworks/
聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.