简体   繁体   English

Ambari 1.7.0无法在CentOS集群中注册数据节点

[英]Ambari 1.7.0 cannot register datanodes in CentOS cluster

Here is another question about getting hosts to register. 这是有关使主机注册的另一个问题。 I am using Ambari 1.7.0 on CentOS 6 machines. 我在CentOS 6机器上使用Ambari 1.7.0。 I am trying to install HDP 2.1. 我正在尝试安装HDP 2.1。

First here is the hosts file I am using. 首先是我正在使用的主机文件。 Note each node has the same hosts file: 注意每个节点都有相同的主机文件:

127.0.0.1   localhost localhost.localdomain localhost4 localhost4.localdomain4
::1         localhost localhost.localdomain localhost6 localhost6.localdomain6
192.168.200.144 datanode10.localdomain.com
192.168.200.107 datanode01.localdomain.com
192.168.200.143 namenode.localdomain.com

Also, I can ping each machine for any machine. 另外,我可以对每台计算机执行ping操作。 I can SSH without a password from the name node into the other datanodes. 我可以在没有密码的情况下从名称节点SSH到其他数据节点。 I disabled selinux and iptables on all machines. 我在所有机器上禁用了selinux和iptables。

I am following the startup procedure listed here: https://cwiki.apache.org/confluence/display/AMBARI/Install+Ambari+1.7.0+from+Public+Repositories . 我正在按照此处列出的启动过程进行操作: https : //cwiki.apache.org/confluence/display/AMBARI/Install+Ambari+1.7.0+from+Public+Repositories Please note that these install instructions mention nothing about iptables or selinux. 请注意,这些安装说明未提及iptables或selinux。 People on the mailing list have told me that I need to disable those items. 邮件列表上的人告诉我,我需要禁用这些项目。

Ambari can discover the namenode it is sitting on. Ambari可以发现它所在的名称节点。 It cannot discover the datanodes. 它无法发现数据节点。 I get this error from the registration log file: 我从注册日志文件中收到此错误:

Verifying ambari-agent process status...
Ambari Agent successfully started
Agent PID at: /var/run/ambari-agent/ambari-agent.pid
Agent out at: /var/log/ambari-agent/ambari-agent.out
Agent log at: /var/log/ambari-agent/ambari-agent.log
("WARNING 2014-12-17 10:43:08,349 NetUtil.py:92 - Server at https://namenode.localdomain.com.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode:8440 is not reachable, sleeping for 10 seconds...

Why is the namenode being appended to the namenode.localdomain.com URL? 为什么将namenode附加到namenode.localdomain.com URL? Why is the script considering this a valid URL and not throwing an error? 为什么脚本将其视为有效URL而不抛出错误?

What follows is the full registration log file. 接下来是完整的注册日志文件。

==========================
Creating target directory...
==========================

Command start time 2014-12-17 10:43:22

Connection to datanode10.localdomain.com closed.
SSH command execution finished
host=datanode10.localdomain.com, exitcode=0
Command end time 2014-12-17 10:43:22

==========================
Copying common functions script...
==========================

Command start time 2014-12-17 10:43:22

scp /usr/lib/python2.6/site-packages/ambari_commons
host=datanode10.localdomain.com, exitcode=0
Command end time 2014-12-17 10:43:23

==========================
Copying OS type check script...
==========================

Command start time 2014-12-17 10:43:23

scp /usr/lib/python2.6/site-packages/ambari_server/os_check_type.py
host=datanode10.localdomain.com, exitcode=0
Command end time 2014-12-17 10:43:23

==========================
Running OS type check...
==========================

Command start time 2014-12-17 10:43:23
Cluster primary/cluster OS type is redhat6 and local/current OS type is redhat6

Connection to datanode10.localdomain.com closed.
SSH command execution finished
host=datanode10.localdomain.com, exitcode=0
Command end time 2014-12-17 10:43:23

==========================
Checking 'sudo' package on remote host...
==========================

Command start time 2014-12-17 10:43:23
sudo-1.8.6p3-15.el6.x86_64

Connection to datanode10.localdomain.com closed.
SSH command execution finished
host=datanode10.localdomain.com, exitcode=0
Command end time 2014-12-17 10:43:24

==========================
Copying repo file to 'tmp' folder...
==========================

Command start time 2014-12-17 10:43:24

scp /etc/yum.repos.d/ambari.repo
host=datanode10.localdomain.com, exitcode=0
Command end time 2014-12-17 10:43:24

==========================
Moving file to repo dir...
==========================

Command start time 2014-12-17 10:43:24

Connection to datanode10.localdomain.com closed.
SSH command execution finished
host=datanode10.localdomain.com, exitcode=0
Command end time 2014-12-17 10:43:24

==========================
Copying setup script file...
==========================

Command start time 2014-12-17 10:43:24

scp /usr/lib/python2.6/site-packages/ambari_server/setupAgent.py
host=datanode10.localdomain.com, exitcode=0
Command end time 2014-12-17 10:43:24

==========================
Running setup agent script...
==========================

Command start time 2014-12-17 10:43:24
Verifying Python version compatibility...
Using python  /usr/bin/python2.6
Found ambari-agent PID: 3622
Stopping ambari-agent
Removing PID file at /var/run/ambari-agent/ambari-agent.pid
ambari-agent successfully stopped
Restarting ambari-agent
Verifying Python version compatibility...
Using python  /usr/bin/python2.6
ambari-agent is not running. No PID found at /var/run/ambari-agent/ambari-agent.pid
Verifying Python version compatibility...
Using python  /usr/bin/python2.6
Checking for previously running Ambari Agent...
Starting ambari-agent
Verifying ambari-agent process status...
Ambari Agent successfully started
Agent PID at: /var/run/ambari-agent/ambari-agent.pid
Agent out at: /var/log/ambari-agent/ambari-agent.out
Agent log at: /var/log/ambari-agent/ambari-agent.log
("WARNING 2014-12-17 10:43:08,349 NetUtil.py:92 - Server at https://namenode.localdomain.com.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode:8440 is not reachable, sleeping for 10 seconds...
INFO 2014-12-17 10:43:18,359 NetUtil.py:48 - Connecting to https://namenode.localdomain.com.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode:8440/ca
WARNING 2014-12-17 10:43:18,360 NetUtil.py:71 - Failed to connect to https://namenode.localdomain.com.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode:8440/ca due to [Errno -2] Name or service not known  
WARNING 2014-12-17 10:43:18,360 NetUtil.py:92 - Server at https://namenode.localdomain.com.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode:8440 is not reachable, sleeping for 10 seconds...
INFO 2014-12-17 10:43:28,370 NetUtil.py:48 - Connecting to https://namenode.localdomain.com.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode:8440/ca
WARNING 2014-12-17 10:43:28,370 NetUtil.py:71 - Failed to connect to https://namenode.localdomain.com.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode:8440/ca due to [Errno -2] Name or service not known  
WARNING 2014-12-17 10:43:28,371 NetUtil.py:92 - Server at https://namenode.localdomain.com.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode:8440 is not reachable, sleeping for 10 seconds...
INFO 2014-12-17 10:43:31,082 main.py:83 - loglevel=logging.INFO
INFO 2014-12-17 10:43:31,082 main.py:55 - signal received, exiting.
INFO 2014-12-17 10:43:31,082 ProcessHelper.py:39 - Removing pid file
INFO 2014-12-17 10:43:31,083 ProcessHelper.py:46 - Removing temp files
INFO 2014-12-17 10:43:36,764 main.py:83 - loglevel=logging.INFO
INFO 2014-12-17 10:43:36,764 DataCleaner.py:36 - Data cleanup thread started
INFO 2014-12-17 10:43:36,765 DataCleaner.py:117 - Data cleanup started
INFO 2014-12-17 10:43:36,767 DataCleaner.py:119 - Data cleanup finished
INFO 2014-12-17 10:43:36,801 PingPortListener.py:51 - Ping port listener started on port: 8670
WARNING 2014-12-17 10:43:36,802 main.py:235 - Unable to determine the IP address of the Ambari server 'namenode.localdomain.com.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode'
INFO 2014-12-17 10:43:36,802 NetUtil.py:48 - Connecting to https://namenode.localdomain.com.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode:8440/ca
WARNING 2014-12-17 10:43:36,802 NetUtil.py:71 - Failed to connect to https://namenode.localdomain.com.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode:8440/ca due to [Errno -2] Name or service not known  
WARNING 2014-12-17 10:43:36,802 NetUtil.py:92 - Server at https://namenode.localdomain.com.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode.namenode:8440 is not reachable, sleeping for 10 seconds...
", None)

Connection to datanode10.localdomain.com closed.
SSH command execution finished
host=datanode10.localdomain.com, exitcode=0
Command end time 2014-12-17 10:43:39

Registering with the server...
Registration with the server failed.

What do you have specified as the hostname in /etc/ambari-agent/conf/ambari-agent.ini ? 您在/etc/ambari-agent/conf/ambari-agent.ini中指定了什么主机名? I assume that it is 'namenode.localdomain.com' 我假设它是“ namenode.localdomain.com”

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM