AWS EMR 集群 - 扩展没有将 dfs.replication 值从 1 更新为 2

Question

I provisioned an AWS EMR HBASE cluster with 1 master and 1 core node (m5.xLarge).我配置了一个AWS EMR HBASE集群，其中包含1 master节点和1 core node (m5.xLarge)。 My cluster doesn't have any 'task' node as I plan to use this cluster only for storage.我的集群没有任何“任务”节点，因为我计划仅将此集群用于存储。 The hdfs-site.xml file on both boxes had dfs.replication set to 1 which makes sense.两个盒子上的hdfs-site.xml文件都将dfs.replication设置为 1，这是有道理的。 I then manually added 5 more core nodes.然后我手动添加了 5 个core节点。 I was hoping EMR would bump the replication factor from 1 to 2 as per their docs - https://docs.aws.amazon.com/emr/latest/ReleaseGuide/emr-hdfs-config.html我希望 EMR 会根据他们的文档将复制因子从 1 提高到 2 - https://docs.aws.amazon.com/emr/latest/ReleaseGuide/emr-hdfs-config.html

As I understand, EMR will set the replication factor to 2 if I supply 6 cores during bootstrap, but what about in my use case where I manually scaled the cluster up after I was up and running?据我所知，如果我在引导期间提供 6 个内核，EMR 会将复制因子设置为 2，但是在我的用例中，我在启动并运行后手动扩展集群时呢？

Answer 1

Looks like EMR won't do it automatically.看起来 EMR 不会自动执行。 After scaling cluster up, I will need to reconfigure the replication factor by manually reconfiguring the instance group - https://docs.aws.amazon.com/emr/latest/ReleaseGuide/emr-configure-apps-running-cluster.html扩展集群后，我需要通过手动重新配置实例组来重新配置复制因子 - https://docs.aws.amazon.com/emr/latest/ReleaseGuide/emr-configure-apps-running-cluster.html

--instanceGroups.json below --instanceGroups.json 下面

 [
  {
  "InstanceGroupId":"<ig-1xxxxxxx9>",
  "Configurations":[
     {
        "Classification":"yarn-site",
        "Properties":{
           "yarn.nodemanager.disk-health-checker.enable":"true",
           "yarn.nodemanager.disk-health-checker.max-disk-utilization-per-disk-percentage":"100.0"
        },
        "Configurations":[]
     }
  ]
 }
]

aws emr modify-instance-groups --cluster-id <j-2AL4XXXXXX5T9> 
--instance-groups file://instanceGroups.json

AWS EMR 集群 - 扩展没有将 dfs.replication 值从 1 更新为 2

问题描述

1 个解决方案

解决方案1
0

AWS EMR 集群 - 扩展没有将 dfs.replication 值从 1 更新为 2

问题描述

1 个解决方案

解决方案1 0

解决方案1
0