简体   繁体   English

Google(GKE)中的Kubernetes集群正在扩展节点

[英]Kubernetes cluster in Google (GKE) is over scaling nodes

I have a Kubernetes cluster in GCP running with several node pools with auto-scaling enabled. 我在GCP中有一个Kubernetes集群,其中运行了几个启用了自动缩放的节点池。 Seems like the auto-scaler is over scaling... Attached is the list of nodes and their usage (I'm using 32CPU machine type). 似乎自动缩放器已超出缩放比例...随附的是节点及其使用情况的列表(我使用的是32CPU机器类型)。

Can't understand why the auto-scaler doesn't merge some of these machines as the usage is far below the capacity and pods running on several machines can easily be merged into one. 无法理解为什么自动缩放器不能合并其中的某些机器,因为使用量远低于容量,并且可以轻松地将多台机器上运行的Pod合并为一个。

I'm not using any special taints/affinities so I can't see why this over scaling is done. 我没有使用任何特殊的污点/亲和力,所以我看不到为什么这样做会过度缩放。 Reading the node auto-scaler documentation doesn't explain this type of behavior either. 阅读节点自动定标器文档也无法解释此类行为。

Columns: status, requested CPU, Total CPU, requested RAM, Total RAM 列:状态,请求的CPU,总CPU,请求的RAM,总RAM

节点列表

As you confirmed about kube-system pods, you might have some kube-system pods running in the nodes that are preventing the autoscaler from removing the nodes; 正如您确认的关于kube-system pod的那样,您可能在节点中运行了一些kube-system pod,这阻止了自动缩放器删除节点。 please see this . 请看这个

Regarding the logs of the cluster-autoscaler under GKE, unfortunately I don't think you have such access. 关于GKE下的群集自动缩放器的日志,不幸的是,我认为您没有这种访问权限。 If you have access to the master machine, check Cluster Autoscaler logs in /var/log/cluster-autoscaler.log. 如果您有权访问主计算机,请在/var/log/cluster-autoscaler.log中检查Cluster Autoscaler日志。 Cluster Autoscaler logs a lot of useful information, including why it considers a pod unremovable or what was its scale-up plan. Cluster Autoscaler会记录许多有用的信息,包括为何认为Pod无法移动或其扩展计划是什么。 For more info, you can look at this . 有关更多信息,您可以查看

Please note that it is entirely possible that the node is underutilized, but the pod would not fit anywhere else and so that can be the reason the node can't be removed. 请注意,节点未充分利用的可能性很大,但吊舱无法在其他任何地方使用,因此这可能是无法删除节点的原因。 The logic is documented here . 逻辑记录在这里

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM