监视gke上的CPU / mem使用情况

Question

I recently launched with gke and kubernetes in production. 我最近在生产中推出了gke和kubernetes。 I have regular outages with no obvious reasons. 我有定期停电，没有明显的原因。 No event shows anything, pods are not restarting and seems stable. 没有任何事件显示任何内容，pods没有重启并且似乎稳定。 I have a similar qa env that has no issue at all whereas it's way smaller. 我有一个类似的qa env，根本没有问题，而它的方式更小。

Where can I find potential infos on the outage reason? 我在哪里可以找到停电原因的潜在信息？

Answer 1

You can see monitoring data for your cluster using Stackdriver . 您可以使用Stackdriver查看群集的监视数据。 There's a brief walkthrough of how to use it for GKE in this blog post . 在这篇博客文章中简要介绍了如何将它用于GKE。 You may also want to check out the general Kubernetes application troubleshooting guide . 您可能还想查看一般的Kubernetes应用程序故障排除指南。

What are the symptoms of the outage? 停电有什么症状？

Answer 2

Stack driver makes you pay and configure it... kubernetes comes with a tool for it... just use this: 堆栈驱动程序让你付费并配置它... kubernetes附带了一个工具...只需使用它：

kubectl top nodes

al@host:~/$ kubectl top nodes
NAME                             CPU(cores)   CPU%      MEMORY(bytes)   MEMORY%
gke-learn-pool-1-10f60e0a-s44c   104m         11%       1008Mi          86%

You can also go under clusters -> Cluster -> nodes -> Node 您也可以进入集群 - >集群 - >节点 - >节点

Update: Stack Driver deprecated all load monitoring plugins. 更新：堆栈驱动程序已弃用所有负载监视插件。 It's K8s or the highway now. 现在是K8或高速公路。

监视gke上的CPU / mem使用情况

问题描述

2 个解决方案

解决方案1
4 2016-03-29 01:06:59

解决方案2
3 2018-03-06 23:29:30

监视gke上的CPU / mem使用情况

问题描述

2 个解决方案

解决方案1 4 2016-03-29 01:06:59

解决方案2 3 2018-03-06 23:29:30

解决方案1
4 2016-03-29 01:06:59

解决方案2
3 2018-03-06 23:29:30