[英]monitoring CPU/mem usage on gke
I recently launched with gke and kubernetes in production. 我最近在生产中推出了gke和kubernetes。 I have regular outages with no obvious reasons.
我有定期停电,没有明显的原因。 No event shows anything, pods are not restarting and seems stable.
没有任何事件显示任何内容,pods没有重启并且似乎稳定。 I have a similar qa env that has no issue at all whereas it's way smaller.
我有一个类似的qa env,根本没有问题,而它的方式更小。
Where can I find potential infos on the outage reason? 我在哪里可以找到停电原因的潜在信息?
You can see monitoring data for your cluster using Stackdriver . 您可以使用Stackdriver查看群集的监视数据。 There's a brief walkthrough of how to use it for GKE in this blog post .
在这篇博客文章中简要介绍了如何将它用于GKE。 You may also want to check out the general Kubernetes application troubleshooting guide .
您可能还想查看一般的Kubernetes应用程序故障排除指南 。
What are the symptoms of the outage? 停电有什么症状?
Stack driver makes you pay and configure it... kubernetes comes with a tool for it... just use this: 堆栈驱动程序让你付费并配置它... kubernetes附带了一个工具...只需使用它:
kubectl top nodes
al@host:~/$ kubectl top nodes
NAME CPU(cores) CPU% MEMORY(bytes) MEMORY%
gke-learn-pool-1-10f60e0a-s44c 104m 11% 1008Mi 86%
You can also go under clusters -> Cluster -> nodes -> Node 您也可以进入集群 - >集群 - >节点 - >节点
Update: Stack Driver deprecated all load monitoring plugins. 更新:堆栈驱动程序已弃用所有负载监视插件。 It's K8s or the highway now.
现在是K8或高速公路。
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.