简体   繁体   English

在 Prometheus 警报中的活动 state 中无法看到警报

[英]Unable to see Alert in active state in Prometheus Alerts

I am trying to setup alert manager with my prometheus server.我正在尝试使用我的普罗米修斯服务器设置警报管理器。 i can see following alert is triggered properly and notification is available in my slack channel.我可以看到以下警报已正确触发,并且我的松弛频道中提供了通知。 it is with simple expression and triggering if any exporter is down如果任何出口商关闭,它具有简单的表达和触发

ie up== 0即向上== 0

alerts:
    groups:
      - name: Exporter
        rules:
        - alert: exporter-down
          expr: up == 0
          for: 2m
          labels:
            severity: page
          annotations:
            Summary: "Exporter {{ $labels.job }} is down."
            Description: "{{ $labels.job }} has been down for more than 2 minutes."
            GrafanaDashboard: example.com
            Prometheus: example.com
            AlertManager: example.com
            Impact: Unavailability of {{ $labels.job }} will impact our monitoring. We will not able to get Insight of {{ $labels.job }}

when i try to set similar alert rule for response in some specific range.当我尝试为某些特定范围内的响应设置类似的警报规则时。

ie expr aws_applicationelb_target_response_time_average > 0.1 AND aws_applicationelb_target_response_time_average < 0.35即 expr aws_applicationelb_target_response_time_average > 0.1 AND aws_applicationelb_target_response_time_average < 0.35

- name: LoadBalancerWarning
        rules:
        - alert: slowResponseWarning
          expr: aws_applicationelb_target_response_time_average > 0.1 AND aws_applicationelb_target_response_time_average < 0.35
          labels:
            severity: warning
          annotations:
            Summary: "Load Balancer {{ $labels.load_balancer }} response is more than 0.1 seconds but less than 0.35 for {{ $labels.job }} "
            Description: "It is Warning Sign. "
            GrafanaDashboard: example.com
            Prometheus: example.com
            AlertManager: example.com
            Impact: Slow Response Impact User Experience  

在此处输入图像描述

在此处输入图像描述

Not sure if i am missing something.不确定我是否遗漏了什么。 it will be great if someone can provide me pointers for same.如果有人可以为我提供相同的指示,那就太好了。

I can see cloud watch has delay of 5 to 10 minutes.我可以看到云手表有 5 到 10 分钟的延迟。 when i add offset in this alert query for 5 to 10 minutes.当我在此警报查询中添加 5 到 10 分钟的偏移量时。 it is able to trigger alerts它能够触发警报

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM