在Debian上监控Kafka告警可参考以下方案,核心工具为kafka_exporter+Prometheus+Grafana,具体步骤如下:
# docker-compose.yml示例
services:
kafka-exporter:
image: bitnami/kafka-exporter:latest
command: "--kafka.server=broker1:9092 --kafka.server=broker2:9092 --kafka.version=3.2.1"
ports: ["9308:9308"]
scrape_configs指定监控目标。# prometheus.yml示例
scrape_configs:
- job_name: 'kafka-exporter'
metrics_path: '/metrics'
static_configs:
- targets: ['host1:9308', 'host2:9308']
# alert.yml示例
rules:
- alert: KAFKA_BROKER_DOWN
expr: up == 0
for: 1m
labels: { severity: "critical" }
annotations: { summary: "Kafka broker {{ $labels.instance }} is down" }
- alert: MESSAGE_BACKLOG_HIGH
expr: sum(kafka_consumergroup_lag_sum) by (group, topic) > 5000
for: 2m
labels: { severity: "warning" }
其他工具补充
以上方案可灵活组合,满足不同规模集群的监控需求。