animesh kumar
I’m currently using SigNoz to monitor my Kubernetes cluster, and I’m trying to set up alerts for two specific pod conditions:
CrashLoopBackOff
ImagePullBackOff
I attempted using the following PromQL query for CrashLoopBackOff:
kube_pod_container_status_waiting_reason{reason="CrashLoopBackOff"} > 0
But it seems that the metric is not available in my setup. I also tried using
k8s.container.restarts, but that led to a PromQL error (500 status) in the SigNoz alert builder.
k8s.container.restarts
Could you please guide me on:
Whether SigNoz supports alerting on these pod states out of the box?
What the recommended metrics and queries are for setting up such alerts?
Whether kube-state-metrics is required and how to integrate it with SigNoz properly?
kube-state-metrics
Any assistance or reference documentation would be greatly appreciated.
Thank you in advance for your support.
SigNoz is an open-source APM. It helps developers monitor their applications & troubleshoot problems, an open-source alternative to DataDog, NewRelic, etc.