@Srikanth Chekuri
I would be interested this question too, spark jobs can produce prometheus metrics
https://spark.apache.org/docs/latest/monitoring.html
Spark jobs can produce metrics by turning this feature on.
spark.ui.prometheus.enabled=true
In my case the spark jobs run on EKS cluster.
Additionally, fluent bit is used as logs processor and forwarder at cluster level.
SigNoz is an open-source APM. It helps developers monitor their applications & troubleshoot problems, an open-source alternative to DataDog, NewRelic, etc.