Hi everyone! I need some help understand Metrics ...
# support
e
Hi everyone! I need some help understand Metrics in Signoz, or OTEL to be more specific. We have EKS cluster which is being monitored by Signoz (logs/traces/metrics) and I try to show the memory utilization of out nodes on agraph. I've used k8s_node_memory_usage - but the results I get are way off the values I get when executing
kubectl top nodes
. Any idea why I get different results? Also, is there a document with better explenation regarding each metric? Even CPU utilization in Signoz seems to be far from the real utilization when watching with 'kubectl top nodes' or in EC2 dashboard.
s
What is the query? Or do you use some existing dashboard? We use these ourselves and they are correct https://github.com/SigNoz/dashboards/tree/main/k8s-infra-metrics.
e
this is just great.. Thank you for sharing this as I can see now what was wrong with my settings. while I was using
k8s_node_memory_usage
it looks like what I was looking for is
k8s_node_memory_working_set
also instead of
k8s_node_memory_available
i should have used
k8s_node_allocatable_memory
Is there any valid documentation to describe the meaning of each metric?
same goes for
k8s_node_cpu_utilization
why isnt that metric represents the used_cpu/allocatable_cpu?
e
Thanks for you help! I'll look into it
I can see now some explanation about CPU, but still cant make sense out of this
s
Which part is not clear?
e
what is usage an what is working set for example I would assume that usage is what i will get performing
kubectl top nodes
s
Usage is total memory used (this includes the cache such as page cache etc...); Some part of this memory can be evicted when the kernel gets a request from another process for memory. The working set is the amount of memory required for the process to run. working <= usage.
e
thanks for this detailed explanation