Hi, my current setup looks like this 1. A self ho...
# support
s
Hi, my current setup looks like this 1. A self hosted signoz cluster running on K8s with helmchat (signoz/signoz) 2. A k8s cluster where all my work loads are running, its pushing data to the self hosted and its working perfectly fine 3. I tried to onboard another cluster today. but i keep getting this error in agent logs
{"level":"error","ts":1723615085.4947934,"caller":"scraperhelper/scrapercontroller.go:200","msg":"Error scraping metrics","kind":"receiver","name":"hostmetrics","data_type":"metrics","error":"failed to read usage at /hostfs/var/vcap/data/nsx-kube-proxy/rootfs/etc/hosts: permission denied; failed to read usage at /hostfs/var/vcap/data/nsx-kube-proxy/rootfs/etc/hostname: permission denied; failed to read usage at /hostfs/var/vcap/data/nsx-kube-proxy/rootfs/usr/lib/os-release: permission denied; failed to read usage at /hostfs/var/vcap/data/nsx-kube-proxy/rootfs/var/log/nsx-ujo: permission denied; failed to read usage at /hostfs/var/vcap/data/nsx-kube-proxy/rootfs/etc/nsx-ujo/ncp.ini: permission denied; failed to read usage at /hostfs/var/vcap/data/nsx-kube-proxy/rootfs/etc/nsx-ujo/certs: permission denied; failed to read usage at /hostfs/var/vcap/data/nsx-node-agent/rootfs/etc/hosts: permission denied; failed to read usage at /hostfs/var/vcap/data/nsx-node-agent/rootfs/etc/hostname: permission denied; failed to read usage at /hostfs/var/vcap/data/nsx-node-agent/rootfs/usr/lib/os-release: permission denied; failed to read usage at /hostfs/var/vcap/data/nsx-node-agent/rootfs/var/log/nsx-ujo: permission denied; failed to read usage at /hostfs/var/vcap/data/nsx-node-agent/rootfs/etc/nsx-ujo/ncp.ini: permission denied; failed to read usage at /hostfs/var/vcap/data/nsx-node-agent/rootfs/etc/nsx-ujo/certs: permission denied; failed to read usage at /hostfs/var/vcap/data/nsx-node-agent/rootfs/var/vcap/data/garden-cni/container-netns: permission denied","scraper":"filesystem","stacktrace":"<http://go.opentelemetry.io/collector/receiver/scraperhelper.(*controller).scrapeMetricsAndReport|go.opentelemetry.io/collector/receiver/scraperhelper.(*controller).scrapeMetricsAndReport>\n\<http://tgo.opentelemetry.io/collector/receiver@v0.88.0/scraperhelper/scrapercontroller.go:200|tgo.opentelemetry.io/collector/receiver@v0.88.0/scraperhelper/scrapercontroller.go:200>\<http://ngo.opentelemetry.io/collector/receiver/scraperhelper.(*controller).startScraping.func1|ngo.opentelemetry.io/collector/receiver/scraperhelper.(*controller).startScraping.func1>\n\<http://tgo.opentelemetry.io/collector/receiver@v0.88.0/scraperhelper/scrapercontroller.go:176|tgo.opentelemetry.io/collector/receiver@v0.88.0/scraperhelper/scrapercontroller.go:176>"}
4. Then i noticed that deployment.envrionmental stopped working too, while checking through, i decided to downgrade the helm chart to version 0.11.7
helm upgrade --namespace=platform my-release signoz/k8s-infra -f vgfacematch2.yaml --version 0.11.7
everything now seem to work. what could be the issue here? how do i get to use the latest version 0.11.9. i could not find any documentation maybe i'm missing a step
s
You might be running into issues because of this change https://github.com/SigNoz/charts/pull/474. Please make it readable.
s
Thank you, My worker nodes do not run as root user, and i'm not able to make this change, what other alternatives can we have to get this to work