Hello everyone,
1)
We have kubernetes cluster it contains 6 nodes and out of 3 are GPU one, we running the LLM models on it. So have installed the signoz in our kubernetes cluster with helm chart. But not sure how can i do the live monitoring of GPU.
Any have any idea on this, that will be very helpful for me 🙂
👀 1
s
Srikanth Chekuri
07/22/2024, 11:32 AM
How do you generally collect the GPU monitoring data?
n
Nagesh Rathod
07/23/2024, 3:32 AM
we don't have any mechanism on place yet, through signoz we are looking for it first time.
s
Srikanth Chekuri
07/23/2024, 5:04 PM
SigNoz will work as a backend. You need some way to collect and send them to SigNoz.
n
Nagesh Rathod
08/01/2024, 3:17 AM
We are using a azure AKS cluster, now we are monitoring GPU through the azure portal.
But we want to use signoz as central dashboard to monitor everything, so let us know how i can send and catch the gpu stats to signoz. @Ankit Nayan@Srikanth Chekuri Please help me 🙂