Hey guys recently we moved our signoz instance from docker c SigNoz Community #support

Hey guys recently we moved our signoz instance fro...

John Silvan

05/15/2024, 5:55 AM

Hey guys recently we moved our signoz instance from docker-compose to kubernetes for better scalability and reliability - we are facing issues where the signoz instance is crashing with the otel-collector instance queue getting full and the otel-collector runs out of memory. We have 10 replicas of the otel-collector and still facing issues - is this a problem with the clickhouse database not being able to handle the load. If this the case how do I scale the database to handle this load

nitya-signoz

05/15/2024, 6:32 AM

are you adding batch processors before sending insert queries to clickhouse ? if yes try increasing the batch size and the timeout

John Silvan

05/15/2024, 6:43 AM

by default doesn't the otel-collector do batch processing? how do I increase the batch size and the timeout? Srikanth had suggested that wouldn't be the problem

John Silvan

05/15/2024, 6:52 AM

because I can see in the helm chart values that the default send_batch_size is 50k and the timeout is 1s

John Silvan

05/15/2024, 7:01 AM

this is the otel-collector config that I can see

nitya-signoz

05/15/2024, 8:51 AM

Can you try increasing the batch timeout to a higher number, clickhouse is complaining because too many small insert requests are coming up. @Srikanth Chekuri @Prashant Shahi any more ideas ?

John Silvan

05/15/2024, 9:36 AM

I've changed the timeout to 10s now and the batch size is 50k which I think should be good enough

Srikanth Chekuri

05/25/2024, 4:05 AM

For anyone reading this, it was the disk causing slower merges.

👀 1

6 Views

Open in Slack

Previous Next