This message was deleted SigNoz Community #support

Join Slack

This message was deleted.

# support

Slackbot

02/07/2024, 8:32 PM

This message was deleted.

clickhouse2.log

✅ 1

panduu Vital

02/08/2024, 4:26 AM

if you are ok with the data loss please follow this link https://kb.altinity.com/altinity-kb-setup-and-maintenance/suspiciously-many-broken-parts/

👍 1

this 1

nitya-signoz

02/08/2024, 5:26 AM

@Prashant Shahi should have some idea on this

Prashant Shahi

02/08/2024, 12:52 PM

The linked shared by @panduu Vital is the right one.

Carlos Martell

02/08/2024, 7:46 PM

that seems to have worked, thanks!

🙌 1

panduu Vital

02/20/2024, 5:26 AM

why will this happen and how to avoid this from occurring ? any idea ? @Prashant Shahi

Carlos Martell

02/26/2024, 5:33 PM

no idea, it keeps happening on my end and I have to add

Copy code

/var/lib/clickhouse/flags/force_restore_data

every couple of days

Prashant Shahi

02/27/2024, 3:50 AM

@Srikanth Chekuri could you please look into this?

Carlos Martell

03/01/2024, 8:09 PM

So this keeps happening and I had to keep adding the flag, but I guess it now got bad enough that it actually corrupted something important and I'm unable to bring clickhouse back online.

clickhouse.log

Carlos Martell

03/01/2024, 8:11 PM

I just went back and manually deleted anything flagged with "is broken and need manual correction" but I'm unsure why this keeps happening.

Carlos Martell

03/01/2024, 8:12 PM

We mostly run on Spot Nodes, so Kubernetes node replacements do happen relatively often, so I wonder if I need any new flags to make this more resilient? HA?

Srikanth Chekuri

03/02/2024, 2:10 AM

Run your database on a node where node replacement do not occur relatively often.

➕ 1

Carlos Martell

04/01/2024, 7:19 AM

so this was mostly solved by upping the default GP2 EBS volume to GP3 as it was due to the default volume provisioned by the helm chart wasn't providing enough IOPS 👍

🙌 1

26 Views

Open in Slack

Previous Next