Hi Team I ve done a fresh setup of SigNoz with clickhouse on SigNoz Community #support

Hi Team, I've done a fresh setup of SigNoz with cl...

Divyansh Sharma

02/23/2025, 10:55 AM

Hi Team, I've done a fresh setup of SigNoz with clickhouse on EKS with 2 shards and 2 replicas (https://signoz.io/docs/operate/clickhouse/distributed-clickhouse/#kubernetes-installation), Now, whenever I do a helm upgrade, the signoz-schema-migrator-sync job runs and fails few times due to table not found errors then automatically succeeds.

Error: code: 60, message: There was an error on [chi-signoz-clickhouse-cluster-1-1:9000]: Code: 60. DB::Exception: Could not find table: time_series_v4. (UNKNOWN_TABLE) (version 24.1.2.5 (official build))

In clickhouse logs as well I see missing table errors. error logs of chi-signoz-clickhouse-cluster-0-0-0:

"message":"Code: 60. DB::Exception: Received from chi-signoz-clickhouse-cluster-1-1:9000. DB::Exception: Table signoz_metrics.samples_v4 does not exist.

"message":"Code: 60. DB::Exception: Received from chi-signoz-clickhouse-cluster-1-1:9000. DB::Exception: Table signoz_metrics.samples_v2 does not exist.

Then while setting the retention on UI, it just gets stuck and in the logs of the query service I see it is not able to do GetTTL from clickhouse:

"msg":"http: panic serving 10.10.249.136:55804: runtime error: invalid memory address or nil pointer dereference\ngoroutine 684 [running]:\nnet/http.(*conn).serve.func1()\n\t/home/runner/go/pkg/mod/golang.org/toolchain@v0.0.1-go1.22.7.linux-amd64/src/net/http/server.go:1903 +0xbe\npanic({0x22593a0?, 0x4155d20?})\n\t/home/runner/go/pkg/mod/golang.org/toolchain@v0.0.1-go1.22.7.linux-amd64/src/runtime/panic.go:770 +0x132\<http://ngo.signoz.io/signoz/pkg/query-service/app/clickhouseReader.(*ClickHouseReader).GetTTL(0xc00011f688|ngo.signoz.io/signoz/pkg/query-service/app/clickhouseReader.(*ClickHouseReader).GetTTL(0xc00011f688>, {0x2f132b8, 0xc000d074a0}

I cleared the SQLite db table as well, but it is still stuck. (https://signoz.io/docs/faqs/troubleshooting/#i-am-trying-to-change-the-retention-period-of-traces-but-the-process-gets-stuck-everytime) Am I missing something wrt to the db schemas? Is anyone able to make it work with the latest helm chart appVersion=0.73.0?

Divyansh Sharma

02/24/2025, 5:44 AM

@Prashant Shahi can you please help here

Prashant Shahi

02/24/2025, 6:27 AM

This usually happens the schema migrator doesn't run properly.

Prashant Shahi

02/24/2025, 6:27 AM

can you share the logs of the migrator pods?

Prashant Shahi

02/24/2025, 6:27 AM

Or try re-running it?

Divyansh Sharma

02/24/2025, 12:41 PM

While re-running it again getting this error:

Error: code: 60, message: There was an error on [chi-signoz-clickhouse-cluster-1-1:9000]: Code: 60. DB::Exception: Could not find table: time_series_v4. (UNKNOWN_TABLE) (version 24.1.2.5 (official build))

signoz-schema-migrator-sync-2n7f4.log

Divyansh Sharma

02/25/2025, 9:13 AM

@Prashant Shahi does it give any idea about schema migrator issue or do I need to share some more details?

Divyansh Sharma

02/26/2025, 3:56 AM

Hi Team, can anyone help here. Thanks in advance.

Prashant Shahi

02/26/2025, 2:44 PM

@Srikanth Chekuri can you please help here?

E H

03/13/2025, 10:42 AM

im seeing similar error and get this error during migration @Prashant Shahi

Partha Dev

04/14/2025, 9:48 PM

I am also facing issue with this schema-migrator when I try to helm upgrade my clickhouse cluster. my values file: clickhouse: layout: shardsCount: 2 replicasCount: 2 zookeeper: replicaCount: 3 schemaMigrator: enableReplication: true Doc: I followed: https://signoz.io/docs/operate/clickhouse/distributed-clickhouse/ Output: schema-migrator-sync in stuck: ss provided:

Partha Dev

04/14/2025, 9:50 PM

Is your issue solved ? @Divyansh Sharma

Nagesh Bansal

05/23/2025, 5:30 AM

Hey @Partha Dev @E H @Divyansh Sharma Are you still facing this issue?

Partha Dev

05/28/2025, 7:00 PM

Yes, I could not succeed by following https://signoz.io/docs/operate/clickhouse/distributed-clickhouse/ . I tried it on signoz 79.1. Could you please provide far more details of the docs for production readyness clickhouse with zookeeper ? @Nagesh Bansal

156 Views

Open in Slack

Previous Next