This message was deleted SigNoz Community #contributing

Join Slack

This message was deleted.

# contributing

Slackbot

03/29/2023, 7:38 PM

This message was deleted.

Ankit Nayan

03/30/2023, 11:05 AM

Nice. We were experimenting with this when it was not merged and had huge storage impacts. We need to check more on the index storage and querying without using cache to compare results. It might be a good idea to engage more on a PR.

Ankit Nayan

03/30/2023, 11:05 AM

We would need processing speed, explain query and index storage outputs

➕ 1

Ankit Nayan

03/30/2023, 11:06 AM

cc: @nitya-signoz

Harsh Thakur

03/30/2023, 11:09 AM

I’ll set it up again and share more info 🙂

nitya-signoz

03/30/2023, 11:12 AM

Yeah agree to what ankit suggested, if possible please increase the sample size to something more than 100 million. Initially, we designed our schema by testing it on 1 billion rows of data. This is because clickhouse is generally fast and the main differences come out over larger data scales. Also from the result your log body has high cardinality, let me know if that is true.

Harsh Thakur

03/30/2023, 11:13 AM

Also from the result your log body has high cardinality, let me know if that is true.

Yes. My setup was signoz’s quickstart- generating traffic with locust.

👍 1

Harsh Thakur

03/30/2023, 11:16 AM

I’ll find params to bump up the locust traffic load. How do you recommend running the Clickhouse server? I completely managed to crash it on a 4CPU 8GB RAM node with docker. Thinking to install it on K8s now using clickhouse operator

Harsh Thakur

03/30/2023, 11:18 AM

I’m guessing I could add quota limits to queries too to avoid crashing the server

nitya-signoz

03/30/2023, 11:23 AM

Okay, I tested it on a larger machine c6a.4xlarge ingesting about 50klogs/s. Using the setup provided here https://github.com/SigNoz/logs-benchmark/tree/main/signoz . But that might be too much for you to set up, you can raise A PR with 25 million logs results if your setup is 4CPU and 8GB ram and we can test it on larger instances.

Harsh Thakur

03/30/2023, 11:27 AM

Oh, 4 times more. So how much data was it able to handle? I’m guessing number of partitions also matter.

nitya-signoz

03/30/2023, 11:34 AM

We ingested 1 billion logs and ran our queries. Didn’t push on how much data it can hold as the objective was to get query perf. If no of partitons are approxiamtely same and on disk and you are checking perf on different indexes types then it shouldn’t matter.

👍 1

Harsh Thakur

04/01/2023, 5:49 PM

So I ran this on 8CPU, 32GB VM with 80 million rows. Noticed coupled things: • Index size is low. • Number of rows processed / granules skipped don’t seem like a huge difference when using flog but not worse either. Wonder if cardinality of data is playing a role here because I did see a major query performance improvement when logs were locust-generated with more API-like logs with status, etc. My guess is Flog generates X term every Y minute that ends up going into Z granule. Since Clickhouse reads entire granules, it ends up reading almost all rows just because that’s how the data got distributed

Harsh Thakur

04/01/2023, 5:52 PM

Ingestion speed seems to be around 6.5k c/s. Ignore the drops, I actually stopped generation during that period.

Harsh Thakur

04/01/2023, 11:05 PM

Looks like with bloom filter, index space is larger and it doesn’t seem to skip indexes on a high cardinality dataset either.

17 Views

Open in Slack

Previous Next