This message was deleted SigNoz Community #general

Join Slack

This message was deleted.

# general

Slackbot

10/23/2023, 1:10 AM

This message was deleted.

Nočnica Mellifera

10/24/2023, 5:07 PM

I'm going to look with the team at what's the max scale we're being used at @Christian Theilemann, will look at what data we can share. In general, SigNoz is using Clickhouse as its datastore, they've written a bit about optimizing performance for log storage, I should ping Dale to see if he ever wrote the promised follow-up

Ankit Nayan

10/25/2023, 12:18 PM

Hi @Christian Theilemann , it is difficult to do exact comparison but we have tested with 5TB/day and 1M active timeseries per minute.

Ankit Nayan

10/25/2023, 12:19 PM

It should definitely easily scale upto 5 times more

Christian Theilemann

10/25/2023, 1:52 PM

have you done something like search/query for a random string (like

foo

) on the entire data (and not just a pre-filtered dataset or specific column) of the last 2 days - which would, in the worst case - scan about 10TB of data? I know that things like that are very problematic in loki and some other systems (in loki this would almost certainly time out), but in humio they optimize queries like these via bloom filters.

Christian Theilemann

10/25/2023, 1:56 PM

Btw, you don't have by chance a way to ingest log data from vector.dev (which is what we're currently using) ? That would make it easy to test things out for me.

Ankit Nayan

10/30/2023, 8:25 AM

bloom filters are already in place. https://github.com/SigNoz/signoz-otel-collector/blob/main/migrationmanager/migrators/logs/migrations/000001_init_db.up.sql#L19 https://clickhouse.com/docs/en/optimize/skipping-indexes#bloom-filter-types

👀 1

Ankit Nayan

10/30/2023, 8:25 AM

You can also use inverted index (still experimental in clickhouse) https://clickhouse.com/docs/en/engines/table-engines/mergetree-family/invertedindexes

59 Views

Open in Slack

Previous Next