Hi i setup signoz for k8s monitoring via otel i want to conn SigNoz Community #general

Hi, i setup signoz for k8s monitoring via otel, i...

S Rajith

04/28/2024, 11:59 AM

Hi, i setup signoz for k8s monitoring via otel, i want to connect directly to clickhouse so i can create custom queries. i checked the

signoz_metrics

distributed_time_series_v4

(assuming this is what i should use for getting metrics). where is metric value stored ?

S Rajith

04/28/2024, 12:03 PM

Srikanth Chekuri

04/29/2024, 6:52 AM

It exists in

samples_v4

table with column name

value

S Rajith

04/29/2024, 9:04 AM

is there a reason why the table doesn't contain labels ?

Srikanth Chekuri

04/29/2024, 9:09 AM

That would be redundant labels for each measurement and make the storage usage unnecessarily high.

S Rajith

04/29/2024, 9:10 AM

so would joining on fingerprint would give correct result ? or is unix_milli required ?

Srikanth Chekuri

04/29/2024, 9:11 AM

Joining on fingerprint will give the correct result. The unix_milli on

samples

table indicates when measurement is produced.

S Rajith

04/29/2024, 9:12 AM

awesome thank you

S Rajith

04/29/2024, 9:13 AM

oh one last thing, for logs is this good enough ?

Copy code

select timestamp, body
from distributed_logs
where
    arrayElement (
        resources_string_value, indexOf (
            resources_string_key, 'k8s.container.name'
        )
    ) like 'myapp%'
order by timestamp desc
limit 5;

Srikanth Chekuri

04/29/2024, 9:14 AM

yes

👍 1

S Rajith

04/29/2024, 9:16 AM

for metrics, here is my query

Copy code

SELECT ts.fingerprint, ts.metric_name, samples.unix_milli, samples.value
FROM
    distributed_time_series_v4 ts
    JOIN distributed_samples_v4 samples ON ts.fingerprint = samples.fingerprint
WHERE
    ts.metric_name = 'container_cpu_utilization'
    AND JSONExtractString (
        ts.labels, 'k8s_container_name'
    ) LIKE 'myapp%'
ORDER BY samples.unix_milli DESC
LIMIT 10;

S Rajith

04/29/2024, 9:17 AM

i thought the samples.unix_milli would have a difference of scrape_interval, but something seems wrong

S Rajith

04/29/2024, 9:18 AM

Srikanth Chekuri

04/29/2024, 9:31 AM

The

time_series_v4

can contain duplicates. You should take care of that in the query. Why are you writing your own queries? Does the query builder not support what you are trying to achieve?

S Rajith

04/29/2024, 9:32 AM

i need to do some custom logic, and use like api

S Rajith

04/29/2024, 9:33 AM

so i can directly use clickhouse api

S Rajith

04/29/2024, 11:39 AM

distributed_time_series_v4_1day

it would contain metrics for present day ?

Srikanth Chekuri

04/29/2024, 2:27 PM

Yes, the

distributed_time_series_v4

attempts to have one row for each unique time series, ``distributed_time_series_v4_6hrs`` one row for 6hrs and ``distributed_time_series_v4_1day`` one row for a day. So that when we query the large duration we can reduce the amount of data read which means faster queries.

17 Views

Open in Slack

Previous Next