sudhanshu dev
01/09/2023, 10:23 AMAnkit Nayan
v0.12.0
David Bronke
01/09/2023, 12:08 PMDeploying to AWS
installation instructions with no modifications aside from naming my release signoz-stack
, but I'm getting stuck at the Verify the Installation step; the frontend
, otel-collector
, and otel-collector-metrics
pods have been stuck at Init:0/1
for almost an hour now. Looking at the logs, it seems like otel-collector
is having issues connecting to clickhouse, even though clickhouse-operator
says Running
and seems OK looking at its logs. There is a chi-signoz-stack-clickhouse-cluster-0-0-0
pod, but it's still in Pending
.
Logs from the signoz-stack-otel-collector-init
container:
wget: can't connect to remote host (172.20.85.116): Connection refused
waiting for clickhouseDB
wget: can't connect to remote host (172.20.85.116): Connection refused
waiting for clickhouseDB
Running signoz/troubleshoot
in the platform
namespace gives:
Error: not able to send data to SigNoz endpoint ...
rpc error: code = Unavailable desc = connection error: desc = "transport: Error while dialing dial tcp 172.20.58.69:4317: i/o timeout"
which I think is just because otel-collector
isn't running yet.Fabbio Borges
01/09/2023, 8:10 PMPaulo Henrique de Morais Santiago
01/10/2023, 9:53 PMPaulo Henrique de Morais Santiago
01/10/2023, 10:02 PM- name: HTTPS_PROXY
value: "<http://myproxy>"
- name: HTTP_PROXY
value: "<http://myproxy>"
But using the signoz UI the slack test still dont working. Looking for logs i see a request sended to query service. So after, i put the same envs HTTPS_PROXY and HTTP_PROXY on query-service pod. And strange, after this the front-end UI page alert channels become give only 500 error. I cant save anymore the configs. See the images on below the error after i configure proxys envs on query service pod.
Resume: The notification using slack channel dont works here.Arvin Corpuz
01/11/2023, 2:37 PM$ kubectl exec -it signoz-otel-collector-54889ffcc8-rz86z sh
kubectl exec [POD] [COMMAND] is DEPRECATED and will be removed in a future version. Use kubectl exec [POD] -- [COMMAND] instead.
W0111 22:29:10.199861 77630 gcp.go:119] WARNING: the gcp auth plugin is deprecated in v1.22+, unavailable in v1.26+; use gcloud instead.
To learn more, consult <https://cloud.google.com/blog/products/containers-kubernetes/kubectl-auth-changes-in-gke>
Defaulted container "signoz-otel-collector" out of: signoz-otel-collector, signoz-otel-collector-init (init)
/ $ cat /conf/otel-collector-config.yaml
exporters:
clickhouselogsexporter:
dsn: tcp://${CLICKHOUSE_HOST}:${CLICKHOUSE_PORT}/?username=${CLICKHOUSE_USER}&password=${CLICKHOUSE_PASSWORD}
retry_on_failure:
enabled: true
initial_interval: 5s
max_elapsed_time: 300s
max_interval: 30s
sending_queue:
queue_size: 100
timeout: 10s
clickhousemetricswrite:
endpoint: tcp://${CLICKHOUSE_HOST}:${CLICKHOUSE_PORT}/?database=${CLICKHOUSE_DATABASE}&username=${CLICKHOUSE_USER}&password=${CLICKHOUSE_PASSWORD}
resource_to_telemetry_conversion:
enabled: true
clickhousetraces:
datasource: tcp://${CLICKHOUSE_HOST}:${CLICKHOUSE_PORT}/?database=${CLICKHOUSE_TRACE_DATABASE}&username=${CLICKHOUSE_USER}&password=${CLICKHOUSE_PASSWORD}
prometheus:
endpoint: 0.0.0.0:8889
extensions:
health_check:
endpoint: 0.0.0.0:13133
pprof:
endpoint: localhost:1777
zpages:
endpoint: localhost:55679
processors:
batch:
send_batch_size: 50000
timeout: 1s
memory_limiter: null
resourcedetection:
detectors:
- env
- system
system:
hostname_sources:
- os
timeout: 2s
signozspanmetrics/prometheus:
dimensions:
- default: default
name: service.namespace
- default: default
name: deployment.environment
dimensions_cache_size: 100000
latency_histogram_buckets:
- 100us
- 1ms
- 2ms
- 6ms
- 10ms
- 50ms
- 100ms
- 250ms
- 500ms
- 1000ms
- 1400ms
- 2000ms
- 5s
- 10s
- 20s
- 40s
- 60s
metrics_exporter: prometheus
receivers:
hostmetrics:
collection_interval: 30s
scrapers:
cpu: {}
disk: {}
filesystem: {}
load: {}
memory: {}
network: {}
jaeger:
protocols:
grpc:
endpoint: 0.0.0.0:14250
thrift_http:
endpoint: 0.0.0.0:14268
otlp:
protocols:
grpc:
endpoint: 0.0.0.0:4317
max_recv_msg_size_mib: 16
http:
endpoint: 0.0.0.0:4318
otlp/spanmetrics:
protocols:
grpc:
endpoint: localhost:12345
service:
extensions:
- health_check
- zpages
pipelines:
logs:
exporters:
- clickhouselogsexporter
processors:
- batch
receivers:
- otlp
metrics:
exporters:
- clickhousemetricswrite
processors:
- batch
receivers:
- otlp
metrics/generic:
exporters:
- clickhousemetricswrite
processors:
- resourcedetection
- batch
receivers:
- hostmetrics
metrics/spanmetrics:
exporters:
- prometheus
receivers:
- otlp/spanmetrics
traces:
exporters:
- clickhousetraces
processors:
- signozspanmetrics/prometheus
- batch
receivers:
- jaeger
- otlp
telemetry:
metrics:
address: 0.0.0.0:8888
Nate Brunette
01/11/2023, 7:02 PMdocker run -it --rm signoz/troubleshoot checkEndpoint --endpoint=172.17.0.1:4317
Nate Brunette
01/11/2023, 7:02 PMError: not able to send data to SigNoz endpoint ...
rpc error: code = Unavailable desc = connection error: desc = "transport: Error while dialing dial tcp 172.17.0.1:4317: i/o timeout"
Nate Brunette
01/11/2023, 7:03 PMSrikanth Chekuri
01/11/2023, 7:37 PMCan I access the time variable in Signoz for dashboard queries?Yes, you could use
{{.SIGNOZ_START_TIME}}
and {{.SIGNOZ_END_TIME}}
.
If not then what is it’s purpose for in the dashboard tab?The configure tab allows you to define a variable whose values are queryable from ClickHouse.
Prim Mayora
01/11/2023, 7:44 PMAlexei Zenin
01/12/2023, 10:14 PMTravis Chambers
01/13/2023, 1:07 AMdeploy/docker/clickhouse-setup/otel-collector-metrics-config.yaml
. However, I can't seem to find any of these metrics showing up in SigNoz. Could I get some help troubleshooting?
more info in 🧵Vinayak Singh
01/13/2023, 3:26 AMoluchi orji
01/13/2023, 10:21 AMnode.js
and java
application. Is anything wrong, we are using the following guide hereVinayak Singh
01/13/2023, 3:35 PMPaulo Henrique de Morais Santiago
01/13/2023, 3:56 PMErros:
Warning Unhealthy 15s (x4 over 45s) kubelet Liveness probe failed: Get "<http://100.100.24.178:8080/api/v1/version>": dial tcp 100.100.24.178:8080: connect: connection refused ││ Warning Unhealthy 5s (x11 over 53s) kubelet Readiness probe failed: Get "<http://100.100.24.178:8080/api/v1/version>": dial tcp 100.100.24.178:8080: connect: connection refused
Apoorva
01/14/2023, 10:18 AMApoorva
01/14/2023, 11:56 AMabbot
01/15/2023, 2:12 AMakshay awate
01/15/2023, 6:45 AMcd pkg/query-service
go build -o build/query-service main.go
ClickHouseUrl=<tcp://localhost:9001> STORAGE=clickhouse build/query-service
when i run ./query-service
i get,
/opt/homebrew/Cellar/go/1.19.4/libexec/src/runtime/proc.go:250
2023-01-15T12:12:16.139+0530 FATAL query-service/main.go:68 Failed to create server{error 26 0 Storage type: is not supported in query service}
Any ideas/suggestion would be helpful.vipul vishwakarma
01/16/2023, 4:26 PMThomas Wiradikusuma
01/16/2023, 5:07 PM/metrics
pulled by Signoz from reading https://signoz.io/docs/userguide/send-metrics/#enable-a-prometheus-receiver. Now, I'm using Fly.io, and they already have their own Prometheus (https://fly.io/docs/reference/metrics/#prometheus-on-fly). How can I make my Signoz instance query from Fly.io's Prometheus? Without resorting to federation.Prim Mayora
01/16/2023, 8:40 PMSebastião Pamplona
01/16/2023, 11:13 PMcluster-apm
to host signoz and 2 other clusters cluster-1
and cluster-2
I want to deploy only signoz-k8s-infra-otel-agent
on cluster-1
and cluster-2
to collect metrics and push it to cluster-apm
is this possible using the values.yaml
? thanksJason Brewer
01/17/2023, 10:52 AMlevel
=== error
.
My JSON structured log looks like this:
{
"timestamp": 1673948790110000000,
"id": "2KPcDvuJzmCPDve2CDoFaLib3lM",
"trace_id": "",
"span_id": "",
"trace_flags": 0,
"severity_text": "",
"severity_number": 0,
"body": "⚠️ Auth error: Missing Authorization header",
"resources_string": {},
"attributes_string": {
"env": "dev",
"level": "error",
"log_file_path": "/tmp/logs/udx-services-ingress-out.log",
"service": "ingress"
},
"attributes_int": {},
"attributes_float": {}
}
These docs are a great start but it's not at all obvious to me how I write a query that rips out attributes_string.level
and attributes_string.env
I'm just playing around to see what is available in the query and so far (but no so good!) I've got this:
select
toStartOfInterval(fromUnixTimestamp64Nano(timestamp), INTERVAL 35 MINUTE) AS interval
, toFloat64(count()) as value
, attributes_string_key[1]
FROM signoz_logs.distributed_logs
WHERE timestamp BETWEEN {{.start_timestamp_nano}} AND {{.end_timestamp_nano}}
GROUP BY interval, attributes_string_key;
Which produces this Chart preview (see thread)Vinayak Singh
01/17/2023, 1:23 PMTeymour
01/17/2023, 3:11 PMTimothy Wigginton
01/17/2023, 3:31 PM