Pete Lorenz
08/16/2023, 11:59 PM{"ts":"2023-08-16 23:26:36.638","level":"WARN","logger":"org.springframework.boot.actuate.redis.RedisReactiveHealthIndicator","thread":"boundedElastic-1","traceId":"0a7e69f9297f9c9a","spanId":"0a7e69f9297f9c9a","msg":"Redis health check failed","exc":"org.springframework.data.redis.RedisConnectionFailureException: Unable to connect to Redis; nested exception is org.springframework.data.redis.connection.PoolException: Could not get a resource from the pool; nested exception is io.lettuce.core.RedisException: Cannot obtain initial Redis Cluster topology\n\tat org.springframework.data.redis.connection.lettuce.LettuceConnectionFactory$ExceptionTranslatingConnectionProvider.translateException(LettuceConnectionFactory.java:1689)\n\tat
In addition, we're seeing the check-postgres-db job in the initContainer step stuck in both the metadata-api and sql-executor deployments (it appears to hang with no messages in the logs). We're using version 6.2.6 of Redis and version 14.5 of Postgres. Please let us know what we can do to upgrade to GD.CN 2.4.0 while maintaining our existing Redis and Postgres services on AWS.Pete Lorenz
08/17/2023, 3:04 PMPete Lorenz
08/17/2023, 3:18 PMPete Lorenz
08/17/2023, 4:21 PMPete Lorenz
08/17/2023, 4:33 PMPete Lorenz
08/17/2023, 4:41 PMPete Lorenz
08/17/2023, 4:45 PMPete Lorenz
08/17/2023, 4:55 PM{
"ts": "2023-08-17 16:53:43.025",
"level": "WARN",
"logger": "org.apache.pulsar.client.admin.internal.BaseResource",
"thread": "AsyncHttpClient-12-1",
"msg": "[<http://pulsar-broker.pulsar:8080/admin/v2/persistent/gooddata-cn/gooddata-cn/sql.select>] Failed to perform http put request: javax.ws.rs.ClientErrorException: HTTP 409 Conflict"
}
Robert Moucha
08/18/2023, 7:22 AM<redis://gooddata-cn-redis.8pswjo.0001.usw2.cache.amazonaws.com?timeout=20s>]: ERR This instance has cluster support disabled
Did you changed any helm values related to Redis, most notably service.redis.clusterMode
? It should be set to false
(default) if your Redis doesn't have cluster mode enabled.Robert Moucha
08/18/2023, 7:26 AMSPRING_REDIS_CLUSTER_NODES
that is set only when service.redis.clusterMode=true.
The Elasticache for Redis running in cluster mode is supported, but you need to explicitly turn it on in your AWS Elasticache instance.Robert Moucha
08/18/2023, 8:49 AMpsql
client. You can connect to that pod's Bash shell using kubectl exec -it
and check if the database host is accessible with credentials you use.
PGPASSWORD=yourpostgrespassword psql -U postgres -h your-db.host.name
Pete Lorenz
08/23/2023, 4:13 PM{
"level": "warn",
"ts": "2023-08-23T16:01:20.885694Z",
"logger": "etcd-client",
"caller": "v3@v3.5.9/retry_interceptor.go:62",
"msg": "retrying of unary invoker failed",
"target": "<etcd-endpoints://0xc0001a8000/gooddata-cn-etcd-0.gooddata-cn-etcd-headless.gooddata-cn.svc.cluster.local:2379>",
"attempt": 0,
"error": "rpc error: code = NotFound desc = etcdserver: member not found"
}
In the other cluster (with identical configuration) the gooddata-cn-ectd StatefulSet is Ready with all 3 pods running. Where should we look to further debug this?Robert Moucha
08/24/2023, 2:12 PMuseInternalQuiverEtcd: false
in your custom helm values file.
Sorry about that. Are there any other outstanding issues with your deployment? You mentioned something about inaccessible database, did you managed to fix it?Pete Lorenz
08/24/2023, 3:25 PMRobert Moucha
08/24/2023, 3:27 PMPete Lorenz
08/24/2023, 3:27 PM