Has anyone tried to load the <http GoodData CN|GoodData CN> GoodData #gooddata-cn

Has anyone tried to load the <GoodData.CN> k8s ins...

Vincil Bishop

03/19/2023, 1:41 PM

Has anyone tried to load the GoodData.CN k8s installation using Docker Desktop Kubernetes? I know that the instructions say it requires at least 3 nodes... and Docker Desktop I think is limited to a single node installation? Should I just use the community edition for local development?

Robert Moucha

03/19/2023, 1:43 PM

Hi Vincil, it's possible to run on single node (for test purposes, of course), but there are some tweaks that need to be done in chart values, both for pulsar and for gooddata-cn

Robert Moucha

03/19/2023, 1:44 PM

I don't recall what exactly needs to be set, but basically you need to disable antiAffinity

Vincil Bishop

03/19/2023, 1:45 PM

whoah nice!

Vincil Bishop

03/19/2023, 1:45 PM

I am new to k8s but learning

Robert Moucha

03/19/2023, 1:46 PM

some components insist on running on a different node. But it is possible to convince it to run on a same node where other pod of the same type is already running

Vincil Bishop

03/19/2023, 1:46 PM

ok, and that would be overridable in the helm chart config

Vincil Bishop

03/19/2023, 1:46 PM

somehow

Robert Moucha

03/19/2023, 1:47 PM

you may also want to set

replicaCount: 1

so save resources on your host.

👍 1

Vincil Bishop

03/19/2023, 1:47 PM

if you come across any hints, let me know here... I will continue to investigate

Robert Moucha

03/19/2023, 1:47 PM

yes, everything you need can be set by extra custom values.yaml.

Vincil Bishop

03/19/2023, 1:47 PM

trying to make the dev environment as much like prod as possible... and Docker Desktop is the tool of choice for the moment...

Vincil Bishop

03/19/2023, 1:48 PM

appreciate the help...I will look

Robert Moucha

03/19/2023, 1:48 PM

I'd rather recommend k3d

Vincil Bishop

03/19/2023, 1:48 PM

ok... I will look

Robert Moucha

03/19/2023, 1:49 PM

k3d allows you running multi-node k8s cluster within docker containers so you don't need to tweak antiaffinity at all - bc you'll actually have 3 k8s worker nodes 😉

Vincil Bishop

03/19/2023, 1:50 PM

it looks like it's geared towards development and that's awesome

Vincil Bishop

03/19/2023, 1:51 PM

I was struggling with minikube... exposing services, etc pulling images in... didn't feel comfortable for an app dev workflow

Robert Moucha

03/19/2023, 1:51 PM

Check out my repo: https://github.com/mouchar/gooddata-cn-tools/tree/master/k3d It's somehow outdated (I don't have much time to maintain it) but you should get a basic insight how it works and what needs to be done

Vincil Bishop

03/19/2023, 1:51 PM

whoah nice!!!

Vincil Bishop

03/19/2023, 1:52 PM

being new to k8s anything helps

Robert Moucha

03/19/2023, 1:53 PM

It's targeted to k3d 4.x - the new 5.x versions have different structure of cmdline parameters, so it will not work out of the box.

Robert Moucha

03/19/2023, 1:57 PM

I was running and developing it on Linux. As far as I know, Docker on MacOS or Windows has some pecularities so may be the script will need some changes.

Vincil Bishop

03/19/2023, 2:06 PM

awesome, I will see what I can do

Vincil Bishop

03/19/2023, 2:06 PM

thanks so much!

Vincil Bishop

03/19/2023, 4:05 PM

This is the error I am facing with pulsar/zookeeper 2.9.2 now... undoubtedly something in pulsar did not start or it can't find something?

Copy code

2023-03-19T16:02:00,455+0000 [QuorumConnectionThread-[myid=1]-1] WARN  org.apache.zookeeper.server.quorum.QuorumCnxManager - Cannot open channel to 2 at election address pulsar-zookeeper-1.pulsar-zookeeper.pulsar.svc.cluster.local:3888
java.net.UnknownHostException: pulsar-zookeeper-1.pulsar-zookeeper.pulsar.svc.cluster.local
 at java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:229) ~[?:?]
 at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392) ~[?:?]
 at java.net.Socket.connect(Socket.java:609) ~[?:?]
 at org.apache.zookeeper.server.quorum.QuorumCnxManager.initiateConnection(QuorumCnxManager.java:383) [org.apache.zookeeper-zookeeper-3.6.3.jar:3.6.3]
 at org.apache.zookeeper.server.quorum.QuorumCnxManager$QuorumConnectionReqThread.run(QuorumCnxManager.java:457) [org.apache.zookeeper-zookeeper-3.6.3.jar:3.6.3]
 at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128) [?:?]
 at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628) [?:?]
 at java.lang.Thread.run(Thread.java:829) [?:?]

Vincil Bishop

03/19/2023, 4:05 PM

These are the custom values I am using, and this is on Docker Desktop k8s, not k3s

customized-values-pulsar.yaml

Vincil Bishop

03/19/2023, 4:21 PM

Copy code

affinity:
  anti_affinity: false

in Pulsar is getting me a little farther... not failing... or will fail with different error LOL

Robert Moucha

03/19/2023, 5:12 PM

The error above looks like the zookeeper component pods didn't start. That might be caused by anti-affinity, when you have less than 3 nodes (e.g. if you have just one worker node, 1st pod is scheduled on it, but the 2nd and 3rd can not be placed on the same node - that's basically what anti-affinity is supposed to do). Btw, if you plan to run k8s cluster locally within docker desktop on mac/win, be aware you'll need to allocate a plenty of resources to docker VM - cpu, ram and sufficient disk. Otherwise, the pods will remain uscheduled due to resource starvation - you may tweak pod resource requests/limits, but only to some extent. That's why I was suggesting to set replicas to one.

Vincil Bishop

03/19/2023, 5:49 PM

ah nice... I've been playing with the settings, and the failure was caused by some errors that I had introduced... I actually have pulsar almost completely starting, except for the pulsar-broker-0, which is just in a pending state :

Copy code

ContainersNotInitialized
containers with incomplete status: [wait-bookkeeper-ready]

customized-values-pulsar.yaml

Vincil Bishop

03/19/2023, 5:50 PM

Here are my docker resources:

Robert Moucha

03/19/2023, 6:52 PM

Hm, that's probably related to older pulsar helm chart, that needs

--set initialize=true

to be passed during helm install (or add

initialize: true

into your customized-values-pulsar.yaml file. Newer versions do not need this setting as the chart automatically recognizes whether it is being installed or upgraded. This setting will add two extra k8s Jobs that will initialize bookkeeper and pulsar clusters.

Robert Moucha

03/19/2023, 6:54 PM

But it should not be an issue for decently new pulsar. IDK what version are you trying to install, i recommend using

2.9.4

Robert Moucha

03/19/2023, 6:56 PM

If you're trying to follow my steps in repo, it's basically ok as a general guidance, but some versions need to be updated (there's pulsar 2.7.2 and gooddata-cn 1.5.0 - a really old versions these days).

Vincil Bishop

03/19/2023, 10:00 PM

Ah yes, am just using the repo as a guide... taking some from the good data install instructions as well...I tried with 2.9.4 and the same result... let me try to pass the initialize true?

Vincil Bishop

03/19/2023, 10:45 PM

I've traced it to broker.configData.managedLedgerDefaultEnsembleSize, if there is only one replica... I think that number should be 1? I can get the wait-bookkeeper ready init code to run from another container successfully, but it still seems to not complete...continuing to try

Copy code

bin/apply-config-from-env.py conf/bookkeeper.conf; until bin/bookkeeper shell whatisinstanceid; do
  echo "bookkeeper cluster is not initialized yet. backoff for 3 seconds ...";
  sleep 3;
done; echo "bookkeeper cluster is already initialized"; bookieServiceNumber="$(nslookup -timeout=10 pulsar-bookie | grep Name | wc -l)"; until [ ${bookieServiceNumber} -ge 1 ]; do
  echo "bookkeeper cluster pulsar isn't ready yet ... check in 10 seconds ...";
  sleep 10;
  bookieServiceNumber="$(nslookup -timeout=10 pulsar-bookie | grep Name | wc -l)";
done; echo "bookkeeper cluster is ready";

Vincil Bishop

03/19/2023, 11:02 PM

bumping the resources wastefully high, to 1.0 CPU and 1024Mi RAM, and the

broker.configData.managedLedgerDefaultEnsembleSize: "1"

seemed to have gotten everything working!

Vincil Bishop

03/19/2023, 11:03 PM

all greens in the pulsar namespace now, thanks!!!

Robert Moucha

03/20/2023, 8:31 AM

Ah, right, when you change number of replicas of bookkeeper, you need to change some parameters. 1CPU/1Gi is too high - the settings in https://github.com/mouchar/gooddata-cn-tools/blob/master/k3d/k3d.sh#L288-L334 should work for small-scale deployments.

Vincil Bishop

03/20/2023, 2:11 PM

sure... I can probably lower it... I tried to start the GoodData-cn helm, and there are all kinds of issues running it locally... had to back off of that for now, and will need to revisit in the future

Vincil Bishop

03/20/2023, 2:11 PM

but really appreciate all your help... I learned a lot, and I know I will get the good data helm working soon

Robert Moucha

03/20/2023, 2:33 PM

Glad to help, just let us know when you get back to it.

Vincil Bishop

03/20/2023, 2:44 PM

yessir, so much appreciated!

Vincil Bishop

03/23/2023, 12:35 PM

@Robert Moucha had some more thoughts on all of this... when trying to run the helm charts manually... in a local instance of K8s/Docker Desktop... could it be a processor arch mismatch? I am on a Mac/M1... and I kept working in the cloud with the config... and am getting some failures on the charts when accidentally loading against a gravitron/arm64 arch.

Robert Moucha

03/23/2023, 12:40 PM

Yes, definitely it is the problem you're facing. Both gooddata-cn and pulsar images are amd64 only. Aarch64 is currently supported only in gooddata-cn-ce thanks to ugly hack I implmented to make Pulsar work on M1.

Vincil Bishop

03/23/2023, 12:40 PM

haha I remember that and it was much appreciated 🙂

Robert Moucha

03/23/2023, 12:41 PM

graviton cpus will have the same problem

Vincil Bishop

03/23/2023, 12:41 PM

sure... and it was an accident...I gave up on gravitron unfortunately

Vincil Bishop

03/23/2023, 12:42 PM

now am on the righteous path...and in EKS... I have 3 nodes configured... not trying to deploy on only a single node

Vincil Bishop

03/23/2023, 12:42 PM

each node is t3.medium with 4GB RAM and 2vcpus

Robert Moucha

03/23/2023, 12:42 PM

It's not a problem for us to start building multi-arch images for gooddata-cn. But we're still blocked by the lack of pulsar images for aarch.

Vincil Bishop

03/23/2023, 12:42 PM

sure... that can be a problem for another day

Vincil Bishop

03/23/2023, 12:43 PM

how long should pulsar take to start?

Robert Moucha

03/23/2023, 12:43 PM

t3.medium is too weak. I don't recommend using t3 burstable instances

Vincil Bishop

03/23/2023, 12:43 PM

Vincil Bishop

03/23/2023, 12:44 PM

Amazon EC2 T3 instances are the next generation burstable general-purpose instance type

Vincil Bishop

03/23/2023, 12:44 PM

so t3.large?

Vincil Bishop

03/23/2023, 12:44 PM

or another class

Robert Moucha

03/23/2023, 12:48 PM

no t3 or t3a. the problem with burstable instances is that they have too low baseline performance

Vincil Bishop

03/23/2023, 12:48 PM

ah sorry... I read your response wrong

Vincil Bishop

03/23/2023, 12:48 PM

compute or memory optimized?

Robert Moucha

03/23/2023, 12:48 PM

what gooddata version do you plan to install? 2.3.0?

Vincil Bishop

03/23/2023, 12:49 PM

yessir

Robert Moucha

03/23/2023, 12:52 PM

2.3.0 contains service for PDF exports, and it is memory and cpu intensive. https://www.gooddata.com/developers/cloud-native/doc/2.3/deploy-and-install/cloud-native/requirements/ try 3x c6a.xlarge

Vincil Bishop

03/23/2023, 12:54 PM

ugh accidentally reading old doc version

Open in Slack

Previous Next