Replies: 3 comments 7 replies
-
It can work with multiple Hadoop clusters with the ability that Spark provides in the same engine, IIUC. Or do you mean specifying different HADOOP_CONF_DIR per engine? I remember that you have already provided an option called |
Beta Was this translation helpful? Give feedback.
-
What's the cluster zookeeper storage means? |
Beta Was this translation helpful? Give feedback.
-
If I recall correctly, LB should be K8s vendor dependent |
Beta Was this translation helpful? Give feedback.
-
Motivation
For some company, there might be multiple Hadoop cluster(data center)s.
Currently, a Kyuubi server only serves one cluster.
So multiple Kyuubi servers are needed for multiple clusters.
It seems meaningful to enable Kyuubi as a unified gateway for multiple clusters.
Restriction
To unify multiple clusters, there are some restrictions.
The node that kyuubi service is on should be able to access all these clusters.
The authentication for these clusters should be consistent. For example, if the authentication method is KERBEROS, for these clusters, they need have the same KDC realm or the KDC are trusted for each other. If the authentication method is LDAP, the ldap server endpoint for these clusters should be the same.
There should be a service discovery(for kyuubi server side) that is visible for all these clusters.
For example, if the service discovery is by ZooKeeper, all these clusters should be able to access this Zookeeper service.
If Kyuubi is deployed on k8s, they can use k8s load balancer(name service) as service discovery space for kyuubi server side.
Solution
For Kyuubi on k8s use case(use k8s load balancer for kyuubi server discovery):
For other use case, they need have a service discovery space that could be accessible by all the clusters.
MISC
How do you think about this?
Is it general or just a specific use case?
Beta Was this translation helpful? Give feedback.
All reactions