Repository containing:
- Infrastructure for booting an instance of Apache Polaris that runs locally or in S3 using Spark
- Documentation on adding additional functionality to our polaris fork:
- Install task and initialize dev env
# init commitlint and secret files
task init
- Clone the polaris repository and build the docker image
git clone https://github.com/msc-open-metadata/polaris.git ..
# Apache Polaris is built using Gradle with Java 21+ and Docker 27+
task docker:build:polaris-local
- Clone and build the spark repository
git clone https://github.com/apache/spark.git ..
# Switch to 3.5
# git checkout branch-3.5
task docker:build:spark
- Building the spark-jupyter image and running the application
task docker:build:spark-jupyter-image docker:compose:up-polaris-spark-local
- Bootstrapping an engineer and hr principal:
task rest:bootstrap-engineer
task rest:bootstrap-hr
- Open local notebook. The spark-jupyter container outputs a URL with a token to the local jupyter instance
- Open then local notebook. Example: curl http://localhost:8888/?token=
- Add spec/open-dic-service.yml
1.Build project and generate API
./gradlew assemble
2.Add object and endpoint yaml specification to spec/polaris-management-service.yml