Skip to content

Docker setup to boot an instance of Apache Polaris that runs locally or in S3 using Spark

Notifications You must be signed in to change notification settings

msc-open-metadata/polaris-boot

Repository files navigation

polaris-boot

Repository containing:

  • Infrastructure for booting an instance of Apache Polaris that runs locally or in S3 using Spark
  • Documentation on adding additional functionality to our polaris fork:

Getting started:

  1. Install task and initialize dev env
# init commitlint and secret files
task init
  1. Clone the polaris repository and build the docker image
git clone https://github.com/msc-open-metadata/polaris.git ..

# Apache Polaris is built using Gradle with Java 21+ and Docker 27+
task docker:build:polaris-local
  1. Clone and build the spark repository
git clone https://github.com/apache/spark.git ..
# Switch to 3.5
# git checkout branch-3.5

task docker:build:spark
  1. Building the spark-jupyter image and running the application
task docker:build:spark-jupyter-image docker:compose:up-polaris-spark-local
  1. Bootstrapping an engineer and hr principal:
task rest:bootstrap-engineer
task rest:bootstrap-hr
  1. Open local notebook. The spark-jupyter container outputs a URL with a token to the local jupyter instance

Changes in open-metadata/polaris

  • Add spec/open-dic-service.yml

Adding a new endpoint to Polaris:

1.Build project and generate API

./gradlew assemble

2.Add object and endpoint yaml specification to spec/polaris-management-service.yml

Overview: img

About

Docker setup to boot an instance of Apache Polaris that runs locally or in S3 using Spark

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published