Apache Ozone Contribution guideline

Ozone is an Apache project. The bug tracking system for Ozone is under the Apache Jira project named HDDS.

This document summarize the contribution process:

What can I contribute?

We welcome contributions of:

Code. File a bug and submit a patch, or pick up any one of the unassigned Jiras.
- Newbie Ozone jiras
- All open and unassigned Ozone jiras
Documentation Improvements: You can submit improvements to either:
- Ozone website. Instructions are here: Modifying the Ozone Website
- Developer docs. These are markdown files checked into the Apache Ozone Source tree.
The wiki pages: Please contact us at dev@ozone.apache.org and we can provide you write access to the wiki.
Testing: We always need help to improve our testing
- Unit Tests (JUnit / Java)
- Acceptance Tests (docker + robot framework)
- Blockade tests (python + blockade)
- Performance: We have multiple type of load generator / benchmark tools (ozone freon), which can be used to test cluster and report problems.
Bug reports pointing out broken functionality, docs, or suggestions for improvements are always welcome!

Who To Contact

If you have any questions, please don't hesitate to contact

email: use dev@ozone.apache.org.
chat: You can find the #ozone channel at the ASF slack. Invite link is here
GitHub Discussions: You can also interact with the community using GitHub Discussions.
meeting: We have weekly meetings which is open to anybody. Feel free to join and ask any questions

Building from source

Requirements

Requirements to compile the code:

Unix System
JDK 1.8 or higher
Maven 3.6 or later
Internet connection for first build (to fetch all Maven and Ozone dependencies)

(Standard development tools such as make, gcc, etc. are required.)

Build the project

After installing the requirements (especially Maven) build is as simple as:

mvn clean verify -DskipTests

Useful Maven build options

Use -DskipShade to skip shaded Ozone FS jar file creation. Saves time, but you can't test integration with other software that uses Ozone as a Hadoop-compatible file system.
Use -DskipRecon to skip building Recon Web UI. It saves about 2 minutes.
Use -Pdist to build the binary tarball, similar to the one that gets released

Running Ozone in Docker

Additional requirements for running Ozone in pseudo cluster (including acceptance tests):

docker
docker-compose
jq (utility used heavily by acceptance tests)

After building Ozone locally, you can start your first pseudo cluster:

cd hadoop-ozone/dist/target/ozone-*-SNAPSHOT/compose/ozone
OZONE_REPLICATION_FACTOR=3 ./run.sh -d

See more details in the README and in the docs.

Jira guideline

When creating a new jira for any kind of new feature, improvement or bug, please follow below guideline:

Title: Title should be a one-liner stating the problem.
Description:
- What is the problem? Is it a feature, improvement or bug? Add as many details as possible and related design doc and discussion.
- For new features, add as many details as possible. If it is part of the big feature, attach parent jira.
- For improvement, add the value it will bring. Is it an optimization, code simplification or something else?
- For bugs, add steps to reproduce it. Where the root cause is unknown and needs investigation, it would be great to update the jira description or add the summary once the root cause is identified.
- If it is follow up of another issue, please link the previous jira to it so that context is preserve.
Jira examples: HDDS-9272, HDDS-9322, HDDS-9291, HDDS-8940, HDDS-9282

Contribute your modifications

We use GitHub pull requests for contributing changes to the repository. The main workflow is as follows:

Fork apache/ozone repository (first time) and clone it to your local machine
Enable the build-branch GitHub Actions workflow (defined in .github/workflows/post-commit.yml) in your fork
Ensure a Jira issue corresponding to the change exists in the HDDS project (eg. HDDS-1234)
- Please search Jira before creating a new issue, someone might have already reported the same.
- If this is your first issue, you might not be able to assign it to yourself. If so, please make a comment in the issue, indicating that your are working on it.
Create a local branch for your contribution (eg. git checkout -b HDDS-1234)
Make your changes locally.
- For complex changes, committing each logical part is recommended.
Push your changes to your fork of Ozone
Wait for the build-branch workflow to complete successfully for your commit.
Create a pull request for your changes
- Please include the Jira link, problem description and testing instruction (follow the template)
Set the Jira issue to "Patch Available" state
Address any review comments if applicable
- Create new, incremental commits in your branch. This makes it easy for reviewers to only review the new changes. The committer will take care to squash all your commits when merging the pull request.
- Push your commits in a batch, when no more changes are expected. This reduces the burden on automated CI checks.
- If you need to bring your PR up-to-date with the base branch (usually master), e.g. to resolve conflicts, please do so by merge, not rebase: git merge --no-edit origin/master.
- In general, please try to avoid force-push when updating the PR. Here are some great articles that explain why:
  - https://developers.mattermost.com/blog/submitting-great-prs/#4-avoid-force-pushing
  - https://www.freecodecamp.org/news/optimize-pull-requests-for-reviewer-happiness#request-a-review

Code convention and tests

Basic code conventions followed by Ozone:

2 spaces indentation
120-char line length limit
Apache license header required in most files
no @author tags, authorship is indicated by Git history

These are checked by tools like Checkstyle and RAT.

For IntelliJ users, it is recommended to import and select the Code Style scheme located at:

./hadoop-ozone/dev-support/intellij/ozone-style.xml

See https://www.jetbrains.com/help/idea/configuring-code-style.html#import-code-style for detailed instructions.

Check your contribution

The hadoop-ozone/dev-support/checks directory contains scripts to build and check Ozone. Most of these are executed by CI for every commit and pull request. Running them before creating a pull request is strongly recommended. This can be achieved by enabling the build-branch workflow in your fork and letting GitHub run all of the checks, but most of the checks can also be run locally.

build.sh: compiles Ozone
quick checks (less than 2 minutes)
- author.sh: checks for @author tags
- bats.sh: unit test for shell scripts
- rat.sh: checks for Apache license header
- docs.sh: sanity checks for Ozone documentation
- dependency.sh: compares list of jars in build output with known list
- checkstyle.sh: Checkstyle
moderate (around 10 minutes)
- findbugs.sh: SpotBugs
- kubernetes.sh: very limited set of tests run in Kubernetes environment
slow (around 1 hour or more)
- unit.sh: pure unit tests
- integration.sh: Java-based tests using single JVM "mini cluster"
- acceptance.sh: rather complete set of tests in Docker Compose-based environment

The set of tests run by integration and acceptance may be limited via arguments, please see the scripts for details. This is used by CI to run them in multiple splits to avoid taking too much time.

Some scripts require third-party tools, but most of these are installed during the first run, if needed.

Most scripts (except build.sh) output results in target/<name>, e.g. target/docs.

Using IDE

As Ozone uses Apache Maven it can be developed from any IDE. IntelliJ IDEA is a common choice, here are some suggestions to use it for Ozone development.

Run Ozone from IntelliJ

Ozone components depends on maven classpath. We generate classpath descriptor from the maven pom.xml files to use exactly the same classpath at runtime.

As a result, it's easy to start all the components from IDE as the right classpath (without provided scope) has already been set.

To start Ozone from IntelliJ:

Stop your IDE
Execute the ./hadoop-ozone/dev-support/intellij/install-runconfigs.sh helper script.
Start the IDE
New runner definitions are available from the Run menu.

You can use the installed Run configurations in the following order:

StorageContainerManagerInit (to initialize the SCM dir)
StorageContainerManger (to start SCM)
OzoneManagerInit (to initialize OM, it requires SCM)
OzoneManager
Recon (required by datanode)
Datanode1, Datanode2, Datanode3

Setting up Checkstyle

Checkstyle plugin may help to detect violations directly from the IDE.

Install Checkstyle+IDEA plugin from File -> Settings -> Plugins
Open File -> Settings -> Other settings -> Checkstyle and Add (+) a new Configuration File

Description: Ozone
Use a local checkstyle ./hadoop-hdds/dev-support/checkstyle/checkstyle.xml

Check the pom.xml for the current version of the used checkstyle and use the same version with the plugin (File -> Settings -> Other settings -> Checkstyle)
Open the Checkstyle Tool Window, select the Ozone rule and execute the check

Common problems

Too large generated classes

IntelliJ may not pick up protoc generated classes as they can be very huge. If the protoc files can't be compiled try the following:

Open Help -> Edit custom properties menu.
Add idea.max.intellisense.filesize=10000 entry
Restart your IDE

Bad class file

Sometimes during incremental build IDEA encounters the following error:

bad class file: hadoop-hdds/common/target/classes/org/apache/hadoop/ozone/common/ChunkBufferImplWithByteBufferList$1.class

Usually this can be fixed by removing the class file (outside of the IDE), but sometimes only by a full Rebuild.

CI

The Ozone project uses Github Actions for its CI system. The configuration is described in detail here.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CONTRIBUTING.md

CONTRIBUTING.md

Apache Ozone Contribution guideline

What can I contribute?

Who To Contact

Building from source

Requirements

Build the project

Useful Maven build options

Running Ozone in Docker

Jira guideline

Contribute your modifications

Code convention and tests

Check your contribution

Using IDE

Run Ozone from IntelliJ

Setting up Checkstyle

Common problems

Too large generated classes

Bad class file

CI

Files

CONTRIBUTING.md

Latest commit

History

CONTRIBUTING.md

File metadata and controls

Apache Ozone Contribution guideline

What can I contribute?

Who To Contact

Building from source

Requirements

Build the project

Useful Maven build options

Running Ozone in Docker

Jira guideline

Contribute your modifications

Code convention and tests

Check your contribution

Using IDE

Run Ozone from IntelliJ

Setting up Checkstyle

Common problems

Too large generated classes

Bad class file

CI