Skip to content

Larger than memory, on-disk Zarr dataset buffering #10107

Closed Answered by jackwayneright
jackwayneright asked this question in General
Discussion options

You must be logged in to vote

While trying to implement this, I believe I may have found my answer. If someone can confirm what I think I've found, that would be great though!

It looks like modifying the dataset does not change the Zarr file at all. You need to explicitly call to_zarr again to write to the file. This can be done in 4 ways (as described in this part of the documentation). Just calling to_zarr with the full dataset, thereby re-writing the entire dataset. Calling to_zarr with mode='a' to overwrite individual variables. Calling it with append_dim to append to a specific dimension of the Zarr storage. Or calling it with region to write to a specific region.

In my case, it would seem I will want to batch th…

Replies: 2 comments 1 reply

Comment options

You must be logged in to vote
1 reply
@TomNicholas
Comment options

Answer selected by jackwayneright
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
2 participants