Skip to content

Conversation

davisusanibar
Copy link
Contributor

Related to #288

Comment on lines +340 to +342
Java Vector module offer read files without compression codec, in case reading
compressed files is required consider to also add Java Compression module
dependency.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
Java Vector module offer read files without compression codec, in case reading
compressed files is required consider to also add Java Compression module
dependency.
The ``arrow-vector`` module can only read uncompressed files by itself. Add
a dependency on ``arrow-compression`` to also be able to read compressed files.

Read - From Compressed File
---------------------------

We are providing a path with auto generated arrow files for testing purposes, change that at your convenience.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is it possible to provide a write-compressed-file example too?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hmm. There's no practical way to write a compressed file.

import pyarrow as pa
pd.DataFrame({'key': range(4)}).to_feather('lz4.arrow', compression='lz4')
pd.DataFrame({'key': range(4)}).to_feather('zstd.arrow', compression='zstd')
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This isn't checked in?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants