Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support for dask 2023.9.2 #99

Merged

Conversation

IzerOnadimQC
Copy link
Collaborator

@IzerOnadimQC IzerOnadimQC commented Sep 29, 2023

Adds support for dask 2023.9.2

Previously, dask would wrap all iterators passed to delayed() with list() or tuple(). As of 2023.9.2, this is no longer the case, therefore, we wrap map objects with list() before passing them into delayed(). Also, we now explicitly set the dask dataframe shuffle method - prior to 2023.9.2 this would be set for us depending on the pyarrow version, but this is no longer the case.

The dask tests are now passing again. I also tested the changes by pip installing this branch into quantcore.thek and running the test suite, which also passed.

@IzerOnadimQC IzerOnadimQC merged commit 1057cbd into data-engineering-collective:main Sep 29, 2023
@IzerOnadimQC IzerOnadimQC deleted the dask-issues branch September 29, 2023 16:20
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Investigate dask 2023.9.2 issue
2 participants